Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination

Challappa, Lekha; Zhang, Zijin; Garg, Rajiv

Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination

Files

0686.pdf (440.6 KB)

Date

2025-01-07

Authors

Challappa, Lekha

Zhang, Zijin

Garg, Rajiv

Starting Page

7036

Abstract

Our study expands upon recent work explaining in-context learning as implicit Bayesian inference, where language models infer shared latent concepts from examples. We analyze GPT-4's semantic attention post-domain priming, using computational linguistics to quantify response similarity to lexicographically independent queries with the same intent. We assess potential privacy breaches from inadvertent domain anchorage, examining how attention and embedding layers process linguistic patterns. We hypothesize that domain-specific words receiving higher gradient updates can introduce bias, create semantic echo chambers, and oversimplify relationships. Grounded in Mohamed Zakaria Kurdi's frameworks, this research uses lexical, semantic, syntactic, and positional similarities to analyze GPT-4's vector transformations and attention distributions. By simulating domain-specific interactions through declarative primes and interrogative inputs, we highlight significant privacy and ethical concerns, as the model may share information across users due to domain anchorage.

Keywords

Artifical Intelligence Security: Ensuring Safety, Trustworthiness, and Responsibility in AI Systems, domain anchorage, implicit profiling in ai, information dissemination, lexicographic similarity., semantic attention

URI

https://hdl.handle.net/10125/109692

Extent

10

Related To

Proceedings of the 58th Hawaii International Conference on System Sciences

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Collections

Artifical Intelligence Security: Ensuring Safety, Trustworthiness, and Responsibility in AI Systems

Full item page

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination

Files

Date

Authors

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

Description

Keywords

Citation

URI

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Rights Holder

Local Contexts

Collections