Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination
Files
Date
2025-01-07
Contributor
Advisor
Department
Instructor
Depositor
Speaker
Researcher
Consultant
Interviewer
Narrator
Transcriber
Annotator
Journal Title
Journal ISSN
Volume Title
Publisher
Volume
Number/Issue
Starting Page
7036
Ending Page
Alternative Title
Abstract
Our study expands upon recent work explaining in-context learning as implicit Bayesian inference, where language models infer shared latent concepts from examples. We analyze GPT-4's semantic attention post-domain priming, using computational linguistics to quantify response similarity to lexicographically independent queries with the same intent. We assess potential privacy breaches from inadvertent domain anchorage, examining how attention and embedding layers process linguistic patterns. We hypothesize that domain-specific words receiving higher gradient updates can introduce bias, create semantic echo chambers, and oversimplify relationships. Grounded in Mohamed Zakaria Kurdi's frameworks, this research uses lexical, semantic, syntactic, and positional similarities to analyze GPT-4's vector transformations and attention distributions. By simulating domain-specific interactions through declarative primes and interrogative inputs, we highlight significant privacy and ethical concerns, as the model may share information across users due to domain anchorage.
Description
Keywords
Artifical Intelligence Security: Ensuring Safety, Trustworthiness, and Responsibility in AI Systems, domain anchorage, implicit profiling in ai, information dissemination, lexicographic similarity., semantic attention
Citation
Extent
10
Format
Geographic Location
Time Period
Related To
Proceedings of the 58th Hawaii International Conference on System Sciences
Related To (URI)
Table of Contents
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International
Rights Holder
Local Contexts
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.