Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination

Challappa, Lekha; Zhang, Zijin; Garg, Rajiv

Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination

dc.contributor.author	Challappa, Lekha
dc.contributor.author	Zhang, Zijin
dc.contributor.author	Garg, Rajiv
dc.date.accessioned	2024-12-26T21:10:56Z
dc.date.available	2024-12-26T21:10:56Z
dc.date.issued	2025-01-07
dc.description.abstract	Our study expands upon recent work explaining in-context learning as implicit Bayesian inference, where language models infer shared latent concepts from examples. We analyze GPT-4's semantic attention post-domain priming, using computational linguistics to quantify response similarity to lexicographically independent queries with the same intent. We assess potential privacy breaches from inadvertent domain anchorage, examining how attention and embedding layers process linguistic patterns. We hypothesize that domain-specific words receiving higher gradient updates can introduce bias, create semantic echo chambers, and oversimplify relationships. Grounded in Mohamed Zakaria Kurdi's frameworks, this research uses lexical, semantic, syntactic, and positional similarities to analyze GPT-4's vector transformations and attention distributions. By simulating domain-specific interactions through declarative primes and interrogative inputs, we highlight significant privacy and ethical concerns, as the model may share information across users due to domain anchorage.
dc.format.extent	10
dc.identifier.doi	10.24251/HICSS.2025.841
dc.identifier.isbn	978-0-9981331-8-8
dc.identifier.other	50cbd0b7-2ee7-4d8a-9c9d-8126304993e3
dc.identifier.uri	https://hdl.handle.net/10125/109692
dc.relation.ispartof	Proceedings of the 58th Hawaii International Conference on System Sciences
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	Artifical Intelligence Security: Ensuring Safety, Trustworthiness, and Responsibility in AI Systems
dc.subject	domain anchorage, implicit profiling in ai, information dissemination, lexicographic similarity., semantic attention
dc.title	Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination
dc.type	Conference Paper
dc.type.dcmi	Text
prism.startingpage	7036

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 0686.pdf
Size:: 440.6 KB
Format:: Adobe Portable Document Format

Download

Collections

Artifical Intelligence Security: Ensuring Safety, Trustworthiness, and Responsibility in AI Systems