Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination
dc.contributor.author | Challappa, Lekha | |
dc.contributor.author | Zhang, Jenevieve | |
dc.contributor.author | Garg, Rajiv | |
dc.date.accessioned | 2024-12-26T21:10:56Z | |
dc.date.available | 2024-12-26T21:10:56Z | |
dc.date.issued | 2025-01-07 | |
dc.description.abstract | Our study expands upon recent work explaining in-context learning as implicit Bayesian inference, where language models infer shared latent concepts from examples. We analyze GPT-4's semantic attention post-domain priming, using computational linguistics to quantify response similarity to lexicographically independent queries with the same intent. We assess potential privacy breaches from inadvertent domain anchorage, examining how attention and embedding layers process linguistic patterns. We hypothesize that domain-specific words receiving higher gradient updates can introduce bias, create semantic echo chambers, and oversimplify relationships. Grounded in Mohamed Zakaria Kurdi's frameworks, this research uses lexical, semantic, syntactic, and positional similarities to analyze GPT-4's vector transformations and attention distributions. By simulating domain-specific interactions through declarative primes and interrogative inputs, we highlight significant privacy and ethical concerns, as the model may share information across users due to domain anchorage. | |
dc.format.extent | 10 | |
dc.identifier.doi | 10.24251/HICSS.2025.841 | |
dc.identifier.isbn | 978-0-9981331-8-8 | |
dc.identifier.other | 50cbd0b7-2ee7-4d8a-9c9d-8126304993e3 | |
dc.identifier.uri | https://hdl.handle.net/10125/109692 | |
dc.relation.ispartof | Proceedings of the 58th Hawaii International Conference on System Sciences | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.subject | Artifical Intelligence Security: Ensuring Safety, Trustworthiness, and Responsibility in AI Systems | |
dc.subject | domain anchorage, implicit profiling in ai, information dissemination, lexicographic similarity., semantic attention | |
dc.title | Domain Anchorage in GPT-4: A Computational Linguistic Analysis of Lexicographic Profiling and Its Implications for Unintended Information Dissemination | |
dc.type | Conference Paper | |
dc.type.dcmi | Text | |
prism.startingpage | 7036 |
Files
Original bundle
1 - 1 of 1