Simulating Strategic Reasoning: Comparing the Ability of Single LLMs and Multi-Agent Systems to Replicate Human Behavior

dc.contributor.authorSreedhar, Karthik
dc.contributor.authorChilton, Lydia
dc.date.accessioned2024-12-26T21:05:05Z
dc.date.available2024-12-26T21:05:05Z
dc.date.issued2025-01-07
dc.description.abstractWhen creating policies, plans, or designs for people, it is challenging for designers to foresee all of the ways in which people may reason and behave. Recently, Large Language Models (LLMs) have been shown to be able to simulate human reasoning. We extend this work by measuring LLMs’ ability to simulate strategic reasoning in the ultimatum game – a classic economics bargaining experiment. Experimental evidence shows human strategic reasoning is complex – people will often choose to “punish” other players to enforce social norms even at personal expense. We test if LLMs can replicate this behavior in simulation, comparing two structures: single LLMs and multi-agent systems. We compare their abilities to (1) simulate human-like reasoning in the ultimatum game, (2) simulate two player personalities, greedy and fair, and (3) create robust strategies that are logically complete and consistent with personality. Our evaluation shows that multi-agent systems are more accurate than single LLMs (88% vs. 50%) in simulating human reasoning and actions for personality pairs. Thus, there is potential to use LLMs to simulate human strategic reasoning to help decision and policy-makers perform preliminary explorations of how people behave in systems.
dc.format.extent10
dc.identifier.doihttps://doi.org/10.24251/HICSS.2025.100
dc.identifier.isbn978-0-9981331-8-8
dc.identifier.otherc34dbb2b-ecff-4a3f-b607-fe61fecf341a
dc.identifier.urihttps://hdl.handle.net/10125/108938
dc.relation.ispartofProceedings of the 58th Hawaii International Conference on System Sciences
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectAI Model Evaluation
dc.subjectlarge language models, multi-agent systems, social simulation, strategic reasoning
dc.titleSimulating Strategic Reasoning: Comparing the Ability of Single LLMs and Multi-Agent Systems to Replicate Human Behavior
dc.typeConference Paper
dc.type.dcmiText
prism.startingpage831

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0083.pdf
Size:
589.08 KB
Format:
Adobe Portable Document Format