Simulating Subjects: The Promise and Peril of Artificial Intelligence Stand-Ins for Social Agents and Interactions

Abstract

Large language models (LLMs), through their exposure to massive collections of online text, learn to reproduce the perspectives and linguistic styles of diverse social and cultural groups. This capability suggests a powerful social scientific application—the simulation of empirically realistic, culturally situated human subjects. Synthesizing recent research in artificial intelligence and computational social science, we outline a methodological foundation for simulating human subjects and their social interactions. We then identify six characteristics of current models that are likely to impair the realistic simulation of human subjects: bias, uniformity, atemporality, disembodiment, linguistic cultures, and alien intelligence. For each of these areas, we discuss promising approaches for overcoming their associated shortcomings. Given the rate of change of these models, we advocate for an ongoing methodological program for the simulation of human subjects that keeps pace with rapid technical progress, and caution that validation against human subjects data remains essential to ensure simulation accuracy.

Keywords

machine learning simulation agent-based modeling large language models computational social science generative AI deep neural networks

Get full access to this article

View all access options for this article.

References

Almaatouq

Abdullah

Griffiths

Thomas L.

Suchow

Jordan W.

Whiting

Mark E.

Evans

James

Watts

Duncan J.

. 2022. “Beyond Playing 20 Questions with Nature: Integrative Experiment Design in the Social and Behavioral Sciences.” The Behavioral and Brain Sciences 47:1–70.

Zvoleff

Alex

Liu

Jianguo

Axinn

William

. 2014. “Agent-Based Modeling in Coupled Human and Natural Systems (CHANS): Lessons from a Comparative Analysis.” Annals of the Association of American Geographers. 104(4):723–745.

Anthis

Jacy Reese

Paez

Eze

. 2021. “Moral Circle Expansion: A Promising Strategy to Impact the Far Future.” Futures 130(102756):102756.

Anthis

Jacy Reese

Pauketat

Janet V. T.

Ladak

Ali

Manoli

Aikaterina

. 2024. “What Do People Think about Sentient AI?” arXiv preprint:2407.08867.

Anthropic. 2023. “Model Card and Evaluations for Claude Models.” Anthropic.com. Retrieved August 15, 2024 (https://www-cdn.anthropic.com/bd2a28d2535bfb0494cc8e2a3bf135d2e7523226/Model-Card-Claude-2.pdf).

Anthropic. 2024. “Measuring the Persuasiveness of Language Models.” Anthropic.com. Retrieved August 15, 2024 (https://www.anthropic.com/news/measuring-model-persuasiveness).

Argyle

Lisa P.

Bail

Christopher A.

Busby

Ethan C.

Gubler

Joshua R.

Howe

Thomas

Rytting

Christopher

Sorensen

Taylor

Wingate

David

. 2023a. “Leveraging AI for Democratic Discourse: Chat Interventions Can Improve Online Political Conversations at Scale.” Proceedings of the National Academy of Sciences of the United States of America 120(41):e2311627120.

Argyle

Lisa P.

Busby

Ethan C.

Fulda

Nancy

Gubler

Joshua R.

Rytting

Christopher

Wingate

David

. 2023b. “Out of One, Many: Using Language Models to Simulate Human Samples.” Political Analysis: An Annual Publication of the Methodology Section of the American Political Science Association 31(3):337–351.

Asimov

Isaac.

1950. I, Robot. New York, NY: Gnome Press.

10.

Bai

Yuntao

Kadavath

Saurav

Kundu

Sandipan

Askell

Amanda

Kernion

Jackson

Jones

Andy

Chen

Anna

Goldie

Anna

Mirhoseini

Azalia

McKinnon

Cameron

Chen

Carol

Olsson

Catherine

Olah

Christopher

Hernandez

Danny

Drain

Dawn

Ganguli

Deep

Dustin

Tran-Johnson

Eli

Perez

Ethan

Kerr

Jamie

Mueller

Jared

Ladish

Jeffrey

Landau

Joshua

Ndousse

Kamal

Lukosuite

Kamile

Lovitt

Liane

Sellitto

Michael

Elhage

Nelson

Schiefer

Nicholas

Mercado

Noemi

DasSarma

Nova

Lasenby

Robert

Larson

Robin

Ringer

Sam

Johnston

Scott

Kravec

Shauna

Showk

Sheer El

Fort

Stanislav

Lanham

Tamera

Telleen-Lawton

Timothy

Conerly

Tom

Henighan

Tom

Hume

Tristan

Bowman

Samuel R.

Hatfield-Dodds

Zac

Mann

Ben

Amodei

Dario

Joseph

Nicholas

McCandlish

Sam

Brown

Tom

Kaplan

Jared

. 2022. “Constitutional AI: Harmlessness from AI Feedback”.

11.

Bail

Christopher A.

2024. “Can Generative AI Improve Social Science?” Proceedings of the National Academy of Sciences of the United States of America 121(21):e2314021121.

12.

Baldassarri

Delia

Bearman

Peter

. 2007. “Dynamics of Political Polarization.” American Sociological Review 72(5):784–811.

13.

Bender

Emily M.

Gebru

Timnit

McMillan-Major

Angelina

Shmitchell

Shmargaret

. 2021 “On the Dangers of Stochastic Parrots.” In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. New York, NY, USA: ACM.

14.

Bengio

Yoshua

Louradour

Jérôme

Collobert

Ronan

Weston

Jason

. 2009. “Curriculum Learning.” In Proceedings of the 26th Annual International Conference on Machine Learning, ICML ‘09, pp. 41–48. New York, NY, USA: Association for Computing Machinery.

15.

Bisbee

James

Clinton

Joshua D.

Dorff

Cassy

Kenkel

Brenton

Larson

Jennifer M.

. 2024. “Synthetic Replacements for Human Survey Data? The Perils of Large Language Models.” Political Analysis: An Annual Publication of the Methodology Section of the American Political Science Association 32:1–16.

16.

Boelaert

Julien

Coavoux

Samuel

Ollion

Etienne

Petev

Ivaylo D.

Präg

Patrick

. 2024. “Machine Bias: How Do Generative Language Models Answer Opinion Polls?” SocArXiv.

17.

Bommasani

Rishi

Hudson

Drew A.

Adeli

Ehsan

Altman

Russ

Arora

Simran

von Arx

Sydney

Bernstein

Michael S.

et al. 2021. “On the Opportunities and Risks of Foundation Models.” arXiv preprint:2108.07258.

18.

Bricken

Trenton

Templeton

Adly

Batson

Joshua

Chen

Brian

Jermyn

Adam

Conerly

Tom

Turner

Nicholas L.

Anil

Cem

Denison

Carson

Askell

Amanda

Lasenby

Robert

Yifan

Kravec

Shauna

Schiefer

Nicholas

Maxwell

Tim

Joseph

Nicholas

Tamkin

Alex

Nguyen

Karina

McLean

Brayden

Burke

Josiah E.

Hume

Tristan

Carter

Shan

Henighan

Tom

Olah

Chris

. 2023. “Towards Monosemanticity: Decomposing Language Models with Dictionary Learning.” Anthropic. Transformer Circuits Thread 2.

19.

Brown

Tom B.

Mann

Benjamin

Ryder

Nick

Subbiah

Melanie

Kaplan

Jared

Dhariwal

Prafulla

Neelakantan

Arvind

Shyam

Pranav

Sastry

Girish

Askell

Amanda

Agarwal

Sandhini

Herbert-Voss

Ariel

Krueger

Gretchen

Henighan

Tom

Child

Rewon

Ramesh

Aditya

Ziegler

Daniel M.

Jeffrey

Winter

Clemens

Hesse

Christopher

Chen

Mark

Sigler

Eric

Litwin

Mateusz

Gray

Scott

Chess

Benjamin

Clark

Jack

Berner

Christopher

McCandlish

Sam

Radford

Alec

Sutskever

Ilya

Amodei

Dario

. 2020. “Language Models Are Few-Shot Learners.” Advances in Neural Information Processing Systems 33:1877–1901.

20.

Bruch

Elizabeth

Atwell

Jon

. 2013. “Agent-Based Models in Empirical Social Research.” Sociological Methods & Research 44(2):186–221.

21.

Cartmill

Erica A.

2022. “Gesture.” Annual Review of Anthropology 51(1):455–473.

22.

Cerulo

Karen A.

2018. “Scents and Sensibility: Olfaction, Sense-Making, and Meaning Attribution.” American Sociological Review 83(2):361–389.

23.

Chafe

Danielewicz

. 1987 "Properties of Spoken and Written Language." Pp. 1-26 in Comprehending Oral and Written Language. New York, NY: Academic Press.

24.

Chalmers

David J.

1997. The Conscious Mind: In Search of a Fundamental Theory. New York, NY: Oxford University Press.

25.

Chiang

Ted.

2010. The Lifecycle of Software Objects. Burton, MI: Subterranean Press.

26.

Csáji

Balázs Csanád

. 2001. “Approximation with Artificial Neural Networks.” Faculty of Sciences, Etvs Lornd University, Hungary 24(48):7.

27.

Cui

Ziyan

Ning

Zhou

Huaikang

. 2024. “Can AI Replace Human Subjects? A Large-Scale Replication of Psychological Experiments with LLMs.” arXiv preprint:2409.00128.

28.

Dai

Damai

Sun

Yutao

Dong

Hao

Yaru

Shuming

Sui

Zhifang

Wei

Furu

. 2023. “Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers.” arXiv preprint:2212.10559.

29.

Davidson

Thomas.

2024. “Start Generating: Harnessing Generative Artificial Intelligence for Sociological Research.” Socius: Sociological Research for a Dynamic World 10:1–17.

30.

Dentella

Vittoria

Günther

Fritz

Leivada

Evelina

. 2023. “Systematic Testing of Three Language Models Reveals Low Language Accuracy, Absence of Response Stability, and a Yes-Response Bias.” Proceedings of the National Academy of Sciences of the United States of America 120(51):e2309583120.

31.

Dettmers

Tim

Pagnoni

Artidoro

Holtzman

Ari

Zettlemoyer

Luke

. 2023. “QLoRA: Efficient finetuning of quantized LLMs”. Proceedings of the 37th International Conference on Neural Information Processing Systems (NeurIPS ’23) 37(441):10088–10115.

32.

DiMaggio

Paul

Garip

Filiz

. 2012. “Network Effects and Social Inequality.” Annual Review of Sociology 38(2012):93–118.

33.

Dodge

Jesse

Ilharco

Gabriel

Schwartz

Roy

Farhadi

Ali

Hajishirzi

Hannaneh

Smith

Noah

. 2020. “Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping.” arXiv preprint:2002.06305.

34.

Drieman

G. H. J.

1962. “Differences between Written and Spoken Language: An Exploratory Study.” Acta Psychologica 20:36–57.

35.

Entwisle

Barbara

Verdery

Ashton

Williams

Nathalie

. 2020. “Climate Change and Migration: New Insights from a Dynamic Model of Out-Migration and Return Migration.” American Journal of Sociology 125(6):1469–1512.

36.

Epstein

Joshua.

1999. “Agent-Based Computational Models and Generative Social Science.” Complexity 4(5):41–60.

37.

Epstein

Joshua M.

2006. Generative Social Science: Studies in Agent-Based Computational Modeling. Princeton, NJ: Princeton University Press.

38.

Epstein

Joshua M.

Axtell

Robert

. 1996. Growing Artificial Societies: Social Science from the Bottom Up. Washington, DC: Brookings Institution Press.

39.

Farquhar

Sebastian

Kossen

Jannik

Kuhn

Lorenz

Gal

Yarin

. 2024. “Detecting Hallucinations in Large Language Models Using Semantic Entropy.” Nature 630(8017):625–630.

40.

Fatemi

Bahare

Kazemi

Mehran

Tsitsulin

Anton

Malkan

Karishma

Yim

Jinyeong

Palowitch

John

Seo

Sungyong

Halcrow

Jonathan

Perozzi

Bryan

. 2024. “Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning.” arXiv preprint:2406.09170.

41.

Filippas

Apostolos

Horton

John J.

Manning

Benjamin S.

. 2024. “Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?.” Pp. 614-615 in Proceedings of the 25th ACM Conference on Economics and Computation. New York, NY: ACM.

42.

Fowler

James H.

Christakis

Nicholas A.

. 2008. “Dynamic Spread of Happiness in a Large Social Network: Longitudinal Analysis Over 20 Years in the Framingham Heart Study.” BMJ 337:a2338.

43.

Garg

Nikhil

Schiebinger

Londa

Jurafsky

Dan

Zou

James

. 2018. “Word Embeddings Quantify 100 Years of Gender and Ethnic Stereotypes.” Proceedings of the National Academy of Sciences of the United States of America 115(16):E3635–E3644.

44.

Garnelo

Marta

Shanahan

Murray

. 2019. “Reconciling Deep Learning with Symbolic Artificial Intelligence: Representing Objects and Relations.” Current Opinion in Behavioral Sciences 29:17–23.

45.

Zhiqi

Huang

Hongzhe

Zhou

Mingze

Juncheng

Wang

Guoming

Tang

Siliang

Zhuang

Yueting

. 2024. “WorldGPT: Empowering LLM as Multimodal World Model.” pp. 7346–55 in Proceedings of the 32nd ACM International Conference on Multimedia. New York, NY, USA: ACM.

46.

Geiecke

Friedrich

Jaravel

Xavier

. 2024. “Conversations at Scale: Robust AI-Led Interviews with a Simple Open-Source Platform.” SSRN 4974382.

47.

Gendron

Gaël

Rožanec

Jože M.

Witbrock

Michael

Dobbie

Gillian

. 2024. “Counterfactual Causal Inference in Natural Language with Large Language Models.” arXiv preprint:2410.06392.

48.

Ginosar

Shirley.

2024. “Behavior Prediction for Interacting Entities from Video Observations.” Presented at the Research at TTIC Seminar, November 8, Chicago IL: Toyota Institute of Technology.

49.

Goffman

Erving.

1979. Goffman: Gender Advertisements. London, England: Harvard University Press.

50.

Golder

Scott A.

Macy

Michael W.

. 2011. “Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures.” Science 333(6051):1878–1881.

51.

Goldstein

Josh A.

Chao

Jason

Grossman

Shelby

Stamos

Alex

Tomz

Michael

. 2024a. “How Persuasive Is AI-Generated Propaganda?” PNAS Nexus 3(2):gae034.

52.

Goldstein

Ariel

Grinstein-Dabush

Avigail

Schain

Mariano

Wang

Haocheng

Hong

Zhuoqiao

Aubrey

Bobbi

Nastase

Samuel A.

Zada

Zaid

Ham

Eric

Feder

Amir

Gazula

Buchnik

Eliav

Doyle

Devore

Dugan

Patricia

Reichart

Roi

Friedman

Daniel

Brenner

Michael

Hassidim

Avinatan

Devinsky

Flinker

Hasson

Uri

. 2024b. “Alignment of Brain Embeddings and Artificial Contextual Embeddings in Natural Language Points to Common Geometric Patterns.” Nature Communications 15(1):2768.

53.

Goldstein

Ariel

Zada

Zaid

Buchnik

Eliav

Schain

Mariano

Price

Aubrey

Bobbi

Nastase

Samuel A.

Feder

Amir

Emanuel

Dotan

Cohen

Alon

Jansen

Gazula

Choe

Gina

Rao

Aditi

Kim

Catherine

Casto

Colton

Fanda

Lora

Doyle

Friedman

Dugan

Patricia

Melloni

Reichart

Roi

Devore

Flinker

Hasenfratz

Liat

Levy

Omer

Hassidim

Avinatan

Brenner

Michael

Matias

Yossi

Norman

Devinsky

Hasson

. 2022. “Shared Computational Principles for Language Processing in Humans and Deep Language Models.” Nature Neuroscience 25(3):369–380.

54.

Guilbeault

Douglas

Delecourt

Solène

Hull

Tasker

Desikan

Bhargav Srinivasa

Chu

Mark

Nadler

Ethan

. 2024. “Online Images Amplify Gender Bias.” Nature 626(8001):1049–1055.

55.

Gurnee

Wes

Tegmark

Max

. 2023. “Language Models Represent Space and Time.” arXiv preprint:2310.02207.

56.

Gururangan

Suchin

Marasović

Ana

Swayamdipta

Swabha

Kyle

Beltagy

Downey

Doug

Smith

Noah A.

. 2020. “Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks.” arXiv preprint:2004.10964.

57.

Halliday

Michael A. K.

1987 “Spoken and Written Modes of Meaning.” Pp. 55–82 in Comprehending Oral and Written Language. San Diego, CA: Academic Press.

58.

Hamilton

William L.

Leskovec

Jure

Jurafsky

Dan

. 2016. “Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change.” arXiv preprint:1605.09096.

59.

Handa

Divij

Chirmule

Advait

Gajera

Bimal

Baral

Chitta

. 2024. When “Competency” in Reasoning Opens the Door to Vulnerability: Jailbreaking LLMs via Novel Complex Ciphers.” arXiv preprint:2402.10601.

60.

Hansen

Luke R.

2023. “On the Existence of Robot Zombies and Our Ethical Obligations to AI Systems.” Journal of Social Computing 4(4):270–274.

61.

Heaven

Will Douglas

. 2022. "Why Meta’s Latest Large Language Model Survived Only Three Days Online." MIT Technology Review. November 18. Retreived from https://www.technologyreview.com/2022/11/18/1063487/meta-large-language-model-ai-only-survived-three-days-gpt-3-science/

62.

Helbing

Dirk

Wenjian

. 2009. “The Outbreak of Cooperation among Success-Driven Individuals Under Noisy Conditions.” Proceedings of the National Academy of Sciences of the United States of America 106(10):3680–3685.

63.

Hendel

Roee

Geva

Mor

Globerson

Amir

. 2023. “In-Context Learning Creates Task Vectors.” arXiv preprint:2310.15916

64.

Heritage

John

Raymond

Geoffrey

. 2005. “The Terms of Agreement: Indexing Epistemic Authority and Subordination in Talk-in-Interaction.” Social Psychology Quarterly 68(1):15–38.

65.

Hill

Felix

Lampinen

Andrew

Schneider

Rosalia

Clark

Stephen

Botvinick

Matthew

McClelland

James L.

Santoro

Adam

. 2019. “Environmental Drivers of Systematicity and Generalization in a Situated Agent.” arXiv preprint arXiv:1910.00571.

66.

Himma

Kenneth Einar

. 2009. “Artificial Agency, Consciousness, and the Criteria for Moral Agency: What Properties Must an Artificial Agent Have to Be a Moral Agent?” Ethics and Information Technology 11(1):19–29.

67.

Hong

Zhuoqiao

Wang

Haocheng

Zada

Zaid

Gazula

Harshvardhan

Turner

David

Aubrey

Bobbi

Niekerken

Leonard

Doyle

Werner

Devore

Sasha

Dugan

Patricia

Friedman

Daniel

Devinsky

Orrin

Flinker

Adeen

Hasson

Uri

Nastase

Samuel A.

Goldstein

Ariel

. 2024. “Scale Matters: Large Language Models with Billions (rather than Millions) of Parameters Better Match Neural Representations of Natural Language.” bioRxiv.org: The Preprint Server for Biology 2024.06. 12.598513.

68.

Hornik

Kurt

Stinchcombe

Maxwell

White

Halbert

. 1989. “Multilayer Feedforward Networks Are Universal Approximators.” Neural Networks: The Official Journal of the International Neural Network Society 2(5):359–366.

69.

Hua

Wenyue

Fan

Lizhou

Lingyao

Mei

Kai

Jianchao

Yingqiang

Hemphill

Libby

Zhang

Yongfeng

. 2023. “War and Peace (WarAgent): Large Language Model-Based Multi-Agent Simulation of World Wars.” arXiv preprint arXiv:2311.17227.

70.

J. E.

Shen

Yelong

Wallis

Phillip

Allen-Zhu

Zeyuan

Yuanzhi

Wang

Shean

Chen

Weizhu

. 2021. “LoRA: Low-Rank Adaptation of Large Language Models.” International Conference on Learning Representations abs/2106.09685.

71.

Huang

Guanxiong

Wang

Sai

. 2023. “Is Artificial Intelligence More Persuasive than Humans? A Meta-Analysis.” The Journal of Communication 73(6):552–562.

72.

Huh

Minyoung

Cheung

Brian

Wang

Tongzhou

Isola

Phillip

. 2024. “Position: The Platonic Representation Hypothesis.” In Forty-first International Conference on Machine Learning. openreview.net.

73.

Jin

Haibo

Zhou

Andy

Menke

Joe D.

Wang

Haohan

. 2024. “Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters.” Advances in Neural Information Processing Systems 37:59408–59435.

74.

Eunkyung

Epstein

Daniel A.

Jung

Hyunhoon

Kim

Young-Ho

. 2023. “Understanding the Benefits and Challenges of Deploying Conversational AI Leveraging Large Language Models for Public Health Intervention.” Pp. 1–16 in Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. New York, NY, USA: ACM.

75.

Jones

Cameron R.

Bergen

Benjamin K.

. 2025. “Large Language Models Pass the Turing Test.” arXiv preprint arXiv:2503.23674

76.

Jumper

John

Evans

Richard

Pritzel

Alexander

Green

Tim

Figurnov

Michael

Ronneberger

Olaf

Tunyasuvunakool

Kathryn

Bates

Russ

Žídek

Augustin

Potapenko

Anna

Bridgland

Alex

Meyer

Clemens

Kohl

Simon A. A.

Ballard

Andrew J.

Cowie

Andrew

Romera-Paredes

Bernardino

Nikolov

Stanislav

Jain

Rishub

Adler

Jonas

Back

Trevor

Petersen

Stig

Reiman

David

Clancy

Ellen

Zielinski

Michal

Steinegger

Martin

Pacholska

Michalina

Berghammer

Tamas

Bodenstein

Sebastian

Silver

David

Vinyals

Oriol

Senior

Andrew W.

Kavukcuoglu

Koray

Kohli

Pushmeet

Hassabis

Demis

. 2021. “Highly Accurate Protein Structure Prediction with AlphaFold.” Nature 596(7873):583–589.

77.

Kahneman

Daniel.

2011. Thinking, Fast and Slow. New York, NY: Farrar, Straus and Giroux.

78.

Kaplan

Jared

McCandlish

Sam

Henighan

Tom

Brown

Tom B.

Chess

Benjamin

Child

Rewon

Gray

Scott

Radford

Alec

Jeffrey

Amodei

Dario

. 2020. “Scaling Laws for Neural Language Models.” arXiv preprint:2001.08361

79.

Kim

Khwan

Askin

Noah

Evans

James A.

. 2024. “Disrupted Routines Anticipate Musical Exploration.” Proceedings of the National Academy of Sciences of the United States of America 121(6):e2306549121.

80.

Kim

Junsol

Evans

James

Schein

Aaron

. 2024. “Linear Representations of Political Perspectives Emerge in Large Language Models.” In Conference on Neural Information Processing Systems.

81.

Kim

Junsol

Lee

Byungkyu

. 2023. “AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinion Prediction.” arXiv preprint arXiv:2305.09620.

82.

Kozlowski

Austin C.

Kwon

Hyunku

Evans

James A.

. 2024. “In Silico Sociology: Forecasting COVID-19 Polarization with Large Language Models.” arXiv preprint:2407.11190.

83.

Kozlowski

Austin C.

Taddy

Matt

Evans

James A.

. 2019. “The Geometry of Culture: Analyzing the Meanings of Class through Word Embeddings.” American Sociological Review 84(5):905–949.

84.

Kreuter

Frauke

Presser

Stanley

Tourangeau

Roger

. 2009. “Social Desirability Bias in CATI, IVR, and Web Surveys: The Effects of Mode and Question Sensitivity.” Public Opinion Quarterly 72(5):847–865.

85.

Lai

Shiyang

Potter

Yujin

Kim

Junsol

Zhuang

Richard

Song

Dawn

Evans

James

. 2024. “Position: Evolving AI Collectives Enhance Human Diversity and Enable Self-Regulation.” In Forty-first International Conference on Machine Learning. openreview.net.

86.

Lee

Sanguk

Yang

Kai-Qi

Peng

Tai-Quan

Heo

Ruth

Liu

Hui

. 2024. “Exploring Social Desirability Response Bias in Large Language Models: Evidence from GPT-4 Simulations.”

87.

Yuan

Zhang

Yixuan

Sun

Lichao

. 2023. “MetaAgents: Simulating Interactions of Human Behaviors for LLM-Based Task-Oriented Coordination via Collaborative Generative Agents.”

88.

Liang

Yancheng

Chen

Daphne

Gupta

Abhishek

Simon S.

Jaques

Natasha

. 2024. “Learning to Cooperate with Humans Using Generative Agents.” arXiv preprint:2411.13934

89.

Liu

Ruibo

Yang

Ruixin

Jia

Chenyan

Zhang

Zhou

Denny

Dai

Andrew M.

Yang

Diyi

Vosoughi

Soroush

. 2023. “Training Socially Aligned Language Models on Simulated Social Interactions.”

90.

Long

Robert

Sebo

Jeff

Butlin

Patrick

Finlinson

Kathleen

Fish

Kyle

Harding

Jacqueline

Pfau

Jacob

Sims

Toni

Birch

Jonathan

Chalmers

David

. 2024. “Taking AI Welfare Seriously.”

91.

Ludwig

Jens

Mullainathan

Sendhil

Rambachan

Ashesh

. 2024. “Large Language Models: An Applied Econometric Framework.” NBER Working Paper 33344.

92.

Maranca

A. R. P.

Chung

Hinck

Wolsky

Egami

Stewart

. 2025. “How to Use Generative AIs for Image Analyses in the Social Sciences: Design-based Supervised Learning.” Sociological Methods and Research 54(3).

93.

Martin

John Levi

. 2023. “The Ethico-Political Universe of ChatGPT.” Journal of Social Computing 4(1):1–11.

94.

McCulloch

Warren S.

Pitts

Walter

. 1943. “A Logical Calculus of the Ideas Immanent in Nervous Activity.” The Bulletin of Mathematical Biophysics 5(4):115–133.

95.

Merchant

Amil

Batzner

Simon

Schoenholz

Samuel S.

Aykol

Muratahan

Cheon

Gowoon

Cubuk

Ekin Dogus

. 2023. “Scaling Deep Learning for Materials Discovery.” Nature 624(7990):80–85.

96.

Morning

Ann.

2008. “Reconstructing Race in Science and Society: Biology Textbooks, 1952–2002.” American Journal of Sociology 114(Suppl):S106–S137.

97.

Munkhdalai

Tsendsuren

Faruqui

Manaal

Gopal

Siddharth

. 2024. “Leave No Context Behind: Efficient Infinite Context Transformers with Infini-Attention.” arXiv preprint:2404.07143.

98.

Naveed

Humza

Khan

Asad Ullah

Qiu

Shi

Saqib

Muhammad

Anwar

Saeed

Usman

Muhammad

Akhtar

Naveed

Barnes

Nick

Mian

Ajmal

. 2023. “A Comprehensive Overview of Large Language Models.” arXiv preprint:2307.06435.

99.

Nelson

Michelle R.

2008. “The Hidden Persuaders: Then and Now.” Journal of Advertising 37(1):113–126.

100.

Evonne

Ginosar

Shiry

Darrell

Trevor

Joo

Hanbyul

. 2021. “Body2hands: Learning to Infer 3d Hands from Conversational Gesture Body Dynamics.” 11865-11874.

101.

Evonne

Joo

Hanbyul

Liwen

Hao

Darrell

Trevor

Kanazawa

Angjoo

Ginosar

Shiry

. 2022. “Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion.” 20395-20405.

102.

Evonne

Subramanian

Sanjay

Klein

Dan

Kanazawa

Angjoo

Darrell

Trevor

Ginosar

Shiry

. 2023. “Can Language Models Learn to Listen?” 10083-10093.

103.

Niu

Zhaoyang

Zhong

Guoqiang

Hui

. 2021. “A Review on the Attention Mechanism of Deep Learning.” Neurocomputing 452:48–62.

104.

Nussbaum

Martha C.

2009. Frontiers of Justice. London, England: Belknap Press.

105.

Nye

Maxwell

Andreassen

Anders Johan

Gur-Ari

Guy

Michalewski

Henryk

Austin

Jacob

Bieber

David

Dohan

David

Lewkowycz

Aitor

Bosma

Maarten

Luan

David

Sutton

Charles

Odena

Augustus

. 2021. Show Your Work: Scratchpads for Intermediate Computation with Language Models.”

106.

OpenAI. 2023. “GPT-4 Technical Report.”

107.

Ouyang

Long

Jeff

Jiang

Almeida

Diogo

Wainwright

Carroll L.

Mishkin

Pamela

Zhang

Chong

Agarwal

Sandhini

Slama

Katarina

Ray

Alex

Schulman

John

Hilton

Jacob

Kelton

Fraser

Miller

Luke

Simens

Maddie

Askell

Amanda

Welinder

Peter

Christiano

Paul

Leike

Jan

Lowe

Ryan

. 2022. “Training Language Models to Follow Instructions with Human Feedback.”

108.

Panickssery

Nina

Gabrieli

Nick

Schulz

Julian

Tong

Meg

Hubinger

Evan

Turner

Alexander Matt

. 2023. “Steering Llama 2 via Contrastive Activation Addition.”

109.

Park

Joon Sung

O’brien

Joseph C

Cai

Carrie J.

Morris

Meredith Ringel

Liang

Percy

Bernstein

Michael S.

. 2023. “Generative Agents: Interactive Simulacra of Human Behavior.” Pp.1–23 in Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (UIST ‘23), San Francisco, CA. New York, NY: Association for Computing Machinery.

110.

Park

Joon Sung

Zou

Carolyn Q.

Shaw

Aaron

Hill

Benjamin Mako

Cai

Carrie

Morris

Meredith Ringel

Willer

Robb

Liang

Percy

Bernstein

Michael S.

. 2024. “Generative Agent Simulations of 1,000 People.”

111.

Pearl

Judea.

2009. Causality. Cambridge, UK: Cambridge University Press.

112.

Piao

Jinghua

Zhihong

Gao

Chen

Fengli

Santos

Fernando P.

Yong

Evans

James

. 2025. “Emergence of Human-like Polarization among Large Language Model Agents.”

113.

Potter

Yujia

Lai

Shiyang

Evans

James

Song

Dawn

. 2024a. “Hidden Persuaders: How LLM Political Bias Could Sway Our Elections.” in Empirical Methods in Natural Language Processing.

114.

Potter

Yujin

Lai

Shiyang

Kim

Junsol

Evans

James

Song

Dawn

. 2024b. “Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters.” arXiv preprint:2410.24190

115.

Radford

Alec

Jeffrey

Child

Rewon

Luan

David

Amodei

Dario

Sutskever

Ilya

. 2019. Language Models Are Unsupervised Multitask Learners. San Francisco, CA: OpenAI.

116.

Rafailov

Rafael

Sharma

Archit

Mitchell

Eric

Ermon

Stefano

Manning

Christopher D.

Finn

Chelsea

. 2023. “Direct Preference Optimization: Your Language Model Is Secretly a Reward Model.” Advances in Neural Information Processing Systems 36:53728–53741.

117.

Raman

Narun

Lundy

Taylor

Amouyal

Samuel

Levine

Yoav

Leyton-Brown

Kevin

Tennenholtz

Moshe

. 2024. “STEER: Assessing the Economic Rationality of Large Language Models.” arXiv preprint:2402.09552

118.

Rosenblatt

1958. “The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain.” Psychological Review 65(6):386–408.

119.

Rubin

Donald B.

2004. Multiple Imputation for Nonresponse in Surveys. New York, NY: John Wiley & Sons.

120.

Salecha

Aadesh

Ireland

Molly E.

Subrahmanya

Shashanka

Sedoc

João

Ungar

Lyle H.

Eichstaedt

Johannes C.

. 2024. “Large Language Models Show Human-like Social Desirability Biases in Survey Responses.” arXiv preprint:2405.06058.

121.

Santurkar

Shibani

Durmus

Esin

Ladhak

Faisal

Lee

Cinoo

Liang

Percy

Hashimoto

Tatsunori

. 2023. “Whose Opinions Do Language Models Reflect?.” Pp. 29971–29974 in Proceedings of the 40th International Conference on Machine Learning. Vol. 202, Proceedings of Machine Learning Research, edited by Krause

Brunskill

Cho

Engelhardt

Sabato

Scarlett

. PMLR.

122.

Savcisens

Germans

Eliassi-Rad

Tina

Hansen

Lars Kai

Hvas Mortensen

Laust

Lilleholt

Lau

Rogers

Anna

Zettler

Ingo

Lehmann

Sune

. 2024. “Using Sequences of Life-Events to Predict Human Lives.” Nature Computational Science 4(1):43–56.

123.

Schelling

Thomas C.

1978. Micromotives and Macrobehavior. New York, NY: W.W. Norton and Company.

124.

Scherrer

Nino

Shi

Claudia

Feder

Amir

Blei

David

. 2024. “Evaluating the Moral Beliefs Encoded in Llms.” Advances in Neural Information Processing Systems 36.

125.

Shanahan

McDonell

Kyle

Reynolds

Laria

. 2023. “Role Play with Large Language Models.” Nature 623:493–498.

126.

Sharma

Mrinank

Tong

Meg

Korbak

Tomasz

Duvenaud

David

Askell

Amanda

Bowman

Samuel R.

Cheng

Newton

Durmus

Esin

Hatfield-Dodds

Zac

Johnston

Scott R.

Kravec

Shauna

Maxwell

Timothy

McCandlish

Sam

Ndousse

Kamal

Rausch

Oliver

Schiefer

Nicholas

Yan

Zhang

Miranda

Perez

Ethan

. 2023. “Towards Understanding Sycophancy in Language Models.” arXiv preprint:2310.13548

127.

Shi

Feng

Evans

James

. 2023. “Surprising Combinations of Research Contents and Contexts Are Related to Impact and Emerge with Scientific Outsiders from Distant Disciplines.” Nature Communications 14(1641). doi: https://doi.org/10.1038/s41467-023-36741-4

128.

Siegel

Zachary S.

Kapoor

Sayash

Nagdir

Nitya

Stroebl

Benedikt

Narayanan

Arvind

. 2024. “CORE-Bench: Fostering the Credibility of Published Research through a Computational Reproducibility Agent Benchmark.” arXiv preprint:2409.11363

129.

Skarpelis

A. K. M.

2023. “Horror Vacui: Racial Misalignment, Symbolic Repair, and Imperial Legitimation in German National Socialist Portrait Photography.” American Journal of Sociology 129(2):313–383.

130.

Smith

Jeffrey A.

Burow

Jessica

. 2020. “Using Ego Network Data to Inform Agent-Based Models of Diffusion.” Sociological Methods & Research 49(4):1018–1063.

131.

Sourati

Jamshid

Evans

James A.

. 2023. “Accelerating Science with Human-Aware Artificial Intelligence.” Nature Human Behaviour 7:1682-1696.

132.

Speer

Susan A.

2022. Gender Talk. 2nd ed. London, England: Routledge.

133.

Sui

Peiqi

Duede

Eamon

Sophie

Richard

. 2024. “Confabulation: The Surprising Value of Large Language Model Hallucinations.” Pp. 14274–14284 in Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, PA, USA: Association for Computational Linguistics.

134.

Sun

Chi

Qiu

Xipeng

Yige

Huang

Xuanjing

. 2019. “How to Fine-Tune BERT for Text Classification?” Pp. 194–206 in Chinese Computational Linguistics. Singapore: Springer International Publishing.

135.

Sutton

Richard.

2019. “The Bitter Lesson.” Incomplete Ideas (Blog) 13(1):38.

136.

Team

Gemini

. 2024. “Gemini 1.5: Unlocking Multimodal Understanding across Millions of Tokens of Context.” arXiv preprint:2403.05530.

137.

Templeton

Adly

Conerly

Tom

Marcus

Jonathan

Lindsey

Jack

Bricken

Trenton

Chen

Brian

Pearce

Adam

Citro

Craig

Ameisen

Emmanuel

Jones

Andy

Cunningham

Hoagy

Turner

Nicholas L.

McDougall

Callum

MacDiarmid

Monte

Tamkin

Alex

Durmus

Esin

Hume

Tristan

Mosconi

Francesco

Freeman

C. Daniel

Sumers

Theodore R.

Rees

Edward

Batson

Joshua

Jermyn

Adam

Carter

Shan

Olah

Chris

Henighan

Tom

. 2024. “Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet.” Anthropic.

138.

Törnberg

Petter

Valeeva

Diliara

Uitermark

Justus

Bail

Christopher

. 2023. “Simulating Social Media Using Large Language Models to Evaluate Alternative News Feed Algorithms.”

139.

Vafa

Keyon

Palikot

Emil

Tianyu

Kanodia

Ayush

Athey

Susan

Blei

David M.

. 2022. “CAREER: A Foundation Model for Labor Sequence Data.” arXiv preprint:2202.08370

140.

Vaswani

Ashish

Shazeer

Noam

Parmar

Niki

Uszkoreit

Jakob

Jones

Llion

Gomez

Aidan N.

Kaiser

Ł. Ukasz

Polosukhin

Illia

. 2017 “Attention is All you Need”. 31st Conference on Neural Information Processing Systems (NIPS 2017) 31:5998–6008.

141.

Vicinanza

Paul

Goldberg

Amir

Srivastava

Sameer B.

. 2023. “A Deep-Learning Model of Prescient Ideas Demonstrates That They Emerge from the Periphery.” PNAS Nexus 2(1):gac275.

142.

Vincent

James.

2016. “Twitter Taught Microsoft’s AI Chatbot to Be a Racist Asshole in Less than a Day.” The Verge, May 24.

143.

Vong

Wai Keen

Wang

Wentao

Orhan

A. Emin

Lake

Brenden M.

. 2024. “Grounded Language Acquisition through the Eyes and Ears of a Single Child.” Science (New York, N.Y.) 383(6682):504–511.

144.

Von Oswald

Johannes

Niklasson

Eyvind

Randazzo

Ettore

Sacramento

Joao

Mordvintsev

Alexander

Zhmoginov

Andrey

Vladymyrov

Max

. 2023. “Transformers Learn In-Context by Gradient Descent.” Pp. 35151–35174 in Proceedings of the 40th International Conference on Machine Learning. Vol. 202, Proceedings of Machine Learning Research, edited by Krause

Brunskill

Cho

Engelhardt

Sabato

Scarlett

. PMLR.

145.

Wang

Haoran

Shu

Kai

. 2023. “Trojan activation attack: Red-teaming large language models using activation steering for safety-alignment.” arXiv preprint:2311.09433.

146.

Wei

Alexander

Haghtalab

Nika

Steinhardt

Jacob

. 2023 “Jailbroken: How Does LLM Safety Training Fail?” Pp. 80079–80110 in Vol. abs/2307.02483, edited by Oh

Naumann

Globerson

Saenko

Hardt

Levine

. Curran Associates, Inc.

147.

Wei

Jason

Wang

Xuezhi

Schuurmans

Dale

Bosma

Maarten

Chi

Xia

Quoc

Zhou

Denny

. 2022. “Chain of Thought Prompting Elicits Reasoning in Large Language Models.” Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS '22). 36:24824-24837.

148.

Wendler

Chris

Veselovsky

Veniamin

Monea

Giovanni

West

Robert

. 2024. “Do Llamas Work in English? On the Latent Language of Multilingual Transformers.” In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics.

149.

Xie

Zikai.

2024. “Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models.” arXiv preprint:2408.05093.

150.

Xie

Chengxing

Chen

Canyu

Jia

Feiran

Ziyu

Lai

Shiyang

Shu

Kai

Jindong

Bibi

Adel

Ziniu

Jurgens

David

Evans

James

Torr

Philip

Ghanem

Bernard

Guohao

. 2024. “Can Large Language Model Agents Simulate Human Trust Behavior?” Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS '24) 38:1–36.

151.

Fengli

Yong

Jin

Depeng

Jianhua

Song

Chaoming

. 2021. “Emergence of Urban Growth Patterns from Human Mobility Behavior.” Nature Computational Science 1(12):791–800.

152.

Nan

Xuezhe

. 2024. “LLM the Genius Paradox: A Linguistic and Math Expert’s Struggle with Simple Word-Based Counting Problems.” arXiv preprint:2410.14166.

153.

Yuan

Youliang

Jiao

Wenxiang

Wang

Wenxuan

Huang

Jen-Tse

Pinjia

Shi

Shuming

Zhaopeng

. 2023. “GPT-4 Is Too Smart to Be Safe: Stealthy Chat with LLMs via Cipher.” arXiv preprint:2308.06463

154.

Zhao

Bowen

Brumbaugh

Zander

Wang

Yizhong

Hajishirzi

Hannaneh

Smith

Noah A.

. 2024. “Set the Clock: Temporal Alignment of Pretrained Language Models.” arXiv preprint:2402.16797

155.

Zhou

Zhang

Yinxian

. 2024. “Political Biases and Inconsistencies in Bilingual GPT Models—The Cases of the U.S. and China.” Scientific Reports 14(1):25048.