Balancing Large Language Model Alignment and Algorithmic Fidelity in Social Science Research

Abstract

Generative artificial intelligence (AI) has the potential to revolutionize social science research. However, researchers face the difficult challenge of choosing a specific AI model, often without social science-specific guidance. To demonstrate the importance of this choice, we present an evaluation of the effect of alignment, or human-driven modification, on the ability of large language models (LLMs) to simulate the attitudes of human populations (sometimes called silicon sampling). We benchmark aligned and unaligned versions of six open-source LLMs against each other and compare them to similar responses by humans. Our results suggest that model alignment impacts output in predictable ways, with implications for prompting, task completion, and the substantive content of LLM-based results. We conclude that researchers must be aware of the complex ways in which model training affects their research and carefully consider model choice for each project. We discuss future steps to improve how social scientists work with generative AI tools.

Keywords

artificial intelligence alignment large language models silicon sampling computational social science

Get full access to this article

View all access options for this article.

References

Achen

Christopher H.

Bartels

Larry M.

. 2016. Democracy for Realists: Why Elections Do Not Produce Responsive Government. Princeton, NJ: Princeton University Press.

Aher

Gati V.

Arriaga

Rosa I.

Kalai

Adam Tauman

(2023) Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies. In Proceedings of the 40th International Conference on Machine Learning, ed. Krause

Andreas

Brunskill

Emma

Cho

Kyunghyun

Engelhardt

Barbara

Sabato

Sivan

Scarlett

Jonathan

. Vol. 202 of Proceedings of Machine Learning Research PMLR, pp. 337–371. https://proceedings.mlr.press/v202/aher23a.html .

Ahmed

Toufique

Devanbu

Premkumar

Treude

Christoph

Pradel

Michael

(2024) “Can LLMs Replace Manual Annotation of Software Engineering Artifacts?”. https://arxiv.org/abs/2408.05534 .

AI@Meta (2024) “Llama 3 Model Card”. https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md .

Anthropic (2024) “The Claude 3 Model Family: Opus, Sonnet, Haiku.” [Accessed 15-08-2024]. https://www-cdn.anthropic.com/de8ba9b01c9ab7cbabf5c33b80b7bbc618857627/Model_Card_Claude_3.pdf.

Arditi

Andy

Obeso

Oscar

Syed

Aaquib

Paleka

Daniel

Rimsky

Nina

Gurnee

Wes

Nanda

Neel

(2024) “Refusal in Language Models Is Mediated by a Single Direction.” arXiv preprint arXiv:2406.11717 .

Argyle

Lisa P.

Bail

Christopher A.

Busby

Ethan C.

Gubler

Joshua R.

Howe

Thomas

Rytting

Christopher

Sorensen

Taylor

et al. 2023b. “Leveraging AI for Democratic Discourse: Chat Interventions Can Improve Online Political Conversations At Scale.” Proceedings of the National Academy of Sciences 120(41): e2311627120.

Argyle

Lisa P.

Busby

Ethan C.

Fulda

Nancy

Gubler

Joshua R.

Rytting

Christopher

Wingate

David

. 2023a. “Out of One, Many: Using Language Models to Simulate Human Samples.” Political Analysis 3(3): 337–351. https://doi.org/10.1371/journal.pclm.0000429 .

Argyle

Lisa P.

Busby

Ethan C.

Gubler

Joshua R.

Lyman

Alex

Olcott

Justin

Pond

Jackson

Wingate

David

(2024) “Testing Theories of Political Persuasion Using Artificial Intelligence.” Working Paper .

10.

Askell

Amanda

Bai

Yuntao

Chen

Anna

Drain

Dawn

Ganguli

Deep

Henighan

Tom

Jones

Andy

et al. (2021) “A General Language Assistant as a Laboratory for Alignment.” CoRR abs/2112.00861. https://arxiv.org/abs/2112.00861.

11.

Bai

Yuntao

Jones

Andy

Ndousse

Kamal

Askell

Amanda

Chen

Anna

Das Sarma

Nova

Drain

Dawn

et al. (2022) “Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback”. https://arxiv.org/abs/2204.05862 .

12.

Bail

Christopher A.

. 2024. “Can Generative AI Improve Social Science?.” Proceedings of the National Academy of Sciences 121(21): e2314021121.

13.

Bender

Emily M.

Gebru

Timnit

McMillan-Major

Angelina

Shmitchell

Shmargaret

(2021) On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pp. 610–623.

14.

Bisbee

James

Clinton

Joshua D.

Dorff

Cassy

Kenkel

Brenton

Larson

Jennifer M.

(2024) “Synthetic Replacements for Human Survey Data? The Perils of Large Language Models.” Political Analysis, pp. 1–16.

15.

Blumer

Herbert

. 1958. “Race Prejudice As a Sense of Group Position.” Pacific Sociological Review 1(1): 3–7.

16.

Boelaert

Julien

Coavoux

Samuel

Ollion

Etienne

Petev

Ivaylo D.

Präg

Patrick

(2024) “Machine Bias. Generative Large Language Models Have a View of Their Own.” osf.io/preprints/socarxiv/r2pnb.

17.

Caliskan

Aylin

Bryson

Joanna J.

Narayanan

Arvind

. 2017. “Semantics Derived Automatically From Language Corpora Contain Human-like Biases.” Science (New York, N.Y.) 356(6334): 183–186.

18.

Cheng

Myra

Durmus

Esin

Jurafsky

Dan

(2023) “Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models.” https://arxiv.org/abs/2305.18189 .

19.

Chu

Junjie

Liu

Yugeng

Yang

Ziqing

Shen

Xinyue

Backes

Michael

Zhang

Yang

(2024) “Comprehensive assessment of jailbreak attacks against llms.” arXiv preprint arXiv:2402.05668 .

20.

Chung

Eun Yi

Romano

Joseph P.

. 2013. “Exact and Asymptotically Robust Permutation Tests.” The Annals of Statistics 41(2): 484–507.

21.

Consensus (2024) “Consensus: AI-powered Academic Search Engine.” https://consensus.app/ .

22.

Copilot (2024) “Microsoft copilot.” https://copilot.microsoft.com/ .

23.

Costello

Thomas H.

Pennycook

Gordon

Rand

David G.

. 2024. “Durably Reducing Conspiracy Beliefs Through Dialogues with AI.” Science (New York, N.Y.) 385(6714): eqdg1814. https://www.science.org/doi/abs/10.1126/science.adq1814 .

24.

de Bolle

Monica

(2024) “AI’s carbon footprint appears likely to be alarming.” Peterson Institute for International Economics. https://www.piie.com/blogs/realtime-economics/2024/ais-carbon-footprint-appears-likely-be-alarming.

25.

Demszky

Dorottya

Yang

Diyi

Yeager

David S.

Bryan

Christopher J.

Clapper

Margarett

Chandhok

Susannah

Eichstaedt

Johannes C.

et al. 2023. “Using Large Language Models in Psychology.” Nature Reviews Psychology 2(11): 688–701. https://doi.org/10.1038/s44159-023-00241-5 .

26.

Dillion

Danica

Tandon

Niket

Yuling

Gray

Kurt

. 2023. “Can AI Language Models Replace Human Participants?.” Trends in Cognitive Sciences 27(7): 597–600. https://doi.org/10.1016/j.tics.2023.04.008 .

27.

Ding

Peng

Feller

Avi

Miratrix

Luke

. 2016. “Randomization Inference for Treatment Effect Variation.” Journal of the Royal Statistical Society Statitiscs Methodology, Series B 78: 655–671.

28.

Dubey

Abhimanyu

Jauhri

Abhinav

Pandey

Abhinav

Kadian

Abhishek

Al-Dahle

Ahmad

Letman

Aiesha

Mathur

Akhil

et al. (2024) “The Llama 3 Herd of Models.” https://arxiv.org/abs/2407.21783 .

29.

Edgell

Penny

Gerteis

Joseph

Hartmann

Douglas

. 2006. “Atheists As ‘Other’: Moral Boundaries and Cultural Membership in American Society.” Social Forces 71: 211–234.

30.

Elicit (2024) “Elicit: The AI Research Assistant.” https://elicit.com .

31.

Ellemers

Naomi

. 2018. “Gender Stereotypes.” Annual Review of Psychology 69: 275–298.

32.

Gerber

Alan S.

Green

Donald P.

. 2012. Field Experiments: Design, Analysis, and Interpretation. New York: W. W. Norton.

33.

Gilardi

Fabrizio

Alizadeh

Meysam

Kubli

Maël

. 2023. “ChatGPT Outperforms Crowd Workers for Text-annotation Tasks.” Proceedings of the National Academy of Sciences 120(30): e2305016120.

34.

Goldstein

Josh A.

Sastry

Girish

(2023) “The Coming Age of AI-powered Propaganda. How to Defend Against Supercharged Disinformation.” Foreign Affairs, p. 7.

35.

Google Gemini Team (2024) “Gemini: Advanced Conversational AI Models.” https://www.deepmind.com/gemini. Accessed: 2024-08-15.

36.

Hackenburg

Kobi

Margetts

Helen

. 2024. “Evaluating the Persuasive Influence of Political Microtargeting with Large Language Models.” Proceedings of the National Academy of Sciences 121(24): e2403116121.

37.

Zeyu

Huang

Chieh-Yang

Ding

Chien-Kuang Cornelia

Rohatgi

Shaurya

Huang

Ting-HaoKenneth

(2024) If in a Crowdsourced Data Annotation Pipeline, a GPT-4. CHI ’24 New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/3613904.3642834 .

38.

Heseltine

Michael

vin Hohenberg

Bernhard Clemm

(2024) “Large language models as a substitute for human experts in annotating political text.” Research and Politics.

39.

Hewitt

Luke

Ashokkumar

Ashwini

Ghezae

Isaias

Willer

Robb

(2024) “Predicting Results of Social Science Experiments Using Large Language Models.” https://samim.io/dl/Predicting%20results%20of%20social%20science%20experiments%20using%20large%20language%20models.pdf .

40.

Horton

John J.

(2023) Large language models as simulated economic agents: What can we learn from homo silicus? Technical report National Bureau of Economic Research.

41.

Hutchings

Vincent L.

Valentino

Nicholas A.

. 2004. “The Centrality of Race in American Politics.” Annual Review of Political Science 7: 383–408.

42.

Jiaming

Liu

Mickel

Dai

Juntao

Pan

Xuehai

Zhang

Chi

Bian

Zhang

Chi

et al. (2023) “BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset.” https://arxiv.org/abs/2307.04657 .

43.

Jiang

Albert Q.

Sablayrolles

Alexandre

Mensch

Arthur

Bamford

Chris

Chaplot

Devendra Singh

de las Casas

Diego

Bressand

Florian

et al. (2023) “Mistral 7B.” https://arxiv.org/abs/2310.06825 .

44.

Jiang

Albert Q

Sablayrolles

Alexandre

Roux

Antoine

Mensch

Arthur

Savary

Blanche

Bamford

Chris

Chaplot

Devendra Singh

et al. (2024) “Mixtral of Experts.” https://arxiv.org/abs/2401.04088 .

45.

Kim

Junsol

Lee

Byungkyu

. 2024. “AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinion Prediction.” https://arxiv.org/abs/2305.09620 .

46.

Kinder

Donald R.

Kam

Cindy D.

. 2009. Us Against them : Ethnocentric Foundations of American Opinion. Chicago: University of Chicago Press.

47.

Kirk

Robert

Mediratta

Ishita

Nalmpantis

Christoforos

Luketina

Jelena

Hambro

Eric

Grefenstette

Edward

Raileanu

Roberta

(2024) Understanding the Effects of RLHF on LLM Generalisation and Diversity. In The Twelfth International Conference on Learning Representations. https://openreview.net/forum?id=PXD3FAVHJT .

48.

Kleinberg

Jon

Lakkaraju

Himabindu

Leskovec

Jure

Ludwig

Jens

Mullainathan

Sendhil

. 2018. “Human Decisions and Machine Predictions.” The Quarterly Journal of Economics 133(1): 237–293. https://doi.org/10.1093/qje/qjx032 .

49.

Kozlowski

Austin C.

Kwon

Hyunku

Evans

James A.

(2024) “In Silico Sociology: Forecasting COVID-19 Polarization with Large Language Models.” https://arxiv.org/abs/2407.11190 .

50.

Kreps

Sarah

McCain

R. Miles

Brundage

Miles

. 2022. “All the News That’s Fit to Fabricate: AI-Generated Text As a Tool of Media Misinformation.” Journal of Experimental Political Science 9(1): 104–117.

51.

Lake

Thom

Choi

Eunsol

Durrett

Greg

(2024) “From Distributional to Overton Pluralism: Investigating Large Language Model Alignment.” https://arxiv.org/abs/2406.17692 .

52.

Lee

Sanguk

Peng

Tai-Quan

Goldberg

Matthew H.

Rosenthal

Seth A.

Kotcher

John E.

Maibach

Edward W.

Leiserowitz

Anthony

. 2024. “Can Large Language Models Estimate Public Opinion About Global Warming? An Empirical Assessment of Algorithmic Fidelity and Bias.” PLOS Climate 3(8): 1–14. https://doi.org/10.1371/journal.pclm.0000429 .

53.

Victoria R.

Chen

Yida

Saphra

Naomi

(2024) “ChatGPT Doesn’t Trust Chargers Fans: Guardrail Sensitivity in Context.” https://arxiv.org/abs/2407.06866 .

54.

Liu

Andy

Diab

Mona

Fried

Daniel

(2024) “Evaluating Large Language Model Biases in Persona-Steered Generation.” https://arxiv.org/abs/2405.20253 .

55.

Lyman, Alex, Bryce Hepner, Lisa P. Argyle, Ethan C. Busby, Joshua R. Gubler, and David Wingate (2025) “Replication Materials for Balancing Large Language Model Alignment and Algorithmic Fidelity in Social Science Research”. GitHub. https://github.com/AlexMLyman/Replication-Materials-for-Balancing-Large-Language-Model-Alignment-and-Algorithmic-Fidelity.

56.

Marcel

Binz

(2024) “Llama-3.1-Centaur-70B.” https://huggingface.co/marcelbinz/Llama-3.1-Centaur-70B-adapter .

57.

Mellon

Jonathan

Bailey

Jack

Scott

Ralph

Breckwoldt

James

Miori

Marta

Schmedeman

Phillip

(2024) “Do AIs know what the most important issue is? Using language models to code open-text social survey responses at scale.” Research and Politics.

58.

Mize

Trenton D.

. 2016. “Sexual Orientation in the Labor Market.” American Sociological Review 81: 1132–1160.

59.

Moore

Jared

Deshpande

Tanvi

Yang

Diyi

(2024) “Are Large Language Models Consistent over Value-laden Questions?” https://arxiv.org/abs/2407.02996 .

60.

Obermeyer

Ziad

Powers

Brian

Vogeli

Christine

Mullainathan

Sendhil

. 2019. “Dissecting Racial Bias in An Algorithm Used to Manage the Health of Populations.” Science (New York, N.Y.) 366(6464): 447–453.

61.

OpenAI (2023) “ChatGPT: A Conversational AI Model.” https://www.openai.com/chatgpt. Accessed: 2024-08-15.

62.

OpenAI

Josh Achiam

Adler

Steven

Agarwal

Sandhini

Ahmad

Lama

Akkaya

Ilge

Aleman

Florencia Leoni

Almeida

Diogo

et al. (2024) “GPT-4 Technical Report.” https://arxiv.org/abs/2303.08774 .

63.

Pachot

Arnault

Petit

Thierry

(2024) “Can Large Language Models Accurately Predict Public Opinion? A Review”.

64.

Palmer

Alexis

Spirling

Arthur

. 2023. “Large Language Models Can Argue in Convincing Ways About Politics, But Humans Dislike AI Authors: Implications for Governance.” Political Science 75(3): 281–291.

65.

Panch

Trishan

Mattie

Heather

Atun

Rifat

. 2019. “Artificial Intelligence and Algorithmic Bias: Implications for Health Systems.” Journal of Global Health 9(2): 010318.

66.

Park

Joon Sung

Zou

Carolyn Q.

Shaw

Aaron

Hill

Benjamin Mako

Cai

Carrie

Morris

Meredith Ringel

Willer

Robb

Liang

Percy

Bernstein

Michael S.

(2024) “Generative Agent Simulations of 1,000 People.” https://arxiv.org/abs/2411.10109 .

67.

Parrish

Alicia

Chen

Angelica

Nangia

Nikita

Padmakumar

Vishakh

Phang

Jason

Thompson

Jana

Htut

Phu Mon

et al. (2022) BBQ: A hand-built bias benchmark for question answering. In Findings of the Association for Computational Linguistics: ACL 2022, ed. Muresan

Smaranda

Nakov

Preslav

Villavicencio

Aline

. Dublin, Ireland: Association for Computational Linguistics, pp. 2086–2105. https://aclanthology.org/2022.findings-acl.165 .

68.

Phillips

Nolan E.

Levy

Brian L.

Sampson

Ryan J.

Small

Mario L.

Wang

Ryan Q.

. 2021. “The Social Integration of American Cities: Network Measures of Connectedness Based on Everyday Mobility Across Neighborhoods.” Sociological Methods & Research 50: 1189–1214.

69.

Yao

Wang

Jue

. 2024. “Performance and Biases of Large Language Models in Public Opinion Simulation.” Academy of Management Proceedings 2024(1): 10298. https://doi.org/10.5465/AMPROC.2024.10298abstract .

70.

Radford

Alec

Jeff

Child

Rewon

Luan

David

Amodei

Dario

Sutskever

Ilya

(2019) Language Models are Unsupervised Multitask Learners. https://api.semanticscholar.org/CorpusID:160025533 .

71.

Rafailov

Rafael

Sharma

Archit

Mitchell

Eric

Manning

Christopher D.

Ermon

Stefano

Finn

Chelsea

(2023) Direct Preference Optimization: Your Language Model is Secretly a Reward Model. In Thirty-seventh Conference on Neural Information Processing Systems. https://openreview.net/forum?id=HPuSIXJaa9 .

72.

Ren

Shaolei

Wierman

Adam

(2024) “The Uneven Distribution of AI’s Environmental Impacts.” Harvard Business Review. https://hbr.org/2024/07/the-uneven-distribution-of-ais-environmental-impacts.

73.

Risman

Barbara

. 2004. “Gender As a Social Structure.” Gender & Society 18: 429–450.

74.

Robinson

Joshua

Rytting

Christopher Michael

Wingate

David

(2023) “Leveraging Large Language Models for Multiple Choice Question Answering.” https://arxiv.org/abs/2210.12353 .

75.

Rothschild

Jacob E.

Howat

Adam J.

Shafranek

Richard M.

Busby

Ethan C.

. 2019. “Pigeonholing Partisans: Stereotypes of Party Supporters and Partisan Polarization.” Political Behavior 41: 423–443.

76.

Santurkar

Shibani

Durmus

Esin

Ladhak

Faisal

Lee

Cinoo

Liang

Percy

Hashimoto

Tatsunori

(2023) Whose opinions do language models reflect? In Proceedings of the 40th International Conference on Machine Learning. ICML’23 JMLR.org.

77.

Sherif

Muzafer

Harvey

O. J.

Jack White

Hood

William R.

Sherif

Carolyn W.

. 1961. Intergroup Conflict and Cooperation: The Robbers Cave Experiment. Norman, OK: Institute of Group Relations.

78.

Shoup

Ella

(2024) “AI and ESG: Understanding the Environmental Impact of AI and LLMs.” Holistic AI. https://www.holisticai.com/blog/environmental-impact-ai-llms.

79.

Srivastava

Aarohi

, et al. (2023) “Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.” Transactions on Machine Learning Research. https://openreview.net/forum?id=uyTL5Bvosj .

80.

Suzgun

Mirac

Gur

Tayfun

Bianchi

Federico

Daniel E.

Icard

Thomas

Jurafsky

Dan

Zou

James

(2024) “Belief in the Machine: Investigating Epistemological Blind Spots of Language Models.” https://arxiv.org/abs/2410.21195 .

81.

Team

Gemma

(2024) “Gemma.” https://www.kaggle.com/m/3301 .

82.

Tessler

Michael Henry

Bakker

Michiel A.

Jarrett

Daniel

Sheahan

Hannah

Chadwick

Martin J.

Koster

Raphael

Evans

Georgina

Campbell-Gillingham

Lucy

Collins

Tantum

Parkes

David C.

Botvinick

Matthew

Summerfield

Christopher

. 2024. “AI can help humans find common ground in democratic deliberation.” Science (New York, N.Y.) 386(6719): eadq2852.

83.

Thébaud

Sarah

Kornrich

Sabino

Ruppanner

Leah

. 2021. “Good Housekeeping, Great Expectations: Gender and Housework Norms.” Sociological Methods & Research 50: 1189–1214.

84.

Velez

Yamil

Liu

Patrick

(2024) “Confronting Core Issues: A Critical Assessment of Attitude Polarization Using Tailored Experiments.” American Political Science Review.

85.

Wei

Alexander

Haghtalab

Nika

Steinhardt

Jacob

. 2024. “Jailbroken: How Does Llm Safety Training Fail?.” Advances in Neural Information Processing Systems 36: 80079–80100.

86.

Wei

Jason

Tay

Bommasani

Rishi

Raffel

Colin

Zoph

Barret

Borgeaud

Sebastian

Yogatama

Dani

et al. (2022) “Emergent Abilities of Large Language Models.” https://arxiv.org/abs/2206.07682 .

87.

Whitehead

Andrew L.

Perry

Samuel L.

Baker

Joseph O.

. 2018. “Make America Christian Again: Christian Nationalism and Voting for Donald Trump in the 2016 Presidential Election.” Sociology of Religion 79: 147–171.

88.

Zihao

Liu

Deng

Gelei

Yuekang

Picek

Stjepan

(2024) “LLM Jailbreak Attack Versus Defense Techniques—A Comprehensive Study.” arXiv preprint arXiv:2402.13457 .

89.

Zhang

Shengyu

Dong

Linfeng

Xiaoya

Zhang

Sen

Sun

Xiaofei

Wang

Shuhe

Jiwei

et al. (2024) “Instruction Tuning for Large Language Models: A Survey.” https://arxiv.org/abs/2308.10792 .

90.

Zheng

Lianmin

Chiang

Wei-Lin

Sheng

Ying

Zhuang

Siyuan

Zhanghao

Zhuang

Yonghao

Lin

et al. (2024) Judging LLM-as-a-judge with MT-bench and Chatbot Arena. In Proceedings of the 37th International Conference on Neural Information Processing Systems. NIPS ’23 Red Hook, NY, USA: Curran Associates Inc.

91.

Zhu

Chiwei

Benfeng

Wang

Quan

Zhang

Yongdong

Mao

Zhendong

(2023) On the Calibration of Large Language Models and Alignment. In Findings of the Association for Computational Linguistics: EMNLP 2023, ed. Bouamor

Houda

Pino

Juan

Bali

Kalika

. Singapore: Association for Computational Linguistics, pp. 9778–9795. https://aclanthology.org/2023.findings-emnlp.654 .

92.

Ziems

Caleb

Held

William

Shaikh

Omar

Chen

Jiaao

Zhang

Zhehao

Yang

Diyi

(2024) “Can Large Language Models Transform Computational Social Science?”. https://arxiv.org/abs/2305.03514 .

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

6.17 MB