Sage Journals: Discover world-class research

Abstract

To address policy challenges such as climate change or pandemics effectively, policymakers require insights into the views of the general public. However, traditional large-scale quantitative methods like surveys and aggregated social media analytics lack nuance, while qualitative approaches such as interviews are labor intensive and thus limited to small samples. We discuss how artificial intelligence tools known as large language models (LLMs) could be leveraged to surface the detailed views of large numbers of citizens on policy issues. In particular, we showcase an LLM-supported method designed to provide both quantitative and qualitative insights from large samples of respondents who provide free-text responses to open-ended questions. We propose that such approaches could help policymakers efficiently integrate citizens’ input into their decision-making processes and give them timely, nuanced insights that complement those produced by established methods of obtaining large-scale public input.

Keywords

large language models (LLMs)citizen perspectives policymaking

Understanding citizens’ perspectives on societal challenges, such as climate change, artificial intelligence (AI), and pandemics, is crucial for the successful design, implementation, and adoption of public policies.^1–6 In the COVID-19 pandemic, for instance, a better understanding of citizens’ views and needs along with more effective communication could have strengthened public trust in such institutions as governments and health organizations.^7–11 This enhanced trust, in turn, might have attenuated the impact of misinformation and thereby led to higher vaccination rates and wider adoption of preventive measures.^9,10,12 In general, awareness of citizens’ perspectives on policy challenges and the measures proposed to address them can help policymakers align their decisions with public needs and expectations,^9–11 potentially facilitating more timely and appropriate responses. Policymakers and the public alike could thus benefit from having methods to (a) elicit citizens’ perspectives on a large scale and (b) analyze these perspectives quickly and with sufficient nuance.

Although quantitative methods such as surveys and polls^13,14 can collect opinions from large samples, they use closed-form questions—ones that have preset answers—which require pollsters to have a good idea in advance of likely responses.¹⁵ And more open-ended approaches that analyze large collections of text (such as social media postings or text messages) to identify broad themes and patterns^16,17 can likewise fail to capture nuance. Conversely, nuanced qualitative approaches^18–22 that provide more in-depth insights through open-ended questions and conversations (for example, interviews) allow for detailed and varied responses and have the potential to uncover new and unexpected insights. However, those approaches are labor intensive, which limits their use to small samples of people or would require considerable time and resources when applied to large samples.

Today’s digital society offers new opportunities for simplifying and accelerating the collection of citizens’ perspectives at scale^23–26 because recently emerged AI tools—particularly large language models (LLMs)—can quickly analyze vast numbers of open-ended responses from citizens and capture nuance. In the sidebar, “A Brief History: LLMs as Research Tools in the Behavioral, Cognitive, and Social Sciences,” we describe some key technological advances that gave rise to and underlie LLMs, and we note that LLMs have been applied to such policy-related tasks as gleaning citizen sentiments from online social postings and forums. Below, we propose that LLM-based approaches can be more broadly applied to other text sources relevant to policymaking, such as transcripts of brainstorming sessions, ^27,28 and that LLMs could potentially provide valuable, fresh insights into how citizens perceive policy issues. However, this proposal raises important questions: How can such policy perspectives be elicited, and what could policymakers expect to learn from those responses? To answer those questions and highlight the potential of using LLM-driven methods^29–32 to enhance policymaking, we describe an approach we applied in a recent study.^33,34 We also discuss how such approaches might complement established methods of eliciting the public’s opinions and help policymakers to efficiently integrate citizens’ insights into their decision-making processes.

An Approach to Surfacing Citizens’ Policy Perspectives at Scale

Our study showcasing the potential of an LLM-based approach featured a brainstorming task that elicited citizens’ ideas on policy issues.^33,34 In the study, conducted online, 300 U.K. citizens (quota matched to reflect the U.K.’s demographics in age, gender, and ethnicity) were asked to generate up to five ideas for each of five policy problems derived from the United Nations’ sustainable development goals³⁵—a set of objectives for meeting global challenges. For example, they were asked, “How could we improve vaccination rates in rural environments?” and “How could we reduce stress levels in our society?” They had three minutes to brainstorm per problem. Below, we focus on the vaccination responses; for details on the responses to the other questions, see Reference 33.

The task aimed to capture respondents’ first thoughts—that is, those that were immediately accessible and relevant when they brought the issue to mind. Respondents generated ideas such as “free bus to vaccination center” and “reminding people of the negative effects if not taken.” We then used an LLM to clean out ambiguity and extract specific elements—units of information we refer to as concepts—from their responses, such as “transport subsidy” or “vaccination center.” See Figure 1 for a schematic depiction of how the LLM extracted concepts from the respondents’ answers.

Figure 1.

Example of an LLM method for surfacing citizens’ policy perspectives, Step 1

Figure 2.

Example of an LLM method for surfacing citizens’ policy perspectives, Steps 2–4

LLMs can reveal which unique concepts are mentioned by a group of respondents as a whole or by subgroups (such as younger versus older citizens). Extraction of these concepts makes it possible to apply other analyses that provide deeper insights into the importance of the mentioned concepts. For instance, network analyses can clarify how the concepts link to one another and indicate whether and to what extent the ideas of different subgroups overlap. For example, do younger and older adults think of similar things when asked about vaccination? What about rural and urban citizens?

In particular, to obtain information that goes beyond concept frequencies, investigators can use the LLMs’ outputs to construct data representations known as semantic networks³⁶ that depict how closely the meanings of multiple concepts are related to one another (see the left panel of Figure 2). Semantic networks have been applied to study cognitive processes,³⁶ casting light on such processes as how people recall information^37–39 and connect concepts,^25,40,41 revealing how individual differences might affect these processes.⁴² In the policy realm, one way that the importance of a given concept to a given group of people can be assessed is by measuring its centrality, or connectedness to other concepts. For example, one can examine the degree centrality of a concept by counting the number of connections between it and others.

The right panel of Figure 2 illustrates the kind of centrality data that might appear in a survey like ours. After participants have been asked about ways to increase vaccination rates in rural environments, a degree centrality measure might indicate that the concept “health care” was more central to the thinking of one subgroup (A) when it pondered the vaccination problem whereas the concept “public infrastructure” was more central to Subgroup B.

Going beyond frequency measures to examine centrality also provides finer-grained insights, such as revealing when concepts that do not necessarily receive the most mentions can nonetheless be implicitly important to respondents. The strength of the influence of such concepts is somewhat akin to that of a person on social media who is not the most popular or visible individual but has significant influence because of interacting with many different groups.⁴³ Measures of centrality could provide insights or inform hypotheses about which concepts most influence the perspectives of different groups of citizens on policy issues and thus which concepts might need to be targeted to produce the most effective policy measures or communication campaigns for those different groups.

For example, in our study, we compared the answers of rural and urban subgroups to the question, “How could we increase vaccination rates in rural environments?” (see Figure 3). Although the rural citizens explicitly mentioned the concepts “vaccination” and “vaccination education” more frequently than the urban citizens did, it was the urban citizens for whom those concepts were most central (that is, those concepts were connected to a larger number of other concepts within that subgroup’s ideas on the question). Conversely, the concepts “education” and “public awareness” were mentioned more frequently by urban respondents but were more central to the ideas of rural respondents.

Figure 3.

How measures of centrality add insights not provided by frequency measures alone

The use of network analysis can be extended in various ways and to various problems. It can, for instance, be applied to different subsets of a larger population (say, to subpopulations divided by age, gender, or political leaning). Or, depending on the goal, other metrics can be applied.^36,44,45 For example, the overall distance between concepts in the network (the average path length) reveals how interconnected ideas are or how easily relationships between distant concepts can be bridged. Similarly, if concepts tend to cluster (a feature reflected by a metric called a clustering coefficient), this pattern indicates the presence of overarching topics that encompass related concepts in meaning. For example, concepts such as “mandatory school immunizations,” “workplace vaccine mandates,” and “international travel vaccination requirements” could appear close together under a broader heading that might be called vaccination requirements and mandates. (See References 46 and 47 for examples of how the findings of network analyses have provided insights into how to change people’s beliefs about genetically modified foods and the importance of childhood vaccines.)

Implications for Policymakers

Citizens’ perspectives may not always suggest ready-to-implement solutions, but they can reveal new and viable paths for addressing societal challenges.^3,48–52 In our study, respondents did not necessarily directly say use schools as vaccination centers to improve vaccination rates in rural areas, but the idea of using existing public infrastructure did emerge from our analysis of their perceptions of the problem. Although the insights we derived from this approach do not conclusively provide solutions, they highlight potentially promising and sometimes surprising avenues for further quantitative and qualitative investigations.

The promise is evident in the finding that the concept “vaccination education” was more central to the thinking of urban citizens, whereas “education” and “public awareness” were more central to the thinking of rural citizens. This difference could indicate differing interpretations of the concept “education” in the context of vaccination. If so, understanding of this nuance could help guide the development of tailored communication strategies that resonate more effectively with each group. For instance, a broad health campaign in rural areas might focus on general health education, highlighting the importance of preventive care, regular check-ups, nutrition, and hygiene. Vaccinations could be framed as part of maintaining overall health, rather than treated as an isolated issue. Such insights may help policymakers to not only use traditional methods for understanding citizens’ views more effectively (say, by helping the policymakers identify new closed-form survey items that directly capture such perspectives) but also to develop more effective data collection methods and better informed public health interventions.

Going forward, the approach could be applied to a variety of text inputs beyond those explored here, including interviews, focus groups, social media posts, forum discussions, and messaging applications—potentially providing a valuable new tool for computational social science.⁵³ It could also complement established citizen participation methods—for example, enriching citizen panels ^2,22,54,55 with prepanel brainstorming and near-live processing of session transcripts to streamline discussions and the generation of useful insights.

LLM-based approaches like the one presented here offer other advantages as well. One is their potential to be integrated into existing policymaking processes, significantly accelerating the cycle of consultation, analysis, and policy design. In contrast to established qualitative and quantitative approaches, they yield timely yet nuanced and granular insights at scale. For instance, consider one way policymakers might have revised the approach the Dutch government took when designing its COVID-19 exit strategy.⁹ Using a closed-form survey, it solicited input on eight proposals for the relaxation of lockdown measures from 30,000 citizens. Instead, in advance of generating the survey, they might have elicited brainstorming proposals from the public, along with their views on the pros and cons. These insights could then have identified additional proposals and questions to be included in the survey and could have potentially led to the adoption of a different, not yet considered proposal.

Furthermore, LLMs provide powerful out-of-the-box capabilities for extracting concepts from open-ended responses and constructing semantic networks across diverse domains^56,57 without requiring the LLM to first be customized to the task at hand (that is, without necessarily training or tuning it on additional data). Instead, using a process called prompt engineering,^58,59 analysts can adjust the instructions given to the LLM. (See Figure 1 for an example of the prompt used in our study.) The ease of using “off-the-shelf” LLMs should enable policymakers to integrate the perspectives of large numbers of citizens into their work much more quickly than has been true in the past.

Challenges and Future Research

Establishing the validity of LLM-based approaches to policymaking and mitigating potential risks are key challenges and priorities for future research. It is critical to systematically assess the reliability, accuracy, and representativeness of methods like the one discussed here. In particular, research needs to examine the quality of the extracted insights, ensuring that they accurately reflect the raw text inputs and the perspectives of different demographic groups without perpetuating biases ingrained in the models through biases in the training data.^{56,57,60–62} Human validation of insights derived through such LLM-based approaches is essential in this process.^20,63,64

Future research should also compare LLM-based insights derived from different data collection approaches (such as brainstorming,^27,28 group deliberations,⁶⁵ polls, and qualitative interviews), examining the relative strengths and weaknesses of the available approaches for different policy goals and settings. This comparison would yield actionable guidance for both policy researchers and practitioners about when to use which approach—and why. Research should further explore synergies between our proposed approach and other possible uses of LLMs in policy contexts,^66,67 including using LLMs to scope likely reactions to behavioral interventions (see References 32 and 68 through 70 for examples of how LLMs can be used to simulate and study human behavior), design conversational agents that can better engage with citizens,⁷¹ and enhance the productivity of discussions about controversial topics.⁷² The semantic networks generated by our approach could potentially be used to empirically ground such LLM-based approaches, similarly to how curated databases of documents can ground other uses of LLMs.⁷³

Incorporating LLMs into policymaking would require adhering to best practices in terms of ensuring transparency,^74,75 mitigating errors,^62,76 using culturally relevant training data,^77,78 conducting regular bias audits,^61,79,80 and prioritizing the use of truly open-source LLMs,^81,82 whose inner workings are more transparent than those of proprietary technologies. These efforts should be conducted within a robust regulatory framework, akin to those applied in the medical field.⁸³ This framework should align with ongoing regulatory initiatives⁸⁴ that aim to ensure fairness, transparency, and accountability in the application of LLMs to policymaking.^74,78

Conclusion

LLM-based methods offer tremendous potential benefits for policymaking and society at large but also present significant risks. To maximize the benefits and minimize the risks, researchers would be wise to invest time now in exploring how and when methods such as the one presented here could be safely integrated into behaviorally informed policymaking processes.^85-87

A Brief History: LLMs as Research Tools in the Behavioral, Cognitive, & Social Sciences

Large language models (LLMs) represent the most advanced form of natural language processing (NLP) models available today. Broadly, NLP involves using mathematical and computational approaches to extract meaning from text, including everyday written or spoken text. Applied to large collections of text, these methods can, for instance, classify their contents by categories,⁸⁸ extract information on specific entities⁸⁹ or broad topics,⁹⁰ and identify arguments^91,92 and sentiments.⁹³ Below, we present a brief history of NLP from its early years to the introduction of LLMs,⁹⁴ highlighting its connections to the behavioral, cognitive, and social sciences.

Early NLP (1950s–1980s) relied on rule-based methods to analyze sentence structure. However, because language depends heavily on context, these approaches often struggled to capture nuanced meaning.⁹⁵ In the 1990s, statistical methods to better address meaning in text (in other words, semantics)—particularly, latent semantic analysis (LSA)⁹⁶—were introduced to analyze large collections of text. Semantic analyses identify relationships between words based on how often they appear in similar contexts, and they enabled investigators to gain a more sophisticated understanding of language patterns.⁹⁷ Their advent kickstarted a wave of research in psychology and cognitive science that used semantic analysis to study cognition⁹⁸—for example, assessing people’s knowledge⁹⁹ or comprehension¹⁰⁰ of topics or determining whether texts were coherent.¹⁰¹

The subsequent integration of machine learning techniques into NLP in the early 2010s significantly advanced the tasks that semantic analyses could accomplish, most prominently through the development of word embedding approaches, such as word2vec¹⁰² and gloVe,¹⁰³ that capture the meaning of text by going beyond simple word co-occurrence statistics and broad patterns across documents to focus on the immediate context in which words appear. Word embeddings have been extensively used in social sciences and psychology—for example, to identify implicit biases,¹⁰⁴ understand how people mentally represent risk,⁴² and analyze honesty in political discourse.¹⁰⁵ They have also enriched the usefulness of tools like psychology-informed dictionaries¹⁰⁶ (such as the Linguistic Inquiry and Word Count, or LIWC¹⁰⁷), by attending to the meanings and context of words rather than just their presence or absence and thus provide deeper insights than do standard dictionary-based approaches that rely only on word lists or word frequency counts.

NLP technology took another leap forward with the introduction of transformers,¹⁰⁸ such as BERT,¹⁰⁹ which have revolutionized text representation by more effectively capturing the contexts in which words are embedded and the relationships between words. Through a so-called attention mechanism,¹⁰⁸ transformers can focus on relevant words in a text even if the words are far apart.

Transformers have enhanced the ability of AI models to generalize across tasks—that is, to perform tasks they have not been explicitly trained for. This includes identifying similar topics or sentiments across large-scale text corpora. As a result, transformers have enabled applications such as conducting personality assessments,¹¹⁰ identifying suicide risk,¹¹¹ and recognizing antivaccination attitudes.¹¹²

LLMs are essentially built on large-scale transformers that are trained on vast amounts of data and have greater capabilities than the original transformers had—including the ability to provide more accurate, more context-aware text processing and to generate human-like text.¹¹³ Notable examples include generative pretrained transformers—the foundation of ChatGPT—as well as Gemini, Llama, Qwen, Mistral, Claude, Grok, DeepSeek, and others. LLMs’ capabilities surpass those of earlier NLP approaches by dynamically adapting to context; enabling nuanced language representation; and offering researchers powerful tools^29,32 for studying cognitive processes,^114,115 mapping mental representations,¹¹⁶ and advancing psychological measurement and experimentation.^31,117 They have been used, for example, in the study of everyday choices,¹¹⁸ risk preferences,¹¹⁹ and narratives.^115,120 LLMs have also been applied to policy-related tasks such as making public affairs documents transparent to citizens,¹²¹ identifying issues important to citizens,¹²² and gauging sentiments¹²³ evident in citizens’ social media posts and discussions in online forums.¹²⁴

For a fuller description of how NLP has been used in the behavioral sciences, see Reference 125.

Key Points for Policymakers

Existing LLM models could be used to identify the views of large numbers of citizens and distinguish between the opinions and priorities of different subgroups.

Such information could enable policymakers to quickly assess the views of many citizens before surveys or interventions are designed, thus producing more meaningful outcomes.

An application of the method to transcripts from brainstorming sessions about ways to solve policy problems, such as the low uptake of vaccinations, has demonstrated the technique’s ability to provide nuanced insights into the views of multiple subgroups.

Policy researchers and practitioners should further explore the potential benefits and risks of the approach.

Footnotes

Author Note

We thank Susannah Goss for editing the manuscript and Christoph Abels, Samuel Aeschbach, Caedyn Stinson, and Mubashir Sultan from the Adaptive Rationality Center at Max Planck Institute for Human Development for their feedback.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: E. L.-L. and S. M. H. were funded by the Deutsche Forschungsgemeinschaft (DFG)—Project number 458366841 (POLTOOLS—Assisting behavioural science and evidence-based policy making using online machine tools).

ORCID iDs

Ezequiel Lopez-Lopez

Stefan M. Herzog

References

Lezaun

Soneryd

(2007). Consulting citizens: Technologies of elicitation and the mobility of publics. Public Understanding of Science, 16(3), 279–297. https://doi.org/10.1177/0963662507079371

Brown

M. B.

(2006). Survey article: Citizen panels and the concept of representation. Journal of Political Philosophy, 14(2), 203–225. https://doi.org/10.1111/j.1467-9760.2006.00245.x

Tvinnereim

Fløttum

Gjerstad

Ø.

Johannesson

M. P.

Nordø

Å. D.

(2017). Citizens’ preferences for tackling climate change: Quantitative and qualitative analyses of their freely formulated solutions. Global Environmental Change, 46, 34–41. https://doi.org/10.1016/j.gloenvcha.2017.06.005

Culver

Howe

(2004). Calling all citizens: The challenges of public consultation. Canadian Public Administration, 47(1), 52–75. https://doi.org/10.1111/j.1754-7121.2004.tb01970.x

Bernauer

Gampfer

(2015). How robust is public support for unilateral climate policy? Environmental Science & Policy, 54, 316–330. https://doi.org/10.1016/j.envsci.2015.07.010

Jenny

M. A.

Betsch

(2022). Large-scale behavioural data are key to climate policy. Nature Human Behaviour, 6(11), 1444–1447. https://doi.org/10.1038/s41562-022-01479-4

Cairney

Wellstead

(2021). COVID-19: Effective policymaking depends on trust in experts, politicians, and the public. Policy Design and Practice, 4(1), 1–14. https://doi.org/10.1080/25741292.2020.1837466

Pearse

(2020). Deliberation, citizen science and COVID-19. The Political Quarterly, 91(3), 571–577. https://doi.org/10.1111/1467-923X.12869

Mouter

Hernandez

J. I.

Itten

A. V.

(2021). Public participation in crisis policymaking: How 30,000 Dutch citizens advised their government on relaxing COVID-19 lockdown measures. PLoS ONE, 16(5), Article e0250614. https://doi.org/10.1371/journal.pone.0250614

10.

Gadarian

S. K.

Goodman

S. W.

Pepinsky

T. B.

(2021). Partisanship, health behavior, and policy attitudes in the early stages of the COVID-19 pandemic. PLoS ONE, 16(4), Article e0249596. https://doi.org/10.1371/journal.pone.0249596

11.

Seale

Heywood

A. E.

Leask

Sheel

Thomas

Durrheim

D. N.

Bolsewicz

Kaur

(2020). COVID-19 is rapidly changing: Examining public perceptions and behaviors in response to this evolving pandemic. PLoS ONE, 15(6), Article e0235112. https://doi.org/10.1371/journal.pone.0235112

12.

Seddig

Maskileyson

Davidov

Ajzen

Schmidt

(2022). Correlates of COVID-19 vaccination intentions: Attitudes, institutional trust, fear, conspiracy beliefs, and vaccine skepticism. Social Science & Medicine, 302, Article 114981. https://doi.org/10.1016/j.socscimed.2022.114981

13.

Berinsky

A. J.

(2017). Measuring public opinion with surveys. Annual Review of Political Science, 20, 309–329. https://doi.org/10.1146/annurev-polisci-101513-113724

14.

Krosnick

J. A.

(2017). Improving question design to maximize reliability and validity. In Vannette

D. L.

Krosnick

J. A.

(Eds.), The Palgrave handbook of survey research (pp. 95–102). Springer. https://doi.org/10.1007/978-3-319-54395-6_13

15.

Schwarz

(1999). Self-reports: How the questions shape the answers. American Psychologist, 54(2), 93–105. https://doi.org/10.1037/0003-066X.54.2.93

16.

Ceron

Negri

(2016). The “social side” of public policy: Monitoring online public opinion and its mobilization during the policy cycle. Policy & Internet, 8(2), 131–147. https://doi.org/10.1002/poi3.117

17.

Belkahla Driss

Mellouli

Trabelsi

. (2019). From citizens to government policy-makers: Social media data analysis. Government Information Quarterly, 36(3), 560–570. https://doi.org/10.1016/j.giq.2019.05.002

18.

Weaver-Hightower

M. B.

(2014). A mixed methods approach for identifying influence on public policy. Journal of Mixed Methods Research, 8(2), 115–138. https://doi.org/10.1177/1558689813490996

19.

Hendren

Luo

Q. E.

Pandey

S. K.

(2018). The state of mixed methods research in public administration and public policy. Public Administration Review, 78(6), 904–916. https://doi.org/10.1111/puar.12981

20.

Brinkman

(2017). The interview. In Denzin

N. K.

Lincoln

Y. S.

(Eds.), The SAGE handbook of qualitative research (5th ed., pp. 576–599). SAGE Publishing.

21.

Guest

Bunce

Johnson

(2006). How many interviews are enough? An experiment with data saturation and variability. Field Methods, 18(1), 59–82. https://doi.org/10.1177/1525822X05279903

22.

Boulianne

Loptson

Kahane

(2018). Citizen panels and opinion polls: Convergence and divergence in policy preferences. Journal of Deliberative Democracy, 14(1), Article 4. https://doi.org/10.16997/jdd.294

23.

Hubert

R. B.

Estevez

Maguitman

Janowski

(2020). Analyzing and visualizing government-citizen interactions on Twitter to support public policy-making. Digital Government: Research and Practice, 1(2), Article 15. https://doi.org/10.1145/3360001

24.

Kowalski

Esteve

Jankin Mikhaylov

(2020). Improving public services by mining citizen feedback: An application of natural language processing. Public Administration, 98(4), 1011–1026. https://doi.org/10.1111/padm.12656

25.

Laurino

De Deyne

Cabana

Kaczer

(2023). The pandemic in words: Tracking fast semantic changes via a large-scale word association task. Open Mind, 7, 221–239. https://doi.org/10.1162/opmi_a_00081

26.

Nørbech

(2024). Does policy context matter for citizen engagement in policymaking? Evidence from the European Commission’s public consultation regime. European Union Politics, 25(1), 130–150. https://doi.org/10.1177/14651165231208995

27.

Paulus

P. B.

Kenworthy

J. B.

(2019). Effective brainstorming. In Paulus

P. B.

Nijstad

B. A.

(Eds.), The Oxford handbook of group creativity and innovation (pp. 287–305). Oxford University Press. https://doi.org/10.1093/oxfordhb/9780190648077.013.17

28.

Stroebe

Nijstad

B. A.

Rietzschel

E. F.

(2010). Beyond productivity loss in brainstorming groups: The evolution of a question. Advances in Experimental Social Psychology, 43, 157–203. https://doi.org/10.1016/S0065-2601(10)43004-X

29.

Bommasani

Hudson

D. A.

Adeli

Altman

Arora

von Arx

Bernstein

M. S.

Bohg

Bosselut

Brunskill

Brynjolfsson

Buch

Card

Castellon

Chatterji

Chen

Creel

Davis

J. Q.

Demszky

. . . Liang

(2022). On the opportunities and risks of foundation models. arXiv. https://doi.org/10.48550/arXiv.2108.07258

30.

Thirunavukarasu

A. J.

Ting

D. S. J.

Elangovan

Gutierrez

Tan

T. F.

Ting

D. S. W.

(2023). Large language models in medicine. Nature Medicine, 29(8), 1930–1940. https://doi.org/10.1038/s41591-023-02448-8

31.

Demszky

Yang

Yeager

D. S.

Bryan

C. J.

Clapper

Chandhok

Eichstaedt

J. C.

Hecht

Jamieson

Johnson

Jones

Krettek-Cobb

Lai

JonesMitchell

Ong

D. C.

Dweck

C. S.

Gross

J. J.

Pennebaker

J. W.

(2023). Using large language models in psychology. Nature Reviews Psychology, 2(11), 688–701. https://doi.org/10.1038/s44159-023-00241-5

32.

Grossmann

Feinberg

Parker

D. C.

Christakis

N. A.

Tetlock

P. E.

Cunningham

W. A.

(2023). AI and the transformation of social science research. Science, 380(6650), 1108–1109. https://doi.org/10.1126/science.adi1778

33.

Lopez-Lopez

Herzog

S. M.

(2024). Characterizing similarities and differences in people’s views based on open-ended expressions. OSF. https://doi.org/10.17605/OSF.IO/3H6RN

34.

Lopez-Lopez

Herzog

S. M.

(2024). Characterizing similarities and differences in people’s views based on open-ended expressions [Manuscript in preparation].

35.

United Nations Development Programme. (n.d.). Sustainable development goals. Retrieved April 1, 2025, from https://www.undp.org/sustainable-development-goals

36.

Siew

C. S. Q.

Wulff

D. U.

Beckage

N. M.

Kenett

Y. N.

(2019). Cognitive network science: A review of research on cognition through the lens of network representations, processes, and dynamics. Complexity, 2019, Article 2108423. https://doi.org/10.1155/2019/2108423

37.

Griffiths

T. L.

Steyvers

Firl

(2007). Google and the mind: Predicting fluency with PageRank. Psychological Science, 18(12), 1069–1076. https://doi.org/10.1111/j.1467-9280.2007.02027.x

38.

Hills

T. T.

Todd

P. M.

Lazer

Redish

A. D.

Couzin

I. D.

(2015). Exploration versus exploitation in space, mind, and society. Trends in Cognitive Sciences, 19(1), 46–54. https://doi.org/10.1016/j.tics.2014.10.004

39.

Hills

T. T.

Todd

P. M.

Goldstone

R. L.

(2008). Search in external and internal spaces: Evidence for generalized cognitive search processes. Psychological Science, 19(8), 802–808. https://doi.org/10.1111/j.1467-9280.2008.02160.x

40.

Wulff

D. U.

Hills

T. T.

Mata

(2022). Structural differences in the semantic networks of younger and older adults. Scientific Reports, 12, Article 21459. https://doi.org/10.1038/s41598-022-11698-4

41.

Siew

C. S. Q.

Guru

(2022). Investigating the network structure of domain-specific knowledge using the semantic fluency task. Memory & Cognition, 51(3), 623–646. https://doi.org/10.3758/s13421-022-01314-1

42.

Wulff

D. U.

Mata

(2022). On the semantic representation of risk. Science Advances, 8(27), Article eabm1883. https://doi.org/10.1126/sciadv.abm1883

43.

Everett

M. G.

Valente

T. W.

(2016). Bridging, brokerage, and betweenness. Social Networks, 44, 202–208. https://doi.org/10.1016/j.socnet.2015.09.001

44.

Newman

(2018). Networks. Oxford University Press.

45.

Herzog

S. M.

Hills

T. T.

(2019). Mediation centrality in adversarial policy networks. Complexity, 2019, Article 918504. https://doi.org/10.1155/2019/1918504

46.

Dalege

van der Does

(2022). Using a cognitive network model of moral and social beliefs to explain belief change. Science Advances, 8(33), Article eabm0137. https://doi.org/10.1126/sciadv.abm0137

47.

van der Does

Stein

D. L.

Fedoroff

Galesic

. (2021). Science communication in light of moral and social concerns: Testing a statistical physics model of belief change. OSF. https://doi.org/10.31219/osf.io/zs7dq

48.

Fischer

Amelung

Said

(2019). The accuracy of German citizens’ confidence in their climate change knowledge. Nature Climate Change, 9(10), 776–780. https://doi.org/10.1038/s41558-019-0563-0

49.

Tummers

(2019). Public policy and behavior change. Public Administration Review, 79(6), 925–930. https://doi.org/10.1111/puar.13109

50.

Topp

Mair

Smillie

Cairney

(2018). Knowledge management for policy impact: The case of the European Commission’s Joint Research Centre. Palgrave Communications, 4(1), 1–10. https://doi.org/10.1057/s41599-018-0143-3

51.

Taylor

A. L.

Dessai

Bruine

Bruin

(2014). Public perception of climate risk and adaptation in the UK: A review of the literature. Climate Risk Management, 4–5, 1–16. https://doi.org/10.1016/j.crm.2014.09.001

52.

Fox

Stoett

(2016). Citizen participation in the UN sustainable development goals consultation process: Toward global democratic governance? Global Governance, 22(4), 555–573. https://doi.org/10.1163/19426720-02204007

53.

Lazer

Pentland

Adamic

Aral

Barabási

A.-L.

Brewer

Christakis

Contractor

Fowler

Gutmann

Jebara

King

Macy

Roy

Van Alstyne

(2009). Computational social science. Science, 323(5915), 721–723. https://doi.org/10.1126/science.1167742

54.

Van Ryzin

G. G

. (2008). Validity of an online panel approach to citizen surveys. Public Performance & Management Review, 32(2), 236–262. https://doi.org/10.2753/PMR1530-9576320203

55.

Gauvreau

C. L.

Wight

Subasri

Palmer

Hayeems

Croker

Abelson

Fraser

Bombard

Hepburn

C. M.

Wilson

M. G.

Denburg

(2023). Access to novel drugs and therapeutics for children and youth: Eliciting citizens’ values to inform public funding decisions. Health Expectations, 26(2), 715–727. https://doi.org/10.1111/hex.13697

56.

Herderich

Lasser

Galesic

Aroyehun

S. T.

Garcia

Garland

(2024). Measuring complex psychological and sociological constructs in large-scale text. PsyArXiv Preprints. https://doi.org/10.31234/osf.io/tzc9p

57.

Le Mens

Kovács

Hannan

M. T.

Pros

. (2023). Uncovering the semantics of concepts using GPT-4. Proceedings of the National Academy of Sciences, 120(49), Article e2309350120. https://doi.org/10.1073/pnas.2309350120

58.

Wang

Chen

Deng

Wen

You

Liu

(2024). Prompt engineering in consistency and reliability with the evidence-based guideline for LLMs. npj Digital Medicine, 7, Article 41. https://doi.org/10.1038/s41746-024-01029-4

59.

Chen

Zhang

Langrené

Zhu

(2024). Unleashing the potential of prompt engineering in large language models: A comprehensive review. arXiv. https://doi.org/10.48550/arXiv.2310.14735

60.

Gallegos

I. O.

Rossi

R. A.

Barrow

Tanjim

M. M.

Kim

Dernoncourt

Zhang

Ahmed

N. K.

(2024). Bias and fairness in large language models: A survey. Computational Linguistics, 50(3), 1097–1179. https://doi.org/10.1162/coli_a_00524

61.

Kotek

Dockum

Sun

(2024). Gender bias and stereotypes in large language models. In Bernstein

Savage

Bozzon

(Eds.), Proceedings of the ACM Collective Intelligence Conference (pp. 12–24). ACM. https://doi.org/10.1145/3582269.3615599

62.

Lee

Frieske

Ishii

Bang

Y. J.

Madotto

Fung

(2023). Survey of hallucination in natural language generation. ACM Computing Surveys, 55(12), Article 248. https://doi.org/10.1145/3571730

63.

Chang

Wang

Yang

Zhu

Chen

Wang

Zhang

Chang

P. S.

Yang

Xie

(2024). A survey on evaluation of large language models. ACM Transactions on Intelligent Systems and Technology, 15(3), Article 39. https://doi.org/10.1145/3641289

64.

Liu

Yao

Ton

J. F.

Zhang

Guo

Cheng

Klochkov

Taufig

M. F.

(2024). Trustworthy LLMs: A survey and guideline for evaluating large language models’ alignment. arXiv. https://doi.org/10.48550/arXiv.2308.05374

65.

Pescetelli

Denis

(2023). Methods to scale deliberative decision-making to billions of simultaneous people. PsyArXiv Preprints. https://doi.org/10.31234/osf.io/cxhke

66.

Burton

J. W.

Lopez-Lopez

Hechtlinger

Rahwan

Aeschbach

Bakker

M. A.

Becker

J. A.

Berditchevskaia

Berger

Brinkmann

Flek

Herzog

S. M.

Huang

Kapoor

Narayanan

Nussberger

A.-M.

Yasseri

Nickl

Almaatouq

. . . Hertwig

(2024). How large language models can reshape collective intelligence. Nature Human Behaviour, 8, 1643–1655. https://doi.org/10.1038/s41562-024-01959-9

67.

Cui

Yasseri

(2024). AI-enhanced collective intelligence. Patterns, 5(11), Article 101074. https://doi.org/10.1016/j.patter.2024.101074

68.

Argyle

L. P.

Busby

E. C.

Fulda

Gubler

J. R.

Rytting

Wingate

(2023). Out of one, many: Using language models to simulate human samples. Political Analysis, 31(3), 337–351. https://doi.org/10.1017/pan.2023.2

69.

Hewitt

Ashokkumar

Ghezae

Willer

(2024). Predicting results of social science experiments using large language models. https://samim.io/dl/Predicting%20results%20of%20social%20science%20experiments%20using%20large%20language%20models.pdf

70.

Dillion

Tandon

Gray

(2023). Can AI language models replace human participants? Trends in Cognitive Sciences, 27(7), 597–600. https://doi.org/10.1016/j.tics.2023.04.008

71.

Costello

T. H.

Pennycook

Rand

D. G.

(2024). Durably reducing conspiracy beliefs through dialogues with AI. Science, 385(6714), Article eadq1814. https://doi.org/10.1126/science.adq1814

72.

Tessler

M. H.

Bakker

M. A.

Jarrett

Sheahan

Chadwick

M. J.

Koster

Evans

Campbell-Gillingham

Collins

Parkes

D. C.

Botvinick

Summerfield

(2024). AI can help humans find common ground in democratic deliberation. Science, 386(6719), Article eadq2852. https://doi.org/10.1126/science.adq2852

73.

Fan

Ding

Ning

Wang

Yin

Chua

T.-S.

(2024). A survey on RAG meeting LLMs: Towards retrieval-augmented large language models. Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 6491–6501). ACM. https://doi.org/10.1145/3637528.3671470

74.

Liao

Q. V.

Wortman Vaughan

(2024). AI transparency in the age of LLMs: A human-centered research roadmap. Harvard Data Science Review (Special Issue 5). https://doi.org/10.1162/99608f92.8036d03b

75.

Zhao

Chen

Yang

Liu

Deng

Cai

Wang

Yin

(2024). Explainability for large language models: A survey. ACM Transactions on Intelligent Systems and Technology, 15(2), Article 20. https://doi.org/10.1145/3639372

76.

Huckle

Williams

(2025). Easy problems that LLMs get wrong. In: Arai

(Ed.), Advances in information and communication: Proceedings of the 2025 Future of Information and Communication Conference (FICC; Vol. 1, pp. 313–322). Springer. https://doi.org/10.1007/978-3-031-84457-7_19

77.

Ouyang

Jiang

Almeida

Wainwright

C. L.

Mishkin

Zhang

Agarwal

Slama

Ray

Schulman

Hilton

Kelton

Miller

Simens

Askell

Wellinder

Christiano

Leike

Lowe

(2022). Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35, 27730–27744. https://proceedings.neurips.cc/paper_files/paper/2022/file/b1efde53be364a73914f58805a001731-Paper-Conference.pdf

78.

Smart

Hutchinson

Amugongo

L. M.

Dikker

Zito

Ebinama

Wudiri

Wang

van Liemt

Sedoc

Olojo

Uwakwe

Wornyo

Schmer-Galunder

Smith-Loud

(2024). Socially responsible data for large multilingual language models. arXiv. https://arxiv.org/abs/2409.05247

79.

Atari

Xue

M. J.

Park

P. S.

Blasi

Henrich

(2023). Which humans? PsyArXiv Preprints. https://doi.org/10.31234/osf.io/5b26t

80.

Mökander

Schuett

Kirk

H. R.

Floridi

(2023). Auditing large language models: A three-layered approach. AI and Ethics, 4(4), 1085–1115. https://doi.org/10.1007/s43681-023-00289-2

81.

Hussain

Binz

Mata

Wulff

D. U.

(2024). A tutorial on open-source large language models for behavioral science. Behavior Research Methods, 56(8), 8214–8237 https://doi.org/10.3758/s13428-024-02455-8

82.

Widder

D. G.

Whittaker

West

S. M.

(2024). Why ‘open’ AI systems are actually closed, and why this matters. Nature, 635(8040), 827–833. https://doi.org/10.1038/s41586-024-08141-1

83.

Meskó

Topol

E. J.

(2023). The imperative for regulatory oversight of large language models (or generative AI) in healthcare. npj Digital Medicine, 6, Article 120. https://doi.org/10.1038/s41746-023-00873-0

84.

Novelli

Casolari

Hacker

Spedicato

Floridi

(2024). Generative AI in EU law: Liability, privacy, intellectual property, and cybersecurity. Computer Law & Security Review, 55, Article 106066. https://doi.org/10.1016/j.clsr.2024.106066

85.

Hallsworth

(2023). A manifesto for applying behavioural science. Nature Human Behaviour, 7, 310–322. https://doi.org/10.1038/s41562-023-01555-3

86.

van Bavel

. (2020). Behavioural insights for EU policymaking. In Šucha

Sienkiewicz

(Eds.), Science for policy handbook (pp. 196–205). Elsevier. https://doi.org/10.1016/B978-0-12-822596-7.00017-6

87.

Herzog

S. M.

Hertwig

(2025). Boosting: Empowering citizens with behavioral science. Annual Review of Psychology, 76, 851–881. https://doi.org/10.1146/annurev-psych-020924-124753

88.

Peng

Xia

Yang

Sun

P. S.

(2022). A survey on text classification: From traditional to deep learning. ACM Transactions on Intelligent Systems and Technology, 13(2), Article 31. https://doi.org/10.1145/3495162

89.

Al-Moslmi

Gallofré Ocaña

Opdahl

A. L.

Veres

(2020). Named entity extraction for knowledge graphs: A literature overview. IEEE Access, 8, 32862–32881. https://doi.org/10.1109/ACCESS.2020.2973928

90.

Rapkin

Atkinson

T. M.

Schofield

Bochner

B. H.

(2019). Leveraging latent dirichlet allocation in processing free-text personal goals among patients undergoing bladder cancer surgery. Quality of Life Research, 28(6), 1441–1455. https://doi.org/10.1007/s11136-019-02132-w

91.

Stab

Gurevych

(2017). Parsing argumentation structures in persuasive essays. Computational Linguistics, 43(3), 619–659. https://doi.org/10.1162/COLI_a_00295

92.

Lawrence

Reed

(2020). Argument mining: A survey. Computational Linguistics, 45(4), 765–818. https://doi.org/10.1162/coli_a_00364

93.

Ravi

(2015). A survey on opinion mining and sentiment analysis: Tasks, approaches, and applications. Knowledge-Based Systems, 89, 14–46. https://doi.org/10.1016/j.knosys.2015.06.015

94.

Khurana

Koli

Khatter

Singh

(2023). Natural language processing: State of the art, current trends, and challenges. Multimedia Tools and Applications, 82(3), 3713–3744. https://doi.org/10.1007/s11042-022-13428-4

95.

Patil

Boit

Gudivada

Nandigam

(2023). A survey of text representation and embedding techniques in NLP. IEEE Access, 11, 36120–36146. https://doi.org/10.1109/ACCESS.2023.3266377

96.

Landauer

T. K.

Dutnais

S. T.

(1997). A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211–240. https://doi.org/10.1037/0033-295X.104.2.211

97.

Bullinaria

J. A.

Levy

J. P.

(2007). Extracting semantic representations from word co-occurrence statistics: A computational study. Behavior Research Methods, 39(3), 510–526. https://doi.org/10.3758/BF03193020

98.

Bhatia

Aka

(2022). Cognitive modeling with representations from large-scale digital data. Current Directions in Psychological Science, 31(3), 207–214. https://doi.org/10.1177/09637214211068113

99.

Rehder

Schreiner

Wolfe

M. B. W.

Laham

Landauer

T. K.

Kintsch

(1998). Using latent semantic analysis to assess knowledge: Some technical considerations. Discourse Processes, 25(2–3), 337–354. https://doi.org/10.1080/01638539809545031

100.

Shapiro

A. M.

McNamara

D. S.

(2000). The use of latent semantic analysis as a tool for the quantitative assessment of understanding and knowledge. Journal of Educational Computing Research, 22(1), 1–36. https://doi.org/10.2190/M811-G475-WKMX-X0JH

101.

Foltz

P. W.

Kintsch

Landauer

T. K.

(1998). The measurement of textual coherence with latent semantic analysis. Discourse Processes, 25(2–3), 285–307. https://doi.org/10.1080/01638539809545029

102.

Mikolov

Chen

Corrado

Dean

(2013). Efficient estimation of word representations in vector space. arXiv. https://doi.org/10.48550/arXiv.1301.3781

103.

Pennington

Socher

Manning

(2014). GloVe: Global vectors for word representation. In Moschitti

Pang

Daelemans

(Eds.), Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP; pp. 1532–1543). Association for Computational Linguistics. https://doi.org/10.3115/v1/D14-1162

104.

Bhatia

Walasek

(2023). Predicting implicit attitudes with natural language data. Proceedings of the National Academy of Sciences, 120(25), Article e2220726120. https://doi.org/10.1073/pnas.2220726120

105.

Lasser

Aroyehun

S. T.

Carrella

Simchon

Garcia

Lewandowsky

(2023). From alternative conceptions of honesty to alternative facts in communications by US politicians. Nature Human Behaviour, 7(12), 2140–2151. https://doi.org/10.1038/s41562-023-01691-w

106.

Garten

Hoover

Johnson

K. M.

Boghrati

Iskiwitch

Dehghani

(2018). Dictionaries and distributions: Combining expert knowledge and large-scale textual data content analysis. Behavior Research Methods, 50(1), 344–361. https://doi.org/10.3758/s13428-017-0875-9

107.

Tausczik

Y. R.

Pennebaker

J. W.

(2010). The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology, 29(1), 24–54. https://doi.org/10.1177/0261927X09351676

108.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Polosukhin

(2017). Attention is all you need. In Guyon

Von Luxburg

Bengio

Wallach

Fergus

Vishwanathan

Garnett

(Eds.), Advances in Neural Information Processing Systems (NIPS 2017; Vol. 30, pp. 5998–6008). Curran Associates.

109.

Devlin

Chang

M.-W.

Lee

Toutanova

(2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Burstein

Doran

Solorio

(Eds.), Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human language technologies (Vol. 1, pp. 4171–4186). Association for Computational Linguistics. https://doi.org/10.18653/v1/N19-1423

110.

Abdurahman

Zou

Ungar

Bhatia

(2024). A deep learning approach to personality assessment: Generalizing across items and expanding the reach of survey-based research. Journal of Personality and Social Psychology, 126(2), 312–331. https://doi.org/10.1037/pspp0000480

111.

Matero

Idnani

Son

Giorgi

Zamani

Limbachiya

Guntuki

S. C.

Schwartz

H. A.

(2019). Suicide risk assessment with multi-level dual-context language and BERT. In Niederhoffer

Hollingshead

Resnik

Loveys

(Eds.), Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology (pp. 39–44). Association for Computational Linguistics. https://doi.org/10.18653/v1/W19-3005

112.

Holford

Lopez-Lopez

Fasce

Karlsson

L. C.

Lewandowsky

(2024). Identifying the underlying psychological constructs from self-expressed anti-vaccination argumentation. Humanities and Social Sciences Communications, 11, Article 926. https://doi.org/10.1057/s41599-024-03416-4

113.

Zhao

W. X.

Zhou

Tang

Wang

Hou

Min

Zhang

Dong

Yang

Chen

Jiang

Ren

Tang

Liu

. . . Wen

J.-R.

(2023). A survey of large language models. arXiv. https://doi.org/10.48550/arXiv.2303.18223

114.

Mihalcea

Biester

Boyd

R. L.

Jin

Perez-Rosas

Wilson

Pennebaker

J. W.

(2024). How developments in natural language processing help us in understanding human behaviour. Nature Human Behaviour, 8(10), 1877–1889. https://doi.org/10.1038/s41562-024-01938-0

115.

Michelmann

Kumar

Norman

K. A.

Toneva

(2025). Large language models can segment narrative events similarly to humans. Behavioral Reearch Methods, 57(1), 39. https://doi.org/10.3758/s13428-024-02569-z

116.

Aeschbach

Mata

Wulff

D. U.

(2025). Mapping mental representations with free associations: A tutorial using the R package associatoR. Journal of Cognition, 8(1). https://doi.org/10.5334/joc.407

117.

Hardy

Sucholutsky

Thompson

Griffiths

(2023). Large language models meet cognitive science: LLMs as tools, models, and participants. In Goldwarter

Anggoro

F. K.

Hayes

B. K.

Ong

D. C.

(Eds.), Proceedings of the Annual Meeting of the Cognitive Science Society, 45. https://escholarship.org/uc/item/6dp9k2gz

118.

Bhatia

van Baal

S. T.

Walasek

(2025). Computational analysis of 100 K choice dilemmas: Decision attributes, trade-off structures, and model-based prediction. Proceedings of the National Academy of Sciences, 122(17). https://doi.org/10.1073/pnas.2406489122

119.

Bhatia

(2024). Exploring variability in risk taking with large language models. Journal of Experimental Psychology: General, 153(7), 1838–1860. https://doi.org/10.1037/xge0001607

120.

Sap

Jafarpour

Choi

Horvitz

(2022). Quantifying the narrative flow of imagined versus autobiographical stories. Proceedings of the National Academy of Sciences, 119(45), Article e2211715119. https://doi.org/10.1073/pnas.2211715119

121.

Peña

Morales

Fierrez

Serna

Ortega-Garcia

Puente

Í.

Córdova

(2024). Leveraging large language models for topic classification in the domain of public affairs. In Coustaty

Fornés

(Eds.), Document analysis and recognition: ICDAR 2023 Workshops (pp. 20–33). Springer. https://doi.org/10.1007/978-3-031-41498-5_2

122.

dos Santos

V. G.

Santos

G. L.

Lynn

Benatallah

. (2024). Identifying citizen-related issues from social media using LLM-based data augmentation. In Guizzardi

Santoro

Mouratidis

Soffer

(Eds.), Advanced information systems engineering (pp. 531–546). Springer. https://doi.org/10.1007/978-3-031-61057-8_31

123.

Kwon

O. H.

Bhargava

Radaideh

Cooper

Joynt

Radaideh

M. I.

(2024). Sentiment analysis of the United States public support of nuclear power on social media using large language models. Renewable and Sustainable Energy Reviews, 200, Article 114570. https://doi.org/10.1016/j.rser.2024.114570

124.

Bhatia

(2024). Advancing policy insights: Opinion data analysis and discourse structuring using LLMs [Unpublished master’s thesis]. University of Central Florida.

125.

Feuerriegel

Maarouf

Bär

Geissler

Schweisthal

Pröllochs

Robertson

C. E.

Rathje

Hartmann

Mohammad

S. M.

Netzer

Siegel

A. A.

Plank

Van Bavel

J. J.

(2025). Using natural language processing to analyse text data in behavioural science. Nature Reviews Psychology, 4, 96–111. https://doi.org/10.1038/s44159-024-00392-z

126.

Günther

Rinaldi

Marelli

(2019).Vector-space models of semantic representation from a cognitive perspective: A discussion of common misconceptions. Perspectives on Psychological Science, 14(6), 1006–1033 https://doi.org/10.1177/1745691619861372

Surfacing citizens’ policy perspectives at scale in the age of large language models

Abstract

Keywords

An Approach to Surfacing Citizens’ Policy Perspectives at Scale

Implications for Policymakers

Challenges and Future Research

Conclusion

A Brief History: LLMs as Research Tools in the Behavioral, Cognitive, & Social Sciences

Key Points for Policymakers

Footnotes

Author Note

Declaration of Conflicting Interests

Funding

ORCID iDs

References