Sage Journals: Discover world-class research

Abstract

Alternative approaches to personality measurement, such as open-ended narrative-based assessments, have potential advantages for organizational research and practice. In this research, we investigate factors that affect valid application of natural language processing (NLP) for scoring open-ended personality assessments and when, how, and why such assessments capture personality-related variance. Using a large sample of responses to open-ended assessments, convergence between NLP scores and self-report target scores increased as the degree of customization and the sophistication of the underlying model increased, with the worst psychometric performance occurring for zero-shot large language model (LLM) scores and the best for fine-tuned LLM scores. However, all scoring methods exhibited evidence of validity. Additionally, when trained to predict direct evaluations of the narrative responses, correlations with target scores were large (M = .83). NLP scores also exhibited discriminant and criterion-related validity evidence. However, validity was contingent upon the methodological rigor employed in developing writing prompts. Prompts designed to elicit trait-relevant information outperformed generic prompts, and this occurred because trait-specific prompts increased the amount of trait-relevant information (i.e., narrative units), which was associated with enhanced convergence with target scores.

Keywords

personality open-ended assessments natural language processing large language models

Get full access to this article

View all access options for this article.

References

Abdurahman

Atari

Karimi-Malekabadi

Xue

M. J.

Trager

Park

P. S.

Golazizian

Omrani

Dehghani

(2024). Perils and opportunities in using large language models in psychological research. PNAS Nexus, 3(7). https://doi.org/10.1093/pnasnexus/pgae245

American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (Eds.) (2014). Standards for educational and psychological testing. American Educational Research Association.

Arthur

W.,

Jr. Villado

A. J.

(2008). The importance of distinguishing between constructs and methods when comparing predictors in personnel selection research and practice. Journal of Applied Psychology, 93(2), 435–442. https://doi.org/10.1037/0021-9010.93.2.435

Berggren

Kaati

Pelzer

Stiff

Lundmark

Akrami

(2024). The generalizability of machine learning models of personality across two text domains. Personality and Individual Differences, 217, 1–7. https://doi.org/10.1016/j.paid.2023.112465

Binning

J. F.

Barrett

G. V.

(1989). Validity of personnel decisions: A conceptual analysis of the inferential and evidential bases. Journal of Applied Psychology, 74(3), 478–494. https://doi.org/10.1037/0021-9010.74.3.478

Bleidorn

Hopwood

C. J.

(2019). Using machine learning to advance personality assessment and theory. Personality and Social Psychology Review, 23(2), 190–203. https://doi.org/10.1177/1088868318772990

Breiman

(2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Brown

T. B.

Mann

Ryder

Subbiah

Kaplan

Dhariwal

Neelakantan

Shyam

Sastry

Askell

Agarwal

Herbert-Voss

Krueger

Henighan

Child

Ramesh

Ziegler

D. M.

Winter

Sutskever

(2020). Language models are few-shot learners. arXiv preprint arXiv:2005.14165.

Brunswik

(1956). Perception and the representative design of psychological experiments. University of California Press.

10.

Brutus

(2010). Words versus numbers: A theoretical exploration of giving and receiving narrative comments in performance appraisal. Human Resource Management Review, 20(2), 144–157. https://doi.org/10.1016/j.hrmr.2009.06.003

11.

Bureau of Labor Statistics. (2023). Table 3. Employment status of the civilian noninstitutional population by age, sex, and race [Table]. 2023 Annual Averages – Household Data – Tables from Employment and Earnings. U.S. Department of Labor. https://www.bls.gov/cps/data/aa2023/cpsaat03.pdf

12.

Campion

M. C.

Campion

M. A.

Campion

E. D.

Reider

M. H.

(2016). Initial investigation into computer scoring of candidate essays for personnel selection. Journal of Applied Psychology, 101(7), 958–975. https://doi.org/10.1037/apl0000108

13.

Cawley

G. C.

Talbot

N. L. C.

(2010). On over-fitting in model selection and subsequent selection bias in performance evaluation. Journal of Machine Learning Research, 11, 2079–2107.

14.

Clark

L. A.

Watson

(1995). Constructing validity: Basic issues in objective scale development. Psychological Assessment, 7(3), 309–319. https://doi.org/10.1037/1040-3590.7.3.309

15.

Connelly

B. S.

Ones

D. S.

(2010). An other perspective on personality: Meta-analytic integration of observers’ accuracy and predictive validity. Psychological Bulletin, 136(6), 1092–1122. https://doi.org/10.1037/a0021212

16.

Cortina

J. M.

Sheng

Keener

S. K.

Keeler

K. R.

Grubb

L. K.

Schmitt

Tonidandel

Summerville

K. M.

Heggestad

E. D.

Banks

G. C.

(2020). From alpha to omega and beyond! A look at the past, present, and (possible) future of psychometric soundness in the Journal of Applied Psychology. Journal of Applied Psychology, 105(12), 1351–1381. https://doi.org/10.1037/apl0000815

17.

Crano

W. D.

Brewer

M. B.

(1986). Principles and methods of social research. Allyn & Bacon.

18.

Cronbach

L. J.

(1960). Essentials of psychological testing (2nd ed.). Harper & Row.

19.

Crossley

S. A.

McNamara

D. S.

(2009). Computational assessment of lexical differences in L1 and L2 writing. Journal of Second Language Writing, 18(2), 119–135. 10.1016/j.jslw.2009.02.002 https://doi.org/10.1016/j.jslw.2009.02.002

20.

Dai

Jayaratne

Jayatilleke

(2022). Explainable personality prediction using answers to open-ended interview questions. Frontiers in Psychology, 13, 865841. https://doi.org/10.3389/fpsyg.2022.865841

21.

Demszky

Yang

Yeager

D. S.

Bryan

C. J.

Clapper

Chandhok

Eichstaedt

J. C.

Hecht

Jamieson

Johnson

Jones

Krettek-Cobb

Lai

JonesMitchell

Ong

D. C.

Dweck

C. S.

Gross

J. J.

Pennebaker

J. W.

(2023). Using large language models in psychology. Nature Reviews Psychology, 2(11), 688–701. https://doi.org/10.1038/s44159-023-00241-5

22.

Devlin

Chang

M. W.

Lee

Toutanova

(2018). BERT: Pre- training of deep bidirectional transformers for language understanding. arxiv:1810.04805. https://doi.org/10.48550/arXiv.1810.04805

23.

DeYoung

C. G.

Quilty

L. C.

Peterson

J. B.

(2007). Between facets and domains: 10 aspects of the Big Five. Journal of Personality and Social Psychology, 93(5), 880–896. https://doi.org/10.1037/0022-3514.93.5.880

24.

Fan

Sun

Liu

Zhao

Zhang

Chen

Glorioso

Hack

(2023). How well can an AI chatbot infer personality? Examining psychometric properties of machine-inferred personality scores. Journal of Applied Psychology, 108(8), 1277–1299. https://doi.org/10.1037/apl0001082

25.

Foldes

H. J.

Duehr

E. E.

Ones

D. S.

(2008). Group differences in personality: Meta-analyses comparing five U.S. racial groups. Personnel Psychology, 61(3), 579–616. https://doi.org/10.1111/j.1744-6570.2008.00123.x

26.

Fox

Spector

P. E.

Bruursema

Kessler

Goh

(2007). Necessity is the mother of behavior: Organizational constraints, CWB and OCB. Paper presented at the meeting of the Academy of Management, Philadelphia, PA.

27.

Funder

D. C.

(1995). On the accuracy of personality judgment: A realistic approach. Psychological Review, 102(4), 652–670. https://doi.org/10.1037/0033-295X.102.4.652

28.

Funder

D. C.

(1999). Personality judgment: A realistic approach to person perception. Academic Press.

29.

Funder

D. C.

(2001). The personality puzzle (2nd ed.). Norton.

30.

Furr

R. M.

(2018). Psychometrics: An introduction (3rd ed.). Sage.

31.

Gatewood

R. D.

Feild

H. S.

Barrick

M. R.

(2016). Human resource selection (8th ed.). Cengage Learning.

32.

Goldberg

L. R.

Johnson

J. A.

Eber

H. W.

Hogan

Ashton

M. C.

Cloninger

C. R.

Gough

H. C.

(2006). The International Personality Item Pool and the future of public-domain personality measures. Journal of Research in Personality, 40(1), 84–96. https://doi.org/10.1016/j.jrp.2005.08.007

33.

Goodfellow

Bengio

Courville

(2016). Deep learning. The MIT Press.

34.

Grunenberg

Peters

Francis

M. J.

Back

M. D.

Matz

S. C.

(2024). Machine learning in recruiting: Predicting personality from CVs and short text responses. Frontiers in Social Psychology, 1, 1290295. https://doi.org/10.3389/frsps.2023.1290295

35.

Haaland

Christiansen

N. D.

(2002). Implications of trait-activation theory for evaluating the construct validity of assessment center ratings. Personnel Psychology, 55(1), 137–163. https://doi.org/10.1111/j.1744-6570.2002.tb00106.x

36.

Harrison

J. S.

Boivie

Hubbard

T. D.

Petrenko

O. V.

(2024). Executive personality assessment with large language models: Updating an existing tool and advancing similar measures in strategy and management research. In O'Kane

Busenbark

J. R.

McKenny

A. F.

Paroutis

(Eds.), Delving deep (pp. 95–122). Emerald Publishing Limited.

37.

Harrison

J. S.

Thurgood

G. R.

Boivie

Pfarrer

M. D.

(2019). Measuring CEO personality: Developing, validating, and testing a linguistic tool. Strategic Management Journal, 40(8), 1316–1330. https://doi.org/10.1002/smj.3023

38.

Harrison

J. S.

Thurgood

G. R.

Boivie

Pfarrer

M. D.

(2020). Perception is reality: How CEOs’ observed personality influences market perceptions of firm risk and shareholder returns. Academy of Management Journal, 63(4), 1166–1195. https://doi.org/10.5465/amj.2018.0626

39.

Hastie

Tibshirani

Friedman

(2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). Springer.

40.

Liu

Gao

Chen

(2020). DeBERTa: Decoding-enhanced BERT with Disentangled Attention. arXiv preprint arXiv:2006.03654. https://doi.org/10.48550/arXiv.2006.03654

41.

Hickman

Bosch

Saef

Tay

Woo

S. E.

(2022). Automated video interview personality assessments: Reliability, validity, and generalizability investigations. Journal of Applied Psychology, 107(8), 1323–1351. https://doi.org/10.1037/apl0000695

42.

Hinkin

T. R.

(1998). A brief tutorial on the development of measures for use in survey questionnaires. Organizational Research Methods, 1(1), 104–121. https://doi.org/10.1177/109442819800100106

43.

Hogan

R. T.

(2007). Personality and the fate of organizations. Erlbaum.

44.

Holtrop

Oostrom

J. K.

van Breda

W. R. J.

Koutsoumpis

de Vries

R. E.

(2022). Exploring the application of a text-to-personality technique in job interviews. European Journal of Work and Organizational Psychology, 31(6), 799–816. https://doi.org/10.1080/1359432X.2022.2051484

45.

Howard

Ruder

(2018). Universal language model fine-tuning for text classification. arXiv:1801.06146. https://doi.org/10.18653/v1/P18-1031

46.

Shen

Wallis

Allen-Zhu

Wang

Chen

(2021). LoRA: Low-rank adaptation of large language models. arXiv preprint arXiv:2106.09685. https://doi.org/10.48550/arXiv.2106.09685

47.

Huffcutt

A. I.

(2011). An empirical review of the employment interview construct literature. International Journal of Selection and Assessment, 19(1), 62–81. https://doi.org/10.1111/j.1468-2389.2010.00535.x

48.

Huffcutt

A. I.

Conway

J. M.

Roth

P. L.

Stone

N. J.

(2001). Identification and meta-analytic assessment of psychological constructs measured in employment interviews. Journal of Applied Psychology, 86(5), 897–913. https://doi.org/10.1037/0021-9010.86.5.897

49.

Hussain

Binz

Mata

Wulff

D. U.

(2024). A tutorial on open-source large language models for behavioral science. Behavior Research Methods, 56(8), 8214–8237. https://doi.org/10.3758/s13428-024-02455-8

50.

James

Witten

Hastie

Tibshirani

(2017). An introduction to statistical learning: With applications in R (7th ed.). Springer.

51.

Jayaratne

Jayatilleke

(2020). Predicting personality using answers to open-ended interview questions. IEEE Access, 8, 115345–115355. https://doi.org/10.1109/ACCESS.2020.3004002

52.

John

O. P.

Robins

R. W.

(1993). Determinants of interjudge agreement on personality traits: The big five domains, observability, evaluativeness, and the unique perspective of the self. Journal of Personality, 61(4), 521–551. https://doi.org/10.1111/j.1467-6494.1993.tb00781.x

53.

Johnson

Zhang

(2024). Examining the responsible use of zero-shot AI approaches to scoring essays. Scientific Reports, 14(1), 12345. https://doi.org/10.1038/s41598-024-79208-2

54.

Judge

T. A.

Bono

J. E.

Ilies

Gerhardt

M. W.

(2002). Personality and leadership: A qualitative and quantitative review. Journal of Applied Psychology, 87(4), 765–780. https://doi.org/10.1037/0021-9010.87.4.765

55.

Kaplan

McCandlish

Henighan

Brown

T. B.

Chess

Child

Gray

Radford

Amodei

(2020). Scaling laws for neural language models. arXiv. https://doi.org/10.48550/arXiv.2001.08361

56.

Koenig

Tonidandel

Thompson

Albritton

Koohifar

Yankov

Speer

Hardy

J. H.,

III Gibson

Frost

Liu

McNeney

Capman

Lowery

Kitching

Nimbkar

Boyce

Sun

Guo

Newton

(2023). Improving measurement and prediction in personnel selection through the application of machine learning. Personnel Psychology, 76(4), 1061–1123. https://doi.org/10.1111/peps.12608

57.

Levashina

Hartwell

C. J.

Morgeson

F. P.

Campion

M. A.

(2014). The structured employment interview: Narrative and quantitative review of the research literature. Personnel Psychology, 67(1), 241–293. https://doi.org/10.1111/peps.12052

58.

Lievens

Chasteen

C. S.

Day

E. A.

Christiansen

N. D.

(2006). Large-scale investigation of the role of trait activation theory for understanding assessment center convergent and discriminant validity. Journal of Applied Psychology, 91(2), 247–258. https://doi.org/10.1037/0021-9010.91.2.247

59.

Lira

Gardner

Quirk

Stone

Rao

Ungar

Hutt

Hickman

D’Mello

S. K.

Duckworth

A. L.

(2023). Using artificial intelligence to assess personal qualities in college admissions. Science Advances, 10(13). https://doi.org/10.1126/sciadv.adg9405

60.

Liu

Ott

Goyal

Joshi

Chen

Levy

Lewis

Zettlemoyer

Stoyanov

(2019). Roberta: A robustly optimized BERT pretraining approach. arXiv:1907.11692.

61.

McAbee

S. T.

Connelly

B. S.

(2016). A multi-rater framework for studying personality: The trait-reputation-identity model. Psychological Review, 123(5), 569–591. https://doi.org/10.1037/rev0000035

62.

McAdams

D. P.

(1995). What do we know when we know a person? Journal of Personality, 63(3), 365–396. https://doi.org/10.1111/j.1467-6494.1995.tb00500.x

63.

McAdams

D. P.

Anyidoho

N. A.

Brown

Huang

Y. T.

Kaplan

Machado

M. A.

(2004). Traits and stories: Links between dispositional and narrative features of personality. Journal of Personality, 72(4), 761–784. https://doi.org/10.1111/j.0022-3506.2004.00279.x

64.

Miao

Wang

Shen

Wang

(2023). Effects of big five, hexaco, and dark triad on counterproductive work behaviors: A meta-analysis. International Journal of Mental Health Promotion, 25(3), 357–374. https://doi.org/10.32604/ijmhp.2023.027950

65.

MiliMatch. (2024). Our technology. Retrieved June 24, 2024, from https://www.milimatch.com/technology.

66.

Min

Chen

Bian

Zhao

Huang

Zhao

Huang

Ananiadou

Rong

(2022). Transformer for graphs: An overview from architecture perspective. arXiv preprint arXiv:2202.08455. https://doi.org/10.48550/arXiv.2202.08455

67.

Mobley

W. H.

Horner

S. O.

Hollingsworth

A. T.

(1978). An evaluation of precursors of hospital employee turnover. Journal of Applied Psychology, 63(4), 408–414. https://doi.org/10.1037/0021-9010.63.4.408

68.

Murphy-Paul

(2006). The cult of personality testing: How personality tests are leading us to miseducate our children, mismanage our companies, and misunderstand ourselves. Free Press.

69.

I.-S.

Wang

Mount

M. K.

(2011). Validity of observer ratings of the five-factor model of personality traits: A meta-analysis. Journal of Applied Psychology, 96(4), 762–773. https://doi.org/10.1037/a0021832

70.

Palan

Schitter

(2018). Prolific.ac—A subject pool for online experiments. Journal of Behavioral and Experimental Finance, 17, 22–27. https://doi.org/10.1016/j.jbef.2017.12.004

71.

Park

Schwartz

H. A.

Eichstaedt

J. C.

Kern

M. L.

Kosinski

Stillwell

D. J.

Ungar

L. H.

Seligman

M. E. P.

(2015). Automatic personality assessment through social media language. Journal of Personality and Social Psychology, 108(6), 934–952. https://doi.org/10.1037/pspp0000020

72.

Paulhus

D. L.

(1984). Two-component models of socially desirable responding. Journal of Personality and Social Psychology, 46(3), 598–609. https://doi.org/10.1037/0022-3514.46.3.598

73.

Pennebaker

J. W.

King

L. A.

(1999). Linguistic styles: Language use as an individual difference. Journal of Personality and Social Psychology, 77(6), 1296–1312. https://doi.org/10.1037/0022-3514.77.6.1296

74.

Pennington

Socher

Manning

C. D.

(2014). Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 1532–1543.

75.

Pletzer

J. L.

Oostrom

J. K.

De Vries

R. E.

(2021). Hexaco personality and organizational citizenship behavior: A domain- and facet-level meta-analysis. Human Performance, 34(2), 126–147. https://doi.org/10.1080/08959285.2021.1891072

76.

Podsakoff

P. M.

MacKenzie

S. B.

Lee

J.-Y.

Podsakoff

N. P.

(2003). Common method biases in behavioral research: A critical review of the literature and recommended remedies. Journal of Applied Psychology, 88(5), 879–903. https://doi.org/10.1037/0021-9010.88.5.879

77.

Putka

D. J.

Beatty

A. S.

Reeder

M. C.

(2018). Modern prediction methods: New perspectives on a common problem. Organizational Research Methods, 21(3), 689–732. https://doi.org/10.1177/1094428117697041

78.

Putka

D. J.

Sackett

P. R.

(2010). Reliability and validity. In Farr

J. L.

Tippins

N. T.

(Eds.), Handbook of employee selection (pp. 9–49). Routledge/Taylor & Francis Group.

79.

Rathje

Mirea

D.-M.

Sucholutsky

Marjieh

Robertson

C. E.

Van Bavel

J. J.

(2024). GPT is an effective tool for multilingual psychological text analysis. Proceedings of the National Academy of Sciences. https://doi.org/10.1073/pnas.2301234567

80.

Raymark

P. H.

Van Iddekinge

C. H.

(2013). Assessing personality in selection interviews. In Christiansen

Tett

(Eds.), Handbook of personality at work (pp. 333–355). Routledge.

81.

Roulin

Le Khoi

A. P.

Bourdage

J. S.

(2023). Camera rolling… action! Examining interviewee training and practice opportunities in asynchronous video interviews. Journal of Vocational Behavior. https://doi.org/10.1016/j.jvb.2023.103910

82.

Sackett

P. R.

Zhang

Berry

C. M.

Lievens

(2022). Revisiting meta-analytic estimates of validity in personnel selection: Addressing systematic overcorrection for restriction of range. Journal of Applied Psychology, 107(11), 2040–2068. https://doi.org/10.1037/apl0000994

83.

Sartori

Orrù

(2023). Language models and psychological sciences. Frontiers in Psychology, 14, 1279317. https://doi.org/10.3389/fpsyg.2023.1279317

84.

Shaffer

J. A.

Postlethwaite

B. E.

(2012). A matter of context: A meta-analytic investigation of the relative validity of contextualized and noncontextualized personality measures. Personnel Psychology, 65(3), 445–494. https://doi.org/10.1111/j.1744-6570.2012.01250.x

85.

Society for Industrial and Organizational Psychology. (2018). Principles for the validation and use of personnel selection procedures. Industrial and Organizational Psychology, 11(S1), 1–97. https://doi.org/10.1017/io p.2018.195

86.

Spector

P. E.

Bauer

J. A.

Fox

(2010). Measurement artifacts in the assessment of counterproductive work behavior and organizational citizenship behavior: Do we know what we think we know? Journal of Applied Psychology, 95(4), 781–790. https://doi.org/10.1037/a0019477

87.

Spector

P. E.

Fox

Penney

L. M.

Bruursema

Goh

Kessler

(2006). The dimensionality of counterproductivity: Are all counterproductive behaviors created equal? Journal of Vocational Behavior, 68(3), 446–460. https://doi.org/10.1016/j.jvb.2005.10.005

88.

Speer

A. B.

Christiansen

N. D.

Honts

(2015). Assessment of personality through behavioral observations in work simulations. Personnel Assessment and Decisions, 1(1), 43–56. https://doi.org/10.25035/pad.2015.006

89.

Speer

A. B.

Oswald

F. L.

Putka

(2025). Reliability evidence from machine learning in organizational contexts: Applying lessons learned from psychometrics. Organizational Research Methods. https://doi.org/10.1177/10944281251346404

90.

Speer

A. B.

Perrotta

Kordsmeyer

T. L.

(2024). Taking it easy: Off-the-shelf versus fine-tuned supervised modeling of performance appraisal text. Organizational Research Methods.

91.

Speer

A. B.

Perrotta

Tenbrink

A. P.

Wegmeyer

L. J.

Delacruz

A. Y.

Bowker

(2023). Turning words into numbers: Assessing work attitudes using natural language processing. Journal of Applied Psychology., 108(6), 1027–1045. https://doi.org/10.1037/apl0001061

92.

Speer

A. B.

Tenbrink

A. P.

Schwendeman

M. G.

Wegmeyer

L. J.

(2020). Individual differences in effective interview design: Factors affecting question choice. International Journal of Selection and Assessment, 28(3), 310–321. https://doi.org/10.1111/ijsa.12297

93.

Speer

A. B.

Wegmeyer

L. J.

Tenbrink

A. P.

Christiansen

N. D.

Salim

R. M.

(2023). Comparing forced-choice and single-stimulus personality scores on a level playing field: A meta-analysis of psychometric properties and susceptibility to faking. Journal of Applied Psychology, 108(11), 1812–1833. https://doi.org/10.1037/apl0001099

94.

Sporer

S. L.

(1997). The less travelled road to truth: Verbal cues in deception detection in accounts of fabricated and self-experienced events. Applied Cognitive Psychology, 11(5), 373–397. https://doi.org/10.1002/(SICI)1099-0720(199710)11:5<373::AID-ACP461>3.0.CO;2-0

95.

Tausczik

Y. R.

Pennebaker

J. W.

(2010). The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology, 29(1), 24–54. https://doi.org/10.1177/ 0261927X09351676

96.

Tay

Woo

S. E.

Hickman

Saef

R. M.

(2020). Psychometric and validity issues in machine learning approaches to personality assessment: A focus on social media text mining. European Journal of Personality, 34(5), 826–844. https://doi.org/10.1002/per.2290

97.

Tett

R. P.

Burnett

D. D.

(2003). A personality trait-based interactionist model of job performance. Journal of Applied Psychology, 88(3), 500–517. https://doi.org/10.1037/0021-9010.88.3.500

98.

Thompson

Koenig

Mracek

D. L.

Tonidandel

(2023). Deep learning in employee selection: Evaluation of algorithms to automate the scoring of open-ended assessments. Journal of Business and Psychology, 38(3), 509–527. https://doi.org/10.1007/s10869-023-09874-y

99.

Van Iddekinge

C. H.

Raymark

P. H.

Roth

P. L.

(2005). Assessing personality with a structured employment interview: Construct-related validity and susceptibility to response inflation. Journal of Applied Psychology, 90(3), 536–552. https://doi.org/10.1037/0021-9010.90.3.536

100.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Polosukhin

(2017). Attention is all you need. Advances in Neural Information Processing Systems, 5998–5600.

101.

Veselovsky

Ribeiro

M. H.

Cozzolino

Gordon

Rothschild

West

(2023). Prevalence and prevention of large language model use in crowd work. arXiv preprint arXiv:2310.15683. https://doi.org/10.48550/arXiv.2310.15683

102.

Vrij

Mann

(2006). Criteria-Based content analysis: An empirical test of its underlying processes. Psychology, Crime & Law, 12(4), 337–349. https://doi.org/10.1080/10683160500129007

103.

Wolf

Debut

Sanh

Chaumond

Clement

Moi

Cistac

Rault

Louf

Funtowicz

Davidson

Shleifer

von Platen

Jernite

Plu

La Scao

Gugger

Rush

A. M.

(2019). HuggingFace’s transformers: State-of-the-art natural language processing. arXiv:1910.03771. https://doi.org/10.48550/arXiv.1910.03771

104.

Yankov

Speer

A. B.

(2023). Comparing three machine learning algorithms for scoring assessment center text data. Personnel Psychology .

105.

Yarkoni

(2010). Personality in 100,000 words: A large-scale analysis of personality and word use among bloggers. Journal of Research in Personality, 44(3), 363–373. https://doi.org/10.1016/j.jrp.2010.04.001

106.

Yoon

Jang

Son

Park

Hwang

Choeh

J. Y.

Choi

K.-H.

(2024). Predicting neuroticism with open-ended response using natural language processing. Frontiers in Psychiatry, 15, 1–12. https://doi.org/10.3389/fpsyt.2024.1437569

107.

Zhou

M. X.

Mark

Yang

(2019). Trusting virtual agents: The effect of personality. ACM Transactions on Interactive Intelligent Systems, 9(2-3), 1–36. https://doi.org/10.1145/3232077

108.

Zimmerman

R. D.

(2008). Understanding the impact of personality traits on individuals’ turnover decisions: A meta-analytic path model. Personnel Psychology, 61(2), 309–348. https://doi.org/10.1111/j.1744-6570.2008.00115.x

109.

Zou

Hastie

(2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society. Series B, Statistical Methodology, 67(2), 301–320. https://doi.org/10.1111/j.1467-9868.2005 .00503.x

Unpacking the Validity of Open-Ended Personality Assessments Using Fine-Tuned Large Language Models

Abstract

Keywords

Get full access to this article

References