Sage Journals: Discover world-class research

Abstract

Natural language processing (NLP) techniques are becoming increasingly popular in industrial and organizational psychology. One promising area for NLP-based applications is scale development; yet, while many possibilities exist, so far these applications have been restricted—mainly focusing on automated item generation. The current research expands this potential by illustrating an NLP-based approach to content analysis, which manually categorizes scale items by their measured constructs. In NLP, content analysis is performed as a text classification task whereby a model is trained to automatically assign scale items to the construct that they measure. Here, we present an approach to text classification—using state-of-the-art transformer models—that builds upon past approaches. We begin by introducing transformer models and their advantages over alternative methods. Next, we illustrate how to train a transformer to content analyze Big Five personality items. Then, we compare the models trained to human raters, finding that transformer models outperform human raters and several alternative models. Finally, we present practical considerations, limitations, and future research directions.

Keywords

personality scale development machine learning natural language processing text classification transformers

Get full access to this article

View all access options for this article.

References

Adhikari

Ram

Tang

Lin

(2019). DocBERT: Bert for document classification. ArXiv:1904.08398 [Cs] . http://arxiv.org/abs/1904.08398

Alberti

Lee

Collins

(2019). A BERT baseline for the natural questions. ArXiv:1901.08634 [Cs] . http://arxiv.org/abs/1901.08634

Allport

G. W.

(1937). Personality: A psychological interpretation (pp. xiv, 588). Holt.

Anderson

J. C.

Gerbing

D. W.

(1991). Predicting the performance of measures in a confirmatory factor analysis with a pretest assessment of their substantive validities. Journal of Applied Psychology, 76(5), 732-740. https://doi.org/10.1037/0021-9010.76.5.732

Ashton

M. C.

Lee

(2005). A defence of the lexical approach to the study of personality structure. European Journal of Personality, 19(1), 5-24. https://doi.org/10.1002/per.541

Ashton

M. C.

Lee

Perugini

Szarota

de Vries

R. E.

Di Blas

Boies

De Raad

(2004). A six-factor structure of personality-descriptive adjectives: Solutions from psycholexical studies in seven languages. Journal of Personality and Social Psychology, 86(2), 356-366. https://doi.org/10.1037/0022-3514.86.2.356

Azunre

(2021). Transfer learning for natural language processing . Manning Publications Co.

Bainbridge

T. F.

Ludeke

S. G.

Smillie

L. D.

(2022). Evaluating the Big Five as an organizing framework for commonly used psychological trait scales. Journal of Personality and Social Psychology, 122(4), 749-777. https://doi.org/10.1037/pspp0000395

Bansal

Jha

Munkhdalai

McCallum

(2020). Self-supervised meta-learning for few-shot natural language classification tasks. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) , 522-534. https://doi.org/10.18653/v1/2020.emnlp-main.38

10.

Bayer

Kaufhold

M.-A.

Reuter

(2022). A survey on data augmentation for text classification. ACM Computing Surveys, 55(7), 1-39. https://doi.org/10.1145/3544558

11.

Block

(1995). A contrarian view of the five-factor approach to personality description. Psychological Bulletin, 117(2), 187-215. https://doi.org/10.1037/0033-2909.117.2.187

12.

Block

(2010). The five-factor framing of personality and beyond: Some ruminations. Psychological Inquiry, 21(1), 2-25. https://doi.org/10.1080/10478401003596626

13.

Boyd

R. L.

Schwartz

H. A.

(2021). Natural language analysis and the psychology of verbal behavior: The past, present, and future states of the field. Journal of Language and Social Psychology, 40(1), 21-41. https://doi.org/10.1177/0261927X20967028

14.

Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., … D. Amodei (2020). Language models are few-shot learners. ArXiv:2005.14165 [Cs] . http://arxiv.org/abs/2005.14165

15.

Campion

M. C.

Campion

M. A.

Campion

E. D.

Reider

M. H.

(2016). Initial investigation into computer scoring of candidate essays for personnel selection. Journal of Applied Psychology, 101(7), 958-975. https://doi.org/10.1037/apl0000108

16.

Cer

Yang

Kong

Hua

Limtiaco

St. John

Constant

Guajardo-Cespedes

Yuan

Tar

Strope

Kurzweil

(2018). Universal Sentence Encoder for English. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations , 169-174. https://doi.org/10.18653/v1/D18-2029

17.

Chen

Zhong

Zha

Karypis

(2022). Meta-learning via language model in-context tuning (arXiv:2110.07814). arXiv. https://doi.org/10.48550/arXiv.2110.07814

18.

Chronopoulou

Baziotis

Potamianos

(2019). An embarrassingly simple approach for transfer learning from pretrained language models. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) , 2089-2095. https://doi.org/10.18653/v1/N19-1213

19.

Clark

L. A.

Watson

(2016). Constructing validity: Basic issues in objective scale development (p. 203). American Psychological Association. https://doi.org/10.1037/14805-012

20.

Clark

L. A.

Watson

(2019). Constructing validity: New developments in creating objective measuring instruments. Psychological Assessment, 31(12), 1412-1427. https://doi.org/10.1037/pas0000626

21.

Cloninger

C. R.

Svrakic

D. M.

Przybeck

T. R.

(1993). A psychobiological model of temperament and character. Archives of General Psychiatry, 50(12), 975-990. https://doi.org/10.1001/archpsyc.1993.01820240059008

22.

Colquitt

J. A.

Sabey

T. B.

Rodell

J. B.

Hill

E. T.

(2019). Content validation guidelines: Evaluation criteria for definitional correspondence and definitional distinctiveness. Journal of Applied Psychology, 104(10), 1243-1265. https://doi.org/10.1037/apl0000406

23.

Condon

(2019). Database of individual differences survey tools. Harvard Dataverse. https://doi.org/10.7910/DVN/T1NQ4V

24.

Condon

D. M.

(2018). The SAPA personality inventory: An empirically-derived, hierarchically-organized self-report personality assessment model. PsyArXiv. https://doi.org/10.31234/osf.io/sc4p9

25.

Condon

D. M.

Wood

Mõttus

Booth

Costantini

Greiff

Johnson

Lukaszewski

Murray

Revelle

Wright

A. G. C.

Ziegler

Zimmermann

(2020). Bottom-up construction of a personality taxonomy. European Journal of Psychological Assessment, 36(6), 923-934. https://doi.org/10.1027/1015-5759/a000626

26.

Conneau

Kiela

(2018, May). SentEval: An evaluation toolkit for universal sentence representations. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) . LREC 2018. https://www.aclweb.org/anthology/L18-1269

27.

Costa

P. T.

McCrae

R. R.

(1995). Domains and facets: Hierarchical personality assessment using the Revised NEO Personality Inventory. Journal of Personality Assessment, 64(1), 21-50. https://doi.org/10.1207/s15327752jpa6401_2

28.

Croce

Castellucci

Basili

(2020). GAN-BERT: Generative adversarial learning for robust text classification with a bunch of labeled examples. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , 2114-2119. https://doi.org/10.18653/v1/2020.acl-main.191

29.

Cutler

Condon

D. M.

(2022). Deep lexical hypothesis: Identifying personality structure in natural language (arXiv:2203.02092). arXiv. https://doi.org/10.48550/arXiv.2203.02092

30.

De Boom

Van Canneyt

Demeester

Dhoedt

(2016). Representation learning for very short texts using weighted word embedding aggregation. Pattern Recognition Letters, 80, 150-156. https://doi.org/10.1016/j.patrec.2016.06.012

31.

de Rosa

G. H.

Papa

J. P.

(2021). A survey on text generation using generative adversarial networks. Pattern Recognition, 119, 108098. https://doi.org/10.1016/j.patcog.2021.108098

32.

DeGeest

D. S.

Schmidt

(2015). A rigorous test of the fit of the circumplex model to big five personality data: Theoretical and methodological issues and two large sample empirical tests. Multivariate Behavioral Research, 50(3), 350-364. https://doi.org/10.1080/00273171.2015.1004568

33.

Devlin

Chang

M.-W.

Lee

Toutanova

(2019). BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv:1810.04805 [Cs] . http://arxiv.org/abs/1810.04805

34.

DiStefano

Motl

R. W.

(2009). Personality correlates of method effects due to negatively worded items on the Rosenberg Self-Esteem Scale. Personality and Individual Differences, 46(3), 309-313. https://doi.org/10.1016/j.paid.2008.10.020

35.

Domingos

(2012). A few useful things to know about machine learning. Communications of the ACM, 55(10), 78-87. https://doi.org/10.1145/2347736.2347755

36.

Eichstaedt

J. C.

Kern

M. L.

Yaden

D. B.

Schwartz

H. A.

Giorgi

Park

Hagan

Tobolsky

Smith

L. K.

Buffone

Iwry

Seligman

Ungar

L. H.

(2020). Closed and open vocabulary approaches to text analysis: A review, quantitative comparison, and recommendations. PsyArXiv. https://doi.org/10.31234/osf.io/t52c6

37.

Elman

J. L.

(1990). Finding structure in time. Cognitive Science, 14(2), 179-211. https://doi.org/10.1207/s15516709cog1402_1

38.

Fleiss

J. L.

(1981). Balanced incomplete block designs for inter-rater reliability studies. Applied Psychological Measurement, 5(1), 105-112. https://doi.org/10.1177/014662168100500115

39.

Foa

E. B.

Huppert

J. D.

Leiberg

Langner

Kichic

Hajcak

Salkovskis

P. M.

(2002). The Obsessive-Compulsive Inventory: Development and validation of a short version. Psychological Assessment, 14(4), 485-496. https://doi.org/10.1037/1040-3590.14.4.485

40.

Geng

Huang

Chen

(2021). Recent advances in open set recognition: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(10), 3614-3631. https://doi.org/10.1109/TPAMI.2020.2981604

41.

Goldberg

L. R.

(1993). The structure of phenotypic personality traits. American Psychologist, 48(1), 26-34. https://doi.org/10.1037/0003-066X.48.1.26

42.

Goldberg

L. R.

Johnson

J. A.

Eber

H. W.

Hogan

Ashton

M. C.

Cloninger

C. R.

Gough

H. G.

(2006). The international personality item pool and the future of public-domain personality measures. Journal of Research in Personality, 40(1), 84-96. https://doi.org/10.1016/j.jrp.2005.08.007

43.

Goldberg

L. R.

Saucier

(2016). The Eugene-Springfield community sample: Information available from the research participants (Tech. Rep. No. 56-1). Oregon Research Institute.

44.

Goldberg

L. R.

Velicer

W. F.

(2006). Principles of exploratory factor analysis. Differentiating normal and abnormal personality (2nd ed., pp. 209-237). Springer Publishing Company.

45.

Goodfellow

I. J.

Mirza

Xiao

Courville

Bengio

(2015). An empirical investigation of catastrophic forgetting in gradient-based neural networks (arXiv:1312.6211). arXiv. https://doi.org/10.48550/arXiv.1312.6211

46.

Götz

Maertens

Linden

D. S.

van der . (2021). Let the algorithm speak: How to use neural networks for automatic item generation in psychological scale development. PsyArXiv. https://doi.org/10.31234/osf.io/m6s28

47.

Halder

Akbik

Krapac

Vollgraf

(2020). Task-aware representation of sentences for generic text classification. Proceedings of the 28th International Conference on Computational Linguistics, 3202-3213. https://doi.org/10.18653/v1/2020.coling-main.285

48.

Harris

Z. S.

(1954). Distributional structure. WORD, 10(2–3), 146-162. https://doi.org/10.1080/00437956.1954.11659520

49.

Hattie

(1985). Methodology review: Assessing unidimensionality of tests and items. Applied Psychological Measurement, 9(2), 139-164. https://doi.org/10.1177/014662168500900204

50.

Haynes

S. N.

Richard

D. C. S.

Kubany

E. S.

(1995). Content validity in psychological assessment: A functional approach to concepts and methods. Psychological Assessment, 7(3), 238-247. https://doi.org/10.1037/1040-3590.7.3.238

51.

Liu

Gao

Chen

(2021). DeBERTa: Decoding-enhanced BERT with disentangled attention. ArXiv:2006.03654 [Cs]. http://arxiv.org/abs/2006.03654

52.

Hendrycks

Liu

Wallace

Dziedzic

Krishnan

Song

(2020). Pretrained transformers improve out-of-distribution robustness. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , 2744-2751. https://doi.org/10.18653/v1/2020.acl-main.244

53.

Hickman

Thapa

Tay

Cao

Srinivasan

(2020). Text preprocessing for text mining in organizational research: Review and recommendations. Organizational Research Methods, 114, 1-33. https://doi.org/10.1177/1094428120971683

54.

Hinkin

T. R.

(1998). A brief tutorial on the development of measures for use in survey questionnaires. Organizational Research Methods, 1(1), 104-121. https://doi.org/10.1177/109442819800100106

55.

Hinkin

T. R.

Tracey

J. B.

(1999). An analysis of variance approach to content validation. Organizational Research Methods, 2(2), 175-186. https://doi.org/10.1177/109442819922004

56.

Hochreiter

Schmidhuber

(1997). Long short-term memory. Neural Computation, 9(8), 1735-1780. https://doi.org/10.1162/neco.1997.9.8.1735

57.

Hofstee

W. K.

de Raad

Goldberg

L. R.

(1992). Integration of the Big Five and circumplex approaches to trait structure. Journal of Personality and Social Psychology, 63(1), 146-163. https://doi.org/10.1037/0022-3514.63.1.146

58.

Hommel

B. E.

Wollang

F.-J. M.

Kotova

Zacher

Schmukle

S. C.

(2022). Transformer-based deep neural language modeling for construct-specific automatic item generation. Psychometrika, 87(2), 749-772. https://doi.org/10.1007/s11336-021-09823-9

59.

Hopwood

C. J.

Donnellan

M. B.

(2010). How should the internal structure of personality inventories be evaluated? Personality and Social Psychology Review, 14(3), 332-346. https://doi.org/10.1177/1088868310361240

60.

Howard

Ruder

(2018). Universal language model fine-tuning for text classification. ArXiv:1801.06146 [Cs, Stat] . http://arxiv.org/abs/1801.06146

61.

Hsu

Chang

Lin

(2010). A practical guide to support vector classification [Technical Report]. National Taiwan University. https://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf

62.

Ilmini

W. M. K. S.

Fernando

T. G. I.

(2017). Computational personality traits assessment: A review. 2017 IEEE International Conference on Industrial and Information Systems (ICIIS) , 1-6. https://doi.org/10.1109/ICIINFS.2017.8300416

63.

Jiao

Lissitz

R. W.

(2020). Application of artificial intelligence to assessment. IAP.

64.

John

O. P.

Angleitner

Ostendorf

(1988). The lexical approach to personality: A historical review of trait taxonomic research. European Journal of Personality, 2(3), 171-203. https://doi.org/10.1002/per.2410020302

65.

John

O. P.

Naumann

L. P.

Soto

C. J.

(2008). Paradigm shift to the integrative Big Five trait taxonomy: History, measurement, and conceptual issues. In Handbook of personality: Theory and research (3rd ed, pp. 114-158). The Guilford Press.

66.

Kalyan

K. S.

Rajasekharan

Sangeetha

(2021). AMMUS: A survey of transformer-based pretrained models in natural language processing. ArXiv:2108.05542 [Cs] . http://arxiv.org/abs/2108.05542

67.

Kennedy

Ashokkumar

Boyd

R. L.

Dehghani

(2021). Text analysis for psychology: Methods, principles, and practices. PsyArXiv. https://doi.org/10.31234/osf.io/h2b8t

68.

Keskar

N. S.

McCann

Varshney

L. R.

Xiong

Socher

(2019). CTRL: A conditional transformer language model for controllable generation. ArXiv:1909.05858 [Cs]. http://arxiv.org/abs/1909.05858

69.

Kobayashi

V. B.

Mol

S. T.

Berkers

H. A.

Kismihók

Den Hartog

D. N.

(2018a). Text mining in organizational research. Organizational Research Methods, 21(3), 733-765. https://doi.org/10.1177/1094428117722619

70.

Kobayashi

V. B.

Mol

S. T.

Berkers

H. A.

Kismihók

Den Hartog

D. N.

(2018b). Text classification for organizational researchers: A tutorial. Organizational Research Methods, 21(3), 766-799. https://doi.org/10.1177/1094428117719322

71.

Kowsari

Meimandi

K. J.

Heidarysafa

Mendu

Barnes

L. E.

Brown

D. E.

(2019). Text classification algorithms: A survey. Information, 10(4), 150. https://doi.org/10.3390/info10040150

72.

Krippendorff

(2018). Content analysis: An introduction to its methodology. Sage.

73.

Kuhn

(2021). caret: Classification and regression training [Manual]. https://CRAN.R-project.org/package=caret

74.

Lan

Chen

Goodman

Gimpel

Sharma

Soricut

(2020). ALBERT: A lite BERT for self-supervised learning of language representations. ArXiv:1909.11942 [Cs] . http://arxiv.org/abs/1909.11942

75.

Lee

Fyffe

Son

Jia

Yao

(2023). A paradigm shift from “human writing” to “machine generation” in personality test development: An application of state-of-the-art natural language processing. Journal of Business and Psychology, 38(1), 163-190. https://doi.org/10.1007/s10869-022-09864-6

76.

Levenson

(1981). Differentiating among internality, powerful others, and chance. In Lefcourt

H. M.

(Ed.), Research with the locus of control construct (pp. 15-63). Academic Press. https://doi.org/10.1016/B978-0-12-443201-7.50006-3

77.

Liang

Jiang

Zuo

Liu

Gao

Chen

Zhao

(2022). No parameters left behind: Sensitivity guided adaptive learning rate for training large transformer models (arXiv:2202.02664). arXiv. https://doi.org/10.48550/arXiv.2202.02664

78.

Liddy

(2001). Natural language processing. In Encyclopedia of library and information science (2nd ed.). Marcel Decker, Inc. https://surface.syr.edu/istpub/63

79.

Liu

Ott

Goyal

Joshi

Chen

Levy

Lewis

Zettlemoyer

Stoyanov

(2019). RoBERTa: A robustly optimized BERT pretraining approach. ArXiv:1907.11692 [Cs]. http://arxiv.org/abs/1907.11692

80.

Liu

Lin

Sun

(2020). Representation learning for natural language processing. Springer Singapore. https://doi.org/10.1007/978-981-15-5573-2

81.

Loevinger

(1957). Objective tests as instruments of psychological theory. Psychological Reports, 3, 635-694. https://doi.org/10.2466/PR0.3.7.635-694

82.

Marsh

H. W.

Lüdtke

Muthén

Asparouhov

Morin

A. J. S.

Trautwein

Nagengast

(2010). A new look at the Big Five factor structure through exploratory structural equation modeling. Psychological Assessment, 22(3), 471-491. https://doi.org/10.1037/a0019227

83.

McCrae

R. R.

Costa

P. T.

(1987). Validation of the five-factor model of personality across instruments and observers. Journal of Personality and Social Psychology, 52(1), 81-90. https://doi.org/10.1037/0022-3514.52.1.81

84.

Meehl

P. E.

Rosen

(1955). Antecedent probability and the efficiency of psychometric signs, patterns, or cutting scores. Psychological Bulletin, 52(3), 194-216. https://doi.org/10.1037/h0048070

85.

Mikolov

Chen

Corrado

Dean

(2013). Efficient estimation of word representations in vector space. ArXiv:1301.3781 [Cs] . http://arxiv.org/abs/1301.3781

86.

Min

Peng

Shoss

Yang

(2021). Using machine learning to investigate the public’s emotional responses to work from home during the COVID-19 pandemic. Journal of Applied Psychology, 106, 214-229. https://doi.org/10.1037/apl0000886

87.

Mirończuk

M. M.

Protasiewicz

(2018). A recent overview of the state-of-the-art elements of text classification. Expert Systems with Applications, 106, 36-54. https://doi.org/10.1016/j.eswa.2018.03.058

88.

Miyajiwala

Ladkat

Jagadale

Joshi

(2022). On sensitivity of deep learning based text classification algorithms to practical input perturbations. In Arai

(Ed.), Intelligent computing (pp. 613-626). Springer International Publishing. https://doi.org/10.1007/978-3-031-10464-0_42

89.

Nangia

Bowman

S. R.

(2019). Human vs. muppet: A conservative estimate of human performance on the glue benchmark. ArXiv:1905.10425 [Cs] . http://arxiv.org/abs/1905.10425

90.

Norman

W. T.

(1963). Toward an adequate taxonomy of personality attributes: Replicated factor structure in peer nomination personality ratings. The Journal of Abnormal and Social Psychology, 66(6), 574-583. https://doi.org/10.1037/h0040291

91.

Padhy

Nado

Ren

Liu

Snoek

Lakshminarayanan

(2020). Revisiting one-vs-all classifiers for predictive uncertainty and out-of-distribution detection in neural networks. ArXiv:2007.05134 [Cs, Stat] . http://arxiv.org/abs/2007.05134

92.

Pan

S. J.

Yang

(2010). A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22(10), 1345-1359. https://doi.org/10.1109/TKDE.2009.191

93.

Pandey

S. K.

(2019). Applying natural language processing capabilities in computerized textual analysis to measure organizational culture. Organizational Research Methods, 22(3), 765-797. https://doi.org/10.1177/1094428117745648

94.

Paunonen

S. V.

Jackson

D. N.

(2000). What is beyond the big five? Plenty!. Journal of Personality, 68(5), 821-835. https://doi.org/10.1111/1467-6494.00117

95.

Peng

Chen

(2020). An empirical study of multi-task learning on BERT for biomedical text mining. Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 205-214. https://doi.org/10.18653/v1/2020.bionlp-1.22

96.

Pennington

Socher

Manning

(2014). Glove: Global vectors for word representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1532-1543. https://doi.org/10.3115/v1/D14-1162

97.

Peters

M. E.

Neumann

Iyyer

Gardner

Clark

Lee

Zettlemoyer

(2018). Deep contextualized word representations. ArXiv:1802.05365 [Cs]. http://arxiv.org/abs/1802.05365

98.

Peters

M. E.

Ruder

Smith

N. A.

(2019). To tune or not to tune? Adapting pretrained representations to diverse tasks (arXiv:1903.05987). arXiv. http://arxiv.org/abs/1903.05987

99.

Peterson

Seligman

M. E. P.

(2004). Character strengths and virtues: A handbook and classification (pp. xiv, 800). Oxford University Press.

100.

Phang

Févry

Bowman

S. R.

(2019). Sentence encoders on STILTS: Supplementary training on intermediate labeled-data tasks. ArXiv:1811.01088 [Cs] . http://arxiv.org/abs/1811.01088

101.

Pilehvar

M. T.

Camacho-Collados

(2020). Embeddings in natural language processing: Theory and advances in vector representations of meaning. Synthesis Lectures on Human Language Technologies, 13(4), 1-175. https://doi.org/10.2200/S01057ED1V01Y202009HLT047

102.

Preacher

K. J.

MacCallum

R. C.

(2003). Repairing Tom Swift’s electric factor analysis machine. Understanding Statistics, 2(1), 13-43. https://doi.org/10.1207/S15328031US0201_02

103.

Putka

D. J.

McCloy

R. A.

Diaz

(2008). Ill-structured measurement designs in organizational research: Implications for estimating interrater reliability. The Journal of Applied Psychology, 93(5), 959-981. https://doi.org/10.1037/0021-9010.93.5.959

104.

Radford

Narasimhan

Salimans

Sutskever

(2018). Improving language understanding by generative pre-training (pp. 1-12) [Technical Report]. OpenAI. https://www.cs.ubc.ca/~amuham01/LING530/papers/radford2018improving.pdf

105.

Rahman

Khan

Porikli

(2018). A unified approach for conventional zero-shot, generalized zero-shot, and few-shot learning. IEEE Transactions on Image Processing, 27(11), 5652-5667. https://doi.org/10.1109/TIP.2018.2861573

106.

Reimers

Gurevych

(2019). Sentence-BERT: Sentence embeddings using Siamese BERT-networks. ArXiv:1908.10084 [Cs] . http://arxiv.org/abs/1908.10084

107.

Revelle

(2021). psych: Procedures for psychological, psychometric, and personality research [Manual]. https://CRAN.R-project.org/package=psych

108.

Roady

Hayes

T. L.

Kemker

Gonzales

Kanan

(2020). Are open set classification methods effective on large-scale datasets? PLoS ONE, 15(9), e0238302. https://doi.org/10.1371/journal.pone.0238302

109.

Rosellini

A. J.

Brown

T. A.

(2021). Developing and validating clinical questionnaires. Annual Review of Clinical Psychology, 17, 55-81. https://doi.org/10.1146/annurev-clinpsy-081219-115343

110.

Ruder

(2017). Transfer learning—Machine learning’s next frontier . http://ruder.io/transfer-learning/

111.

Ruder

(2021). Recent advances in language model fine-tuning . http://ruder.io/recent-advances-lm-fine-tuning

112.

Rudkowsky

Haselmayer

Wastian

Jenny

Emrich

Sedlmair

(2018). More than bags of words: Sentiment analysis with word embeddings. Communication Methods and Measures, 12(2–3), 140-157. https://doi.org/10.1080/19312458.2018.1455817

113.

Russell

D. W.

(2002). In search of underlying dimensions: The use (and abuse) of factor analysis in personality and social psychology bulletin. Personality and Social Psychology Bulletin, 28(12), 1629-1646. https://doi.org/10.1177/014616702237645

114.

Saarikoski

Joutsijoki

Jarvelin

Laurikkala

Juhola

(2015). On the influence of training data quality on text document classification using machine learning methods. International Journal of Knowledge Engineering and Data Mining, 3(2), 143-169. https://doi.org/10.1504/IJKEDM.2015.071284

115.

Sanh

Debut

Chaumond

Wolf

(2020). DistilBERT, a distilled version of BERT: Smaller, faster, cheaper and lighter. ArXiv:1910.01108 [Cs] . http://arxiv.org/abs/1910.01108

116.

Saucier

(1997). Effects of variable selection on the factor structure of person descriptors. Journal of Personality and Social Psychology, 73(6), 1296-1312. https://doi.org/10.1037/0022-3514.73.6.1296

117.

Saucier

Goldberg

L. R.

(1998). What is beyond the big five? Journal of Personality, 66, 495-524. https://doi.org/10.1111/1467-6494.00022

118.

Scao

T. L.

Rush

A. M.

(2021). How many data points is a prompt worth? ArXiv:2103.08493 [Cs]. http://arxiv.org/abs/2103.08493

119.

Schick

Schütze

(2021). It’s not just size that matters: Small language models are also few-shot learners. ArXiv:2009.07118 [Cs] . http://arxiv.org/abs/2009.07118

120.

Schwaba

Rhemtulla

Hopwood

C. J.

Bleidorn

(2020). A facet atlas: Visualizing networks that describe the blends, cores, and peripheries of personality structure. PLoS ONE, 15(7), 0236893. https://doi.org/10.1371/journal.pone.0236893

121.

Short

J. C.

Broberg

J. C.

Cogliser

C. C.

Brigham

K. H.

(2010). Construct validation using computer-aided text analysis (CATA): An illustration using entrepreneurial orientation. Organizational Research Methods, 13(2), 320-347. https://doi.org/10.1177/1094428109335949

122.

Short

J. C.

McKenny

A. F.

Reid

S. W.

(2018). More than words? Computer-aided text analysis in organizational behavior and psychology research. Annual Review of Organizational Psychology and Organizational Behavior, 5(1), 415-435. https://doi.org/10.1146/annurev-orgpsych-032117-104622

123.

Shrestha

Y. R.

V. F.

Puranam

von Krogh

(2021). Algorithm supported induction for building theory: How can we use prediction models to theorize? Organization Science, 32(3), 856-880. https://doi.org/10.1287/orsc.2020.1382

124.

Smith

R. W.

Min

M. A.

Haynes

N. J.

Clark

M. A.

(2022). A content validation of work passion: Was the passion ever there? Journal of Business and Psychology, 38(1), 191-213. https://doi.org/10.1007/s10869-022-09807-1

125.

Song

Salcianu

Song

Dopson

Zhou

(2021). Fast wordpiece tokenization. ArXiv:2012.15524 [Cs] . http://arxiv.org/abs/2012.15524

126.

Speer

A. B.

(2021). Scoring dimension-level job performance from narrative comments: Validity and generalizability when using natural language processing. Organizational Research Methods, 24(3), 572-594. https://doi.org/10.1177/1094428120930815

127.

Sun

Qiu

Huang

(2020). How to fine-tune BERT for text classification? ArXiv:1905.05583 [Cs]. http://arxiv.org/abs/1905.05583

128.

Tellegen

Waller

N. G.

(2008). Exploring personality through test construction: Development of the multidimensional personality questionnaire. In The SAGE handbook of personality theory and assessment, vol 2: Personality measurement and testing (pp. 261-292). Sage Publications, Inc. https://doi.org/10.4135/9781849200479.n13

129.

Vabalas

Gowen

Poliakoff

Casson

A. J.

(2019). Machine learning algorithm validation with a limited sample size. PLoS ONE, 14(11), 0224365. https://doi.org/10.1371/journal.pone.0224365

130.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Polosukhin

(2017). Attention is all you need. ArXiv:1706.03762 [Cs]. http://arxiv.org/abs/1706.03762

131.

Vodrahalli

Gerstenberg

Zou

(2022). Uncalibrated models can improve human-AI collaboration. ArXiv:2202.05983 [Cs] . http://arxiv.org/abs/2202.05983

132.

von Davier

(2018). Automated item generation with recurrent neural networks. Psychometrika, 83(4), 847-857. https://doi.org/10.1007/s11336-018-9608-y

133.

Wang

Pruksachatkun

Nangia

Singh

Michael

Hill

Levy

Bowman

S. R.

(2019). SuperGLUE: A stickier benchmark for general-purpose language understanding systems. ArXiv Preprint 1905.00537 .

134.

Wang

Fang

Khabsa

Mao

(2021). Entailment as Few-Shot Learner. ArXiv:2104.14690 [Cs]. http://arxiv.org/abs/2104.14690

135.

Wilson

E. B.

(1927). Probable inference, the law of succession, and statistical inference. Journal of the American Statistical Association, 22(158), 209-212. https://doi.org/10.2307/2276774

136.

Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., Cistac, P., Rault, T., Louf, R., Funtowicz, M., Davison, J., Shleifer, S., von Platen, P., Ma, C., Jernite, Y., Plu, J., Xu, C., Scao, T. L., Gugger, S., … (2020). HuggingFace’s transformers: State-of-the-art natural language processing. ArXiv:1910.03771 [Cs]. http://arxiv.org/abs/1910.03771

137.

Worthington

R. L.

Whittaker

T. A.

(2006). Scale development research: A content analysis and recommendations for best practices. The Counseling Psychologist, 34(6), 806-838. https://doi.org/10.1177/0011000006288127

138.

Yang

Dai

Yang

Carbonell

Salakhutdinov

Q. V.

(2020). XLNET: Generalized autoregressive pretraining for language understanding. ArXiv:1906.08237 [Cs]. http://arxiv.org/abs/1906.08237

139.

Yin

Rajani

N. F.

Radev

Socher

Xiong

(2020). Universal natural language processing with limited annotations: Try few-shot textual entailment as a start. ArXiv:2010.02584 [Cs] . http://arxiv.org/abs/2010.02584

140.

Zellers

Bisk

Schwartz

Choi

(2018). SWAG: A large-scale adversarial dataset for grounded commonsense inference. ArXiv:1808.05326 [Cs] . http://arxiv.org/abs/1808.05326

141.

Zhang

Katiyar

Weinberger

K. Q.

Artzi

(2021). Revisiting few-sample BERT fine-tuning. ArXiv:2006.05987 [Cs] . http://arxiv.org/abs/2006.05987

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.03 MB

“Transforming” Personality Scale Development: Illustrating the Potential of State-of-the-Art Natural Language Processing

Abstract

Keywords

Get full access to this article

References

Supplementary Material