Sage Journals: Discover world-class research

Abstract

The field of machine learning has recently made significant progress in reducing the requirements for labeled training data when building new models. These “cheaper” learning techniques hold significant potential for the social sciences, where development of large labeled training datasets is often a significant practical impediment. In this article we review three “cheap” techniques that have developed in recent years: Weak supervision, transfer learning and prompt engineering. For the latter, we also review the particular case of zero-shot prompting of large language models. For each technique, we provide a guide of how it works and demonstrate its application and the presence of systematic biases across two different and realistic social science tasks paired with three different dataset makeups. We show good performance for all techniques and we demonstrate how prompting of large language models can achieve high accuracy at very low cost, but biases must be considered.

Keywords

Social data science text classification week supervision transfer learning prompt engineering language models

Get full access to this article

View all access options for this article.

References

Aborisade

Opeyemi

Anwar

Mohd

. 2018. “Classification for Authorship of Tweets by Comparing Logistic Regression and Naive Bayes Classifiers.” pp. 269-276 in 2018 IEEE International Conference on Information Reuse and Integration (IRI). doi:10.1109/IRI.2018.00049. https://ieeexplore.ieee.org/document/8424720/.

Al-Jarrah

Omar Y.

Yoo

Paul D.

Muhaidat

Sami

Karagiannidis

George K.

Taha

Kamal

. 2015. “Efficient Machine Learning for Big Data: A Review.” Big Data Research 2(3): 87-93. ISSN 2214-5796. doi:10.1016/j.bdr.2015.04.001. https://www.sciencedirect.com/science/article/pii/S2214579615000271.

Ali

Raza

Farooq

Umar

Arshad

Umair

Shahzad

Waseem

Beg

Mirza Omer

. 2022. “Hate Speech Detection on Twitter Using Transfer Learning.” Computer Speech & Language 74: 101365.

Anaby-Tavor

Ateret

Carmeli

Boaz

Goldbraich

Esther

Kantor

Amir

Kour

George

Shlomov

Segev

Tepper

Naama

et al. 2020. “Do not have Enough Data? Deep Learning to the Rescue!” pp. 7383-7390 in Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34.

Ashwin

Julian

Chhabra

Aditya

Rao

Vijayendra

. 2023. “Using Large Language Models for Qualitative Analysis can Introduce Serious Bias.” http://arxiv.org/abs/2309.17147. arXiv:2309.17147 [cs].

Atari

Mohammad

Mona Xue

Peter

Damián Blasi

Park

Henrich

Joseph

(2023) Which Humans?. https://osf.io/5b26t_v1.

Baden

Christian

Pipal

Christian

Schoonvelde

Martijn

van der Velden

Mariken A.C.G

. 2022. “Three Gaps in Computational Text Analysis Methods for Social Sciences: A Research Agenda.” Communication Methods and Measures 16(1): 1-18. ISSN 1931-2458. doi:10.1080/19312458.2021.2015574. https://doi.org/10.1080/19312458.2021.2015574.

Bai

Haoli

Hou

Shang

Lifeng

Jiang

Xin

King

Irwin

Lyu

Michael R.

. 2022. “Towards Efficient Post-training Quantization of Pre-trained Language Models.” Advances in Neural Information Processing Systems 35: 1405-1418. https://proceedings.neurips.cc/paper_files/paper/2022/hash/096347b4efc264ae7f07742fea34af1f-Abstract-Conference.html .

Bashath

Samar

Perera

Nadeesha

Tripathi

Shailesh

Manjang

Kalifa

Dehmer

Matthias

Streib

Frank Emmert

. 2022. “A Data-centric Review of Deep Transfer Learning with Applications to Text Data.” Information Sciences 585: 498-528.

10.

Beigman Klebanov

Beata

Madnani

Nitin

. 2020. “Automated Evaluation of Writing - 50 Years and Counting.” pp. 7796-7810 in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, edited by Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel Tetreault. Association for Computational Linguistics. 10.18653/v1/2020.acl-main.697. https://aclanthology.org/2020.acl-main.697.

11.

Bengio

Yoshua

. 2012. “Practical Recommendations for Gradient-based Training of Deep Architectures.” pp. 437-478 in Neural Networks: Tricks of the Trade: Second Edition, Springer.

12.

Berger

Maria

Goldstein

Elizabeth

. 2021. “Increasing Sentence-Level Comprehension Through Text Classification of Epistemic Functions.” pp. 139-150 in Proceedings of the Joint 15th Linguistic Annotation Workshop (LAW) and 3rd Designing Meaning Representations (DMR) Workshop, edited by Claire Bonial and Nianwen Xue. Punta Cana, Dominican Republic: Association for Computational Linguistics. doi:10.18653/v1/2021.law-1.15. https://aclanthology.org/2021.law-1.15.

13.

Berry

Michael W

Mohamed

Azlinah

Yap

Bee Wah

. 2019. Supervised and Unsupervised Learning for Data Science. Cham: Springer.

14.

Birjali

Marouane

Kasri

Mohammed

Beni-Hssane

Abderrahim

. 2021. “A Comprehensive Survey on Sentiment Analysis: Approaches, Challenges and Trends.” Knowledge-Based Systems 226: 107134. ISSN 0950-7051. doi:10.1016/j.knosys.2021.107134. https://www.sciencedirect.com/science/article/pii/S095070512100397X.

15.

Bommarito II

Michael

Katz

Daniel Martin

. 2022. “Gpt Takes the Bar Exam.” ArXiv Computer Science.

16.

Bonikowski

Bart

Nelson

K. Laura

. 2022. “From Ends to Means: The Promise of Computational Text Analysis for Theoretically Driven Sociological Research.” Sociological Methods & Research 51(4): 1469-1483. https://ideas.repec.org//a/sae/somere/v51y2022i4p1469-1483.html .

17.

Boyd

Danah

Crawford

Kate

. 2011. “Six Provocations for Big Data.” SSRN, doi:10.2139/ssrn.1926431. https://papers.ssrn.com/abstract=1926431.

18.

Burscher

Björn

Odijk

Daan

Vliegenthart

Rens

de Rijke

Maarten

de Vreese

Claes H.

. 2014. “Teaching the Computer to Code Frames in News: Comparing Two Supervised Machine Learning Approaches to Frame Analysis.” Communication Methods and Measures 8(3): 190-206. ISSN 1931-2458. doi:10.1080/19312458.2014.937527. https://doi.org/10.1080/19312458.2014.937527.

19.

Burscher

Bjorn

Vliegenthart

Rens

De Vreese

Claes H.

. 2015. “Using Supervised Machine Learning to Code Policy Issues: Can Classifiers Generalize Across Contexts?.” The ANNALS of the American Academy of Political and Social Science 659(1): 122-131. ISSN 0002-7162. doi:10.1177/0002716215569441. https://doi.org/10.1177/0002716215569441. Publisher: SAGE Publications Inc.

20.

Caccia

Massimo

Caccia

Lucas

Fedus

William

Larochelle

Hugo

Pineau

Joelle

Charlin

Laurent

. 2020. “Language GANs Falling Short.” http://arxiv.org/abs/1811.02549. arXiv:1811.02549 [cs].

21.

Cherkassky

Vladimir

Mulier

Filip M.

. 2007. Learning From Data: Concepts, Theory, and Methods. John Wiley & Sons.

22.

Chung

Yi-Ling

Tekiroglu

Serra Sinem

Guerini

Marco

. 2020. “Italian Counter Narrative Generation to Fight online Hate Speech.” in CLiC-it.

23.

Conneau

Alexis

Lample

Guillaume

. 2019. “Cross-lingual Language Model Pretraining.” in Advances in Neural Information Processing Systems, edited by H. Wallach, H. Larochelle, A. Beygelzimer, F. d’ Alché-Buc, E. Fox, and R. Garnett. Vol. 32. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2019/file/c04c19c2c2474dbf5f7ac4372c5b9af1-Paper.pdf.

24.

Costantini

Edoardo

Lang

Kyle M.

Reeskens

Tim

Sijtsma

Klaas

. 2023. “High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods.” Sociological Methods & Research 50: 00491241231200194. ISSN 0049-1241. doi:10.1177/00491241231200194. https://doi.org/10.1177/00491241231200194. Publisher: SAGE Publications Inc.

25.

Dastin

Jeffrey

. 2022. “Amazon Scraps Secret Ai Recruiting Tool that Showed Bias Against Women.” pp. 296-299 in Ethics of Data and Analytics, Auerbach Publications.

26.

Davison

Joe

Feldman

Joshua

Rush

Alexander

. 2019. “Commonsense Knowledge Mining from Pretrained Models.” pp. 1173-1178 in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China: Association for Computational Linguistics. doi:10.18653/v1/D19-1109. https://aclanthology.org/D19-1109.

27.

De Grove

Frederik

Boghe

Kristof

De Marez

Lieven

. 2020. “(What) Can Journalism Studies Learn From Supervised Machine Learning?.” Journalism Studies 21(7): 912-927. ISSN 1461-670X. doi:10.1080/1461670X.2020.1743737. https://doi.org/10.1080/1461670X.2020.1743737. Publisher: Routledge.

28.

Deterding

Nicole M.

Waters

Mary C.

. 2021. “Flexible Coding of In-depth Interviews: A Twenty-first-century Approach.” Sociological Methods & Research 50(2): 708-739. https://ideas.repec.org//a/sae/somere/v50y2021i2p708-739.html .

29.

Devlin

Jacob

Chang

Ming-Wei

Lee

Kenton

Toutanova

Kristina

. 2019. “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.” pp. 4171–4186 in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota: Association for Computational Linguistics. doi:10.18653/v1/N19-1423. https://aclanthology.org/N19-1423.

30.

Dinan

Emily

Abercrombie

Gavin

Bergman

A Stevie

Spruit

Shannon

Hovy

Dirk

Boureau

Y-Lan

Rieser

Verena

. 2021. “Anticipating Safety Issues in e2e Conversational ai: Framework and Tooling.” arXiv preprint arXiv:2107.03451.

31.

Ding

Ning

Shengding

Zhao

Weilin

Chen

Yulin

Liu

Zhiyuan

Zheng

Haitao

Sun

Maosong

. 2022. “OpenPrompt: An Open-Source Framework for Prompt-Learning.” pp. 105-113 in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Dublin, Ireland: Association for Computational Linguistics. doi:10.18653/v1/2022.acl-demo.10. https://aclanthology.org/2022.acl-demo.10.

32.

Dong

Yang

Nan

Wang

Wenhui

Wei

Furu

Liu

Xiaodong

Wang

Gao

Jianfeng

et al. 2019. “Unified Language Model Pre-training for Natural Language Understanding and Generation.” pp. 13063-13075 in Proceedings of the 33rd International Conference on Neural Information Processing Systems, Red Hook, NY, USA: Curran Associates Inc.

33.

Fatourechi

Mehrdad

Ward

Rabab K

Mason

Steven G

Huggins

Jane

Schlögl

Alois

Birch

Gary E

. 2008. “Comparison of Evaluation Metrics in Classification Applications with Imbalanced Datasets.” pp. 777-782 in 2008 Seventh International Conference on Machine Learning and Applications, IEEE.

34.

Fries

Jason A.

Steinberg

Ethan

Khattar

Saelig

Fleming

Scott L.

Posada

Jose

Callahan

Alison

Shah

Nigam H.

. 2021. “Ontology-driven Weak Supervision for Clinical Entity Classification in Electronic Health Records.” Nature Communications 12(1): 2017. ISSN 2041-1723. doi:10.1038/s41467-021-22328-4. https://www.nature.com/articles/s41467-021-22328-4. Number: 1 Publisher: Nature Publishing Group.

35.

Galal

Omar

Abdel-Gawad

Ahmed H.

Farouk

Mona

. 2024. “Federated Freeze BERT for Text Classification.” Journal of Big Data 11(1): 28. ISSN 2196-1115. doi:10.1186/s40537-024-00885-x. https://doi.org/10.1186/s40537-024-00885-x.

36.

Gangula

Rama Rohit Reddy

Duggenpudi

Suma Reddy

Mamidi

Radhika

. 2019. “Detecting Political Bias in News Articles Using Headline Attention.” pp. 77-84 in Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, edited by Tal Linzen, Grzegorz Chrupa∖la, Yonatan Belinkov, and Dieuwke Hupkes. Florence, Italy: Association for Computational Linguistics. doi:10.18653/v1/W19-4809. https://aclanthology.org/W19-4809.

37.

Geva

Mor

Goldberg

Yoav

Berant

Jonathan

. 2019. “Are we Modeling the Task or the Annotator? an Investigation of Annotator Bias in Natural Language Understanding Datasets.” pp. 1161-1166 in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), edited by Kentaro Inui, Jing Jiang, Vincent Ng, and Xiaojun Wan. Hong Kong, China: Association for Computational Linguistics. doi:10.18653/v1/D19-1107. https://aclanthology.org/D19-1107.

38.

Gilardi

Fabrizio

Alizadeh

Meysam

Kubli

Maël

. 2023. “ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks.” Proceedings of the National Academy of Sciences 120(30): e2305016120. ISSN 0027-8424, 1091-6490. doi:10.1073/pnas.2305016120. http://arxiv.org/abs/2303.15056. arXiv:2303.15056 [cs].

39.

Goldsteen

Abigail

Ezov

Gilad

Shmelkin

Ron

Moffie

Micha

Farkash

Ariel

. 2022. “Anonymizing Machine Learning Models.” pp. 121-136 in Data Privacy Management, Cryptocurrencies and Blockchain Technology, edited by Joaquin Garcia-Alfaro, Jose Luis Muñoz-Tapia, Guillermo Navarro-Arribas, and Miguel Soriano. Lecture Notes in Computer Science, Cham: Springer International Publishing. ISBN 978-3-030-93944-1. doi:10.1007/978-3-030-93944-1_8.

40.

Golovanov

Sergey

Kurbanov

Rauf

Nikolenko

Sergey

Truskovskyi

Kyryl

Tselousov

Alexander

Wolf

Thomas

. 2019. “Large-scale Transfer Learning for Natural Language Generation.” pp. 6053-6058 in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy: Association for Computational Linguistics. doi:10.18653/v1/P19-1608. https://aclanthology.org/P19-1608.

41.

Grimmer

Justin

Roberts

Margaret E.

Stewart

Brandon M.

. 2021. “Machine Learning for Social Science: An Agnostic Approach.” Annual Review of Political Science 24(1): 395-419. doi:10.1146/annurev-polisci-053119-015921. https://doi.org/10.1146/annurev-polisci-053119-015921.

42.

Grimmer

Justin

Stewart

Brandon M.

. 2013. “Text As Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts.” Political Analysis 21(3): 267-297. ISSN 1047-1987, 1476-4989. doi:10.1093/pan/mps028. https://www.cambridge.org/core/journals/political-analysis/article/text-as-datathe-promise-and-pitfalls-of-automatic-content-analysis-methods-for-politicaltexts/F7AAC8B2909441603FEB25C156448F20. Publisher: Cambridge University Press.

43.

Guan

Naiqing

Koudas

Nick

. 2022. “Fila: Online Auditing of Machine Learning Model Accuracy Under Finite Labelling Budget.” pp. 1784-1794 in Proceedings of the 2022 International Conference on Management of Data, SIGMOD ’22, New York, NY, USA: Association for Computing Machinery. ISBN 9781450392495. doi:10.1145/3514221.3517904. https://doi.org/10.1145/3514221.3517904.

44.

Guo

Yunhui

Shi

Honghui

Kumar

Abhishek

Grauman

Kristen

Rosing

Tajana

Feris

Rogerio

. 2019. “Spottune: Transfer Learning Through Adaptive Fine-tuning.” pp. 4805-4814 in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

45.

Pengcheng

Gao

Jianfeng

Chen

Weizhu

. 2021. “DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing.” arXiv e-prints, art. arXiv:2111.09543, November 2021. 10.48550/arXiv.2111.09543.

46.

Hossin

Sulaiman

M.N.

. 2015. “A Review on Evaluation Metrics for Data Classification Evaluations.” International Journal of Data Mining & Knowledge Management Process 5(2): 1.

47.

Houlsby

Neil

Giurgiu

Andrei

Jastrzebski

Stanislaw

Morrone

Bruna

De Laroussilhe

Quentin

Gesmundo

Andrea

Attariyan

Mona

et al. 2019. “Parameter-Efficient Transfer Learning for NLP.” pp. 2790-2799 in Proceedings of the 36th International Conference on Machine Learning, edited by Kamalika Chaudhuri and Ruslan Salakhutdinov. Vol. 97 of Proceedings of Machine Learning Research, PMLR. https://proceedings.mlr.press/v97/houlsby19a.html.

48.

Hsieh

Cheng-Yu

Chun-Liang

Yeh

Chih-kuan

Nakhost

Hootan

Fujii

Yasuhisa

Ratner

Alex

Krishna

Ranjay

et al. 2023. “Distilling Step-by-step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes.” pp. 8003-8017 in Findings of the Association for Computational Linguistics: ACL 2023, edited by Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki. Toronto, Canada: Association for Computational Linguistics. doi:10.18653/v1/2023.findings-acl.507. https://aclanthology.org/2023.findings-acl.507.

49.

Shengding

Ding

Ning

Wang

Huadong

Liu

Zhiyuan

Wang

Jingang

Juanzi

Wei

et al. 2022. “Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification.” pp. 2225-2240 in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Dublin, Ireland: Association for Computational Linguistics. doi:10.18653/v1/2022.acl-long.158. https://aclanthology.org/2022.acl-long.158.

50.

Hube

Christoph

Fetahu

Besnik

Gadiraju

Ujwal

. 2019. “Understanding and Mitigating Worker Biases in the Crowdsourced Collection of Subjective Judgments.” pp. 1-12 in Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI ’19, New York, NY, USA: Association for Computing Machinery. ISBN 9781450359702. doi:10.1145/3290605.3300637. https://doi.org/10.1145/3290605.3300637.

51.

Hulth

Anette

Beáta

Megyesi . 2006. “A Study on Automatically Extracted Keywords in Text Categorization.” pp. 537-544 in Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Nicoletta Calzolari, Claire Cardie, and Pierre Isabelle. Sydney, Australia: Association for Computational Linguistics. doi:10.3115/1220175.1220243. https://aclanthology.org/P06-1068.

52.

Hutto

Gilbert

Eric

. 2014. “VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text.” Proceedings of the International AAAI Conference on Web and Social Media 8(1): 216-225. ISSN 2334-0770. doi:10.1609/icwsm.v8i1.14550. https://ojs.aaai.org/index.php/ICWSM/article/view/14550. Number: 1.

53.

Jaillant

Lise

. 2022. “Archives, Access and Artificial Intelligence: Working with Born-Digital and Digitized Archival Collections,” volume 2 of Digital Humanities Research. Bielefeld University Press / transcript Verlag, Bielefeld, Germany, 1 edition. ISBN 978-3-8376-5584-1 978-3-8394-5584-5. doi:10.14361/9783839455845. https://www.transcript-open.de/isbn/5584.

54.

Jain

Navdeep

. 2021. “Customer Sentiment Analysis Using Weak Supervision for Customer-agent Chat.” arXiv: Computation and Language.

55.

Jeffrey L.

Jensen

Karell

Daniel

Tanigawa-Lau

Cole

Habash

Nizar

Oudah

Mai

Fani

Dhia Fairus Shofia

. 2022. “Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity.” Sociological Methodology 52(1): 30-52. doi:10.1177/00811750211053370. https://doi.org/10.1177/00811750211053370.

56.

Jordan

Michael I

Mitchell

Tom M

. 2015. “Machine Learning: Trends, Perspectives, and Prospects.” Science (New York, N.Y.) 349(6245): 255-260.

57.

Kalla

Dinesh

Smith

Nathan

Samaah

Fnu

Kuraku

Sivaraju

. 2023. “Study and Analysis of Chat GPT and its Impact on Different Fields of Study.” https://papers.ssrn.com/abstract=4402499.

58.

Kamal

Muhammad Ayoub

Raza

Hafiz Wahab

Alam

Muhammad Mansoor

Mohd

. 2020. “Highlight the Features of Aws, GCP and Microsoft Azure that Have An Impact when Choosing a Cloud Service Provider.” International Journal of Recent Technology and Engineering 8(5): 4124-4232.

59.

Kandel

Ibrahem

Castelli

Mauro

. 2020. “The Effect of Batch Size on the Generalizability of the Convolutional Neural Networks on a Histopathology Dataset.” ICT Express 6(4): 312-315. ISSN 2405-9595. doi:10.1016/j.icte.2020.04.010. https://www.sciencedirect.com/science/article/pii/S2405959519303455.

60.

Keskar

Nitish Shirish

Nocedal

Jorge

Tang

Ping Tak Peter

Mudigere

Dheevatsa

Smelyanskiy

Mikhail

. 2017. “On Large-batch Training for Deep Learning: Generalization Gap and Sharp Minima.” in 5th International Conference on Learning Representations, ICLR 2017; Conference date: 24-04-2017 Through 26-04-2017.

61.

Kibriya

Ashraf M.

Frank

Eibe

Pfahringer

Bernhard

Holmes

Geoffrey

. 2005. “Multinomial Naive Bayes for Text Categorization Revisited.” pp. 488-499 in AI 2004: Advances in Artificial Intelligence, edited by Geoffrey I. Webb and Xinghuo Yu. Berlin, Heidelberg: Springer Berlin Heidelberg. ISBN 978-3-540-30549-1.

62.

Kojima

Takeshi

Shixiang Shane

Reid

Machel

Matsuo

Yutaka

Iwasawa

Yusuke

. 2022. “Large Language Models are Zero-shot Reasoners.” Advances in Neural Information Processing Systems 35: 22199-22213.

63.

Kolesnikov

Alexander

Beyer

Lucas

Zhai

Xiaohua

Puigcerver

Joan

Yung

Jessica

Gelly

Sylvain

Houlsby

Neil

. 2020. “Big Transfer (bit): General Visual Representation Learning.” pp. 491-507 in Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part V 16, Springer.

64.

Kotsiantis

Sotiris B

Zaharakis

Ioannis D

Pintelas

Panayiotis E

. 2006. “Machine Learning: a Review of Classification and Combining Techniques.” Artificial Intelligence Review 26: 159-190.

65.

Lehmann

Pola

Franzmann

Simon

Burst

Tobias

Matthieß

Theres

Regel

Sven

Riethmüller

Felicia

Volkens

Andrea

et al. 2023. “Wissenschaftszentrum Berlin Für Sozialforschung (WZB), and Institut Für Demokratieforschung Göttingen (IfDem).” Manifesto Project Dataset, 2023. https://manifesto-project.wzb.eu/doi/manifesto.mpds.2023a.

66.

Le Scao

Teven

Rush

Alexander

. 2021. “How Many Data Points is a Prompt Worth?” pp. 2627-2636 in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, Association for Computational Linguistics. doi:10.18653/v1/2021.naacl-main.208. https://aclanthology.org/2021.naacl-main.208.

67.

Lichtenstein

Matty

Rucks-Ahidiana

Zawadi

. 2021. “Contextual Text Coding: A Mixed-methods Approach for Large-scale Textual Data.” Sociological Methods & Research 52(2): 606-641. ISSN 0049-1241. doi:10.1177/0049124120986191. https://doi.org/10.1177/0049124120986191. Publisher: SAGE Publications Inc.

68.

Lindstrom

Patrick

Namee

Brian Mac

Delany

Sarah Jane

. 2013. “Drift Detection Using Uncertainty Distribution Divergence.” Evolving Systems 4: 13-25.

69.

Liu

Pengfei

Yuan

Weizhe

Jinlan

Jiang

Zhengbao

Hayashi

Hiroaki

Neubig

Graham

. 2023. “Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing.” ACM Computing Surveys 55(9): 01. ISSN 0360-0300. doi:10.1145/3560815. https://doi.org/10.1145/3560815.

70.

Liu

Xiao

Zheng

Yanan

Zhengxiao

Ding

Ming

Qian

Yujie

Yang

Zhilin

Tang

Jie

. 2021. “GPT Understands, Too.” arXiv e-prints, art. arXiv:2103.10385, March 2021. 10.48550/arXiv.2103.10385.

71.

Luo

Gang

. 2016. “A Review of Automatic Selection Methods for Machine Learning Algorithms and Hyper-parameter Values.” Network Modeling Analysis in Health Informatics and Bioinformatics 5: 1-16.

72.

Maas

Andrew L.

Daly

Raymond E.

Pham

Peter T.

Huang

Dan

Andrew Y.

Potts

Christopher

. 2011. “Learning Word Vectors for Sentiment Analysis.” pp. 142-150 in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA: Association for Computational Linguistics. https://aclanthology.org/P11-1015.

73.

Macanovic

Ana

. 2022. “Text Mining for Social Science - The State and the Future of Computational Text Analysis in Sociology.” Social Science Research 108: 102784. ISSN 0049-089X. doi:10.1016/j.ssresearch.2022.102784. https://www.sciencedirect.com/science/article/pii/S0049089X22000904.

74.

Mahesh

Batta

. 2020. “Machine Learning Algorithms-a Review.” International Journal of Science and Research (IJSR).[Internet] 9(1): 381-386.

75.

Mallory

Emily K.

de Rochemonteix

Matthieu

Ratner

Alexander

Acharya

Ambika

Ré

Christopher

Bright

Roselie A.

Altman

Russ B.

. 2020. “Extracting Chemical Reactions From Text Using Snorkel.” BMC Bioinformatics 21: 217doi:10.1186/s12859-020-03542-1.

76.

Marciano

Richard

Lemieux

Victoria

Hedges

Mark

Esteva

Maria

Underwood

William

Kurtz

Michael

Conrad

Mark

. 2018. “Archival Records and Training in the Age of Big Data.” pp. 179-199 in Re-Envisioning the MLS: Perspectives on the Future of Library and Information Science Education, volume 44B. Emerald Publishing Limited. ISBN 978-1-78754-885-5. doi:10.1108/S0065-28302018000044B010. https://www.emerald.com/insight/content/doi/10.1108/s0065-28302018000044b010/full/html. ISSN: 0065-2830.

77.

Maresca

Fabio

Solmaz

Gürkan

Cirillo

Flavio

. 2021. “OntoAugment: Ontology Matching through Weakly-Supervised Label Augmentation.” pp. 420-425 in Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, SenSys ’21, New York, NY, USA: Association for Computing Machinery. ISBN 978-1-4503-9097-2. doi:10.1145/3485730.3493445. https://doi.org/10.1145/3485730.3493445.

78.

Molina

Mario

Garip

Filiz

. 2019. “Machine Learning for Sociology.” Annual Review of Sociology 45(1): 27-45. doi:10.1146/annurev-soc-073117-041106. https://doi.org/10.1146/annurev-soc-073117-041106.

79.

Nelson

Jacob L

Taneja

Harsh

. 2018. “The Small, Disloyal Fake News Audience: The Role of Audience Availability in Fake News Consumption.” New Media & Society 20(10): 3720-3737. ISSN 1461-4448, 1461-7315. doi:10.1177/1461444818758715. http://journals.sagepub.com/doi/10.1177/1461444818758715.

80.

Németh

Renáta

Sik

Domonkos

Máté

Fanni

. 2020. “Machine Learning of Concepts Hard Even for Humans: The Case of Online Depression Forums.” International Journal of Qualitative Methods 19: 1609406920949338. ISSN 1609-4069. doi:10.1177/1609406920949338. https://doi.org/10.1177/1609406920949338. Publisher: SAGE Publications Inc.

81.

Neyshabur

Behnam

Sedghi

Hanie

Zhang

Chiyuan

. 2020. “What is Being Transferred in Transfer Learning?.” Advances in Neural Information Processing Systems 33: 512-523.

82.

NVIDIA. 2022. “Sizing Guide.” NVIDIA Docs, 2022. Accessed: 14 June 2023 (https://docs.nvidia.com/ai-enterprise/workflows-generative-ai/0.1.0/sizing-guide.html).

83.

Onan

Aytuğ

Korukoğlu

Serdar

Bulut

Hasan

. 2016. “Ensemble of Keyword Extraction Methods and Classifiers in Text Classification.” Expert Systems with Applications 57: 232-247.

84.

OpenAI. 2020. “API reference.” OpenAI, 2020. Accessed: 11 January 2024 (https://platform.openai.com/docs/api-reference/).

85.

Pan

Sinno Jialin

Yang

Qiang

. 2010. “A Survey on Transfer Learning.” IEEE Transactions on Knowledge and Data Engineering 22(10): 1345-1359. doi:10.1109/TKDE.2009.191.

86.

Peters

Matthew E.

Ruder

Sebastian

Smith

Noah A.

. 2019. “To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks.” pp. 7-14 in Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019), Florence, Italy: Association for Computational Linguistics. doi:10.18653/v1/W19-4302. https://aclanthology.org/W19-4302.

87.

Petroni

Fabio

Rocktäschel

Tim

Riedel

Sebastian

Lewis

Patrick

Bakhtin

Anton

Yuxiang

Miller

Alexander

. 2019. “Language Models as Knowledge Bases?” pp. 2463-2473 in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China: Association for Computational Linguistics. doi:10.18653/v1/D19-1250. https://aclanthology.org/D19-1250.

88.

Pierleoni

Paola

Concetti

Roberto

Belli

Alberto

Palma

Lorenzo

. 2019. “Amazon, Google and Microsoft Solutions for Iot: Architectures and a Performance Comparison.” IEEE Access 8: 5455-5470.

89.

Press

Ofir

Zhang

Muru

Min

Sewon

Schmidt

Ludwig

Smith

Noah

Lewis

Mike

. 2023. “Measuring and Narrowing the Compositionality Gap in Language Models.” pp. 5687-5711 in Findings of the Association for Computational Linguistics: EMNLP 2023, edited by Houda Bouamor, Juan Pino, and Kalika Bali. Singapore: Association for Computational Linguistics. doi:10.18653/v1/2023.findings-emnlp.378. https://aclanthology.org/2023.findings-emnlp.378.

90.

Priyadarshini

Ishaani

Cotton

Chase

. 2021. “A Novel LSTM–CNN–grid Search-based Deep Neural Network for Sentiment Analysis.” The Journal of Supercomputing 77(12): 13911-13932. ISSN 0920-8542. doi:10.1007/s11227-021-03838-w. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8097246/.

91.

Radford

Alec

Jeff

Child

Rewon

Luan

David

Amodei

Dario

Sutskever

Ilya

. 2019. “Language Models are Unsupervised Multitask Learners. Unpublished.” http://www.persagen.com/files/misc/radford2019language.pdf.

92.

Raffel

Colin

Shazeer

Noam

Roberts

Adam

Lee

Katherine

Narang

Sharan

Matena

Michael

Zhou

Yanqi

et al. 2020. “Exploring the Limits of Transfer Learning with a Unified Text-to-text Transformer.” Journal of Machine Learning Research 21(1): 1-67. ISSN 1532–4435.

93.

Rahal

Charles

Verhagen

Mark

Kirk

David

. 2024. “The Rise of Machine Learning in the Academic Social Sciences.” AI & SOCIETY 39(2): 799-801. ISSN 1435-5655. doi:10.1007/s00146-022-01540-w. https://doi.org/10.1007/s00146-022-01540-w.

94.

Ratner

Alexander

Bach

Stephen H.

Ehrenberg

Henry

Fries

Jason

Sen

Ré

Christopher

. 2017. “Snorkel: Rapid Training Data Creation with Weak Supervision.” Proceedings of the VLDB Endowment 11(3): 269-282. ISSN 2150-8097. doi:10.14778/3157794.3157797. http://arxiv.org/abs/1711.10160. arXiv:1711.10160 [cs, stat].

95.

Ratner

Alexander J

De Sa

Christopher M

Sen

Selsam

Daniel

Ré

Christopher

. 2016. “Data Programming: Creating Large Training Sets, Quickly.” pp. 3574-3582 in Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc., https://proceedings.neurips.cc/paper/2016/hash/6709e8d64a5f47269ed5cea9f625f7ab-Abstract.html.

96.

Ratner

Alexander

Hancock

Braden

Dunnmon

Jared

Sala

Frederic

Pandey

Shreyash

Ré

Christopher

. 2019. “Training Complex Models with Multi-Task Weak Supervision.” Proceedings of the AAAI Conference on Artificial Intelligence 33(01): 4763-4771. ISSN 2374-3468. doi:10.1609/aaai.v33i01.33014763. https://ojs.aaai.org/index.php/AAAI/article/view/4403. Number: 01.

97.

Rehana

Hasin

Çam

Nur Bengisu

Basmaci

Mert

Yongqun

Özgür

Arzucan

Hur

Junguk

. 2023. “Evaluation of gpt and Bert-Based Models on Identifying Protein-Protein Interactions in Biomedical Text.” ArXiv Computer Science.

98.

Rennie

Jason D M

Shih

Lawrence

Teevan

Jaime

Karger

David R

. 2003. “Tackling the Poor Assumptions of Naive Bayes Text Classi?ers.” in Proceedings of the Twentieth International Conference on Machine Learning.

99.

Rodríguez-Sánchez

Francisco

Albornoz

Jorge Carrillo-de

Morales

Laura Plaza

Mendieta-Aragón

Adrián

Remón

Guillermo Marco

Makeienko

Maryna

Plaza

María

et al. 2022. “Overview of exist 2022: Sexism Identification in Social Networks.”

100.

Roemmele

Melissa

Gordon

Andrew S.

. 2018. “Automated Assistance for Creative Writing with an RNN Language Model.” pp. 1-2 in Proceedings of the 23rd International Conference on Intelligent User Interfaces Companion, IUI ’18 Companion, New York, NY, USA: Association for Computing Machinery. ISBN 9781450355711. doi:10.1145/3180308.3180329. https://doi.org/10.1145/3180308.3180329.

101.

Sanh

Victor

Debut

Lysandre

Chaumond

Julien

Wolf

Thomas

. 2019. “Distilbert, a Distilled Version of Bert: Smaller, Faster, Cheaper and Lighter.” arXiv preprint arXiv:1910.01108.

102.

Sarwar

Sheikh Muhammad

Zlatkova

Dimitrina

Hardalov

Momchil

Dinkov

Yoan

Augenstein

Isabelle

Nakov

Preslav

. 2022. “A Neighborhood Framework for Resource-Lean Content Flagging.” Transactions of the Association for Computational Linguistics 10: 484-502. ISSN 2307-387X. doi:10.1162/tacl_a_00472. https://doi.org/10.1162/tacl_a_00472.

103.

Sathyanarayana Rao

T. S.

Bansal

Deepali

Chandran

Suhas

. 2018. “Cyberbullying: A Virtual Offense with Real Consequences.” Indian Journal of Psychiatry 60(1): 3-5. ISSN 0019-5545. doi:10.4103/psychiatry.IndianJPsychiatry_147_18. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5914259/.

104.

Saunshi

Nikunj

Malladi

Sadhika

Arora

Sanjeev

. 2020. “A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks.” in International Conference on Learning Representations.

105.

Scharkow

Michael

Vogelgesang

Jens

. 2011. “Measuring the Public Agenda Using Search Engine Queries.” International Journal of Public Opinion Research 23(1): 104-113. ISSN 0954-2892. doi:10.1093/ijpor/edq048. https://doi.org/10.1093/ijpor/edq048.

106.

Schick

Timo

Schmid

Helmut

Schütze

Hinrich

. 2020. “Automatically Identifying Words that can Serve as Labels for Few-shot Text Classification.” pp. 5569-5578 in Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, Spain (Online): International Committee on Computational Linguistics. doi:10.18653/v1/2020.coling-main.488. https://aclanthology.org/2020.coling-main.488.

107.

Schick

Timo

Schütze

Hinrich

. 2021a. “Exploiting Cloze-questions for Few-shot Text Classification and Natural Language Inference.” pp. 255-269 in Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Association for Computational Linguistics. doi:10.18653/v1/2021.eacl-main.20. https://aclanthology.org/2021.eacl-main.20.

108.

Schick

Timo

Schütze

Hinrich

. 2021b. “It’s not Just Size that Matters: Small Language Models are also Few-shot Learners.” pp. 2339-2352 in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics. doi:10.18653/v1/2021.naacl-main.185. https://aclanthology.org/2021.naacl-main.185.

109.

Shams

Rushdi

. 2014. “Semi-Supervised Classification for Natural Language Processing.” arXiv preprint arXiv:1409.7612.

110.

Shelke

Nilesh

Chaudhury

Sushovan

Chakrabarti

Sudakshina

Bangare

Sunil L.

Yogapriya

Pandey

Pratibha

. 2022. “An Efficient Way of Text-based Emotion Analysis From Social Media Using LRA-DNN.” Neuroscience Informatics 2(3): 100048. ISSN 2772-5286. doi:10.1016/j.neuri.2022.100048. https://www.sciencedirect.com/science/article/pii/S2772528622000103.

111.

Shin

Taylor

Razeghi

Yasaman

Logan IV

Robert L.

Wallace

Eric

Singh

Sameer

. 2020. “AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts.” pp. 4222-4235 in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics. doi:10.18653/v1/2020.emnlp-main.346. https://aclanthology.org/2020.emnlp-main.346.

112.

Shu

Kai

Zheng

Guoqing

Yichuan

Mukherjee

Subhabrata

Awadallah

Ahmed Hassan

Ruston

Scott

Liu

Huan

. 2021. “Early Detection of Fake News with Multi-source Weak Social Supervision.” pp. 650-666 in Machine Learning and Knowledge Discovery in Databases, edited by Frank Hutter, Kristian Kersting, Jefrey Lijffijt, and Isabel Valera. volume 12459. Springer International Publishing, Cham. ISBN 978-3-030-67663-6 978-3-030-67664-3. doi:10.1007/978-3-030-67664-3_39. https://link.springer.com/10.1007/978-3-030-67664-3_39. Series Title: Lecture Notes in Computer Science.

113.

Silva

Carlos Anderson Oliveira

Gonzalez-Otero

Rafael

Bessani

Michel

Mendoza

Liliana Margarita Otero

Otero

Liliana

de Castro

Cristiano Leite

. 2022. “Interpretable Risk Models for Sleep Apnea and Coronary Diseases From Structured and Non-structured Data.” Expert Systems With Applications 200: 116955doi:10.1016/j.eswa.2022.116955.

114.

Small

Mario

Calarco

Jessica

. 2022. Qualitative Literacy. University of California Press. https://www.ucpress.edu/books/qualitative-literacy/paper .

115.

Souma

Wataru

Vodenska

Irena

Aoyama

Hideaki

. 2019. “Enhanced News Sentiment Analysis Using Deep Learning Methods.” Journal of Computational Social Science 2(1): 33-46. ISSN 2432-2725. doi:10.1007/s42001-019-00035-x. https://doi.org/10.1007/s42001-019-00035-x.

116.

Spirling

Arthur

. 2023. “Why Open-source Generative AI Models are An Ethical Way Forward for Science.” Nature 616(7957): 413-413. doi:10.1038/d41586-023-01295-4. https://www.nature.com/articles/d41586-023-01295-4. Bandiera_abtest: a Cg_type: World View Publisher: Nature Publishing Group Subject_term: Ethics, Machine learning, Technology, Scientific community.

117.

Stede

Manfred

Patz

Ronny

. 2021. “The Climate Change Debate and Natural Language Processing.” pp. 8-18 in Proceedings of the 1st Workshop on NLP for Positive Impact, Anjalie Field, Shrimai Prabhumoye, Maarten Sap, Zhijing Jin, Jieyu Zhao, and Chris Brockett. Association for Computational Linguistics. doi:10.18653/v1/2021.nlp4posimpact-1.2. https://aclanthology.org/2021.nlp4posimpact-1.2.

118.

Sun

Chi

Qiu

Xipeng

Yige

Huang

Xuanjing

. 2019. “How to Fine-tune Bert for Text Classification?” pp. 194-206 in Chinese Computational Linguistics: 18th China National Conference, CCL 2019, Kunming, China, October 18-20, 2019, Proceedings 18, Springer.

119.

Tay

Dehghani

Mostafa

Gupta

Jai Prakash

Aribandi

Vamsi

Bahri

Dara

Qin

Zhen

Metzler

Donald

. 2021a. “Are Pretrained Convolutions Better than Pretrained Transformers?” pp. 4349-4359 in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli. Association for Computational Linguistics. doi:10.18653/v1/2021.acl-long.335. https://aclanthology.org/2021.acl-long.335.

120.

Tay

Tran

Vinh Q

Ruder

Sebastian

Gupta

Jai

Chung

Hyung Won

Bahri

Dara

Qin

Zhen

et al. 2021b. “Charformer: Fast Character Transformers via Gradient-based Subword Tokenization.” in International Conference on Learning Representations.

121.

Terechshenko

Zhanna

Linder

Fridolin

Padmakumar

Vishakh

Liu

Michael

Nagler

Jonathan

Tucker

Joshua A.

Bonneau

Richard

. 2020. “A Comparison of Methods in Political Science Text Classification: Transfer Learning Language Models for Politics.” SSRN, doi:10.2139/ssrn.3724644. https://papers.ssrn.com/abstract=3724644.

122.

Thomas

Rincy N.

Gupta

Roopam

. 2020. “A Survey on Machine Learning Approaches and its Techniques.” pp. 1-6 in 2020 IEEE International Students’ Conference on Electrical,Electronics and Computer Science (SCEECS). doi:10.1109/SCEECS48394.2020.190.

123.

Treviso

Marcos

Lee

Ji-Ung

Tianchu

van Aken

Betty

Cao

Qingqing

Ciosici

Manuel R.

Hassid

Michael

et al. 2023. “Efficient Methods for Natural Language Processing: A Survey.” Transactions of the Association for Computational Linguistics 11: 826-860. ISSN 2307-387X. doi:10.1162/tacl_a_00577. https://doi.org/10.1162/tacl_a_00577.

124.

Veldanda

Akshaj Kumar

Grob

Fabian

Thakur

Shailja

Pearce

Hammond

Tan

Benjamin

Karri

Ramesh

Garg

Siddharth

. 2023. “Are Emily and Greg Still More Employable than Lakisha and Jamal? Investigating Algorithmic Hiring Bias in the Era of Chatgpt.” arXiv preprint arXiv:2310.05135.

125.

Waggoner

Philip D

. 2021. “Unsupervised Machine Learning for Clustering in Political and Social Research.” in Elements in Quantitative and Computational Methods for the Social Sciences. Cambridge: Cambridge University Press.

126.

Wang

Alex

Pruksachatkun

Yada

Nangia

Nikita

Singh

Amanpreet

Michael

Julian

Hill

Felix

Levy

Omer

et al. 2019. “Superglue: A Stickier Benchmark for General-Purpose Language Understanding Systems.” pp. 3266-3280 in Proceedings of the 33rd International Conference on Neural Information Processing Systems, Red Hook, NY, USA: Curran Associates Inc.

127.

Wankmüller

Sandra

. 2022. “Drawing Causal Inferences About Performance Effects in NLP.” ArXiv Computer Science, doi:10.48550/arXiv.2209.06790. http://arxiv.org/abs/2209.06790. arXiv:2209.06790 [cs].

128.

Watanabe

Kohei

Baturo

Alexander

. 2023. “Seeded Sequential Lda: A Semi-supervised Algorithm for Topic-specific Analysis of Sentences.” Social Science Computer Review 0: 08944393231178605.

129.

Wei

Jason

Bosma

Maarten

Zhao

Vincent Y.

Guu

Kelvin

Adams Wei

Lester

Brian

Nan

et al. 2021. “Finetuned Language Models are Zero-shot Learners.” CoRR, abs/2109.01652, https://arxiv.org/abs/2109.01652.

130.

Wei

Jason

Wang

Xuezhi

Schuurmans

Dale

Bosma

Maarten

Xia

Fei

Chi

Quoc V

, et al. 2022. “Chain-of-thought Prompting Elicits Reasoning in Large Language Models.” Advances in Neural Information Processing Systems 35: 24824-24837.

131.

Weiss

Karl

Khoshgoftaar

Taghi M

Wang

DingDing

. 2016. “A Survey of Transfer Learning.” Journal of Big data 3(1): 1-40.

132.

Wiedemann

Gregor

. 2013. “Opening Up to Big Data: Computer-Assisted Analysis of Textual Data in Social Sciences.” Historical Social Research / Historische Sozialforschung 38(4 (146)): 332-357. ISSN 0172-6404. https://www.jstor.org/stable/24142701. Publisher: GESIS—Leibniz-Institute for the Social Sciences, Center for Historical Social Research.

133.

Wolf

Thomas

Debut

Lysandre

Sanh

Victor

Chaumond

Julien

Delangue

Clement

Moi

Anthony

Cistac

Pierric

et al. 2020. “Transformers: State-of-the-art Natural Language Processing.” pp. 38-45 in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Association for Computational Linguistics. doi:10.18653/v1/2020.emnlp-demos.6. https://aclanthology.org/2020.emnlp-demos.6.

134.

Wulczyn

Ellery

Thain

Nithum

Dixon

Lucas

. 2017. “Ex Machina: Personal Attacks Seen at Scale.” pp. 1391-1399 in Proceedings of the 26th International Conference on World Wide Web, WWW ’17, Republic and Canton of Geneva, CHE,: International World Wide Web Conferences Steering Committee. ISBN 978-1-4503-4913-0. doi:10.1145/3038912.3052591. https://doi.org/10.1145/3038912.3052591.

135.

Jing

Margaret

Boureau

Y-Lan

Weston

Jason

Dinan

Emily

. 2021. “Bot-Adversarial Dialogue for Safe Conversational Agents.” pp. 2950-2968 in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics. doi:10.18653/v1/2021.naacl-main.235. https://aclanthology.org/2021.naacl-main.235.

136.

Tiezheng

Liu

Zihan

Fung

Pascale

. 2021. “AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization.” pp. 5892-5904 in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics. doi:10.18653/v1/2021.naacl-main.471. https://aclanthology.org/2021.naacl-main.471.

137.

Hao

Yang

Zachary

Pelrine

Kellin

Godbout

Jean Francois

Rabbany

Reihaneh

. 2023. “Open, Closed, or Small Language Models for Text Classification?,” http://arxiv.org/abs/2308.10092. arXiv:2308.10092.

138.

Zheng

Wanwan

Jin

Mingzhe

. 2020. “The Effects of Class Imbalance and Training Data Size on Classifier Learning: An Empirical Study.” SN Computer Science 1: 1-13.

139.

Zhuang

Fuzhen

Zhiyuan

Duan

Keyu

Dongbo

Zhu

Yongchun

Zhu

Hengshu

Xiong

Hui

et al. 2020. “A Comprehensive Survey on Transfer Learning.” Proceedings of the IEEE 109(1): 43-76.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

1.59 MB

Cheap Learning: Maximizing Performance of Language Models for Social Data Science Using Minimal Data

Abstract

Keywords

Get full access to this article

References

Supplementary Material