Argument component classification in academic writings

Abstract

Argumentation in academic writing is a challenging task required to communicate clear ideas. Exposed ideas have to be supported by reasoned arguments. Arguments are composed of components such as premises and conclusions. In this paper, we present an approach to classify argumentative components using language models and machine learning algorithms on a new corpus of academic theses and research proposals. We explore the use of lexical, syntactic, semantic and indicator features to tackle this task. We found that lexical features provide the best efficacy for the classification. For language models, the best features were syntactical. But our experiments showed that a document occurrence representation with unigrams achieved the best accuracy. We also tested the conclusions about the representation and classifier on theses according to their study level (undergraduate, master, and doctoral). We analyzed the information gain of features and found patterns that are part of argumentative markers.

Keywords

Computer-assisted argument analysis academic writing argumentation studies argument components annotated theses corpus

Get full access to this article

View all access options for this article.

References

Al-Rfou

, Perozzi

and Skiena

, Polyglot: Distributed word representations for multilingual nlp, In Proceedings of the Seventeenth Conference on Computational Natural Language Learning , Sofia, Bulgaria. Association for Computational Linguistics (2013), 183–192.

Bird

, Klein

and Loper

, Natural Language Processing with Python. O’Reilly Media, Inc., 1, 2009.

Briz

, Pons

and Portolés

, Diccionario de partículas discursivas del español. In El diccionario como puente entre las lenguas y culturas del mundo. Actas del II Congreso Internacional de Lexicografía Hispánica, Alicante, Biblioteca Virtual Cervantes, Biblioteca Virtual Miguel de Cervantes, Alicante, (2008), 217–227.

Cabrera

J.M.

, Escalante

H.J.

and Montes-y

, Gómez, Distributional term representations for short-text categorization, In International Conference on Computational Linguistics and Intelligent Text Processing (2013), 335–346. Springer.

Cabrio

and Villata

, Combining textual entailment and argumentation theory for supporting online debates interactions, In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2, ACL ’12 , Association for Computational Linguistics (2012), 208–212.

Capaldi

, Cómo ganar una discusiónGedisa, 1990.

Carstens

and Toni

, Towards relation based argumentation mining, In Proceedings of the 2ndWorkshop on Argumentation MiningDenver, CO, Association for Computational Linguistics, (2015), 29–34.

Daxenberger

, Eger

, Habernal

, Stab

and Gurevych

, What is the essence of a claim? cross-domain claim identification, Association for Computational Linguistics, In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (2017), 2045–2056.

Fierro

, Fuentes

, Pérez

and Quezada

, 200k+ crowdsourced political arguments for a new chilean constitution, In Proceedings of the 4th Workshop on Argument Mining , Copenhagen, Denmark (2017) 1–10. Association for Computational Linguistics.

10.

Freeman

J.B.

, Argument Structure: Representation and Theory, Springer2011.

11.

García-Gorrostieta

J.M.

, López-López

and González-López

, Towards automatic assessment of argumentation in theses justifications. In Lavoué

É.

, Drachsler

, Verbert

, Broisin

and Pérez-Sanagustín

, editors, Data Driven Approaches in Digital Education: 12th European Conference on Technology Enhanced LearningEC-TEL Tallinn, Estonia, Proceedings, Springer International Publishing, (2017), 54–66.

12.

González-López

and López-López

, Colección de tesis y propuesta de investigación en tics: Un recurso para su análisis y estudio,n Educativa, In XIII Congreso Nacional de Investigaci’o (2015), 1–15.

13.

Gorrostieta

J.M.G.

and López-López

, Argumentation identification for academic support in undergraduate writings. In Verbert

, Sharples

and Klobučar

, editors, Adaptive and Adaptable Learning: 11th European Conference on Technology Enhanced LearningEC-TEL Lyon, France, Proceedings, Springer International Publishing (2016), 98–109.

14.

Goudas

, Louizos

, Petasis

and Karkaletsis

, Argument extraction from news, blogs, and social media, In Hellenic Conference on Artificial Intelligence (2014) 287–299. Springer.

15.

Green

, Identifying argumentation schemes in genetics research articles, Association for Computational Linguistics, In Proceedings of the 2nd Workshop on Argumentation Mining (2015), 12–21.

16.

Habernal

and Gurevych

, Argumentation mining in user-generated web discourse, Computational Linguistics43(1) (2017), 125–179.

17.

Hall

, Frank

, Holmes

, Pfahringer

, Reutemann

and Witten

I.H.

, The weka data mining software: An update, ACM SIGKDD Explorations Newsletter11(1) (2009), 10–18.

18.

Jurafsky

and Martin

J.H.

, Speech and Language Processing2, Prentice-Hall Inc., Upper Saddle River, NJ, USA, 2009.

19.

Kirschner

, Eckle-Kohler

and Gurevych

, Linking the thoughts: Analysis of argumentation structures in scientific publications, Association for Computational Linguistics, In Proceedings of the 2nd Workshop on Argumentation Mining (2015), 1–11.

20.

Landis

J.R.

and Koch

G.G.

, The measurement of observer agreement for categorical data, Biometrics33(1) (1977), 159–174.

21.

Lindsay

, Scientific Writing, CSIRO Publishing, 2011.

22.

López Ferrero

and García

, Negroni La argumentación en los géneros académicos,In Actas del Congreso Internacional La Argumentaci’onUniversidad de Buenos Aires (2003), 1121–1129.

23.

Mochales

and Moens

M.-F.

, Study on the structure of argumentation in case law, In Proceedings of the 2008 Conference on Legal Knowledge and Information Systems , IOS Press (2008), 11–20.

24.

Mochales

and Moens

M.-F.

, Argumentation mining, Artificial Intelligence and Law19(1) (2011), 1–22.

25.

Moens

M.-F.

, Boiy

, Palau

R.M.

and Reed

, Automatic detection of arguments in legal texts, ACM, In Proceedings of the 11th International Conference on Artificial Intelligence and Law (2007), 225–230.

26.

Nguyen

and Litman

, Extracting argument and domain words for identifying argument components in texts, Association for Computational Linguistics, In Proceedings of the 2nd Workshop on Argumentation Mining (2015), 22–28.

27.

Padró

and Stanilovsky

, Freeling 3.0: Towards wider multilinguality, Istanbul, Turkey, European Language Resources Association (ELRA), In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12) (2012), 2473–2479.

28.

Pedregosa

, Varoquaux

, Gramfort

, Michel

, Thirion

, Grisel

, Blondel

, Prettenhofer

, Weiss

, Dubourg

, Vanderplas

, Passos

and Cournapeau

, Brucher

, Perrot

and Duchesnay

, Scikit-learn: Machine learning in python, Journal of Machine Learning Research12(Oct) (2011), 2825–2830.

29.

Persing

and Ng

, Modeling argument strength in student essays, Beijing, China, Association for Computational Linguistics, In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (2015), 543–552.

30.

Rodríguez

C.F.

, Diccionario de conectores y operadores del español, Arco/Libros, 2009.

31.

Sánchez

, Avendaño, Los conectores discursivos: Su empleo en redacciones de estudiantes universitarios costarricenses,a y Lingüística de la Universidad de Costa Rica, Revista de Filolog’ı31(2) (2005).

32.

Sardianos

, Katakis

I.M.

, Petasis

and Karkaletsis

, Argument extraction from news, Association for Computational Linguistics, In Proceedings of the 2nd Workshop on Argumentation Mining (2015), 56–66.

33.

Stab

and Gurevych

, Identifying argumentative discourse structures in persuasive essays, Association for Computational Linguistics, In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014), 46–56.

34.

Stab

C.M.E.

, Argumentative Writing Support by means of Natural Language Processing, PhD thesis, Technische Universität Darmstadt, 2017.

35.

Stolcke

, Srilm – an extensible language modeling toolkit, In Proceedings of the 7TH International Conference on Spoken Language Processing (2002), 901–904.

36.

Toulmin

S.E.

, The uses of argument, Cambridge University Press2003.

37.

Villalba

M.P.G.

and Saint-Dizier

, Some facets of argument mining for opinion analysis, In Proceedings of the 2012 International Conference on Computational Models of Argument (2012), 23–34.

38.

Walton

, Fundamentals of critical argumentation, Cambridge University Press, 2005.

39.

Walton

, Reed

and Macagno

, Argumentation schemes, Cambridge University Press2008.

40.

Wyner

and Bench-Capon

, Towards an extensible argumentation system, In Proceedings of the Ninth European Conferences on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (2007) 283–294. Springer.

41.

Wyner

, Mochales-Palau

, Moens

M.-F.

and Milward

, Approaches to Text Mining Arguments from Legal Cases, Springer, 2010, pp. 60–79 .