Sage Journals: Discover world-class research

Abstract

A latent semantic analysis-based automated summary assessment is described; this automated system is applied to a real learning from text task in a Distance Education context. We comment on the use of automated content, plagiarism, text coherence measures, and word weights average and their impact on predicting human judges summary scoring. A first regression analysis showed the independence of interparagraph coherence with respect to superficial text variables, advising its inclusion in a general regression model, along with content, plagiarism measures. The final regression model explains a considerable degree of variability in human judgment of summaries. Finally, we discuss several methodological implications and further applications of the automated summary scoring technique developed in this study.

Keywords

LSA assessment summaries coherence plagiarism distance education

Get full access to this article

View all access options for this article.

References

Barrón-Cedeño

Rosso

(2009) On automatic plagiarism detection based on n-grams comparison. In: Boughanem (eds) Advances in Information Retrieval, ECIR 2009, LNCS 5478, Berlin/Heidelberg, Germany: Springer-Verlag, pp. 696–700.

Bellissens

Jeuniaux

Duran

N. D.

McNamara

D. S.

(2010) A text relatedness and dependency computational model: Using latent semantic analysis and coh-metrix to predict self-explanation quality. Studia Informatica Universalis 8: 85–125.

Bereiter

Scardamalia

(1984) Information processing demand of text composition. In: Mandl

Stein

Trabasso

(eds) Learning and comprehension of text, Hillsdale, NJ: LEA, pp. 407–428.

Bestgen

Lories

Thewissen

(2010) Using latent semantic analysis to measure coherence in essays by foreign language learners? In: Bolasco

Chiari

Giuliano

(eds) Proceedings of 10th international conferences journée d’analyse statistique des données textuelles, (JADT2010), 9–11 July 2010, Rome.

Brown

A. L.

Day

J. D.

(1983) Macrorules for summarizing texts: The development of expertise. Journal of Verbal Learning and Verbal Behavior 22(1): 1–14.

Cai

McNamara

D. S.

Louwerse

M. M.

Rowe

M. P.

Graesser

A. C.

(2004) NLS: A non-latent similarity algorithm. In: Forbus

K. D.

Gentner

Regier

(eds) Proceedings of the 26th annual conference of the cognitive science society, Mahwah, NJ: LEA, pp. 180–185.

Češka

(2010) Automatic plagiarism detection based on latent semantic analysis: Theory and practice, Saarbrüecken, Germany: VDM Verlag Dr. Müller, pp. 101–110.

Češka

Hanák

Tesař

(2007) Teraman: A tool for N-gram extraction from large datasets. ICCP 2007, New York, NY: IEEE, pp. 209–216.

Deerwester

Dumais

S. T.

Furnas

G. W.

Landauer

T. K.

Harshman

(1990) Indexing by latent semantic analysis. Journal of the American Society for Information Science 41: 391–407.

10.

Defeldre

A. C.

(2005) Inadvertent plagiarism in everyday life. Applied Cognitive Psychology 19: 1033–1040.

11.

Foltz

P. W.

(2007) Discourse coherence and LSA. In: Landauer

McNamara

Simon

Kintsch

(eds) Handbook of latent semantic analysis, Mahwah, NJ: Lawrence Erlbaum Associates.

12.

Foltz

P. W.

Kintsch

Landauer

T. K.

(1998) The measurement of textual coherence with latent semantic analysis. Discourse Processes 25: 285–307.

13.

Foltz

P. W.

Laham

Landauer

T. K.

(1999) Automated essay scoring: Applications to educational technology. World Conference on Educational Multimedia, Hypermedia and Telecommunications (ED-MEDIA) 1999(1): 939–944.

14.

Foltz

P. W.

Lochbaum

K. E.

Rosenstein

M. B.

(2011) Analysis of student writing for a large scale implementation of formative assessment. Paper presented at the National Council for Measurement in Education, New Orleans, LA.

15.

Graesser

A. C.

Millis

K. N.

Zwaan

R. A.

(1997) Discourse comprehension. Annual Review of Psychology 48: 163–189.

16.

Graesser

A. C.

Penumatsa

Ventura

Cai

(2007) Using LSA in AutoTutor: Learning through mixed-initiative dialogue in natural language. In: Landauer

T. K.

McNamara

Dennis

Kintsch

(eds) The handbook of latent semantic analysis, Mahwah, NJ: Erlbaum, pp. 243–262.

17.

Graesser

A. C.

Singer

Trabasso

(1994) Constructing inferences during narrative text comprehension. Psychological Review 101: 371–395.

18.

Graesser

A. C.

Wiemer-Hastings

Harter

Person

The Tutoring Research Group (2000) Using latent semantic analysis to evaluate the contributions of students in AutoTutor. Interactive Learning Environments 8: 129–148.

19.

Haley

Applying latent semantic analysis to computer assisted assessment in the computer science domain: A framework, a tool, and an evaluation (Doctoral dissertation).

2009Buckinghamshire, UK: The Open University.

20.

Cai

Wiemer-Hastings

Graesser

A. C.

McNamara

D. S.

(2007) Strengths, limitations, and extensions of LSA. In: Landauer

T. K.

McNamara

D. S.

Dennis

Kintsch

(eds) The handbook of latent semantic analysis, Mahwah, NJ: Erlbaum, pp. 401–426.

21.

Jorge-Botana

León

J. A.

Olmos

Escudero

(2010) Latent semantic analysis parameters for essay evaluation using small-scale corpora. Journal of Quantitative Linguistics 17(1): 1–29.

22.

Jorge-Botana

Olmos

Barroso

(2013, July) Gallito 2.0: A natural language processing tool to support research on discourse. Proceedings of the 13th Annual Meeting of the Society for Text and Discourse, Valencia, Spain.

23.

Jorge-Botana

Olmos

Barroso

(2015) Call routing based on a combination of the construction-integration model and latent semantic analysis: A full system. Informatica.

24.

Kakkonen

Sutinen

(2011) Essay aid: Towards a semi-automatic system for assessing student texts. International Journal of Continuing Engineering Education and Life-Long Learning 21(2–3): 119–139.

25.

Kintsch

Caccamise

Franzke

Johnson

Dooley

(2007) Summary street: Computer-guided summary writing. In: Landauer

T. K.

McNamara

Dennis

Kintsch

(eds) The handbook of latent semantic analysis, Mahwah, NJ: Erlbaum, pp. 263–277.

26.

Kintsch

(1998) Comprehension: A paradigm for cognition, New York, NY: Cambridge University Press.

27.

Kintsch

(2002) On the notions of theme and topic in psychological process models of text comprehension. In: Louwerse

van Peer

(eds) Thematics: Interdisciplinary studies, Amsterdam, the Netherlands: Benjamins, pp. 157–170.

28.

Kintsch

Mangalath

(2011) The construction of meaning. Topics in Cognitive Science 3: 346–370.

29.

Kurby

C. A.

Wiemer–Hastings

Ganduri

Magliano

J. P.

Millis

K. K.

McNamara

D. S.

(2003) Computerizing reading training: Evaluation of a latent semantic analysis space for science text. Behavior Research Methods, Instruments and Computers 35: 244–250.

30.

Landauer

T. K.

(2002) On the computational basis of cognition: Arguments from LSA. In: Ross

B. H.

(ed.) The psychology of learning and motivation, New York, NY: Academic Press, pp. 43–84.

31.

Landauer

T. K.

(2003) Automatic essay assessment. Assessment in Education 10(3): 295–308.

32.

Landauer

T. K.

(2007) LSA as a theory of meaning. In: Landauer

T. K.

McNamara

Dennis

Kintsch

(eds) Handbook of latent semantic analysis, New York, NY and London, England: Routledge-Taylor & Francis Group, pp. 3–34.

33.

Landauer

T. K.

Dumais

S. T.

(1997) A solution to Plato’s problem: The latent semantic analysis theory of the acquisition, induction and representation of knowledge. Psychological Review 104: 211–240.

34.

Landauer

T. K.

Dumais

S. T.

(2008) Latent semantic analysis. Scholarpedia 3(11): 4356.

35.

Landauer

T. K.

Foltz

P. W.

Laham

(1998) Introduction to latent semantic analysis. Discourse Processes 25: 259–284.

36.

Landauer

T. K.

Laham

Foltz

P. W.

(1998) Learning human-like knowledge by singular value decomposition: A progress report. In: Jordan

M. I.

Kearns

M. J.

Solla

S. A.

(eds) Advances in neural information processing systems Vol. 10, Cambridge, MA: MIT Press, pp. 45–51.

37.

León

J. A.

Olmos

Escudero

Cañas

J. J.

Salmerón

(2006) Assessing short summaries with human judgments procedure and latent semantic analysis in narrative and expository texts. Behavior Research Methods, Instruments and Computers 38(4): 616–627.

38.

Lorch

R. F.

O’Brien

E. J.

(1995) Sources of coherence in reading, Hillsdale, NJ: Erlbaum.

39.

Magliano

J. P.

Graesser

A. C.

(2012) Computer-based assessment of student-constructed responses. Behavior Research Methods 44: 608–621. doi:10.3758/s13428-012-0211-3.

40.

Martin

Berry

(2007) Mathematical foundations behind latent semantic analysis. In: Landauer

McNamara

Simon

Kintsch

(eds) Handbook of latent semantic analysis, Mahwah, NJ: Lawrence Erlbaum Associates.

41.

McCowan

Moore

Dines

Gatica-Pérez

Flynn

Wellner

Bourlanr

(2004) On the use of information retrieval measures for speech recognition evaluation, IDIAP-RR73, Martigny, Switzerland: IDIAP.

42.

McNamara

D. S.

Boonthum

Levinstein

I. B.

Millis

(2007) Evaluating self-explanations in iSTART: Comparing word-based and LSA algorithms. In: Landauer

T. K.

McNamara

Dennis

Kintsch

(eds) The handbook of latent semantic analysis, Mahwah, NJ: Erlbaum, pp. 227–242.

43.

McNamara

D. S.

Cai

Louwerse

M. M.

(2007) Optimizing LSA measures of cohesion. In: Landauer

McNamara

Simon

Kintsch

(eds) Handbook of latent semantic analysis, Mahwah, NJ: Lawrence Erlbaum Associates.

44.

McNamara

D. S.

Levinstein

I. B.

Boonthum

(2004) iSTART: Interactive strategy training for active reading and thinking. Behavior Research Methods, Instruments, and Computers 36: 222–233.

45.

McNamara

(1996) Measuring second language performance, Harlow, Essex, England: Addison Wesley Longman Ltd.

46.

Millis

K. K.

Kim

H. J. J.

Todaro

Magliano

J. P.

Wiemer-Hastings

McNamara

D. S.

(2004) Identifying reading strategies using latent semantic analysis: Comparing semantic benchmarks. Behavior Research Methods, Instruments and Computers 36: 213–221.

47.

Monjurul Islam

Latiful Hoque

A. S. M.

(2012) Automated essay scoring using generalized latent semantic analysis. Journal of Computers 7(3): 616–626.

48.

Mulder

Sanders

T. J. M.

(2012) Causal coherence relations and levels of discourse representation. Discourse Processes 49: 501–522.

49.

Nakov

(2000, August) Getting better results with latent semantic indexing. Paper presented at the Students Presentations at the European Summer School in Logic Language and Information (ESSLLI’00) (pp. 156–166), Birmingham, England.

50.

Nakov

Popova

Mateev

(2001) Weight functions impact on LSA performance. Paper presented at Recent Advances in Natural Language Processing, – RANLP 2001, Tzigov Chark, Bulgaria.

51.

Olde

B. A.

Franceschetti

D. R.

Karnavat Graesser

A. C.

The Tutoring Research Group (2002) The right stuff: Do you need to sanitize your corpus when using latent semantic analysis? In: Gray

Schunn

C. D.

(eds) Proceedings of the 24th annual conference of the cognitive science society, Mahwah, NJ: Lawrence Erlbaum Associates, pp. 708–713.

52.

Olmos

León

J. A.

Jorge-Botana

Escudero

(2009) New algorithms assessing short summaries in expository texts using latent semantic analysis. Behaviour Research Methods, Instruments, and Computers 41: 944–950.

53.

Palincsar

A. S.

Brown

A. L.

(1984) Reciprocal teaching of comprehension fostering and comprehension-monitoring activities. Cognition & Instruction 1: 117–175.

54.

Rehder

Schreiner

M. E.

Wolfe

B. W.

Laham

Landauer

T. K.

Kintsch

(1998) Using latent semantic analysis to assess knowledge: Some technical considerations. Discourse Processes 25: 337–354.

55.

Sanders

T. J. M.

Spooren

W. P. M.

Noordman

L. G. M.

(1993) Coherence relations in a cognitive theory of discourse representation. Cognitive Linguistics 8: 93–133.

56.

Santrock

J. W.

(2012) Psicología de la Educación [Educational psychology], 4th ed. Madrid: McGraw-Hill UNED.

57.

Shermis

M. D.

Burstein

Higgins

Zechner

(2010) Automated essay scoring: Writing assessment and instruction. In: Baker

McGaw

Petersen

N. S.

(eds) International encyclopedia of education Vol. 4, Oxford, UK: Elsevier, pp. 20–26.

58.

Srihari

Collins

Shihari

Srinivasan

Shetty

Brutt-Griffer

(2008) Automatic scoring of short handwritten essays in reading comprehension tests. Artificial Intelligence 172: 300–324.

59.

Wade-Stein

Kintsch

(2004) Summary street: Interactive computer support for writing. Cognition and Instruction 22: 333–362.

60.

Wiemer-Hastings

Graesser

(1999) Improving an intelligent tutor’s comprehension of students with latent semantic analysis. In: Lajoie

S. P.

Vivet

(eds) Artificial intelligence in education, Amsterdam: IOS Press, pp. 535–542.

61.

Wolfe

M. B. W.

Goldman

S. R.

(2003) Use of latent semantic analysis for predicting psychological phenomena: Two issues and proposed solutions. Behavior Research Methods, Instruments, and Computers 35: , 22–31.

62.

Wolfe

M. B. W.

Magliano

J. P.

Larsen

(2005) Causal and semantic relatedness in discourse understanding and representation. Discourse Processes 39: 165–187.

Automated LSA Assessment of Summaries in Distance Education

Abstract

Keywords

Get full access to this article

References