Sage Journals: Discover world-class research

Abstract

This article describes a new data base for English word-usage patterns. It improves on older efforts by including television and personal commentaries as sources for the main corpus studied. More than a third of a million words were sampled from media and nonmedia sources and analyzed to produce a parsimonious listing of 6505 words (types) and their frequencies. The reliability and validity of this list were established in a variety of ways, and a computer program based on the list was used to analyze two different sets of data (an exploratory set and one representing an a priori hypothesis about word usage). A mere 206 different words were seen to account for 57% of all the words in the corpus, and 95% of this small set had its roots in Middle English or some older form of English.

Get full access to this article

View all access options for this article.

References

Allen

P. A.

McNeal

, & Kvak

Perhaps the lexicon is coded as a function of word frequency. Journal of Memory and Language, 1992, 31, 826–844.

Bailey

R. W.

Images of English: a cultural history of the language. Ann Arbor, MI: Univer. of Michigan Press, 1991.

Bryson

The mother tongue: English and how it got that way. New York-, William Morrow, 1990. Pp. 21–60.

Cherry

On human communication. Cambridge, MA: MIT Press, 1966. Pp. 102–109, 211–214.

Crowley

Proper English? readings in language history and cultural identity. London: Routledge, 1991.

Fowler

(Ed.) Style and structure in literature. Ithaca, NY: Cornell Univer. Press, 1975.

INSTITUTE FOR SCIENTIFIC INFORMATION. Social Sciences Citation Index. Philadelphia, PA: Institute for Scientific Information, 1994.

Kučera

, & Francis

Computational analysis of present-day American English. Providence, RI: Brown Univer. Press, 1967.

McArthur

Worlds of reference. Cambridge, MA: Cambridge Univer. Press, 1986.

10.

Paul

, & Whissell

Memory for words in a serial list as a function of primacy-recency, frequency, length, order, and location in two-dimensional emotion space. Perceptual and Motor Skills, 1992, 74, 427–432.

11.

Ridley

D. R.

, & Gonzales

Zipf's law extended to small samples of adult speech. Perceptual and Motor Skills, 1994, 79, 153–154.

12.

SPSS, Inc. SPSS user's guide. Ann Arbor, MI: SPSS, Inc. 1988.

13.

Thorndike

E. L.

, & Lorge

The teacher's word book of 30,000 words. New York: Bureau of Publ., Teacher's College, Columbia Univer., 1963.

14.

Webster's Ninth New Collegiate Dictionary. Markham, Ontario: Thomas Allen & Son, 1990.

15.

Zipf

G. K.

Human behavior and the principle of least effort. Cambridge, MA: Addison-Wesley, 1949. Pp. 22–26.

A Parsimonious Technique for the Analysis of Word-Use Patterns in English Texts and Transcripts

Abstract

Get full access to this article

References