Can Recall Data Be Trusted? Evaluating Reliability of Interview Data on Traditional Multilingualism in Highland Daghestan

Abstract

In this article, we address the issue of reliability of quantitative data on multilingualism of the past obtained as recall data. More specifically, we investigate whether the interviewees’ assessments of the language repertoires of their late relatives (indirect data) provide results that are quantitatively similar to those obtained from the people of the same age range themselves (direct data). The empirical data we use come from an ongoing field study of traditional multilingualism in Daghestan (Russia). We trained machine learning models to see whether they can detect differences in indirect and direct data. We conclude that our indirect quantitative data on L2 other than Russian are essentially similar to direct data, while there may be a small but systematic underestimation when reporting others’ knowledge of Russian.

Get full access to this article

View all access options for this article.

References

Aikhenvald

A. Y.

1996. Areal diffusion in northwest Amazonia: The case of Tariana. Anthropological Linguistics 38:73–116.

de Nicola

Giné

. 2014. How accurate are recall data? Evidence from coastal India. Journal of Development Economics 106:52–65.

Dobrushina

2013. How to study multilingualism of the past: Investigating traditional contact situations in Daghestan. Journal of Sociolinguistics 17:376–93.

Dobrushina

Staferova

Belokon

(eds.). 2017. Atlas of Multilingualism in Dagestan Online. Linguistic Convergence Laboratory, HSE. (Available online at https://multidagestan.com).

Dobrushina

Daniel

Koryakov

. 2020. Atlas of multilingualism in Daghestan: A case study in diachronic sociolinguistics. Languages of the Caucasus 4:1–37.

Dobrushina

Daniel

Koryakov

. 2021. Languages and sociolinguistics of the Caucasus. In The Oxford handbook of languages of the Caucasus, eds. Polinksy

, 27–66. Oxford: Oxford University Press.

Dobrushina

Kozhukhar

Moroz

. 2019. Gendered multilingualism in highland Daghestan: Story of a loss. Journal of Multilingual and Multicultural Development 40:115–32.

Dobrushina

Moroz

. 2021. The speakers of minority languages are more multilingual. International Journal of Bilingualism. International Journal of Bilingualism 25:921–38.

Genko

2005. Tabasaransko–Russkij slovar’. (Tabasaran–Russian dictionary). Moscow: Academia.

10.

Hicks

R. E.

2017. From multilingualism to bilingualism: Changes in language use, language value, and social mobility among Engdewu speakers in the Solomon Islands. Journal of Multilingual and Multicultural Development 38:857–70.

11.

Hunter

J. D.

2007. Matplotlib: A 2D graphics environment. Computing in Science & Engineering 9:90–95.

12.

Jourdan

2007. Linguistic paths to urban self in postcolonial Solomon Islands. In Consequences of contact: Language ideologies and sociocultural transformations in Pacific societies, eds. Makihara

Schieffelin

B. B.

, 30–48. Oxford: Oxford University Press.

13.

Khanina

Meyerhoff

. 2018. A case-study in historical sociolinguistics beyond Europe: Reconstructing patterns of multilingualism in a linguistic community in Siberia. Journal of Historical Sociolinguistics 4:221–51.

14.

Kish

. 1959. Some statistical problems in research design. American Sociological Review 24:328–38.

15.

Koryakov

Yu.

Staferova

B., D. A.

Belokon

A. A.

Dobrushina

N. R.

. 2019. Population of Dagestan from data of different census years. Linguistic Convergence Laboratory, HSE University. (Available online at https://multidagestan.com/census).

16.

Lavrov

L I

. 1978. Istoriko–ètnografičeskie Očerki Kavkaza. (Historical ethnographic survey of the Caucaus). Leningrad: Nauka.

17.

Lüpke

2016. Uncovering small-scale multilingualism. Critical Multilingualism Studies 4:35–74.

18.

Matthew

L. E.

Romero

S. F.

. 2012. Nahuatl and Pipil in colonial Guatemala: A central American counterpoint. Ethnohistory 59:765–83.

19.

Mullan

Sills

Bauch

. 2014. The reliability of retrospective data on asset ownership as a measure of past household wealth. Field Methods 26:223–38.

20.

Nesvig

2012. Spanish men, Indigenous language, and informal interpreters in postcontact Mexico. Ethnohistory 59:739–64.

21.

Nichols

2013. The vertical archipelago: Adding the third dimension to linguistic geography. In Space in language and linguistics, eds. Auer

Hilpert

Stukenbrock

Szmrecsanyi

, 38–60. Berlin: De Gruyter.

22.

Pedregosa

Varoquaux

Gramfort

Michel

Thirion

Grisel

Blondel

Prettenhofer

Weiss

Dubourg

Vanderplas

Passos

Cournapeau

Brucker

Perrot

Duchesnay

. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12:2825–30.

23.

Philips

2011. Unexpected languages: Multilingualism and contact in eighteenth-and nineteenth-century North America. American Indian Culture and Research Journal 35:19–42.

24.

Rosenbaum

P. R.

Rubin

D. B.

. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70:41–55.

25.

Schwaller

J. F.

2012. The expansion of Nahuatl as a lingua franca among priests in sixteenth-century Mexico. Ethnohistory 59:675–90.

26.

Sergeeva

1967. Archincy. Leningrad: Nauka.

27.

Thomason

S. G.

Kaufman

. 1992. Language contact, Creolization, and genetic linguistics. Los Angeles: University of California Press.

28.

Volkova

1974. Ètničeskij sostav naseleniâ severnogo Kavkaza v XVIII-načale XX veka. (Ethnic composition of the population of the northern Caucasus in 18th to early 20th century). Leningrad: Nauka.

29.

Wickham

2016. Ggplot2: Elegant graphics for data analysis. New York: Springer-Verlag.