In panel studies on sensitive topics, respondent-generated identification codes are often used to link records across surveys. However, usually a substantial number of cases are lost due to the codes. These losses may cause biased estimates. Using more components and linking the codes by the Levenshtein string distance function will reduce the losses. In a simulation study and two field experiments, the proposed procedure outperforms the methods previously applied.
Alonzo, T.A., and M.S. Pepe.2007. Development and evaluation of classifiers . In Topics in biostatistics, ed. W. T. Ambrosius, 89-116. Totowa, NJ: Humana .
2.
Barnea, Z., M. Teichman, and G. Rahav.1992. Personality, cognitive, and interpersonal factors in adolescent substance use: A longitudinal test of an integrative model. Journal of Youth and Adolescence21:187-201.
3.
Bloor, M.2005. Population estimation without censuses or surveys: A discussion of mark-recapture methods illustrated by results from three studies . Sociology39:121-38.
4.
Bourgeois, F., and J.C. Lassalle.1971. An extension of the Munkres algorithm for the assignment problem to rectangular matrices. Communications of the ACM14:802-4.
5.
Carifio, J., and R. Biron.1978. Collecting sensitive data anonymously: The CDRGP technique. Journal of Alcohol and Drug Education23:47-66.
6.
---. 1982. Collecting sensitive data anonymously: Further findings on the CDRGP technique. Journal of Alcohol and Drug Education27:38-70.
7.
Damrosch, S.P.1986. Ensuring anonymity by use of subject-generated identification codes. Research in Nursing and Health9:61-63.
8.
DiIorio, C., J.E. Soet, D. VanMarter, T.M. Woodring, and W.N. Dudley.2000. An evaluation of a self-generated identification code. Research in Nursing and Health23:167-74.
9.
El-Khorazaty, M.N., P. B. Imrey, G.G. Koch, and H.B. Wells.1977. Estimating the total number of events with data from multiple-record systems: A review of methodological strategies . International Statistical Review45:129-57.
10.
Faden, V.B., N.L. Day, M. Windle, J.W. Grube, B.S.G. Molina, W. E. Pelham, E.M.Gnagy, T.K.Wilson, K.M. Jackson, and K.J. Sher.2004. Collecting longitudinal data through childhood, adolescence, and young adulthood: Methodological challenges. Alcoholism: Clinical and Experimental Research28:330-40.
11.
Foxcroft, D.R., and G. Lowe.1995. Adolescent drinking, smoking and other substance use involvement: Links with perceived family life. Journal of Adolescence18:159-77.
12.
Galanti, M.R., R. Siliquini, L. Cuomo, J.C. Melero, M. Panella, and F. Faggiano.2007. Testing anonymous link procedures for follow-up of adolescents in a school-based trial: The EU-DAP pilot study . Preventive Medicine44:174-7.
13.
Griffin, J.R., R.C. Holliday, E. Frazier, and R.L. Braithwaite . 2009. The BRAVE (building resiliency and vocational excellence) program: Evaluation findings for a career-oriented substance abuse and violence preventive interventionJournal of Health Care for the Poor and Underserved20:798-816.
14.
Groves, W.E.1974. Patterns of college students drug use and life styles . In Drug use: Epidemiological and sociological approaches , eds. E. Josephson and E. E. Carroll, 241-75. Washington: Hemisphere.
15.
Grube, J.W., and M. Morgan.1990. Attitude-social support interactions: Contingent consistency effects in the prediction of adolescent smoking, drinking, and drug use. Social Psychology Quarterly53:329-39.
16.
Grube, J.W., M. Morgan, and K.A. Kearney.1989. Using self-generated identification codes to match questionnaires in panel studies of adolescent substance use . Addictive Behaviors14:159-71.
17.
Haberman, P.W., E. Josephson, A. Zanes, and J. Elinson.1972. High school drug behavior: A methodological report on pilot studies. In Proceedings of the First International Conference on Student Drug Surveys, eds. S. Epstein and S. Allen, 103-21. Farmingdale : Baywood.
18.
Hassard, T.H.1986. Writing the book of life: Medical record linkage . In The fascination of statistics, eds. R. J. Brook, G. C. Arnold and T. H. Hassard, 25-46. New York: Dekker .
19.
Herzog, T.N., F.J. Scheuren, and W.E. Winkler.2007. Data quality and record linkage techniques . New York: Springer.
20.
Hogben, L., M.M. Johnstone, and K.W. Cross.1948. Identification of medical documents . British Medical Journal4552:632-35.
21.
Honig, F.1995. When you can’t ask their names: Linking anonymous respondents with the Hogben number. Australian Journal of Public Health19:94-96.
22.
Jaro, M.A.1989. Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida. Journal of the American Statistical Association84:414-20.
23.
Josephson, E., and M.A. Rosen.1978. Panel loss in a high school drug study . In Longitudinal research on drug use: Empirical findings and methodological issues, ed. D. B. Kandel , 115-33. New York: Wiley.
24.
Kandel, D.B.1973. Adolescent marihuana use: The role of parents and peers . Science181:1067-70.
25.
Kandel, D.B., E. Single, and R.C. Kessler.1976. The epidemiology of drug use among New York state high school students: Distribution, trends, and change in rates of use. American Journal of Public Health66:43-53.
26.
Kearney, K.A., R.H. Hopkins, A.L. Mauss, and R.A.Weisheit.1984. Self-generated identification codes for anonymous collection of longitudinal questionnaire data. Public Opinion Quarterly48:370-78.
27.
Krzanowski, W. J., and D. J. Hand. 2009. ROC curves for continuous data. Boca Raton: CRC Press.
28.
Lanza-Kaduce, L.1988. Perceptual deterrence and drinking and driving among college students. Criminology26:321-41.
29.
Lee, B.C., J.D. Westaby, and R.L. Berg.2004. Impact of a national rural youth health and safety initiative: Results from a randomized controlled trial. American Journal of Public Health94:1743-49.
30.
Levenshtein, V.I.1966. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics-Doklady10:707-10.
31.
Mahon, N.E., and A. Yarcheski.1988. Loneliness in early adolescents: An empirical test of alternate explanations. Nursing Research37:330-35.
32.
McAlister, A., and N.P. Gordon.1986. Attrition bias in a cohort study of substance abuse onset and prevention. Evaluation Review10:853-59.
33.
McGloin, J., S. Holcomb, and D.S. Main.1996. Matching anonymous pre-posttests using subject-generated information. Evaluation Review20:724-36.
34.
McKeganey, N., M. Barnard, A. Leyland, I. Coote, and E. Follet.1992. Female streetworking prostitution and HIV infection in Glasgow. British Medical Journal305:801-4.
35.
Mellanby, A.R., R.G. Newcombe, J. Rees, and J.H. Tripp. 2001. A comparative study of peer-led and adult-led school sex education. Health Education Research16:481-92.
36.
Morgan, M., and J.W. Grube.1991. Closeness and peer group influence . British Journal of Social Psychology30:159-69.
37.
Morgenstern, M., G. Wiborg, B. Isensee, and R. Hanewinkel.2008. School-based alcohol education: Results of a cluster-randomized controlled trial. Addiction104:402-12.
38.
Newcombe, H.B.1988. Handbook of record linkage: Methods for health and statistical studies, administration, and business. Oxford: Oxford University Press.
39.
Newcombe, H.B., and J.M. Kennedy.1962. Record linkage: Making maximum use of the discriminating power of identifying information. Communications of the ACM5:563-66.
Rothman, E.F., M.R. Decker, and J.G. Silverman.2006. Evaluation of a teen dating violence social marketing campaign: Lessons learned when the null hypothesis was accepted . New Directions for Evaluation110:33-44.
42.
Schnell, R., T. Bachteler, and S. Bender.2004. A toolbox for record linkage. Austrian Journal of Statistics33:125-33.
43.
Schnell, R., T. Bachteler, and J. Reiher.2006. Die Anwendung statistischer Record-Linkage-Methoden auf selbst-generierte Codes bei Längsschnittuntersuchungen. ZA-Information59:128-42.
Stuart, R.B.1974. Teaching facts about drugs: Pushing or preventing ? Journal of Educational Psychology66:189-201.
46.
Swets, J.A., and R.M. Pickett.1982. Evaluation of diagnostic systems: Methods from signal detection theory. New York: Academic Press.
47.
Tripp, J.H., A. R. Mellanby, F.A.Phelps, H.A. Curtis, and N.J. Crichton.1994. A method for determining rates of sexual activity in school children . AIDS Care6:453-57.
48.
Winchester, L., S. Dobbinson, C. Rissel, and A. Bauman.1996. Anonymous record linkage using respondent-generated identification codes: A tool for health promotion research. Health Promotion Journal of Australia6:52-54.
49.
Winkler, W.E.1995. Matching and record linkage. In Business survey methods, eds. B. G. Cox, D. A. Binder, B. NanjammaChinnappa, A.Christianson, M. J. Colledge and P. S. Kott, 335-84. New York: Wiley.
50.
Yurek, L.A., J. Vasey, and D. Sullivan Havens. 2008. The use of self-generated identification codes in longitudinal research. Evaluation Review32:435-52.