Abstract
OBJECTIVE:
To generate normative data on the Trail Making Test (TMT) across 11 countries in Latin America, with country-specific adjustments for gender, age, and education, where appropriate.
METHOD:
The sample consisted of 3,977 healthy adults who were recruited from Mexico, Argentina, Peru, Paraguay, Honduras, Chile, Cuba, Puerto Rico, Guatemala, El Salvador, and Bolivia. Each subject was administered the TMT as part of a larger neuropsychological battery. A standardized five-step statistical procedure was used to generate the norms.
RESULTS:
The final multiple linear regression models for the TMT-A explained 23– 50% of the variance, and the final multiple linear models for the TMT-B explained 22– 49% of the variance. Although there were gender differences on the TMT in Mexico, Peru, Paraguay, and Honduras, only Honduras had an effect size greater than 0.3. As a result, gender-adjusted norms were generated for the Trail Making Test-A, but not B, in this country.
CONCLUSIONS:
The present study is the first to create norms for the TMT in Latin America. As a result, this study will have important implications for the practice of neuropsychology in the future.
Introduction
The Trail Making Test (TMT) is one of the most widely used neuropsychological assessment instruments and is the most common instrument for the assessment of attention (Rabin, Barr, & Burton, 2005). This paper-and-pencil test is easily administered, is in the public domain, and can be reproduced without permission (Lezak, Howieson, & Loring, 2004). Originally developed to assess general intelligence as part of the Army Individual Test Battery (1944), it was later included in the Halstead-Reitan Battery (Reitan & Wolfson, 1985), with detailed administration instructions updated by Spreen and Strauss (1998).
The TMT is considered a measure of psychomotor speed, visual scanning, attention, sequencing, and mental flexibility (Spreen & Strauss, 1998; Mitrushina, Boone, & D’Ella, 1999), and consists of two parts: Part A and Part B (TMT-A and TMT-B). In Part A, the goal is to connect consecutively numbered semi-randomly distributed circles on a sheet of paper as quickly as possible by drawing lines between them, without lifting the pencil from the paper (Lezak, Howieson, & Loring, 2004). Part B requires the subject to connect on a separate worksheet in ascending and alternating order the same number of circles which contain numbers and letters (i.e., 1-A-2-B-3, etc.). TMT-A is considered to be a measure of visual search/attention skills and psychomotor speed, as its performance has been shown to correlate with scores on other timed tasks which require visual search (e.g., WAIS-III Digit Symbol Coding; Sanchez-Cubillo et al., 2009). TMT-B, on the other hand, is thought of as a measure of executive control, cognitive flexibility, and set shifting, as it is correlated with performance on cognitive alternation and task-switching tests, as well as increased activation of frontal cortices on fMRI studies and results of prefrontal cortex lesion studies (Crowe, 1998; Arbuthnott & Frank, 2000; Yochim, Baldo, Nelson, & Delis, 2007; Jacobson, Blanchard, Connolly, Cannon, & Garavan, 2011). Both parts of the test have exhibited high test-retest reliability (at least 0.76 for Part A, and 0.82 for Part B in recently reported studies), with the coefficient values generally higher for TMT-B compared to A (Lezak et al., 2004; Seo et al., 2006; Wagner, Helmreich, Dahmen, Lieb, & Tadić, 2011).
The subject’s performance on the Trail Making Test yields two scores: times to completion (in seconds) for Parts A and B. Additionally, derived scores (i.e., difference B– A, and ratio B:A) are oftentimes used in clinical practice to remove the speed component from the test performance, provide a more pure measure of executive control, and serve as a possible symptom validity indicator (Arbuthnott & Frank, 2000; Drane, Yuspeh, Huthwaite, & Klingler, 2002; Lezak et al., 2004; Egeland, & Langfjaeran, 2007; Ashendorf, Jefferson, O’Connor, Chaisson, Green, & Stern, 2008; Sanchez-Cubillo et al., 2009).
The TMT is considered to be one of the best measures of general brain function (Armitage, 1946; Spreen & Benton, 1965; Reitan & Wolfson, 1985) and is sensitive to many dysfunctions in both adult and pediatric patient populations (Reitan & Wolfson, 1985). Due to its wide applicability, the TMT has been utilized to measure the cognitive effects of hepatic encephalopathy (Conn, 1977; Riggio et al., 2011), cognitive deterioration in HIV positive patients (Chalermchai et al., 2013; Selnes et al., 1991) and polydrug users (McCaffrey, Krahula, & Heimberg, 1989), patients with head trauma (Leininger, Gramling, Farrell, Kreutzer, & Peck, 1990; Thaler et al., 2012), Alzheimer’s disease (Amieva et al., 1998; Terada et al., 2013), Parkinson’s disease, supranuclear palsy (Pellecchia et al., 2015; Pillon et al., 1995), mild cognitive impairment, and normal aging (Ashendorf et al., 2008), among other disorders.
Despite its extensive use in neuropsychological and neuropsychiatric populations, the performance on the TMT has been associated with cultural and demographic factors (Agranovich & Puente, 2007; Horton & Roberts, 2003). While gender showed little relation with performance in adults (Tombaugh, 2004), increased age and lower education are related to lower test scores (Bornstein & Suga, 1988; Periañez et al., 2007; Wecker, Kramer, Wisniewski, Delis, & Kaplan, 2000; Woods, Wyma, Herron, & Yund, 2015). Ethnicity, cultural background, and degree of acculturation have also been linked to performance on the TMT (Ardila, 2001; Fernández & Marcopulos, 2008).
It has been recommended in the past to take cultural variables into consideration at each stage of neuropsychological evaluation, including administration and interpretation of results (Ardila, 1995Golden & Thomas, 2000; Ardila & Moreno, 2001). However, until relatively recently the preponderance of normative TMT data has been obtained mainly from Western, well-educated, and English-speaking countries (e.g., U.S., Canada; Tombaugh, 2004; Spreen & Strauss, 1998; Selnes et al., 1991; Goul & Brown, 1970). To ameliorate this problem, normative studies of the TMT have recently been carried out in other regions of the world, namely, in Japan (Abe et al., 2004; Hashimoto et al., 2006), Korea (Seo et al., 2006), Spain (Peña-Casanova et al., 2009), Turkey (Cangoz, Karakoc, & Selekler, 2009), China (Wang et al., 2011), and the Czech Republic (Bezdicek, 2012), and Portugal (Cavaco et al., 2013).
There is lack of normative neuropsychological data in Latin America, where to the best of our knowledge only two normative studies about the TMT have taken place. Campanholo and colleagues (2014) administered a battery of neuropsychological tests which included the TMT to 1025 healthy native Portuguese speakers from five regions of Brazil, obtaining normative data stratified by age (into six groups, ranging from 18 to over 70 years old) and years of education (four groups, 0– 13 years). Fernández and colleagues (2002) obtained the TMT normative data from a sample of 251 healthy adults (ages 15– 70 y.o.) from all educational levels (0– 24 years) in Argentina, and observed differential performance as a function of age and education. As relatively few individuals were included in certain cells of this normative study (e.g., only 9 participants over 60 years old with high level of education) generalizability of the findings may be somewhat limited. Even in the normative data obtained in Western countries, there is a great deal of variability, such that depending upon the norms used, an otherwise normal performance could be classified as pathological, and vice versa (Fernández & Marcopulos, 2008). The present study aims to establish normative data, stratified by age and educational level, for 11 countries in Latin America.
Method
Participants
The sample consisted of 3,977 healthy individuals who were recruited from Argentina, Bolivia, Chile, Cuba, El Salvador, Guatemala, Honduras, Mexico, Paraguay, Peru, and, Puerto Rico. The participants were selected according to the following criteria: a) were between 18 to 95 years of age, b) were born and currently lived in the country where the protocol was conducted, c) spoke Spanish as their native language, d) had completed at least one year of formal education, e) were able to read and write at the time of evaluation, f) scored ≥23 on the Mini-Mental State Examination (MMSE, Folstein, Folstein, & McHugh, 1975), g) scored ≤4 on the Patient Health Questionnaire– 9 (PHQ-9, Kroenke, Spitzer, & Williams, 2001), and h) scored ≥90 on the Barthel Index (Mahoney, & Barthel, 1965).
Participants with self-reported neurologic or psychiatric disorders were excluded due to a potential effect on cognitive performance. Participants were volunteers from the community and signed an informed consent. Twelve participants were excluded from the analyses, with a final sample of 3965 participants. Socio-demographic and participant characteristics for each of the countries’ samples have been reported elsewhere (Guàrdia-Olmos, Peró-Cebollero, Rivera, & Arango-Lasprilla, 2015). The multi-center study was approved by the Ethics Committee of the coordinating site, the University of Deusto, Spain.
Instrument administration
The TMT consists of two parts: TMT-A and B. In the TMT-A the individual must draw a line to connect 25 numbers in ascending order, which are circled and randomly distributed on a sheet of paper. The task requirements are similar for the TMT-B, except that the person alternates between numbers and letters (1-A, 2-B, 3-C, etc.), the latter being significantly more difficult (Drane, Yuspeh, Huthwaite, & Klingler, 2002). The score is the time that an individual takes to finish the task in each test. The time limit for TMT-A is 100 seconds (maximum score) and 300 secondsfor TMT-B.
Statistical analyses
The detailed statistical analyses used to generate the normative data for this test are described in Guàrdia-Olmos et al. (2015). In summary, the data manipulation process for each country-specific dataset involved five-steps: a) t – tests for independent samples and effect sizes (r) were conducted to determine gender effects. If the effect size was larger than 0.3, gender was included in the model with gender dummy coded and female as the reference group (male = 1 and female = 0). b) A multivariable regression model was used to specify the predictive model including gender (if effect size was larger than 0.3), age as a continuous variable, and education as a dummy coded variable with 1 if the participant had >12 years of education and 0 if participants had 1– 12 years of education. If gender, age and/or education was not statistically significant in this multivariate model with an alpha of 0.05, the non-significant variables were removed and the model was re-run. Then a final regression model was conducted that included age (if statistically significant in the multivariate model), dichotomized education (if statistically significant in the multivariate model), and/or gender (if effect size was greater than 0.3) ; c) residual scores were calculated based on this final model (); d) using the SD e (residual) value provided by the regression model, residuals were standardized: z = e i /SD e , with SD e (residual) = the standard deviation of the residuals in the normative sample; and e) standardized residuals were converted to percentile values (Strauss et al., 2006). Using each country’s dataset, these steps were applied to TMT A & B errors.
Results
Trail Making Test – A
Regarding the effect of gender on TMT-A, the t-tests showed significant differences between men and women in the countries of Honduras, Mexico, Paraguay, and Peru. Table 1 shows the results of the gender analysis by country on TMT-A scores. As shown in Table 1, the effect sizes for all countries except Honduras were less than 0.3, and therefore gender was only taken into account to generate TMT-A normative data for the Honduras sample.
The final eleven TMT-A scores multivariate linear regression models for each country are shown in Table 2. In all countries, except Puerto Rico, the TMT-A scores decreased for those with more than 12 years of education (see Table 2), and, in all countries, TMT-A scores increased in a linear fashion as a function of age. The amount of variance explained in TMT-A scores ranged from 23% (in Argentina) to 50% (in Paraguay).
Trail Making Test – B
Regarding the effect of gender on TMT-B scores, the t-test showed significant differences between men and women in the countries of Honduras, Mexico, and Paraguay. Table 3 shows the results of the gender analysis by country on TMT-B. As shown in Table 3, the effect sizes for all countries were less than 0.3, and therefore gender was not taken into account to generate TMT-B normative data.
The final eleven TMT-B multivariate linear regression models for each country are shown in Table 4. In all countries, TMT-B scores decreased for those with more than 12 years of education (see Table 4) and, TMT-B scores increased in a linear fashion as a function of age. The amount of variance explained in TMT-B scores ranged from 22% (in Cuba) to 49% (in Honduras).
Normative procedure
Norms (e.g., a percentile score) for the TMT A & B scores were established using the five-step procedure described above. To facilitate the understanding of the procedure to obtain the percentile associated with a score on this test, an example will be given. Suppose you need to find the percentile score for an Argentine man, who is 50 years old and has 17 years of education. He has a score of 40 (seconds) on TMT-A. The steps to obtain the percentile for this score are: a) Check Table 1 to determine if the effect size of gender in the country of interest (Argentina) on this test and time point (TMT-A) is greater than 0.3 by country. The column labelled r in Table 1 indicates the effect size and the superscript notation b next to the number indicates that the number is larger than 0.3. In this example, the effect size is 0.079, which is not greater than 0.3. For Argentines on this test, gender does not influence scores to a sufficient degree to take it into account gender when determining the percentile. b) Find Argentina in Table 2, which provides the final regression models by country for TMT-B. Use the B weights to create an equation that will allow you to obtain the predicted TMT-B score. The corresponding B weights are multiplied by the actual age and dichotomized education scores and added to a constant in order to calculate the predicted value. In this case, the predicted TMT-A would be calculated using the equation (the values have been rounded for presentation in the formula). The subscript notation i indicate the person of interest. The person’s age is 50, but the education variable is not continuous in the model. Years of education is split into either 1 to 12 years (and assigned a 0) or more than 12 years (and assigned a 1) in the model. Since our hypothetical person in the example has 17 years of education, his educational level value is 1. Thus the predicted value is ). c) In order to calculate the residual value (indicated with an e in the equation), we subtract the actual value from the predicted value we just calculated (). In this case, it would be e i = 40 - 31.662 = 8.338. d) Next, consult the SD e column in Table 2 to obtain the country-specific SD e (residual) value. For Argentina it is 10.264. Using this value, we can transform the residual value to a standardized z score using the equation (e i /SD e ). In this case, we have 8.338/10.264 = 0.812. In the case of TMT A & B, the order of the scores were reversed (e.g., the z score sign changed from negative to positive or positive to negative) in order to maintain an interpretation of improved performance, higher percentile. Thus – 0.812 is the standardized z score for an Argentine man aged 50 and 17 years of education and a score of 40 on TMT-A. e) The last step is to use look-up the tables in the statistical reference books (e.g. Strauss et al., 2006) or use a trusted online calculator like the one available at http://www.measuringu.com/pcalcz.php. In the online calculator, you would enter the z score and choose a one-sided test and note the percent of area after hitting the submit button. In this case, the probability of -0.812 corresponds to the 21st percentile. Please remember to use the appropriate tables that correspond to each test when performing these calculations. If the percentile for the TMT-B scores is desired, Tables 3-4 must be used.
User-friendly normative data
The five-step normative procedures explained above can provide more individualized norms. However, this method can be prone to human error due to the number of required computations. To enhance user-friendliness, the authors have completed these steps for a range of raw scores based on small age range groupings (see Guàrdia-Olmos et al., 2015) and created tables so that clinicians can more easily use to obtain a percentile range associated with a given raw score on this test. These tables are available by country and type of test in the Appendix. In order to obtain an approximate percentile for the above example (converting a raw score of 40 for an Argentine man who is 50 years old and has 17 years of education) using the simplified normative tables provided, the following steps are recommended. (1) First, identify the appropriate table ensuring the specific country and test. In this case, the table for TMT-A for Argentina can be found in Table A1. (2) Note if the title of the table indicates that it is only to be used for one specific gender. In this case, gender is not specified. Thus Table A1 is used for both males and females. (3) Next, the table is divided based on educational level (1 to 12 vs. more than 12 years of education). Since this man has 17 years of education, he falls into the more than 12 years of education category. These data can be found in the top section of the table. (4) Determine the age range most appropriate for the individual. In this case, 50 fall into the column 48– 52 years of age. (5) Read down the age range column to find the approximate location of the raw score the person obtained on the test. Reading down the 48– 52 column, the score of 40 obtained by this Argentine man corresponds to an approximate percentile of 20.
The percentile obtained via this user-friendly table method (20th) is slightly different than the more exact one (21st) obtained following the individual conversion steps above because the table method is based on an age range (e.g., individuals aged 48– 52) instead of the exact age (individuals aged 50). If the exact score is not listed in the column, you must estimate the percentile value from the listed raw scores.
Discussion
The purpose of the current study was to generate normative data on the TMT across 11 countries in Latin America, with country-specific adjustments for gender, age, and education, where appropriate. The final multiple linear regression models explained between 23– 50% of the variance in TMT-A scores and 22– 49% of the variance TMT-B scores. On the TMT-A, gender differences emerged in several countries, although only Honduras reached an effect size greater than 0.3. Similarly, on the TMT-B, there were several gender differences, but none reached an effect size of 0.3. Although gender-based norms have often been used in neuropsychological assessment, these findings generally conformed to those found in the research literature showing gender to have little association with TMT performance (Tombaugh, 2004). As a result, the performance of the current sample on the TMT in terms of gender likely is not different from other normative samples. In light of the previous literature and because the gender differences in TMT performance in the current study generally showed small effect sizes, gender-adjusted norms were not generated, except for in Honduras. Except in Honduras on the TMT-A, gender-adjustments should not be made in calculating percentiles for the TMT in LatinAmerica.
Both TMT scores decreased linearly as a function of education in almost all countries. However, there was no effect of education for the TMT-A in Puerto Rico. These general effects of education resonate with the prior literature showing that education has been inversely associated with TMT scores (Stuss et al., 1987; Wecker et al., 2000; Bornstein & Suga, 1988). Therefore, neuropsychologists in Latin America should use education-adjusted norms generated in this study for each country on the TMT, except in Puerto Rico for the TMT-A. Various countries in Latin America likely have major differences in their quality of education, and as a result, the current TMT education adjustments should be used in their respective Latin American countries. Perhaps these differences in education are the largest between Puerto Rico and the other countries in this study, given that Puerto Rico is a territory of the United States, and therefore has one of the more advanced educational systems. This could have accounted for the consistent educational effect on TMT-A scores in all countries exceptPuerto Rico.
TMT scores increased with advancing age in all countries in this study. This robust finding is consistent with the previous literature showing older age to be associated with higher TMT scores (Stuss et al., 1987; Wecker et al., 2000; Bornstein & Suga, 1988). When considering the previous findings, those from the current study suggests that TMT corrections for age should be made in all Latin American countries tested in this study.
Limitations and future directions
The current study has several limitations, and as a result directions for future research. First, the TMT is a very common neuropsychological assessment instrument in Latin America, but many other common instruments should be normed following the same procedures in this study to improve their use in Latin America as well. Similarly, future studies needs to examine the ecological validity and psychometrics of the TMT and these other common neuropsychological instruments in Latin America. Researchers should create instruments within Latin American cultures with high ecological validity, considering that the TMT was developed and validated initially in a Western culture which differs in many ways from the various cultures present in Latin America. Developing assessments in the context of local cultures, instead of simply translating and norming instruments from other cultural contexts, would represent a crucial advance in neuropsychological assessment throughoutthe region.
Second, neuropsychologists should use caution when applying the TMT norms from this study in conducting assessments with the TMT in countries other than those in which data were collected. Future research needs to create TMT norms in countries in Latin America that were not included in this study, including Ecuador, Uruguay, Venezuela, and Panama. Despite this limitation, the TMT norms from the current study may actually be more accurate in other Latin American countries than some of the norms that neuropsychologists in those countries currently apply. The generalizability of the current norms to other Latin American countries is an important area of future research.
Third, all participants in the current study spoke Spanish as a primary language, but they may have spoken secondary languages as well, such as local dialects or English. TMT performance may be different among bilingual individuals from Latin America, so future studies need to examine possible influences of bilingualism on TMT performance. Participants were all recruited from distinct regions or cities in each country, instead of nationally within the countries. However, this was the largest TMT normative study to date in Latin America, or in any global region, and it is a first step toward larger, nationally representative studies. The sample was also limited in that it contained many participants with fewer than 12 years of education, but those who were unable to read were excluded. As a result, the current TMT norms may not apply well to illiterate adults, so future studies should be conducted with illiterate individuals, as well as those with neurological conditions and children.
Despite these limitations, only limited studies have produced TMT norms in Spanish-speaking populations such as Spanish-speakers from Spain (Peña-Casanova et al., 2009) and Argentina (Fernandez et al., 2002). Therefore, this study was the first systematic study to create TMT norms across 11 countries in Latin America with almost 4,000 participants. It was the largest, most comprehensive TMT normative study to date in any global region, and its norms have the potential to improve the standard of neuropsychological assessment with the TMT in Latin America unlike any study before it.
Appendix
Normative data for the TMT-A stratified by age and education levels for ARGENTINA
Normative data for the TMT-A stratified by age and education levels for BOLIVIA
Normative data for the TMT-A stratified by age and education levels for CHILE
Normative data for the TMT-A stratified by age and education levels for CUBA
Normative data for the TMT-A stratified by age and education levels for EL SALVADOR
Normative data for the TMT-A stratified by age and education levels for GUATEMALA
Normative data for the TMT-A stratified by age and education levels and gender for HONDURAS: MALES only
Normative data for the TMT-A stratified by age education level, and gender for HONDURAS: FEMALES only
Normative data for the TMT-A stratified by age and education levels for MEXICO
Normative data for the TMT-A stratified by age and education levels for PARAGUAY
Normative data for the TMT-A stratified by age and education levels for PERU
Normative data for the TMT-A stratified by age for PUERTO RICO
Normative data for the TMT-B stratified by age and education levels for ARGENTINA
Normative data for the TMT-B stratified by age and education levels for BOLIVIA
Normative data for the TMT-B stratified by age and education levels for CHILE
Normative data for the TMT-B stratified by age and education levels for CUBA
Normative data for the TMT-B stratified by age and education levels for EL SALVADOR
Normative data for the TMT-B stratified by age and education levels for GUATEMALA
Normative data for the TMT-B stratified by age and education levels for HONDURAS
Normative data for the TMT-B stratified by age and education levels for MEXICO
Normative data for the TMT-B stratified by age and education levels for PARAGUAY
Normative data for the TMT-B stratified by age and education levels for PERU
Normative data for the TMT-B stratified by age and education levels for PUERTO RICO
Normative data for the TMT-B stratified by age and education levels for PERU
Normative data for the TMT-B stratified by age and education levels for PUERTO RICO
