Abstract
The question of whether age and facial identity are processed through a shared or parallel route has scarcely been examined, despite being of theoretical relevance for face processing models. For the first time, the Garner speeded classification paradigm was applied to assess the independence of age and facial identity processing. Across three experiments, participants made either age or facial identity judgements while both dimensions varied (Filtering) or only one dimension varied while the other remained constant (Baseline). Garner interferences, represented by slower response times for the Filtering condition compared to the Baseline condition, were recorded for both Experiments 1 (familiar, cropped, single-image stimuli) and 2 (unfamiliar, cropped, single-image stimuli). A weaker Garner interference was recorded for Experiment 3 (familiar, naturalistic, multi-image stimuli). Garner interference for the first two experiments is indicative of the shared route hypothesis for identity and age perception. However, findings from Experiment 3 suggest that these effects are weaker for naturalistic images, and the implications of this are discussed.
Keywords
Introduction
Traditional theoretical face processing models (Bruce & Young, 1986) considered judgements of age to be one of several visually derived semantic codes defined as information which can be inferred from visual cues with relative accuracy. These visually derived semantic codes, which also include sex and attribute judgements, were considered independent by-products of the operation of face recognition but not essential components for face recognition. However, the theory that these processes are independent has since been challenged, suggesting, for example, that sex judgements and facial identity are processed through a single route (Ganel & Goshen-Gottstein, 2002). Age perception, within Bruce and Young’s (1986) theoretical framework, was not considered any further, and subsequent theoretical models of face processing do not include age (Burton et al., 1999; Haxby et al., 2000; Valentine, 1991). Yet, the face undergoes progressive transformations across the lifespan, altering the structure of the face, facial features, and skin texture and tone. However, some idiosyncratic physical properties are retained, and these can be used in the recognition of familiar faces from images taken decades apart (Mileva et al., 2020). This suggests that age may, in fact, be an important component of face processing, and it is not yet established whether or to what extent face recognition and age perception share a single processing route.
The interaction activation and competition (IAC) model (Burton et al., 1999) proposes that the architecture of face processing is structured around pools of units which connect bidirectionally to units in other pools. This structure comprises a Face Recognition Unit (FRU), Person Identity Nodes (PIN), Semantic Information Unit (SIU) and Word Recognition Unit (WRU). There is one FRU for each face, and this is view-independent, while one PIN unit contains information on a person, rather than the face. At PIN, the model proposes that other known aspects of the person converge, such as the name (WRU) and occupation (SIU). The IAC does not account for the perception of age within this architectural structure. It can be argued that age would be stored in the SIU such that known information about the age of an identity can be drawn once the PIN is activated. However, unlike the name, occupation and other semantic information, age is a fundamental visual feature, which is relatively unstable and transforms slowly over time. Although age could be comparable to other visually derived facial information, specifically sex and emotional expression, it is also unique. Sex is typically stable over time, and an observer can gain exposure to a range of emotional expressions within a single meeting. By contrast, different ages of a person are not typically viewed during the learning phase of a face, such as in the example of a single encounter. Therefore, in comparison to exposure to emotional expression, exposure to different ages of the same identity is more infrequent in everyday interactions.
An alternative theory, as suggested by Bruce and Young (1986), is that age perception, being a visually derived semantic code with no functional purpose, is just a by-product of facial recognition. From a functional perspective, forming an age-independent representation of a face would be beneficial, since it would allow for a robust facial recognition processing system that is not impaired by age-related facial changes over time. Computer recognition models have been shown to recognise familiar faces from images depicting the face decades apart, and stable idiosyncratic features might be key to recognising faces despite these ageing transformations (Mileva et al., 2020). However, in human perception, it is not clear to what extent independent representations of facial identity, such as idiosyncratic features or features associated with fundamental aspects of facial identity recognition, contribute to the recognition of identities across ages. There is some overlap in low-level perceptual features that have been identified as important for both facial identity recognition and age perception, such as pigmentation (i.e. surface reflection, luminance and colour contrast), and alterations to these features due to ageing can also change representation of identity (Galper, 1970; Johnston & Edmonds, 2009; Porcheron et al., 2013; Russell et al., 2006). This suggests that aspects of the face that are important for face recognition are also changeable, at least to some degree, during the ageing process. However, the unnatural image distortions created by the experimental manipulations implemented in such studies (e.g. using photographic negatives) make generating inferences about the overlap between key features used in facial identity recognition and age perception challenging. Nonetheless, a bottom–up account of crosstalk has been reported between other aspects of face perception, specifically, for example, the perception of gender and age, because of shared features underlying both these mechanisms (Fitousi, 2021). Additionally, there may also be top–down influences whereby representations of age and identity overlap at the level of prior knowledge, stereotypes and social expectations, or a dynamic interplay between the two accounts, as described in Freeman and Ambady’s (2011) model of person construal. Thus, for example, in the context of age and identity, knowledge of the release date of an actor’s movie can enhance the processing of their current age.
Therefore, it is not inconceivable that the processing of age shares some underlying mechanism with facial identity processing. However, Haxby et al.’s (2000) neural model of face perception proposes that the brain networks for changeable aspects of the face are independent of invariant aspects of the face. In this model, changeable aspects of face perception refer to the perception of movement and information that facilitates social communication, such as gaze direction, emotional expressions and facial speech. By contrast, invariant aspects are associated with stable dimensions, such as facial identity, gender and race. The posterior superior temporal sulcus is responsible for processing changeable information, while the fusiform gyrus is used for the invariant face information (Haxby et al., 2000; Hoffman & Haxby, 2000; Kanwisher et al., 1997), although the degree of separation between the functional roles of the different brain regions is not clear (Haxby et al., 2000).
Age, however, is not directly considered within Haxby et al.’s (2000) neural system, and whether age should be classified as a changeable or invariant aspect of the face is not specified. Although there is no dispute that age does involve significant changes to the face (Porcheron et al., 2013), in many ways it is unlike other changeable aspects considered in the model. Specifically, the transformations to the face due to ageing occur over a longer period; although age provides social context and impacts on social interactions, it does not primarily function to convey social information in the same direct and fleeting way as emotional expression, facial speech movements and gaze direction.
Parallel or shared route? Insights from empirical studies
Taken together, theoretical models of face processing often do not directly account for the perception of age. If age information is to be considered within the same category as other changeable non-identity face information, theoretical models would suggest that age and facial identity operate independently. A small set of cognitive and behavioural studies support the independence hypothesis. Examination of the dissociation of ability between different face perception tasks in those with impaired facial recognition ability suggests that the ability to extract other non-identity facial information remains intact (e.g. Chatterjee & Nakayama, 2012; Nunn et al., 2001). Specifically, for example, individuals with developmental prosopagnosia do not differ from the control group in their ability to order computer-generated unfamiliar faces from young to old (Chatterjee & Nakayama, 2012). These studies provide some support for the independent route. However, the ability to estimate age using different tasks, natural images and varying difficulty levels was not assessed (Chatterjee & Nakayama, 2012). Furthermore, the nature and severity of face processing impairment may affect the range of cognitive, perceptual and face-related processes impacted (Bate et al., 2019; Behrmann et al., 2005; Le Grand et al., 2006; Rosenthal & Avidan, 2018). This makes drawing conclusions regarding the (in)dependence of age and facial identity from research with this population challenging.
By contrast, other behavioural studies challenge the notion that age perception and facial identity operate parallel routes. In a repetition priming study, the ages of primed familiar and famous faces were classified faster and more accurately than non-primed faces (Dagovitch & Ganel, 2010). Priming was achieved by asking participants to provide judgements of the face images (attractiveness, trustworthiness, pleasantness ratings) during the priming phase, which was then followed by rating the age of these faces (Dagovitch & Ganel, 2010). However, there is some debate concerning whether patterns recorded in these studies reflect mere exposure effects or true repetition priming (for review, see Butler & Berry, 2004). This caveat limits the conclusions that could be drawn about (in)dependence of familiar face processing and age perception. Furthermore, it provides little indication as to the interaction of age perception with unfamiliar face processing. An applied study examining unfamiliar face matching and age verification on accuracy when checking ID cards (Robertson & Burton, 2021) suggests that there may be an interaction between age perception and unfamiliar face processing. A striking interference between the two tasks was recorded, such that the task of verifying age by a neurotypical population resulted in an increase in face matching error by a margin of 46%, and subsequent experiments excluded an increase in cognitive load as the reason (Robertson & Burton, 2021). Although this study did not directly assess independence in processing, the interference in performance suggests that there may be a degree of interaction.
Studies vary in their use of familiar and unfamiliar faces; however, there is now a substantial body of evidence demonstrating that familiar and unfamiliar faces are processed differently (for review, see Johnston & Edmonds, 2009), and findings from familiar and unfamiliar faces should not be conflated (Burton, 2013). In the context of age processing, estimating the age of familiar faces may involve some degree of retrieval of semantic information related to any prior knowledge the observer holds about the identity (Bruce & Young, 1986). Therefore, perception of age of familiar faces across the lifespan may be supported by an additional top–down component not available for unfamiliar faces. In fact, when comparing familiar and unfamiliar faces on an age decision task, familiar faces recorded a significant accuracy advantage (Bruyer et al., 1991, 2007). Although this effect was reliant on perceptual difficulty and present only when the faces were presented for brief exposures, it does suggest that familiar and unfamiliar faces may interact differently with age processing.
In sum, although previous face perception models account for the processing of non-identity aspects of face perception (e.g. sex, race, emotional expression; Bruce & Young, 1986; Burton et al., 1999; Haxby et al., 2000; Valentine, 1991), age and how it interacts with facial identity processing is not accounted for in any direct or complete way. Considering age to be like other non-identity aspects of face processing is also problematic. This is because of the slow progressive transformation, which is a unique characteristic of age not shared with any of the other aspects of face processing, such as race, which is stable over time, or emotional expression, which changes frequently over just a few seconds. Many insights have been gleaned indirectly through studies examining other aspects of face processing (Chatterjee & Nakayama, 2012; Nunn et al., 2001; Robertson & Burton, 2021), and few studies have made direct attempts to understand whether facial identity processing and age perception are processed through parallel or shared routes (Dagovitch & Ganel, 2010).
The findings from these studies are inconsistent, and the question of whether identity and age processing operate a shared or parallel route remains unresolved, leaving a gap in our understanding of age processing within existing face processing frameworks. Therefore, over a series of three experiments, 1 the present study will systematically examine the question of the independence of facial identity and age processing for both familiar and unfamiliar faces using the Garner speeded classification paradigm. This paradigm has been used previously to assess the independence of different dimensions of face processing (see Algom & Fitousi, 2016), but has not yet been applied to examine facial identity and age processing.
The Garner speeded classification paradigm
The Garner speeded classification paradigm is a well-established approach for examining the independence of a wide range of perceptual dimensions, including shape perception, numerical cognition, auditory perception and face perception (see e.g. Algom & Fitousi, 2016; Atkinson et al., 2005; Ganel & Goshen-Gottstein, 2002; Schweinberger & Soukup, 1998; Wang et al., 2013). This paradigm is based on the theoretical concept of selective attention, that is, the ability to focus on relevant stimuli while suppressing responses to irrelevant or conflicting information. Therefore, in this paradigm, participants selectively attend to one aspect of the stimulus (i.e. the task-relevant dimension) while ignoring another aspect (i.e. the task-irrelevant dimension; see Algom & Fitousi, 2016; Garner, 1976; Fitousi, 2023). The theoretical concept is similar to that of the Stroop effect, whereby failure to selectively attend to a single stimulus dimension suggests that the stimulus is composed of dimensions that interact in processing, that is, integral dimensions. Thus, integral dimensions are processed as a unified whole, and examples of integral dimensions include colour ink and colour word (see Eidels et al., 2010), hue and saturation (Garner, 1975), loudness and timbre (Melara & Marks, 1990). By contrast, successful selective attention suggests that the dimensions of the stimulus do not interact in processing, that is, separable dimensions (for a detailed description, see Algom & Fitousi, 2016). Thus, separable dimensions can be deconstructed and processed independently, and examples of separable dimensions include colour and shape (Garner, 1975), size and brightness (Garner, 1977), and colour and texture (Cant et al., 2008). Converging evidence from other methodologies provide further support for the distinction between integral and separable dimensions (see Algom & Fitousi, 2016). However, a detailed account of the mechanisms underlying Garner interference and its relationship to other methodologies is beyond the scope of this paper, but comprehensive reviews and critical evaluations can be found in the works of Algom and Fitousi (2016), Fitousi (2023) and Melara and Mounts (1993).
This paradigm usually comprises three blocks of trials. In the Baseline condition, the task-irrelevant dimension is held constant while the task-relevant dimension varies. In the Filtering condition, the task-irrelevant dimension will vary randomly. In the correlated condition, the task-irrelevant dimension will vary consistently across all trials with the relevant dimension. Taking emotional expression and facial identity as an example, if the task here is to classify facial identity while ignoring emotional expression, the task-relevant dimension is facial identity, and the task-irrelevant dimension is emotional expression. In the Baseline condition, the identity of faces (i.e. task relevant) will differ, but facial expression (i.e. task irrelevant) will be held constant across all trials in this block (e.g. Persons A and B always express anger). In the Filtering condition, the task-irrelevant facial expression dimension will vary randomly (e.g. Persons A and B randomly express anger or disgust). In the correlated condition, changes to the facial expression will vary consistently with changes to facial identity (e.g. Person A always expresses anger and Person B always expresses disgust). The dimensions can be replaced with other aspects of the face, including facial identity and facial age, which are relevant to this paper. A graphical illustration of the Baseline, Filtering and Correlation blocks using facial identity and age as dimensions is presented in Figure 1.

Graphical illustration depicting which stimulus combinations would be presented for each of the Baseline, Filtering and Correlation Blocks.
The difference in response times across these three conditions reveals the success of selective attention to a single aspect of the stimulus. A measure of Garner interference is calculated by computing the difference in mean response times between the Baseline and Filtering conditions. Slower response times in the Filtering condition would indicate that the task-irrelevant dimension influenced the ability to selectively attend to the task-relevant dimension. An analogous calculation is also often performed to calculate the difference in mean response times between the correlated and the baseline conditions. If response times for the correlated condition are faster than the baseline condition, this is called Redundancy Gain. The two dimensions are characterised as separable if the speed and accuracy of performance are comparable across all three conditions. By contrast, if the results reveal both Garner interference and Redundancy gain, then the dimensions are considered integral (see Algom & Fitousi, 2016).
Another possible outcome is asymmetric interference. This term describes a scenario where one dimension influences a second dimension, but the reverse is not found. For example, facial identity has been found to interfere with emotional expression classification, but emotional expression does not interfere with facial identity recognition (Schweinberger & Soukup, 1998). Although Garner suggested that this was likely the result of a hierarchy in processing, asymmetry may instead be determined by baseline discriminability (see Algom & Fitousi, 2016). Discriminability is defined as the difference separating two stimuli along a single dimension (Melara & Mounts, 1993). Baseline discriminability is matched across both dimensions when the recorded speed and accuracy are comparable for the two dimensions in the Baseline condition. The effect of matched and mismatched baseline discriminability between the two dimensions has been examined through a series of studies using a range of perceptual stimuli. This work demonstrates that asymmetry is usually recorded when relative baseline discriminability (RBD) between the two dimensions is mismatched (e.g. Algom et al., 1996, 2022; Melara & Mounts, 1994; Pansky & Algom, 1999). Typically, the highly discriminable dimension leads to failure of selective attention to the less discriminable dimension but not the opposite (Melara & Mounts, 1993).
Garner effects in face processing
The Garner paradigm has provided an effective tool for examining whether facial identity and other dimensions of face perception (such as emotional expression and facial speech) are processed along a shared or parallel route (see Algom & Fitousi, 2016; Fitousi & Wenger, 2013; Wang et al., 2013). Findings from several Garner studies provide evidence for the shared route hypothesis across dimensions of familiarity and sex (Ganel & Goshen-Gottstein, 2002), and identity and expression (Ganel & Goshen-Gottstein, 2004). In other studies, Garner interference was found to be asymmetric. For example, interference of facial identity on emotional expression is recorded, but not the reverse (Schweinberger & Soukup, 1998; Wang et al., 2013). Similarly, sex classification influences emotion classification but not the opposite (Atkinson et al., 2005). Differences in discriminability of the two dimensions may explain the asymmetry recorded for facial identity processing and other facial dimensions when using the Garner Speeded Classification paradigm (Le Gal & Bruce, 2002; Wang et al., 2013). However, baseline discriminability may not explain all cases of asymmetry. For example, when discriminability is matched such that the speed of classifications of these two dimensions in the Baseline condition is equal, asymmetry was upheld (Schweinberger & Soukup, 1998; Schweinberger et al., 1999).
There are also some inconsistencies in findings across studies, such that some studies have found symmetric Garner interference between two dimensions (e.g. identity and facial expression) (Fitousi & Wenger, 2013; Ganel & Goshen-Gottstein, 2002), while others have recorded asymmetry (Schweinberger & Soukup, 1998; Schweinberger et al., 1999), or no interference at all (Wang et al., 2013). Disparities in findings may be attributed to variations in the stimuli used. Specifically, for example, whether RBD was controlled for, and whether the faces used were familiar or unfamiliar. With regards to the latter, face familiarity has been found to moderate the presence and strength of Garner interference in some studies (Ganel & Goshen-Gottstein, 2002, 2004; Kaufmann & Schweinberger, 2004). Thus, both these stimulus attributes will be considered in the present study.
The present study
Although age has been considered in studies assessing independence from emotional expression, and has also been found to operate independently of sex and race (Fitousi, 2020), few studies to date have directly examined the independence of age and facial identity processing (Dagovitch & Ganel, 2010). The Garner speeded classification paradigm will be used to systematically investigate the independence of the processing of age and facial identity. If age and facial identity share a processing mechanism, and are therefore integral, slower response times in the Filtering compared to the Baseline condition will be recorded. By contrast, null effects will be recorded across all experiments if facial identity and age processing are separable.
Given that existing theoretical models do not explicitly account for age perception (Bruce & Young, 1986; Burton et al., 1999; Haxby et al., 2000), in addition to the inconsistent findings on the interaction between age and facial identity processing in other empirical studies (e.g. Chatterjee & Nakayama, 2012; Dagovitch & Ganel, 2010; Robertson & Burton, 2021), predicting a specific outcome is challenging. The findings from these experiments will serve to advance our understanding of age perception within existing theoretical frameworks of facial identity processing.
To this end, in Experiment 1, the Garner paradigm will be used, where participants will classify the age and facial identity of well-known famous faces across the two conditions (i.e. Baseline and Filtering). Given that previous studies have found familiarity effects on decision tasks of age (Bruyer et al., 2007) and other visually derived semantic codes, such as gender (Rossion, 2002) and ethnicity (Bruyer et al., 2004), as well as identity (Ganel & Goshen-Gottstein, 2002, 2004), an experiment with unfamiliar faces will be included. Thus, in Experiment 2, the design will be identical, but the facial identity stimuli will comprise lesser-known famous faces (e.g. international persons). The reason for using international individuals is that faces of unfamiliar people with known ages at different time points are more difficult to source.
Previous studies have reported that cues external to facial features can facilitate categorisation by encouraging picture-based strategies (Goshen-Gottstein & Ganel, 2000). Given that the same stimuli are repeatedly presented to participants, any salient non-face marker, such as differences in face contour, could act as an artificial cue that may interfere with the categorisation task. Therefore, a similar approach to standardising stimuli (i.e. cropping faces into an oval shape) used in other studies (e.g. Wang et al., 2013) will be used here for Experiments 1 and 2 to discourage participants from using such picture-based strategies. However, a criticism of the Garner speeded classification paradigm is the difficulty with generalising the findings to the normal face recognition system. This is because of the constrained nature of cropped stimuli, which is different from how we encounter faces in everyday life (see Burton, 2013). To address this, Experiment 3 will use multiple naturalistic images of famous identities comprising a high degree of within-person variability.
Experiment 1
Participants
To determine the sample size required for the main analysis, a 2 × 2 ANCOVA with one covariate, a power analysis was performed based on a study that examined the age classification paradigm for age and emotion categorisation. This study reported a significant interaction effect for condition (Filtering, Baseline) by categorisation (age, emotional expression) with an effect size of η p 2 = .28 (extracted from Karnadewi & Lipp, 2011), whereby there was a slower performance in the Filtering condition compared to the baseline condition such that varying age interfered with emotional expression but not the opposite. The power analysis computed using G*Power revealed a required sample size of 36 participants to achieve a power of 0.95 with an alpha level of .05. Therefore, 40 to 50 male and female participants were considered sufficient. All participants were required to have a normal or corrected-to-normal vision. To minimise any cross-race influence on facial recognition performance (Bothwell et al., 1989; Meissner & Brigham, 2001) and age perception (Dehon & Brédart, 2001; Rhodes, 2009), only data from Caucasian participants were included. Participants recruited were over the age of 18, with no upper limit on the age range. However, there is some evidence for an age bias in both facial recognition (Rhodes & Anastasi, 2012) and age perception (Davis & Attard-Johnson, 2022; Moyse & Brédart, 2012); therefore, participant age was included as a covariate in the analysis. No other inclusion or exclusion criteria were specified.
In total, 67 participants took part and fully completed Experiment 1. Of these, 11 were removed because they did not meet the criteria for ethnicity, and 1 because they had a percentage error rate of greater than 40%. This resulted in 55 remaining participants, 26 (16 male, Mage = 34, SDage = 9.52, range = 20–55) completed the ‘Age Classification’ task, and 29 (16 male, 1 non-binary, Mage = 38, SDage = 11.77, range = 19–66) completed the ‘Facial Identification’ task. Participants were randomly assigned to one of the two tasks. Participants who did not pass the initial prescreen or practice trials, or who did not see the task to completion, have not been reported here. Participants were recruited on Testable using the ‘Verified Minds’ participant pool between 12 February and 11 April 2024. All participants were compensated for their time. Ethical approval was obtained from the Institution’s Ethics Board (Ref: 52251), and all participants provided consent to take part, which was recorded electronically using Testable.
Stimuli
The faces used were standardised using the open-source GNU Image Manipulation Programme (GIMP) software. All faces were cropped into an oval shape measuring around 8.5 cm (width) × 12cm (height). Faces depicted a neutral facial expression and were presented in a grey scale. To minimise variations across the dimensions other than those being examined (i.e. age and facial identity), all faces were male and Caucasian to maintain consistency for sex and race. Images that fit the criteria depicting faces in their 20s and their 50s were sourced through an extensive Google image search.
For Experiment 1, two models comprising famous faces (Matt Damon and Ewan McGregor) were used. These faces were chosen because both individuals are of similar age (born in 1970 and 1971, respectively) and began their film careers around the same period in the 1990s, having performed in a similar number of films. Therefore, the opportunity for exposure to both identities was roughly similar throughout their adult life. Nonetheless, to ensure that both identities are familiar to participants, a familiarity check was included. Pilot data from 20 male and female participants (Mean age = 38 years, SD = 10.14) were collected for the four images to establish perceived age for the older (McGregorMean = 44 years and DamonMean = 43 years) and younger faces (McGregorMean = 30 years and DamonMean = 32 years). The image pairs for the younger and older faces of the two identities were also compared for similarity using a 7-point Likert scale (1 = ‘not at all similar’ and 7 = ‘extremely similar’), and this revealed a low similarity score between Matt Damon and Ewan McGregor for both the older faces (M = 2.15, SD = 1.14) and younger faces (M = 2.35, SD = 1.39). Therefore, the images were considered sufficiently distinguishable.
Procedure
Participants completed a familiarity check by rating the faces (both older and younger versions) on their familiarity using a 5-point Likert scale. Only participants who provided a rating of 4 and 5 (highly familiar) were invited to complete the experiment; those who did not meet the criteria were screened out. Participants completed a familiarisation phase to ensure that they were clear of the identities (Model A vs. Model B) and the two age categories (younger vs. older). For this, there were 20 familiarisation trials to become acquainted with the four photographs and the appropriate response keys; thus, participants were shown each photograph 5 times and given 5 s to respond. If an accuracy of at least 90% was not achieved, the familiarisation trials were repeated until reaching a maximum of three attempts. Participants who did not pass the initial accuracy test by the third attempt were screened out.
Participants were randomly assigned to a task by the experiment software, and depending on the condition assigned to, participants were instructed to classify the faces as either young or old, or Matt or Ewan, using the keyboard. Participants were informed that speed and accuracy were both important and to respond using the ‘Z’ and ‘M’ keys on a standard keyboard. Arrangement of the left and right response keys was balanced between participants, and the order was randomly generated by the experimental software. Each cycle consisted of three blocks: the two Baseline blocks and one Filtering block. In the Baseline blocks, participants judged the faces on one dimension (facial identity or age), while the other dimension was held constant (either Face A or Face B). Therefore, the Baseline block consists of two parts, such that participants in the age classification group will judge age (young or old) for Model A and Model B separately. In the Filtering block, participants classified faces on one dimension while the other dimension varied. Therefore, participants in the age classification condition made age judgements for both Models A and B. Every participant completed three cycles. Within each cycle, the blocks were presented in random order. The primary task comprised a total of 576 trials. Each trial started with a fixation cross for 300 ms followed by a single face presented in the centre of the screen for 2,500 ms. A short break was provided every 48 trials (see the online Supplemental Material for further details).
Results
Data processing
Data were processed using R software (R Core Team, 2023) in the RStudio interface (Posit Team, 2024) using the R package ‘tidyverse’ (Wickham et al., 2019). Trials with a response time shorter than 150 ms were removed from the analysis. Incorrect trials and trials without a response were removed. The percentage of trials removed is reported in Tables 1 to 3. For the remaining trials, participants’ mean reaction time (RT) aggregated across all Baseline blocks was calculated, producing a single value representing mean Baseline RT. The same calculation was performed for the Filtering condition. For the analysis, a 2 (condition: Baseline, Filtering) × 2 (classification task: age, facial identity) mixed-factorial ANCOVA, with participant age included as a covariate, was performed with partial eta square reported for effect sizes. Any significant interactions were followed up with post hoc t-tests adjusted for multiple comparisons using Bonferroni Correction.
Mean RTs (in ms) and proportion of error (%) indicating average percentage of trials excluded for the Baseline and Filtering trials in Experiment 1 (Famous Faces).
Note. RT = reaction time; SD = standard deviation.
Mean RTs (in ms) and proportion of error (%) indicating the average percentage of trials excluded for the Baseline and Filtering trials in Experiment 2 (Unfamiliar Faces). SD = standard deviation.
Note. RT = reaction time; SD = standard deviation.
Mean RTs (in ms) and proportion of error (%) indicating the average percentage of trials excluded for the Baseline and Filtering trials in Experiment 3 (Ambient Famous Faces).
Note. RT = reaction time; SD = standard deviation.
This analysis was complemented with Bayes factors (BF), which can inform us about the null hypothesis, which is important for assessing independence, which will be operationalised as a null effect (Dienes, 2014). A BF10 of greater than 3 represents substantial support for the alternative hypothesis, and smaller than 0.3 represents substantial support for the null hypothesis, and a BF between 0.3 and 3 represents only weak evidence (Jeffreys, 1961; Wetzels et al., 2011). The traditional ANCOVA and Bayesian ANCOVA were both performed in JASP (version 0.18). The Analysis Protocol was detailed in the Registered Report and datasets generated and analysed during the current study are available in the Open Science Framework repository (URL: https://osf.io/85b2s/?view_only=d1da2126efff4932bd804124abe4277b).
Analysis
Within the Baseline conditions, the RTs for the two separate identities in the baseline condition of the facial identification task (Mdiff = 4 ms) and for the two ages in the baseline condition of the age classification task were comparable (Mdiff = 11 ms). Of primary interest are the RTs combined across identities and ages for the Filtering and Baseline conditions, which are summarised in Table 1. The 2 × 2 mixed factorial ANCOVA revealed that participant age was not a significant covariate, F (1, 52) = 1.11, p = .295, η p 2 = .021. There was no effect of the classification task, F (1, 52) = 2.57, p = .115, η p 2 = .05. A main effect of condition (Baseline vs. Filtering) was present, F (1, 52) = 29.38, p < .001, η p 2 = .36, such that response times in the Filtering condition were on average 62 ms slower than the Baseline condition, indicating the presence of Garner interference. The analysis also revealed an interaction between condition and classification task, F (1, 52) = 15.63, p < .001, η p 2 = .23.
The data were also subjected to a 2 × 2 Bayesian ANCOVA. This revealed moderate evidence in favour of the null model for participant age as a covariate (BF10 = 0.75). The analysis revealed anecdotal evidence for the effect of the classification task (BF10 = 1.36) and strong evidence for the interaction between classification and condition (BFinc = 15.89). Strong evidence for the effect of condition was found (BF10 = 4.61 × 109), which supports the presence of Garner Interference.
To unpack the interaction effect, four comparisons were performed. Post hoc t-tests were performed using an alpha level of .0125 (Bonferroni adjusted for four comparisons, 0.05/4 = 0.0125). When separated by the age and identity classification tasks, paired t-tests revealed an increase in response time (40 ms) from Baseline to Filtering in the age classification task, t(25) = 5.25, p < .001, d = 1.03 (BF10 = 1,181, very strong). A similar pattern was found for the identity classification task (81 ms), t(28) = 8.69, p < .001, d = 1.61 (BF10 = 6.052 × 106, very strong). This demonstrates the presence of GI in both the Age and Facial Identity tasks.
Independent samples t-tests comparing Baseline RTs for age and identity classification revealed that RTs for the Baseline condition were similar for both the age and identity classification tasks, t(53) = 0.97, p = .336, d = 0.26 (BF10 = 0.403; weak). Finally, the t-tests also demonstrated that the mean RTs for the Filtering trials were slightly lower for the identity classification task compared to the age classification task, but this did not reach the threshold for significance (a = .0125), t(53) = 2.54, p = .014, d = 0.26 (BF10 = 3.70; weak).
Discussion
Experiment 1 demonstrated that varying the age or identity of famous faces interfered with the classification of the other dimension, indicating the presence of Garner interference. Importantly, this interference was symmetric, supporting the notion that facial identity and age processing are integral dimensions. The relationship between age and facial identity mirrors patterns recorded between identity and other facial attributes such as sex or race (e.g. Fitousi & Wenger, 2013; Ganel & Goshen-Gottstein, 2002). Additionally, the similar mean RTs for the Baseline tasks suggest that the two dimensions have equal relative baseline discriminability (RBD). RBD has been considered as a possible explanation for inconsistencies in Garner interference patterns across other studies (Schweinberger & Soukpu, 1998; Schweinberger et al., 1999). By matching RBD in this experiment, the likelihood of interference stemming from Baseline differences was reduced. Experiment 1 focussed on familiar faces using famous faces as stimuli. However, there are known differences in the processing of familiar and unfamiliar faces (see Burton, 2013; Johnston & Edmonds, 2009). Specifically, familiar face recognition benefits from an advantage in accuracy and speed in comparison to unfamiliar face recognition, which is thought to rely more heavily on a less sophisticated picture-based strategy. Therefore, to determine whether the findings from Experiment 1 extend to unfamiliar face processing, a second experiment was conducted.
Experiment 2
The aim of Experiment 2 was to examine whether Garner interference recorded for famous faces extends to unfamiliar face processing. In Experiment 2, the design was identical, but unfamiliar faces were used. Before completing the main task, participants learned the unfamiliar faces via a practice task, and therefore, these faces became learned unfamiliar faces. There are two key differences between famous and unfamiliar faces that may influence the speed of classification in this study. First, the use of internal facial features becomes more important for highly familiar faces, whereas external features are more relied upon for unfamiliar face identification (Ellis et al., 1979; Kramer et al., 2018; Young et al., 1985). In this experiment, the faces were cropped to exclude external facial features, such as hair, which may result in a disadvantage for unfamiliar face classification. Second, drawing on the IAC model (Burton et al., 1999), highly familiar faces hold extra semantic information in the Personal Information Unit and Semantic Information Unit. This extra information may be accessed and may serve to facilitate the classification of age and identity. Indeed, previous studies have found a familiarity advantage, with faster response times for sex classifications of familiar faces compared to unfamiliar faces (Edmonds et al., 2013; Richards & Ellis, 2009). Consequently, it is expected that overall slower response times for the classification of age and identity for unfamiliar faces will be revealed.
However, predicting differences in the presence or absence of Garner interference for highly familiar and unfamiliar faces is more complex. Most studies examining familiarity effects within the Garner paradigm focus on emotional expression as one dimension (e.g. Ganel & Goshen-Gottstein, 2002, 2004; Kaufmann & Schweinberger, 2004). Emotional expression is theoretically distinct from other non-identity face aspects, such as sex and race, due to its complex interconnected affective neural networks (e.g. Bruce & Young, 1986; Haxby et al., 2000). Nonetheless, familiarity has been shown to affect the presence and strength of Garner interference in studies examining emotional expression and identity (Ganel & Goshen-Gottstein, 2002, 2004; Kaufmann & Schweinberger, 2004), with stronger interference typically found for familiar faces. Therefore, it is possible that the interference between identity and age for unfamiliar faces is smaller or even absent.
Participants
In total, 72 participants took part in the experiment, and 14 were removed because they did not meet the criteria for ethnicity. The error rates of remaining participants fell below 40% and therefore no additional participants were removed. This resulted in 58 remaining participants, 30 (11 male, 1 non-binary, Mage = 35, SDage = 14.75, range = 18–71) completed the ‘Age Classification’ task, and 28 (11 male, 1 non-binary, Mage = 33, SDage = 11.90, range = 19–59) completed the ‘Facial Identification’ task. Participants who did not pass the initial prescreen or practice trials, or who did not see the task to completion, have not been reported here. Participants were recruited on Testable using the ‘Verified Minds’ participant pool between 12 February and 11 April 2024 and had not participated in the previous experiment.
Stimuli and procedure
For Experiment 2, two unfamiliar models were used. However, there is no existing database for images of unfamiliar faces taken decades apart to represent the young and old conditions required for this study. Therefore, Caucasian celebrities known outside of the United Kingdom were used instead. Several images sourced from an extensive Google search were piloted for perceived age, and the two identities with ages most comparable to Experiment 2 were two European celebrities, Fabrizio Faniello (a Maltese singer) and Johnny de Mol (a Dutch actor). Pilot data from 20 male and female participants were collected for the four images to establish perceived age for the older (FanielloMean = 38 and de MolMean = 42 years) and younger faces (FanielloMean = 28 and de MolMean = 32 years). The image pairs for the young and older faces of the two identities were also compared for similarity using a 7-point Likert scale (1 = ‘not at all similar’ and 7 = ‘extremely similar’), and this revealed a low similarity score between Fabrizio Faniello and Johnny de Mol for both the older faces (M = 2.3, SD = 1.08) and younger faces (M = 1.9, SD = 1.12). Therefore, the images were considered sufficiently distinguishable.
A familiarity check was performed before commencing with the experiment to ensure that participants were not familiar with the faces at the time of the experiment. Before commencing the study, participants completed a familiarity check on a pre-screen by rating the faces (both older and younger versions) on their familiarity using a 5-point Likert scale. Only participants who provided a rating of 0 (not at all familiar) were invited to complete the experiment. The rest of the procedure was identical to Experiment 1, and to keep the instructions also identical, the unfamiliar models were assigned the false names ‘Matt’ and ‘Ewan’.
Results
For the separate stimuli within the Baseline condition, the RTs for the two separate identities of the facial identification task (Mdiff = 4 ms) were comparable, though slightly larger for the two ages in the Baseline condition of the age classification task (Mdiff = 21 ms). The main analysis concerns the data for the RTs combined across identities and ages for the Filtering and Baseline conditions, and these data are summarised in Table 2. A 2 (condition: Baseline, Filtering) × 2 (classification task: age, facial identity) mixed-factorial ANCOVA, with participant age included as a covariate, was performed. The analysis found a significant effect for the covariate of age, F (1, 55) = 12.33, p < .001, η p 2 = .183. Therefore, with age accounted for, the analysis found no effect of classification task on response time, F (1, 55) = 0.037, p = .848, η p 2 = .0006. A main effect of condition was found, F (1, 55) = 38.44, p < .001, η p 2 = .411, such that RTs were on average 96 ms slower in the Filtering condition compared to Baseline. This means that Garner interference is present for both tasks. There was no significant interaction between condition and classification task, F (1, 55) = 3.85, p = .055, η p 2 = .065.
The data were also subjected to a 2 × 2 Bayesian ANCOVA. This revealed strong evidence for the effect of participant age as a covariate (BF10 = 38.97). Once participant age is accounted for, the analysis revealed weak evidence for the effect of classification task (BF10 = 0.42) and for the interaction between classification and condition (BFinc = 1.14). Strong evidence for the effect of condition was found (BF10 = 7.142 × 1017), which supports the presence of Garner interference.
Additional analyses: familiarity comparison
To explore whether there were any differences in RTs for familiar and unfamiliar faces, separate 2 (familiarity: familiar, unfamiliar) × 2 (condition: Baseline, Filtering) frequentist and Bayesian ANCOVAs (participant age as a covariate) were performed for the identity and age classification tasks. This analysis is exploratory and was not specified in the Registered Report.
For the identity classification task, this analysis revealed a main effect of condition, F (1, 54) = 39.98, p < .001, η p 2 = .43, and as expected, the RTs in the Filtering condition were overall slower than the Baseline condition. There was no main effect of familiarity, F (1, 54) = 0.04, p = .844, η p 2 = .0007, and no interaction effect, F (1, 54) = 2.92, p = .093, η p 2 = .051. The covariate of participant age was also not significant, F (1, 54) = 0.89, p = .350, η p 2 = .016. The Bayesian equivalent analysis further supports this pattern, demonstrating strong evidence for condition (BF10 = 1.38 × 1016) and anecdotal evidence for the interaction (BFinc = 1.65). In addition, BF revealed anecdotal evidence in favour of the null hypothesis for familiarity (BF10 = 0.31).
For the age classification task, this analysis revealed a main effect of condition, F (1, 53) = 21.82, p < .001, η p 2 = 0.29, and no main effect of familiarity, F (1, 53) = 2.71, p = .106, η p 2 = .049. In contrast to the identity task, an interaction between condition and familiarity was found, F (1, 53) = 15.28, p < .001, η p 2 = .22. To understand the interaction, independent t-tests comparing familiar and unfamiliar faces for Baseline and Filtering conditions were performed. These revealed faster RTs for the age classification of familiar faces (594 ms) compared to the unfamiliar faces (672 ms) in the Filtering condition, t(54) = 2.21, p = .032, d = 0.59, but the difference (554 ms vs. 588 ms, respectively) was not significant for the Baseline condition, t(54) = 1.04, p = .302, d = 0.23. The Bayesian equivalent analysis further supports this pattern, demonstrating extreme evidence for condition (BF10 = 9.61 × 1010) and strong evidence for the interaction (BFinc = 67.21). In addition, BF revealed anecdotal evidence for familiarity (BF10 = 1.11). Furthermore, BF for the independent samples t-tests revealed anecdotal evidence in favour of the null hypothesis for the differences in RTs for the Baseline condition (BF10 = 0.42), and anecdotal evidence in favour of the alternative hypothesis for the Filtering condition (BF10 = 1.96).
Discussion
Taken together, these findings indicate that Garner interference is present and symmetric, and they provide support for the notion that facial identity and age processing are integral dimensions for both familiar and unfamiliar faces. The findings also show that the two dimensions were equally discriminable, as evidenced by similar overall response times (RTs) in the Baseline tasks. RTs were compared directly for the familiar task in Experiment 1 and the unfamiliar task in Experiment 2. For the identity classification task, contrary to the prediction that familiarity would reveal an advantage in identity classification due to the more proficient use of internal facial cues and additional semantic information, responses were not significantly faster than for unfamiliar faces. However, an interaction was found for the age classification task. Specifically, in the Filtering condition, the ages of familiar faces were categorised faster than unfamiliar faces. Thus, although familiarity did not affect the presence of Garner interference or the classification of identity, there was some detectable slowing down of age classification when unfamiliar face identities varied. It may be that the stored internal representation of familiar faces facilitates the processing of age to some extent by allowing participants to disregard identity more rapidly and selectively attend to age. However, it should be noted that Bayes Factor analysis only found anecdotal evidence for this effect, and there was also no strong evidence in favour of the null hypothesis (i.e. that familiarity does not affect the age of identity classification in the Garner paradigm). Importantly, however, both highly familiar and unfamiliar faces support the shared route hypothesis in relation to facial identity processing. Theoretical implications are discussed in more detail in the General Discussion.
Experiment 3
In recent years, researchers have argued against generalising experimental results from highly artificial and tightly constrained stimuli, for instance, images with deliberate removal or manipulation of facial features of faces or other low-level image characteristics (see Burton, 2013). Furthermore, it is difficult to capture a true representation of a real person within a single image, given the large within-person variability in photos of the same face (Burton, 2013; Jenkins et al., 2011). To address this, Experiment 3 used an adaptation of the Garner speeded classification task to investigate whether effects are present with stimuli more likely to be encountered in daily life, specifically naturalistic images of faces. Additionally, multiple images, as opposed to a single image, representing each identity were used.
Participants
In total, 66 participants completed the experiment, and 17 were removed because they did not meet the criteria for ethnicity. Error rates of remaining participants fell below 40% and therefore no additional participants were removed. This resulted in 49 remaining participants, 24 (12 male, Mage = 36, SDage = 9.14, range = 20–58) completed the ‘Age Classification’ task, and 25 (14 male, Mage = 38, SDage 9.7, range = 23–51) completed the‘Facial Identification’task. Participants who did not pass the initial prescreen or practice trials, or who did not see the task to completion, have not been reported here. Participants were recruited on Testable using the ‘Verified Minds’ participant pool between 11 April and 1 July 2024, and were independent of the participants in the previous two experiments.
Stimuli and procedure
For Experiment 3, 20 ambient images for each of two famous identities from Experiment 1 were used, with equal numbers (10 each) depicting younger faces (under 30) and older faces (over 40). The images of faces were sourced from a Google search. Pilot data from 21 male and female participants were collected for all 40 images to establish perceived age for the older (McGregorMean = 40 and DamonMean = 43 years) and younger sets of faces (McGregorMean = 25 and DamonMean = 23 years). The images were adjusted to be of the same size, and although the viewpoint and angle varied, all internal features were clearly visible. The background and face, as well as external features, were included. No other manipulations to the faces were made. Only participants who provided a rating of 4 and 5 (highly familiar) were invited to complete the experiment. The rest of the procedure was identical to the previous experiments, including the random presentation of the order of the images.
Results
Within the Baseline conditions, the RTs for the two separate identities in the baseline condition of the facial identification task (Mdiff = 28 ms) were similar but larger than the previous two experiments, and comparable for the age classification task (Mdiff = 13 ms). Data of main interest for the RTs combined across identities and ages for the Filtering and Baseline conditions are summarised in Table 3. The 2 × 2 mixed factorial ANCOVA revealed that participant age was a significant covariate, F (1, 46) = 4.40, p = .04, η p 2 = .009. With age accounted for, there was no main effect of condition (i.e. Filtering vs. Baseline), F (1, 46) = 1.20, p = .16, η p 2 = .042, indicating an absence of the Garner interference. There was also no interaction effect, F (1, 46) = 1.20, p = .16,η p 2 = .042. The analysis found a main effect of classification task, F (1, 46) = 4.80, p = .033, η p 2 = .095, such that response times on the age classification task were slightly slower than the identification task.
The equivalent Bayesian analysis demonstrates weak evidence for the covariate of age (BF10 = 1.14). However, in contrast to the traditional ANCOVA, the analysis revealed strong evidence for the effect of condition (i.e. Filtering vs. Baseline; BF10 = 4,505), indicating a slowing down of response time for Filtering trials, providing evidence for the presence of Garner interference. There was weak evidence for the effect of the classification task (BF10 = 0.94) and for the interaction effect (BF10 = 1.51).
Discussion
Experiment 3 replicated the previous experiments with a few adjustments. First, naturalistic images of famous faces were used, and second, there were multiple images representing each identity and age as opposed to a single image of each. This was done to improve the generalisability of the findings using images of faces more likely to be encountered in daily life (Burton, 2013). As with the previous experiments, results show that the two dimensions had equal relative baseline discriminability since the overall RT for the Baseline tasks was similar.
A small Garner interference was present, but the effect was weaker than in the previous experiments, although the frequentist analysis did not detect a significant effect, BF revealed strong evidence of Garner interference for both the Age and ID classification tasks. On visual inspection of the descriptive data, the error rate is higher, and Baseline RTs are slower when compared to the previous experiments. This indicates that categorising highly variable faces either by age or identity required more cognitive resources or a later stage of processing compared to the more highly controlled stimuli in the earlier experiments. This may be a result of processing additional visual information, such as colour, skin tone and external features, which were not present in the cropped greyscale face stimuli used in Experiments 1 and 2. However, adding the second irrelevant dimension only resulted in a modest slowing down. These results are consistent with Burton’s (2013) review on the limitations of generalising experimental results from highly artificial face stimuli to inform us about the normal face recognition system. The implications of these results for the parallel or shared route hypothesis are discussed in the next section.
General discussion
Empirical work to date has examined whether facial identity and other aspects of the face, specifically, emotional expression, sex and race, are processed through a single route or in parallel to each other (e.g. Craig & Lipp, 2023; Ganel & Goshen-Gottstein, 2002; Wang et al., 2013). However, few studies have considered the interconnectedness of facial age with identity and non-identity aspects of the face (Alonso-Prieto et al., 2015; Fitousi, 2020). For the first time, the Garner Speeded Classification Task was employed to examine the relationship between facial identity processing and facial age perception. Using the standard Garner task, Experiments 1 and 2 revealed symmetric Garner interference, providing support for the shared route hypothesis. This was the case for both familiar and unfamiliar faces. Experiment 3 used an adaptation of the Garner task by including multiple naturalistic images of two familiar identities as opposed to cropped single images per identity. In contrast to the first two experiments, only a weak Garner interference was present.
Shared or parallel route for identity and age perception?
Experiments 1 and 2 recorded Garner interference for both the ‘age judgement’ and the ‘identification judgement’ condition. This is illustrated by a slowing down of response times when classifying a face across one dimension (e.g. age for the ‘age judgement’ condition) while the task-irrelevant dimension varies (e.g. identity for the ‘age judgement’ condition) (Filtering condition), as compared to when the task-irrelevant dimension is kept constant (Baseline condition). In accordance with the theoretical principles of the Garner Speeded Classification Task (Algom & Fitousi, 2016; Garner, 1976), the presence of Garner interference demonstrates a failure to selectively attend to a single stimulus dimension while ignoring the second. A difficulty in selectively attending to one dimension suggests that age and facial identity are integral dimensions and thus, processed along a shared route. Furthermore, the influence of the age and identity dimensions was bidirectional, i.e., age interfered with the speed of facial identification in the facial identification condition and vice versa. These findings support the hypothesis that age and identity are processed along a shared route.
These findings are similar to behavioural data from a small set of studies using the Garner Speeded Classification Task, demonstrating that facial identity and another aspect of the face, sex, are also processed via a shared route (e.g. Ganel & Goshen-Gottstein, 2002; Ganel et al., 2002). However, intriguingly, age and sex have been found to operate in parallel to each other (see Fitousi, 2020). This suggests that although there is evidence that age and sex are processed independently, they are both individually integrated with facial identity. The same applies to the relationship between age, race and facial identity. Specifically, there is evidence for the parallel-route hypothesis between race and age (Fitousi, 2020; Alonso-Prieto et al., 2015) and race and sex (Fitousi, 2020), but support for a shared route between race and identity (Bruyer et al., 2004).
By contrast, findings on emotional expression and identity (Fitousi & Wenger, 2013; Schweinberger & Soukup, 1998; Wang et al., 2013), and age, sex and race (see Atkinson et al., 2005; Karnadewi & Lipp, 2011; Le Gal & Bruce, 2002) point towards partial asymmetric crosstalk indicated by asymmetric Garner interference. Specifically, in the studies where asymmetric interference is observed, emotional expression classification is influenced by all four of the above face processes (Baudouin et al., 2002; Krebs et al., 2011; Schweinberger & Soukup, 1998; Wang et al., 2013), but there is no published record of asymmetry occurring in the opposite direction. Additionally, a smaller set of experiments record an absence of interference altogether (Craig & Lipp, 2023; Le Gal & Bruce, 2002) or symmetric interference (Ganel & Goshen-Gottstein, 2004; Fitousi & Wenger, 2013).
Variations in methodology may account for the discrepant findings in previous work. For instance, a mismatch in RBD (i.e. similarity of response times and accuracy in the Baseline conditions for the two single dimensions; Melara & Mounts, 1993) has been found to produce asymmetric Garner interference in some studies (Wang et al., 2013) but not all (Schweinberger & Soukup, 1998; Schweinberger et al., 1999). Across all three experiments in the present study, RTs and accuracy were similar across both age and identity dimensions in the Baseline conditions, which indicates that RBD was matched. Therefore, the possibility of obtaining asymmetric interference because of mismatched RBD was reduced. Directly manipulating Baseline discriminability would be an important avenue for investigation to help shed light on its direct influence on the symmetry of Garner interference.
Although the evidence points towards the shared route hypothesis for age and identity perception, there is, however, an important caveat to consider. Experiment 3, using unmodified naturalistic images of faces, found a weaker effect of Garner interference. This finding is consistent with Burton’s (2013) proposal that conclusions drawn from experiments using faces stripped of their natural characteristics should not be extended to our natural face recognition system. A similar discrepancy between artificially manipulated face stimuli and naturalistic faces has been recorded in studies examining the independence of sex and identity. In these studies, a pattern of results supporting the shared route hypothesis was only found for cropped faces (Goshen-Gottstein & Ganel, 2000), and not for intact faces (Bruce et al., 1987).
However, there is also evidence to the contrary. Ganel et al. (2002) experimentally manipulated intact and cropped images for the sex-identity dimensions in a Garner paradigm across four experiments and found Garner effects indicative of a shared route when both faces removed of hair, and intact faces were used. Interestingly, and similar to the findings from Experiment 3, Garner interference was absent when faces were both intact and multiple faces were used, in contrast to the single-cropped face paradigm. Garner interference has previously been recorded when using multiple images (e.g. Ganel & Goshen-Gottstein, 2002); thus, it appears that there may be a potential interactive effect of using multiple images and intact ambient faces which weakens Garner interference. The reason for this is unclear. However, one potential explanation is that single-cropped image paradigms may encourage the use of picture-based heuristics, relying more on low-level image processing. This could account for the lack of differences in Garner interference observed with familiar and famous faces in Experiments 1 and 2, as the task may involve earlier visual processing stages rather than the deeper identity recognition or age perception. By contrast, using multiple varied and natural images might discourage low-level image processing, possibly requiring engagement with later processing stages, where age and identity could be processed more independently. This would suggest that findings from single-cropped image face stimuli may not represent our deeper face recognition system. However, there is an alternative consideration. If the single-cropped image approach does encourage low-level picture-based strategy, then one might also predict that this would result in no Garner interference at all, which is not the case in Experiments 1 and 2.
Taken together, the findings from this study suggest that age and identity are processed along a shared route, as evidenced by symmetric Garner interference between the two dimensions. However, it is important to acknowledge the weaker Garner effect recorded for ambient face images and its potential implications. Different experimental designs, such as single versus multiple images, naturalistic versus modified, familiar versus unfamiliar and the specific combination of dimensions being investigated across multiple studies (e.g. Fitousi, 2020; Schweinberger & Soukup, 1998; Ganel et al., 2002), add complexity to the interpretation of these differences. Therefore, further systematic investigation is needed to fully clarify the conditions under which facial attributes are processed independently or along shared routes, potentially challenging the dichotomous perspective that these two processes are either separate or integrated.
Implications for face perception models
While acknowledging the complexities discussed in the previous section, the overall findings are in favour of the shared route hypothesis and add potentially new insights to existing theoretical models. Consider first the traditional Bruce and Young’s (1986) theoretical model of face processing. Within this framework, age judgements are considered one of several visually derived semantic codes, information which can be inferred with relative accuracy from visual cues alone and are independent by-products of face recognition, but not essential for face recognition. Thus, this model proposed a parallel route between visually derived semantic codes (sex, race and age) and facial identity recognition. Contrary to Bruce and Young’s (1986) early model, sex, race and age share a route with facial identification as evidenced by the symmetric interference recorded in the current study and other Garner speeded classification tasks (Ganel & Goshen-Gottstein, 2002; Ganel et al., 2002). Additionally, findings from this work and previous behavioural experiments suggest that these non-identity aspects of the face operate independently of each other (e.g. Alonso-Prieto et al., 2015; Fitousi, 2020). Bruce and Young (1986) also proposed that there was an expression code which was important for emotional expression recognition but not important for facial identification. In fact, the inconsistent partial interaction recorded between emotional expression classification and facial identity, age, race and sex supports the notion that emotional expression recognition is distinct from other non-identity aspects of the face (i.e. age, sex and race).
Haxby et al.’s (2000) neurocognitive model of face perception expanded on Bruce and Young’s (1986) model and separated these aspects of the face into two categories. The core system, which involves the processing of invariant aspects of the face (such as sex, race and facial identity), and the extended system, which processes changeable facial information (such as emotional expression and facial speech patterns), for which the neural networks are independent. The extended system also includes personal knowledge, such as biographical knowledge, episodic memories and personality traits. Haxby et al. (2000) emphasise the distinction between the invariant aspects of the faces which underlie recognition of identity, and changeable aspects of the face which facilitate social communication but not identity recognition. Facial age is not explicitly considered in Haxby’s model, leaving the question of whether age is to be considered as part of the core system or extended system unresolved. In this study, age interacts with facial identity in a similar way to known invariant facial aspects (e.g. sex), compared with changeable aspects (e.g. emotional expression), thus lending support to the possibility that age also forms part of the core system.
Furthermore, face familiarity did not influence the presence of Garner interference in these experiments, but familiarity has been found to influence Garner interference for emotional expression (Ganel et al., 2002) and speech pattern perception (Schweinberger & Soukup, 1998). In Haxby’s model (Haxby & Gobbini, 2010), the extended system comprises emotional expression and speech pattern, as well as aspects typically associated with facial familiarity, such as person knowledge. Therefore, emotional expression and person knowledge overlap within this extended system, which may explain why face familiarity affects Garner interference, given that top–down semantic knowledge is present for familiar and not unfamiliar faces. By contrast, if age processing forms part of the core system, separate from the extended system where person knowledge resides, age processing is unaffected by face familiarity.
However, few studies have examined the effect of face familiarity directly with facial aspects other than speech patterns and emotional expression; therefore, further research is needed for a clearer pattern to emerge and theoretical conclusions to be drawn. Additionally, although Haxby groups these facial characteristics into two systems (i.e. core and extended), how the different aspects of the face interact within these groups is not clear. The independence between sex, age and race, and the interconnectedness of all three with facial identity, may point towards further nuance in the delineation between individual processes within the core system. However, this notion also requires further investigation.
Future directions
Despite the implications for a shared route between age and facial identity perception, there are additional questions to consider. First, in this study, age was measured using a dichotomous ‘young’ and ‘old’ classification to fit within the standard design of the Garner speeded classification paradigm. However, age perception can be measured using different approaches, such as numeric estimation by providing a precise estimate of the age, classification of faces as over or under the age of 18, and ordering and ranking faces by age. However, a recent study shows that different measurement approaches tap into different cognitive processes, and some processes may share more underlying cognitive mechanisms with facial identity than others (Attard-Johnson et al., 2024). This may explain discrepancies in studies relating to the behavioural interaction between age perception and facial identification. For instance, individuals with developmental prosopagnosia have been found to have unimpaired age processing, pointing towards a parallel-route (see Bennetts et al., 2024; Chatterjee & Nakayama, 2012; Nunn et al., 2001), while other studies have found an interaction between age and face processing in neurotypical participants (see Dagovitch & Ganel, 2010; Robertson & Burton, 2021). These studies varied substantially in the methodological approach for measuring age perception. Therefore, although the simple age classification task described in this study offers evidence towards the shared route hypothesis, it is possible that there are nuances in this relationship based on the methodological approach used to measure age perception, and further research is needed to tease this apart. Furthermore, although the actual chronological age difference between the young and older categories was between 20 and 30 years, the age gap based on perceived ages was around 10 years. It may be informative to also compare identities across wider perceived age gaps, such as childhood and adulthood, revealing more drastic effects of ageing on facial identity. However, such a manipulation may affect RBD as with a wider gap, the age judgement classification task may become easier relative to the identity judgement task.
Second, RBD was matched across all three experiments, and it is not known whether a mismatch in discriminability along one of the stimulus dimensions would have replicated the asymmetric results obtained in some other face processing Garner studies (e.g. Schweinberger & Soukpu, 1998; Schweinberger et al., 1999; Wang et al., 2013). Though this may be a general limitation of the Garner speeded classification task, potentially explaining some inconsistencies in symmetry of results, previous work having manipulated RBD directly has demonstrated that RBD cannot explain all cases of asymmetry (Schweinberger & Soukpu, 1998; Schweinberger et al., 1999). Future work could consider directly manipulating RBD for the identity–age relationship, too.
Experiments 1 and 2 utilised artificially cropped images of the identities to minimise the possibility of non-face cues facilitating categorisation by encouraging picture-based strategies of classification (Goshen-Gottstein & Ganel, 2000). However, the constrained nature of the cropped stimuli is different from how we would encounter faces in everyday life (see Burton, 2013), and therefore may limit the generalisability of the findings in these experiments to normal face recognition. For this reason, a third experiment, which was an adaptation of the Garner speeded classification paradigm, was implemented. In Experiment 3, multiple ambient images of the identities were used. Although the same pattern emerged, there was a discrepancy between the frequentist and Baysian (BF) analysis. Specifically, strong evidence for the alternative hypothesis was found using BF, but the frequentist analysis was non-significant. The reason for this discrepancy is unclear; however, BF is more robust and reliable with smaller sample sizes (e.g. Hox et al., 2012; Lee & Song, 2004; van de Schoot et al., 2014). Furthermore, although the naturalistic images were used only for familiar faces, there is merit to applying this methodology to unfamiliar faces in the future. Unfamiliar face recognition is more vulnerable to the effects of naturalistic images and within-person variability than familiar face recognition (Burton, 2013); therefore, the presence and strength of Garner interference may be affected for naturalistic images.
Finally, Garner Interference is a gold standard measure of dimensional interaction and selective attention; however, the precise mechanisms underlying the Garner Interference are not yet fully understood. It is beyond the scope of this paper to theorise about the underlying mechanisms of the paradigm itself. Recent novel accounts propose a binding account of facial dimensions (Fitousi, 2017a,b) and offer an alternative mechanism of binding explaining the Garner Interference (Fitousi, 2023). Specifically, the proposition is that facial attributes (e.g. identity, age, sex, emotion) interact in a binding process created via the concept of a ‘face file’, and that this binding can occur within- and across-independent processing routes, thus resulting in patterns that do not fully conform to the dual-route model. Additionally, sequential binding across consecutive trials can predict the magnitude of Garner interference, suggesting that the strength of feature integration influences the strength of interference recorded. For a detailed explanation of this account, see Fitousi (2017a,b; 2023). Future research may consider applying alternative and potentially converging methodologies to the examination of identity and age. One such example is the conjoint measurement approach, which has been used to study the effect of ageing of faces on perceived gender (Fitousi, 2021), finding that observers integrate gender and age information.
Conclusion
Using the Garner speeded classification task, the current study provides evidence for a shared route between facial age and identity processing. Considering the body of work using Garner interference, it seems that emotional expression relates more inconsistently with facial identity, age, sex and race. While studies examining the identity–age, identity–sex and identity–race interconnectedness thus far appear to support the shared route hypothesis. By contrast, studies examining the interconnectedness of age–sex, age–race, sex–race, age–race suggest that these processes occur independently via a parallel route. However, the number of studies examining emotional expression far outnumbers other non-identity aspects of the face. Therefore, further research is needed to build a clearer picture of the interconnectedness between the different non-identity aspects of the face and facial identity. Furthermore, differences emerged between highly controlled and naturalistic face stimuli, which highlight the complexities with interpretations across studies employing different methodologies.
Supplemental Material
sj-docx-1-qjp-10.1177_17470218251360238 – Supplemental material for Examining independence of facial identity and age processing using the Garner Speeded Classification Paradigm
Supplemental material, sj-docx-1-qjp-10.1177_17470218251360238 for Examining independence of facial identity and age processing using the Garner Speeded Classification Paradigm by Janice Attard-Johnson in Quarterly Journal of Experimental Psychology
Footnotes
Declaration of conflicting interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
