Abstract
Blindsight and other examples of unconscious knowledge and perception demonstrate dissociations between judgment accuracy and metacognition: Studies reveal that participants’ judgment accuracy can be above chance while their confidence ratings fail to discriminate right from wrong answers. Here, we demonstrated the opposite dissociation: a reliable relationship between confidence and judgment accuracy (demonstrating metacognition) despite judgment accuracy being no better than chance. We evaluated the judgments of 450 participants who completed an AGL task. For each trial, participants decided whether a stimulus conformed to a given set of rules and rated their confidence in that judgment. We identified participants who performed at chance on the discrimination task, utilizing a subset of their responses, and then assessed the accuracy and the confidence-accuracy relationship of their remaining responses. Analyses revealed above-chance metacognition among participants who did not exhibit decision accuracy. This important new phenomenon, which we term
The phenomenon of blindsight (Weiskrantz, Warrington, Sanders, & Marshall, 1974) has had a powerful influence on the development of psychology and neuroscience because it challenges the intuition that metacognitive awareness must necessarily accompany discriminative accuracy. Studies of blindsight, which may be exhibited following damage to the primary visual cortex, demonstrate that substantial decision accuracy (e.g., discriminating between visual stimuli) can occur in the absence of metacognitive insight into that ability; blindsight patients classically report being blind to the stimuli that they so deftly categorize. In this article, we introduce a related phenomenon that has the potential to similarly transform psychology’s understanding of metacognition and its relationship to the distinction between conscious and unconscious processing. We term this phenomenon
Metacognition, and in particular the ability to assess the accuracy of knowledge states, is fundamental to understanding executive processes (e.g., Koriat, 2007), the nature of memory (e.g., Mazzoni, Scoboria, & Harvey, 2010), good educational practice (e.g., Koriat, 2012), gambling (Lueddeke & Higham, 2011), development (Beck, McColgan, Robinson, & Rowley, 2011), cognitive differences between species (Smith, Beran, Couchman, Coutinho, & Boomer, 2009), social interaction (Frith, 2012), mental illness (Hamm et al., 2012), and the distinction between conscious and unconscious processes in both perception (Kanai, Walsh, & Tseng, 2010) and learning (Dienes & Seth, 2010).
Given the importance of metacognition to such a wide variety of research endeavors, there has been a strong motivation both to refine its accurate bias-free measurement and elucidate the underlying cognitive architecture. Signal detection theory (SDT) provides a useful method to measure stimulus-discrimination accuracy independently of response bias (Lau, 2008; Lau & Passingham, 2006; Macmillan & Creelman, 2005) and has been widely adopted and extended for the assessment of metacognition (Barrett, Dienes, & Seth, 2013; Galvin, Podd, Drga, & Whitmore, 2003; Ko & Lau, 2012; Maniscalco & Lau, 2012; Rounis, Maniscalco, Rothwell, Passingham, & Lau, 2010; see Fig. 1). The measure of sensitivity provided by SDT is generally termed Type I

Schematic illustrating general principles of signal detection theory. The dashed curve shows the signal distribution when the stimulus is absent (or ungrammatical, new, etc.), and the solid curve shows the distribution when the stimulus is present (or grammatical, old, etc.). The index
In its classical form, SDT offers a hierarchical framework whereby information available to the metacognitive judgment derives from the same signal exploited by the first-order discriminative process. Indeed, it can be theoretically demonstrated that an SDT framework (with some straightforward assumptions) cannot give rise to metacognitive insight in the absence of decision accuracy (Barrett et al., 2013). While there is intuitive appeal of an arrangement in which confidence in a judgment derives from the strength of the signal driving the first-order decision, a purely bottom-up hierarchical configuration is at odds with both neuroanatomical and neurophysiological evidence. A growing body of data indicates that both bottom-up (feed-forward) and top-down (feedback, recurrent) connections and processing make crucial contributions to perception, with the latter being particularly vital to attentional grouping and awareness (Bowman, Schlaghecken, & Eimer, 2006; Jaskowski & Verleger, 2007; Salin & Bullier, 1995).
Research exploring individual differences in metacognition is similarly suggestive of interactions between low-level sensory decisions and metacognitive processes. Fleming, Weil, Nagy, Dolan, and Rees (2010) demonstrated that individual differences in metacognitive performance on a perceptual decision task were correlated with gray-matter volume in the anterior prefrontal cortex and white-matter microstructure connected with this region. Crucially, the anterior prefrontal cortex receives input from higher-order cortical regions rather than from early sensory regions, which is consistent with a role in metacognitive judgment rather than in simple perceptual decisions. In contrast, other decision-making-related regions (e.g., posterior parietal cortex) receive inputs from early sensory regions and have been shown to support the primary perceptual decision (Kiani & Shadlen, 2009). Other researchers have demonstrated a dissociation between reaction times and confidence that is also at odds with typical models of how confidence arises (Wilimzig, Tsuchiya, Fahle, Einhaeuser, & Koch, 2008).
Although the application of SDT to metacognition enjoys increasing popularity, it is by no means the only approach to modeling the confidence-accuracy relationship. The metamemory literature offers a range of theoretical approaches based on concepts such as cue utilization (Koriat, 2007). In cue utilization, factors as diverse as fluency and brightness have been shown to influence confidence (Busey, Tunnicliff, Loftus, & Loftus, 2000; Oppenheimer, 2008), though such cues can be unrelated to the accuracy of first-order judgments and, therefore, may not provide a veridical source of metacognition.
In the research reported here, we focused specifically on the SDT framework. We sought to evaluate whether metacognition and first-order decision accuracy can be dissociated in a manner incompatible with the SDT framework and, in so doing, offer clear constraints on the type of model able to account for this characteristically human cognitive process. To accomplish this, we examined metacognitive performance in artificial-grammar learning (AGL; Pothos, 2007; Reber, 1967), a paradigm in which after incidental exposure to apparently random strings of letters, participants classify new strings as obeying or contravening an inherent set of rules. The AGL task has proven particularly useful in the study of implicit learning and is well known for demonstrating decision accuracy in the absence of confidence (i.e., a knowledge state equivalent to blindsight; e.g., Dienes, Altmann, Kwan, & Goode, 1995). Here, we revealed the opposite dissociation—blind insight—by establishing an unbiased selection of participants who exhibited chance performance and then examining their metacognitive accuracy.
Method
Participants
Participants were 450 student volunteers (227 male, 223 female) ages 18 to 40 years (
Materials
Two finite-state grammars (Grammar A and Grammar B, both from Reber, 1969) were used to generate grammar strings between five and nine characters in length. Training sets comprised either 15 or 16 strings (depending on the experiment) selected from the grammar to which the participant had been assigned and repeated three times in random orders. The test set comprised either 60 or 64 strings (depending on the experiment), including half from each grammar that had not been used during training. Strings were presented in black on a white background at the center of a computer screen.
Procedure
Training strings were presented under the guise of a short-term memory task, with each string presented for memorization for 5 s, followed by a brief recall task before the next string appeared. The presentation order of both training and test strings was separately randomized for each participant. After training, participants were informed that the order of letters in the training strings had obeyed a complex set of rules and that they were to classify a new set of strings, exactly half of which would obey the same rules. Test strings were presented one at a time, and participants were asked to indicate the following without time constraints: (a) how familiar the string seemed to them on a scale from 0 to 100, (b) whether or not the string was grammatical (i.e., obeyed the rules), (c) how confident they were in their grammaticality judgment on a scale from 50 to 100 (50
Design
A dual-grammar design was employed in which half the participants were trained on Grammar A and half on Grammar B. At test, all participants classified the same set of test strings, all of which were different from the training strings. Precisely half of the test strings conformed to Grammar A, and half conformed to Grammar B. Thus, the nongrammatical test strings for one group were grammatical for the other group, which eliminated the need for an untrained control group. The key independent variable was grammatical status, manipulated within subject (grammatical vs. ungrammatical). There were two dependent variables of interest: first-order decision accuracy, for which we computed
Results
Approach to analysis
Our objective was to assess whether reliable metacognitive accuracy could exist in the absence of first-order accuracy. To test this, we identified that subset of participants whose decision accuracy was equivalent to chance. To avoid incorrect inferences, it was important that this selection be robust to biases arising from regression toward the mean. Specifically, analysis needed to be conducted on a sample of trials that had not itself been subject to the bias imposed by the selection process. This was accomplished by selecting participants on the basis of a subset of their trials and analyzing the remainder.
While a repeated random subsampling method might typically be applied to select trial subsets in a maximally unbiased fashion, our data contained a predictable linear trend that precluded this approach. It is an established phenomenon in AGL that performance (

Mean classification performance (
Where performance changes systematically in this way, chance performance on a random subsample of trials cannot reliably predict chance performance on the remainder. Consequently, we adopted a linear sampling approach (i.e., selecting participants who performed at chance in early test trials and analyzing their later trials), thus taking advantage of the tendency for performance to decrease over time. For this approach to be effective, the selected subset needs to be sufficiently large that performance for that subset is representative of performance across subsequent trials. We first attempted a selection including participants for whom
The same selection process was applied to identify participants who reliably performed above chance (
We further computed a Bayes factor to establish the extent to which these data provide evidence for the null hypothesis (
Metacognitive accuracy in the absence of first-order accuracy
Figure 3 illustrates the mean

Mean first-order accuracy (
Although these analyses were based on
Similarly, the difference in the percentage of correct judgments when participants were confident versus not confident, known as the
The source of metacognitive accuracy
To explore the source of the observed metacognition seen in participants lacking decision accuracy, we conducted a 2 (judgment: grammatical vs. ungrammatical) × 2 (confidence: confident vs. guess) within-subject analysis of variance on the proportion of correct judgments (see Fig. 4). This analysis revealed no main effect of judgment,

Mean proportion of correct grammaticality judgments among participants who did not exhibit first-order accuracy. Results are shown for “grammatical,” “ungrammatical,” and all judgments made with and without confidence. Error bars indicate ±1
Overall, these findings suggest that the observed metacognition (in the absence of decision accuracy) reflects a tendency for judgments made without confidence to exhibit below-chance accuracy, while confident judgments remain at chance. One possible interpretation of this difference is that there was some form of implicit error monitoring taking place that was expressed as reduced confidence where a wrong answer was made. If this was the case, however, the information exploited by the error-monitoring process was clearly unavailable to the preceding classification judgment.
Effect of delay between judgments
Participants made grammaticality and confidence judgments consecutively, with confidence judgments necessarily following the grammaticality judgments. This arrangement gives rise to two potential issues. The first is that during the momentary pause between judgments, participants’ knowledge state may continue to stabilize, and metacognitive performance, deriving from the latter judgment, may show greater accuracy as a result. Some evidence for this has been identified in the context of reaction-time responses (Baranski & Petrusic, 2001; Charles, Van Opstal, Marti, & Dehaene, 2013; Pleskac & Busemeyer, 2010). However, in our experiments, there was no time constraint on judgments and, therefore, no obvious reason why a (first-order) judgment would be made before a stable knowledge state had been achieved. Furthermore, when Tunney and Shanks (2003, Experiments 1a and 1b) contrasted
A second issue arises from the potential for participants to make errors when reporting grammaticality judgments. As there were no time constraints, very few errors were anticipated; nonetheless, if a participant were confident that a string was grammatical but inadvertently pressed “ungrammatical,” or vice versa, then they might choose to report no confidence to reflect that error. Assuming they were applying veridical knowledge, this could result in below-chance accuracy for judgments attributed no confidence. For example, if we assume that participants’ knowledge on average permitted 60% classification accuracy (10% above chance), then when applied without error, their judgments would have 60% accuracy and be reported to have been made with some confidence. In contrast, when they applied that knowledge but inadvertently pressed the wrong button (and realized this), the judgments would have 40% accuracy (10% below chance) and would be reported to have been made with no confidence. As can be seen, the maximum extent to which the accuracy of no-confidence judgments could be reduced below chance by this mechanism is limited to the equivalent above-chance accuracy of confident judgments (10% in this illustration). Therefore, if this account applies, we should have observed above-chance accuracy in confident judgments at least equivalent to the below-chance accuracy of judgments made without confidence. This was not observed. Judgments without confidence were 8% below chance (
Inequality of variances
In an SDT model, if the underlying signals exploited to classify grammatical and ungrammatical strings had unequal variances, this could in principle result in an inflated estimate of
The parameters detailed here apply to both the following simulation and the criterion-jitter simulation described in the following section: On each trial, the grammaticality signal was generated as a Gaussian random variable, with
We simulated the experiment 1,000 times, assuming a
Criterion jitter
If the criterion employed in making grammaticality judgments was subject to jitter, this could result in an underestimate of
Discussion
We exploited the AGL paradigm to evaluate metacognitive performance in participants who lacked first-order decision accuracy. Analysis was conducted on data independent of that used in the selection of participants, and additional analyses and simulations eliminated effects of a delay between judgments, unequal variance, and criterion jitter as alternative explanations for the findings. The results revealed significant metacognitive discrimination independent of first-order decision accuracy. Specifically, confidence reports expressed reliable knowledge of whether judgments had been right or wrong despite the judgments themselves showing chance levels of discrimination. While the phenomenon of blindsight challenges the intuition that metacognitive performance must necessarily follow from reliable decision accuracy, the phenomenon of blind insight challenges the intuition that decision accuracy must necessarily exist for there to be metacognitive discrimination of the veracity of those first-order judgments. While we see no reason to expect this phenomenon to be unique to the context of AGL, additional research is needed to determine the extent to which our results generalize across distinct paradigms, including perceptual decisions.
What are the implications of our results for theoretical models of metacognition? Models that rest on SDT fit naturally with bottom-up hierarchical arrangements in which low-level discriminations provide the signals supporting high-level metacognitive discriminations. These models can naturally account for dissociations between (low-level) decision accuracy and metacognition as seen in blindsight and unconscious knowledge by simply assuming that a failure in the metacognitive process can leave lower-level discriminative processes intact. In contrast, blind insight represents a dissociation that is fundamentally at odds with a purely bottom-up hierarchical relationship relating first-order decision processes to metacognition, because the absence of reliable decision accuracy precludes the availability of signals supporting above-chance metacognitive performance on these models. Our observation of blind insight therefore establishes that the metacognitive process must either draw on information additional to that available to the first-order decision process or exploit the same information in a substantially different way. Such an arrangement is not readily implemented by models that adhere closely to SDT in assuming that metacognitive judgments are made on the same signal underlying first-order decisions (Clifford et al., 2008; Maniscalco & Lau, 2012; Pleskac & Busemeyer, 2010; Scott & Dienes, 2008; Snodgrass et al., 2004). While amendments to these models might accommodate the blind-insight phenomenon, any such amendments would represent a fundamental departure from the standard SDT framework. In short, significant metacognition (
To account for blind insight therefore requires a model architecture that less closely couples metacognitive performance to the signal driving first-order judgments. Progress in this direction has been made by Timmermans, Schilbach, Pasquali, and Cleeremans (2012), who describe a “hybrid” neural network model in which first-order decision processes and second-order metacognitive processes are supported by independent networks. While both networks are feed-forward architectures trained using standard back-propagation algorithms, the metacognitive network takes as input not simply the output of the first-order network but rather the difference between its input and output. It is interesting that during training on a blindsight simulation, this model exhibited a pattern of results similar to blind insight; however, this was only a transient stage of model dynamics rather than a stable state as in our data. Moreover, their model remains faithful to the assumptions of SDT by proposing unidirectional bottom-up signal flow (back-propagation is used only for updating connection strengths).
Given the inability of SDT-based models to account for blind insight, our data suggest that a more radical revision of metacognition models is required. One potential direction for revision would take into account the evidence, mentioned in the Introduction, that neural dynamics underlying perceptual decisions involve counterflowing bottom-up and top-down neural signals (Bowman et al., 2006; Jaskowski & Verleger, 2007; Salin & Bullier, 1995). A framework for interpreting these countercurrent dynamics is provided by
In summary, blind insight demonstrates a previously undescribed dissociation between second-order awareness and first-order performance and in so doing presents a critical challenge to prevailing models of metacognition.
Footnotes
Declaration of Conflicting Interests
The authors declared that they had no conflicts of interest with respect to their authorship or the publication of this article.
Funding
This work was supported by the Economic and Social Research Council (Grant No. RES-062-23-1975), an Engineering and Physical Sciences Leadership Fellowship to A. K. Seth (Grant No. EP/G007543/1), an Engineering and Physical Sciences Fellowship to A. B. Barrett (Grant No. EP/L005131/1), the European Research Council project Collective Experience of Empathic Data Systems project (Grant No. 258749; FP7-ICT-2009-5), and a donation from the Dr. Mortimer and Theresa Sackler Foundation via the Sackler Centre for Consciousness Science.
Open Practices
All data and materials have been made publicly available via Open Science Framework and can be accessed at https://osf.io/ivdk4/files/. The complete Open Practices Disclosure for this article can be found at http://pss.sagepub.com/content/by/supplemental-data. This article has received badges for Open Data and Open Materials. More information about the Open Practices badges can be found at https://osf.io/tvyxz/wiki/view/ and
.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
