Abstract
Background:
Excessive checking is a common, debilitating symptom of obsessive–compulsive disorder. To further examine cognitive processes underpinning checking behaviour, and clarify how and why checking develops, we designed a novel operant paradigm for rats, the observing response task. The present study used the observing response task to investigate checking behaviour following excitotoxic lesions of the medial prefrontal cortex, nucleus accumbens core and dorsal striatum, brain regions considered to be of relevance to obsessive–compulsive disorder.
Methods:
In the observing response task, rats pressed an ‘observing’ lever for information (provided by light onset) about the location of an ‘active’ lever that provided food reinforcement. Following training, rats received excitotoxic lesions of the regions described above and performance was evaluated post-operatively before histological processing.
Results:
Medial prefrontal cortex lesions selectively increased functional checking with a less-prominent effect on non-functional checking and reduced discrimination accuracy during light information periods. Rats with nucleus accumbens core lesions made significantly more checking responses than sham-lesioned rats, including both functional and non-functional checking. Dorsal striatum lesions had no direct effect on checking per se, but reduced both active and inactive lever presses, and therefore changed the relative balance between checking responses and instrumental responses.
Conclusions:
These results suggest that the medial prefrontal cortex and nucleus accumbens core are important in the control of checking, perhaps via their role in processing uncertainty of reinforcement, and that dysfunction of these regions may therefore promote excessive checking behaviour, possibly relevant to obsessive-compulsive disorder.
Keywords
Introduction
Excessive checking is a common, debilitating symptom of obsessive–compulsive disorder (OCD) (Fontenelle et al., 2006). However, the neural mechanisms that underlie the development of excessive checking are not fully understood.
Functional neuroimaging and neuropsychological studies suggest that OCD is related to cortico-striatal hyperactivity (Graybiel and Rauch, 2000), particularly within the orbitofrontal-striatal circuitry (Menzies et al., 2008). Recent studies have implicated a wider network of regions in OCD pathology, including the nucleus accumbens (NAc), amygdala and the anterior cingulate cortical (ACC) region of the medial prefrontal cortex (mPFC) (Figee et al., 2011; Milad and Rauch, 2012). The NAc is clearly and directly implicated in OCD; OCD patients show altered ventral striatal activity when anticipating reward (Figee et al., 2011; Marsh et al., 2015; Remijnse et al., 2006) and effective deep-brain stimulation targets for treatment-refractory OCD patients are often in or around the ventral striatum (Greenberg et al., 2010). Although there is less direct evidence for ACC involvement in the OCD phenotype, ACC dysfunction is relevant to OCD development in a number of important ways. For example, ACC is involved in error detection and conflict monitoring (Van Veen and Carter, 2002) and electrophysiological and functional neuroimaging studies demonstrated increased error-related negativity (i.e. error detection) in OCD patients (Fitzgerald et al., 2005; Gehring et al., 2000; Ursu et al., 2003). Dysfunction of mPFC may also impair goal-directed control, leading to excessive habit formation, which may contribute to OCD symptoms (Gillan and Robbins, 2014). In rats, lesions to either the prelimbic cortex (PL), or its projection regions within the dorsomedial striatum, prevent acquisition of goal-directed learning and render performance habitual (Corbit and Balleine, 2003; Yin et al., 2005).
We recently established the observing response task (ORT) as a translational, operant test of checking behaviour, to examine the cognitive processes underpinning compulsive checking and its development (Eagle et al., 2014). In the ORT, rats can press a lever (the observing/checking response) that gives information about the location of future rewards. Uncertainty about which lever is reinforced can be reduced using the light-cue information provided by the ‘observing’ response; thus, checking can increase as a consequence of increased outcome uncertainty. Treatment with the dopamine D2/D3 receptor agonist quinpirole increased both ‘functional’ (for information) and also ‘non-functional’ (perseverative) checking in the ORT (Eagle et al., 2014). This quinpirole-induced checking is comparable with findings from a well-established open-field model of checking in rodents (Szechtman et al., 1998), in which excessive returns to a ‘home base’ in the open-field are interpreted as ‘checking behaviour’. Evidence from open-field checking also clearly implicates the NAc core in control of the vigour or extent of checking (Ballester González et al., 2015; Dvorkin et al., 2010). NAc core lesions increased ‘checking’ to an extent comparable with checking in unlesioned, quinpirole-treated rats (Dvorkin et al., 2010) and delayed, but did not prevent, the development of checking (Ballester González et al., 2015). Cortical input into the neural circuitry of checking is less clear; in contrast with NAc lesions, orbitofrontal cortex (OFC) lesions did not directly increase checking but produced changes in goal-directed activity in the open-field checking task. However, both the ACC and PL project to the NAc core (Brog et al., 1993; Vertes, 2004) and either may be a critical component of the cortico-striatal circuitry that normally moderates excessive checking. The role of these cortical structures to control checking behaviour remains to be investigated, alongside the role of their alternate striatal projection, the dorsal striatum (DStr).
NAc, DStr and mPFC are all potential sites-of-action for quinpirole in the ORT given their high concentration of D2 receptors. Subchronic quinpirole increased dopamine synthesis in the DStr and the NAc (Rowlett et al., 1995) but decreased dopamine levels in the mPFC (infralimbic cortex (IL), prelimbic cortex (PL) and ACC) (Sullivan et al., 1998). Similarly, there is reduced neuronal activity in the NAc core, DStr and mPFC of rats sensitised to quinpirole (Carpenter et al., 2003; Richards et al., 2005).
In the present study, we investigated the effects of inactivation of DStr, NAc core and mPFC, in ORT checking behaviour, using excitoxic (fibre-sparing) lesions of these structures, to determine the role of these regions in the neural circuitry underpinning control of checking behaviour.
Materials and methods
Subjects
Subjects were adult male Lister hooded rats sourced from Charles River, UK. Rats weighed 227 ± 8 g initially and 338 ± 21 g at the time of surgery (Experiment 1). Rats weighed 268 ± 2 g initially and 411 ± 5 g at the time of surgery (Experiment 2). Rats were housed in groups of four, in cages enriched with cardboard tubes, in a temperature-controlled room (minimum 22°C) under a reversed 12 h light-dark cycle (lights on 19:00; lights off 07:00). Rats were maintained at approximately 95% of their free-feeding weight and received 15–20 g of food daily (task reinforcer pellets plus laboratory chow given 1–2 h following the daily test session); this restricted weight gain to approximately 5 g per week. Water was available ad libitum for the duration of the procedures. All experiments were conducted in accordance with the United Kingdom Animals (Scientific Procedures) Act, 1986.
Apparatus
The apparatus and testing procedure have been described previously (Eagle et al., 2014). Testing took place in 12 operant-conditioning chambers (Med Associates). The chamber configuration is shown in Figure 1. Each chamber had two retractable levers, with a light above each, to the left and right of a central food well (Figure 1(a)). Illumination of the light above a lever signalled that the lever was currently active and delivered food pellets when pressed (Figure 1(b)). A third lever, the observing lever, was located in the centre panel of the back wall at the same height above the chamber floor as the active/inactive levers. If the observing lever was extended, a lever press turned on the light above the active lever if it was previously unlit (Figure 1(c)). A house light in the chamber roof was illuminated throughout the session. A pellet dispenser delivered 45 mg Noyes formula P pellets (TestDiet; Purina) into the food well when the active lever was pressed. Chamber operation and on-line data collection were controlled with the Observing Response Task program (written by A.C. Mar) and the Whisker server software (Cardinal and Aitken, 2010). Rats were tested between 5 and 7 days per week.

(a–d) The observing response task: (a) Training 1. The observing lever (i) on the back panel of the box was retracted. Rats were trained to press two levers (ii) on the front panel of the chamber. A light was illuminated above each lever to indicate ‘active’ status (iii). Completion of lever presses requirement gave a food pellet in a central food well (iv), (b) Training 2: lever discrimination. One front panel lever was active and the light above was illuminated (v); the other lever was inactive and the light above was unlit. The active lever and illuminated-light location switched on a pre-determined schedule (vi), (c) Observing response task. Both levers were extended but neither light was lit above (vii). The observing lever was extended, and a single press on the observing lever (viii) illuminated the light above the active lever for 15 s and (d) extra observing lever presses (EOLPs) (ix); when the active-lever light was illuminated, any further observing lever presses had no consequence, but were recorded as extra observing lever presses.
Behavioural training
Rats received two training sessions per day for 15 training days (sessions 1–30), and one session per day during and subsequent to the last 3 days of observing lever training.
Training 1 – lever acquisition
Rats were trained to lever press for food pellets. Both front panel levers were presented and active (i.e. resulted in pellet delivery). The light above each lever was illuminated for the whole session. The observing lever remained retracted. Rats were reinforced with a food pellet on a fixed ratio (FR) 1 (sessions 1–2) or FR3 (session 3) schedule for completing the required presses on one or the other lever. Each session was terminated after 21 min or 200 rewards, whichever was sooner.
Training 2 – lever discrimination
Rats were trained to discriminate active from inactive levers. Both front panel levers were presented, one active and one inactive. The observing lever remained retracted. The light above the active lever was lit and the light above the inactive lever was unlit. The position of the active lever/light switched on a fixed time (FT) of 90 s schedule. The sequence always began with left lever active, which promoted more rapid learning; the rats could begin each session with predictably rewarded active lever location. Left and right levers were active for equal duration per session. An active lever press delivered food pellets on a pre-determined schedule of reinforcement (see below). Inactive lever pressing gave no consequence. If a rat switched from active to inactive lever, the active lever responses within a partially completed ratio were not reset to zero. However, the schedule requirement was restarted following a switch in location of active lever. Sessions ended after 21 min or 200 reward pellets. Rats were reinforced under the following schedules: FR3 increasing to FR10, variable ratio (VR) 5–15 increasing to VR10–20.
The observing response task
Rats were trained to make observing responses that ‘produce discriminative stimuli associated with the conditions of availability of primary reinforcement, but do not alter the availability of primary reinforcement’ (Wyckoff, 1952). At the beginning of the session, both front panel levers were presented, but there was no light illuminated above either lever. The observing lever was extended. One observing lever press (Figure 1(c)) illuminated the light above the currently active lever for a pre-determined period (30 s during training down to 15 s during the final version of the task). If the active lever switched location during the observing period, the light position switched correspondingly. While the active lever light was illuminated, any further observing lever presses had no consequence, but were recorded as extra observing lever presses (EOLPs; Figure 1(d)). EOLPs did not extend the period of light illumination. The active lever switched sides under an FT90s schedule. Rats were reinforced for active lever presses on a VR10–20 schedule. The session ended after 21 min or 200 reward pellets, whichever was sooner. Rats received nine sessions of observing lever training. Rats were trained twice a day (days 1–3; observing light duration of 30 s), decreasing to once a day (days 4–6; observing light duration of 15 s). A mean of sessions 31–33 constituted the pre-lesion baseline session.
Behavioural challenges
Rats were challenged with manipulations of task contingencies in order to assess the effects of changing reward uncertainty on observing.
Unpredicted reward omission
Unpredicted reward omission (removal of expected reward) has previously been shown to significantly increase checking responses (Eagle et al., 2014). We tested the hypothesis that removal of reward/reinforcement pellets, when reward was expected, during a single reward omission session would increase checking. During reward omission, the session was identical to baseline (FT90s, VR10–20, OLP FR1 (15 s)), but the food reinforcer was delivered outside the test chamber (so all food delivery cues were identical except for food availability in the magazine).
Combined uncertainty of active lever location and contingency degradation
Because the effect of the mPFC lesion was small, we decided to test this group under an uncertainty schedule. We therefore tested the hypothesis that increased uncertainty about active lever location and response requirement would increase observing responses in this group. Uncertainty was increased by switching active/inactive lever location less predictably – from a FT90s to a variable time (VT) of 20–120 s schedule. For the contingency degradation, the response requirement changed from VR10–20 to variable interval (VI) of 10–20, and so behaviour became less linked to reinforcement. Thus, during reward uncertainty, the schedule was VT20-120s, VI10-20s, OLP FR1 (15 s). Combined unpredictability of active/inactive lever location (VT20–120s from VT90s) and effective contingency degradation (VI10–20s from VR10–20) was shown to significantly increase checking responses (d’Angelo PhD thesis, unpublished observations).
Behavioural measures
The main measures on the ORT are detailed in the following:
Surgery
Experiment 1
Following training, rats were allocated to three groups matched for baseline task performance of OLPs, EOLPs, active lever presses and rewards earned. Animals received lesions of the NAc core (n = 12), DStr (n = 12) or sham lesions (NAc core site, n = 6; DStr site, n = 6; total, n = 12). Rats were anaesthetised and secured in a stereotaxic frame (David Kopf Instruments). For all surgeries, the incisor bar was adjusted until the heights of lambda and bregma were equal so as to obtain a flat skull position. Rats were anaesthetised with inhaled isoflurane carried in medical oxygen, induced at 5% and maintained at 1%–4% concentrations at a flow rate of 2 L/min. Upon exposure of the skull, a dental drill was used to make small holes in the skull above the sites of microinjection. Lesion coordinates were derived using a stereotaxic atlas (Paxinos and Watson, 2005), using bregma as the origin. The dorsoventral reading was taken from dura. Animals were allowed up to 2 weeks to recover prior to behavioural re-training. For the first 24 h post-surgery, rats were singly housed and then returned to their pre-surgical groups. For 3 days post-surgery, rats received meloxicam analgesia in their drinking water (30 mg/L).
Excitotoxic lesions were made using 0.09 M quinolinic acid dissolved in 0.1 M phosphate-buffered saline (PBS; vehicle), with pH adjusted to 7.2–7.4 using 0.1 M NaOH. The toxin was infused using a 31-gauge stainless steel injector (Cooper’s Needle Works) connected via polyethylene tubing to a 10 μL glass Hamilton syringe (Hamilton Bonaduz AG) mounted on a microinfusion pump (Harvard Apparatus, Ltd.). Lesions were made according to parameters in Table 1. The injector was left in place at each site for a determined period following infusion in order to allow the diffusion of the toxin away from the injection site. Sham surgery was carried out in the same manner, except that vehicle was infused instead of toxin.
Lesion coordinates used for lesions of the DStr, NAc core and mPFC.
DStr: dorsal striatum; NAc core: nucleus accumbens core; mPFC: medial prefrontal cortex; AP: anterior-posterior; ML: medial-lateral; DV:dorsoventral.
DV coordinates are from dura. Toxin infusion parameters: quinolinic acid, 0.09 M, via cannula.
Experiment 2
Following training, rats were allocated to two groups matched for pre-lesion task performance of OLPs, EOLPs, active lever presses and rewards earned. Animals received lesions of the mPFC (n = 16) or sham lesions (n = 14). The apparatus and procedures were similar to those in Experiment 1. Lesions were made according to parameters in Table 1.
Post-operatively, rats with sham lesions to each of the two different lesion sites were compared to assess whether they could be treated as one group for further analysis. Within these control groups, there was no evidence that the site of vehicle infusion had any effect on the primary measures of ORT performance. The NAc core group made more EOLPs than the DStr group pre-operatively (
Histology
Rats were terminally anaesthetised with sodium pentobarbitone and perfused transcardially with 0.01 M PBS followed by formaldehyde solution (Experiment 1%–4% paraformaldehyde in PBS; Experiment 2%–10% neutral buffered formalin). Brains were removed, postfixed in the respective fixative and transferred into 20% sucrose in 0.01 M PBS before sectioning on a freezing microtome. Coronal sections (60 μm) were stained with cresyl violet and lesion locations were mapped onto standardised sections of the rat brain (Paxinos and Watson, 2005).
Data analysis
Data were initially explored using box-plots and tests of homogeneity of variance so that outliers were identified and removed. Behavioural data were subjected to analysis of variance (ANOVA). A significance level of
Results
Histological analysis
Lesions were classified as acceptable if they showed significant damage or gliosis to the target area, with damage in both hemispheres, and no significant bilateral damage to the neighbouring structures.
Experiment 1
Representative photomicrographs of both lesions and sham controls, and schematic representations of the extent of damage to the striatum caused by quinolinic acid infusions, are shown in Figure 2 (NAc core) and Figure 3 (DStr). (a) Five out of 12 rats with NAc core lesions were excluded as follows: Two rats had significant unilateral damage to the DStr. Three rats had significant damage to the NAc shell region, sparing most of the NAc core region. Seven of 12 NAc core lesion rats were determined to have appropriate lesions. The lesion started at approximately bregma +1.7 and extended to bregma +0.2. For all rats, the lesion encroached slightly into the lateral NAc shell. (b) Six of 12 DStr rats were excluded as the lesion extended bilaterally into the NAc core. Six of 12 DStr rats were determined to have appropriate lesions. The lesion started at approximately bregma +2.2 and extended to bregma –0.4. (c) One of 12 sham lesion rats was excluded due to extensive unilateral damage throughout the whole striatum. In all lesioned animals, there was considerable cell loss and gliosis in the lesioned regions, accompanied by striatal shrinkage. In DStr-lesioned rats, the extensive tissue shrinkage resulted in a visible widening of the lateral ventricles. One rat had baseline OLPs more than 2 standard deviations higher than the respective group mean OLPs and was removed from further analysis. Therefore, final group sizes for this experiment were Sham, n = 10; NAc core, n = 7; DStr, n = 6.

Histological analysis of NAc core lesions: (a) schematic diagrams of NAc core lesions. Grey scale shading indicates extent of neuronal loss across subjects, with each subject represented as a separate stacked layer. Diagrams are modified from Paxinos and Watson (2005) and (b) photomicrographs of cresyl-stained coronal sections, depicting typical lesions of NAc core-lesioned (left) and sham-lesioned (right) rats. Arrows indicate the site of lesion.

Histological analysis of DStr lesions: (a) schematic diagrams of DStr lesions. Grey scale shading indicates extent of neuronal loss across subjects, with each subject represented as a separate stacked layer. Diagrams are modified from Paxinos and Watson (2005) and (b) photomicrographs of cresyl-stained coronal sections, depicting typical lesions of DStr-lesioned (top) and sham-lesioned (bottom) rats. Arrows indicate the site of lesion.
Experiment 2
Representative photomicrographs of both mPFC lesions and sham-lesioned controls and schematic representations of the extent of damage to the mPFC caused by quinolinic acid infusions are shown in Figure 4. Two of 16 rats with mPFC lesions were excluded because they had little or no apparent damage to the target regions of the mPFC. For all of the remaining rats in the mPFC group, each had significant damage to the anterior cingulate and prelimbic cortex, in both hemispheres. In two cases, there was minor encroachment of the lesion into medial orbital cortex. In six cases, there was minor unilateral encroachment of the lesion into the dorsal infralimbic cortex. There was no damage to the anterior parts of the striatum in any of the lesion group. For the control group, there was no evidence of lesion damage to the target area. Two rats (one lesion and one sham) had baseline OLPs more than 2 standard deviations higher than their respective group mean OLPs and were removed from further analysis. Therefore, final group sizes for this experiment were Sham, n = 13; mPFC, n = 13.

Histological analysis of mPFC lesions: (a) schematic diagrams of mPFC lesions. Grey scale shading indicates extent of neuronal loss across subjects, with each subject represented as a separate stacked layer. Diagrams are modified from Paxinos and Watson (28) and (b) photomicrographs of cresyl-stained coronal sections, depicting typical lesions of mPFC-lesioned (left) and sham-lesioned (right) rats. Arrows indicate the site of lesion.
Pre-surgical ORT performance
Pre-operatively, prospective lesion groups were matched for OLPs, EOLPs, total active lever presses and rewards earned. Following removal of subjects, there were no significant baseline differences in pre-operative performance between groups with respect to any measure (Table 2; all
Pre-surgical ORT performance.
NAc core: nucleus accumbens core; DStr: dorsal striatum; mPFC: medial prefrontal cortex; OLPs: observing lever presses; EOLPs: extra observing lever presses; ALP on: active lever presses, light on; ALP off: active lever presses, light off; ILP on: inactive lever presses, light on; ILP off: inactive lever presses, light off; Experiment 1: lesions of the NAc core, DStr and controls. Experiment 2: lesions of the mPFC and controls.
Data are expressed as mean ± SEM.
Post-surgical ORT performance
Following recovery from surgery, rats were tested for 20 daily sessions under the standard schedule of the task (FT90s, VR10-20, OLP FR1 (15 s)).
Effects on checking
Functional checking (OLPs)

The effect of NAc core, DStr and mPFC lesions on checking. Figures show pre-surgery baseline sessions (three sessions) and 20 post-surgical sessions for NAc core (left), DStr lesion (middle) and mPFC (right) rats and sham-operated controls. (a, b, c) functional OLPs; (d, e, f) non-functional EOLPs. Significance is denoted as follows: #
Non-functional checking (EOLPs)
Effects on instrumental responding
Experiment 1
Figure 6(a) shows that NAc core lesions significantly reduced the rate of active lever presses during periods when the active lever light was unlit (pre-post block × lesion, block × lesion:
Figure 7 indicates that DStr lesions led to a significant impairment in instrumental responding, both when the active lever light was lit and when it was unlit.

The effect of NAc core lesions on instrumental responding. Figure shows pre-surgery baseline (three sessions) and 20 post-surgical sessions for NAc core rats and sham-operated controls: (a) ALP off, (b) ILP off, (c) % Active on and (d) % Active off. Significance is denoted as follows: #

The effect of DStr lesions on instrumental responding. Figures show pre-surgery baseline sessions (three sessions) and 20 post-surgical sessions for DStr lesion rats and sham-operated controls: (a) ALP on, (b) ALP off, (c) ILP on, (d) ILP off, (e) % Active on and (f) % Active off. Significance is denoted as follows: #
Periods when the active lever light was lit
Figure 7(a) indicates that DStr lesions reduced the rate of active lever presses during periods when the active lever light was lit (pre-post block × lesion, block 1:
Although rats with DStr lesions reduced their rate of inactive lever presses relative to pre-lesion baseline during periods when the active lever light was lit, this was not statistically significant (Figure 7 (c); pre vs post, all blocks
Periods when the active lever light was unlit
Figure 7(b) indicates that DStr lesions reduced the rate of active lever presses during periods when the active lever light was unlit (pre-post block × lesion, block 1:
Similarly, Figure 7(d) shows that DStr lesions reduced the rate of inactive lever presses during periods when the active lever light was unlit (pre-post block × lesion, block 1:
Figure 7(f) shows that during periods when the light was unlit, the impairment in instrumental responding led to an impairment in discrimination accuracy (pre-post block × lesion, block 1:
Experiment 2
Figure 8(b) shows that mPFC lesions increased the rate of inactive lever presses, but only during periods when the active lever light was lit (pre-post block × lesion, block × lesion:

The effect of mPFC lesions on instrumental responding. Figure shows pre-surgery baseline sessions (three sessions) and 20 post-surgical sessions for mPFC lesion rats and sham-operated controls: (a) ALP on, (b) ILP on, (c) % Active on and (d) % Active off. Significance is denoted as follows: #
Performance during unpredicted reinforcer omission
Following post-surgical testing, rats completed one omission-of-reinforcer session and five (Experiment 1) or three (Experiment 2) recovery baseline sessions under the standard schedule of the task (see Supplementary Material).
Performance during uncertainty
Effects on checking
Functional checking (OLPs)
Figure 9(a) shows that the uncertainty manipulation increased OLP irrespective of lesion group (pre-post block × lesion, block,

Effects of uncertainty on checking. Figures show baseline sessions (three sessions under the standard schedule of the task) and 25 sessions of uncertainty for mPFC lesion rats and sham-operated controls: (a) OLPs and (b) EOLPs. Significance is denoted as follows: **
However, as evident in Figure 9(a), there was a strong trend for the mPFC lesion group to exhibit more OLP than the Shams, as had been shown previously (Figure 5(c)).
Non-functional checking (EOLPs)
Figure 9(b) shows that there was no effect of uncertainty on the rate of non-functional checking in either group (pre-post block × lesion, block,
Effects on instrumental responding
Figure S6 shows that when reinforcement was made more uncertain, there was no effect of uncertainty on any measure of instrumental responding (pre-post block × lesion, all measures,
Summary of main findings
Summary of behavioural effects of NAc core, DStr and mPFC lesions on the ORT during all experimental challenges.
NAc core: nucleus accumbens core; DStr: dorsal striatum; mPFC: medial prefrontal cortex; OLPs: observing lever presses; EOLPs: extra observing lever presses; ALP on: active lever presses, light on; ALP off: active lever presses, light off; ILP on: inactive lever presses, light on; ILP off: inactive lever presses, light off; -: no significant difference relative to Shams; ↓: significant decrease relative to Shams; ↑: significant increase relative to Shams; +: significant increase relative to recovery baseline.
Discussion
Selective excitotoxic damage to the mPFC significantly increased functional and non-functional checking and reduced discrimination accuracy during light information periods. NAc core lesions significantly increased both functional and non-functional checking, as well as transiently impairing accuracy during periods without information. DStr lesions led to a substantial reduction in instrumental responding, producing profound changes in task performance.
Neural substrates of checking
mPFC and NAc core implicated in functional checking behaviour
The finding that both mPFC- and NAc core-lesioned rats increased functional checking behaviour suggests that these regions, because of their known anatomical interconnectivity, form a functional PFC–striatal circuit that is critical for the control of checking. The dorsal mPFC projects to both the NAc core and the medial DStr (Heidbreder and Groenewegen, 2003), and it is possible that both sectors of the striatum are implicated in checking. The present findings are consistent with those of ‘open-field’ checking (Szechtman et al., 1998), in which the NAc core appears to exert inhibitory control over certain components of compulsive checking (Ballester González et al., 2015; Dvorkin et al., 2010; Tucci et al., 2014). In the case of NAc core lesions, there were also a later developing increase in non-functional checking which may relate to models of OCD. For mPFC lesions, lesioned rats did exhibit increased mean levels of non-functional checking (Figure 5(f)), although the high variability in this measure precluded attainment of statistical significance. Overall, these findings support that NAc core-mPFC circuitry may contribute to maladaptive non-functional checking behaviour, with a possible focus on the nucleus accumbens.
DStr and checking
DStr-lesioned rats, unlike NAc core-lesioned rats, were not different from controls in their rate of checking. However, the large size of these DStr lesions, which encompassed both the medial and lateral striatum, makes interpretation of the results less straightforward, given the substantial evidence of functional heterogeneity between striatal subregions (Devan et al., 2011; White, 2009). Analysis of the data indicates that although DStr lesions did not increase or decrease checking numerically, they did substantially reduce instrumental responding, and consequently, the ratio of checking to instrumental responding was greatly increased. Thus, DStr lesions produced a qualitatively similar effect on checking to that of NAc core lesions – although in the latter, a weaker reduction in instrumental responding was accompanied by a genuine increase in checking. Therefore, although superficially different, both NAc core and DStr lesions induced a similar behavioural profile in the ORT.
Theories that might explain increased functional checking
Conditioned reinforcement versus information-seeking
Two key hypotheses exist regarding the maintenance of observing or checking. The conditioned reinforcement hypothesis posits that discriminative stimuli maintain observing responses because these stimuli are associated with primary reinforcement (Dinsmoor, 1983). By contrast, the information or uncertainty reduction hypothesis argues that observing is maintained because it predicts the availability and non-availability of reinforcement (Berlyne, 1960). The profile of ORT performance supports the hypothesis that rather than responding for conditioned reinforcement, rats may be checking for information (potentially to reduce uncertainty) about the location of the active (i.e. currently rewarded) lever, consistent with Berlyne’s information-seeking hypothesis of observing. First, checking increased during periods of reward uncertainty or reward omission, when rats were potentially no longer receiving expected feedback for correct responses. Thus, rats had learned the meaning of the observing light but, under baseline task conditions, were choosing alternative strategies. Second, lesions of the NAc core have been shown to reduce the ability of a food-associated conditioned reinforcer to support the acquisition of a new instrumental response (Parkinson et al., 1999). Had the observing light acquired general affective value (through being associated with food availability), then NAc core lesions should have abolished, or at least reduced, checking, but increases were in fact observed. Therefore, the evidence is in favour of functional checking representing information-seeking and uncertainty reduction in rodents.
Deficient inhibitory response control
Increased functional checking could also possibly result from enhanced exploration or greater impulsivity, perhaps due to failures of behavioural inhibition mechanisms leading to hyperactivity (Maldonado-Irizarry and Kelley, 1994; Wu et al., 1993) or impulsivity (Dalley et al., 2011). However, increased checking was unlikely to reflect a generalised increase in responding, since in contrast to the increase in checking, NAc core lesions reduced instrumental responding for food and the NAc core group was not different from controls in a test of locomotor activity. It is also unlikely that increased checking in mPFC-lesioned rats was the result of generalised hyperactivity as lesioned rats did not make more instrumental responses during periods without information compared with controls.
Increased checking is functional, arising from impaired discrimination of reward contingencies
NAc core lesions impaired rats’ ability to discriminate without information (when the active lever light was unlit). Therefore, the increased checking produced by these lesions may have arisen to compensate for this deficit by providing additional discriminatory information and is in fact fully functional. Optimal ORT performance requires rats to retrieve information about the likely location of the active lever in the absence of the exteroceptive visual cue (i.e. lever light). Therefore, they must use internally generated cues, including recent experience of reinforcement contingencies (spatio-temporal associations with reward) to guide response choice. Such a failure of discrimination performance is consistent with impairments following NAc core lesions in several studies of spatial discrimination learning (Annett et al., 1989; Klein et al., 2004; Schacter et al., 1989; Seamans and Phillips, 1994; Smith-Roe et al., 1999), although some of those studies found effects in the nucleus accumbens shell region rather than the core.
The DStr lesion produced a similar profile to the NAc core lesion in terms of impaired discrimination and preserved checking behaviour but the deficit in the uncued instrumental discrimination in the DStr group was very large compared to that seen following NAc core lesions, suggesting that the DStr group failed to compensate for this discrimination incapacity by elevating functional checking behaviour. Alternatively, high levels of motor output were disproportionately reduced by the DStr lesion, preventing any potential compensatory increases in observing.
Effects of mPFC and striatal lesions on instrumental responding
The reduction in instrumental responding for food following NAc core lesions is unlikely to reflect impaired motor output (see above) or motivational impairment (Cardinal et al., 2002; Parkinson et al., 2000), given the intact approach responses to the active lever when lit and the unimpaired progressive ratio performance in these animals (see Supplementary Material). The impaired spatial discrimination of reinforcement contingencies may reflect specific impairments in spatial processing or in the use of interoceptive discriminative cues in relationship to reward. Thus, they may arise because of a failure of working memory for recent reward outcomes following responding in the absence of the explicit light cue.
The extensive nature of the DStr lesion makes it likely that several processes contributing to the maintenance and discrimination of instrumental responding were impaired. Not only were these DStr-lesioned rats less active but they also had reduced breakpoints on the progressive ratio schedule (see Supplementary Material). However, a primary motivational deficit seems unlikely given their normal rates of food consumption and maintenance of body weight. These findings are consistent with an extensive literature showing specific deficits in motor and motivational function following more discrete DStr lesions (Eagle et al., 1999; Fricker et al., 1996; Pisa, 1988; Whishaw et al., 1986). It is likely that the profound instrumental discrimination impairment was caused by fundamental impairments in action-outcome processing known to be produced by selective dorsomedial striatal lesions (Yin et al., 2004, 2005, 2006), possibly in combination with spatial working memory deficits as above.
The deficit in instrumental discrimination by the mPFC-lesioned rats appeared to be relatively selective in the absence of overall reductions in instrumental responding. Thus, mPFC-lesioned rats made more presses on the ‘inactive’ lever during information periods, suggestive of either a basic impairment in memory for the rule concerning reinforcement availability, a mild attentional deficit in relation to the informative visual cue or an impairment of cognitive control. There is previous evidence of a role for the mPFC in attentional functions (Birrell and Brown, 2000; Muir et al., 1996; Ragozzino et al., 1999); however, the extended duration of the visual cue implies that the attentional load in this task was relatively small. Moreover, although the increased responding on the inactive lever during the explicit visual cue is suggestive of impaired inhibitory response control, there were no other indications of such a general executive deficit. Therefore, given the role of the mPFC, specifically the prelimbic cortex (Corbit and Balleine, 2003), in action-outcome learning, it seems likely that the present deficits in reinforcement discrimination are a consequence of impairments in this inhibitory control process. The behavioural effects of mPFC lesions appeared to show some recovery with repeated testing. However, when reinforcement was made more uncertain, some of the original increases in observing became more evident in the mPFC group (Figures 9(a)). In general, reinforcement uncertainty increased observing, consistent with the hypothesis that this behaviour arises from a need to sample information under uncertainty.
General implications
In terms of providing an adequate model of OCD, it is important to distinguish functional from non-functional checking, the severe symptoms of OCD checking presumably relating to the former. The present study has not demonstrated unequivocal non-functional checking as the increased observing could in most instances be attributed to impairments of uncertainty processing. However, there is considerable individual variability in the proportion of functional to non-functional checking and quinpirole treatment can certainly greatly enhance the latter. Therefore, the present ORT paradigm may be useful for determining how normal information-seeking can become pathological, as expressed by non-functional checking. By implementing the ORT paradigm, we have been able in this study to begin to define the neural networks controlling normal information-seeking in the context of reward and how this potentially may lead to aberrant checking behaviour.
In summary, we have shown that damage to the NAc core and mPFC significantly increased functional checking in the ORT. This is consistent with increased checking following NAc core lesions in the ‘open-field’ model of checking (Dvorkin et al., 2010). The results imply that the NAc core and mPFC form a functional PFC–striatal circuit that is critical for the control of checking behaviour and also provide support for an information-seeking account of checking in the ORT.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
This research was funded by a Wellcome Trust Senior Investigator Award (104631/Z/14/Z) awarded to T.W. Robbins. Work was completed at the Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, UK, supported by a joint award from the Medical Research Council and Wellcome Trust (G00001354). C. d’Angelo was supported by a studentship from the Medical Research Council.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
