Abstract
Objective:
Studies using proton magnetic resonance spectroscopy reveal substantial inconsistencies in the levels of brain glutamate, glutamine and glutamate + glutamine across schizophrenia spectrum disorders. This systematic review employs qualitative and quantitative methods to analyse the patterns and relationships between glutamatergic metabolites, schizophrenia spectrum disorders and brain regions.
Methods:
A literature search was conducted using various databases with keywords including glutamate, glutamine, schizophrenia, psychosis and proton magnetic resonance spectroscopy. Inclusion criteria were limited to case-control studies that reported glutamatergic metabolite levels in adult patients with a schizophrenia spectrum disorder diagnosis – i.e. first-episode psychosis, schizophrenia, treatment-resistant schizophrenia and/or ultra-treatment-resistant schizophrenia – using proton magnetic resonance spectroscopy at 3 T or above. Pooled study data were synthesized and analysed.
Results:
A total of 92 studies met the inclusion criteria, including 2721 healthy controls and 2822 schizophrenia spectrum disorder participants. Glu levels were higher in the basal ganglia, frontal cortex and medial prefrontal of first-episode psychosis participants, contrasting overall lower levels in schizophrenia participants. For Gln, strong differences in metabolite levels were evident in the basal ganglia, dorsolateral prefrontal cortex and frontal cortex, with first-episode psychosis showing significantly higher levels in the basal ganglia. In glutamate + glutamine, higher metabolite levels were found across schizophrenia spectrum disorder groups, particularly in the basal ganglia and dorsolateral prefrontal cortex of treatment-resistant schizophrenia participants. Significant relationships were found between metabolite levels and medication status, clinical measures and methodological variables.
Conclusion:
The review highlights abnormal glutamatergic metabolite levels throughout schizophrenia spectrum disorders and in specific brain regions. The review underscores the importance of standardized future research assessing glutamatergic metabolites using proton magnetic resonance spectroscopy due to considerable literature heterogeneity.
Introduction
The glutamate hypothesis for schizophrenia asserts that glutamatergic dysfunction is a key causal contributor to both the pathogenesis and the symptomatology of schizophrenia (Egerton and Stone, 2012). Advances in neuroimaging techniques have permitted comprehensive testing of this hypothesis via examination of the extent of glutamatergic dysfunction in people living with schizophrenia spectrum disorders (SSD) and the corresponding clinical consequences. Despite the prominence of the glutamate hypothesis, ongoing research on the role of brain glutamate in schizophrenia using proton magnetic resonance spectroscopy (1H-MRS), although promising, has not produced robust evidence to support the hypothesis. Several systematic reviews and meta-analyses have been conducted on 1H-MRS studies assessing glutamatergic metabolites across the clinical spectrum of schizophrenia (Iwata et al., 2018; Kumar et al., 2020; Marsman et al., 2013; Merritt et al., 2016, 2021; Nakahara et al., 2022; Poels et al., 2014; Salavati et al., 2015; Smucny et al., 2021). The brain areas most consistently affected appear to be the anterior cingulate cortex (ACC), the basal ganglia, the midcingulate cortex and the medial prefrontal (MPF) cortex. Glutamatergic metabolite levels, however, do not present a uniform pattern across brain areas or SSD. A summary of these findings can be found in Table 1.
Summary of 1H-MRS glutamatergic findings.
↑: Higher levels; ↓: lower levels; ACC: anterior cingulate cortex; BG: basal ganglia; DLPFC: dorsolateral prefrontal cortex; FEP: first-episode psychosis; Gln: glutamine; Glu: glutamate; Glx: Glu + Gln; Hipp: hippocampus; MCC: midcingulate cortex; MFC: medial frontal cortex; MPF: medial prefrontal region; OCC: occipital cortex; PFC: prefrontal cortex; 1H-MRS: proton magnetic resonance spectroscopy; SSD: schizophrenia spectrum disorders; TRS: treatment-resistant schizophrenia; uTRS: ultra-TRS.
The table provides a summary of the outcomes obtained from meta-analyses and systematic reviews investigating glutamatergic metabolites using 1H-MRS studies across SSD (Iwata et al., 2018; Kumar et al., 2020; Marsman et al., 2013; Merritt et al., 2016, 2021; Nakahara et al., 2022; Poels et al., 2014; Salavati et al., 2015; Smucny et al., 2021).
Contributing to this complex pattern of results is the considerable methodological diversity in the field, particularly regarding MRI field strength. Both glutamate (Glu) and glutamine (Gln) have similar chemical compositions and narrow chemical shift ranges which often leads to significant MRS spectral overlap between the two metabolite resonances (i.e. Glx), with greater contamination taking place at lower field strengths (Snyder and Wilman, 2010). Higher field strengths (e.g. 3 T and above) increase proton polarization and, thus, signal intensity, which allows for a higher degree of spectral resolution and provides better separation of metabolite signals from background noise (Snyder and Wilman, 2010). For this reason, the current systematic review will collate all 1H-MRS findings to date using a field strength of 3 T or above used to investigate glutamatergic metabolite (i.e. Glu, Gln and Glx) levels across four distinct SSD groups (i.e. first-episode psychosis [FEP], schizophrenia, treatment-resistant schizophrenia [TRS] and ultra-TRS [uTRS]) across varying brain regions.
Though valuable, meta-analyses can often fail to capture nuances present in a vast and heterogeneous range of studies. Previous meta-analyses have highlighted the significant discrepancy between studies, particularly the small sample sizes, varying brain regions, variability among patient diagnoses, differing 1H-MRS methodologies, difficulty discerning between Glu and Gln at lower field strengths, etc. (Iwata et al., 2018; Kumar et al., 2020; Marsman et al., 2013; Merritt et al., 2016, 2021; Nakahara et al., 2022; Poels et al., 2014; Salavati et al., 2015; Smucny et al., 2021). Employing a meta-analytical method that solely combines data from various studies might not be optimal, potentially missing crucial findings, overlooking nuanced variations and diminishing their impact. Given the discrepancy in findings and the vast dataset, a standard systematic qualitative synthesis inadequately assesses glutamatergic neurometabolite levels in schizophrenia. This systematic review employs a mixed-methods approach, combining quantitative and qualitative analyses. Pooled effect sizes (Cohen’s d) will be calculated to estimate the overall effect of glutamatergic metabolite levels between SSD groups and healthy controls (HC) across brain regions. In addition, a qualitative synthesis will explore patterns and trends in findings to provide context to the quantitative results. The impact of medication status and methodological variables on metabolite quantification will also be assessed. This approach addresses limitations in previous work by providing a multidirectional assessment of the evidence related to the role of glutamatergic neurometabolites in schizophrenia. While prior reviews included research up to November 2020 (Nakahara et al., 2022), this review distinguishes itself by incorporating the latest research up to June 2022 with studies not included in previous reviews, thereby expanding the timeframe and collective sample size with an additional 447 SSD participants and 414 HC.
Methods
Data sources and search strategy
This systematic review was conducted in accordance with Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines. The systematic review protocol for this work was published as a preprint in PsyArXiv Preprints in August 2020 (Lopes et al., 2020). Deviations from the review protocol were: (1) to allow comparisons between participants within an SSD and HC, only studies with data on these two groups were included, (2) for longitudinal studies, only the first timepoint data was included, (3) where the same participant data was used in different studies only the publication with the largest reported sample size was included and (4) in studies with mixed diagnoses only data from the diagnostic groups included in our analysis were included.
A systematic computerized literature search of articles up to 28 June 2022 was conducted using Web of Science, PsychNet, EBSCOhost and PubMed. Search terms were selected to represent schizophrenia throughout its phenotypic presentations, Glu and Gln and 1H-MRS. These search terms were: (glutamate OR glutamine) AND (schizo* OR psychosis) AND (MRS OR ‘magnetic resonance spectroscopy’ OR ‘MR spectroscopy’) AND (patient OR participant OR people OR person).
Selection criteria
Studies were included if (1) they were part of an original empirical investigation, containing (2) adult human participant samples, of which (3) one or more participant groups had a diagnosis of FEP, schizophrenia, schizoaffective disorder or schizophreniform disorder, whose (4) brain metabolite concentration data included Glu and/or Gln (5) that was collected using 1H-MRS. Exclusion criteria included (1) animal studies, (2) case studies, (3) studies using post-mortem tissue and (4) 1H-MRS field strengths below 3 T. The authors were contacted for this information if the above criteria were not made explicit in the study. Studies were excluded if they were not case-controlled to include HC and if they did not report on independent glutamatergic statistical data. Inclusion was limited to studies performed in patients with FEP, schizophrenia, TRS and/or uTRS; studies were excluded if participant diagnostic information was not discriminated against or if collated 1H-MRS data pertained to mixed diagnoses.
Data collection
The selection procedure is illustrated in Figure 1 and followed PRISMA standards (Moher et al., 2009). A two-staged screening process was employed to determine record eligibility and was outlined in a preprint (Lopes et al., 2020). Duplicate articles were removed. Possible articles were initially screened independently (J.J.L.) for inclusion through reading article title, abstract and keyword screening. The full-text screening was then conducted independently by two assessors (J.J.L. and S.P.C.) for studies potentially meeting the inclusion criteria; discrepancies between reviewers were either resolved by consensus or arbitrated by a third reviewer (S.L.R.), where necessary.

PRISMA flow diagram for literature search and exclusion process.
Data extraction
Statistical data for clinical participants and HC, in terms of their glutamatergic metabolite concentrations, were extracted from the main text or Supplemental Materials. Statistical data included sample size (n), means (
Extracted data also included the SSD group, brain region investigated, specific glutamatergic metabolite(s) studied and 1H-MRS acquisition details, including magnetic field strength (T), echo time (TE), voxel size and 1H-MRS sequence. Data pertaining to participant clinical measures (i.e. medication status and symptom rating scales) were also included in this review. These included participant antipsychotic status and, where possible, chlorpromazine equivalent dose, the Positive and Negative Symptoms Scale (PANSS), the Scale for the Assessment of Negative Symptoms (SANS), the Scale for the Assessment of Positive Symptoms (SAPS) and the Brief Psychiatric Rating Scale (BPRS).
Statistical significance was noted, and Cohen’s d effect sizes were calculated for each between-group comparison (i.e. between SSD participants and HC), representing the magnitude of difference in glutamatergic metabolite levels between SSD participants and HC. For this analysis, a significance level of < 0.05 was used to determine the statistical significance of a study. Effect size was calculated using Cohen’s d, where standardized mean differences of 0.2, 0.5 and 0.8 correspond to small, medium and large effect sizes, respectively, while values below 0.2 were deemed negligible.
Statistical analyses
Cohen’s d was used to measure the effect size of glutamatergic metabolite concentration differences between SSD participants and HC. For the quantitative analyses of this review, Cohen’s d was chosen as the dependent variable due to it being a standardized measure of the strength of a relationship, independent of sample sizes. Furthermore, to account for the potential increased reliability of studies with larger sample sizes, the analyses in this review employed SSD group participant sample sizes as a weighting factor.
Kolmogorov–Smirnov tests revealed the Cohen’s d values for the metabolite concentrations across brain region and SSD groups were not normally distributed (p < 0.001). As such, this review employed non-parametric tests for the following analyses. Kruskal–Wallis tests were used to assess the relationship between Cohen’s d and SSD groups, brain regions and medication status, with separate analyses for Glu, Gln and Glx; Spearman’s correlation coefficients were used to explore the strength and direction of association between clinical measures and chlorpromazine equivalent doses with effect size. Chi-square tests of independence were used to examine the relationships between echo time, field strength and ¹H-MRS sequence and effect size (Cohen’s d). Echo times were categorized into short (TE < 30 ms), medium (30 ms ⩽ TE < 80 ms) and long (TE ⩾ 80 ms), while effect sizes were categorized into direction and magnitude in the following manner: negative large (d ⩽ −0.80), medium (−0.80 < d ⩽ −0.50) and small (d ⩽ −0.20), null (−0.20 < d < 0.20) and positive small (d ⩽ 0.20), medium (0.50 ⩽ d < 0.80) and large (d ⩾ 0.80). Pearson’s chi-square test and Cramer’s V were used to assess the association, with adjusted standardized residuals exceeding ± 3.2 used to identify significant contributions in the contingency table.
Unless otherwise stated, the above analyses were performed using IBM SPSS Statistics (Version 29). Effect size values indicate the direction and magnitude of glutamatergic metabolite level differences in participants within an SSD group with respect to HC.
The findings of this review are presented in three parts: first, a summary of the key characteristics of the reported studies; second, a qualitative analysis of the collected data, highlighting emerging trends and patterns and third, the above quantitative statistical analyses are performed and the relationships with Cohen’s d are reported.
Results
The literature and cross-reference searches initially returned 389 records which matched the search criteria. After the removal of duplicates, screening and assessment for eligibility, a total of 92 studies met our inclusion criteria and were included in the review (Figure 1). From the 92 studies, 11 studies did not provide participant means and standard deviations (Aoyama et al., 2011; Briend et al., 2020; Bustillo et al., 2014; Cadena et al., 2018; Chang et al., 2007; Gurler et al., 2021; Hugdahl et al., 2015; Jauhar et al., 2018; Rauchmann et al., 2020; Shakory et al., 2018; Théberge et al., 2003). While some studies assessed Glu, Gln and Glx in isolation within specific brain regions, the majority assessed multiple metabolites (e.g. creatine, N-acetylaspartate, myo-inositol) and voxel placements as part of one study. Many compared SSD and healthy control groups, leading to the number of finding reports being greater than the number of studies.
A total of 330 1H-MRS between-group comparisons were found, of which 265 reported statistically significant differences between SSD and healthy control participants. Notably, nine of the included studies (Bryant et al., 2021; Ćurčić-Blake et al., 2017; Kaminski et al., 2020; Kegeles et al., 2012; Leptourgos et al., 2022; Rowland et al., 2013; Shakory et al., 2018; Tayoshi et al., 2009; Wang et al., 2022) categorized SSD participants based on demographic characteristics, such as gender and age (e.g. male vs female, young vs old, etc.). For the purpose of this review, these subgroup analyses were integrated into their respective broader SSD group (e.g. FEP), resulting in an adjusted total of 304 between-group comparisons. Effect size was calculated for all but one between-group comparison where no relevant statistical data were reported (Théberge et al., 2003).
Study characteristics
Thirty-one studies recruited participants with FEP, 61 with schizophrenia, 5 with TRS and 2 with uTRS. In these studies, a total of 125 between-group comparisons were obtained for Glu levels, 47 for Gln levels and 132 for Glx levels; from these, 26, 2 and 39 normalized Glu, Gln and Glx, respectively, to creatine. Neuroanatomical regions of interest included: ACC, basal ganglia (including the associative striatum, caudate, putamen, striatum and substantia nigra), centrum semiovale, cerebellum, dorsolateral prefrontal cortex (DLPFC), frontal lobe (including the frontal cortex, frontal white matter [WM], medial frontal cortex [MFC] and orbitofrontal region), inferior frontal gyrus (IFG), insula, MPF (including the prefrontal cortex [PFC], medial PFC, prefrontal WM and ventromedial PFC), the temporal lobe (including the auditory complex, hippocampus, superior temporal gyrus [STG], superior temporal sulcus [STS], temporal cortex and temporal WM), thalamus and visual cortex (including the occipital cortex [OCC], OCC WM, parietal cortex and parieto-occipital cortex [POC]).
Sample sizes of included participants ranged from 6 to 114 for the patient group and 10 to 89 for HC. Seventy-four studies were performed at a magnetic field strength of 3 T, 6 at 4 T and 12 at 7 T. Regarding 1H-MRS sequences methods, point-resolved spectroscopy (PRESS) was used in 61 studies, stimulated echo acquisition mode (STEAM) in 17 studies, Mesher-Garwood (MEGA)-PRESS in 10 studies, semi-localized by adiabatic selective refocusing (sLASER) in three studies and spin echo full intensity acquired localized (SPECIAL) pulse sequence and MEGA-sLASER were used in one study each. Echo times varied between 5 and 500 ms. Study characteristics are provided in Table 2.
Study characteristics.
+: higher levels; –: lower levels; ACC: anterior cingulate cortex; Cr: creatine; CT: constant-time; CS: centrum semiovale; dACC: dorsal ACC; DLPFC: dorsolateral prefrontal cortex; FEP: first-episode psychosis; Gln: glutamine; Glu: glutamate; Glx: Glu + Gln; HC: healthy controls; IF: inferior frontal; L: left; MEGA: Mesher-Garwood; MFC: medial frontal cortex; MPF: medial prefrontal region; MPFC: medial prefrontal cortex; MS: milliseconds; OCC: occipital cortex; PFC: prefrontal cortex; POC: parieto-occipital cortex; PRESS: point-resolved spectroscopy sequence; PR: phase rotation; 1H-MRS: proton magnetic resonance spectroscopy; R: right; rACC: rostral ACC; Sequence: 1H-MRS sequence; sLASER: semi-localization by adiabatic selective refocusing; SPECIAL: spin echo full intensity acquired localized; SSD: schizophrenia spectrum disorders; ST: superior temporal; STEAM: stimulated echo acquisition mode; Sz: schizophrenia; T: field strength in Tesla; TE: echo time; TRS: treatment-resistant schizophrenia; uTRS: ultra-TRS; VMPF: ventromedial prefrontal cortex; WM: white matter.
‘n SDD’ and ‘n HC’ represent maximum number of participants who undertook 1H-MRS investigation. Actual sample size may vary depending on SSD, 1H-MRS sequence, voxel placement, metabolite and excluded participant data. ‘Result’ shows reported study SSD glutamatergic metabolite level results as statistically significant, represented by higher (+) and lower (–) levels, statistically not significant (NS) or as statistical significance not reported (NR), compared with HC.
Means and standard deviations were not provided in the study and, as such, the effect size was calculated using t-values instead.
The following descriptions of between-group comparison results denote the direction and magnitude of glutamatergic metabolite effect size (Cohen’s d) levels in participants within an SSD group compared with HC, where positive effect sizes indicate higher metabolite levels in SSD groups.
Qualitative analyses
Participant demographics
A total of 2721 HC and 2822 participants with an SSD diagnosis were included in this review, with a mean age ranging from 21 to 66 years across the SSD groups and 21 to 70 years in HC (Table 3). A consistent general trend of higher male participation (i.e. SSD = 67%; HC = 62%) compared with female was observed. Moreover, a clear age progression was noted, from FEP through to uTRS, with participants with more advanced or resistant forms of schizophrenia tending to be older. Overall, there was no significant difference in mean age between SSD participants and HC (p = 0.492).
Participant demographics and clinical characteristics.
AP: antipsychotics; BPRS: Brief Psychiatric Rating Scale; CPZ eq.: chlorpromazine equivalent doses; FEP: first-episode psychosis; HC: healthy controls; PANSS: Positive and Negative Syndrome Scale; SANS: Scale for the Assessment of Negative Symptoms; SAPS: Scale for the Assessment of Positive Symptoms; SSD: schizophrenia spectrum disorders; TRS: treatment-resistant schizophrenia; uTRS: ultra-TRS.
Age, chlorpromazine equivalent doses and clinical measure scores represent mean averages (mean standard deviations) calculated after using SSD group sample size as a weighing factor. Chlorpromazine equivalent doses in mg.
Clinical characteristics
Medication status
Clinical participants were described as either being antipsychotic-naïve, antipsychotic-free, being treated with antipsychotics, being treated with clozapine, or mixed (i.e. some participants were medicated, while others were not). FEP studies primarily focused on antipsychotic-naïve individuals (61%), with schizophrenia groups mainly assessing participants medicated with antipsychotics (52%) or mixed medication states (37%). In both TRS and uTRS groups, all participants were treated with antipsychotics (TRS: 60%), particularly clozapine (TRS: 40%; uTRS: 100%). A progression in chlorpromazine equivalent doses was noted, from FEP to uTRS (Table 3).
Clinical measures
Overall, followed by the BPRS (26%), the PANSS was the most frequently employed measure (56%) and the only one used among all SSD groups. Both tests captured overall symptom burden and were often accompanied by subscale scores for specific symptom domains (e.g. positive, negative, general). Apart from the SANS and the SAPS, studies did not generally combine different clinical measures within the same participants. Compared with the schizophrenia group, all other SSD group participants tended to have higher positive, negative and general symptom scores (Table 3).
FEP-specific characteristics
Thirteen studies employed specific criteria for the duration of psychotic symptoms prior to study inclusion. The average reported range of time from symptom onset to study entry was 31.85 (SD = 17.90) months. Eleven studies reported an average illness duration (i.e. the total time elapsed since symptom onset) of 16.67 (SD = 8.14) months. Sixteen studies reported a mean duration of untreated psychosis (i.e. time elapsed between the first noticeable onset of positive psychotic symptoms and the start of consistent antipsychotic treatment) of 9.45 (SD = 7.53) months.
Trends in regional glutamatergic metabolite levels
Overall, approximately 20% of all reported effect size (Cohen’s d) findings comparing SSD participants with HC were statistically significant. Statistically significant findings generally exhibited medium to large effect sizes (Cohen’s d). Regarding the direction of effect sizes, while Gln and Glx levels tend to be generally higher in SSD groups compared with HC, Glu was generally lower in SSD groups.
Following are the main trends in effect size magnitude and direction in glutamatergic metabolite levels of SSD groups in relation to HC across brain regions in statistically significant reports (Figure 2). Brain regions with a small number of between-group comparisons (i.e. n < 20) were not included in this summary. A detailed account of the findings is provided in Supplemental Materials.

Significant regional glutamatergic metabolite variations between SSD and HC.
Anterior cingulate cortex
The ACC revealed consistent lower levels of Glu for FEP and schizophrenia than HC. Gln appeared lower in schizophrenia. Glx levels were generally higher across all SSD groups.
Basal ganglia
Higher levels of Glu and Glx were noted in FEP than HC.
Medial prefrontal cortex
Lower levels of Glx were found in schizophrenia than HC.
Temporal region
In schizophrenia, while Glu appeared to have both higher and lower levels, Glx levels were generally lower than HC.
Thalamus
Findings show consistent higher levels of Gln in schizophrenia than HC.
Quantitative analysis of effect size
Brain voxels with fewer than 15 between-group comparisons were categorized by their respective brain region (i.e. basal ganglia, frontal lobe, MPF, temporal lobe and visual cortex) for analysis. However, 18 between-group comparisons, corresponding to regions of the centrum semiovale, cerebellar cortex, IFG and insula, were excluded from analysis due to their small number and inability to be categorized by brain region.
Comparisons across brain regions for SSD group
Figure 3 presents a comparative analysis of SSD groups and HC across various brain regions, focusing on the levels of each glutamatergic metabolite. The strength of the detected effects is represented through a colour gradient, with different shades indicating the magnitude of effect sizes (Cohen’s d). Accompanying letters in the figure denote the statistical significance of differences in effect size between SSD groups; groups that share the same letter are not significantly different in their Cohen’s d values. Effect sizes (η2) for differences between SSD groups and HC ranked Cohen’s d values were summarized across brain regions (Table 4).

Brain region-specific effect size means of Glu, Gln and Glx in SSDs.
Relationship between Cohen’s d and SSD group across brain regions.
ACC: anterior cingulate cortex; DLPFC: dorsolateral prefrontal cortex; Gln: glutamine; Glu: glutamate; Glx: Glu + Gln; MPFC: medial prefrontal cortex.
Effect sizes (η2) were conducted on ranked Cohen’s d values.
Glutamate
The basal ganglia (η² = 0.446, p < 0.001) and the frontal cortex (η² = 0.423, p < 0.001) displayed the greatest Cohen’s d mean differences for Glu, followed by the MPF (η² = 0.174, p < 0.001). Notably, while individuals with FEP exhibited moderately higher levels of Glu in the basal ganglia (d = 0.576), participants with schizophrenia exhibited moderate decreases in Glu within the visual cortex (d = −0.624). Pairwise comparisons found that FEP and schizophrenia consistently differed across brain regions (except ACC, p = 1.0, Bonferroni-corrected). In the ACC, the negative Cohen’s d in FEP and schizophrenia differed significantly from the positive Cohen’s d in TRS and uTRS with a moderate effect (η² = 0.095, p < 0.001). Within the DLPFC, all SSD groups revealed consistent negative Cohen’s d compared with controls, with a moderate effect (η² = 0.063, p < 0.001). A significant difference in Cohen’s d between TRS and uTRS was noted only in the basal ganglia (p < 0.001).
Glutamine
Only FEP and schizophrenia groups were assessed for Gln levels. The analysis revealed strong notable Cohen’s d differences in the basal ganglia (η² = 0.665, p < 0.001), the DLPFC (η² = 0.523, p < 0.001) and frontal cortex (η² = 0.520, p < 0.001). Across all regions, except the MPF (p = 0.418), pairwise comparisons revealed FEP and schizophrenia differed significantly in Cohen’s d (p < 0.001, Bonferroni-corrected) and, with the exception of the thalamus, typically differed in direction. Of note, compared with HC, significantly positive large Cohen’s d was found in the basal ganglia (d = 1.219) for FEP participants compared with controls, and positive medium Cohen’s d in the DLPFC (d = 0.506) and thalamus (d = 0.670) in schizophrenia.
Glx
A general trend of positive Cohen’s d emerged in SSD groups across brain regions, with TRS notably showing moderate positive Cohen’s d in both the basal ganglia (d = 0.705) and the DLPFC (d = 0.527). uTRS exhibited a negative Cohen’s d in the DLPFC (d = −0.324), significantly different from the positive Cohen’s d of the remaining SSD groups (η² = 0.108; p < 0.001). Markedly large Cohen’s d differences were found between SSD groups in the thalamus (η² = 0.370, p < 0.001), the MPF (η² = 0.313, p < 0.001), the frontal cortex (η² = 0.284, p < 0.001) and basal ganglia (η² = 0.148, p < 0.001). SSD Cohen’s d differed significantly across all brain regions (p < 0.001), except for the temporal region (η² = 0.004, p = 0.133).
Glutamatergic levels Cohen’s d and clinical variables
Medication status
Significant differences were found in glutamatergic metabolite levels across all categories of medication status (p < 0.001). Pairwise comparisons revealed significant differences in Cohen’s d between all medication statuses (p < 0.05, Bonferroni-corrected) in Gln, with some exceptions in Glu and Glx. In Glu, no significant differences in metabolite levels were found between individuals treated with clozapine and those who were antipsychotic-naïve (p = 0.589). In Glx, no significant differences were found between individuals treated with mixed antipsychotics and those who were antipsychotic-free (p = 1.00), between antipsychotic-free individuals and those treated with clozapine (p = 0.442) and between individuals treated with clozapine and antipsychotic-naive participants (p = 1.00). Regarding chlorpromazine equivalent doses, though significant Cohen’s d correlations were found with both Glu and Gln (p < 0.001; Table 5), these were found to be negligible (Glu: rs = 0.123; Gln: rs = 0.173).
Relationship strength between Cohen’s d and clinical variables.
BPRS: Brief Psychiatric Rating Scale; CPZ eq.: chlorpromazine equivalent doses; Gln: glutamine; Glu: glutamate; Glx: Glu + Gln; PANSS: Positive and Negative Syndrome Scale; SANS: Scale for the Assessment of Negative Symptoms; SAPS: Scale for the Assessment of Positive Symptoms.
Monotonic relationships were assessed using the Spearman correlation coefficient (rs).
Clinical measures
Significant correlations were found between glutamatergic metabolite level Cohen’s d and most clinical measure scores (p < 0.001; Table 5). Particularly, in Glu, strong monotonic relationships were found between Cohen’s d and the BPRS positive scale (rs = 0.647), followed by the SANS (rs = 0.437) and the SAPS (rs = 0.461), and a moderate association with the PANSS general scale (rs = 0.363). In Gln, the BPRS revealed a strong correlation with the positive scale (rs = 0.643), followed by a moderate relationship in both the total score (rs = 0.350) and the negative scale (rs = 0.350). Notably, Glx exhibited inverse moderate relationships between Cohen’s and the SANS (rs = −0.341), the SAPS (rs = −0.298) and the BPRS positive scale (rs = −0.315). In the PANSS, weak to moderate positive relationships were found in the positive scale (rs = 0.295) and general scale (rs = 0.294).
Glutamatergic levels Cohen’s d and methodological variables
Statistically significant associations were found between Cohen’s d for all three glutamatergic metabolites and echo time, field strength and MRS sequence (Table 6). Figure 4 depicts the adjusted standardized residuals across these associations, highlighting areas of significant deviation from the expected frequencies under the assumption of independence.
Association strength between Cohen’s d and methodological variables.
Gln: glutamine; Glu: glutamate; Glx: Glu + Gln.
The strength of association was calculated using Cramer’s V (V).

Standardized residuals between Cohen’s d and methodological variables.
Glutamate
Analyses revealed a moderate to possibly strong association between Glu levels Cohen’s d and echo time (Table 6; χ2 = 624.08, df = 12, p < 0.001, V = 0.313), field strength (χ2 = 529.31, df = 12, p < 0.001, V = 0.287) and MRS sequence (χ2 = 771.89, df = 30, p < 0.001, V = 0.219). Notably, while short echo times yielded fewer than expected positive Cohen’s d (Figure 4), medium echo times showed a greater number of both negative and positive Cohen’s d. Regarding field strength, while scans at 3 T appeared to yield more positive Cohen’s d than expected, the opposite was noted at 7 T. As for MRS sequence, overall positive Cohen’s d were more frequent with MEGAPRESS and PRESS, as opposed to less frequent with STEAM.
Glutamine
In Gln, these associations were even more pronounced, with echo time (χ2 = 165.21, df = 6, p < .001, V = 0.346), field strength (χ2 = 516.18, df = 12, p < 0.001, V = 0.427) and MRS sequence (χ2 = 625.11, df = 24, p < 0.001, V = 0.332) displaying robust connections with Cohen’s d (Table 6). Overall, Gln had the fewest cells with adjusted residuals that deviated from the expected (Figure 4).
Glx
Cohen’s d in Glx had a moderate to possibly strong association with methodological variables (echo time: χ2 = 339.26, df = 12, p < 0.001, V = .221; field strength: χ2 = 401.61, df = 12, p < 0.001, V = 0.239; MRS sequence: χ2 = 457.80, df = 18, p < 0.001, V = .208). Of note, field strengths of 3 T and MEGA-sLASER MRS sequences had the fewest deviations from expected.
Discussion
Using the largest pooled dataset to date, this review presents a comprehensive examination of the variations in glutamatergic metabolite levels across different SSD groups and brain regions, as well as across clinical and methodological variables. Given the large and diverse datasets in this field, organizing the data into coherent themes or patterns poses significant challenges. By leveraging an integrated approach that synthesizes both qualitative observations and quantitative analyses, this review provides a more robust understanding of the intricacies underlying these variations.
Regional glutamatergic metabolite differences across SSD groups
Qualitative trends
Approximately 20% (n = 57) of comparisons revealed significant differences in glutamatergic metabolite levels between SSD groups and HC, generally with medium to large effect sizes. Overall, compared with HC, most studies reporting significant metabolite levels found higher Gln (60%) and Glx (70%) levels in SSD groups, while Glu (58%) was generally reported lower in SSD groups. The overall body of evidence, however, remains mixed, with no uniform pattern across SSD groups or brain regions (Figure 2). Noteworthy trends were highlighted in the metabolite measurements across brain regions and SSD groups. Specifically, compared with HC, most studies reporting significance in the ACC exhibited lower Glu levels in FEP and schizophrenia and higher Glx levels throughout SSD groups. Studies reporting significance in the basal ganglia mainly showed elevated Glu and Glx levels in FEP, suggesting the possibility of distinct regional variations in glutamatergic metabolite alterations early in the disease progression. In the thalamus, studies suggest higher levels of Gln in schizophrenia groups, which is consistent with previous meta-analyses (Merritt et al., 2016; Salavati et al., 2015).
Quantitative insights
Our findings reveal significant variability in the distribution of glutamatergic metabolite concentrations across SSD groups or brain regions, suggesting a complex pathology that varies with disease state and neuroanatomy, as well as varying patterns of glutamatergic metabolite alterations across SSD groups and different brain regions. However, for Glu, schizophrenia revealed consistently lower Cohen’s d values across all brain regions, except for the basal ganglia (d = 0.003), supporting the hypothesis of lower overall Glu levels across the brain in this SSD group, compared with HC.
Significant differences in Cohen’s d between SSD groups and HC were found across all brain regions, with the exception of the temporal region. Compared with HC, the basal ganglia exhibited the largest Cohen’s d mean difference for Gln in the FEP group, and the most statistically significant Gln differences between both FEP and schizophrenia groups. These significant differences in Cohen’s d between FEP and schizophrenia were maintained throughout most brain regions and glutamatergic metabolites (Figure 3), highlighting an early hyperglutamatergic state, which may normalize or invert as the illness progresses or with treatment, or potentially reflect different stages of glutamatergic dysfunction. These findings are consistent with previous meta-analyses which highlighted higher levels of Glu and Glx in the basal ganglia of patients with FEP and schizophrenia (Merritt et al., 2016; Nakahara et al., 2022; Poels et al., 2014). This evidence joins previous work in supporting a glutamatergic dysfunction in the basal ganglia which is apparent at both early stages and throughout illness progression. Table 7 provides an updated summary of these findings.
Summary of 1H-MRS glutamatergic metabolite level findings.
↑: higher levels; ↓: lower levels; ACC: anterior cingulate cortex; BG: basal ganglia; DLPFC: dorsolateral prefrontal cortex; FEP: first-episode psychosis; Gln: glutamine; Glu: glutamate; Glx: Glu + Gln; 1H-MRS: proton magnetic resonance spectroscopy; MPF: medial prefrontal region; SSD: schizophrenia spectrum disorders; Thal: thalamus; TRS: treatment-resistant schizophrenia; uTRS: ultra-TRS.
The table provides a summary of the findings in this review regarding glutamatergic metabolite levels across SSD groups and brain regions, compared with HC; in
These findings, however, must be considered in the broader context of participant medication status, as well as methodological factors, as these may influence the measurement of glutamatergic metabolite concentrations. In addition, the predominance of male participants (SSD = 67%) highlights a potential gender bias into the findings. Sex-specific factors, such as hormonal differences and variations in drug metabolism (Brand et al., 2022), may affect glutamatergic system modulation differently in males and females.
Medication status
Most SSD participants (74%) were being treated with antipsychotic medication at the time of their enrolment, which could influence the measurement of glutamatergic metabolite concentrations in the brain (Egerton et al., 2017, 2021; Van der Heijden et al., 2004). Significant variations in Glu, Gln and Glx levels across different medication statuses suggest that these metabolites may be particularly sensitive to antipsychotic medication effects and serve as potential pharmacokinetic biomarkers. The lack of significant differences in Glu levels between clozapine-treated and antipsychotic-naïve participants, alongside the negligible correlations with chlorpromazine equivalent doses, contributes to the understanding that clozapine treatment may contribute to Glu modulation (Fukuyama et al., 2019; Hribkova et al., 2022; McQueen et al., 2021). While antipsychotic-naïve FEP individuals exhibit distinct glutamatergic profiles, the progression to higher chlorpromazine equivalent doses in uTRS does not appear to linearly correlate with Glu or Gln levels, suggesting that the effects of antipsychotics on glutamatergic metabolites are not straightforward and may be influenced by long-term antipsychotic treatment, illness progression, or individual patient factors. Future research should aim to elucidate the effects of individual antipsychotic drugs and dosages on neurometabolite levels.
Clinical measures
This review found significant relationships between Cohen’s d for glutamatergic metabolite levels and clinical measures, underscoring the potential relevance of these metabolites in the pathophysiology and symptomatology of schizophrenia. Regarding symptom scores, non-schizophrenia groups tend to have higher scores on positive, negative and general symptoms, suggesting that the severity and presentation of symptoms may evolve with illness progression and treatment history. Moderate and strong monotonic relationships between both Glu and Gln and most clinical measures, mainly the BPRS, SANS and SAPS, suggest a significant association between glutamatergic dysfunction and psychotic symptoms.
Methodological considerations
The statistically significant associations between Cohen’s d for glutamatergic metabolites echo time, field strength and 1H-MRS sequence highlight the importance of considering methodological factors in neuroimaging studies. These moderate to strong associations can considerably impact the MRS measurement of metabolite concentrations.
The association between glutamatergic metabolite levels and echo time may reflect the sensitivity of metabolite measurements to T2 relaxation effects, which can vary with echo time. While shorter echo times can minimize J-evolution and T2 relaxation, longer echo times can decrease macromolecule contributions and improve the separation of Glu and Gln peaks (Wong et al., 2018). Notably, the results suggest that shorter and longer echo times yield fewer positive Cohen’s d values for Glu, while negative values are more prevalent at short and medium echo times.
Field strength also appears to influence the magnitude of the observed effects. Though higher field strengths amplify proton polarization and signal intensity, resulting in improved spectral resolution and clearer separation of metabolite signals from noise (Snyder and Wilman, 2010), only in Gln were there few deviations from the expected at 7 T. In Glu, field strengths of 7 T yielded fewer positive Cohen’s d values and more negative values, the opposite of 3 T. Researchers conducting 1H-MRS investigations should carefully consider the potential impact of field strength selection on their findings, particularly related to Glu and Glx measurements.
Furthermore, specific sequences like MEGAPRESS and PRESS, as opposed to STEAM, tended to yield more positive Cohen’s d values for Glu, implying that certain 1H-MRS sequences could be more reliable for detecting changes in glutamatergic metabolites. These findings warrant further investigation to understand the underlying mechanisms by which different 1H-MRS sequences influence the detection of glutamatergic metabolites.
Future investigations into the influence of methodological factors on glutamatergic metabolite measurements should explore the interaction and relationship between these factors and neurometabolite quantification. In addition, the regional heterogeneity of the brain and its potential impact on these measurements should be examined.
Strengths, limitations and future directions
A significant strength of this review is the incorporation of a large number of studies and comparisons, which enhances the statistical power and generalizability of the findings. The combination of qualitative and quantitative approaches allowed for a comprehensive assessment of the data, capturing both the magnitude of effects and underlying patterns. Compared with earlier reviews, ours extends up to June 2022, encompassing recent studies not included in previous meta-analyses, the latest of which only covered literature up to November 2020 (Nakahara et al., 2022), and an additional 447 SSD participants. Furthermore, weighting of Cohen’s d averages based on SSD group participant sample size further improved the precision of the effect size estimates and ensured that studies with larger sample sizes, which provide more reliable estimates of the true effect, have a proportional influence on the overall analysis. In addition, the addition of eta-square values and Cramer’s V provided a more thorough analysis in considering the magnitude and relevance of the detected effects.
However, the main limitation of this review is the limited number of available studies, particularly those examining Gln, TRS and uTRS. Also, several studies investigating FEP fail to consider symptom onset and lack follow-up evaluations to track illness progression. Thus, conclusions drawn regarding FEP might not accurately depict its role in the progression towards schizophrenia, given that FEP can either remain stable, transition into schizophrenia or develop into other disorders. Moreover, the heterogeneity in brain voxel placement and size, 1H-MRS methodology and scanner specifications hinder the ability to draw significant statistical relationships between studies due to the lack of direct comparability among these variables. In addition, while the review attempted to control for medication status, the complexity of antipsychotic treatment regimens, polypharmacy, symptomatic presentation and their impact on glutamatergic metabolism could not be fully disentangled. This variability underscores the need for future research employing standardized methodologies, larger sample sizes and comprehensive assessment of clinical heterogeneity.
To better understand the involvement of glutamatergic metabolites in the pathophysiology of schizophrenia in future research, we propose the following: (1) Focus on brain regions consistently showing changes in glutamatergic metabolite levels, such as the ACC, basal ganglia and DLPFC in FEP, the ACC and thalamus in schizophrenia and the ACC in TRS and uTRS; (2) adopt an operational definition of FEP with a focus on the onset of initial psychotic symptoms; (3) conduct follow-up diagnostic assessments for FEP to monitor progression into schizophrenia, distinguishing early stages of schizophrenia from other conditions; (4) increase the number of TRS and uTRS studies; (5) assess Glu and Gln metabolite levels individually rather than the combined signals (i.e. Glx) for better differentiation; (6) opt for a field strength of 3 T or above to avoid Glu signal contamination with Gln; (7) increase statistical power with larger sample sizes and (8) employ greater efforts in reducing recruitment gender bias. These recommendations stem from the evidence presented in this review, which provides support for the glutamate hypothesis of schizophrenia by presenting evidence of consistent abnormal levels of glutamatergic metabolites throughout the course of SSD, particularly in the basal ganglia, DLPFC, thalamus and visual cortex. Nonetheless, further research is needed to fully elucidate the precise mechanisms underlying the relationship between glutamate abnormalities and the emergence and manifestation of symptoms throughout the course of the SSD.
Conclusion
The results of this comprehensive review emphasize evidence of altered levels of glutamatergic metabolites across the SSD, with a particular focus on certain areas of the brain. Given the considerable heterogeneity in the literature, we highlight the importance of standardized future research for the assessment of glutamatergic metabolites using 1H-MRS.
Supplemental Material
sj-docx-1-anp-10.1177_00048674241254216 – Supplemental material for Glutamatergic neurotransmission in schizophrenia: A systematic review and quantitative synthesis of proton magnetic resonance spectroscopy studies across schizophrenia spectrum disorders
Supplemental material, sj-docx-1-anp-10.1177_00048674241254216 for Glutamatergic neurotransmission in schizophrenia: A systematic review and quantitative synthesis of proton magnetic resonance spectroscopy studies across schizophrenia spectrum disorders by Jamie J Lopes, Sean P Carruthers, Denny Meyer, Brian Dean and Susan L Rossell in Australian & New Zealand Journal of Psychiatry
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: S.L.R. holds a Senior National Health and Medical Research Council (NHMRC) Fellowship (GNT1154651).
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
