Sage Journals: Discover world-class research

Abstract

While assessments of transparent reporting practices in meta-analyses are not uncommon in the field of health sciences interventions, they are limited in the social sciences and to our knowledge are non-existent in criminology. Modified PRISMA 2020 checklists were used to assess transparency and reproducibility of reporting for a sample of 33 meta-analyses of intervention/prevention evaluations published in scholarly journals between 2016 and 2021. Results indicate that the average rate of transparent reporting practices was 63%; adherence varied considerably across studies and subscales, with low rates of adherence for some core checklist items. Overwhelmingly, studies were not reproducible in their entirety; article word count was significantly correlated with reproducibility (r = 0.4028, p < .03). These findings suggest that substantial changes to reporting practices are necessary to meet traditional meta-analytic claims of transparency and reproducibility. Study limitations include sample size, coding instruments, and coding subjectivity.

Keywords

meta-analysis PRISMA reproducibility reporting guidelines transparency criminology

Meta-analysis is an increasingly popular quantitative research synthesis technique (see Williams et al., 2017) that involves explicit and systematic methods to identify a population of relevant studies and statistically synthesize results across studies to produce an average effect. The technique differs from a traditional narrative literature review, in which findings from a non-systematically generated set of studies are discussed and summarized qualitatively. While meta-analysis is not without its criticisms (e.g., Bailar, 1997; Berk, 2007), proponents argue for its strengths with respect to decreasing bias in study identification (e.g., Thompson & Belur, 2016), utility in summarizing large and/or disparate bodies of research (e.g., Siddaway et al., 2019; Wilson, 2001), and proficiency in addressing the oft-cited concern of “proliferation without accumulation” (e.g., Wells, 2009), in which the volume of studies on a given topic increases but definitive knowledge does not increase accordingly.

When compared with traditional forms of research synthesis such as narrative review and vote-counting, meta-analysis has several advantages (see Borenstein et al. (2009) for a more detailed overview). These include, primarily, (a) greater precision in estimating the effect in question (through converting individual study results into a common metric to enable cross-study pooling; Wilson, 2001), (b) a focus on the direction and size of each study’s treatment impact rather than on statistical significance (which recognizes that non-significant results are rarely effects of zero and that statistical significance is often a function of sample size; Wells, 2009), (c) the ability to quantitatively examine potential causes of variations in treatment effect magnitude (such as the impact of study, treatment, measurement, or participant characteristics; Williams et al., 2017), (d) increased objectivity regarding study selection and inclusion in the analysis by way of a pre-defined set of criteria (Thompson & Belur, 2016), and (e) methodological transparency which limits hidden biases and assumptions and enables replication of the research by others (Gough et al., 2017; Lipsey & Wilson, 2001; Siddaway et al., 2019). With respect to (e), methodological transparency suggests that, if provided with the complete list of inclusion/exclusion criteria and sources for study identification (i.e., bibliographic databases, grey literature sources), along with the set of methodological decisions concerning effect size calculation and study pooling, independent researchers could theoretically reproduce the findings of the original study. The purpose of this paper is to examine whether recent journal publications of meta-analyses in the field of criminology meet the expectations of transparency and reproducibility. Given that meta-analyses are often highly cited resources which are presumed to present the most comprehensive summative evidence in a field, ensuring that these reports are explicit about subjective research decisions, and that implementation of these decisions is verifiable by third parties, is essential.

Transparency and Reproducibility

That meta-analyses are both transparent and reproducible are common narratives in the literature when touting the strengths of meta-analysis in comparison to other forms of research synthesis. A quick foray into classic meta-analysis handbooks and renowned authors in the field highlights these claims. For example, Lipsey and Wilson (2001), in Practical Meta-Analysis,

Good meta-analysis is conducted as a structured research technique in its own right and hence requires that each step be documented and open to scrutiny…. By making the research summarizing process explicit and systematic, the consumer can assess the author’s assumptions, procedures, evidence, and conclusions rather than take on faith that the conclusions are valid. (p. 5–6)

Likewise, as noted by Borenstein et al. (2009) in Introduction to Meta-Analysis,

…because all of the decisions are specified clearly, the mechanisms are transparent…. While the reviewers and readers may still differ on the substantive meaning of the results (as they might for a primary study), the statistical analysis provides a transparent, objective, and replicable framework for this discussion. (p. xxiii)

Transparency and reproducibility are overlapping but not synonymous constructs. “Transparency” refers to the completeness of reporting in a given meta-analysis document, such that all methodological elements, decision points, findings, and conclusions are presented in full. Meta-analysis requires an elaborate sequence of steps; not surprisingly, studies vary in their degree of transparency in reporting of processes and decisions concerning the literature search, inclusion criteria for candidate studies, and methods for computing effect sizes and quantitatively pooling the results. Further, despite the systematized processes in literature searches and cross-study pooling that are inherent to meta-analysis, syntheses range in overall quality.

“Reproducibility” is whether or not the results of a given meta-analysis would be possible to reproduce in their entirety, based on the details presented in the original report (see Lakens et al., 2016). While many elements of transparency are also required for reproducibility, some, such as multi-coder involvement in data extraction, implications of the findings, and financial support for the study are not. On the other hand, reproducibility requires a level of information not traditionally expected when it comes to transparency, such as study-level detail on effect size computation and related decisions on sample size if the primary report is not clear. As per Williams et al. (2017), “With so many potential sources of variance across these decisions, it is easy to imagine investigators coming to conclusions that differ, at least slightly” (p. 269). Reproducibility is possible only if meta-analyses present all decision and data points in full.

Why do Transparency and Reproducibility Matter?

As meta-analyses purportedly represent the summative state of the current literature on a given topic, they are often widely read, influential documents (Lakens et al., 2017; Polanin et al., 2020). While transparency in all forms of research is important, it is even more indispensable for meta-analysis. Critics contend that given the large series of subjective decision-making steps involved in such syntheses, meta-analyses may produce results that are misleading at best and erroneous at worst (Ferguson & Kilburn, 2010; Ioannadis, 2016). Further, obscurity concerning methodological processes has been raised as a validity concern (Gotzsche et al., 2007; Jones et al., 2005; Lakens et al., 2017). For example, a meta-review by Ford et al. (2009) reported a 100% error rate across eight systematic reviews of interventions to treat irritable bowel syndrome; errors included the use of ineligible trials according to the stated review inclusion criteria, the omission of trials that should have been included (according to stated inclusion criteria), errors in data extraction, and errors in pooled treatment impact calculation. Given the critical importance of the consolidation of existing literature to our understanding of current knowledge, as well as the “replication crisis” often lamented in the field (e.g., Losel, 2018; Pridemore et al., 2018), the production of valid and robust meta-analyses is imperative.

Importantly, we note that the issue of transparent and reproducible reporting in meta-analysis is distinct from the issue of meta-analysis methodological quality. While in some cases, the two may be linked—that is, a low-quality study may be characterized by low-quality reporting (e.g., Tunis et al., 2013), it is certainly possible for a high-quality study to display a lack of transparency in its reporting of methods and findings. Similarly, a low-quality study suffering from methodological concerns may adhere to high standards of reporting. Systematic review and meta-analysis involve a series of sequential steps in which subjective decision-making is introduced repeatedly; different analysts often make different choices at each decision point and whether a given study is “high” versus “low” quality with respect to some of these decisions is up for debate. Far less debatable is the fact that high-quality, transparent reporting in meta-analysis allows for third party reviewers and readers to ascertain study quality themselves.

Lakens et al. (2016) list additional benefits to reproducibility, including the potential for independent analysts to subsequently adjust study inclusion criteria or methods of effect size calculation and conduct new analyses. These new analyses may result in different conclusions than the first synthesis, similar to the frequent finding in primary study replication wherein the results of the original study are not supported (see, for example, Derry et al. (2006) on the null impacts of acupuncture once inclusion criteria for the synthesis are limited to randomized, double-blind trials). Another benefit to complete transparency in reporting is the increased ease for future meta-analytic updates as new empirical research on the topic is published, or as new analytic techniques are developed which may be utilized to increase the validity of effect size estimation or study aggregation (Lakens et al., 2016; Polanin et al., 2020).

Reporting Guidelines for Meta-Analyses

Reporting guidelines for systematic reviews and meta-analyses are not new; multiple frameworks exist such as the MARS (Meta-Analysis Reporting Standards; Cooper, 2010) and the Campbell Collaboration’s MEC2IR reporting standards (The Methods Group of the Campbell Collaboration, 2019). Other frameworks focus on assessing the methodological quality of systematic reviews and meta-analyses; most notably AMSTAR (A MeaSurement Tool to Assess systematic Reviews; Shea et al., 2017). With respect to reporting guidelines, arguably the most widely-used checklist is the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement (Page et al., 2021a, 2021b). The PRISMA statement was designed to be used by authors of systematic reviews and meta-analyses to improve reporting, and by journal peer reviewers and editors as a tool for critical appraisal of the reporting of systematic reviews.

The PRISMA 2020 statement and associated checklists were released in March 2021; an update from the previous guidelines published in 2009 (available at http://www.prisma-statement.org/PRISMAStatement/PRISMAStatement). The updated guidelines contain few substantive differences in terms of actual content to be reported; rather, the 2020 guidelines include expanded subsections for coding of key elements noted in the 2009 guidelines. The 2009 statement was “widely endorsed and adopted, as evidenced by its co-publication in multiple journals, citation in over 60,000 reports (Scopus, August 2020), endorsement from almost 200 journals and systematic review organizations, and adoption in various disciplines” (Page et al., 2021a). The 2020 checklist contains seven sections (e.g., Introduction, Methods, Results) and a total set of 42 checklist items (Page et al., 2021a; 2021b). In addition, the PRISMA statement contains a series of “extensions”, meant to facilitate reporting guidelines for different types or aspects of meta-analyses/systematic reviews. Most relevant to the current study is the 16-item “PRISMA for Searching” checklist, which provides an expanded reporting checklist for the information sources and search strategies involved in a systematic review (Rethlefsen et al., 2021).

Prior Research Examining Transparency and Reproducibility

With respect to PRISMA reporting guidelines, research examining systematic reviews and meta-analyses find low adherence to the guidelines overall. For example, Sun et al. (2019) examined 64 systematic reviews and meta-analyses of nursing interventions for Alzheimer’s patients by scoring studies on the PRISMA 2009 checklist. The mean PRISMA score across all studies was 19.3 out of a possible 27 points (SD = 4.17), with six items reported in less than 50% of the studies. Similarly, Peters et al. (2015) used the PRISMA checklist to rate meta-analyses of otorhinolaryngologic articles published in the top five Ear Nose Throat journals. Peters and colleagues found a median score of 54.4% on the checklist overall, with a lower adherence rate for the PRISMA-Abstracts checklist (41.7%; a separate checklist of recommendations for transparent reporting in abstracts). Other research examining PRISMA-Abstracts has reported similarly low rates of checklist compliance, including Tsou and Treadwell (2016) with 60% of items reported on average across 200 reviews, and Maticic et al. (2019) with a mean adherence rate of 42% across 244 studies.

Research on transparency and/or reproducibility in the social sciences is relatively limited and has predominantly been conducted in the field of psychology (e.g., Aytug et al., 2012; Brugha et al., 2012; Dieckmann et al., 2009). With respect to reproducibility, Lakens and a team of 14 colleagues (2017) attempted to reproduce 20 published meta-analyses in the psychological sciences. The authors noted “there was unanimous agreement across all seven teams involved in extracting data from the literature that reproducing published meta-analyses was much more difficult than we expected” (p. 8). Results suggest that 25% of the studies could not be reproduced at all due to missing data, and, of the remaining studies, there were frequently differences between the effect sizes calculated by Lakens et al. and those reported by the original authors due to lack of reporting of effect size conversion equations used, lack of clarity on sample sizes, and so forth. With respect to transparency, Hohn et al. (2020) examined reporting practices in a random sample of 345 psychological meta-analyses published between 2009 and 2014. Using 45 items from the Quality Assessment for Systematic Reviews—Revised (QUASR-R; Slaney et al., 2017), the authors compared the actual practices of studies (e.g., search methods) versus those that were reported, and identified several areas of concern. Major gaps in reporting included primary study quality (reported by 36% of the 345 reviews), type of meta-analytic model used (e.g., random effects, fixed effects; reported by 80%), and power analyses (reporting almost non-existent).

Most relevant to the current study is a recent article by Polanin et al. (2020), who examined transparency and reproducibility in published meta-analyses in the journal Psychological Bulletin over a 30-year period (1990–2020). Based on the PRISMA checklist, Polanin and colleagues developed a 34-item checklist to represent important components of transparency and reproducibility, then scored 150 studies on the checklist. The authors found relatively weak adherence overall; just over half (55%) of all checklist items were reported in each study. Some examples of Polanin et al.’s specific findings are that only 58% of authors reported their source of funding, 2% reported using a systematic review protocol, 64% defined the criteria for study population eligibility, 95% listed eligible outcome measures, 77% specified the search terms used in databases, 48% reported dates of searches, 77% reported on data transformations, and 57% mentioned publication bias.

The Current Study

While prior research in health sciences and psychology suggest that the transparency and reproducibility of meta-analyses is low, to date no study has examined meta-analyses in criminology. The goal of the current study is to conduct a preliminary inquiry to examine whether criminological meta-analyses published in scholarly journals meet the traditional claims of transparency and reproducibility, by (1) scoring the extent to which best practices in transparent reporting are followed based on the PRISMA 2020 checklist and PRISMA-Search checklist, and (2) using a PRISMA-based Reproducibility checklist to assess overall study reproducibility and identify any common barriers. Importantly, we note that the PRISMA 2020 guidelines were not in existence when any of the studies in the current sample were published. We underscore that the purpose of the current study is not to rate meta-analysts on how well they adhered to reporting guidelines that were not yet established, nor is it to criticize authors for failing to observe existing guidelines. Rather, we examine how closely recent meta-analyses in criminology adhere to one example of state-of-the-art reporting guidelines, and identify where areas for improvement exist. We reiterate that whether a meta-analysis is reported in a manner that is highly transparent, and/or in a manner that would allow reproduction by others, should not be confused with a measure of the quality of the meta-analysis or its methodology (e.g., see the AMSTAR 2 checklist by Shea et al., 2017); it is a reflection of the manner in which it was reported.

Methods

Data

To locate recently published meta-analyses of intervention/prevention evaluations in the field of criminology, we searched Criminal Justice Abstracts (EBSCOhost) with date limiters January 1, 2016 to March 10, 2021. The following terms were combined in an Abstract search: “meta analysis” AND (recidiv* OR arrest* OR charge* OR convict* OR incarcerat* OR offend* OR offense* OR offence* OR crim*)¹. Given the general difference in conceptual orientation, we limited the inclusion of studies to those presenting a meta-analysis of intervention/prevention program evaluations, as opposed to a meta-analysis of risk factors, attitudes, behaviors, actuarial assessments (e.g., the Static-99), drug treatments, and so forth. In addition, as we were interested in the transparency of reporting of meta-analyses published in scholarly journals, we limited studies to those published in peer-reviewed journals (as opposed to technical reports or Campbell Collaboration reports that can range up to 100+ pages). The nature of the planned analyses and presentation of results was at the individual study level; this was to enable a preliminary assessment of transparency and reproducibility, demonstrate study-level similarities and differences, and ascertain potential areas for improvement. We emphasize that the goal of the search was to identify a fairly small set of recent meta-analyses in the field of criminology; the search was not structured to be a systematic literature search across numerous potentially relevant databases and grey literature sources.

Instrumentation

(1) Transparency.

The PRISMA 2020 checklist and the PRISMA-Search checklist were used to assess the transparency of reporting across the sample of studies. Each checklist is described below.

PRISMA 2020

The 42-item PRISMA 2020 checklist contains 27 main items (e.g., “Eligibility criteria,” “Risk of bias in studies”), some of which are divided into sub-items (e.g., “Discuss any limitations of the review process used”; “Discuss implications of the results for practice, policy, and future research”). For the purposes of this study, four items from the checklist were not included as they are subsumed in other checklists: item #2: Abstract (the PRISMA Abstracts checklist is a separate 12-item checklist that was not incorporated in the current study), and items #5: Eligibility criteria, #6: Information sources, and #7: Search strategy (all covered in the PRISMA-Search checklist; described next). The total possible modified PRISMA 2020 score was 38 points; see Table 1 for the complete checklist and scoring modifications made. In addition to the total score for PRISMA 2020, we were interested in the differential adherence to methods and results guidelines. To examine these factors separately, we computed sub-scores for “Methods” (subtotal = 14 points) and “Results” (subtotal = 11 points).²

Table 1.

PRISMA 2020 Checklist (Page et al., 2021a) and Coding Detail for Current Study.

Section and Topic	Item #	Checklist item	Coding detail^a
TITLE
Title	1	Identify the report as a systematic review.	Core
ABSTRACT
Abstract	2	See the PRISMA 2020 for Abstracts checklist.	Not included
INTRODUCTION
Rationale	3	Describe the rationale for the review in the context of existing knowledge.	Core
Objectives	4	Provide an explicit statement of the objective(s) or question(s) the review addresses.	Core
METHODS
Eligibility criteria	5	Specify the inclusion and exclusion criteria for the review and how studies were grouped for the syntheses.	Not included
Information sources	6	Specify all databases, registers, websites, organizations, reference lists, and other sources searched or consulted to identify studies. Specify the date when each source was last searched or consulted.	Not included
Search strategy	7	Present the full search strategies for all databases, registers, and websites, including any filters and limits used.	Not included
Selection process	8	Specify the methods used to decide whether a study met the inclusion criteria of the review, including how many reviewers screened each record and each report retrieved, whether they worked independently, and if applicable, details of automation tools used in the process.	Core
Data collection process	9	Specify the methods used to collect data from reports, including how many reviewers collected data from each report, whether they worked independently, any processes for obtaining or confirming data from study investigators, and if applicable, details of automation tools used in the process.	Core
Data items	10a	List and define all outcomes for which data were sought. Specify whether all results that were compatible with each outcome domain in each study were sought (e.g., for all measures, time points, analyses), and if not, the methods used to decide which results to collect.	Core
Data items	10b	List and define all other variables for which data were sought (e.g., participant and intervention characteristics, funding sources). Describe any assumptions made about any missing or unclear information.	Core
Study risk of bias assessment	11	Specify the methods used to assess risk of bias in the included studies, including details of the tool(s) used, how many reviewers assessed each study and whether they worked independently, and if applicable, details of automation tools used in the process.	Optional
Effect measures	12	Specify for each outcome the effect measure(s) (e.g., risk ratio, mean difference) used in the synthesis or presentation of results.	Core
Synthesis methods	13a	Describe the processes used to decide which studies were eligible for each synthesis (e.g., tabulating the study intervention characteristics and comparing against the planned groups for each synthesis (item #5)).	Optional
	13b	Describe any methods required to prepare the data for presentation or synthesis, such as handling of missing summary statistics, or data conversions.	Core
	13c	Describe any methods used to tabulate or visually display results of individual studies and syntheses.	Core
	13d	Describe any methods used to synthesize results and provide a rationale for the choice(s). If meta-analysis was performed, describe the model(s), method(s) to identify the presence and extent of statistical heterogeneity, and software package(s) used.	Core
	13e	Describe any methods used to explore possible causes of heterogeneity among study results (e.g., subgroup analysis, meta-regression).	Optional
	13f	Describe any sensitivity analyses conducted to assess robustness of the synthesized results.	Optional
Reporting bias assessment	14	Describe any methods used to assess risk of bias due to missing results in a synthesis (arising from reporting biases).	Optional
Certainty assessment	15	Describe any methods used to assess certainty (or confidence) in the body of evidence for an outcome.	Optional
RESULTS
Study selection	16a	Describe the results of the search and selection process, from the number of records identified in the search to the number of studies included in the review, ideally using a flow diagram.	Core
Study selection	16b	Cite studies that might appear to meet the inclusion criteria, but which were excluded, and explain why they were excluded.	Core
Study characteristics	17	Cite each included study and present its characteristics.	Core
Risk of bias in studies	18	Present assessments of risk of bias for each included study.	Optional
Results of individual studies	19	For all outcomes, present, for each study: (a) summary statistics for each group (where appropriate) and (b) an effect estimate and its precision (e.g., confidence/credible interval), ideally using structured tables or plots.	Core
Results of syntheses	20a	For each synthesis, briefly summarize the characteristics and risk of bias among contributing studies.	Core
	20b	Present results of all statistical syntheses conducted. If meta-analysis was done, present for each the summary estimate and its precision (e.g., confidence/credible interval) and measures of statistical heterogeneity. If comparing groups, describe the direction of the effect.	Core
	20c	Present results of all investigations of possible causes of heterogeneity among study results.	Optional
	20d	Present results of all sensitivity analyses conducted to assess the robustness of the synthesized results.	Optional
Reporting biases	21	Present assessments of risk of bias due to missing results (arising from reporting biases) for each synthesis assessed.	Optional
Certainty of evidence	22	Present assessments of certainty (or confidence) in the body of evidence for each outcome assessed.	Optional
DISCUSSION
Discussion	23a	Provide a general interpretation of the results in the context of other evidence.	Core
	23b	Discuss any limitations of the evidence included in the review.	Core
	23c	Discuss any limitations of the review processes used.	Core
	23d	Discuss implications of the results for practice, policy, and future research.	Core
OTHER INFORMATION
Registration and protocol	24a	Provide registration information for the review, including register name and registration number, or state that the review was not registered.	Core
	24b	Indicate where the review protocol can be accessed, or state that a protocol was not prepared.	Core
	24c	Describe and explain any amendments to information provided at registration or in the protocol.	Core
Support	25	Describe sources of financial or non-financial support for the review, and the role of the funders or sponsors in the review.	Core
Competing interests	26	Declare any competing interests of review authors.	Core
Availability of data, code and other materials	27	Report which of the following are publicly available and where they can be found: template data collection forms; data extracted from included studies; data used for all analyses; analytic code; any other materials used in the review.	Core
TOTAL		PRISMA 2020 (original) = 42 points
		PRISMA 2020 (modified) = 38 points; includes 27 core items and 11 optional items
		Includes Methods subscale (items 8–15; 14 points) and Results subscale (items 16a–22; 11 points)

^aCore = the item is a required element of a meta-analysis and all studies were coded on it.

Optional = the item is an optional element of a meta-analysis and studies were code 99 if it was assumed that the element was not implemented; 0 if it was clear that the element was implemented but not fully reported.

Not included = the item in the original PRISMA 2020 checklist was not included in the current study’s modified checklist due to being coded in more detail in another checklist (PRISMA Abstracts or PRISMA Search).

Of the 38 PRISMA 2020 items, we consider 27 to be “core” reporting items for a meta-analysis, and 11 items to be “optional.” Core reporting items are those that are central to the conduct of a basic systematic review and meta-analysis (see Polanin et al. (2020) for a discussion of mandatory vs. optional criteria). For example, core items include #12: description of outcome effect measures, #16a: results of the search and selection process, and #22d: a discussion of any limitations of the evidence included in the review.

Optional reporting items are steps that may certainly improve the quality of a meta-analysis if they are implemented, but that are not indispensable to complete a basic synthesis of the literature (and in some cases may not be feasible given a small set of studies). These include item #11: specifies methods used to assess risk of bias in included studies, #13f: describes any sensitivity analyses used, #20d: presents results of heterogeneity investigations, and #22: presents results of certainty in the body of evidence for reach outcome assessed. Full details on core versus optional reporting items are presented in Table 1.

PRISMA-Search

The PRISMA-Search checklist is an extension of the PRISMA 2020. Specifically, the checklist encompasses 16 items, for example, #1: database names, #5: citation searching, and #7: full search strategies. To avoid issues associated with coding double-barreled items, we modified this checklist by adding subcategories to three items as follows: We split item #4: Online resources and browsing into (a) hand-searched journals and (b) websites, we split item #5: Citation searching into (a) reference lists of included studies and (b) reference lists of existing reviews, and we split item #8: Full search strategies into (a) database search strategies and (b) search strategies for other sources.

For the modified PRISMA-Search checklist, the total possible score was 19 points. In addition, we computed sub-scores for “Information and methods” (subtotal = 9 points), and “Search strategies” (subtotal = 7 points). Both core (n = 6; e.g., #1: name each individual database searched; #9: limits and restrictions) and optional (n = 13; e.g., #3: list any study registries searched; #6: indicate whether studies were sought by contacting authors or experts) reporting items were included. See Table 2 for the complete checklist and modifications made.

(2) Reproducibility.

Table 2.

PRISMA Search Checklist (Rethlefsen et al., 2021) and Coding Detail for Current Study

Section/topic	#	Checklist item	Coding detail^a
INFORMATION SOURCES AND METHODS
Database name	1	Name each individual database searched, stating the platform for each.	Core
Multi-database searching	2	If databases were searched simultaneously on a single platform, state the name of the platform, listing all of the databases searched.	Optional
Study registries	3	List any study registries searched.	Optional
Online resources and browsing	4	Describe any online or print source purposefully searched or browsed (e.g., tables of contents, print conference proceedings, web sites), and how this was done.	Split into 4a (journals) and 4b (websites); both Optional
Citation searching	5	Indicate whether cited references or citing references were examined, and describe any methods used for locating cited/citing references (e.g., browsing reference lists, using a citation index, setting up email alerts for references citing included studies).	Split into 5a (references of included studies) and 5b (references of existing reviews); both Optional
Contacts	6	Indicate whether additional studies or data were sought by contacting authors, experts, manufacturers, or others.	Optional
Other methods	7	Describe any additional information sources or search methods used.	Optional
SEARCH STRATEGIES
Full search strategies	8	Include the search strategies for each database and information source, copied and pasted exactly as run.	Split into 8a (database search strategies) = Core, and 8b (search strategies for other sources) = Optional
Limits and restrictions	9	Specify that no limits were used, or describe any limits or restrictions applied to a search (e.g., date or time period, language, study design) and provide justification for their use.	Core
Search filters	10	Indicate whether published search filters were used (as originally designed or modified), and if so, cite the filter(s) used.	Optional
Prior work	11	Indicate when search strategies from other literature reviews were adapted or reused for a substantive part or all of the search, citing the previous review(s).	Optional
Updates	12	Report the methods used to update the search(es) (e.g., rerunning searches, email alerts).	Optional
Dates of searches	13	For each search strategy, provide the date when the last search occurred.	Core
PEER REVIEW
Peer review	14	Describe any search peer review process.	Optional
MANAGING RECORDS
Total Records	15	Document the total number of records identified from each database and other information sources.	Core
Deduplication	16	Describe the processes and any software used to deduplicate records from multiple database searches and other information sources.	Core
TOTAL		PRISMA-S (original) = 16 points
		PRISMA-S (modified) = 19 points including 6 required items and 13 optional items
		Includes Information & Methods subscale (items 1–7; 9 points) and Search Strategies subscale (items 8–13; 7 points)

^aCore = the item is a required element of a systematic search and all studies were coded on it.

Optional = the item is an optional element of a systematic search and studies were code 99 if it was assumed that the element was not implemented; 0 if it was clear that the element was implemented but not fully reported.

A checklist to assess study reproducibility was developed based on the PRISMA 2020 checklists. The Reproducibility checklist is intended to assess whether the meta-analysis could be reproduced using the information presented in the article; this checklist is notably shorter than the full 38-item PRISMA 2020 checklist because as previously noted not all elements of transparency are necessary for study reproduction. Importantly, we did not actually attempt to replicate any of the searches, effect size calculations, or meta-analyses as this task was beyond the scope of the paper and we were not focused on attempting to validate the procedures or results of the included studies. Rather, we reviewed each study and, based on our prior experience in coding for and conducting multiple meta-analyses and our familiarity with the necessary level of detail for various data points, we determined whether sufficient information was reported to enable us to reproduce the study element. The 22-item checklist contains two sections (Search; n = 13 items and Results; n = 9 items); see Table 6.

Coding

All coding was completed by the two study authors. The first author coded all studies in the sample, and the second author repeated the same process in order to validate the initial codes. Discrepancies and disagreements between the first and second round of coding were discussed and multiple iterations of coding were conducted in order to reach 100% consensus on codes across all studies.

(1) Transparency.

Each study was coded on all PRISMA checklist items using a scoring system in which 0 = implemented but not reported at all, 0.5 = implemented but partially reported, and 1 = implemented and adequately reported. As it is largely uncommon for authors to explicitly report that they did not implement a step in a meta-analysis, for the checklist items that we designated as optional we coded 99 if the item was not reported. In other words, in many cases we assumed that a study did not report on a PRISMA checklist item because the authors did not implement the item.³ This coding is distinct from situations involving a clear reporting omission from an item such as study eligibility criteria, which would have been coded as 0 or 0.5.

(2) Reproducibility.

Scoring options were 0/1 with respect to whether the article reported the checklist item in sufficient detail for it to be reproduced by third party authors.⁴ Again, the code 99 denotes that there was no evidence an optional component was implemented by the study authors.

Analytic Approach

We assessed the transparency of reporting for each study by summing the scores across all items and computing the adherence rate (i.e., sum score/total possible items). Rates were computed for the PRISMA 2020 and PRISMA-Search checklist scores, as well as the two associated subscales in each. With respect to the calculation of adherence rate, the denominator of each study (i.e., total possible items) was reduced by omitting the “99” codes.⁵ As such, the rate of adherence is not impacted by unreported, optional checklist items.

Similarly, we assessed the reproducibility of each study by determining whether sufficient information was presented in the report to enable full reproduction of each reported component. If yes, the item was scored as 1; if no, the item was scored as 0. Any 0’s indicate that the study is not reproducible at that particular step. The number of 1s and 0s for each study were tallied, to give an overall indication of which components were the most and least reproducible.

Last, to assess the potential impact of journal page limitations on transparency and reproducibility, we examined the correlation between checklist scores and length of article. The word count for each article’s .pdf file was calculated using the “counting characters” extension in Google Chrome.⁶

Results

Search Results

The search resulted in 145 hits, of which 33 were meta-analyses of intervention/prevention programs published from January 2016 to February 2021 and were selected for inclusion. Almost all of the meta-analyses were published in criminology journals such as Aggression & Violent Behavior, Criminal Justice & Behavior, and Journal of Experimental Criminology (n = 31). One was published in Psychiatry, Psychology, & Law, and one was published in Addiction. The types of interventions examined were diverse; for example, crisis intervention teams, music therapy in correctional settings, protection orders for domestic violence, and electronic monitoring. The outcomes examined were primarily recidivism and crime; some studies measured alternative outcomes such as psychological functioning, educational attainment, and police officer use of force. All included studies are denoted with an asterisk in the References section.

Transparency

PRISMA 2020 Checklist

The 38-item PRISMA 2020 checklist was used to assess adherence to transparent reporting across the entire study. As described previously, not all checklist items are required in a search strategy (i.e., some are optional); of the 33 studies, only one implemented all 38 items on the checklist. As shown in Table 3, the average adjusted number of items (i.e., adjusted by omitting 99 codes) across the meta-analyses was 32.88 items (range 24–38 items). The average checklist score was 20.82 (range 8–32.5), and the average rate of adherence to the PRISMA 2020 checklist was 63% (range 33%–86%). In addition, the average adherence rate for the 14-item Methods subscale was 61% (range 8%–89%), and adherence for the Results subscale was 76% (range 20%–95%).

Table 3.

PRISMA 2020 Checklist and PRISMA-Search Checklist Total and Subscale Average Scores

	PRISMA 2020
	PRISMA 2020 Total (38 items)	Methods subscale (14 items^b)	Results subscale (11 items^c)
Sum score	20.82 (range 8–32.5)	7.03 (range 0.5–12.5)	6.59 (range 1–9.5)
Average number of items implemented^a	32.88 (range 24–38)	11.3 (range 6–14)	8.58 (range 5–11)
Average rate of adherence^a	63% (range 33–86%)	61% (range 8–89%)	76% (range 20–95%)
PRISMA-SEARCH
	PRISMA-Search Total scale (19 items)	Information & Methods subscale (9 items^d)	Search Strategies subscale (7 items^e)
Sum score	7.58 (range 2.5–12.5)	3.4 (range 0–7)	2.8 (range 1–5.5)
Average number of items implemented^a	9.79 (range 6–14)	3.9 (range 1–7)	3.8 (range 3–6)
Average rate of adherence^a	77% (range 42%–94%)	86% (range 0–100%)	75% (range 25%–100%)

^aDenominators were adjusted for each study based on the number of 99 codes (e.g., 38—(# of 99s)).

^bChecklist items 8–15, including sub-items.

^cChecklist items 16–22, including sub-items.

^dChecklist items 1–7, including sub-items.

^eChecklist items 8a–13, including sub-items.

Figure 1 shows the item adherence scores across the 38 PRISMA 2020 items. With respect to the 27 “core” items, while the majority of studies received high scores on adherence to items such as study rationale (94%), objectives (86%), type of effect measures used (100%), and type of results synthesis (92%), other items had substantially lower rates of reporting. In particular, few studies described their approach to displaying results (6%), presented summary characteristics of each pooled analysis (3%), mentioned whether or not the study had a registered protocol (9%), or mentioned the availability of data/code (15%).

Figure 1.

PRISMA 2020 Average Item Adherence Scores^a,b,c. ^aItem descriptions available in Table 1; as noted in the Methods section, PRISMA 2020 items #2, #5, #6, and #7 were not included. ^bNumbers in parentheses represent the number of studies reporting on the item; n = 33 for all core reporting items, and variable counts for the optional items. For example, 15 studies implemented checklist item #11. ^cDiagonal bar lines represent optional reporting items.

Figure 1 differentiates the 11 “optional” PRISMA 2020 checklist items with diagonal lines in the bar chart. In general, studies that implemented optional components tended to report adequately on these components. For example, of the 15 studies that implemented a study-level risk of bias assessment, 73% adequately described the methodological approach. Similarly, of the 18 studies that implemented a sensitivity analysis, 64% adequately reported the methodological approach and 78% adequately reported the analytic results.

PRISMA-Search Checklist

Results for the PRISMA-Search checklist indicate that none of the 33 studies implemented all 19 items on the checklist. As shown in Table 3, the average sum score was 7.58 (range 2.5–12.5 items), and the average number of items implemented was 9.79 (range 6–14). Across the 33 studies, the average transparency of reporting was 77% (range 42%–94%). Subscale scores for the PRISMA-Search checklist are also shown in Table 3. For the “Information & Methods” subscale, the average rate of adherence was 86% with a range of 0%–100%, while the average adherence score for “Search Strategies” was 75%; range 25%–100%.

Figure 2 demonstrates the average adherence of reporting for each of the 19 individual items on the PRISMA-Search checklist. The majority of the items were optional, and are indicated by diagonal lines in the bar chart. For the core reporting items, adherence was highest for item #9: limits and restrictions for included studies (100%), item #15: total number of records documented (95%), and item #1: database names listed (91%). Reporting was somewhat lower for item #13: dates of each search (80%), item #8a: inclusion of full database search strategies (71%), and item #16: description of search records deduplication method (27%). For the optional checklist items, reporting adherence rates were lowest for search filters (item #10; 0%), and listing complete search strategies from non-database sources (item #8b; 26%). In general, high rates of adherence were shown for the optional checklist items, although many items were not implemented by the majority of studies in the set. For example, only six studies reported searching study registries (item #3), four studies reported other methods of searching for literature (item #7), and three studies noted using peer review for their search strategy (#14).

Figure 2.

PRISMA-Search Average Item Adherence Scores^a,b. ^aNumbers in parentheses represent the number of studies reporting on the item; n = 33 for all core reporting items, and variable counts for the optional items. ^bDiagonal bar lines represent optional reporting items.

Reproducibility

A modified 22-item version of the PRISMA 2020 checklist was used to assess reproducibility across the sample of 33 meta-analyses. The reproducibility of the search methods for each study is presented in Table 4 (items from the checklist are numbered 1 to 13; the associated labels are shown in the table footnotes). Blank cells indicate that the item was not implemented by the authors and thus is not relevant to reproducibility. Items coded with a check mark indicate sufficient reporting for reproducibility; those coded with an X indicate that the item was implemented but was not adequately reported for reproduction. Table 4 demonstrates that, based on the contents of the published articles, only three studies (9%) presented search strategies with sufficient detail to allow complete reproducibility (studies # 1, 27, and 30). Some studies had numerous stop points in terms of reproducibility; for example, study #3 had five non-reproducible components, while study #5 had one component that was not reproducible.

Table 4.

Study Level Reproducibility of Search

Study #	Checklist item # (defined below)													# of non-reproducible elements per study	Can the search be reproduced?
Study #	1	2	3	4	5	6	7	8	9	10	11	12	13	# of non-reproducible elements per study	Can the search be reproduced?
1	✓	✓	✓		✓				✓		✓	✓	✓	0	Yes
2	✓			✓	✓		X		✓	X		✓	✓	2	No
3	✓		X	X		✓	X		X	X	✓	✓	✓	5	No
4	✓								X		✓	✓	✓	1	No
5	✓			✓		✓			✓	X	✓	✓	✓	1	No
6	✓			X	✓				X	X	✓	✓	✓	3	No
7	✓				✓	X			✓		✓	✓	✓	1	No
8	✓				✓				✓		✓	X	X	2	No
9	✓		✓			X	X		✓		✓	✓	✓	2	No
10	X			X	✓			✓	✓	X	✓	X	✓	4	No
11	✓			✓	✓	✓	X		X	X	✓	✓	✓	3	No
12	✓		✓	✓		✓			✓	X	✓	X	✓	2	No
13	✓				✓	X			✓		✓	✓	✓	1	No
14	✓		✓				X		✓		✓	✓	✓	1	No
15	✓			✓	✓	X			✓	X	✓	✓	✓	2	No
16	X			✓	✓	✓		✓	✓	X	✓	✓	✓	2	No
17	✓	✓		✓		X			X	✓	✓	✓	✓	2	No
18	✓			X		X	X		✓	X	✓	X	✓	5	No
19	X		X	X	✓				✓	X	✓	✓	✓	4	No
20	X		X		X				X		✓	X	✓	5	No
21	✓		X	X				✓	✓	X	✓	✓	✓	3	No
22	X								X		✓	✓	X	3	No
23	✓	✓	✓	✓	✓	✓	X		✓	X	✓	✓	✓	2	No
24	✓		✓	✓	✓		X		X	X	✓	X	✓	4	No
25	✓				✓	✓			X		✓	✓	✓	1	No
26	✓		✓	X		X			✓	X	✓	✓	✓	3	No
27	✓				✓				✓	.	✓	✓	✓	0	Yes
28	✓	✓	✓	✓	✓	✓			X	X	✓	✓	✓	2	No
29	✓			✓		X		✓	X	X	✓	✓	X	4	No
30	✓					✓			✓	.	✓	✓	✓	0	Yes
31	✓	✓	X	X		X	X		✓	X	✓	✓	✓	5	No
32	✓		X	✓		✓	X		✓	X	✓	✓	✓	3	No
33	✓	X		✓		X	X		✓	✓	✓	✓	✓	3	No
# non-reproducible items across studies	5	1	6	8	1	10	11	0	11	19	0	6	3

✓ = the item has sufficient reporting for reproducibility; X = the item was implemented but was not adequately reported for reproducibility; Blank cell = the item was not implemented by study authors and is not relevant to reproducibility; 1 = individual database names, 2 = study registries, 3 = hand searched journals, 4 = websites, 5 = reference lists of incl. studies, 6 = existing review reference lists, 7 = authors contacted, 8 = all other search methods used, 9 = included search terms for databases, 10 = included website search strategies, 11 = study inclusion criteria, 12 = dates of searches, 13 = listed total records and exclusions at each search phase.

Similarly, Table 5 presents findings with respect to the reproducibility of the meta-analytic results for each study. Based on the published article, only three studies (9%) would be reproducible in full (studies #16, 17, and 29). The remaining 30 studies ranged from 1 to 3 elements that were implemented by study authors but that would not be reproducible in full without additional information beyond the published article.

Table 5.

Study Level Reproducibility of Results.

Study #	Checklist item # (defined below)									# of non-reproducible elements	Can the results be reproduced?
Study #	14	15	16	17	18	19	20	21	22	# of non-reproducible elements	Can the results be reproduced?
1	✓	X	✓	X	X	✓	✓	✓	✓	3	No
2	✓		✓	X	✓	✓	X	✓	✓	2	No
3	✓	✓	✓	✓	✓	✓	X	✓	✓	1	No
4	✓		✓		X	✓	✓		✓	1	No
5	✓	✓	✓	X	X	✓	X	✓	✓	3	No
6	✓	X	✓	X	✓	✓	X	✓	✓	3	No
7	✓	✓	✓	✓	✓	✓	X		✓	1	No
8	✓		✓	X	✓	✓	✓		✓	1	No
9	✓		✓		X	✓	✓	X	✓	2	No
10	✓		✓	✓	X	✓	✓		✓	1	No
11	✓		✓		✓	✓	X	X	✓	2	No
12	✓	X	✓	✓	X	✓				2	No
13	✓	✓	✓	X	X	✓	✓	X		3	No
14	✓	✓	✓	X	X	✓	X			3	No
15	✓	✓	X	✓	✓	✓	✓	✓	✓	1	No
16	✓		✓	✓	✓	✓		✓		0	Yes
17	✓	✓	✓	✓	✓	✓	✓	✓	✓	0	Yes
18	✓	✓	✓	X	✓	✓	X	✓		2	No
19	✓		✓	X	✓	✓	✓		✓	1	No
20	✓		✓	X	✓	✓				1	No
21	✓		✓	X	X	✓	✓	✓	✓	2	No
22	✓				X					1	No
23	✓	X	✓		X	✓	✓	X	✓	3	No
24	✓		X		✓	✓	X	✓	✓	2	No
25	✓	✓	✓	X	✓	✓	✓	✓		1	No
26	✓		✓	X	X	✓	X			3	No
27	✓		✓	X	X	✓	✓		✓	2	No
28	✓		✓	✓	✓	✓	X	✓	✓	1	No
29	✓	✓	✓	✓	✓	✓				0	Yes
30	✓		✓		X	✓	✓		✓	1	No
31	✓	✓	X	✓	✓	✓	✓		✓	1	No
32	✓		X	✓	X	✓	✓		✓	2	No
33	✓		X	X	X	✓	✓	✓	✓	3	No
# non-reproducible items across studies	0	4	5	15	16	0	11	4	0

✓ = the item has sufficient reporting for reproducibility; X = the item was implemented but was not adequately reported for reproducibility; Blank cell = the item was not implemented by study authors and is not relevant to reproducibility; 14 = outcome variables, 15 = coding of bias within studies, 16 = types of effect size calculation, 17 = study eligibility, 18 = methods of preparing data for syntheses, 19 = method(s) of synthesizing results, 20 = approach to heterogeneity analyses, 21 = approach to sensitivity analyses, 22 = method for handling risk of bias across studies

Taken together, while three studies presented a reproducible search strategy and three other studies provided reproducible meta-analytic results, none of the 33 meta-analyses in our sample would be reproducible in their entirety. The average total number of non-reproducible elements (i.e., Search and Results) ranged from 1 to 7; with the average study stopping short of reproducibility by approximately 4 items (M = 4.12, SD = 1.57).

Table 6 displays the number of reproducible studies by item implemented across the set of studies examined, highlighting the key roadblocks to reproducibility. With respect to the systematic literature search, the items most likely to lack sufficient reporting were #10 (“included full search strategies for websites”); of the 21 studies that implemented this item, 19 did not report it sufficiently to allow for reproduction of this step of the meta-analysis. For item #7 (“listed specific authors contacted”), none of the studies implementing this component reported their methodology in sufficient detail to enable successful reproduction. Only half of the 20 studies that noted searching the reference lists of existing literature reviews listed citations for those reviews (item #6). In addition, 43% of the 14 studies that included hand-searched journals as part of a grey literature search did not list all the journals that were searched, while 38% of the studies that noted searching key websites for grey literature failed to provide the full list of websites searched. Although the non-reproducibility score was not quite as low, importantly, item #9 (“included full search terms for databases”) was not reported in full by 11 out of 33 studies which would render an attempt at reproducing the search impossible.

Table 6.

Reproducibility Checklist Items (n = 22) and Item-Level Reproducibility.

	Item # and description	# reproducible	# implemented	% reproducible
Search	1. Listed ALL individual databases searched?	28	33	85%
	2. Listed ALL study registries searched?	5	6	83%
	3. Listed ALL hand searched journals?	8	14	57%
	4. Listed ALL websites searched?	13	21	62%
	5. Searched reference lists of incl. studies?	16	17	94%
	6. Searched/listed existing review reference lists?	10	20	50%
	7. Listed ALL authors contacted?	0	11	0%
	8. Listed ALL other search methods used?	4	4	100%
	9. Incl. FULL search terms for databases?	22	33	67%
	10. Incl. FULL website search strategies?	2	21	10%
	11. Listed ALL inclusion criteria	32	32	100%
	12. Listed dates of searches	26	32	81%
	13. Listed total records and exclusions at each search phase	30	33	91%
Results	14. Described outcome variables	33	33	100%
	15. Described coding of bias within studies	11	15	73%
	16. Described types of effect size calculation	27	32	84%
	17. Described study eligibility	11	26	42%
	18. Described methods of preparing data for syntheses	17	33	52%
	19. Described method(s) of synthesizing results	32	32	100%
	20. Described approach to heterogeneity analyses	17	28	61%
	21. Described approach to sensitivity analyses	14	18	78%
	22. Described method for handling risk of bias across studies	23	23	100%

For example, study #10 did not report on four of the nine search items implemented. While the authors listed three examples of the 18 databases that they searched (including Criminal Justice Abstracts, National Criminal Justice Reference Service, and Web of Science), this is insufficient for third party reproduction based on the published article alone. Similarly, the authors present examples of websites that they searched but do not provide a complete list; further, they do not specify the search terms used in website searches, and do not report the specific dates of all searches implemented.

The most common roadblocks for reproducibility of the meta-analytic results were items #18 (“describes data preparation”; only 16 of 33 studies (48%) reported this item sufficiently), #17 (“describes study eligibility for different analyses”; 15 out of 26 studies (58%) did not sufficiently report this item), and #20 (“describes approach to any heterogeneity analysis”; 11 out of 28 studies (39%) did not report the methodology in a way to enable reproduction of results).

Article Length, Transparency, and Reproducibility

Last, given that reporting transparency and reproducibility may be in part a function of article length, we examined correlations between article word count, PRISMA 2020 score, and reproducibility score. With respect to word count, the 33 articles ranged from 6,232 to 19,714 words, with a mean of 11,543 words (SD = 3395). As noted previously, the mean PRISMA 2020 score was 63% (SD = 11%), and the mean reproducibility score was 73% (SD = 11%).

As anticipated, article word count was significantly correlated with study reproducibility score, with a moderately sized Pearson r = 0.4028 (p < .03). Longer articles were more likely to score highly with respect to reproducibility. Conversely, PRISMA 2020 scores were not correlated with article word count, suggesting that reporting adherence/transparency is not directly related to length of the article or, potentially, any journal page length maximums.

Discussion

A central premise of meta-analysis is that reporting is transparent and findings are reproducible (Borenstein et al., 2009; Lipsey & Wilson, 2001). Indeed, that studies are objectively and systematically selected and synthesized is arguably one of the method’s marquee features, particularly in comparison to traditional narrative research syntheses. Yet, prior studies indicate that methodological transparency in meta-analyses is often lacking, rendering the reproduction of results near impossible (Hohn et al., 2020; Lakens et al., 2017). As research on transparency and/or reproducibility of meta-analyses in the social sciences is limited and to our knowledge is non-existent within criminology, the current study sought to examine whether meta-analyses in the field of criminology meet these traditional claims.

Our systematic (albeit purposely limited in scope) search of the literature yielded a set of 33 meta-analyses of intervention programs published in scholarly journals between January 2016 and February 2021. Modified versions of the PRISMA 2020 checklist (38 items) and the PRISMA-Search checklist (19 items) were used to assess transparency of reporting and adherence to PRISMA reporting guidelines. Our findings indicate that the average rate of transparency within the sample was moderate (63%), however adherence varied considerably across studies and across subscales; in particular, information in the Results sections of meta-analyses was reported on more adequately than was information in the Methods sections (76% vs. 61% adherence). Additionally, findings from the PRISMA-Search checklist indicate a fairly high rate of adherence to the reporting of search methods overall (77%); the majority of the studies in our sample (86%) adequately reported “Information & Methods” items such as the full list of databases and websites used, and 75% sufficiently reported the search strategies used in the meta-analysis (such as search terms and inclusion criteria). Overall, while “core” and “optional” checklist items were both generally reported with sufficient detail, some core items had low rates of adherence. This finding is concerning as the core checklist items are those that are most central to conducting a basic systematic review and meta-analysis, and represent information that should be reported with utmost transparency. Failure to clearly and sufficiently report core items not only creates potential gaps in the transparency of decision-making, but opaque reporting practices may also considerably undermine the perceived rigor and validity of the general findings and serve to minimize the overall credibility of study conclusions.

With respect to the core items in PRISMA 2020, in this sample of meta-analyses few studies provided information concerning review protocol registration and protocol (item #24a, 9%), and few noted the availability of data, code, or other materials (item #27, 15%). While we do not contest the value of a pre-registered review protocol and the utility of having data and code made publicly available, these components are not currently standard in the field of criminology and lack of reporting on these items may be more of a reflection of norms in criminology (versus health sciences research) than a lack of reporting transparency. A difference in norms is also evident for some low reported items in the PRISMA-S checklist; including item #3: study registries (only six studies reported using registries), item #10: search filters (no studies reported using filters), item #11: prior work (only two studies noted using search strategies from other literature reviews), item #14: peer review (only three studies used a search peer review process), and item #27: deduplication (27% of studies mentioned efforts to handle duplication in search results but only three studies mentioned the use of software). These items do not appear standard in the field at the current time, and we contend that their lack of implementation is less a reflection of lack of rigor and/or lack of transparency, but more a reflection of differences in research norms between meta-analysis of criminological interventions versus meta-analysis of health interventions (for which PRISMA 2020 was developed; Page et al. 2021a).

Considering how few official guidelines exist in the field of criminology with respect to commitments for data sharing and transparency, these findings are perhaps not altogether surprising. When compared with disciplinary norms in other fields such as psychology, it appears that criminology is lagging behind with respect to established guidelines for reporting and data sharing. For instance, section 8.14 of the American Psychological Association’s (APA) Ethics Code “instructs researchers to allow other competent professionals access to the data on which their published results are based” (APA, 2020). Further, many APA journals now require authors to provide data availability statements that link shared data, materials, and/or codes (for the purposes of reproducing results or replicating procedures), or authors must specify their ethical or legal reasons for not sharing (APA, 2020). Without clear guidelines of reporting standards in criminology, an acceptable level of “adherence” will continue to be difficult to achieve.

Altogether, while it is encouraging that most studies have reasonably transparent reporting based on state-of-the-art PRISMA standards, our findings point to several areas in need of improvement. Specifically, of the eight core items in the Methods section of the PRISMA 2020 checklist, five had adherence rates below 60%: #8: selection process, #9: data collection process, #10b: data items (variables), #13b: preparation of data, and #13c synthesis display (see Figure 1). As items #8, 9, 10b, and 13b all represent items where at least some level of subjective decision-making is required, transparency is especially vital for third party appraisal of methodological objectivity and rigor.

Additionally, two of the six core items in the Results section of the PRISMA 2020 checklist were below 60% on adherence: #16b: study exclusions, and #20a: summary characteristics of each analysis. With respect to item 16b, this phase of the study selection process requires authors to make a number of decisions to ensure that the set of studies is commensurate (e.g., with respect to population, intervention type, outcome measures, etc.). In general, listing the number of studies excluded at each phase of the search process as a result of various inclusion criteria reduces potential ambiguity of the decision-making process, which may have a considerable impact on the perceived objectivity of study selection (e.g., that the studies were not “cherry picked”) and that there are no hidden biases. Finally, our findings suggest that summarizing the characteristics and risks of bias for each synthesis in a given meta-analysis (for example, when studies present a series of smaller syntheses that examine homogenous program (e.g., multi-session vs. single-session) or participant (e.g., adults vs. youths) types) is uncommon in meta-analyses of crime prevention/criminal-justice-related interventions. Across the set of 33 studies, only one study adequately reported results on this checklist item.

To assess reproducibility, we used a 22-item checklist that we developed based on the PRISMA 2020. Our findings demonstrate that, overwhelmingly, studies did not adhere to reproducibility guidelines as none of the 33 studies in our sample could be successfully reproduced in their entirety. While five of the checklist items were entirely reproducible across the set of 33 studies (i.e., 100% of the studies sufficiently reported this information), six items had a rate of reproducibility below 60%; including four in the Search section (see Table 6). With respect to the search strategy items (i.e., #3: listed all hand searched journals, #6: listed all existing review reference lists that were searched, #7: listed all authors contacted, #10: included full website search strategy), these generally represent items that include potentially long lists, and would consume a large amount of manuscript space if listed in full. It is possible that authors who do not consider the reproducibility of their study (or deprioritize it) may choose to omit or partially report this information when faced with the challenge of adhering to a journal’s submission guidelines with respect to word count.

The significant relationship between article word count and overall reproducibility scores in the current study support this explanation. With respect to the items in the Results section (i.e., items (#17: described study eligibility for each synthesis, #18: described methods of preparing data for syntheses)), it may be that such descriptions are prohibitively long and complex depending on the number (and nature) of the primary studies being synthesized. For example, in our experience recidivism data that are primarily 0/1 outcomes computed as odds ratios tend be much simpler than data computed as standardized mean differences (or those involving multiple levels of nesting or treatment groups that require aggregation). To fully describe all data preparation for a study might in some cases require a half page of explanation (or more); if the meta-analysis contains a large pool of studies, this requirement is unrealistic for presentation in a journal manuscript. In these situations, supplementary online material that includes raw data may be the most reasonable means by which to ensure full transparency and reproducibility.

Overall, our findings appear to suggest that meta-analysts in criminology may be more focused on transparency than on reproducibility. We are not suggesting that authors purposefully omit information so that their studies cannot be reproduced; rather, when faced with a limited word count and/or a desire to present a manuscript that is not so prohibitively detailed as to detract from readability, authors are required to strategically choose between which information is reported in full, which is partially reported, and which information is omitted. Meta-analyses often have lengthy reference lists (due to listing all included studies in the analyses), and an above-average number of tables and figures. In our experience in conducting this type of study, we have often dropped some analyses and/or opted for the “results are available upon request” space-saving strategy. Unfortunately, when making such decisions authors may, without realizing, lose focus of the basic tenets of transparency and reproducibility and the detailed information that is necessary for both. See Table 7 for a summary of critical findings.

Table 7.

Summary of Critical Findings.

	Transparency (average rate of adherence)
PRISMA 2020 checklist	Overall	38	63%
	Methods	14	61%
	Results	11	76%
PRISMA-Search checklist	Overall	19	77%
	Information & Methods	9	86%
	Search strategies	7	75%
	Reproducibility (# of reproducible studies)
Reproducibility checklist	Overall	22	0%
	Search	13	9%
	Results	9	9%

Recommendations

The results of this study lend to several recommendations for the field. With respect to academic journals and organizations, criminology journals that publish research syntheses should clarify in their submission guidelines that page length maximums will be extended for meta-analyses—or that online supplementary materials for meta-analyses are expected. Transparency in reporting is hampered by a requirement for authors to keep a manuscript to 25 or even 35 pages, which are the stated page maximums for some of the journals in which the current set of articles were published. In addition, editors of journals that welcome the submission of meta-analyses could consider requesting a completed reporting checklist, such as PRISMA, along with the manuscript to enhance transparency. Influential journals in the field of criminology should also consider adopting a similar approach to that of APA journals, and require data sharing/availability statements from meta-analyses to enable other researchers to reproduce results and/or replicate methodological procedures. Further, if the quality of reporting is to be improved, guidance and ethics statements from organizations such as the American Society of Criminology and the Academy of Criminal Justice Sciences can be developed for meta-analysts in the field.

With respect to authors, we strongly recommend that meta-analysts bear in mind both the PRISMA 2020 guidelines for reporting transparency, as well as the tenet of reproducibility, when designing their research and drafting their manuscripts. As meta-analysts are known for their repetitive calls to primary evaluators to “do better” in reporting elements of research design (e.g., Lipsey, 2001), we recognize the irony in making this suggestion. Nonetheless, it is clear that meta-analyses suffer the same issues of imperfect reporting, but with potentially even more problematic implications. Regarding reproducibility, we recommend that meta-analysts adopt a policy of double-coding all phases of systematic reviews; from study inclusion decisions to effect size calculation. Surprisingly, very few of the studies in the current set reported double-coding of all data; many used inter-rater reliability (IRR) assessments on a portion of overlapped studies. In our experience, no matter the coder’s proficiency, errors in data extraction are inevitable; as such, a high IRR is likely not satisfactory for reproducibility. While the intense labor involved in coding for meta-analyses is well-known, we suggest that coding be conducted by two reviewers, with 100% overlap on all studies and disagreements resolved by a third author (see Buscemi et al., 2006). If double-coding all studies is not possible (e.g., because the data set is prohibitively large), we recommend that a second reviewer code a random sample of a substantial portion of the studies in the set, and IRR subsequently be assessed.

Finally, if meta-analyses are expected to uphold claims of rigor by adhering to high standards of transparent reporting and the presentation findings that are reproducible, it appears as though there is little option but for meta-analysts to fully report the information extracted from each study; that is, in a detailed codebook available online with study-level information on all decisions made, all statistical formulas used, etc. (see also Turanovic & Pratt, 2021). Doing so would lead to improved confidence in findings by readers who may not take the time to reproduce a study, but who might wish to critically appraise various decision points in the methodological process and come to their own judgments concerning the validity of conclusions drawn. See Table 8 for a summary of recommendations/implications for practice, research, and policy.

Table 8.

Recommendations and Implications of the Review for Practice, Policy, and Research.

Target	Recommendation
Journals and Organizations	1. Extend page length maximums for meta-analyses
	2. Accept online supplementary materials
	3. Require a completed reporting checklist such as PRISMA 2020
	4. ASC^a/ACJS^b: Develop and disseminate guidance and ethics statements for meta-analyses in criminology
Authors	1. Design studies with reporting checklists in hand (e.g., PRISMA 2020)
	2. Consider reproducibility when drafting manuscripts and provide all necessary information in the report
	3. Use double-coding for all data extraction
	4. Fully report information extracted from each study in an online codebook

^aAmerican Society of Criminology

^bAcademy of Criminal Justice Sciences

Limitations

There are several limitations to our study. First, the sample of included studies was fairly small (n = 33). This was intentional in order to identify a manageable set of recently published studies on which to test the application of PRISMA 2020 guidelines, develop and test our Reproducibility checklist, and present findings at the individual study level. While the search was systematic in that we used a priori inclusion criteria and specific date limiters in the Criminal Justice Abstracts database, we by no means suggest that this set of 33 studies is the universe of meta-analyses of criminological prevention/intervention program evaluations conducted during this time period. In addition, we purposely limited the search to meta-analyses focused on prevention/intervention program evaluations in the field of criminology, and results may not generalize to meta-analyses of other topics. Further, given our focus on journal publications, we specifically excluded Campbell Collaboration meta-analyses and the results from this study do not extend to these publications. Second, this study involved a series of decisions with respect to coding instruments and procedures. While we selected the PRISMA 2020 reporting checklist as our measure of transparency, other checklists such as the MARS (Cooper, 2010) may arguably have been more applicable. The focus of the paper was not on PRISMA; rather, it was to use a state-of-the-art transparency guideline as a benchmark to assess the transparency of recent publications. Third, with respect to coding, some element of subjectivity was apparent when applying the checklists. To ensure accurate coding to the extent possible, both reviewers familiarized themselves with the PRISMA 2020 statement, and both reviewers coded the full dataset with a requirement that disagreements were discussed and concurrence on 100% of the codes was reached. The same process was used for the Reproducibility checklist. Relatedly, we acknowledge that there is a degree of subjectivity in ratings on the components of the Reproducibility checklist. Finally, a thoughtful reviewer queried how it was possible to assess reproducibility without actually reproducing anything. We acknowledge this as a potential limitation to the study, and emphasize that the study’s focus is on meta-analysis reporting (as opposed to validating meta-analysis methods or conclusions). While we relied on our own meta-analytic expertise and experience in conducting meta-analyses to determine whether sufficient information was reported to theoretically allow for replication, it is certainly possible that an attempt to formally replicate the 33 studies in our set would have led to different study results.

Conclusion

Meta-analyses are summative research syntheses and have the potential to be influential pieces of literature—provided their methods and conclusions are accepted as systematic, objective, and valid. It is incumbent on meta-analysts to adhere to principles of transparency and reproducibility, as the research they produce may influence decisions with respect to criminal justice policy—with notable implications for public safety and equity with respect to treatment impacts on diverse populations. The results of the current study suggest that meta-analyses in the field of criminology are moderately transparent, with several noted areas for improvement. Substantial changes to reporting practices are necessary for reproducibility to become reality versus mythical claim. While an adherence rate of 63% for transparency does not mean much in isolation, comparisons between fields (e.g., social science vs. natural science) or comparisons within the same field over time (e.g., crime prevention/intervention meta-analyses published between 2016 and 2020 vs. 2021–2025) could yield useful information about how adherence to transparent reporting in criminology compares to other fields (i.e., perhaps 63% adherence is comparatively high), or whether reporting practices have changed over time. Future research should continue to examine these issues in criminology, using PRISMA-based checklists (or other validated tools for reporting) such as the ones derived for the current study, with potential modifications implemented such as removing items that not considered norms in the field. Applying these checklists to a larger sample over a lengthier time period would allow for the assessment of a broader array of correlates, such as whether transparency and reproducibility of reporting are related to recency of publication, research topic, size of the study set, number of analyses conducted, journal metrics, methodological quality ratings on AMSTAR 2, and so forth.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Jennifer S. Wong

Notes

Author Biographies

Jennifer S. Wong is an Associate Professor in the School of Criminology at Simon Fraser University. Her work employs applied, quantitative, and qualitative methods to study issues of delinquency/crime prevention and intervention, focusing primarily on program evaluation and meta-analysis in the areas of crime prevention and intervention. Recent work includes evaluations of intimate partner violence intervention programs, examining the impact of prior deportation of illegal immigrants on recidivism, and meta-analyses on gang prevention programs, restorative justice programs for at-risk youth, halfway houses, and intensive supervision programs.

Jessica Bouchard is a PhD candidate in the School of Criminology at Simon Fraser University. She has served as a Project Coordinator on several community-based crime prevention evaluation projects, which include an anti-gang messaging campaign, the metro Vancouver YWCA Youth Education Programs, and 12 intimate partner violence intervention programs across British Columbia. Jessica has also worked with Dr Wong on systematically reviewing and meta-analyzing the effects of interventions such as aftercare/re-entry programs for juvenile offenders, home confinement programs, Day Reporting Centers, and Teen Court programs.

References

*Included in study data set.

* Akhtar

Barlow

(2018). Forgiveness therapy for the promotion of mental well-being: A systematic review and meta-analysis. Trauma, Violence & Abuse, 19(1), 107–122. https://doi.org/10.1177/1524838016637079

American Psychological Association (2020). Data sharing. American Psychological Association. https://www.apa.org/pubs/journals/resources/data-sharing.

* Armstrong

Eggins

Reid

Harnett

Dawe

(2018). Parenting interventions for incarcerated parents to improve parenting knowledge and skills, parent well-being, and quality of the parent-child relationship: A systematic review and meta-analysis. Journal of Experimental Criminology, 14(3), 279–317. https://doi.org/10.1007/s11292-017-9290-6

Aytug

Z. G.

Rothstein

H. R.

Zhou

Kern

M. C.

(2012). Revealed or concealed? Transparency of procedures, decisions, and judgment calls in meta-analyses. Organizational Research Methods, 15(1), 103–133. https://doi.org/10.1177/1094428111403495

Bailar

J. C.

(1997). The promise and problems of meta-analysis. New England Journal of Medicine, 337(8), 559–561. https://doi.org/10.1056/NEJM199708213370810

* Belur

Thornton

Thompson

Manning

Sidebottom

Bowers

(2020). A systematic review of the effectiveness of the electronic monitoring of offenders. Journal of Criminal Justice, 68. (Advanced Online Publication). https://doi.org/10.1016/j.jcrimjus.2020.101686

* Berghuis

(2018). Reentry programs for adult male offender recidivism and reintegration: A systematic review and meta-analysis. International Journal of Offender Therapy & Comparative Criminology, 62(14), 4655–4676. https://doi.org/10.1177/0306624X18778448

Berk

(2007). Statistical inference and meta-analysis. Journal of Experimental Criminology, 3(3), 247–270. https://doi.org/10.1007/s11292-007-9036-y

10.

Borenstein

Hedges

L. V.

Higgins

J. P. T.

Rothstein

H. R.

(2009). Introduction to meta-analysis. John Wiley & Sons, Ltd.

11.

* Bouchard

Wong

J. S.

(2018a). Examining the effects of intensive supervision and aftercare programs for at-risk youth: A systematic review and meta-analysis. International Journal of Offender Therapy & Comparative Criminology, 62(6), 1509–1534. https://doi.org/10.1177/0306624x17690449

12.

* Bouchard

Wong

J. S.

(2018b). The new panopticon? Examining the effect of home confinement on criminal recidivism. Victims & Offenders, 13(5), 589–608. https://doi.org/10.1080/15564886.2017.1392387

13.

* Bozik

Steele

Davis

Turner

(2018). Does providing inmates with education improve postrelease outcomes? A meta-analysis of correctional education programs in the United States. Journal of Experimental Criminology, 14(3), 389–428. https://doi.org/10.1007/s11292-018-9334-6

14.

* Braga

A. A.

Turchan

B. S.

Papachristos

A. V.

Hureau

D. M.

(2019). Hot spots policing and crime reduction: An update of an ongoing systematic review and meta-analysis. Journal of Experimental Criminology, 15(3), 289–311. https://doi.org/10.1007/s11292-019-09372-3

15.

* Braga

A. A.

Weisburd

Turchan

(2018). Focused deterrence strategies and crime control: An updated systematic review and meta-analysis of the empirical evidence. Criminology & Public Policy, 17(1), 205–250. https://doi.org/10.1111/1745-9133.12353

16.

Brugha

T. S.

Matthews

Morgan

Hill

Alonso

Jones

(2012). Methodology and reporting of systematic reviews and meta-analyses of observational studies in psychiatric epidemiology: Systematic review. British Journal of Psychiatry, 200(6), 446–453. https://doi.org/10.1192/bjp.bp.111.098103

17.

Buscemi

Hartling

Vandermeer

Tjosvold

Klassen

T. P.

(2006). Single data extraction generated more errors than double data extraction in systematic reviews. Journal of Clinical Epidemiology, 59(7), 697–703. https://doi.org/10.1016/j.jclinepi.2005.11.010

18.

* Chen

X. J.

Leith

Aarø

L. E.

Manger

Gold

(2016). Music therapy for improving mental health problems of offenders in correctional settings: Systematic review and meta-analysis. Journal of Experimental Criminology, 12(2), 209–228. https://doi.org/10.1007/s11292-015-9250-y

19.

* Cooke

B. J.

Farrington

D. P.

(2016). The effectiveness of dog-training programs in prison: A systematic review and meta-analysis of the literature. The Prison Journal, 96(6), 854–876. https://doi.org/10.1177/0032885516671919

20.

Cooper

(2010). Research synthesis and meta‐analysis: A step‐by‐step approach. In Applied social research methods series (4th ed., Vol. 2). Sage.

21.

Derry

C. J.

Derry

McQuay

H. J.

Moore

R. A.

(2006). Systematic review of systematic reviews of acupuncture published 1996-2005. Clinical Medicine, 6(4), 381–386. https://doi.org/10.7861/clinmedicine.6-4-381

22.

Dieckmann

N. F.

Malle

B. F.

Bodner

T. E.

(2009). An empirical assessment of meta-analytic practice. Review of General Psychology, 13(2), 101–115. https://doi.org/10.1037/a0015107

23.

* Dowling

Morgan

Hulme

Manning

Wong

(2018). Protection orders for domestic violence: A systematic review. Trends & Issues in Crime & Criminal Justice, 551(551), 1–19.

24.

* Duindam

H. M.

Asscher

J. J.

Hoeve

Stams

G. J. J. M.

Creemers

H. E.

(2020). Are we barking up the right tree? A meta-analysis on the effectiveness of prison-based dog programs. Criminal Justice & Behavior, 47(6), 749–767. https://doi.org/10.1177/0093854820909875

25.

* Duke

(2018). A meta-analysis comparing educational attainment prior to incarceration and recidivism rates in relation to correctional education. Journal of Correctional Education, 69(1), 44–59.

26.

* Ellison

Szifris

Horan

Fox

(2017). A rapid evidence assessment of the effectiveness of prison education in reducing recidivism and increasing employment. Probation Journal, 64(2), 108–128. https://doi.org/10.1177/0264550517699290

27.

Ferguson

C. J.

Kilburn

(2010). Much ado about nothing: The misestimation and overinterpretation of violent video game effects in Eastern and Western nations: Comment on Anderson et al. (2010). Psychological Bulletin, 136(2), 174–178. https://doi.org/10.1037/a0018566

28.

Ford

A. C.

Guyatt

G. H.

Talley

N. J.

Moayyedi

(2009). Errors in the conduct of systematic reviews of pharmacological interventions for irritable bowel syndrome. American Journal of Gastroenterology, 105(2), 280–288. https://doi.org/10.1038/ajg.2009.658

29.

Gøtzsche

P. C.

Hróbjartsson

Marić

Tendal

(2007). Data extraction errors in meta-analyses that use standardized mean differences. Journal of the American Medical Association, 298(4), 430–437. https://doi.org/10.1001/jama.298.4.430

30.

Gough

Oliver

Thomas

(2017). An introduction to systematic reviews (2nd ed.). Sage.

31.

* Gutierrez

Chadwick

Wanamaker

K. A.

(2018). Culturally relevant programming versus the status quo: A meta-analytic review of the effectiveness of treatment of indigenous offenders. Canadian Journal of Criminology & Criminal Justice, 60(3), 321–353. https://doi.org/10.3138/cjccj.2017-0020.r2

32.

* Harrison

O’Toole

Ammen

Ahlmeyer

Harrell

Hernandez

(2020). Sexual offender treatment effectiveness within cognitive-behavioral programs: A meta-analytic investigation of general, sexual, and violent recidivism. Psychiatry, Psychology & Law, 27(1), 1–25. https://doi.org/10.1080/13218719.2018.1485526

33.

Hohn

R. E.

Slaney

K. L.

Tafreshi

(2020). An empirical review of research and reporting practices in psychological meta-analyses. Review of General Psychology, 24(3), 195–209. https://doi.org/10.1177/1089268020918844

34.

* Hoppe

S. J.

Zhang

Hayes

B. E.

Bills

M. A.

(2020). Mandatory arrest for domestic violence and repeat offending: A meta-analysis. Aggression & Violent Behavior, 53. Advanced Online Publication. https://doi.org/10.1016/j.avb.2020.101430

35.

Ioannidis

J. P. A.

(2016). The mass production of redundant, misleading, and conflicted systematic reviews and meta-analyses. The Milbank Quarterly, 94(3), 485–514. https://doi.org/10.1111/1468-0009.12210

36.

Jones

A. P.

Remmington

Williamson

P. R.

Ashby

Smyth

R. L.

(2005). High prevalence but low impact of data extraction and reporting errors were found in Cochrane systematic reviews. Journal of Clinical Epidemiology, 58(7), 741–742. https://doi.org/10.1016/j.jclinepi.2004.11.024

37.

* Kettrey

H. H.

Lipsey

M. W.

(2018). The effects of specialized treatment on the recidivism of juvenile sex offenders: A systematic review and meta-analysis. Journal of Experimental Criminology, 14(3), 361–387. https://doi.org/10.1007/s11292-018-9329-3

38.

Lakens

Hilgard

Staaks

(2016). On the reproducibility of meta-analyses: Six practical recommendations. BMC Psychology, 4(1), 24. https://doi.org/10.1186/s40359-016-0126-3

39.

Lakens

van Assen

Anvari

Corker

K. S.

Grange

J. A.

Gerger

Hasselman

Koyama

Locher

Miller

Page-Gould

Schonbrodt

F. D.

Sharples

Spelman

B A.

Zhou

(2017). Examining the reproducibility of meta-analyses in psychology: A preliminary report. Unpublished manuscript https://doi.org/10.31222/osf.io/xfbjf.

40.

Lipsey

(2001). Re: Unsolved problems and unfinished business. American Journal of Evaluation, 22(3), 325–328. https://doi.org/10.1016/s1098-2140(01)00146-1

41.

Lipsey

M. W.

Wilson

D. B.

(2001). Practical meta-analysis. SAGE Publications.

42.

Losel

(2018). Evidence comes by replication, but needs differentiation: The reproducibility issue in science and its relevance for criminology. Journal of Experimental Criminology, 14(3), 257–278. https://doi.org/10.1007/s11292-017-9297-z

43.

Maticic

Krnic Martinic

Puljak

(2019). Assessment of reporting quality of abstracts of systematic reviews with meta-analysis using PRISMA-A and discordance in assessments between raters without prior experience. BMC Medical Research Methodology, 19(1), 32. https://doi.org/10.1186/s12874-019-0675-2

44.

* Mielke

Farrington

D. P.

(2021). School-based interventions to reduce suspension and arrest: A meta-analysis. Aggression & Violent Behavior, 56. Advanced Online Publication. https://doi.org/10.1016/j.avb.2020.101518

45.

*Mitchell

M. M.

Spooner

Jia

Zhang

(2016). The effect of prison visitation on reentry success: A meta-analysis. Journal of Criminal Justice, 47, 74-83. https://doi.org/10.1016/j.jcrimjus.2016.07.006.

46.

Page

M. J.

McKenzie

J. E.

Bossuyt

P. M.

Boutron

Hoffmann

T. C.

Mulrow

C. D.

Shamseer

Tetzlaff

J. M.

Akl

E. A.

Brennan

S. E.

Chou

Glanville

Grimshaw

J. M.

Hrobjartsson

Lalu

M. M.

Loder

E. W.

Mayo-Wilson

McDonald

Moher

(2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. British Medical Journal, 372(71), 1–9. https://doi.org/10.1136/bmj.n71

47.

Page

M. J.

Moher

Bossuyt

P. M.

Boutron

Hoffmann

T. C.

Mulrow

C. D.

Shamseer

Tetzlaff

J. M.

Akl

E. A.

Brennan

S. E.

Chou

Glanville

Grimshaw

J. M.

Hrobjartsson

Lalu

M. M.

Loder

E. W.

Mayo-Wilson

McDonald

McKenzie

J. E.

(2021). PRISMA 2020 explanation and elaboration: Updated guidance and exemplars for reporting systematic reviews. British Medical Journal, 372(160), 1–36. https://doi.org/10.1136/bmj.n160

48.

* Papalia

Spivak

Daffern

Ogloff

J. R. P.

(2020). Are psychological treatments for adults with histories of violent offending associated with change in dynamic risk factors? A meta-analysis of intermediate treatment outcomes. Criminal Justice & Behavior, 47(12), 1585–1608. https://doi.org/10.1177/0093854820956377

49.

* Per

Spinelli

Sadowski

Schmelefske

Anand

Khoury

(2020). Evaluating the effectiveness of mindfulness-based interventions in incarcerated populations: A meta-analysis. Criminal Justice & Behavior, 47(3), 310–330. https://doi.org/10.1177/0093854819891457

50.

Peters

J. P. M.

Hooft

Grolman

Stegeman

(2015). Reporting quality of systematic reviews and meta-analyses of otorhinolaryngologic articles based on the PRISMA statement. Plos One, 10(8), 1–11. https://doi.org/10.1371/journal.pone.0136540

51.

* Piquero

A. R.

Jennings

W. G.

Diamond

Farrington

D. P.

Tremblay

R. E.

Welsh

B. C.

Gonzalez

J. M. R.

(2016). A meta-analysis update on the effects of early family/parent training programs on antisocial behavior and delinquency. Journal of Experimental Criminology, 12(2), 229–248. https://doi.org/10.1007/s11292-016-9256-0

52.

* Piza

Welsh

B. C

Farrington

D. P.

Thomas

A. L.

(2019). CCTV surveillance for crime prevention. Criminology & Public Policy, 18(1), 135–159. https://doi.org/10.1111/1745-9133.12419

53.

Polanin

Hennessy

Tsuji

(2020). Transparency and reproducibility of meta-analyses in psychology: A meta-review. Perspectives on Psychological Science, 15(4), 1026–1041. https://doi.org/10.1177/1745691620906416

54.

Pridemore

W. A.

Makel

M. C.

Plucker

J. A.

(2018). Replication in criminology and the social sciences. Annual Review of Criminology, 1(1), 19–38. https://doi.org/10.1146/annurev-criminol-032317-091849

55.

Rethlefsen

M. L.

Kirtley

Waffenschmidt

Ayala

A. P.

Moher

Page

M. J.

Koffel

J. B.

PRISMA-S Group (2021). PRISMA-S: An extension to the PRISMA Statement for reporting literature searches in systematic reviews. Systematic Reviews, 10(1), 39. https://doi.org/10.1186/s13643-020-01542-z.

56.

Shea

B. J.

Reeves

B. C.

Wells

Thuku

Hamel

Moran

Moher

Tugwell

Welch

Kristjansson

Henry

D. A.

(2017). AMSTAR 2: A critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. British Medical Journal, 358(1), j4008. https://doi.org/10.1136/bmj.j4008

57.

Siddaway

A. P.

Wood

A. M.

Hedges

L. V.

(2019). How to do a systematic review: A best practice guide for conducting and reporting narrative reviews, meta-analyses, and meta-syntheses. Annual Review of Psychology, 70(1), 747–770. https://doi.org/10.1146/annurev-psych-010418-102803

58.

* Sidebottom

Tompson

Thornton

Bullock

Tilley

Bowers

Johnson

S. D.

(2018). Gating alleys to reduce crime: A meta-analysis and realist synthesis. Justice Quarterly, 35(1), 55–86. https://doi.org/10.1080/07418825.2017.1293135

59.

Slaney

K. L.

Tafreshi

Malange

(2017). Quality assessment of systematic reviews – revised (QUASR-R). Unpublished instrument https://osf.io/evzyt/.

60.

* Smith

Heyes

Fox

Harrison

Kiss

Bradbury

(2018). The effectiveness of probation supervision towards reducing reoffending: A rapid evidence assessment. Probation Journal, 65(4), 407–428. https://doi.org/10.1177/0264550518796275

61.

* Stockings

Bartlem

Hall

Hodder

Gilligan

Wiggers

Sherker

Wolfenden

(2018). Whole‐of‐community interventions to reduce population‐level harms arising from alcohol and other drug use: A systematic review and meta‐analysis. Addiction, 113(11), 1984–2018. https://doi.org/10.1111/add.14277

62.

Sun

Zhou

Zhang

Liu

(2019). Reporting and methodological quality of systematic reviews and meta-analyses of nursing interventions in patients with Alzheimer’s disease: General implications of the findings. Journal of Nursing Scholarship, 51(3), 308–316. https://doi.org/10.1111/jnu.12462

63.

* Taheri

S. A.

(2016). Do crisis intervention teams reduce arrests and improve officer safety? A systematic review and meta-analysis. Criminal Justice Policy Review, 27(1), 76–96. https://doi.org/10.1177/0887403414556289

64.

* Tanner-Smith

E. E.

Lipsey

M. W.

Wilson

D. B.

(2016). Juvenile drug court effects on recidivism and drug use: a systematic review and meta-analysis. Journal of Experimental Criminology, 12(4), 477–513. https://doi.org/10.1007/s11292-016-9274-y

65.

The Methods Group of the Campbell Collaboration . (2019). Methodological expectations of Campbell Collaboration intervention reviews: Reporting standards. Campbell Policies and Guidelines Series, 4. https://doi.org/10.4073/cpg.2016.4.

66.

Thompson

Belur

(2016). Information retrieval in systematic reviews: A case study of the crime prevention literature. Journal of Experimental Criminology, 12(2), 187–207. https://doi.org/10.1007/s11292-015-9243-x

67.

Tsou

A. Y.

Treadwell

J. R.

(2016). Quality and clarity in systematic review abstracts: An empirical study. Research Synthesis Methods, 7(4), 447–458. https://doi.org/10.1002/jrsm.1221

68.

Tunis

A. S.

McInnes

M. D. F.

Hanna

Esmail

(2013). Association of study quality with completeness of reporting: Have completeness of reporting and quality of systematic reviews and meta-analyses in major radiology journals changed since publication of the PRISMA statement? Radiology, 269(2), 413–426. https://doi.org/10.1148/radiol.13130273

69.

Turanovic

J. J.

Pratt

T. C.

(2021). Meta-analysis in criminology and criminal justice: Challenging the paradigm and charting a new path forward. Justice Evaluation Journal, 4(1), 21–47. https://doi.org/10.1080/24751979.2020.1775107

70.

* van der Put

C. E.

Boekhout van Solinge

N. F.

Stams

G. J.

Hoeve

Assink

(2021). Effects of awareness programs on juvenile delinquency: A three-level meta-analysis. International Journal of Offender Therapy & Comparative Criminology, 65(1), 68–91. https://doi.org/10.1177/0306624x20909239

71.

Wells

(2009). Uses of meta-analysis in criminal justice research: A quantitative review. Justice Quarterly, 26(2), 268–294. https://doi.org/10.1080/07418820802119984

72.

Williams

R. T.

Polanin

Pigott

(2017). “Meta-analysis and reproducibility”. In Makel,

M. C.

Plucker

J. A.

(Eds.), Toward a more perfect psychology: Improving trust, accuracy, and transparency in research (pp. 255–270). American Psychological Association.

73.

Wilson

D. B.

(2001). Meta-analytic methods for criminology. The Annals of the American Academic of Political and Social Science, 578(1), 71–89. https://doi.org/10.1177/0002716201578001005

74.

* Wong

J. S.

Bouchard

Gravel

Bouchard

Morselli

(2016). Can at-risk youth be diverted from crime? A meta-analysis of restorative justice diversion programs. Criminal Justice & Behavior, 43(10), 1310–1329. https://doi.org/10.1177/0093854816640835

75.

* Wong

J. S.

Bouchard

Gushue

Lee

(2019). Halfway out: An examination of the effects of halfway houses on criminal recidivism. International Journal of Offender Therapy & Comparative Criminology, 63(7), 1018–1037. https://doi.org/10.1177/0306624X18811964

Do Meta-Analyses of Intervention/Prevention Programs in the Field of Criminology Meet the Tests of Transparency and Reproducibility?

Abstract

Keywords

Transparency and Reproducibility

Why do Transparency and Reproducibility Matter?

Reporting Guidelines for Meta-Analyses

Prior Research Examining Transparency and Reproducibility

The Current Study

Methods

Data

Instrumentation

PRISMA 2020

PRISMA-Search

Coding

Analytic Approach

Results

Search Results

Transparency

PRISMA 2020 Checklist

PRISMA-Search Checklist

Reproducibility

Article Length, Transparency, and Reproducibility

Discussion

Recommendations

Limitations

Conclusion

Footnotes

Declaration of Conflicting Interests

Funding

ORCID iD

Notes

Author Biographies

References