Sage Journals: Discover world-class research

Abstract

In 2000, this journal published an influential case–control study identifying dynamic risk factors for sexual recidivism (Hanson & Harris, 2000). In 2017, updated recidivism information for the same sample was obtained with an average follow-up of 20 years. The current study compared the risk factors that differentiated between sexual recidivists and nonrecidivists between the two research designs: original case–control and updated prospective cohort. Of the 82 comparisons, 50 favored the prospective design while 32 favored the case–control; however, most of the differences were small and nonsignificant. Static and dynamic risk factors were approximately equivalent between study designs. Factors identified as sex-specific (e.g., sexual deviancy) were also equivalent between designs while general risk factors (e.g., substance use) were more likely identified in the prospective design. Overall, case–control studies can be used for the identification of risk factors, especially for low base rate behaviors such as sexual recidivism.

Keywords

case–control prospective cohort dynamic risk factor static risk factor sexual recidivism

Evidence-based corrections requires knowledge of recidivism risk and protective factors (i.e., characteristics empirically associated with the outcomes of interest; Bonta & Andrews, 2024; Hanson, 2009). One of the following two research designs are typically used to identify such factors: (a) prospective cohort designs (i.e., assessment of factors at Time 1 are correlated with the outcome of interest at Time 2), and (b) case–control designs (i.e., factors are compared between a group that has experienced the outcome of interest and a group that has not, without a follow-up period). Of the two, prospective cohort designs are usually preferred because they can clearly establish the temporal order of the variables (Mann, 2003; Sedgwick, 2013). Temporal order is necessary for applied prediction tasks and is a key criterion for causal theories (Kraemer et al., 1997). In addition to concerns about temporal order, case–control studies are also influenced by the variables used to match cases (Meehl, 1970). Case–control studies are, nonetheless, widely used because they can be completed relatively quickly, particularly for low base rate behaviors (Rothman et al., 2008; Song & Chung, 2010), such as sexual recidivism.

More than twenty years ago, this journal (Criminal Justice and Behavior) published a highly influential case–control study of dynamic risk factors for sexual recidivism (Hanson & Harris, 2000). The Hanson and Harris (2000) study has been widely cited, with over 1,300 citations in Google Scholar as of July 2024, including 36 citations in 2023 alone, two decades after it was originally published. The study also informed the development of the STABLE-2007 and ACUTE-2007 (Hanson et al., 2007) sexual recidivism risk tools, which are among the most used in Canada and the United States (Bourgon et al., 2018; McGrath et al., 2010). But given the inherent limitations of case–control studies, how much should we trust the Hanson and Harris results? More generally, to what extent does the recidivism risk and protective factors identified in any case–control study replicate in prospective studies? Although many factors have been examined in both case–control and prospective studies, the studies have been different (one study of variable x is case–control; another is prospective). Consequently, it is hard to tell whether any observed differences in the outcome of these two types of studies can be attributed to differences in the initial sample selection, location, or any of a myriad of other, unmeasured features varying across research studies. We are unaware of any previous study of recidivism risk factors directly comparing the results of a case–control design and a truly prospective design using the same variables, with the same sample, in the same setting. The current study fills this gap by comparing risk factors differentiating sexual recidivists and nonrecidivists in Hanson and Harris’ (2000) original case–control study to the factors predicting sexual recidivism in a 20-year prospective follow-up of the same sample.

Overview of Prospective and Case–Control Designs

Prospective designs can take many forms; however, prospective cohort studies are commonly used when studying recidivism risk factors. Prospective cohort studies start with a defined group who vary on a characteristic (or set of characteristics); after a follow-up period, some of the group members will have the outcome and some will not. Those with and without the outcome are then compared on the original factors that were measured at baseline (Mann, 2003; Sedgwick, 2013). Temporal order is established because the variables are measured prior to the outcome, limiting the risk of recall bias. Although prospective cohort studies are often considered the gold standard in prediction studies, they are resource and time intensive. Truly prospective cohort designs are particularly problematic when assessing outcomes that occur rarely (e.g., suicide) or manifest over a long period (e.g., life expectancy of infants born in 2024; Mann, 2003; Sedgwick, 2013). Furthermore, prospective cohort designs are susceptible to bias stemming from attrition between timepoints, especially if the reason for the attrition is associated with the factors assessed at baseline (Sedgwick, 2013).

Like prospective cohort designs, case–control studies compare those with and without the outcome on a predefined set of variables. The key difference is that those with the outcome are selected because they have the outcome. There are several different methods of selecting those without the outcome (i.e., the control group), usually involving some form of matching from a larger pool of potentially eligible cases (Song & Chung, 2010). For example, researchers could compare all cases of teen suicide in a metropolitan area with a random sample of teenagers from the same city during the same time period. Because researchers determine the number of cases in each group, case–control studies make it possible to study very low frequency outcomes, as well as outcomes that occur many years later (Rothman et al., 2008; Song & Chung, 2010). In general, case–control studies require fewer participants for equivalent statistical power, allow for the use of existing file information, and can be conducted more quickly and easily than prospective designs.

As with all observational studies, both prospective cohort and case–control studies are susceptible to the influence of unmeasured, third variables. Prospective cohort studies often address this problem by measuring a wide range of variables and then using them as statistical controls. In the case–control design, it is common to match cases on variables that could influence the outcome but are not the focus of the investigation (e.g., age, gender, geographic location, income; Cologne & Shibata, 1995; Meehl, 1970). Although matching is common practice in case–control studies, it is not without limitations. Any variable used as a matching variable cannot be used to compare the groups on the outcome of interest (by design, the effect would be zero). Less obviously, matching decreases the perceived effects of correlated variables. If, for example, low income is related to high criminal behavior, matching cases on age would decrease the association between income and crime because young adults typically make less money than older adults. Furthermore, as Meehl (1970) described, matching cases on any measured variable, m, would be expected to systematically mismatch the cases on some other, unknown variable, z. By controlling for variable m, the level of the mismatch on variable z could be quite high.

Risk Factors for Sexual Recidivism

The effect matching has on the underlying constructs is difficult to anticipate (Meehl, 1970). In the context of sexual recidivism risk assessment, two major constructs have been identified: (a) sex-crime-specific factors and (b) general antisociality (Babchishin et al., 2012; Brouillette-Alarie et al., 2016; Lehmann et al., 2013). Sex-crime-specific factors describe problems with atypical sexual interests and poor sexual self-regulation, contributing to sexual crime but not necessarily to nonsexual crime. The sex-crime-specific dimension is exemplified by characteristics such as early onset of sexual offending, engaging in diverse sex crimes, atypical sexual interests (e.g., pedophilia, exhibitionism), sexual preoccupation, and emotional identification with children (a correlate of pedophilia). The general antisociality dimension describes a general propensity for rule-breaking (sexual or otherwise), and includes factors such as adjustment problems in childhood, poor self-regulation and problem-solving, antisocial attitudes and personality traits (e.g., callousness), and relationship conflicts (Hanson & Bussière, 1998; Hanson & Morton-Bourgon, 2004; Helmus & Thornton, 2015).

In their case–control study, Hanson and Harris (2000) examined both static (historical, largely unchangeable) and dynamic (potentially changeable) risk factors in a sample of slightly over 400 men with a history of sexual offending who had served part of their sentence on community supervision. Sexual recidivists (n = 208) and nonrecidivists (n = 201) were matched on certain demographic characteristics (gender, geographic location, jurisdiction), as is customary in case–control studies. Given Hanson and Harris (2000) were primarily interested in dynamic risk factors, they also attempted to match cases on sex-crime-specific static risk factors (age, victim type, sexual criminal history). As well, they approximately matched on sexual recidivism risk level using the Rapid Risk Assessment for Sex Offense Recidivism (RRASOR; Hanson, 1997). Overall, Hanson and Harris (2000) found that the static factors that best differentiated recidivists from nonrecidivists were scores on validated risk assessment measures for general violence (e.g., VRAG, Quinsey et al., 1998), indicators of sexual deviance (e.g., number of paraphilias, victim type), IQ, and history of childhood maltreatment (e.g., sexual abuse and neglect). Examples of promising stable dynamic factors were antisocial attitudes (e.g., sees self at no risk for recidivism), poor social influences, and sexual entitlement. Because the groups were matched on sex-specific static factors, the effect sizes of the general risk factors (e.g., antisocial attitudes) may have been inflated; conversely, the variables correlated with the matching variables would be expected to show reduced effects compared to what they would have shown in prospective studies. Any of the variables explicitly used in matching (e.g., RRASOR scores) could not be meaningfully tested. Consequently, it is an open question whether the same types of risk factors (i.e., sex-crime specific vs. general) identified through this case–control design would replicate in a prospective cohort design.

Current Study

In 2017, the data set created by Hanson and Harris (2000), was updated to include prospective recidivism information for the same individuals from the original study. The current study compared the predictive accuracy of static and stable dynamic risk factors identified through a case–control design versus a prospective cohort design. Overall, we expected more differentiation between recidivists and nonrecidivists in the prospective cohort rather than case–control design because the prospective design would have less matching bias. Given the original study had matched on sex-crime-specific factors, we expected relatively larger effects from sex-specific risk factors in the prospective study and relatively larger effects from general antisociality factors in the case–control study.

Methods

Participants

The original sample of 409 men were supervised in the community for a sexually motivated offense (Hanson & Harris, 2000) between 1992 and 1997. Cases were selected from all Canadian provincial correctional systems (except Prince Edward Island) and all regions of the Correctional Service of Canada. All the men had been convicted of sexual assault involving physical contact against a person who was not in their immediate family (i.e., cases of offending against children/stepchildren were excluded, as were those with purely noncontact sexual offense histories). In the original study, 208 were classified as having recently sexually recidivated while on community supervision, whereas 201 were currently on supervision without any known reoffense as of the 1997 data collection. In the original study, recidivism was indexed by charges for sexual offenses (68%), violations of community supervision for sexually motivated behavior (26%), and nonsexual charges with sexual motivations (e.g., stealing underwear; 6%). Most of the sexually motivated violations were for sex crime behaviors (e.g., exhibitionism, sexual contact with a minor) that eventually resulted in charges and convictions; however, some of the community supervision violations were for behavior that indicated that a new sexual offense was imminent, but not actualized (e.g., approaching victims, possession of rape kits). None of the original sexual recidivism events included purely technical violations, such as nondisclosure of consenting intimate relationships or possession of legal pornographic materials.

Once a recidivism case was identified, nonrecidivist cases were selected from the same location (e.g., Metro X Probation Office) and jurisdiction (provincial/federal). Among the available cases at that office, the nonrecidivist case was selected that minimized static differences between the recidivist and nonrecidivist groups. As well, an effort was made to match on other salient characteristics, such as major mental illness (e.g., schizophrenia) and ethnicity (e.g., Indigenous status). Finally, the researchers aimed to match on the risk for sexual recidivism using RRASOR (Hanson, 1997) total scores.

Updated criminal history information obtained in 2017 was used to identify subsequent recidivism events for the nonrecidivist group, and correct errors in the original classification. The most common error was pseudo-recidivism (i.e., individuals had new charges for behavior that predated the beginning of their supervision period), resulting in 25 cases being reclassified from recidivists to nonrecidivists. As well, two cases were removed because they did not fit the sampling frame of the original study (one case with a very old recidivism event; one case in which nonsexual offending was mistaken for a sexual offense). Consequently, the original 409 cases were reclassified as 180 recidivists and 227 nonrecidivists (n = 407) for the purpose of the case–control analyses. Consequently, some cases that were (supposedly) matched in the original study were not matched in the current sample (e.g., a particular probation office may have contributed two nonrecidivists rather than a matched set of one recidivist and one nonrecidivist).

The prospective analyses considered only the 227 men who were classified as nonrecidivists in 1997. Although some of these men had prior sexual and nonsexual offenses, all had successfully completed at least six months of community supervision (average of 24 months) at the time of data collection. The median at-risk date for the nonrecidivists was 1996 (range from 1981 to 2002) in the updated data set. Subsequent criminal history information identified 57 of these 227 men as having committed a new sexual offense. Recidivism information was not available for 14 cases. Consequently, the prospective study compared 57 men who were known to have sexually reoffended after their assessment date in 1997 to 156 men who had no new recidivism events as of the follow-up end date in 2017.

At time of release, the average age of the complete sample (n = 407) was 39.4 years (SD = 11.4, range from 18 to 74). Most of the men were born in Canada (87.3%) and White (85.0%; 8.8% Indigenous; 2.9% Black; 3.3% unknown/other). Median education level was Grade 10. Based on the Static-99R (Phenix et al., 2017), the sexual recidivism risk of this group was above average: Very low risk (1%); Below average risk (4.9%); Average risk (28.7%); Above average risk (30.7%); Well above average risk (34.6%).

Original Data Collection

The selection of variables was guided by Andrews’ (1994) work on criminogenic factors, Hanson and Bussière’s (1996, 1998) meta-analyses on sex-crime-specific factors, and consultations with researchers, correctional managers, and community supervision officers. The final data collection instrument benefited from focus groups with experienced probation/parole officers and extensive pilot testing. The original project obtained ethics clearance from 14 institutional review boards. Selection and training of field team members that would be completing interviews followed Canada-wide position advertising, resumé review, and in-person interviews including role-plays and mock telephone cold calls. Ecological validity was increased by having an active-duty probation officer included in the interview team (seconded to the data collection team). All coders received a week-long in-person training that involved mock interviews, practice scoring on actual de-identified cases, and team-building exercises. Once data collection started, the team was actively supported by frequent telephone communication, periodic onsite visits, and an ever-evolving e-mail “Decision Log” that addressed ongoing issues in coding and other guidance. To increase the likelihood that the information concerning the predictor variables described the case before the recidivism event, coders first established two critical time points in the interviewees’ lives, one in the month proceeding the recidivism event and another six months earlier (e.g., changed jobs; returned from maternity leave; experienced a severe weather event). The interviewees were asked to link their responses about the recidivist/nonrecidivist cases to these two critical time periods in their own lives. Coding of contemporaneous contact notes was completed using the same interview coding form as the officer interview, utilizing the same two critical time periods. Data were then couriered to Ottawa for double-entry data punching. Further procedural details can be found in Hanson and Harris (1998, 2000).

Updated Recidivism

Two graduate-level research assistants working for Public Safety Canada linked recidivism information obtained in 2017 to the previously collected case information. Recidivism events were primarily identified by Canadian Police Information Center (CPIC) criminal history records held by the Royal Canadian Mounted Police. These records were supplemented by news articles searched via Google, identifying seven additional recidivism events (five contact, one noncontact, one sexually motivated violation). Sexual recidivism was based on new charges (45 cases) or sex-related community supervision violations (12 cases). Fifteen of the 57 recidivism cases involved only noncontact sexual offending (e.g., voyeurism, exhibitionism, child sexual exploitation materials). Follow-up time began at the date of assessment and ended with death (n = 18 of the 213; median death date of 2010, range from 1998 to 2016), deportation (n = 1), the date of the most recent CPIC record (2017), or the start of a period of incarceration that extended into one of these dates. The average follow-up time was 20.6 years (median = 20.7, SD = 4.1). At-risk time excluded periods of incarceration for nonsexual offenses.

Classification of Risk Factors

All three authors independently categorized the list of risk factors from Hanson and Harris (1998, 2000) as either sex-crime-specific, general, or neither (see S1 in the Supplemental Materials). Using Fleiss’ (1971) kappa, interrater agreement was good for the sex-specific (kappa = 0.671) and general (kappa = 0.605) categories, but poor for the other/neither category (kappa = 0.364). Given that kappa is heavily influenced by post hoc endorsement rates (penalizing infrequently used categories), interrater agreement was also indexed using the Brennan-Prediger coefficient (Moss, 2023), which indicated good agreement for all three categories (0.700 for sex-specific; 0.606 for general; and 0.718 for other/neither). Consensus was reached on any variable without perfect agreement. Reliability coefficients were calculated using the irrCAC package in R (Gwet, 2019).

Data Use

The current study was conducted under a data sharing agreement with Public Safety Canada. This study received ethics approval from the Dalhousie University Research Ethics Board (#2021-5662). The recidivism data set used in this study has been previously used by Aelick et al. (2020) in a study of mental health variables, Lee et al. (2020) in a meta-analysis on the predictive validity of sexual recidivism risk tools for individuals of Indigenous heritage in Canada, and Hanson et al. (2024) in a study of sexual recidivism rates.

Analysis Plan

Differences between the recidivists and nonrecidivists were computed for the case–control and prospective cohort designs. The question of primary interest was the difference between these differences (i.e., differences in effect sizes). The effect size for ordinal (e.g., no = 0, possibly = 1, yes = 2), count (number of prior offenses), and interval-like variables (risk scale scores, age) was the area under the curve (AUC), which is insensitive to base rates and extreme values (Ruscio, 2008). For dichotomous variables, the effect size was the logged odds ratio, with 0.5 added to each cell as a variance stabilization adjustment (Fleiss et al., 2003; Hanson, 2022). These were calculated in logit metric and reported in odds ratio metric for ease of interpretation. When interpreting the effect sizes, the following metrics can be used: small (.56 to .63), moderate (.64 to .70), and large (> .71) for AUCs (Rice & Harris, 2005); small (.71 to .43; 1.4 to 2.3), moderate (.44 to .27; 2.4 to 3.7), and large (< .27; > 3.7) for odds ratios (see Hanson, 2022).

The differences between the effects sizes were reported in several ways. First, we simply counted the number of times that the effect size was larger (in the expected direction) in the case–control analyses compared to the prospective cohort analyses. When the effect sizes were larger in one design than another, we considered that the results favored that design. Next, we counted the number of times the magnitude of these differences was more than expected by chance based on the standard error of differences.

The generic form for the standard error of a difference score is as follows (Ley, 1972):

σ_{a - b}^{2} = σ_{a}^{2} + σ_{b}^{2} - 2 r σ_{a} σ_{b}

In our study, r (the correlation between the variables) was computed separately for the AUC values (r = 0.7026, k = 51) and the logged odds ratios (r = 0.5497, k = 33). The variances for the AUC values were calculated using IBM/SPSS Version 29.0.2.0; the variances of the logged odds ratios were calculated in Microsoft Excel using the standard formula with 0.5 added to each cell (Hanson, 2022). Ninety-five percent confidence intervals for the differences were calculated as ± 1.96σ_a-b. When the confidence intervals do not include zero, they are statistically significant at alpha equal < .05 level. These confidence intervals should be considered approximate because the application of the normal probability distribution to Ley’s (1972) formula assumes equal variances (at the population level), which is unlikely to be the case for effect sizes based on different sample sizes. Nevertheless, the relative consistency of sample sizes within each design made the standard error of the differences a useful metric to identifying plausible differences across designs. No correction was made for multiple comparisons because keeping the p value at .05 provided a more sensitive test of the differences of effects between the two designs. All numbers in the case–control and prospective cohort analyses were verified independently by the first and second authors.

Results

Overall, the relationship of the variables to sexual recidivism was similar in the case–control and prospective cohort studies, with the expected trend toward larger effects in the prospective design. Of the 82 comparisons, 50 favored the prospective design (i.e., were larger) and 32 favored the case–control design. However, most of these differences were small. Only 27 of the 82 comparisons were statistically significant (17 favoring prospective; 10 favoring case–control; using p < .05 without correction for multiple comparisons). The same patterns were found for the static, historical variables (Table 1: 30 favored prospective; 20 favored case–control) and the stable dynamic variables (Table 2: 21 favored prospective; 11 favored case–control). Given that the criminal history variables were used to match cases, it is not surprising that all eight of the criminal history variables in Table 1 showed larger effects in the prospective study than the case–control study; five of these differences were statistically significant. The RRASOR, which also guided case matching, showed an AUC of .55 in the case–control study and .60 in the prospective study.

Table 1:

Effect Sizes for Sexual Recidivism of Static, Historical Variables in a Case–Control Design Compared to a Prospective Design

Variables	n	AUC	95% CI	Odds ratio	95% CI	Difference/ratio	95% CI
Age at index offense						−0.0024	[−0.061, 0.057]
Case–control	407	0.468	0.412, 0.524
Prospective	213	0.466	0.383, 0.549
Age at release						−0.019	[−0.077, 0.040]
Case–control	407	0.423	0.368, 0.479
Prospective	213	0.405	0.322, 0.487
Never married						1.05	[0.62, 1.77]
Case–control	403			1.35	0.90, 2.02
Prospective	212			1.29	0.69, 2.40
Minority race						1.02	[0.51, 2.03]
Case–control	406			1.13	0.66, 1.95
Prospective	213			1.11	0.49, 2.53
Unemployed at index						0.37	[0.21, 0.64]
Case–control	389			1.00	0.67, 1.48
Prospective	204			2.70	1.40, 5.21
Sexual offense history
Predominant victim type
Adult women (rape)						1.83	[1.04, 3.24]
Case–control	407			1.22	0.81, 1.84
Prospective	213			0.66	0.34, 1.31
Boys						0.58	[0.34, 0.99]
Case–control	407			0.93	0.61, 1.43
Prospective	213			1.60	0.85, 3.01
Girls						1.06	[0.63, 1.79]
Case–control	407			0.88	0.59, 1.33
Prospective	213			0.94	0.50, 1.75
Total of known victims						0.087	[0.025, 0.148]
Case–control	407	0.660	0.608, 0.712
Prospective	213	0.574	0.488, 0.660
Ever offended against
Adult females						2.02	[1.21, 3.36]
Case–control	407			1.72	1.16, 2.55
Prospective	213			0.85	0.46, 1.57
Adult males						2.25	[0.70, 7.26]
Case–control	407			1.28	0.60, 2.73
Prospective	213			0.57	0.14, 2.32
Boys						0.69	[0.42, 1.16]
Case–control	407			1.14	0.77, 1.71
Prospective	213			1.65	0.90, 3.04
Girls						1.42	[0.86, 2.36]
Case–control	407			1.50	1.01, 2.23
Prospective	213			1.05	0.58, 1.93
Diverse victim types						2.75	[1.61, 4.70]
Case–control	407			2.70	1.80, 4.04
Prospective	213			0.98	0.52, 1.85
Relationship to victims
Only related						2.73	[0.54, 13.79]
Case–control	405			0.11	0.021, 0.61
Prospective	211			0.31	0.055, 1.74
Any acquaintances						1.05	[0.57, 1.93]
Case–control	406			1.34	0.84, 2.15
Prospective	212			1.28	0.62, 2.63
Any strangers						1.61	[0.95, 2.71]
Case–control	405			2.10	1.41, 3.13
Prospective	212			1.31	0.70, 2.43
Sexual deviance
Any juvenile sex offenses						1.16	[0.65, 2.09]
Case–control	391			2.18	1.40, 3.39
Prospective	204			1.87	0.93, 3.75
Any diagnosis of deviant sexual preferences						0.95	[0.57, 1.58]
Case–control	407			1.18	0.80, 1.74
Prospective	213			1.24	0.68, 2.26
Phallometric assessments
PPG Deviant age preference (children)						0.61	[0.34, 1.08]
Case–control	407			1.05	0.66, 1.67
Prospective	213			1.72	0.87, 3.41
PPG Deviant activity preference (e.g., violence)						0.57	[0.29, 1.10]
Case–control	407			0.76	0.43, 1.32
Prospective	213			1.34	0.62, 2.91
Number of paraphilias (e.g., voyeurism, exhibitionism)						−0.0011	[−0.066, 0.064]
Case–control	407	0.570	0.513, 0.626
Prospective	213	0.571	0.480, 0.661
Lifestyle congruent with sexual deviance						0.65	[0.39, 1.11]
Case–control	407			1.36	0.92, 2.02
Prospective	213			2.08	1.11, 3.89
Sex offense treatment
Ever attended						0.69	[0.38, 1.25]
Case–control	401			0.97	0.61, 1.55
Prospective	211			0.67	0.33, 1.36
Number of different programs						– ^a	–
Case–control	380	0.510	0.450, 0.569
Prospective	200	0.492	0.399, 0.585
Poor treatment candidate (low motivation, dropout)						0.072	[0.010, 0.135]
Case–control	407	0.656	0.604, 0.709
Prospective	213	0.584	0.497, 0.671
Childhood/family background
Physical abuse						0.65	[0.38, 1.09]
Case–control	393			1.27	0.85, 1.90
Prospective	202			1.97	1.06, 3.66
Sexual abuse						0.68	[0.40, 1.16]
Case–control	392			1.92	1.29, 2.88
Prospective	204			2.82	1.50, 5.28
Other abuse (emotional, neglect)						0.86	[0.51, 1.45]
Case–control	393			1.84	1.23, 2.75
Prospective	205			2.15	1.15, 4.01
In care of child protective services						0.56	[0.31, 1.04]
Case–control	375			1.74	1.07, 2.82
Prospective	197			3.08	1.50, 6.33
Any long-term separation from biological parents before age 16						0.51	[0.30, 0.86]
Case–control	407			1.47	0.98, 2.21
Prospective	213			2.88	1.54, 5.38
Negative relationship with mother
During childhood						1.88	[1.02, 3.47]
Case–control	407			1.90	1.23, 2.95
Prospective	213			1.01	0.49, 2.11
As an adult						0.75	[0.41, 1.37]
Case–control	407			1.23	0.77, 1.97
Prospective	213			1.64	0.81, 3.34
Overall negative childhood environment						−0.051	[−0.113, 0.011]
Case–control	407	0.609	0.554, 0.664
Prospective	213	0.660	0.574, 0.747
Criminal Record Number of prior offenses
Sexual						−0.077	[−0.141, - 0.013]
Case–control	407	0.527	0.470, 0.584
Prospective	213	0.604	0.514, 0.694
Nonsexual violence						−0.023	[−0.087, 0.041]
Case–control	407	0.527	0.471, 0.583
Prospective	213	0.550	0.461, 0.639
Nonviolent						−0.048	[−0.112, 0.015]
–control	407	0.576	0.520, 0.632
Prospective	213	0.625	0.536, 0.713
Total						−0.070	[−0.132, − 0.007]
Case–control	407	0.573	0.517, 0.628
Prospective	213	0.642	0.555, 0.729
Number of index offenses
Sexual						−0.082	[−0.140, − 0.023]
Case–control	407	0.446	0.390, 0.502
Prospective	213	0.528	0.446, 0.610
Nonsexual violence						−0.082	[−0.145, − 0.020]
–control	407	0.490	0.434, 0.547
Prospective	213	0.573	0.485, 0.661
Nonviolent						−0.022	[−0.086, 0.043]
Case–control	407	0.507	0.450, 0.563
Prospective	213	0.528	0.439, 0.618
Total						−0.101	[−0.159, - 0.043]
Case–control	407	0.443	0.387, 0.499
Prospective	213	0.544	0.462, 0.625
Clinical Assessment
Low IQ						−0.0021	[−0.077, 0.073]
Case–control	316	0.588	0.526, 0.651
Prospective	156	0.591	0.486, 0.695
PCL-R
Total Scores						0.0052	[−0.60, 0.070]
Case–control	352	0.690	0.635, 0.744
Prospective	176	0.684	0.594, 0.775
PCL-R > 29						1.07	[0.48, 2.42]
Case–control	352			2.38	1.29, 4.39
Prospective	176			2.22	0.84, 5.85
Antisocial personality disorder						0.98	[0.58, 1.64]
Case–control	407			1.85	1.24, 2.76
Prospective	213			1.89	1.02, 3.49
Any personality disorder						1.20	[0.71, 2.04]
Case–control	407			1.37	0.92, 2.05
Prospective	213			1.14	0.61, 2.13
Any psychotic disorder						1.03	[0.35, 3.05]
Case–control	407			1.16	0.49, 2.74
Prospective	213			1.12	0.31, 4.05
Actuarial Risk Scales
SIR						0.078	[−0.0052, 0.161]
Case–control	174	0.711	0.634, 0.789
Prospective	96	0.634	0.517, 0.750
VRAG						−0.133	[−0.185, − 0.080]
Case–control	407	0.628	0.575, 0.682
Prospective	213	0.761	0.687, 0.834
RRASOR						−0.053	[−0.119, 0.013]
Case–control	405	0.549	0.492, 0.605
Prospective	211	0.602	0.510, 0.694

Note. AUC used for ordinal and interval variables; Odds ratios used for dichotomous variables. All differences were computed such that positive values (for AUC values) and ratios greater than one (for odds ratios) indicate the effects were larger (in the expected direction) for the case–control design than for the prospective design. For the standard error of the differences, the correlation was estimated at 0.7026 (n = 51) for the AUC values and 0.5497 (n = 33) for the logged odds ratios. Differences for AUC values are not significant when the confidence interval includes zero. Differences in the odds ratios are not significant when the confidence interval for the ratio does not include 1. Bolded values indicate that the comparison was statistically significant (p < .05). AUC = area under the curve; CI = confidence interval; VRAG = Violence Risk Appraisal Guide; RRASOR = Rapid Risk Assessment for Sex Offense Recidivism.

Difference not calculated because there was no expectation as to the direction of the effect.

Table 2:

Effect Sizes for Sexual Recidivism of Stable, Dynamic Variables in a Case–Control Design Compared to a Prospective Design

Variables	n	AUC	95% CI	Odds Ratio	95% CI	Difference/ Ratio	95% CI
Employment
Unemployed						−0.106	[−0.168, − 0.043]
Case–control	373	0.529	0.469, 0.588
Prospective	208	0.634	0.547, 0.721
Type of employment a problem						−0.008	[−0.071, 0.055]
Case–control	403	0.505	0.448, 0.562
Prospective	211	0.512	0.424, 0.600
Substance abuse						−0.063	[−0.126, 0.0004]
Case–control	407	0.564	0.508, 0.620
Prospective	213	0.627	0.538, 0.715
Ever on antiandrogens						0.45	[0.15, 1.38]
Case–control	403			2.44	1.08, 5.52
Prospective	210			5.43	1.42, 20.67
Psychological symptoms
Negative mood						−0.092	[−0.154, − 0.031]
Case–control	407	0.484	0.428, 0.540
Prospective	213	0.577	0.490, 0.663
Anger						−0.069	[−0.132, − 0.007]
Case–control	406	0.529	0.472, 0.585
Prospective	212	0.598	0.511, 0.686
Psychiatric symptoms (any)						−0.070	[−0.133, − 0.006]
Case–control	407	0.475	0.419, 0.531
Prospective	213	0.545	0.456, 0.634
Life stress						−0.072	[−0.141, − 0.003]
Case–control	407	0.473	0.416, 0.530
Prospective	213	0.545	0.449, 0.641
Social adjustment
Number of significant influences
Positive						−0.078	[−0.055, 0.071]
Case–control	365	0.362	0.305, 0.418
Prospective	199	0.354	0.266, 0.442
Neutral						− ^a	−
Case–control	364	0.550	0.490, 0.610
Prospective	198	0.500	0.412, 0.588
Negative						−0.018	[−0.083, 0.048]
Case–control	365	0.602	0.543, 0.661
Prospective	199	0.620	0.528, 0.711
Global problems with intimacy						0.082	[0.019, 0.145]
Case–control	394	0.509	0.452, 0.567
Prospective	209	0.428	0.339, 0.516
General social problems						−0.124	[−0.182, - 0.067]
Case–control	407	0.499	0.442, 0.555
Prospective	213	0.623	0.542, 0.703
Association with sex offenders						−0.019	[−0.084, 0.046]
Case–control	402	0.514	0.456, 0.571
Prospective	213	0.532	0.441, 0.624
Attitudes
Low remorse/victim blaming						0.085	[0.021, 0.150]
Case–control	406	0.638	0.584, 0.692
Prospective	212	0.552	0.463, 0.642
Rape attitudes						−0.012	[−0.077, 0.052]
Case–control	399	0.602	0.546, 0.658
Prospective	210	0.615	0.525, 0.704
Child molester attitudes						0.009	[−0.054, 0.073]
Case–control	395	0.593	0.536, 0.649
Prospective	208	0.584	0.495, 0.672
Sexual entitlement						0.034	[−0.029, 0.097]
Case–control	391	0.640	0.585, 0.695
Prospective	208	0.606	0.518, 0.694
Self-management
Sees self as no risk to recidivate						0.050	[−0.016, 0.116]
Case–control	406	0.675	0.622, 0.727
Prospective	213	0.625	0.533, 0.716
Victim access						−0.080	[−0.135, − 0.026]
Case–control	406	0.603	0.548, 0.649
Prospective	212	0.684	0.607, 0.760
Sexual deviancy
Sexual preoccupations						0.033	[−0.031, 0.098]
Case–control	401	0.578	0.521, 0.634
Prospective	211	0.544	0.454, 0.634
Appearance
Dirty/smelly/inappropriate						−0.016	[−0.080, 0.048]
Case–control	407	0.560	0.503, 0.616
Prospective	213	0.575	0.486, 0.665
Any strong change
For the worse						−0.049	[−0.116, 0.018]
Case–control	393	0.502	0.445, 0.560
Prospective	202	0.551	0.458, 0.645
For the better						0.077	[0.013, 0.141]
Case–control	395	0.436	0.379, 0.492
Prospective	208	0.512	0.423, 0.602
Lifestyle
Antisocial lifestyle						−0.059	[−0.122, 0.004]
Case–control	407	0.613	0.559, 0.667
Prospective	213	0.672	0.583, 0.760
Uncontrolled release environment						0.065	[0.002, 0.128]
Case–control	405	0.586	0.530, 0.641
Prospective	212	0.521	0.433, 0.608
No opportunities for fun/relaxation						−0.0081	[−0.071, 0.055]
Case–control	405	0.514	0.457, 0.571
Prospective	211	0.522	0.434, 0.611
Using religion as a shield						−0.011	[−0.074, 0.051]
Case–control	406	0.492	0.435, 0.548
Prospective	212	0.503	0.415, 0.591
Cooperation with supervision
Treatment attendance (any)						2.55	[1.27, 5.13]
Case–control	402			0.49	0.30, 0.79
Prospective	212			1.25	0.54, 2.88
Disengaged						0.052	[−0.008, 0.111]
Case–control	407	0.647	0.594, 0.700
Prospective	213	0.596	0.513, 0.679
Manipulative						−0.0023	[−0.063, 0.058]
Case–control	407	0.614	0.560, 0.669
Prospective	213	0.617	0.532, 0.701
No show/late for appointments						−0.019	[−0.080, 0.041]
Case–control	405	0.589	0.534, 0.645
Prospective	212	0.609	0.524, 0.694
Overall non cooperation						0.017	[−0.044, 0.078]
Case–control	407	0.656	0.603, 0.709
Prospective	213	0.639	0.554, 0.725

Note. AUC used for ordinal and interval variables; Odds ratios used for dichotomous variables. All differences were scales such that positive values (for AUC values) and ratios greater than one (for odds ratios) indicate the effects were larger (in the expected direction) for the case–control design than for the prospective design. For the standard error of the differences, the correlation was estimated at 0.7026 (n = 51) for the AUC values and 0.5497 (n = 33) for the logged odds ratios. Differences for AUC values are not significant when the confidence interval includes zero. Differences in the odds ratios are not significant when the confidence interval for the ratio does not include 1. Bolded values indicate that the comparison was statistically significant (p < .05). AUC = area under the curve; CI = confidence interval.

Difference not calculated because there was no expectation as to the direction of the effect.

Of the total 82 variables, 62 were classified as either being a sex-crime-specific risk factor (28 variables) or general risk factor prior to analyzing the data (34 variables; see S1 in Supplemental Materials). Of the 28 sex-crime-specific variables, equal numbers showed larger effects in the case–control design (14 variables) and the prospective design (14 variables); only seven of these comparisons were statistically significant (four favoring case–control; three favoring prospective). For the 34 general crime variables, the effects tended to be larger in the prospective design (21 variables) than the case–control design (13 variables); again, only a small number of these comparisons were statistically significant (seven favoring prospective; four favoring control). This pattern of results was opposite to the pattern we predicted (we had expected the sex-crime-specific variables to show larger effects in the prospective design than in the case–control design). Tables summarizing the findings can be found in S2 and S3 of the Supplemental Materials.

Discussion

The choice of a specific research method is guided by the strengths and limitations of the different approaches, their costs, and the available resources. When identifying risk factors for sexual recidivism, case–control studies are often used because they are relatively easy to implement and require fewer resources (e.g., time, money) than prospective studies. Despite these advantages, prospective studies are still considered the gold standard because the temporal order between risk factors and the outcome of interest is easily established. But to what extent are the findings between the two research designs equivalent? To our knowledge, no other study of sexual recidivism risk factors has compared the results obtained from a case–control study to those of a prospective follow-up of the same sample. Our results support the following conclusions: (a) case–control and prospective cohort designs can provide similar information on risk factors for sexual recidivism; (b) matching on specific factors in case–control studies can have unexpected effects on the observed relationships; and (c) the factors identified through both designs are largely consistent with the broader literature on sexual recidivism risk.

Case–Control vs. Prospective Cohort Designs

Given the strengths associated with prospective cohort designs (e.g., established temporal order), we expected that more risk factors would differentiate sexual recidivists from nonrecidivists in the prospective analyses compared to the original case–control study. Although there were more comparisons that favored the prospective cohort rather than case–control design, the differences were small and mostly nonsignificant. There also did not appear to be any consistent pattern in whether static or stable dynamic risk factors emerged in either research design. Slightly more of the comparisons of stable dynamic risk factors favored the prospective rather than the case–control design; however, once again, these differences were small and mostly nonsignificant. These results provide support for the ability of case–control designs to identify risk factors for sexual recidivism. Not all case control studies, however, are created equal: it is necessary to consider the quality of the study. In this case, Hanson and Harris (2000) relied on information from multiple sources (e.g., file information, interviews with parole officers) and made a concerted effort to establish the temporal order of factors, which may have contributed to the replicability of the original findings using a prospective design.

Given that Hanson and Harris (2000) matched on static sex-specific factors, we expected that this matching would amplify general risk factors in the case–control design whereas sex-specific risk factors would be more prominent in the prospective cohort design. Unsurprisingly, the factors that were used in matching for the case–control (e.g., victim type—offended against boy; sexual offending history, etc.) showed stronger effects in the prospective design than the case–control design. Contrary to expectations, however, sex-specific risk factors were otherwise equally distributed between case–control and prospective designs; instead, general risk factors were more likely to be identified in the prospective cohort design. Our assumptions concerning the effect of matching on sex-specific factors in the case–control study were, in this case, completely wrong!

These findings highlight a key criticism of case–control studies; the effect that matching on specific risk factors will have on the effect of other correlated factors, is difficult, if not impossible to discern a priori (Meehl, 1970). The goal of matching on static, sex-specific risk factors in the original Hanson and Harris (2000) study was to amplify the effects of dynamic risk factors given that the identification of dynamic factors was the main goal of the study. However, the two main constructs predicting sexual recidivism (sex-crime specific and general antisociality) are likely correlated and share common variance (Babchishin et al., 2012). The unanticipated consequence of this matching may have been to suppress both sex-crime-specific and general antisociality factors in the case–control study. Despite this limitation, the pattern of results for individual risk factors was surprisingly consistent across both designs. Although there was sometimes a change in the strength of the effect size, the direction of the relationship between the risk factor and sexual recidivism was generally consistent across both research designs, which provides further support for the use of case–control studies in the identification of risk factors for sexual recidivism.

Risk Factors Overall

The risk factors that differentiated between sexual recidivists and non-recidivists across both study designs were largely consistent with the broader literature on risk and protective factors for sexual recidivism (Hanson & Bussière, 1998; Hanson & Morton-Bourgon, 2004; Helmus & Thornton, 2015). When examining static, sex-specific risk factors, for example, indicators of sexual deviancy, victim type (e.g., boys), and sexual offending history were all related to sexual recidivism. When examining static, general risk factors, indicators of antisocial personality (e.g., psychopathy scores, diagnosis of antisocial personality disorder) were good predictors of sexual recidivism in both designs. Somewhat surprisingly, the relationship between the VRAG and sexual recidivism, which was among the largest in the case–control design, became even larger in the prospective design, despite the scale not being created to predict sexual recidivism. The VRAG and its updated version (VRAG-R; Rice et al., 2013) have shown significant discrimination between sexual recidivists and nonrecidivists in past studies (e.g., .62 and .63, respectively; Olver & Sewall, 2018); however, the effect size in the prospective cohort design was larger (.76 compared to .62 in the Olver and Sewall study [2018]). Although the VRAG may be related to sexual recidivism, the recommendation, based on the results of larger meta-analyses, is to utilize tools that were designed to predict sexual recidivism (e.g., Hanson & Morton-Bourgon, 2009).

Sex-specific dynamic factors that emerged in both designs included attitudes supportive of sexual offending (i.e., rape, child molestation, and sexual entitlement), victim access, and being on antiandrogen medication. Dynamic general risk factors that significantly predicted sexual recidivism were consistent with the Central Eight risk/need factors (Bonta & Andrews, 2024) and included substance use, negative social influences, and antisocial lifestyle.

Somewhat inconsistent with the broader literature on sexual recidivism risk was the positive and significant relationship between childhood sexual abuse and sexual recidivism across both research designs. Although the rates of childhood sexual abuse are relatively high among men who have committed a sexual offense compared to the general population (Whitaker et al., 2008; evidence from case–control studies), previous prospective studies have not found a consistent association between a history of sexual abuse and subsequent recidivism. Meta-analyses of adult samples have found no association between childhood sexual abuse and sexual recidivism (e.g., Hanson & Bussière, 1998), whereas at least one meta-analysis of youth samples has found a significant association between childhood sexual abuse and sexual recidivism (Mallie et al., 2010). Further examination of this potential relationship is required and may have implications for the type of interventions adopted for individuals with abuse histories (e.g., trauma-informed care; Dalsklev et al., 2021).

Other findings may have implications for the scoring of specific tools designed to predict sexual recidivism. Consistent with other studies (Hanson & Thornton, 2003), the number of sexual index offenses was not significantly related to sexual recidivism, whereas a history of past sexual offenses was. This pattern of results supports the decision from scale developers to exclude considerations of index offenses and, instead, focus on the density of past sexual offending (e.g., Static-99/R [Hanson & Thornton, 2000; Helmus et al., 2012] and Static-2002R [Hanson & Thornton, 2003; Helmus et al., 2012]). Another finding with potential scoring implications pertained to the interaction between the sex and age of the victim. Although having male victims has long been accepted as an established risk factor for sexual recidivism, the effect was only evident for boy male victims; adult male victims showed the opposite effect indicating lower risk. If this pattern holds, it will have consequences for the scoring of “male victim” items in sexual recidivism risk tools, such as Static-99R, Static-2002R, Risk Matrix-2000/S (Thornton et al., 2003), and the Minnesota Sex Offender Screening Tool-4 (MnSOST-4; Duwe, 2019), all of which have an item for male victims that does not differentiate between boys and men.

Implications

Meehl (1970) doubted the utility of case–control designs: “To put it most extremely, the so-called ex post facto ‘experiment’ is fundamentally defective for many, perhaps most, of the theoretically significant purposes to which it has been put” (p. 374). Although case–control studies introduce a certain amount of uncertainty, especially concerning the effects of matching, the results of the current study do not support Meehl’s dire claim. On the contrary, if risk factors were identified in the case–control design, they tended to also emerge in the prospective design; for no variable was there a reversal of a risk factor between the two designs (i.e., statistically significant in different directions). Differences between the designs were usually small and mostly nonsignificant indicating that case–control studies can and should be used to identify risk factors, especially for outcomes where prospective designs may be impractical, time-consuming, and expensive.

Of course, not all case–control studies are created equal; the quality of the study design will matter for the validity of the results. A primary concern with the use of case–control studies is that the ease with which they can be implemented may lead to a lack of rigor compared to how they should be conducted (Rothman, 1986; Schulz & Grimes, 2002). There are several recommendations that can be made to strengthen the case–control study design. The most important consideration is ensuring the equivalency between the two groups (Schulz & Grimes, 2002; Song & Chung, 2010). To control for systematic differences, both groups should come from the same population, location, and setting. In other words, the control group should be representative of the individuals who have experienced the outcome of interest (Schulz & Grimes, 2002).

Once cases are preselected, additional individual-level matching variables may be necessary to control for factors that could influence the results but that have no substantive connection to the research questions (e.g., age, sex, any factor that is considered a confound; Song & Chung, 2010). Caution should be taken and a strong rationale provided for matching on these factors given that the overall effect that matching will have on the comparisons between the groups cannot be fully known. It should also be noted that the limitations associated with matching do not simply disappear in prospective cohort designs. Preselection still occurs, and the effects of this preselection on the results are also uncertain (Mann, 2003).

Another consideration in case–control designs is the quality and the sources of information used to identify and measure the risk factors and outcome of interest. Whenever possible, multiple sources of information should be used, and every attempt should be made to establish that risk factors occurred before the outcome of interest. In the case of Hanson and Harris (2000), information was gathered from extensive file review and through interviews with probation/parole officers. Furthermore, additional steps were taken to establish temporal ordering of the factors and the sexual recidivism outcome. For example, during interviews, parole officers were specifically asked to think about their clients 6 months prior to the current interview and to compare any changes in the factors. Case–control studies that are carefully planned are those most likely to provide reliable and valid results (Schulz & Grimes, 2002).

In the current study, some of the observed differences between the effect sizes for the case–control and prospective studies would have been due to differences in study design and some would have been due to random error. Although prospective designs generally have more credibility than case–control designs, we do not recommend that evidence-based practitioners privilege findings from one or another of the research designs in the current study because most of the apparent differences are likely attributable to random error. Instead, we recommend that practitioners look to meta-analytic reviews for the best evidence of risk and protective factors. The current findings do suggest, however, that case–control studies should be included in systematic and meta-analytic reviews, and that reviewers should examine study design as a moderator variable.

The current findings contrast with those of Weisburd et al. (2001) who found meaningfully different results from better compared to worse studies. Specifically, they found that evaluations of criminal justice programs that used randomized assignment found smaller effects, and were more likely to identify adverse effects, than observational studies. Although there is no single “scientific method,” some research designs assume canonical importance in certain research communities (one meaning of Kuhn’s [1970] paradigms). For example, Kraemer and colleagues (1997) restrict the use of the term “risk factor” to variables for which temporal precedence has been demonstrated, and explicitly rule out case–control studies as the basis of valid inferences concerning potentially causal factors. We think they have gone too far. In particular, evidence-based practitioners who inferred psychologically meaningful risk factors from the Hanson and Harris (2000) case–control study would have stood on solid ground. The difficulty, of course, is that all research is fallible. Scientific inquiry is a problem-solving task that cannot be reduced to any particular sets of research designs (Hanson, 1958; Wolman, 1971). The current findings raise the confidence that should be accorded to case–control designs, but does not diminish the importance of truly prospective studies—as well as other modes of inquiry. Scientific conclusions are strengthened by replication within a progressing, orderly program of research (Lakatos, 1970).

Limitations

Although novelty is a virtue, it is also a limitation. As far as we are aware, this is the only study of recidivism risk factors that has directly compared the results of case–control and prospective studies. Consequently, any conclusions must be tempered because they are only based on this one sample. Whether the findings would generalize to other comparisons between case–control studies and prospective cohort designs needs to be examined. One of our main recommendations is that researchers with access to case–control samples should, when possible, plan for a prospective follow-up of these same individuals. Another limitation to consider is the sample size for the prospective cohort design, which was relatively small because only the nonrecidivists from the original case–control study were included. Power to detect differences in effect sizes between the two designs was likely attenuated. Despite these limitations, the quality of the information from the original case–control study and the exceptionally long follow-up time of the prospective cohort study, provide confidence in the validity of the risk factors that were identified across both designs.

Conclusion

Overall, case–control studies can and should be used when determining risk factors for low frequency outcomes, such as sexual recidivism. The quality of the case–control study matters, and every effort should be made to collect high quality information from multiple sources and to establish the temporal order of the factors and the outcome of interest. Although matching may be necessary in case–control designs, researchers should recognize that it is difficult to anticipate the effect that matching will have on the results. Replication studies across diverse samples are therefore needed and, when possible, samples in case–control studies should be followed-up prospectively to compare results across the different methodological designs.

Supplemental Material

sj-docx-1-cjb-10.1177_00938548241291155 – Supplemental material for Where Should We Intervene, 20 Years Later? Case–Control and Prospective Cohort Designs Provide Similar Answers

Supplemental material, sj-docx-1-cjb-10.1177_00938548241291155 for Where Should We Intervene, 20 Years Later? Case–Control and Prospective Cohort Designs Provide Similar Answers by Julie Blais, R. Karl Hanson and Andrew J. R. Harris in Criminal Justice and Behavior

Footnotes

Data Availability Statement

The current study was conducted under a data sharing agreement with Public Safety Canada.

ORCID iDs

Julie Blais

R. Karl Hanson

Supplemental Material

Supplemental Tables S1–S3 are available in the online version of this article at .

Julie Blais (she/her) is an associate professor and the Director of the Forensic Psychology and Personality Lab in the Department of Psychology and Neuroscience at Dalhousie University. Her research interests include risk assessment, risk communication, sexual violence, and personality.

R. Karl Hanson originally trained as a clinical psychologist and retired in 2017 from Public Safety Canada after 27 years as a researcher and research manager. Currently, he is an Adjunct Research Professor in the Psychology Department of Carleton University, and President of the Society for the Advancement of Actuarial Risk Need Assessment (SAARNA).

Andrew J. R. Harris, PhD, C.Psych., has a private forensic clinical and consulting practice in Ontario, Canada. As a Canadian Civil Servant, he worked on several large research projects including the Crown Files Review, and those that developed the VRAG, STABLE, and ACUTE instruments.

References

Aelick

C. A.

Babchishin

K. M.

Harris

A. J. R.

(2020). Severe mental illness diagnoses and their association with reoffending in a sample of men adjudicated for sexual offences. Sexual Offending: Theory, Research, and Prevention, 15(1), Article e3123. https://doi.org//10.5964/sotrap.3123

Andrews

D. A.

(1994). The psychology of criminal conduct. Anderson.

Babchishin

K. M.

Hanson

R. K.

Helmus

(2012). Even highly correlated measures can add incrementally to predicting recidivism among sex offenders. Assessment, 19(4), 442–461. https://doi.org/10.1177/1073191112458312

Bonta

Andrews

D. A.

(2024). The psychology of criminal conduct (7th ed.). Routledge.

Bourgon

Mugford

Hanson

R. K.

Coligado

(2018). Offender risk assessment practices vary across Canada. Canadian Journal of Criminology and Criminal Justice, 60(2), 167–205. https://doi.org/10.3138/cjccj.2016-0024

Brouillette-Alarie

Babchishin

K. M.

Hanson

R. K.

Helmus

L.-M.

(2016). Latent constructs of the Static-99R and Static-2002R: A three-factor solution. Assessment, 23(1), 96–111. https://doi.org/10.1177/1073191114568114

Cologne

J. B.

Shibata

(1995). Optimal case-control matching in practice. Epidemiology, 6(3), 271–275. https://doi.org/10.1097/00001648-199505000-00014

Dalsklev

Cunningham

Dempster

Hanna

(2021). Childhood physical and sexual abuse as a predictor of reoffending: A systematic review. Trauma, Violence, & Abuse, 22(3), 605–618. https://doi.org/10.1177/1524838019869082

Duwe

(2019). Better practices in the development and validation of recidivism risk assessments: The Minnesota Sex Offender Screening Tool–4. Criminal Justice Policy Review, 30(4), 538–564. https://doi.org/10.1177/0887403417718608

10.

Fleiss

J. L.

(1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), 378–382. https://doi.org/10.1037/h0031619

11.

Fleiss

J. L.

Levin

Paik

M. C.

(2003). Statistical methods for rates and proportions (3rd ed.). John Wiley & Sons.

12.

Gwet

K. L.

(2019). irrCAC: Computing chance-corrected agreement coefficients (CAC) [Computer software]. https://CRAN.R-project.org/package=irrCAC

13.

Hanson

N. R.

(1958). Patterns of discovery: An inquiry into the conceptual foundations of science. Cambridge University Press.

14.

Hanson

R. K.

(1997). The development of a brief actuarial risk scale for sexual offense recidivism (User report 97-04). Department of the Solicitor General Canada. https://www.publicsafety.gc.ca/cnt/rsrcs/pblctns/dvlpmnt-brf-ctrl/index-en.aspx

15.

Hanson

R. K.

(2009). The psychological assessment of risk for crime and violence. Canadian Psychology/Psychologie Canadienne, 20(3), 172–182. https://doi.org/10.1037/a0015726

16.

Hanson

R. K.

(2022). Prediction statistics for psychological assessment. American Psychological Association.

17.

Hanson

R. K.

Bussière

M. T.

(1996). Predictors of sexual offender recidivism: A meta-analysis (User report 96-04). Department of the Solicitor General of Canada.

18.

Hanson

R. K.

Bussière

M. T.

(1998). Predicting relapse: A meta-analysis of sexual offender recidivism studies. Journal of Consulting and Clinical Psychology, 66(2), 348–362. https://doi.org/10.1037/0022-006X.66.2.348

19.

Hanson

R. K.

Harris

A. J. R.

(1998). Dynamic predictors of sexual recidivism (User report 1998-01). Department of the Solicitor General Canada. https://www.securitepublique.gc.ca/cnt/rsrcs/pblctns/dnmc-prdctrs-sxl/index-en.aspx

20.

Hanson

R. K.

Harris

A. J. R.

(2000). Where should we intervene? Dynamic predictors of sexual offender recidivism. Criminal Justice and Behavior, 27(1), 6–35. https://doi.org/10.1177/0093854800027001002

21.

Hanson

R. K.

Harris

A. J. R.

Scott

Helmus

(2007). Assessing the risk of sexual offenders on community supervision: The Dynamic Supervision Project (Corrections user report no 2007-05). Public Safety Canada. https://www.publicsafety.gc.ca/cnt/rsrcs/pblctns/ssssng-rsk-sxl-ffndrs/ssssng-rsk-sxl-ffndrs-eng.pdf

22.

Hanson

R. K.

Lee

S. C.

Thornton

(2024). Long term recidivism rates among individuals at high risk to sexually reoffend. Sexual Abuse, 36(1), 3–32. https://doi.org/10.1177/10790632221139166

23.

Hanson

R. K.

Morton-Bourgon

(2004). Predictors of sexual recidivism: An updated meta-analysis (User report 2004-03). Department of the Solicitor General of Canada. https://www.publicsafety.gc.ca/cnt/rsrcs/pblctns/2004-02-prdctrs-sxl-rcdvsm-pdtd/index-en.aspx

24.

Hanson

R. K.

Morton-Bourgon

K. E.

(2009). The accuracy of recidivism risk assessments for sexual offenders: A meta-analysis of 118 prediction studies. Psychological Assessment, 21, 1–21. https://doi.org/10.1037/a0014421

25.

Hanson

R. K.

Thornton

(2000). Improving risk assessments for sex offenders: A comparison of three actuarial scales. Law and Human Behavior, 24(1), 119–136. https://doi.org/10.1023/A:1005482921333

26.

Hanson

R. K.

Thornton

(2003). Notes on the development of static-2002 (User report 2003-01). Department of the Solicitor General of Canada.

27.

Helmus

Thornton

Hanson

R. K.

Babchishin

K. M.

(2012). Improving the predictive accuracy of Static-99 and Static-2002 with older sex offenders: Revised age weights. Sexual Abuse: A Journal of Research and Treatment, 24(1), 64–101. https://doi.org/10.1177/1079063211409951

28.

Helmus

L.-M.

Thornton

(2015). Stability and predictive and incremental accuracy of the individual items of Static-99R and Static-2002R in predicting sexual recidivism: A meta-analysis. Criminal Justice and Behavior, 42(9), 917–937. https://doi.org/10.1177/0093854814568891

29.

Kraemer

H. C.

Kazdin

A. E.

Offord

D. R.

Kessler

R. C.

Jensen

P. S.

Kupfer

D. J.

(1997). Coming to terms with the terms of risk. Archives of General Psychiatry, 54(4), 337–343. https://doi.org/10.1001/archpsyc.1997.01830160065009

30.

Kuhn

T. S.

(1970). The structure of scientific revolutions (2nd ed.). University of Chicago Press.

31.

Lakatos

(1970). Falsification and the methodology of scientific research programmes. In Lakatos

Musgrave

(Eds.), Criticism and the growth of knowledge (pp. 91–196). Cambridge University Press.

32.

Lee

S. C.

Hanson

R. K.

Blais

(2020). Predictive accuracy of the Static-99R and Static-2002R risk tools for identifying Indigenous and White individuals at high risk for sexual recidivism in Canada. Canadian Psychology/Psychologie Canadienne, 61(1), 42–57. https://doi.org/10.1037/cap0000182

33.

Lehmann

R. J. B.

Hanson

R. K.

Babchishin

Gallasch-Nemitz

Biedermann

Dahle

K.-P.

(2013). Interpreting multiple risk scales for sex offenders: Evidence for averaging. Psychological Assessment, 25(3), 1019–1024. https://doi.org/10.1037/a0033098

34.

Ley

(1972). Quantitative aspects of psychological assessment: An introduction. Gerald Duckworth.

35.

Mallie

A. L.

Viljoen

J. L.

Mordell

Spice

Roesch

(2010). Childhood abuse and adolescent sexual re-offending: A meta-analysis. Child Youth Care Forum, 40(5), 401–417. https://doi.org/10.1007/s10566-010-9136-0

36.

Mann

C. J.

(2003). Observational research methods—Research designs II: Cohort, cross sectional, and case-control studies. Emergency Medicine Journal, 20(1), 54–60. https://doi.org/10.1136/emj.20.1.54

37.

McGrath

R. J.

Cumming

G. F.

Burchard

B. L.

Zeoli

Ellerby

(2010). Current practices and emerging trends in sexual abuser management: The Safer Society 2009 North American Survey. Safer Society Press.

38.

Meehl

P. E.

(1970). Nuisance variables and the ex post facto design. Minnesota Studies in the Philosophy of Science, 4, 373–402. http://hdl.handle.net/11299/184638

39.

Moss

(2023). Measuring agreement using guessing models and knowledge coefficients. Psychometrika, 88(3), 1002–1025. https://doi.org/10.1007/s11336-023-09919-4

40.

Olver

M. E.

Sewall

L. A.

(2018). Cross-validation of the discrimination and calibration properties of the VRAG-R in a treated sexual offender sample. Criminal Justice and Behavior, 45(6), 741–761. https://doi.org/10.1177/0093854818762483

41.

Phenix

Fernandez

Harris

A. J. R.

Helmus

Hanson

R. K.

Thornton

(2017). Static-99R coding rules revised-2016 (Research report 2017-R012). Public Safety Canada. https://www.publicsafety.gc.ca/cnt/rsrcs/pblctns/sttc-2016/index-en.aspx

42.

Quinsey

V. L.

Harris

G. T.

Rice

M. E.

Cormier

(1998). Violent offenders: Appraising and managing risk. American Psychological Association.

43.

Rice

M. E.

Harris

G. T.

(2005). Comparing effect sizes in follow-up studies: ROC area, Cohen’s D, and R. Law and Human Behavior, 29(5), 615–620. https://doi.org/10.1007/s10979-005-6832-7

44.

Rice

M. E.

Harris

G. T.

Lang

(2013). Validation of and revision to the VRAG and SORAG: The Violence Risk Appraisal Guide—Revised (VRAG-R). Psychological Assessment, 25(3), 951–965. https://doi.org/10.1037/a0032878

45.

Rothman

K. J.

(1986). Modern epidemiology. Little, Brown.

46.

Rothman

K. J.

Greenland

Lash

T. L.

(2008). Case–control studies. In Encyclopedia of quantitative risk analysis and assessment. John Wiley & Sons. https://doi.org/10.1002/9780470061596.risk0599

47.

Ruscio

(2008). A probability-based measure of effect size: Robustness to base rates and other factors. Psychological Methods, 13(1), 19–30. https://doi.org/10.1037/1082-989X.13.1.19

48.

Schulz

K. F.

Grimes

D. A.

(2002). Case-control studies: Research in reverse. The Lancet, 349(9304), 431–434. https://doi.org/10.1016/S0140-6736(02)07605-5

49.

Sedgwick

(2013). Prospective cohort studies: Advantages and disadvantages. The British Medical Journal, 347, Article f6726. https://doi.org/10.1136/bmj.f6726

50.

Song

J. W.

Chung

K. C.

(2010). Observational studies: Cohort and case-control studies. Plastic and Reconstructive Surgery, 126(6), 2234–2242. https://doi.org/10.1097/PRS.0b013e3181f44abc

51.

Thornton

Mann

Webster

Blud

Travers

Friendship

Erikson

. (2003). Distinguishing and combining risks for sexual and violent recidivism. In Prentky

R. A.

Janus

E. S.

Seto

M. C.

(Eds.), Sexually coercive behavior: Understanding and management (Vol. 989, pp. 225–246). Annals of the New York Academy of Sciences. https://doi.org/10.1111/j.1749-6632.2003.tb07308.x

52.

Weisburd

Lum

C. M.

Petrosino

(2001). Does research design affect study outcomes in criminal justice? The Annals of the American Academy of Political and Social Science, 578(1), 50–70. https://doi.org/10.1177/000271620157800104

53.

Whitaker

D. J.

Hanson

R. K.

Baker

C. K.

McMahon

P. M.

Ryan

Klein

Rice

D. D.

(2008). Risk factors for the perpetration of child sexual abuse: A review and meta-analysis. Child Abuse & Neglect, 32(5), 529–548. https://doi.org/10.1016/j.chiabu.2007.08.005

54.

Wolman

B. B.

(1971). Does psychology need its own philosophy of science? American Psychologist, 26(10), 877–886. https://doi.org/10.1037/h0032144

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.04 MB

Where Should We Intervene,20 Years Later? Case–Control and Prospective Cohort Designs Provide Similar Answers

Abstract

Keywords

Overview of Prospective and Case–Control Designs

Risk Factors for Sexual Recidivism

Current Study

Methods

Participants

Original Data Collection

Updated Recidivism

Classification of Risk Factors

Data Use

Analysis Plan

Results

Discussion

Case–Control vs. Prospective Cohort Designs

Risk Factors Overall

Implications

Limitations

Conclusion

Supplemental Material

sj-docx-1-cjb-10.1177_00938548241291155 – Supplemental material for Where Should We Intervene, 20 Years Later? Case–Control and Prospective Cohort Designs Provide Similar Answers

Footnotes

Data Availability Statement

ORCID iDs

Supplemental Material

References

Supplementary Material