Abstract
To improve our understanding of the latent constructs of intimate partner violence (IPV) risk, we explored the underlying factor structure of combined items from three IPV risk assessment tools and examined whether the factors predict recidivism outcomes. Data were examined for 251 adult men who were charged with violence against their past or current female intimate partners and whose files were referred for a comprehensive threat assessment from 2010 to 2016 in Canada. Results suggested six underlying risk factors, two of which significantly predicted IPV, any violent, and any recidivism outcomes in a 4-year average follow-up with 227 men. However, only one factor (
Keywords
Intimate partner violence (IPV)—generally defined as the actual, attempted, or threatened physical harm of a current or former intimate partner—represents the largest single category of calls to police in North America (e.g., Burczycka et al., 2018; Klein, 2009). However, knowledge gaps remain about how best to intervene in IPV cases, particularly given the finite resources available to the criminal legal system. Considerable research demonstrates that the effects of criminal legal and therapeutic interventions are maximized when their intensity is determined by the level of risk (e.g., Risk-Need-Responsivity [RNR] principles; Bonta & Andrews, 2024). Effective risk assessment helps identify persons who are in the highest risk category and thus is considered the cornerstone of offender management (Bonta & Andrews, 2024). Thus, optimizing IPV risk assessment tools is imperative to enhancing IPV risk assessment and risk management.
Theoretical understandings of violence risk are important to the development of risk assessment instruments, yet the available theories on IPV recidivism have been noted for their failure to produce predictive models (Bell & Naugle, 2008). Although the literature contains a plethora of theories on factors contributing to the onset of IPV, theories on risk factors predictive of repeated IPV (or IPV recidivism) remain scarce. For example, Hilton (2021) notes that risk factors that explain why people commit IPV in the first place are not necessarily useful for distinguishing who—among men who have encountered the criminal legal system for IPV—will recidivate. Instead, researchers must rely on risk factors from empirical research, literature reviews, or clinical observations to develop risk assessment tools for IPV recidivism. While this approach is helpful to identify the strongest predictors of IPV recidivism, it does little to further our understanding of the relationship between the latent factors that underlie these risk items and recidivism. Furthermore, it is unclear whether commonly used IPV risk tools contain overlapping risk constructs, as well as the most relevant risk information to reliably predict recidivism. Until well-established theories are available to inform IPV risk assessment, researchers must use a bottom-up approach to understand the underlying factor structure of IPV risk, which would then inform new or refine current theories of IPV recidivism. Therefore, the present study aims to explore the factor structure of commonly used IPV risk assessment measures and examine whether the underlying risk factors that comprise them are strongly related to, and predict, IPV recidivism.
IPV Risk Assessment Measures and Their Factor Structure
Several violence risk assessment tools have been developed to assess the risk of repeat IPV. One meta-analysis identified 39 different measures that were either developed for or applied to IPV risk assessment (van der Put et al., 2019), with the Ontario Domestic Assault Risk Assessment (ODARA; Hilton et al., 2004) and Spousal Assault Risk Assessment guide (SARA; Kropp & Hart, 2000) being among the most used tools. The ODARA is a 13-item risk assessment tool (see Table 1 for full list of items) designed for use by frontline police officers and other workers at the scene of an IPV call (Rice et al., 2010) to evaluate the risk of man-to-woman IPV recidivism among men who have been identified as having committed at least one act of IPV (Hilton, 2021; Hilton et al., 2004; Hilton, Harris, & Rice, 2010). The tool has good predictive validity for IPV recidivism with AUCs ranging from .65 to .74 in individual studies and meta-analyses (Hilton et al., 2008; Hilton & Harris, 2009; Messing & Thaller, 2013; van der Put et al., 2019).
List of ODARA, SARA, and B-SAFER Items
The ODARA is an actuarial tool developed through empirical research and interpreted using statistical data with minimal theoretical influence on the potential predictors studied. Items in actuarial risk assessment tools are predetermined and selected because they are easily accessible and have been empirically demonstrated to increase risk of recidivism through follow-up studies. These tools also have explicit rules for scoring individual items, and there is typically an algorithm for obtaining a total score related to numeric risk of recidivism in actuarial tables. Correspondingly, the ODARA items have good predictive value but limited explanatory value.
By contrast, the SARA and the abbreviated Brief Spousal Assault Form for the Evaluation of Risk (B-SAFER) are structured professional judgment (SPJ) tools comprising risk factors rationally derived from literature reviews and clinical considerations (Kropp & Hart, 2000; Kropp et al., 1999). These tools typically contain a structured list of predetermined risk factors that are empirically or theoretically related to recidivism, but the method of combining items into a total score is not specified, and the overall risk evaluation is left to the professional judgment of the evaluator (Boer et al., 1997). The SARA was developed as an inventory of risk factors for IPV cases (Kropp et al., 1995, 1999) and contains 20 items divided into five content areas (referred to as
The B-SAFER is a 15-item (see Table 1 for full list of items) version of the SARA designed for use by police officers (Kropp & Hart, 2004) and was developed with the purpose of improving upon the limitations of the SARA. Empirical studies have found some support for its predictive validity. For example, the summed risk score predicted any recidivism in 40 men imprisoned for an offense against an intimate partner (AUC = .76; Loinaz, 2014). However, it did not significantly predict IPV recidivism in 158 men court-ordered to treatment for violence or stalking (AUC = .55; Gerbrandij et al., 2018). The SARA and B-SAFER are organized with a focus on their practical utility in the field rather than based on the authors’ conceptualization of an underlying factor for IPV risk.
Currently, risk of IPV recidivism is understood as involving distinct constructs (e.g., McDonagh et al., 2024), but the precise number and nature of these constructs are unclear. For example, the SARA and B-SAFER items are categorized into four and three sections, respectively (see Table 1), suggesting that both measures contain multiple underlying risk factors. While the factor structure of the SARA has been empirically examined by Hilton, Harris, and Rice (2010), the B-SAFER’s factor structure has yet to be empirically investigated in the literature.
Only two published studies have empirically examined the factor structure of IPV risk measures. Hilton, Harris, and Rice (2010) conducted an exploratory factor analysis (EFA) of combined items from the ODARA, Danger Assessment (Campbell, 1986), SARA, Domestic Violence Screening Instrument (Hisashima, 2008), and Psychopathy Checklist–Revised (Hare, 1991). They found four distinct underlying risk factors. Items in the first factor reflected attitudes toward IPV, plus socio-affective, emotional, and interpersonal aspects of psychopathy. Items in the second factor reflected antisocial patterns (e.g., general criminality, substance abuse, prior arrests). Items in the third factor reflected IPV-related technical violations. Items in the fourth factor reflected threats, use of weapons, and sexual assault items from all included measures. Pham and Jung (2022) factor analyzed ODARA items using a sample of 234 men charged for a violent offense against their past or current female intimate partners and found that the measure comprised three distinct factors related to antisocial patterns, victim vulnerabilities, and index offense.
Both Hilton, Harris, and Rice (2010) and Pham and Jung (2022) found that factors reflecting the antisocial patterns construct strongly predicted IPV recidivism, consistent with prior research demonstrating the utility of the antisocial patterns construct to predicting IPV recidivism (e.g., Fitzgerald & Graham, 2016; Hilton & Radatz, 2021; López-Ossorio et al., 2017). However, both studies found that many of the other underlying risk factors were unrelated to IPV recidivism, such as those reflecting characteristics of the index offense and victim circumstances. These findings are inconsistent with the literature (e.g., Connor-Smith et al., 2011; López-Ossorio et al., 2017), underscoring the need to not only explore the latent constructs that comprise commonly used risk assessment tools but also to examine whether they are relevant for the prediction of IPV recidivism.
The Present Study
Given the popularity of the ODARA and SARA in IPV risk assessment (van der Put et al., 2019), as well as the development of the B-SAFER (and ODARA) for frontline use (Kropp & Hart, 2004), the aim of the present study was to shed light on the factor structure of IPV risk as measured by these three tools. To that end, first, we factor analyzed their combined items. Next, we examined whether the resulting factors significantly predicted general, any violent, and IPV recidivism, using a series of receiver operating characteristic (ROC) and Cox regression analyses. Given past findings of at least three latent factors that comprise IPV risk assessment measures (i.e., Hilton, Harris, & Rice, 2010; Pham & Jung, 2022), we hypothesized that the EFA would also reveal at least three factors. We also expected that few factors would significantly predict recidivism outcomes. Relatedly, we expected that factors reflecting antisocial patterns would significantly and independently predict recidivism outcomes better than other factors.
Method
Sample
Our sample consisted of 300 adult men charged or accused of violence against their past or current women intimate partners and whose files were referred between 2010 and 2016 to a comprehensive threat assessment service in Canada (Ennis et al., 2015). The men’s mean age was 35.3 (
Measures
Ontario Domestic Assault Risk Assessment
The Ontario Domestic Assault Risk Assessment’s (ODARA) 13 items (see Table 1) are scored dichotomously as “1”
Spousal Assault Risk Assessment
The Spousal Assault Risk Assessment’s (SARA) 20 items (see Table 1) are scored using a 3-point nominal scale, where “0” indicates
Brief Spousal Assault Form for the Evaluation of Risk
The Brief Spousal Assault Form for the Evaluation of Risk’s (B-SAFER) 15 items (see Table 1) are also scored using the same 3-point scale as the SARA. As with the SARA, assessors score the B-SAFER items, which then inform a summary rating of risk. Note that the B-SAFER also includes
Recidivism
We coded criminal record data from local, provincial, and federal sources retrieved from 2016 to 2020, specifically incidents following the date of release from custody. We used three recidivism outcomes: (a) Any new convictions and/or charges, excluding supervision or technical violations (Any Recidivism), (b) any new violent convictions and/or charges (e.g., assault) against anyone (e.g., strangers, acquaintances, partners, family members, etc.; Any Violent Recidivism), and (c) any new violent convictions and/or charges against a past or current intimate partner (including, but not limited to, the index partner, which is appropriate since the tools we used were not designed to assess risk against only one victim; Any IPV Recidivism). The average follow-up was 4 years (
Procedure and Sources of Information
This research was approved by the research ethics boards at the authors’ four primary institutions. A formal notice of collaboration was obtained from the Integrated Threat and Risk Assessment Centre where the study took place. We identified 300 cases referred by police for threat assessment between 2010 and 2016 based on two criteria, such that cases must have: (1) full or provisional threat assessment reports to ensure that sufficient documents and information were available to reliably extract, and accurately code, data, and (2) an identifiable index offense that matched with the ODARA-defined index offense (Hilton et al., 2004). From 2015 to 2021, all cases were coded for the ODARA, SARA, and B-SAFER items, and recidivism outcomes (see Measures) by undergraduate and graduate research assistants who completed standard online training on all three measures (see Hilton & Ham, 2015, for the ODARA training description).
While coders were blind to information on reoffending and other relevant behaviors after the assessment date, they had access to the same case file information that threat assessors used for their evaluation. These files included a referral form (e.g., demographic information, historical occurrences of IPV), official criminal records, and police occurrence documents (e.g., investigator notes, documented evidence, arrest details). In addition, reports by the assigned certified threat assessor typically included a case history (e.g., childhood, employment, substance use history), list of criminal convictions/sentences, intimate relationship history including associated violence, case management recommendations, and risk assessment (e.g., ODARA, SARA, or both). Some files also included reports from mental health agencies and victim interview documentation. To identify any post-index violent and nonviolent offenses at follow-up, we retrieved criminal records up to 2020, and the corresponding police occurrence reports for any post-index violent charges to identify whether the victim was an intimate partner.
To ensure continued adherence to coding instructions and avoid coding drift or complacency, coders were tested for coding accuracy on at least one case previously coded by the authors and at least one subsequent test case (also previously coded by the authors). These procedures, including examination of IRR with 32 cases, have been described in a previous publication (Hilton et al., 2021). Interrater reliability (ICC) was above .80 for ODARA and B-SAFER items (past items only; see Measures) and between .70 and .79 for SARA items.
Statistical Analyses
We explored the factor structure of the ODARA, SARA, and B-SAFER Past items using EFA in MPlus Version 8.5 (Muthén & Muthén, 2015). First, we analyzed a polychoric correlation matrix of the measured items to reveal any underlying processes in the scales (Floyd & Widaman, 1995) because polychoric correlations are ideal with linearly related, ordinal, and binary variables (Flora & Curran, 2004; Holgado-Tello et al., 2010). We then used the weighted least square estimator method to extract factors as it is robust against small sample sizes, any violations of normality (Flora & Curran, 2004) and performs well regardless of ceiling and floor effects in ordinal data with sample sizes as small as 200 (Brown, 2006). For factor retention, we examined the scree plot, Kaiser’s criterion, parallel analysis, and Velicer’s minimum average partial test. To maximize interpretability, we used the oblique rotation method, which computes the degree to which items load onto factors (Tabachnick & Fidell, 2019).
We used general rules regarding cutoff values as guidelines for factor structure fit: root mean square error of approximation (RMSEA; no greater than .06; Hu & Bentler, 1999), comparative fit index (CFI; .95 or greater; Brown, 2006), and standardized root mean square residual (SRMR; no greater than .08; Hu & Bentler, 1999; Schmitt, 2011). In addition to placing items with factor loadings of .30 to .40 or greater onto a factor (Cudeck & O’Dell, 1994; Schmitt & Sass, 2011), we also investigated whether the standardized factor loading significantly differed from zero to assess whether an item significantly loaded onto a factor (Cudeck & O’Dell, 1994; Schmitt, 2011; Schmitt & Sass, 2011). To determine the significance and protect against Type 1 error, a correction procedure for correlated factors was used to compute the appropriate α level (see Cudeck & O’Dell, 1994; Schmitt, 2011). The
We generated AUC statistics to examine each factor’s predictive validity for predicting all recidivism outcomes via the ROC analyses in SPSS version 27. AUC is the most used and recommended effect size statistic for evaluating the prediction of recidivism and is typically preferred to other measures of predictive accuracy that are more affected by recidivism base rates (Helmus & Babchishin, 2017; Rice & Harris, 2005). AUC values are traditionally interpreted as the probability of a recidivist scoring higher than a non-recidivist on a given measure. Generally, AUCs above .50 indicate prediction exceeding chance and AUCs of above .55, .63, and .70 correspond to small, moderate, and large effects, respectively (Rice & Harris, 2005).
Next, we conducted pairwise comparisons of the AUCs to test for significant differences in the factors’ predictive validity, using a Bootstrap test (Carpenter & Bithell, 2000). ROC analyses were generated using the pROC statistical package in R (Robin et al., 2011) and all statistical analyses were based on the nonparametric bootstrapping method (Carpenter & Bithell, 2000), with 10000 replications, to test for significant differences in AUCs of two ROC curves. In each replication, the original measurements were resampled with replacement, where the same number of case and control observations as in the original sample were selected in each bootstrap replicate. Both new ROC curves corresponding to this new sample were built, and the resampled AUCs and their difference,
In addition, we conducted Cox regression analyses in SPSS version 27 to test the incremental predictive validity of each EFA factor. We computed the length of time until recidivism from the date of release from custody to the first date of post-index arrest or charge for a new IPV offense and any new violent offense for survival analyses. Incremental prediction was not tested for any recidivism because the date of the first general offense was not coded.
Results
Factor Structure of IPV Risk Measures
We entered 48 items from the ODARA, SARA, and B-SAFER Past items in the EFA to assess their underlying factor structure. Eleven items violated the multicollinearity assumption and were subsequently removed from the EFA, resulting in a final inclusion of 37 items in the EFA. Note that our sample size is smaller than previously recommended (e.g., Comrey & Lee, 1992; Tabachnick & Fidell, 2019); however, we conducted the Kaiser–Meyer–Olkin (KMO; Kaiser, 1974) test for sampling adequacy and found that the sampling is sufficient, with an overall KMO value of .67, exceeding the recommended cutoff value of .60 (Kaiser, 1970, 1974). Of the 300 cases, 49 were missing values on at least one ODARA item; therefore, the final sample in the EFA was 251. The general factor showed acceptable internal consistency, with omega total and omega hierarchical values of .85 and .72, respectively.
According to the factor retention methods, all of which suggested a different number of factors to retain, we examined the 9-, 7-, and 6-factor models further. Although the 9- and 7-factor structures had good fit statistics (see Supplemental Table S1 [available in the online version of this article] for model fit statistics for models 1 through 9), many items from both models cross-loaded (i.e., loaded onto more than one factor) and contained negative residual variances (i.e., variances of the observed variables after accounting for all shared variance), suggesting that the factors were unstable and the models are not a good fit for the data. We then examined the 6-factor model, which had good fit, with RMSEA of .031 (90% CI [.022, .039]), CFI of .916, and SRMR of .076. Only five items cross-loaded in the 6-factor model and the factors appeared to be more interpretable; that is, items similar in content clustered together (see Supplemental Table S2 for factor loadings). Therefore, the 6-factor model seemed to fit these data best, and together, the factors accounted for 37.99% of the variance.
Factor 1 had poor internal consistency (ω = .58; α = .56) and consisted of six items that concern the victim; thus, we labeled this factor
Prediction of Recidivism Outcomes
Table 2 displays the AUC values of the six EFA factors for predicting IPV, any violent, and any recidivism. Of the 251 men with complete scores on all risk assessment measures used in the present study, 24 men were further excluded from analyses on the prediction of recidivism outcomes because eight were missing follow-up data and 16 were not released from custody or were deceased at the time of follow-up, resulting in 227 men with complete risk assessment and follow-up data. For analyses predicting IPV recidivism only, seven men were additionally excluded because information on whether they violently reoffended against a past or current partner was unavailable. Eighty (36.4%) of the remaining 220 men had a post-index IPV offense. Results indicated that only Factors 2 (
Predictive Validities of Six Factors Retained from the Exploratory Factor Analysis for Recidivism Outcomes
For any violent recidivism, of the 227 men with complete risk assessment and follow-up data, 112 (49.3%) had a post-index violent offense. Results indicated that only Factors 2 (
For any recidivism, of the 227 men with complete risk assessment and follow-up data, 148 (65.2%) had at least one post-index offense. Results indicated that only Factors 2 (
Pairwise Comparisons Among EFA Factors
We also examined whether differences between paired ROC curves (see Supplemental Figures S1–S6) among EFA factors were statistically significant for all recidivism outcomes (see Table 3). For IPV recidivism, only one significant difference emerged between Factors 2 and 6 (
Pairwise Comparisons of the Six Factors Retained from the Exploratory Factor Analysis in the Prediction of Recidivism Outcomes
Incremental Validity of EFA Factors
Next, we examined whether any of the EFA factors incrementally predicted IPV and any violent recidivism over time. Among IPV recidivists (
Cox Regression Survival Analysis: Six Factors Retained From the Exploratory Factor Analysis in the Prediction of Intimate Partner Violence and Any Violent Recidivism
For any violent recidivism, 112 (49.3%) recidivated and the mean time until their first post-index arrest or charge for any violent offense was 505 days (
Discussion
Our factor analysis of the ODARA, SARA, and B-SAFER revealed that IPV risk is multidimensional, with at least six latent constructs, which is consistent with past research (Hilton, Harris, & Rice, 2010; Pham & Jung, 2022). As expected and consistent with past research (e.g., Hilton, Harris, & Rice, 2010; Pham & Jung, 2022), Factors 2 (
Surprisingly, Factor 6 (
We have previously commented on the difficulty of coding mental health and victim-vulnerability variables garnered during police investigations, which may result in failure to identify relationships between these variables and recidivism. Better information could facilitate more accurate coding of IPV- and victim-specific risk variables and permit a fairer test of their predictive validity. It is also possible that some risk factors are more relevant for specific perpetrator subtypes than for others. There is substantial evidence that men who engage in IPV are a heterogenous group who differ on general criminality and violence, severity of violence, antisocial and psychopathic traits, personality traits, weapon use, victim injury, and likelihood of recidivism (Holtzworth-Munroe & Meehan, 2004; Holtzworth-Munroe & Stuart, 1994; Peters et al., 2023). Indeed, violence severity may only be predictive of IPV recidivism for men who are generally violent/antisocial but not for those who exclusively target their family members (Goldstein et al., 2016), suggesting that the relationship between risk factors and IPV recidivism may depend on perpetrator subtype. Similarly, it is reasonable to hypothesize that Hilton, Harris, and Rice’s (2010) third factor comprising IPV-related technical violations might be more relevant for IPV offenders with personality problems related to anxious attachment (e.g., borderline/dysphoric abusers: Holtzworth-Monroe & Stuart, 1994) than for abusers with fewer emotional and behavioral self-management issues. Future research should assess the moderating effects of perpetrator subtype on the relationship between risk factors and IPV recidivism.
Limitations
Our sample size was smaller than previously recommended for factor analysis and tests of incremental validity (e.g., Comrey & Lee, 1992; Tabachnick & Fidell, 2019), which reduced statistical power and stability of our results. However, the KMO test for sampling adequacy revealed an overall value of .67, suggesting that the sampling is sufficient (Kaiser, 1970, 1974). It is also generally suggested that factors with a large number of high factor loadings are reliable regardless of sample size (Guadagnoli & Velicer, 1988). Our factors contained several items with high factor loadings (e.g., .50 and above; see Supplemental Table S2), suggesting that the factors may be reliable despite the small sample size. Interrater reliability was lower for items that assessed psychological constructs and victim concern (Hilton et al., 2021), which could have rendered the underlying factors invalid or unstable and reduced the predictive validity of the factors reported. Furthermore, this study was conducted with high-risk men in one specific law enforcement organization in Canada (see Ennis et al., 2015; Hilton et al., 2021); therefore, generalizability to other populations, jurisdictions, or agencies cannot be assumed. We also had limited information to examine the data from an intersectional perspective. A growing area of focus is intersectionality (e.g., Batastini et al., 2022), which examines how a person’s overlapping identities, such as gender, sexuality, race, and socioeconomic status intersect to shape their lived experiences, and potentially their responsiveness to intervention. Future research should attempt to replicate these findings through the lens of intersectionality, using a larger sample from other organizations with additional sources of data (i.e., interviews).
Given the high-risk nature of our sample, many men were likely subject to relatively intensive management efforts. If such efforts reduced their likelihood of recidivism, this could have attenuated the predictive validities found in the present study. Moreover, individuals who are arrested, charged, or convicted in relation to a general or other non-IPV crime may no longer be at risk for IPV while in custody for these crimes (i.e., competing risks of recidivism; Noordzij et al., 2013). However, we could not account for competing risks of recidivism because we did not collect the required information. Future research should attempt to replicate these findings while accounting for competing risks of recidivism, as well as investigate potential moderating effects of risk management interventions on the relationship between risk and recidivism.
Conclusion
Our findings suggest that there are multiple dimensions that comprise IPV risk, directing professionals toward further potential needs of men who engage in IPV. If future research replicates these findings and demonstrates a similar number and nature of these domains, then IPV interventions could eventually be tailored to focus on the risk domains most strongly associated with IPV recidivism, in line with RNR principles. The extent to which correctional services address characteristics that are most strongly related to offending behavior significantly affects their treatment success. Evidently, more work is needed to improve our current understanding of IPV risk, particularly the need for theoretically informed risk assessments.
Supplemental Material
sj-docx-1-cjb-10.1177_00938548251357789 – Supplemental material for The Factor Structure of Intimate Partner Violence Risk
Supplemental material, sj-docx-1-cjb-10.1177_00938548251357789 for The Factor Structure of Intimate Partner Violence Risk by Anna T. Pham, Kevin L. Nunes, N. Zoe Hilton, Liam Ennis and Sandy Jung in Criminal Justice and Behavior
Footnotes
Authors’ Note:
The views expressed are those of the authors and not necessarily those of the Department of National Defence Canada. This research was supported by funding from the Social Sciences and Humanities Research Council of Canada (Grant #890-2014-0007). Dr. Hilton is an author of the ODARA and declares a financial interest in a publication cited in this article. We would like to thank the Integrated Threat and Risk Assessment Centre, and Sean Bois, Jessica Brandon, and Ethan Davidge for their assistance in the data collection. We also wish to express our appreciation to Carissa Augustyn, Renee Bencic, Martina Faitakis, Sacha Maimone, Adam Morrill, Alicia LaPierre, Lynden Perrault, and Farron Wielinga for their research assistance.
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
