Abstract
Different women experience hormonal contraceptives differently, reporting side effects on their sexuality that range from negative to positive. But research on such causal effects of hormonal contraceptives on psychological outcomes struggles both to identify average causal effects and capture the high heterogeneity in women’s treatment responses. In this study, we leveraged longitudinal data to improve our ability to separate the causal effects of hormonal contraceptives from other sources of association, including observed and unobserved confounding, reverse causality, and attrition. In this programmatic registered report (programmatic registered stage 1 protocol: https://osf.io/kj3h2; date of in-principle acceptance: 28/09/2023), we analyzed data from up to 5,041 women (23,130 observations), who participated in PAIRFAM, a German longitudinal panel dataset consisting of 14 waves, using Bayesian multilevel regressions. To deal with confounding and probe the robustness of findings, we implemented two analysis approaches: adjusted regression analysis and inverse probability of treatment weighting approach. We found evidence for positive average treatment effects of hormonal contraceptives on sexual frequency and sexual satisfaction, but no robust evidence for effects on desired sexual frequency. Furthermore, to move beyond average treatment effects, we analyzed heterogeneity in treatment responses. We found relatively high heterogeneity in individual treatment effects on sexual frequency and sexual satisfaction. Interindividual differences were not systematically related to individual treatment effects, and those treatment effects did not predict women’s decisions about which contraceptive method to use in the long run. Our results contribute to understanding the effects of hormonal contraceptives on sexuality in a naturalistic setting, where women adapt their choice of contraceptive method to their own experiences.
The impact of hormonal contraceptives on women’s sexuality has been discussed since their approval in 1960. Before their invention, only so-called barrier methods existed, which prevent fertilization by blocking the union of egg and sperm (e.g., condoms, diaphragms, cervical caps, and chemical spermicides). In contrast, hormonal contraceptives (including oral hormonal contraceptives, but also hormonal implants, hormonal shots, skin patches, and vaginal rings) include synthetic hormones (progestins and sometimes synthetic estrogens) that enter the bloodstream and, in most cases, prevent ovulation (Watkins, 2012).
By altering the endocrine system, hormonal contraceptives can have effects on other aspects of the female body and brain—including negative medical and psychological side effects. For instance, two randomized controlled trials reported small negative effects of oral hormonal contraceptives on sexual desire, arousal, and pleasure (Zethraeus et al., 2016) as well as sexual interest (Lundin et al., 2018). But a recent review by Both et al. (2019) found that only a minority of women reported changes in sexual functioning and concluded that the effects of hormonal contraceptives on sexual functioning—and sexual desire in particular—are understudied and therefore poorly understood.
Beyond its relevance for health and intimate relationships, human sexuality is also closely tied to personality psychology. Yet, historically, sexuality-related descriptors were deliberately excluded in the lexical approach that laid the foundation for the Big Five personality traits model. Although Allport and Odbert (1936) original compilation included such terms, subsequent reductions by Norman (1967) and Goldberg (1982) progressively removed them. Among the explicit exclusion criteria for adjectives was any link to sex and slanginess, as documented by Saucier and Goldberg (1996). The implications of these exclusions for understanding sexual dimensions of personality in humans are discussed in detail by Schmitt and Buss (2000). These practices likely contributed to human sexuality being treated as peripheral to personality research. More recent work suggests that this boundary is artificial, as interindividual differences in human sexuality show systematic associations with Big Five personality traits (e.g., Bourdage et al., 2007; Schmitt & Shackelford, 2008). Situating research on hormonal contraceptives and sexuality within this broader personality framework helps to highlight its relevance for understanding enduring individual differences.
Experiments are considered the gold standard to answer causal research questions, such as the effects of hormonal contraceptives on sexuality. However, experimental evidence can only partly tell us how these effects affect women’s everyday lives. As Graham (2019) points out, women’s experiences with hormonal contraceptives are highly heterogeneous—ranging from negative side effects to no effects to positive effects. These heterogeneous responses to hormonal contraceptive use might be due to varying sensitivity to hormones (Kiesner, 2017). Such differences in sensitivity are also supported by evidence that ovulatory cycle shifts with average increases in sexual desire and self-perceived attractiveness during the fertile phase vary between women (Arslan et al., 2021; Schleifenbaum et al., 2021). Hormonal contraceptives inhibit ovulation, and so hormonal contraceptive users no longer experience the same ovulatory cycle shifts. Heterogeneous effects of hormonal contraceptives might therefore be due to varying sensitivity to ovulatory cycle shifts before starting hormonal contraceptive use, with sensitive women showing stronger effects and insensitive women showing smaller effects on sexuality.
Such differences in the effects of hormonal contraceptives can be studied in an experimental context, as suggested by Hill and Mengelkoch (2022) who propose a precision medicine approach. They suggest researchers collect detailed information about contraceptive methods, duration of contraceptive use, mental health history, as well as sexual activity and relationship status as important potential moderators of the relationships between hormonal contraceptive use and psychological outcomes (see Box 3 and 4 in Hill & Mengelkoch, 2022).
Carefully isolated experimental settings are valuable to establish the effects of (individual) hormonal contraceptives on women’s sexuality. In contrast, in everyday life, women actively choose between different non-hormonal and hormonal contraceptive methods and often try multiple methods during their lifespan. As women try to find a balance between efficacy, ease-of-use, as well as desirable and undesirable side effects, the causal effects of synthetic hormones are interwoven with confounding, attrition effects, and reverse causality. This poses unique causal inference challenges, but also allows one to investigate additional research questions such as whether side effects determine which contraceptive women eventually choose. Furthermore, the different requirements of observational data collection (as opposed to randomized clinical trials) make it easier to include a broad range of variables such as personality, thus making it possible to more thoroughly investigate potential predictors of women’s heterogeneous responses to hormonal contraceptives.
The current study aims to close the gap between the available experimental and correlational evidence about the relationship between hormonal contraceptives and women’s sexuality. By analyzing the effects of starting and discontinuing hormonal contraceptives on sexuality in a longitudinal dataset with around 5,000 women, observed over up to 14 yearly waves 1 (years of data collection: 2008–2022), we aim to answer questions about potentially heterogeneous average treatment effects of hormonal contraceptives in real world settings while accounting for (un)observed confounders as well as attrition effects.
Empirical Evidence of Positive and Negative Effects of Hormonal Contraceptives on Sexuality
Hormonal contraceptives contain synthetic versions of progesterone (also called progestin) and sometimes estrogen, which inhibit the natural production of progesterone and estrogens as well as the natural production of pituitary hormones (luteinizing hormone and follicle-stimulating hormone). This reduction of natural hormonal fluctuation across the menstrual cycle prevents the maturation of the ovarian follicle and therefore hinders ovulation (Frye, 2006). In general, women who are using hormonal contraceptives have lower levels of estradiol, progesterone, follicle stimulating hormones, luteinizing hormones, and total and free testosterone as well as higher levels of sex-binding globulins (Gaspard et al., 1983; Zethraeus et al., 2017; Zimmerman et al., 2014). Their endogenous hormone levels remain constantly similar to those found in the early follicular phase of normally cycling women (Mishell et al., 1972).
This intervention into the endocrine system (Stomati et al., 1998) has been hypothesized to negatively affect women’s sexuality (Both et al., 2019). Some empirical evidence supports these hypotheses regarding sexual functioning (e.g., Læssøe et al., 2014) and libido (e.g., Lee et al., 2017; Lundin et al., 2018; Zethraeus et al., 2016), as well as sexual activity, arousal, pleasure, orgasm, and lubrication (Smith et al., 2014). While hypotheses and evidence for negative side effects of hormonal contraception exist, the use of hormonal contraception has also been hypothesized to positively affect women’s sexuality through several mechanisms, including, for example, overcoming the fear of unwanted pregnancy during sexual activity (Blumenstock & Barber, 2022) and the resolution of painful or troublesome gynecologic disorders (Both et al., 2019). Empirical evidence in support of positive effects on sexuality has been reported concerning sexual functioning (e.g., Oranratanaphan & Taneepanichskul, 2006), libido (McCoy & Matyas, 1996), and, most strongly, sexual frequency (e.g., Caruso et al., 2005; McCoy & Matyas, 1996). In addition, women using hormonal contraceptives reported higher sexual satisfaction (e.g., Caruso et al., 2005) and higher relationship satisfaction (e.g., Taggart et al., 2018).
Obstacles to Estimating Psychological Effects of Hormonal Contraceptives
Taken together, evidence concerning potential psychological effects of hormonal contraceptives remains inconclusive. While randomized-controlled trials provide somewhat consistent evidence of small negative average treatment effects on various aspects of women’s sexuality (e.g., Graham et al., 1995; Lundin et al., 2018; Sabatini & Cagiano, 2006; Zethraeus et al., 2016; but see Oranratanaphan and Taneepanichskul (2006) and Strufaldi et al. (2010) for evidence of positive causal effects of certain methods of hormonal contraception), evidence based on correlational data often shows no or even positive relationships between the use of hormonal contraceptives and sexuality (e.g., Caruso et al., 2005; McCoy & Matyas, 1996; but see Wallwiener et al. (2010, 2015) for evidence of a negative relationship). Some reviews about potential effects of hormonal contraceptives conclude that there are negative effects of hormonal contraceptives (Lee et al., 2017) or no effects of hormonal contraceptives (Pastor et al., 2013). However, most reviews conclude that the effects of hormonal contraceptives on sexuality have not been well studied and remain controversial (Both et al., 2019; Burrows et al., 2012; Davis & Castaño, 2004; Schaffir, 2006).
Several explanations for this mixed and inconclusive body of evidence are plausible: (1) Contraceptive method and dosage effects: differing psychological responses are due to differences between hormonal contraceptives (e.g., application methods or different dosages of synthetic progesterone and estrogen; for supporting empirical evidence see e.g., Boozalis et al., 2016; Læssøe et al., 2014; Sabatini & Cagiano, 2006; Strufaldi et al., 2010) (2) Treatment heterogeneity: differing psychological responses are due to interindividual differences between women (Graham, 2019) and studies systematically vary in sampling procedures (e.g., some only including women with a regular ovulatory cycle) (3) Treatment heterogeneity leading to selective attrition: women who experience negative effects of hormonal contraceptives discontinue them, leaving only women who experience no effects or positive effects in the group of hormonal contraceptive users in correlational studies (4) Confounders: pre-existing differences in women influence the decision what contraceptive method to use and affect psychological outcomes, leading to differences between the groups of hormonal contraceptive users and non-hormonal contraceptive users in correlational studies (5) Reverse causality: in some cross-sectional studies, relationships between psychological outcomes and hormonal contraceptive use might occur because the outcome influences the contraceptive choice (e.g., higher frequency of vaginal intercourse might lead to the decision to start using hormonal contraceptives).
Randomized controlled trials with a placebo control group are regarded as the superior approach for estimating the average treatment effect of hormonal contraceptives and their contraceptive efficacy. They can also expand the knowledge about (1) contraceptive method and dosage effects and (2) treatment heterogeneity. While the estimated effects will not be biased through (4) confounders and (5) reverse causality as their impacts are nullified by randomization, this also means that the design cannot inform us about the extent to which these two affect correlations between contraceptive usage and outcomes in everyday life. Furthermore, this design is not optimized to inform us about how (3) treatment heterogeneity might lead to selective attrition in everyday life. A related concern is sometimes termed healthy user bias: the women who volunteer for a randomized controlled trial will not include, for example, women who, based on previous experience, fear bouts of severe depression if they are assigned to hormonal contraception. By randomly assigning different forms of contraceptives to women, they remove the decision process to start or to discontinue using contraceptives that is inherent to real world settings. In addition, owing to their cost, randomized controlled trials usually have small sample sizes that preclude the rigorous investigation of subgroups, heterogeneity, and uncommon side effects. Finally, trials with a non-hormonal contraceptive control group are uncommon, in part because pharmaceutical trials tend to focus on comparing different formulations and in part because many non-hormonal methods are less efficacious, increasing the risk of unplanned pregnancies. For example, in the randomized trial with a non-hormonal contraceptive control group by Zethraeus et al. (2016, 2017) women were blinded and did not know whether they were using hormonal contraceptives. To avoid unwanted pregnancies, all women were instructed to use additional non-hormonal contraceptive methods during the study and received free condoms (Zethraeus et al., 2017). Therefore, any beneficial effects resulting from knowing that one is using a highly effective birth control method (Both et al., 2019) may be underestimated in such blinded randomized controlled trials.
Observational Cross-Sectional and Longitudinal Designs
In comparison to randomized-controlled trials, observational cross-sectional designs also capture any association induced by the decision process. Therefore, (3) selective attrition, (4) confounders, and (5) reverse causality will often bias the estimated effects. At the same time, they are usually based on larger sample sizes and include users of multiple contraceptive methods as well as those who use no contraceptive method at all. They operate like photographs of the real world. While they only show patterns at one specific time point, they still provide important pieces of the picture (such as the associations between demographic variables and contraceptive method) that could not be obtained based on randomized controlled trials alone. Going beyond mere associations, we can at least attempt to infer causal effects from cross-sectional data, if we are willing to transparently discuss and defend the necessary strong assumptions and statistical adjustments (e.g., Botzet et al., 2021).
One way to reduce the number of assumptions necessary for causal identification in observational data is examining change over time within individuals, because many of the potential confounding factors that vary between individuals are held constant by design. Longitudinal designs can rule out between-subject confounders by allowing the use of within-subject analyses (Rohrer & Murayama, 2023). Therefore, time-invariant confounders can be ruled out when estimating causal effects based on appropriately specified longitudinal designs.
Such panel studies operate like a series of photos: 2 We can track change, but still have to be cautious not to confuse cause and effect, since multiple events can occur in the interim—a longitudinal design alone is no guarantee of appropriate causal inference. Still, given transparent assumptions and adequate statistical control, we can at least attempt to infer causal effects. Specific statistical models are needed to remove confounders (Hamaker et al., 2015) and all modeling decisions ultimately reflect assumptions about the underlying causal network (Rohrer & Lucas, 2020).
Given the correct modeling decisions, time-invariant confounders are automatically controlled for in longitudinal designs. As they do not vary within a woman, they will not induce spurious correlations between her time-varying predictor and her time-varying outcome. Time-varying confounders on the other hand are not automatically controlled by longitudinal designs, but instead need to be accounted for (Rohrer & Murayama, 2023). A time-varying confounder might affect a woman’s choice of contraceptive method as well as the outcome of interest at a given time. For example, an ineffable or at least unmeasured shift from a casual to a steadier exclusive relationship may affect the decision to use hormonal contraceptives. In addition, this shift could cause more frequent sexual activity at a later time. In a longitudinal design that only measures hormonal contraceptive use and sexual activity but not this relationship shift, it will appear like there is a positive causal effect of hormonal contraceptives on sexual activity.
Some of these time-varying confounders might not have been observed in the available dataset or might even be completely unobservable—they thus cannot be accounted for in the statistical analysis. Such unobserved confounders bias the estimate no matter what analytic strategy is used, which we analyzed in our simulations reported in the supplement. However, additional sensitivity analyses can be conducted to estimate the influence unobserved confounders would need to have to fully account for the remaining observed relationship between treatment and outcome, thus providing at least the opportunity to make an educated guess about the internal validity of the results (for early work on sensitivity analysis for unobserved confounders see Rosenbaum & Rubin, 1983).
Longitudinal designs investigating potential medical effects of hormonal contraception are relatively common (e.g., Eng et al., 2008; Riggs et al., 2007; Wang et al., 2016), although all of these studies implement randomized treatment assignment rather than an observational approach. To our knowledge, only two studies investigated effects of hormonal contraception on sexuality with an observational longitudinal design. Blumenstock and Barber (2022) analyzed data from a weekly survey over 2.5 years from 893 women. They showed that women had a higher sexual frequency when they were using hormonal contraceptives. Frequency of sexual intercourse increased after starting using hormonal contraception, remained high for several months, and then slowly declined. Ott et al. (2008) showed in a 41-month long study with 328 participants that sexual interest based on daily diaries did not change when women started using oral contraceptives. But when women stopped using oral contraceptives, sexual interest decreased.
To summarize, causal inference from longitudinal data is only possible on the basis of assumptions. We strive to make our analysis goal (Lundberg et al., 2021) and the assumptions underlying our causal identification strategy as transparent as possible. In addition, we apply two different analytical approaches with different underlying assumptions.
Heterogeneity in Treatment Responses
While evidence for a negative average treatment effect on sexuality based on randomized controlled trials exist (Zethraeus et al., 2016), self-reports by women indicate that individual treatment effects on sexuality might vary widely (Malmborg et al., 2016). Heterogeneity in treatment responses might be caused by individual differences in responses to steroids (Kiesner, 2017). To our knowledge, treatment heterogeneity of hormonal contraceptives on sexuality has not been estimated quantitatively. Based on longitudinal data analyses, individual treatment effects on sexuality for each woman can be estimated and the distribution of individual treatment effects and their uncertainty can be visualized.
Estimating individual treatment effects allowed us to answer further questions about the underlying causal network connecting hormonal contraceptives and sexuality. Is there a large number of women who experience either positive or negative effects? Do women use their own experience with individual effects of hormonal contraceptives on sexuality to make a decision about their contraceptive method? For example, are women who experience adverse effects of hormonal contraceptives on sexuality more likely to stop using them during a specific time span? In addition, we want to answer the question whether interindividual differences like demography and personality predict individual treatment effects. Older women might be more likely to experience beneficial side effects of hormonal contraceptives on sexuality because they found the method that fits them best. In line with this reasoning, empirical findings suggest that higher age was associated with less negative side effects of hormonal contraceptive use on depression with particularly strong negative effects during adolescents (Skovlund et al., 2016). Nevertheless, these findings might be accounted for by other explanations, e.g., a possible decrease in sensitivity to steroid hormones with age or a specifically strong sensitivity to steroid hormones during puberty. Women with higher scores on openness might be more likely to experience beneficial side effects as well because they are more likely to try out different contraceptive methods until they find their perfect method. Other personality dimensions might be related to negative or positive individual treatment effects. For example, women with higher scores on neuroticism may experience more positive psychological effects as their heightened worries about unwanted pregnancies are reduced.
Focusing on individual treatment effects of hormonal contraceptives on sexuality allowed us to broaden our understanding about the individual nature of potential effects of hormonal contraceptives as well as confounding and attrition effects.
The Current Study
In the current study we aimed to answer the questions whether hormonal contraceptive use influences women’s sexuality (over and above attrition effects, accounting for observed and unobserved confounders) as well as whether and to which extent the effects of hormonal contraceptives on sexuality vary between users. Outcomes included desired sexual frequency in the last three months as a measure for libido, reported sexual frequency in the last three months, and sexual satisfaction. By using a longitudinal design, we can partly rule out alternative explanations such as reverse causality. Analyses were based on the German Family Panel (PAIRFAM), a panel dataset containing information about contraceptive use and women’s sexuality from more than 5,000 women over 14 waves, starting in 2008 (Brüderl et al., 2021; Huinink et al., 2011).
Conceptual Design and Underlying Assumptions
The conceptual design of the study, including all underlying assumptions, is outlined in Figure 1. These two graphs correspond to the two analytical approaches that were used to estimate the causal effect of hormonal contraceptives on the three outcomes. Conceptual design of the analyses approaches. Note. Panel A shows the conceptual design and assumptions underlying the adjusted regression model. Panel B shows the conceptual design and assumptions underlying the inverse probability of treatment weighted (IPTW) regression model
The graph in panel A shows the adjusted regression approach, which estimates the effect of contraceptive method on the outcome while controlling for the respective outcome in the previous wave, contraceptive method in the previous wave, and their interaction in the previous wave, as well as potential observed time-varying confounders (i.e., demography, relationship information). In addition, the potential influence of unobserved (and unobservable) confounders was estimated.
The graph in panel B corresponds to the conceptual design underlying the inverse probability of treatment weighting approach (IPTW; Thoemmes & Ong, 2016). For this approach, individuals were weighted by their probability to receive a specific treatment, in our case hormonal contraceptive use. This weight for each individual is modeled with effects of the outcome in the previous wave, contraceptive method in the previous wave, and their interaction in the previous wave, as well as potential observed time-varying confounders (i.e., demography, relationship information) on the treatment itself (i.e., hormonal contraceptive use). When estimating the effect of hormonal contraceptives on the respective outcome this weight was taken into account.
Why implement two approaches instead of only one line of analyses? According to Thoemmes and Ong (2016), the adjusted regression approach has several disadvantages: (1) regressions with different numbers of covariates can be estimated easily and therefore may introduce biases through cherry-picking (Rubin, 2001); (2) the adjusted regression approach relies on the untested key assumption that the relationships between the covariates and the outcome are modeled appropriately (more narrowly described as the linearity assumption, see Gutman & Rubin, 2017); (3) any comparisons between the treated and the untreated group might be due to extrapolation because there are no treated participants who are comparable to the untreated participants (King & Zeng, 2006).
While we agree that the IPTW approach outperforms adjusted regression analysis in estimating the causal effect of a treatment on an outcome in many possible scenarios (Fuentes et al., 2021), the first two disadvantages of adjusted regression mentioned above can also apply to the IPTW approach: (1) models estimating the individual weights are regression models that can be performed as easily with a different number of covariates and therefore potential bias through cherry-picking is not meaningfully precluded, and (2) the IPTW approach relies on the untested key assumption that the relationships between the covariates and the treatment are modeled appropriately (as opposed to the relationships between covariates and outcome, see assumptions of adjusted regression approach).
We addressed the first concern of both approaches (introduction of bias through cherry-picking) by carefully laying out the assumed underlying causal network and preregistering our models in form of a registered report before having access to the data. To address the second concern (nonlinearity between covariates and outcome or treatment, respectively), we decided to perform and compare both approaches to estimate the causal effect of hormonal contraceptives on the outcome robustly under different sets of assumptions. Nevertheless, both approaches still rely on the assumptions of (1) no unobserved confounders; (2) positivity (i.e., every individual having a probability of receiving the treatment that is larger than 0 and smaller than 1); and (3) a correct specification of the underlying models (Thoemmes & Ong, 2016). To estimate the dependency of our analyses on these three underlying assumptions, we tested the proposed models with different specifications based on simulated data with varying data generating mechanisms. The models, simulations, and results are described in more detail in the supplement. Given our interest in the immediate effects of hormonal contraceptive use (rather than the lagged effects after one year), and to avoid adding superfluous complexity, we decided against a popular alternative modeling approach (RI-CLPM, Hamaker et al., 2015) which simultaneously attempts to estimate causal effects pointing into the opposite direction.
In addition, we estimated the potential influence of unobserved confounders on the average treatment effect. We ran additional sensitivity analysis to estimate how sensitive the results are to hidden bias. Although a sensitivity analysis does not compensate for unobserved confounding, it quantifies how large the hidden bias would need to be to change the conclusions substantially (see methods section).
Estimands
Study Design
Note. This table is adapted based on the study design template provided by Peer Community In Registered Report here: https://rr.peercommunityin.org/help/guide_for_authors. PAIRFAM = German Family Panel (Brüderl et al., 2021; Huinink et al., 2011).
First of all, we were interested in overall descriptive patterns, including the percentage of hormonal contraceptive users across observations and common patterns in use and switches of hormonal contraceptives. Based on the full sample of all eligible women participating in PAIRFAM, we examined descriptives and general trends over the course of the study.
Second, we were interested in why women choose hormonal contraceptive methods. To get a better understanding of potential causes, we investigated whether time-varying covariates predicted the choice of contraceptive method. This was based on the IPTW model as this approach explicitly models how likely women are to use hormonal contraceptive methods. Our empirical estimands were quantified as percentage points based on marginal effects.
Third, we wanted to estimate the average treatment effect of hormonal contraceptive use on all three outcomes. Therefore, adjusted as well as IPTW regression models were performed to estimate the causal effect, taking into account observed confounders. In addition, the sensitivity of the models to unobserved confounders was estimated. Our empirical estimand was the unstandardized mean difference in the outcome between non-hormonal and hormonal contraceptive use. For the reported sexual frequency outcome, this difference can be seen as a very rough approximation of the percentage change in sexual frequency. 3
In addition, we were interested in treatment heterogeneity. Therefore, we investigated individual treatment effects on the outcome based on the adjusted regression models (see section Analysis for an explanation why we did not investigate treatment effect heterogeneity in the context of IPTW regression models). To help interpret this quantity, we visualized the distribution and uncertainty of individual estimates and reported for how many women we estimated negative and positive effects.
Furthermore, we wanted to explore the correlation between individual treatment effects and age as well as the correlations between individual treatment effects and Big Five personality traits. While these analyses were less focused on causal identification, they might still provide tentative evidence for substantively plausible causal hypotheses.
In addition, we wanted to investigate whether women’s individual treatment effects on sexuality informed their decision of which contraceptive method to use by investigating the correlation between estimated individual treatment effects and the number of years using hormonal contraceptives during the course of PAIRFAM. Ideally, we would have sufficient data to instead estimate individual treatment effects (e.g., using all but the last observation of each participant) to predict individual behavior (e.g., contraceptive method in the very last observation of each participant). However, in the context of the available data, this would result in very low statistical power, and we thus decided on a different approach which would only provide very rough evidence for potential assortment based on experiences with contraceptive methods. Such an assortment based on experiences would result in the type of selective attrition explained above and may provide a partial explanation for the mixed evidence concerning effects of hormonal contraceptives on sexuality.
Methods
Data
Analyses were based on data from a German panel study called PAIRFAM (Brüderl et al., 2021; Huinink et al., 2011). It contains information about contraceptive use and women’s sexuality from more than 5,000 women. The longitudinal design consists of annual waves with the first data collection in 2008 and the latest available data from 2022 (wave 14). The ethics committee of the Faculty of Management, Economics, and Social Sciences of the University of Cologne approved PAIRFAM. Huinink et al. (2011) provide a detailed description of the PAIRFAM dataset. In addition, the present manuscript contains supporting information including rmd files and html files for the blind code, the simulation code, planned analyses, data wrangling, and conducted analyses online at https://osf.io/u8ntf/.
The data on which our analyses are based were already available and can be used for scientific purposes; the Leibniz Institute for the Social Sciences (GESIS) grants access to the scientific community. Only one of the authors had previously accessed the PAIRFAM data; JMR was granted access to Release 7.0 (waves 1–7) in 2016 within the context of a different research project but never actually worked with the data beyond an initial screening of the included variables to determine suitability for her research question (birth order effects on personality). Thus, some of the data used to answer this research question had been previously downloaded by one of the authors, but we certify that we have not observed any part of the data relevant to the present research question (Level 3 bias control based on the categorization in Table 1 by Peer Community In Registered Report; https://rr.peercommunityin.org/help/guide_for_authors).
Exclusion Process and Participants
We excluded individuals who did not identify as female. Furthermore, once a woman crossed the age of 50 or reported to be (post-)menopausal, her data (including subsequent waves) were excluded, but previous waves of data collection remained in the analysis. In addition, we excluded all individual waves of data in which participants indicated being in a homosexual relationship or only reported homosexual relationships in the past, were pregnant, trying to become pregnant, gave birth to a child in the last year, were currently breastfeeding, or indicated using the morning-after-pill or an unknown contraceptive method. Besides these exclusion criteria explicitly mentioned in stage 1 of this registered report (see Table S1 in the supplement for details on all deviations from the in-principle accepted registered report stage 1), the implemented models for effects of hormonal contraception on sexuality only used information from one wave if the information from the previous wave was also available (because we used predictors from the previous waves in our models). In addition, we could only include information from participants if (1) hormonal contraception was available in the previous and the current wave (2) all additional predictors were available in the current wave and (3) one of the three outcomes was available in the previous and the current wave. To make these data exclusion steps following from modeling decisions more explicit, we now list them as additional exclusion criteria in stage 2 of this registered report.
In further separate robustness analyses, we additionally excluded waves in which participants indicated that they are sterilized, as well as all subsequent waves of those participants. We also excluded all waves in which participants indicated that their partner is sterilized and all waves in which women indicated using no contraceptive method, 4 an intrauterine device as a contraceptive method, or hormonal methods other than the oral contraceptive pill. In addition, we excluded all waves in which women indicated that they had never been sexually active. All exclusion criteria, reasons for exclusion, and excluded observations are summarized in Table S2 in the supplement.
In addition to these robustness analyses, which focused on excluding specific women or waves that might bias the estimates of the originally registered main analysis, we would have liked to conduct exploratory subanalyses based only on women who reported being in a homosexual relationship or who have reported only homosexual relationships in the past (otherwise using the same exclusion criteria as in the originally registered main analysis). While we hoped to gain some initial insight into the potential effects of hormonal contraceptives on sexuality of homosexual women, the sample size after applying our registered exclusion criteria was already too small to perform any meaningful analysis (n = 188, observations = 539). We had the information from the previous and the current wave that was necessary to perform the proposed models for only 62 exclusively homosexual women (213 observations). Of these, only three women reported a switch from non-hormonal to hormonal contraception and no women reported a switch from hormonal to non-hormonal contraception (compared to our registered threshold of 200 homosexual women reporting a switch between hormonal and non-hormonal contraception at least once). As the threshold of a sufficient number of exclusively homosexual women was not met, we did not perform the additional exploratory subanalyses based on this sample.
Variables
All variables, including the predictor variable, potential time-varying confounders, outcome variables, and variables used to investigate treatment heterogeneity are listed in Table S3 in the supplement. The original German item wording can be found here: https://www.pairfam.de/dokumentation/fragebogen/.
The predictor hormonal contraception was based on the items about the contraceptive method; participants were able to report multiple contraceptive methods. Hormonal contraception was coded as 0 if participants indicated that they use no contraceptive method at all. The variable hormonal contraception was also coded as 0 if participants indicated that they use no hormonal contraceptive method and at least one of the following methods: condom; intrauterine device; 5 diaphragm, foam, suppository, gel; natural birth control; female sterilization; male sterilization; or withdrawal method, coitus interruptus. In addition, the variable hormonal contraception was coded as 0 if participants were never sexually active in their life, as these participants were not asked about their contraceptive method. The variable hormonal contraception was coded as 1 if participants indicated that they use a birth control pill, mini pill, or other hormonal method (implant, patch, ring), even if they additionally use non-hormonal methods. Exclusion criteria for originally registered main as well as robustness analyses based on the contraceptive method are described above.
Simulation
In order to contrast our different analytical approaches, we compared the performance of our models (conceptually summarized in Figure 1) on data simulated under different data generating mechanisms. These simulations are discussed in more detail in the supplement and the implications are mentioned in the section Analysis. Based on these simulation results, to estimate the average treatment effects and treatment effect heterogeneity, we performed adjusted regression analysis without accounting for systematic missingness. In addition, we estimated the average treatment effects based on the IPTW approach accounting for systematic missingness.
Analysis
To answer the question whether hormonal contraceptive use influences women’s sexuality, and to separate these potential causal effects from confounders and attrition effects, we used two different analytical approaches, as outlined in Figure 1. This decision was based on simulations contrasting our different analytical approaches and comparing the performance of our models on data simulated under different data generating mechanisms (for more details on the simulation see the supplement). Based on these simulation results, to estimate the average treatment effects and treatment effect heterogeneity, we performed adjusted regression analysis without accounting for systematic missingness. In addition, we estimated the average treatment effects based on the IPTW approach with stabilized, truncated weights at 1% (Thoemmes & Ong, 2016) accounting for systematic missingness. All planned analyses can be found in form of an rmd file and an html file: https://osf.io/u8ntf/.
All Bayesian models included a random intercept and a random slope for hormonal contraceptive use nested within participants. In addition, each model included information from the previous wave about the outcome, hormonal contraception, and their interaction as predictors. 6 In order to be able to estimate the causal relationship between the hormonal contraception and the outcome, we controlled for individual mean levels of hormonal contraceptive use across observations (see Bafumi & Gelman, 2007; Hamaker & Muthén, 2020, for further information); this approach effectively controls for stable confounding influences that work between women (time-invariant confounders). For both models, potential time-varying confounders included linear effects for log transformed net income, educational attainment, and fertility plans; a thin-plate spline effect (Wood, 2003) for age; and a categorical effect for number of children (no children, one child, two children, three or more children).
Furthermore, relationship duration was included as a nested variable. This allowed us to model a linear association with relationship duration which is only informed by women who are in a relationship, while simultaneously including those who are not in the analysis. Technically, we achieved this by including a dummy coded variable for current relationship status (single vs. non-single) and its interaction with relationship duration as a predictor. No main effect of relationship duration was included in the model. Relationship duration for singles was set to −1; this value is arbitrary and does not affect the resulting estimates because when multiplied with the relationship status dummy, relationship duration for singles is dropped from the analysis. In addition, we included two dummy coded variables: one indicating whether a woman started a relationship between the previous wave and the current wave and one indicating whether a woman became single between the previous wave and the current wave.
In the IPTW approach the outcome in the first model was the contraceptive method. The first model results in an estimated weight which was then included in the second model. In the IPTW approach, the effects were additionally weighted for systematic missingness based on weights provided by PAIRFAM. 7 Separate analysis for desired sexual frequency, reported sexual frequency, and sexual satisfaction as outcomes were performed. All included variables are listed in Table S3 in the supplement.
To answer the question whether interindividual differences predicted individual treatment effects, we extracted individual treatment effect estimates from the adjusted regression analysis and subsequently correlated them with age and the Big Five personality traits at draw level. Finally, these correlations were averaged across draws (we call this approach “correlate, then average across draws”; Ly et al. (2017) for example use the term “plausible values”). 8
To answer the question whether women guide their contraceptive method choices by deciding against hormonal contraceptive methods after experiencing adverse effects, we again used individual treatment effect estimates from the adjusted regression analysis, this time correlating them with the proportion of years using hormonal contraceptives (observations in which hormonal contraceptives were used divided by total number of observations) at draw level. Finally, these correlations were averaged across draws (we call this approach “correlate, then average across draws”). 9 This analysis can potentially provide tentative evidence for assortment based on experiences with contraceptive methods.
Additionally, given the possibility of unobserved confounding, we ran sensitivity analysis to estimate how sensitive our results are to hidden bias. We calculated E-values for the effect of hormonal contraception on all outcomes (VanderWeele & Ding, 2017). As VanderWeele and Ding (2017) write “The E-value is defined as the minimum strength of association, on the risk ratio scale, that an unobserved confounder would need to have with both the treatment and the outcome to fully explain away a specific treatment-outcome association, conditional on the measured covariates.“ A large E-value implies that unobserved confounding would need to be relatively substantial to explain away an effect. Conversely, a small E-value implies that even just a little unobserved confounding would be able to explain away the estimated effect. E-values are one of the few approaches to unobserved confounding that can be applied to longitudinal designs (VanderWeele et al., 2020).
All planned analyses can be found in the form of an rmd file and an html file: https://osf.io/u8ntf/. In addition, all models are outlined in the supplement using a simplified readable notation.
Model Convergence
Using the default settings for brms, a lot of the models had divergent transitions (between 6 and 240 diverging transitions). Therefore, we increased the sampler’s target acceptance rate during Stan’s adaptation period in these models from brms’s default 0.80 to 0.99 (Stan Development Team, 2024). With this increased average proposal acceptance probability, none of the originally registered main models had divergent transitions, except for the adjusted regression analysis with sexual frequency as an outcome (4 divergent transitions), which we considered harmless. In all originally registered main models, the Rhats were at most 1.01, indicating that the 4 chains of the models converged. 10
Deviations From Stage 1 Concerning the Analysis Sample
In contrast to our registered analysis plan, we decided to report the results from robustness analysis 6 (additional exclusion of women who had never been sexually active) as our main analysis. All results based on the original sample for the originally registered main analysis are additionally presented in the supplement. We did so because we realized that including women who had never been sexually active likely induces bias. This particularly applies to the IPTW approach and less so to the adjusted regression analysis, with the overall consequence that including these women makes results more dependent on modeling choices (i.e., means of covariate adjustment) that should not make a substantive difference. Table S1 in the supplement provides all deviations from the in-principle accepted registered report stage 1 and the reasons for the deviations.
In the originally registered main analysis (and all other robustness analyses), women who had never been sexually active were classified as non-hormonal contraceptive users (even though they were not asked about their contraceptive method) and their sexual frequency was 0 (never). This coding decision appears to systematically bias results, because women who are not sexually active over a longer period of time are incorporated in the analyses as non-hormonal contraceptive users (because by study design, they are not asked questions about contraception usage) with the lowest score for sexual frequency (nobody else was assigned this score). The adjusted regression analysis seems to partly account for this bias by controlling for hormonal contraception in the previous wave, the linear effect of sexual frequency from the previous wave (which was 0 = never as well), and their interaction on the sexual frequency in the current wave. In contrast, in the IPTW approach the control for hormonal contraception in the previous wave, the linear effect of sexual frequency from the previous wave, and their interaction on hormonal contraceptive use in the current wave seems to be unable to account for this bias, presumably because the effect of sexual frequency from the previous wave was modeled as a linear effect. Thus, when including these women, estimates from the IPTW approach were about twice as large as estimated based on the adjusted regression analysis (see the section Robustness of Results for more information). When excluding these women, results are much less dependent on the choice of the data-analytic approach and thus more robust. The new exclusion implies that we should not generalize our conclusions to women who start being sexually active and/or start using hormonal contraception.
For women who are being sexually active for the first time, several factors may make their experience different from that of women who have been sexually active for a longer period of time. Initially, these women may be in the process of exploring and understanding their sexual preferences, desires, and boundaries, which may influence their sexual frequency and satisfaction. These factors may result in a unique pattern of sexual behavior that differs from those with more established sexual routines and experiences. Women who begin using hormonal contraceptives without being sexually active may experience different dynamics than those who are already sexually active. For these women, the decision to use hormonal contraception may be driven by reasons other than birth control, such as managing menstrual cycles, reducing menstrual pain, or addressing hormonal imbalances. Without the context of sexual activity, their experience with contraception may be primarily focused on managing these health concerns.
Results
Description of Samples
In PAIRFAM, 18,912 individuals provided a total of 104,661 observations (i.e., annual assessments of individuals). After applying our predefined exclusion criteria and excluding observations that could not be incorporated into the analyses because of modeling decisions, up to 23,130 observations from n = 5,041 women were eligible for our main analysis (registered as robustness analysis 6) estimating effects of hormonal contraception on sexuality. 11 Due to further missingness in single outcomes, the sample sizes for our main analysis vary between outcomes. In the main analysis estimating the effect of hormonal contraception on sexual frequency 4,403 women with 19,801 observations were included (sexual frequency was only measured in waves 2 to 14); in the main analysis estimating the effect of hormonal contraception on desired sexual frequency 3,057 women with 9,939 observations were included (desired sexual frequency was only measured in waves 7 to 14); in the main analysis estimating the effect of hormonal contraception on sexual satisfaction 4,983 women with 22,622 observations were included. The full exclusion process for the main analysis is outlined in Figure S3 in the supplement. The total eligible sample sizes for estimating effects of hormonal contraception on sexuality for all samples are displayed in Table S4 in the supplement.
Descriptives
Overall, after applying all exclusion criteria n = 5,041 women with 23,130 observations were eligible for our main analysis estimating effects of hormonal contraception on sexuality. 12 Each woman was observed between 1 and 13 times (the first wave was always excluded because no information from the previous wave was available); with an average of 4.59 observations (standard deviation of 3.46). 13 Table S5 in the supplement displays the number of waves (for all categorical variables) or means, standard deviations, and ranges (for all continuous variables) for all included women and all observations.
Table S6 in the supplement shows zero-order correlations of all variables included in the same analysis averaged within women across all available observations. Hormonal contraception measured in the current wave correlated positively with hormonal contraception measured in the previous wave (r = .86 [95% CI: .86, .87]). As is to be expected, both measures (hormonal contraception in the current wave, hormonal contraception in the previous wave) correlated positively with the average frequency of using hormonal contraception (r = .95 [.94, .95] and r = .95 [.94, .95], respectively).
Hormonal contraception in the current wave correlated negatively with income, years of education, relationship duration, completed fertility plans, and age (ordered from weakest to strongest negative correlation, rs ranging from −.11 [−.14, −.08] to −.50 [−.52, −.48]), as well as positively with the end of a relationship in the last year (r = .11 [.09, .14]) and the start of a relationship in the last year (r = .15 [.12, .17]).
Each sexuality variable measured in the current wave correlated positively with its value in the previous wave (rs ranging from .71 [.70, .72] to .82 [.81, .83]). When measured in the current wave, sexual frequency correlated positively with sexual satisfaction (r = .57 [.55, .59]), while desired sexual frequency correlated negatively with sexual frequency (r = −.35 [−.38, −.32]) and sexual satisfaction (r = −.34 [−.37, −.31]). Hormonal contraception in the current wave correlated positively with sexual frequency in the current wave (r = .22 [.19, .24]) and sexual satisfaction in the current wave (r = .16 [.13, .19]), but did not correlate with desired sexual frequency in the current wave (r = −.02 [−.05, .01]). For the remaining covariates measured in the current wave, being in a relationship showed the strongest and most consistent correlations with sexual frequency (r = .44 [.42, .47]), sexual satisfaction (r = .26 [.24, .29]), and desired sexual frequency (r = −.27 [−.30, −.24]).
Descriptive Patterns in Hormonal Contraceptive Use
Of the 23,130 observations of 5,041 women included in our main analysis, 12,321 (53%) were observations in which women reported using non-hormonal contraceptive methods and 10,809 (47%) were observations in which women reported using hormonal contraceptive methods. Over the whole available information (including previous waves), 1,537 women (30%) reported using only non-hormonal contraceptives, 1,335 women (26%) reported using only hormonal contraceptives, and 2,169 (43%) women reported using non-hormonal and hormonal contraceptives at least once. The 2,169 women who used both non-hormonal and hormonal contraceptives reported 2,716 switches (total number of switches across all 5,041 women: mean = 0.54, sd = 0.93, min = 0, max = 7) with 1,584 switches from hormonal to non-hormonal and 1,132 switches from non-hormonal to hormonal contraceptives.
Predictors of Hormonal Contraceptive Use
Figure S4 in the supplement displays the three separate models predicting hormonal contraceptive use in the current wave from, among other predictors, (a) sexual frequency in the previous wave and its interaction with hormonal contraceptive use in the previous wave; (b) desired sexual frequency in the previous wave and its interaction with hormonal contraceptive use in the previous wave; and (c) sexual satisfaction in the previous wave and its interaction with hormonal contraceptive use in the previous wave.
Stability of Contraceptive Method Use
In all three models the effect of the average frequency of using hormonal contraception was significant (model (a): unstandardized regression coefficient b = 6.25 [6.02, 6.49]; model (b): b = 7.18 [6.70, 7.68]; model (c): b = 6.02 [5.82, 6.24]). Using average marginal effects, this corresponds to an increase around 4% in the probability to use hormonal contraception, when women’s average frequency of using hormonal contraception increases by 10% (model (a): 4.16% [4.04%, 4.28%]; model (b): 4.30% [4.10%, 4.51%]; model (c): 4.11% [4.00%, 4.22%]). In addition, hormonal contraception in the previous wave was a significant predictor in all three models (model (a): b = 0.73 [0.48, 0.98]; model (b): b = 1.58 [0.89, 2.30]; model (c): b = 1.09 [0.84, 1.34]). This corresponds to an increase of around 15% in the probability to use hormonal contraception for women who used hormonal contraception in the previous wave compared to women who used non-hormonal contraception (model (a): 15.30% [13.80%, 16.80%]; model (b): 14.30% [12.20%, 16.50%]; model (c): 15.40% [14.00%, 16.80%]).
Sexuality
The use of hormonal contraception in the current wave was negatively predicted by sexual frequency in the previous wave (model (a): b = −0.13 [−0.19, −0.07]) and its interaction with hormonal contraceptive use in the previous wave (model (a): b = 0.27 [0.20, 0.34]). If a woman used non-hormonal contraceptives in the previous wave, for each increase of 1 on the scale for sexual frequency this corresponds to a decrease of −0.65% [−0.95%, −0.35%] in the probability to use hormonal contraception in the current wave. If a woman used hormonal contraceptives in the previous wave, for each increase of 1 on the scale for sexual frequency this corresponds to an increase of 1.17% [0.75%, 1.59%] in the probability to use hormonal contraception in the current wave. Similarly, the use of hormonal contraception in the current wave was negatively predicted by sexual satisfaction in the previous wave (model (c): b = −0.04 [−0.07, −0.02]) and its interaction with hormonal contraceptive use in the previous wave (model (c): b = 0.08 [0.04, 0.12]). If a woman used non-hormonal contraceptives in the previous wave, each increase of 1 on the scale for sexual satisfaction corresponds to a decrease of −0.25% [−0.40%, −0.09%] in the probability to use hormonal contraception in the current wave. If a woman used hormonal contraceptives in the previous wave, each increase of 1 on the scale for sexual satisfaction corresponds to an increase of 0.31% [0.10%, 0.51%] in the probability to use hormonal contraception in the current wave. In other words, with increased sexual frequency and increased sexual satisfaction, the model predicted that women were more likely to stick with the contraceptive method reported in the previous wave.
Neither the effect of desired sexual frequency in the previous wave (model (b): b = −0.03 [−0.18, 0.13]) nor its interaction with hormonal contraceptive use in the previous was a significant predictor of hormonal contraceptive use in the current wave (model (b): b = 0.03 [−0.17, 0.21]).
Relationship and Family Situation
The probability of using hormonal contraceptives was significantly predicted by relationship status in models (a) and (c) (model (a): b = 0.32 [0.07, 0.58]; model (c): b = 0.37 [0.15, 0.60]; but not in model (b): b = 0.01 [−0.29, 0.34]) corresponding to an increase around 6% in the probability to use hormonal contraception for women being in a relationship compared to women being single (model (a): 5.42% [1.19%, 9.44%]; model (c): 6.40% [2.62%, 10.30%]). The probability of using hormonal contraceptives was significantly predicted by a relationship start in the last year in all three models (model (a): b = 0.64 [0.41, 0.87]; model (b): b = 0.50 [0.16, 0.83]; model (c): b = 0.49 [0.28, 0.71]) corresponding to an increase around 3% to 4% in the probability to use hormonal contraception for women who started a relationship in the last year (model (a): 4.24% [2.69%, 5.78%]; model (b): 2.94% [0.94%, 4.90%]; model (c): 3.31% [1.92%, 4.83%]). Ending a relationship was a significant negative predictor of hormonal contraceptive use in all three models (model (a): b = −0.36 [−0.60, −0.12]; model (b): b = −0.60 [−0.97, −0.23]; model (c): b = −0.29 [−0.51, −0.06]) corresponding to a decrease around 3% in the probability to use hormonal contraception for women who ended a relationship in the last year (model (a): −2.43% [−4.10%, −0.79%]; model (b): −3.67% [−6.12%, −1.38%]; model (c): −2.00% [−3.54%, −0.42%]). In addition, in all three models, the probability of using hormonal contraceptives was significantly predicted by relationship duration for women in a relationship (model (a): b = −0.02 [−0.03, −0.01]; model (b): b = −0.02 [−0.03, −0.001]; model (c): b = −0.02 [−0.03, −0.01]) corresponding to a decrease around 0.12% in the probability to use hormonal contraception for each additional year women spent in a relationship (model (a): −0.13% [−0.20%, −0.06%]; model (b): −0.10% [−0.19%, −0.003%]; model (c): −0.13% [−0.19%, −0.07%]).
The positive effect of having one child compared to no children was significant in all three models (model (a): b = 0.23 [0.04, 0.42]; model (b): b = 0.30 [0.01, 0.60]; model (c): b = 0.21 [0.03, 0.39]). This corresponds to an increase around 2% in the probability to use hormonal contraception for women having one child compared to women who have no children (model (a): 1.46% [0.24%, 2.74%]; model (b): 1.79% [0.05%, 3.45%]; model (c): 1.38% [0.23%, 2.56%]). The positive effect of having two children compared to no children was significant in model (a) and model (c) (model (a): b = 0.25 [0.05, 0.46]; (c): b = 0.22 [0.03, 0.41]; but not in model (b): b = 0.28 [−0.02, 0.58]). This corresponds to an increase around 1.5% in the probability to use hormonal contraception for women having two children compared to women who have no children (model (a): 1.66% [0.36%, 2.94%]; model (c): 1.49% [0.24%, 2.72%]). The positive effect of completed fertility plans was significant in model (b) (model (b): b = 0.29 [0.02, 0.57]; but not in model (a): b = 0.12 [−0.07, 0.30] and model (c): b = 0.06 [−0.11, 0.22]). This corresponds to an increase around 2% in the probability to use hormonal contraception for women having completed their fertility plans compared to women who have not completed their fertility plans (model (b): 1.72% [0.14%, 3.30%]).
Income and Education
The negative effect of income was significant in model (a) (model (a): b = −0.02 [−0.04, −0.002]; but not in model (b): b = −0.01 [−0.04, 0.01] and model (c): b = −0.01 [−0.03, 0.01]). This corresponds to a decrease around 0.15% in the probability to use hormonal contraception, for each increase of 1 in the log transformed net income per month (model (a): −0.15% [−0.28%, −0.02%]). The negative effect of education was significant in model (c) (model (c): b = −0.03 [−0.05, −0.004]; but not in model (a): b = −0.02 [−0.04, 0.002] and model (b): b = −0.03 [−0.06, 0.003]). This corresponds to a decrease around 0.17% in the probability to use hormonal contraception, for each increase of one year of education (model (c): −0.17% [−0.31%, −0.03%]). All other predictors were non-significant across all three models.
Average Treatment Effects of Hormonal Contraceptive Use on Sexuality
Adjusted Regression Analysis
Figure 2 displays the three separate adjusted regression analyses estimating the effects of hormonal contraceptive use and additional predictors on sexual frequency, desired sexual frequency, and sexual satisfaction. Hormonal contraception in the current wave positively predicted sexual frequency and sexual satisfaction. For sexual frequency, this corresponds to an unstandardized mean difference between non-hormonal contraceptive use and hormonal contraceptive use of 0.30 [0.24, 0.35] on a 7-point scale (ranging from 1 = no sexual intercourse during the last three months to 7 = daily). This unstandardized mean difference can be seen as a very rough approximation of the percentage change in sexual frequency (as the response scale of this item is very roughly a log-transformed version of frequency). For sexual satisfaction, the unstandardized mean difference between non-hormonal contraceptive use and hormonal contraceptive use was 0.29 [0.18, 0.39] on a 11-point scale (ranging from 0 = very unsatisfied to 10 = very satisfied). Hormonal contraception in the current wave negatively predicted desired sexual frequency indicating a small unstandardized mean difference between non-hormonal contraceptive use and hormonal contraceptive use of −0.06 [−0.12, −0.02] on a 5-point scale. As sexual frequency differed between the groups and desired sexual frequency was measured relative to the actual sexual frequency (“If it was only up to you, would you like to have less or more sexual intercourse compared to the last three months?”) on a scale from 1 = a lot less; 2 = a little bit less; 3 = same amount; 4 = a little bit more; to 5 = a lot more, this result cannot, on its own, be interpreted as an overall higher sexual desire for women using non-hormonal contraceptives. The distribution of desired sexual frequency for women included in this model using non-hormonal and hormonal methods is displayed in Figure S11 in the supplement. Adjusted regression analysis estimating the effects of hormonal contraceptive use and other predictors on sexual frequency, desired sexual frequency, and sexual satisfaction. Note. HC = hormonal contraception. Figure S12 in the supplement shows the thin-plate spline effects of age on the sexuality outcomes
Inverse Probability of Treatment Weighting
We had registered (1) adjusted regression analyses that do not account for systematic missingness (already summarized in the previous section) and (2) inverse probability of treatment weighting models (IPTW) that do account for systematic missingness. The resulting effect estimates are contrasted in Figure 3. Estimated effects of hormonal contraceptive use on sexual frequency, desired sexual frequency, and sexual satisfaction for different modeling approaches
Compared to the estimate based on the adjusted regression analysis (b = 0.30 [0.24, 0.35]), the estimate in the IPTW approach for the effects of hormonal contraceptive use on sexual frequency (b = 0.42 [0.32, 0.52]) was slightly larger (difference in regression coefficients = 0.12 [0.003, 0.24]). The estimate in the IPTW approach for the effect of hormonal contraceptive use on desired sexual frequency was non-significant (b = −0.05 [−0.13, 0.04]), compared to the significant negative effect estimated based on the adjusted regression analysis (b = −0.06 [−0.12, −0.02]; however, the two estimates did not differ significantly from each other as the difference in regression coefficients was 0.02 [−0.08, 0.11]). The estimate in the IPTW approach for the effects of hormonal contraceptive use on sexual satisfaction (b = 0.40 [0.24, 0.56]) did not differ from the estimate based on the adjusted regression analysis (b = 0.29 [0.18, 0.39]; difference in regression coefficients = 0.11 [−0.08, 0.31]).
As we found systematic differences between these two approaches for sexual frequency as an outcome, the question arises whether these are due to the different model classes (regression analyses versus IPTW models) or due to the fact that only the IPTW models accounted for systematic missingness. We thus additionally implemented (1) adjusted regression analyses accounting for systematic missingness and (2) IPTW models without accounting for systematic missingness. As can be seen in Figure 3, the differences arise from the different models rather than from whether or not systematic missingness is accounted for.
Robustness of Results
Figure 4 contrasts the estimated effects of hormonal contraceptive use from our main analysis, the originally registered main analysis, and all five robustness analyses (which implement different exclusion criteria, see Table S4 in the supplement). All analyses supported a positive effect of hormonal contraceptive use on sexual frequency and sexual satisfaction. Estimated effects of hormonal contraceptive use based on all robustness analyses. Note. Additional exclusion criteria compared to the originally registered main analysis in the main analysis (registered as robustness analysis 6): women never sexually active (up to n = 5,041 and observations = 23,130); robustness analysis 1: women sterilized (up to n = 5,586 and observations = 25,201); robustness analysis 2: partner sterilized (up to n = 5,618 and observations = 25,159); robustness analysis 3: no contraceptive method (up to n = 5,114 and observations = 20,870); robustness analysis 4: intrauterine device as a contraceptive method (up to n = 5,358 and observations = 23,231); and robustness analysis 5: hormonal methods other than the oral contraceptive pill (up to n = 5,601 and observations = 25,023). For more details see section Exclusion Process and Participants
For sexual frequency as an outcome in the adjusted regression analysis, results based on the sample for the main analysis (b = 0.30 [0.24, 0.35]) indicated smaller positive effects than the results based on all other analysis samples. Similarly, the estimate for the effect of hormonal contraceptive use on sexual frequency in the IPTW approach based on the main analysis (b = 0.42 [0.32, 0.52]) was smaller than the results based on all other analysis samples. As explained above, our main analyses deviate from the originally registered main analyses in that all observations in which women indicated that they had never been sexually active were excluded. See Table S1 in the supplement for all deviations from the in-principle accepted registered report stage 1 and the reasons for the deviations. In the originally registered main analysis and in all remaining robustness analyses for these observations sexual frequency was coded as 0 (never). Considering the other two outcomes, desired sexual frequency and sexual satisfaction, the estimated effects of hormonal contraceptive use did not differ significantly between the various sets of exclusion criteria.
Sensitivity Analysis
To estimate how sensitive our results are to unobserved confounders, E-values (VanderWeele & Ding, 2017) for the effect of hormonal contraception on sexual frequency, desired sexual frequency, and sexual satisfaction were estimated based on the results of the adjusted regression analysis. The E-Value indicates how strong an association of the unmeasured confounders with both variables (hormonal contraception and the sexuality outcome) conditional on all observed covariates would need to be to explain away the effect of hormonal contraception on the sexuality outcome; weaker confounding would not suffice to explain away the effect (wording according to VanderWeele & Ding, 2017). The E-value is interpreted as a risk ratio for the associations between the unmeasured confounders and hormonal contraception as well as the sexuality outcome. The E-Values were 1.59 (outcome: sexual frequency), 1.22 (desired sexual frequency), and 1.57 (sexual satisfaction).
Heterogeneity in Treatment Effects of Hormonal Contraceptive Use on Sexuality
Heterogeneity in and Reliability of Treatment Effects of Hormonal Contraceptive Use on Sexuality
Note. ATE = average treatment effect (unstandardized regression coefficient b); ITE = individual treatment effect. All numbers are based on the adjusted regression analysis. Total sample sizes for the three analyses: sexual frequency: n = 4,403 women and 19,801 observations; desired sexual frequency: n = 3,057 women and 9,939 observations; and sexual satisfaction: n = 4,983 women and 22,622 observations.
Taking a closer look at the model-implied point estimates of the individual treatment effects, for all three outcomes, some women experienced negative effects and some women experienced positive effects of hormonal contraceptive use. However, the point estimates of individual treatment effects were estimated very imprecisely, so that only few of them were significantly different from zero when considered in isolation. Putting these pieces together, while we can say that heterogeneity was high, the number of observations per woman was not sufficient to estimate anyone’s treatment effect with certainty (see Table 2). Figure 5 displays selected individual treatment effects (as displaying all of them would not be legible). The figure includes all women who contributed at least 11 observations to the model (7 observations for desired sexual frequency) and reported using non-hormonal and hormonal contraception at least once. The large confidence intervals illustrate the uncertainty in the estimates of the individual treatment effects. To provide an additional quantification of this uncertainty, we computed the reliability of the estimated random slopes. To get an estimate of the unweighted reliability, we subtracted the fraction of the uncertainty of the random slope (reflected by the variance across MCMC draws) by the average between-person variance (reflected by the variance of the random slope) from 1 (see Arslan, 2024, for a detailed description of this procedure and the code that we adjusted for our analysis). The reliability of the random slopes for all outcomes was very low (≤.17, see last column in Table 2), indicating a high uncertainty in the estimates of the individual treatment effects. Distribution of selected individual treatment effects of hormonal contraception on a) sexual frequency (n = 137); b) desired sexual frequency (n = 181); and c) sexual satisfaction (n = 193). Note. The figure includes all women, who contributed at least 11 observations to the model (7 observations for desired sexual frequency) and reported using non-hormonal and hormonal contraception at least once. The green individual treatment effects do not differ significantly from zero, the orange individual treatment effects differ significantly from zero. The dotted purple line indicates the average treatment effect
Predictors of Individual Treatment Effects
Individual treatment effect estimates for sexual frequency and sexual satisfaction correlated 14 positively with each other (r = .36 [.33, .39], n = 4,346). Individual treatment effect estimates for desired sexual frequency correlated negatively with individual treatment effect estimates for sexual frequency (r = −.24 [−.27, −.20], n = 3,048) and sexual satisfaction (r = −.30 [−.35, −.25], n = 3,034).
“Correlate, Then Average Across Draws” Relationships Between Individual Treatment Effects and Age, Big Five Personality Traits, as Well as Years Spent on Hormonal Contraception
Note. HC = hormonal contraception; ITE = individual treatment effect. Square brackets indicate 95% credibility intervals. Bold numbers indicate that the 95% credible interval excluded zero.
Using this approach, we found two small correlations for which the 95% credible interval excluded zero. These were between the interindividual treatment effects on sexual satisfaction and (a) neuroticism (r = .07 [.04, .11], n = 4,570), meaning that women who scored higher on neuroticism experienced more positive effects of hormonal contraceptive use on sexual satisfaction and (b) conscientiousness (r = −.04 [−.07, −.01], n = 4,571), meaning that women who scored higher on conscientiousness experienced less positive effects of hormonal contraceptive use on sexual satisfaction. While some of them were significant, most of them were rather small in magnitude and no common pattern across sexuality outcomes emerged. Neither neuroticism nor conscientiousness showed significant correlations with the individual treatment effects for the other outcomes.
Contraceptive Decisions and Individual Treatment Effects
We additionally wanted to investigate whether women guide their contraceptive choices based on their experiences. In fact, our analysis of predictors of hormonal contraceptive use already provided evidence pointing into that direction: women with higher sexual frequency and higher sexual satisfaction were more likely to stick with the method they were previously using. However, our registered analysis went further and asked whether the individual treatment effects (i.e., not just whether women are more or less sexually satisfied, but whether hormonal contraceptives have a negative or positive effect on their sexual satisfaction) correlated with the amount of time that women used hormonal contraceptives. Again, we used the “correlate, then average across draws” approach. 15 That is, we correlated the individual treatment effects with the proportion of years of hormonal contraceptive use at draw level. Finally, these correlations were averaged across draws. None of the “correlate, then average across draws” relationships were significant (see the last column of Table 3).
Discussion
Based on a large longitudinal study, we aimed to answer the questions whether hormonal contraceptive use influences women’s sexuality as well as whether and to which extent the effects of hormonal contraceptives on sexuality vary between users. Women’s hormonal contraceptive use was highly stable over time in our study. Besides the stability, the current relationship situation, number of children and fertility plans, sexual frequency in the previous wave and sexual satisfaction in the previous wave, as well as potentially income and education predicted the use of hormonal contraception. Controlling for a number of potential confounding variables, we found strong and robust support for positive average treatment effects of hormonal contraceptive use on sexual frequency and sexual satisfaction across two different analytic approaches and across six robustness analyses. In contrast, evidence for a negative effect of hormonal contraceptive use on desired sexual frequency was not robust across analytic approaches. Beyond these average treatment effects, we found relatively high heterogeneity for individual treatment effects of hormonal contraception on sexual frequency and sexual satisfaction, but not on desired sexual frequency. However, the high uncertainty around individual treatment effects suggests that the number of observations per woman was too small to estimate individual treatment effects reliably. In addition, the one year interval between waves might have been too large to estimate individual treatment effects: women might have switched back and forth between methods in between measurement occasions. The estimated individual treatment effects were not consistently associated with other interindividual differences (age and Big Five personality traits) or contraceptive decisions across the longitudinal study.
Descriptive Patterns in Hormonal Contraceptive Use and Predictors of Hormonal Contraceptive Use
Our results suggest that across 5.6 years (mean number of current and previous waves), about one half of the women exclusively reported non-hormonal or hormonal methods, and the other half of the women reported a switch between non-hormonal and hormonal methods. About 60% of these switches were from hormonal to non-hormonal methods and 30% from non-hormonal to hormonal methods. Only a few women reported several switches. Overall, the percentage of women using hormonal methods declined across the course of PAIRFAM, from 2008 to 2021. These results are in line with a representative survey in Germany comparing contraceptive use between the years 2011 and 2018 showing that the use of oral contraceptives (i.e., the most popular form of hormonal contraception) declined by 6 percentage points while the use of condoms (i.e., the most popular form of non-hormonal contraception) increased by 9 percentage points (Bundeszentrale für gesundheitliche Aufklärung, 2018). The most commonly mentioned concerns about hormonal contraception in Western countries are effects on menstruation, physical side effects, mental health effects and effects on sexuality as well as concerns about future fertility or a general wish to use more natural methods (Le Guen et al., 2021). The discontinuation rate for oral contraceptives was around 28% in an analysis of 23 countries from 1990 to 2008 (Ali & Cleland, 2010). The trend away from hormonal contraception is not generalizable to all countries, though. According to the United Nations et al. (2022), oral contraceptives are the most common method in 36 countries (e.g., Algeria, Morocco), while condoms are the most common method in 27 countries (including Germany).
There was relatively strong stability in the broad choice of contraceptive method, as indicated by the large predictive contributions of average frequency of hormonal contraceptives and use of hormonal contraceptives in the previous wave. In addition, women were more likely to switch their contraceptive method if sexual frequency and sexual satisfaction were low.
Beyond these method- and sexuality-related predictors, the current relationship situation predicted the choice of contraceptive methods. Women in a relationship, especially women who started a new relationship in the last year, were more likely to use hormonal contraceptive methods. Beyond this initial increase in hormonal contraceptive use after the start of a relationship, relationship duration was a negative predictor of hormonal contraceptive use. In addition, ending a relationship negatively predicted the use of hormonal contraception. Besides the current relationship situation, women who had one child (and potentially two children, even though this result was not robust across all analyses) were more likely to use hormonal contraception compared to women who had no children. In line with these results, we found some (albeit non-robust) evidence that women who said that they had completed their fertility plans were more likely to use hormonal contraceptive methods.
We also found some non-robust evidence for negative effects of income and education on hormonal contraceptive use. These results potentially indicate that women with a higher income and more education were less likely to use hormonal contraceptives. As these effects are rather small and did not emerge across models, they need to be interpreted cautiously. It is quite possible that the effects of income and education on hormonal contraceptive use differ depending on the details of the health care system and how accessible it renders such contraceptives (e.g., is a prescription required? Does health insurance cover the cost of the necessary appointments and of the contraceptives?).
Overall, the choice of contraceptive method appears to be relatively stable and changes in contraceptive methods are mainly explained by low sexual frequency and sexual satisfaction as well as the current relationship and family situation.
Average Treatment Effects on Sexuality
We found strong support for positive average treatment effects of hormonal contraception on sexual frequency and sexual satisfaction across all analytical approaches and all robustness analyses. When women use hormonal contraceptives, they have a higher sexual frequency and a higher sexual satisfaction then when they use non-hormonal contraceptives, and these effects are robust to the inclusion of the potential confounding effects discussed in the previous section. These results are in line with results from an earlier project based on cross-sectional data that aimed to disentangle, to some extent, causal effects of hormonal contraceptives from selection effects by controlling for potential confounders and estimating the sensitivity to unobserved confounders (Botzet et al., 2021). Applying these methodologies to longitudinal data in the current project advances our understanding of the causal nature of the effect by addressing reverse causality and attrition besides confounding as potential alternative explanations. Based on the performed sensitivity analysis, unobserved confounding would need to be relatively strong to fully account for the estimated effects of hormonal contraceptive use on sexual frequency, desired sexual frequency, and sexual satisfaction.
Our results are in line with existing correlational evidence finding positive effects of hormonal contraception on sexuality concerning sexual functioning (e.g., Oranratanaphan & Taneepanichskul, 2006), libido (McCoy & Matyas, 1996), sexual satisfaction (e.g., Caruso et al., 2005), and, most strongly, sexual frequency (e.g., Caruso et al., 2005; McCoy & Matyas, 1996). Most importantly, our results are also in line with two other studies finding support for positive effects of hormonal contraceptive use on sexuality in longitudinal datasets (Blumenstock & Barber, 2022; Ott et al., 2008). Taking the available evidence together supports the hypothesis that the use of hormonal contraception positively affects women’s sexuality; potentially through several mechanisms, including, for example, overcoming the fear of unwanted pregnancy during sexual activity (Blumenstock & Barber, 2022) and the resolution of painful or troublesome gynecologic disorders (Both et al., 2019).
In contrast, our results are not consistent with existing experimental evidence that hormonal contraceptive use negatively affects sexual functioning (e.g., Læssøe et al., 2014) and libido (e.g., Lee et al., 2017; Lundin et al., 2018; Zethraeus et al., 2016), as well as sexual activity, arousal, pleasure, orgasm, and lubrication (Smith et al., 2014). Therefore, based on the correlational data, we found no support for the hypothesis that the intervention in the endocrine system through the use of hormonal contraceptives (Stomati et al., 1998) negatively affects women’s sexuality (Both et al., 2019).
The evidence concerning negative average treatment effects on desired sexual frequency (our available measure for libido) is neither constant across analytical approaches nor robustness analyses. It should therefore be interpreted very cautiously, especially because the sample size of available observations was considerably smaller (desired sexual frequency was only measured from wave 7 to wave 14). Even if we take the potential negative effect of hormonal contraceptive use found in the adjusted regression analysis at face value, its interpretation would be ambiguous because of the way desired sexual frequency was measured (“If it was only up to you, would you like to have less or more sexual intercourse compared to the last three months?”). Because the sexual desire item was phrased relative to sexual behavior, we have to interpret this result in light of the positive effect of HC on sexual frequency. So, potentially, desire is constant, but because frequency is lower among non-HC users, our desire outcome showed a negative effect. Overall, the evidence of an average treatment effect of hormonal contraceptive use on desired sexual frequency seems not very robust and very inconsistent across studies (e.g., Botzet et al., 2021; Lee et al., 2017; Lundin et al., 2018; McCoy & Matyas, 1996; Zethraeus et al., 2016). If an average treatment effect of hormonal contraceptive use on libido exists, it is probably relatively small and large sample sizes are needed to detect it.
We found evidence for strong positive effects of hormonal contraceptive use on sexuality when women are not randomly assigned to one contraceptive method but make free (and potentially to some degree informed) decisions about their contraceptive method.
Taking the evidence for positive effects of hormonal contraceptive use and a high heterogeneity together, this makes it very likely that women in our sample, who experience adverse effects of hormonal contraceptive use, are more likely to discontinue using hormonal contraceptives. Therefore, the estimated effect of hormonal contraceptive use on sexuality in the current study, does not recover the average treatment effect across all women, but is closer to the average treatment effect on the treated.
Differences in Results Based on Modelling Approach
To our surprise, for the outcome of sexual frequency, we found significant differences between the estimated average treatment effects depending on whether we used adjusted regression analysis or the IPTW approach. Estimated average treatment effects were slightly larger for the IPTW approach compared to the adjusted regression analysis (see Figure 3). As registered in stage 1 of this registered report, adjusted regression analysis was treated as the main analysis and the IPTW approach was treated as a robustness analysis for identifying the average treatment effect. Both analytic approaches supported positive average treatment effects of hormonal contraceptive use on sexual frequency, but the question that still remains is how strong this effect actually is.
As outlined in the introduction, both approaches try to estimate the causal effect of a treatment on an outcome and they come with different advantages and disadvantages. 16 Our adjusted regression analysis relies on the key assumption that the relationships between the covariates and the sexuality outcome are modeled appropriately (Gutman & Rubin, 2017), while the IPTW approach relies on the key assumption that the relationships between the covariates and hormonal contraceptive use are modeled appropriately. For both assumptions, there are reasons to doubt that these assumptions hold true. Considering the regression analyses, relationship duration could affect sexual frequency in a non-linear manner in the form of a sudden drop in sexual frequency after a certain relationship duration, biasing the results. Considering the IPTW approach, education could affect the choice of contraceptive methods in a non-linear manner in the form of a sudden increase in hormonal contraceptive use after a certain degree of educational attainment, biasing the results. We cannot discern which modeling approach is more biased here, and thus we cannot clearly favor one over the other. Maybe more importantly, both approaches still rely on the key assumption of no unobserved confounders, even if our sensitivity analyses suggest that such confounders would need to be fairly influential to completely explain away the estimated effects.
As we cannot decide which of the two approaches correctly retrieves the underlying causal effect of hormonal contraceptive use, we would like to use our results as an example to urge researchers to consider different analytical approaches when estimating causal effects and to lay out all of the tested and untested assumptions as well as the specification of the underlying model.
Individual Treatment Effects
Besides the average treatment effects, we were interested in the heterogeneity of effects when estimating individual treatment effects. For both sexual frequency and sexual satisfaction we found support for relatively strong heterogeneity in individual treatment effects, ranging from negative to positive effects in line with Graham (2019) summary of anecdotal evidence. This finding questions the idea that hormonal contraceptive use affects sexuality only through a uniform and universal biological pathway.
Where could such heterogeneity come from? From a biological perspective, our findings are consistent with the idea that women have varying sensitivity to hormones (Kiesner, 2017) resulting in heterogeneous responses to hormonal contraceptive use. The heterogeneous pattern could also be explained by non-universal psychological mechanisms. For example, there might be interindividual differences in the expectations and worries about the effectiveness of contraception and the likelihood of unplanned pregnancy (Both et al., 2019). This might result in differential effects of hormonal contraceptive use on sexuality.
At the same time, we found no common pattern across sexuality outcomes for relationships between individual treatment effects and potential predictors of individual treatment effects (including age and Big Five personality traits). Unfortunately, the high uncertainty around individual treatment effects as indicated by the low reliability makes it impossible to rule out any of the predictors. Future studies interested in the effects of hormonal contraceptive use on sexuality should further investigate the heterogeneity in individual treatment effects and explore their relationship with other markers of interindividual differences.
What could these markers of interindividual differences be? First, self-reported side effects (i.e., side effects mentioned by women) might be positively related to experienced side effects (i.e., individual treatment effects). This could provide a first test of whether interindividual differences in self-report questionnaires reflect individual treatment effects at all. Second, instead of a general measure of neuroticism, fear of unwanted pregnancy might be positively related to individual treatment effects, with women with a higher fear of unwanted pregnancy showing positive effects of hormonal contraceptives on sexual frequency and sexual satisfaction. Third, general attitudes toward the use of hormonal contraceptives might also be related to individual treatment effects, with women who are very skeptical about the use of hormonal contraceptives (but who still use them, e.g., for other health reasons) showing negative effects of hormonal contraceptives on sexual frequency and sexual satisfaction. Finally, we would hypothesize that hormone sensitivity (Kiesner, 2017) would be negatively related to individual treatment effects, with women with higher sensitivity experiencing more negative effects of hormonal contraceptives on sexual frequency and sexual satisfaction.
We also did not find significant correlations between the estimated individual treatment effects and the number of years of hormonal contraceptive use over the course of PAIRFAM, which we would interpret as preliminary evidence against potential assortment based on experiences with contraceptive methods. Again, the interpretability of this analysis is limited by the low reliability of the individual treatment effects. Furthermore, the analysis would have at best provided indirect evidence for assortment. Another analysis we conducted can be used to answer a slightly different but related question, namely whether generally low sexual frequency or low sexual satisfaction (which may or may not be attributable to hormonal contraception) predict switches in contraceptive methods. This analysis provides another type of indirect evidence for assortment: In the models predicting hormonal contraceptive use in the current wave, women were more likely to switch their contraceptive method if they experienced lower levels of sexual frequency and sexual satisfaction. However, another recent study using a similar analysis found no support for women’s experiences of lower levels of sexual frequency and sexual satisfaction predicting switches in contraceptive methods (Draxler et al., 2024). We believe that a better empirical test of the underlying idea would require more extensive longitudinal data. In such data, one could use all but the last observation per individual to estimate the individual treatment effect, and then test whether this effect predicts the contraceptive method reported in the last observation of the individual.
Contributions to Personality Psychology
Human sexuality has long been treated as peripheral in personality psychology, in part because sexuality-related descriptors were excluded from early lexical studies that laid the foundation for the Big Five personality traits (Saucier & Goldberg, 1996; Schmitt & Buss, 2000). Yet, individual differences in sexuality abound and accumulating evidence demonstrates systematic associations between human sexuality and personality traits (e.g., Bourdage et al., 2007; Schmitt & Shackelford, 2008), underscoring that this boundary is artificial. Against this backdrop, our study contributes not only to knowledge about the psychological effects of hormonal contraceptives, but also to understanding the interplay between human sexuality and personality.
In addition, our analyses illustrate how robust analytical strategies can be applied to large-scale longitudinal panel data to address complex causal questions in personality research—an issue with which the field has grappled with for some time (Atherton et al., 2021). A key element of this approach is the explicit definition of estimands, which ensures that theoretical research questions are directly connected to statistical analyses (Lundberg et al., 2021). By focusing on the average treatment effect, we were able to provide a clear causal contrast and evaluate its robustness across different modeling assumptions. Longitudinal data played a central role in strengthening causal inference, as repeated observations within the same women allowed us to control for time-invariant confounders and to examine changes in contraceptive use over time. This design enabled us to investigate switches between hormonal and non-hormonal contraceptive methods within individuals, thereby reducing biases that are inherent in purely cross-sectional analyses. Importantly, the comparison of analytical approaches also underscored how methodological choices can shape substantive conclusions: while both strategies pointed to positive effects of hormonal contraceptive use on sexual frequency, the estimated magnitude of the effect differed between methods, highlighting the value of triangulating evidence across multiple approaches.
Taken together, our work contributes to the broader debate on how to draw causal inferences from observational data in personality science. Rather than treating causality as a simple binary, it is useful to conceptualize causal inference as a continuum of plausibility (Janson, 2012) and as inherently probabilistic rather than deterministic (Parascandola, 2011). Within this perspective, our study illustrates how carefully chosen estimands, longitudinal designs, and complementary analytical strategies can move personality research beyond mere associations and toward more credible causal claims. In doing so, we provide both a framework for future studies in this area and a guide for applying these methods to similar research questions in personality psychology.
Limitations
Our current study was a registered report applying two different analytical approaches and six robustness analyses based on a large panel dataset combining all the relevant information about contraceptive use, covariates, and sexuality. By examining change over time within individuals, we reduced the number of assumptions necessary for causal identification and held many of the potential confounding factors that vary between individuals constant. In addition, the assumed underlying causal network was transparently outlined before performing analyses and the assumptions were questioned in the discussion. Therefore, our study aimed to provide an estimation of the effect of hormonal contraceptive use on sexuality in a naturalistic setting where women decide which contraceptive method to use.
Nevertheless, our study comes with a few limitations. First, the two modeling approaches we used to estimate the average treatment effect differed substantially in their estimations of the average treatment effect questioning the robustness of our results and limiting their generalizability. In future research focusing on identifying causal effects based on correlational data, combining different analytical approaches could be beneficial to determine a range of possible effects (and might additionally inform us about systematic differences between these analytical approaches). Second, our outcome variables were only measured with one item each and often deviated from items in existing literature (especially in the case of desired sexual frequency as a measure of libido). Future research could try to implement more nuanced measures (e.g., solitary and dyadic libido; masturbation frequency) or measure sexuality in a more extensive way (e.g., sexual functioning, or using a diary method). And third, the time interval between observations (one year) was potentially too long to capture all of the switches in contraceptive methods that happened between waves. As we had no indicator of the time between switch and reporting of outcomes, we are unable to conclude whether the observed effects appear only shortly after changing contraceptive methods or after some time has passed. Diary studies (e.g., Blumenstock & Barber, 2022; Ott et al., 2008) investigating very short time frames found support for positive effects of hormonal contraceptive use on sexuality as well; more research focusing on an intermediate time frame (e.g., one month, one menstrual cycle) would be helpful to understand the effects of hormonal contraceptive use on sexuality further.
Conclusion
The results of our current study support the conclusion that, on average, hormonal contraceptive use has beneficial effects on sexual frequency and sexual satisfaction. In addition, it provides evidence that the heterogeneity in women’s sexual responses to hormonal contraceptive use is high. Because women differ so much, there is promise for a tailored approach to contraception. This would mean that reliable estimates of individual treatment effects help women make informed decisions about which contraceptives have the desired safety and side effect profile for them. To obtain such reliable estimates, we need more granular data. Our study was based on a panel dataset and can be likened to a series of photographs—much can be missed in the moments we did not capture. Next, we should aim for movies, daily data on contraceptive use and sexuality over long time spans.
Supplemental Material
Supplemental Material - Positive Treatment Effects and High Heterogeneity of Hormonal Contraceptive Use on Women’s Sexuality
Supplemental Material for Positive Treatment Effects and High Heterogeneity of Hormonal Contraceptive Use on Women’s Sexuality by Laura J. Botzet, Julia M. Rohrer, Lars Penke, Ruben C. Arslan in Personality Science.
Footnotes
Author Note
This is the final version of the manuscript submitted to Personality Science. All previous versions can be found on the Open Science Framework:
.
This manuscript is a stage 2 article based on a programmatic registered report stage 1 (Botzet et al., 2023). The registry for the programmatic registered report stage 1 including in-principal acceptance can be found here: https://osf.io/kj3h2. Based on the programmatic registered report stage 1 two separate stage 2 articles were planned. The current article focuses on sexuality (including desired sexual frequency, reported sexual frequency, and sexual satisfaction as outcomes) and the other stage 2 article focuses on well-being (including depressiveness, life satisfaction, and self-esteem as outcomes). All other parts of the stage 1 registered report apply for both stage 2 articles. Based on the preregistration deviation table template by Willroth and Atherton (2024),
in the supplement provides all details about the deviations from the in-principle accepted registered report stage 1, the reasons for the deviations, and the potential impact on the readers’ interpretation of our study.
Acknowledgements
We would like to thank Tita Gonzalez Avilés, who executed our blind code on the PAIRFAM dataset before we received in-principal acceptance of our programmatic registered report stage 1. Gonzalez Avilés is an independent researcher and is not otherwise involved in this manuscript. We would also like to thank Zoe Rau for diligently reviewing the supportive HTML documents.
Author Contributions
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Accessibility Statement
The data on which our analyses are based, a German panel study called PAIRFAM (Brüderl et al., 2021; Huinink et al., 2011), were already available and can be used for scientific purposes. The Leibniz Institute for the Social Sciences (GESIS) grants access to the scientific community.
Supplemental Material
Supplemental material for this article is available online. Depending on the article type, these materials may include a Transparency Checklist, a Transparent Peer Review File, and optional materials provided by the authors. In addition, all previous versions of the manuscript and the supplement as well as rmd files and html files for the blind code, the simulation code, planned analyses, data wrangling, and conducted analyses are available online at
.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
