Abstract
Although widely cited as strong evidence that sexual selection has shaped human facial-attractiveness judgments, findings suggesting that women’s preferences for masculine characteristics in men’s faces are related to women’s hormonal status are equivocal and controversial. Consequently, we conducted the largest-ever longitudinal study of the hormonal correlates of women’s preferences for facial masculinity (
Keywords
Exaggerated sex-typical (i.e., masculine) characteristics in men have been proposed as indicators of a strong immune system that would be inherited by offspring but also linked to reduced willingness to invest time and other resources in personal relationships (Gangestad, Simpson, Cousins, Garver-Apgar, & Christensen, 2004; Gildersleeve, Haselton, & Fales, 2014; Little, Jones, & DeBruine, 2011; Penton-Voak et al., 1999; Perrett et al., 1998). Given this proposed trade-off between the benefits and costs of choosing a masculine mate, researchers have hypothesized that women could maximize the benefits of their mate choices by mating with masculine men when fertile, while forming long-term relationships with relatively feminine men (Gangestad et al., 2004; Gildersleeve et al., 2014; Little et al., 2011; Penton-Voak et al., 1999).
Consistent with this hypothesis, some studies have reported that women show stronger preferences for masculine characteristics in men’s faces when in hormonal states associated with high fertility (e.g., during the ovulatory phase of the menstrual cycle or when not using hormonal contraceptives; Ditzen, Palm-Fischbacher, Gossweiler, Stucky, & Ehlert, 2017; Johnston, Hagel, Franklin, Fink, & Grammer, 2001; Little, Burriss, Petrie, Jones, & Roberts, 2013; Little & Jones, 2012; Little, Jones, Penton-Voak, Burt, & Perrett, 2002; Penton-Voak et al., 1999; Penton-Voak & Perrett, 2000; Roney & Simmons, 2008; Roney, Simmons, & Gray, 2011; Vaughn, Bradley, Byrd-Craven, & Kennison, 2010; Welling et al., 2007). These effects are widely cited as evidence that sexual selection has shaped women’s judgments of men’s facial attractiveness (Fink & Penton-Voak, 2002; Gangestad & Simpson, 2000; Grammer, Fink, Møller, & Thornhill, 2003; Thornhill & Gangestad, 1999).
The claim that women’s preferences for facial masculinity are related to their hormonal status has been influential. However, it is also highly controversial (see Gildersleeve et al., 2014, and Wood, Kressel, Joshi, & Louie, 2014, for meta-analyses drawing opposite conclusions about the robustness of hypothesized links between women’s masculinity preferences and hormonal status). In particular, recent work has highlighted four potentially serious methodological problems with research on the hormonal correlates of masculinity preferences.
First, sample sizes are usually small, meaning that studies are very underpowered (Gangestad et al., 2016). For example, the mean sample size of within-subjects studies reporting significant effects of hormonal status on facial-masculinity preferences is 40 women (Mdn = 34). Consequently, results from previous studies are difficult to interpret (Blake, Dixson, O’Dean, & Denson, 2016; Gangestad et al., 2016).
Second, hormonal status is typically assessed using self-reported menstrual cycle data (e.g., number of days since onset of last menses or number of days until expected date of next menses; Harris, 2013; Johnston et al., 2001; Little & Jones, 2012; Muñoz-Reyes et al., 2014; Penton-Voak & Perrett, 2000; Penton-Voak et al., 1999; Scott et al., 2014; Zietsch, Lee, Sherlock, & Jern, 2015). This method is imprecise and prone to bias (Blake et al., 2016; Gangestad et al., 2016; Harris, 2013).
Third, many studies use between-subjects designs. Use of between-subjects designs in this research is potentially problematic because, even with large samples, the substantial genetic contribution to individual differences in facial-masculinity preferences (Zietsch et al., 2015) could obscure subtle effects of hormonal status. Thus, although several recent studies testing for possible effects of hormonal status on facial-masculinity preferences have reported null results (Harris, 2013; Marcinkowska et al., 2016; Muñoz-Reyes et al., 2014; Scott et al., 2014; Zietsch et al., 2015), it is noteworthy that these studies all used between-subjects designs.
Fourth, studies using within-subjects designs typically test women on only two occasions (Johnston et al., 2001; Little et al., 2013; Little & Jones, 2012; Penton-Voak et al., 1999; Roney et al., 2011). This limited approach may not adequately capture complex changes in hormonal status (e.g., Roney & Simmons, 2013).
The current study directly addressed all of these potentially serious methodological problems by recruiting 584 heterosexual women for a longitudinal (i.e., within-subjects) study in which both women’s hormonal status and preferences for masculinity in men’s faces were repeatedly assessed (519 women completed at least 5 test sessions, 176 women completed at least 10 test sessions). Changes in women’s hormonal status were assessed by measuring steroid hormones from saliva samples and also by tracking within-subjects changes in hormonal contraceptive use.
Method
Participants
A total of 598 heterosexual White women who reported that they either were not using any form of hormonal contraceptive (i.e., had natural menstrual cycles) or were using the combined oral contraceptive pill were recruited for the study. Data from 14 of these women were excluded from the data set because they reported hormonal contraceptive use inconsistently within a single block of test sessions. Thus, the final data set was 584 women (age:
Number of Women Who Completed Five, Four, Three, Two, or One Weekly Test Sessions in Block 1, Block 2, and Block 3
Overall, 45 women reported changing their hormonal contraceptive status between blocks during the study: 15 women reported changing from using the combined oral contraceptive pill to not using the combined oral contraceptive pill, and 30 women reported changing from not using the combined oral contraceptive pill to using the combined oral contraceptive pill.
Stimuli
The methods we used to manufacture stimuli to test women’s preferences for facial masculinity have been used in many previous studies (e.g., Harris, 2013; Johnston et al., 2001; Little & Jones, 2012; Marcinkowska et al., 2016; Muñoz-Reyes et al., 2014; Penton-Voak et al., 1999; Penton-Voak & Perrett, 2000; Scott et al., 2014; Welling et al., 2007; Zietsch et al., 2015). Responses to stimuli manufactured using these methods predict women’s actual partner choices (DeBruine et al., 2006). They have also been shown to be very similar to responses to stimuli manufactured using other methods for manipulating sexually dimorphic characteristics in face images (DeBruine et al., 2006). The stimuli from this study can be found in the Supplemental Material available online.
First, we manufactured a female prototype (i.e., average) face by using specialist software (Tiddeman, Burt, & Perrett, 2001) to average the shape, color, and texture information from images of 50 young White women’s faces. A male prototype face was also manufactured in this way by averaging the shape, color, and texture information from images of 50 young White men’s faces.
Next, we randomly selected 10 images from the set of 50 individual male faces. We then created a feminized and a masculinized version of each of these 10 male images by adding or subtracting 50% of the linear (i.e., vector) differences in 2-D shape between symmetrized versions of the female and male prototypes to (or from) each individual image. This process created 10 pairs of face images in total, with each pair consisting of a feminized and a masculinized version of one of the individual face images. Examples of these stimuli are shown in Figure 1. Note that our feminized and masculinized versions of faces differed in sexually dimorphic shape characteristics only (i.e., were matched in other regards, such as identity, color, and texture; Tiddeman et al., 2001).

Examples of masculinized (a) and feminized (b) versions of men’s faces used to assess facial-masculinity preferences in our study.
Procedure
In each test session, women reported their current romantic partnership status (partnered or unpartnered), reported their hormonal contraceptive use status (using the combined oral contraceptive pill, not using any form of hormonal contraceptive), reported whether they were currently taking a scheduled break from the pill (and, if so, how many days into this scheduled break they were), provided a saliva sample, and completed two face-preference tests (one assessing men’s attractiveness for a short-term relationship, the other assessing men’s attractiveness for a long-term relationship). Attractiveness of men for short-term relationships and long-term relationships were measured separately because hormonal status has previously been shown to influence women’s masculinity preferences when assessing men’s attractiveness for short-term, but not long-term, relationships (Little & Jones, 2012; Penton-Voak et al., 1999).
In the two face-preference tests, women were shown the 10 pairs of male faces, each pair consisting of a masculinized and feminized version of a given individual. Women were instructed to select the more attractive face in each pair and to indicate the strength of that preference by choosing from the options “slightly more attractive,” “somewhat more attractive,” “more attractive,” and “much more attractive.” This procedure has been used to assess masculinity preferences in previous studies (e.g., Zietsch et al., 2015).
In the short-term-attractiveness test, women were given the following information:
You are looking for the type of person who would be attractive in a short-term relationship. This implies that the relationship may not last a long time. Examples of this type of relationship would include a single date accepted on the spur of the moment, an affair within a long-term relationship, and possibility of a one-night stand.
In the long-term-attractiveness test, women were given the following information:
You are looking for the type of person who would be attractive in a long-term relationship. Examples of this type of relationship would include someone you may want to move in with, someone you may consider leaving a current partner to be with, and someone you may, at some point, wish to marry (or enter into a relationship on similar grounds as marriage).
Trial order within each test was fully randomized, and the order in which the two face-preference tests were completed in each test session was also fully randomized. Definitions of short-term and long-term relationships were taken from previous studies (Little & Jones, 2012; Penton-Voak et al., 2003).
Responses on the face-preference test were coded using scales from 0.5 to 3.5 for masculinized faces and −0.5 to −3.5 for feminized faces. Masculinized faces were rated as “slightly more attractive” (0.5), “somewhat more attractive” (1.5), “more attractive” (2.5), or “much more attractive” (3.5) than the feminized face; feminized faces were rated as “slightly more attractive” (−0.5), “somewhat more attractive” (−1.5), “more attractive” (−2.5), or “much more attractive” (–3.5) than the masculinized face. On both scales, higher scores indicate stronger masculinity preferences, and the scales are centered on chance (i.e., 0).
Each woman’s average masculinity-preference score was calculated separately for the short-term and long-term judgments for each test session. Higher scores indicate stronger masculinity preferences.
In each face-preference test, the 10 trials assessing preferences for sexually dimorphic shape characteristics were interspersed among 30 filler trials assessing preferences for other facial traits.
Saliva samples
Participants provided a saliva sample via passive drool (Papacosta & Nassis, 2011) in each test session. Participants were instructed to avoid consuming alcohol and coffee in the 12 hr prior to participation and avoid eating, smoking, drinking, chewing gum, or brushing their teeth in the 60 min prior to participation. Each woman’s test sessions took place at approximately the same time of day to minimize effects of diurnal changes in hormone levels (Bao et al., 2003; Veldhuis et al., 1988).
Saliva samples were frozen immediately and stored at −32° C until being shipped on dry ice to the Salimetrics Lab (Suffolk, United Kingdom) for analysis, where they were assayed using the Salivary 17β-Estradiol Enzyme Immunoassay Kit 1-3702 (
Hormone levels more than 3 standard deviations from the sample mean for that hormone or where Salimetrics indicated levels were outside their sensitivity range were excluded from the data set (~1% of hormone measures were excluded for these reasons). The descriptive statistics given above do not include these excluded values. Values for each hormone were centered on their subject-specific means to isolate effects of within-subjects changes in hormones. They were then scaled so the majority of the distribution for each hormone varied from −.5 to .5 to facilitate calculations in the linear mixed models. Since hormone levels were centered on their subject-specific means, women with only one value for a hormone could not be included in analyses considering hormone levels.
Analyses
Linear mixed models were used to test for possible effects of hormonal status on women’s facial-masculinity preferences. Analyses were conducted using R version 3.3.2 (R Core Team, 2016), with lme4 version 1.1-13 (Bates, Maechler, Bolker, & Walker, 2014) and lmerTest version 2.0-33 (Kuznetsova, Brockhoff, & Christensen, 2013). The dependent variable was masculinity-preference score, which was centered on chance. The relationship context for which women had judged men’s attractiveness was effect-coded (short-term = +0.5 and long-term = −0.5) and included as an independent variable in all analyses. Random slopes were specified maximally following the recommendations of Barr, Levy, Scheepers, and Tily (2013) and Barr (2013). Full model specifications and full results for each analysis are given in the Supplemental Material.
Results
General preferences and relationship-context effect
Significant intercepts in all analyses indicated that women generally preferred masculinized to feminized versions of men’s faces. Masculinity preferences were also significantly stronger in the short-term than long-term relationship context in all analyses. The one exception was in the analyses described under Hypothesis 4. There, the relationship-context effect was not significant, probably because these analyses were less powerful than our other analyses. Full results for these effects are given in the Supplemental Material.
Hypothesis 1: do facial-masculinity preferences track changes in measured steroid hormone levels in women not using hormonal contraceptives?
The fertile phase of the menstrual cycle is characterized by the combination of high estradiol and low progesterone (Gangestad & Haselton, 2015; Puts et al., 2013). Additionally, some previous studies have suggested that changes in women’s masculinity preferences are positively correlated with changes in estradiol (Roney & Simmons, 2008; Roney et al., 2011) and negatively correlated with changes in progesterone (Jones et al., 2005; Puts, 2006). We therefore used linear mixed models to test for possible effects of estradiol, progesterone, and their interaction on women’s facial-masculinity preferences. Masculinity-preference scores could range from −3.5 to 3.5 (0 indicated no preference; higher scores indicated stronger masculinity preferences). This analysis included all women who were not using any form of hormonal contraceptive when tested (
We conducted additional analyses to test for previously reported effects of testosterone (Welling et al., 2007) and cortisol (Ditzen et al., 2017) on masculinity preferences, and for hypothesized effects of estradiol-to-progesterone ratio on mating-related behavior (Eisenbruch, Simmons, & Roney, 2015). These analyses also showed no evidence that women’s preferences for masculine men were related to their hormone levels (see the Supplemental Material).
At the suggestion of a reviewer, we also tested for an interaction between the effects of testosterone and cortisol (see the Supplemental Material). The rationale for testing this interaction was that some research suggests that behavioral effects of testosterone are more pronounced when cortisol is low (the dual-hormone hypothesis; see Mehta & Prasad, 2015). Although there was a significant interaction between testosterone and cortisol, β = 0.51,
A reviewer also asked that we repeat each of the analyses described above, controlling for effects of test session order on masculinity preferences. Doing so did not alter the patterns of results (i.e., no nonsignificant effects became significant and no significant effects became nonsignificant). These analyses are reported in the Supplemental Material.
Hypothesis 2: do women not using hormonal contraceptives show stronger facial-masculinity preferences than women using the combined oral contraceptive pill?
Studies reporting that women not using hormonal contraceptives show stronger facial-masculinity preferences than do women using hormonal contraceptives have been interpreted as converging evidence that women’s hormonal status influences their facial-masculinity preferences (Little et al., 2013). To investigate this issue in our data set, we first used linear mixed models to compare the facial-masculinity preferences of women using the combined oral contraceptive pill (
Stronger masculinity preferences in women using hormonal contraceptives have been reported in one other study (Cobey, Little, & Roberts, 2015). We suggest that these between-groups differences reflect effects of lifestyle or personality factors that are correlated with contraceptive use rather than hormonal effects.
Hypothesis 3: do facial-masculinity preferences of women using the combined oral contraceptive pill change when they are taking inactive pills?
In women using the combined oral contraceptive pill, fertility-linked hormone levels are affected when women are not taking active pills (i.e., the scheduled “hormone-free interval” or “break”) during their monthly cycle of oral contraceptive use (van Heusden & Fauser, 2002). If women’s masculinity preferences are influenced by their hormonal status, one would then expect women’s facial-masculinity preferences to change during this scheduled break. To investigate this possibility, we used linear mixed models to compare the facial-masculinity preferences of women (
Possible moderating role of partnership status
Some previous research has suggested that the magnitude of hormone-linked changes in women’s masculinity preferences is moderated by their partnership status (i.e., whether or not they had a romantic partner; Penton-Voak et al., 1999). Thus, we repeated each of the analyses described above, including partnership status and all possible interactions between partnership status and the other predictors (see the Supplemental Material). These additional analyses also showed no evidence that women’s salivary steroid hormone levels were related to their facial-masculinity preferences or that oral contraceptive use decreased masculinity preferences.
Hypothesis 4: do facial-masculinity preferences change when women start or stop using the combined oral contraceptive pill?
During the course of the current study, 45 women changed their hormonal contraceptive use by either switching from using no hormonal contraceptive to using the combined oral contraceptive pill, or vice versa. There was a mean time of 360 days (
Because changes in oral contraceptive use could be associated with a change in partnership status, we repeated this analysis controlling for possible effects of changes in women’s partnership status (see the Supplemental Material). This additional analysis also did not show any evidence that using the combined oral contraceptive pill weakened women’s masculinity preferences.
Preferences for additional facial traits
Some previous studies have tested for effects of hormonal status on other aspects of women’s face preferences, such as preferences for femininity in women’s faces, facial symmetry, facial averageness, and apparent facial health (for a review, see Jones et al., 2008). Consequently, we also tested for effects of hormonal status on women’s preferences for these facial characteristics.
All male face preferences were assessed in the same short-term and long-term blocks with trial order fully randomized. All female face preferences were tested in a separate block, again with trial order fully randomized. The order in which women completed the short-term male attractiveness, long-term male attractiveness, and female attractiveness preference tasks in each test session was fully randomized. Femininity in women’s faces was manipulated using identical methods to those that were used to manipulate masculinity in men’s faces. Methods used to manipulate facial symmetry, facial averageness, and apparent facial health are reported by Quist et al. (2012), Jones, DeBruine, and Little (2007), and Wincenciak et al. (2015), respectively.
Analyses of these preferences using the same type of models we used to test for effects of hormonal status on masculinity preferences also showed no clear evidence that face preferences were consistently related to women’s hormonal status. Notably, we did not replicate putative effects of ovarian hormones on women’s preferences for symmetry or apparent health previously reported for women not using hormonal contraceptives (for a review, see Jones et al., 2008). Full results, along with the data, analysis files, and stimuli, are publicly available at osf.io/9b4y7. These full results include a significant negative effect of cortisol on preferences for male facial symmetry and a significant negative effect of progesterone on preferences for male facial averageness. Neither of these results were a priori predictions, so we suggest they should be treated as preliminary findings.
Discussion
Collectively, our analyses showed no compelling evidence that changes in women’s salivary hormone levels are associated with their facial-masculinity preferences or that the combined oral contraceptive pill decreases women’s masculinity preferences. 1 This was despite having a much larger sample size, having tested participants more often, and having used more reliable measures of hormonal status (e.g., measurements of multiple steroid hormones from saliva samples) than previous studies. Thus, the current study presents evidence against the popular and influential hypothesis that changes in women’s facial-masculinity preferences track changes in their hormonal status (Ditzen et al., 2017; Johnston et al., 2001; Little et al., 2013; Little & Jones, 2012; Penton-Voak et al., 1999; Penton-Voak & Perrett, 2000). Analyses of preferences for other facial traits (symmetry, averageness, apparent health) that some previous research had suggested may track changes in hormonal status also showed no compelling evidence for consistent effects of hormonal status on face preferences. Although we did observe a significant negative effect of cortisol on preferences for male facial symmetry and a significant negative effect of progesterone on preferences for male facial averageness, these findings were not predicted a priori and should be treated as preliminary. Indeed, given that symmetry and averageness are correlated in faces (see Jones et al., 2007), it is unclear why steroid hormones would have different effects on preferences for these facial characteristics.
A crucial piece of the rationale for predicting hormone-linked changes in women’s preferences for facial masculinity is the claim that facial masculinity is a cue of men’s heritable immunocompetence (Penton-Voak et al., 1999). Our null results for hormonal status and facial-masculinity preferences add to a growing body of evidence calling this assumption into question (Lee et al., 2014; Scott et al., 2014). Rather than functioning as a cue of men’s immunocompetence, men’s facial masculinity may primarily function as a cue of their intrasexual competitiveness (for a review, see Puts, 2010).
Although we found no evidence that women’s masculinity preferences are linked to their hormonal status, our analyses do suggest that women show stronger preferences for masculine facial characteristics when assessing men’s attractiveness for short-term relationships than when assessing men’s attractiveness for long-term relationships. Although this pattern of results is consistent with the proposal that perceived costs associated with choosing a masculine mate cause women’s preferences for masculinity in long-term partners to be weaker than preferences for masculinity in short-term partners (Little et al., 2011), we emphasize here that the effect of relationship context on masculinity preference was small.
In summary, and by contrast with previous research using smaller samples and less precise measures of hormonal status, our analyses show no compelling evidence for links between women’s hormonal status and preferences for facial masculinity. These results highlight the importance of employing large sample sizes and rigorous assessments of hormonal status (e.g., measures of salivary hormone levels) to test hypotheses concerning links between hormonal status and mate preferences.
Supplemental Material
JonesOpenPracticesDisclosure – Supplemental material for No Compelling Evidence That Preferences for Facial Masculinity Track Changes in Women’s Hormonal Status
Supplemental material, JonesOpenPracticesDisclosure for No Compelling Evidence That Preferences for Facial Masculinity Track Changes in Women’s Hormonal Status by Benedict C. Jones, Amanda C. Hahn, Claire I. Fisher, Hongyi Wang, Michal Kandrik, Chengyang Han, Vanessa Fasolt, Danielle Morrison, Anthony J. Lee, Iris J. Holzleitner, Kieran J. O’Shea, S. Craig Roberts, Anthony C. Little and Lisa M. DeBruine in Psychological Science
Supplemental Material
JonesSupplementalMaterial – Supplemental material for No Compelling Evidence That Preferences for Facial Masculinity Track Changes in Women’s Hormonal Status
Supplemental material, JonesSupplementalMaterial for No Compelling Evidence That Preferences for Facial Masculinity Track Changes in Women’s Hormonal Status by Benedict C. Jones, Amanda C. Hahn, Claire I. Fisher, Hongyi Wang, Michal Kandrik, Chengyang Han, Vanessa Fasolt, Danielle Morrison, Anthony J. Lee, Iris J. Holzleitner, Kieran J. O’Shea, S. Craig Roberts, Anthony C. Little and Lisa M. DeBruine in Psychological Science
Footnotes
Acknowledgements
We thank Sean Murphy, Ruben Arslan, Hans IJzerman, Steve Gangestad, and Lawrence Barsalou for helpful discussion and comments.
Action Editor
Steven W. Gangestad served as action editor for this article.
Author Contributions
The study was designed by B. C. Jones, L. M. DeBruine, A. C. Hahn, C. I. Fisher, S. C. Roberts, and A. C. Little. Data were collected by A. C. Hahn, C. I. Fisher, H. Wang, M. Kandrik, V. Fasolt, D. Morrison, I. J. Holzleitner, and C. Han. The data were analyzed by B. C. Jones, L. M. DeBruine, H. Wang, M. Kandrik, A. C. Hahn, A. J. Lee, I. J. Holzleitner, and K. J. O’Shea. The manuscript was drafted by B. C. Jones and L. M. DeBruine and revised by all of the authors.
Declaration of Conflicting Interests
The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.
Funding
This research was supported by European Research Council grants awarded to B. C. Jones (OCMATE) and L. M. DeBruine (KINSHIP).
Open Practices
All data and materials have been made publicly available via the Open Science Framework and can be accessed at https://osf.io/9b4y7/. The design and analysis plans were not preregistered. The complete Open Practices Disclosure for this article can be found at http://journals.sagepub.com/doi/suppl/10.1177/0956797618760197. This article has received badges for Open Data and Open Materials. More information about the Open Practices badges can be found at
.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
