Abstract
Objective:
This study explores the effectiveness of tibolone and combined hormone therapy (HT) in alleviating sleep disorders in women aged 40–65 who are in menopausal transition and postmenopause.
Methods:
The ambidirectional cohort study (retrospective 2011–2015; prospective 2016–2021), conducted at Shanghai Sixth People’s Hospital’s specialized menopause clinic. Participants were given tibolone or combined HT based on their STRAW + 10 stratification and preference for continued menstruation. Demographic information, baseline, and reevaluated performance of the Pittsburgh Sleep Quality Index (PSQI) were collected for analysis of their differentiations and impact factors. Sleep quality was reassessed 12 months after treatment initiation via PSQI.
Results:
The cohort study involved 285 participants—184 in the tibolone cohort and 101 in the combination HT cohort. After 12 months, PSQI scores improved by −3.76 ± 4.12 with tibolone and –3.66 ± 4.23 with combined HT. The adjusted between group difference was –0.359 (95% confidence interval = 0.577; p = 0.534).
Conclusions:
The study suggests that tibolone and combined HT improved sleep to a similar extent after adjustment. Further comprehensive research is necessary to corroborate these observations.
Introduction
Menopausal symptoms can be highly disruptive for women during the transition period, which typically spans approximately 4 years before the final menstrual period. 1 The Study of Women’s Health Across the Nation divides the Stages of Reproductive Aging Workshop (STRAW + 10) broadly into three categories, namely, reproductive, menopausal transition, and postmenopause in order to guide the diagnosis and management of menopausal symptoms. 2 Sleep disturbances affect 40%–60% of menopausal women compared to 10.5% for hot flashes in Chinese women. 3 Statistically, sleep disturbances are present in 40%–60% women with menopausal symptoms, rendering it one of the most irritating symptoms. 4
Menopausal symptoms primarily result from declining estrogen production, disrupting hypothalamic pathways. Elevated follicle-stimulating hormone levels principally reflect ovarian aging rather than directly causing psychological symptoms. 5 Randomized trials and meta-analyses show that menopausal hormone therapy (MHT) remains the most effective treatment for vasomotor symptoms and confers ancillary benefits for bone, sleep, and urogenital health.6–8 Previous high-quality randomized controlled trial (RCT) and meta-analysis, such as National Institute for Health and Care Excellence, The North American Menopause Society, American College of Obstetricians and Gynecologists, have demonstrated the effectiveness of MHT on attenuating menopausal symptoms, yet head-to-head data comparing individual regimens are limited—an evidence gap addressed by the present study.9–11
While MHT reliably relieves vasomotor symptoms, 12 its sleep effects show scenario-dependent efficacy, with benefits mainly seen in women reporting hot flashes. 13 The 2016 Cochrane review systematically compared tibolone and combined hormone therapy (HT), demonstrating similar but slightly reduced efficacy in alleviating vasomotor symptoms (odds ratio [OR] = 1.57, 95% confidence interval [CI]: 1.18–2.10) after adjusting for attrition bias. 14 However, in contrast to what is hypothesized that sleeping disorders should have been significantly improved after vasomotor symptoms being controlled by MHT, Geiger’s RCT on transdermal estrogen plus progesterone for its palliation on sleep disorders found that only sleep latency (estimate = −0.12, p = 0.002) and sleep disturbance (estimate = −0.24, p = 0.04) had been demonstrated signs of improvements in comparison of placebo groups, and these effects persisted after adjustment for vasomotor intensity, implying a mechanism beyond hot-flush relief. 15 In addition, previous studies on the effect of MHT on sleep disorders have several limitations that hinder the credibility of the MHT, including but not limited to heterogeneity in trial population, the differences of pharmacokinetics and drug metabolisms on individuals, MHT formulation variations, and so on. 12 To address this gap, we compared tibolone (2.5 mg daily) with combined HT (oral estradiol + dydrogesterone) in peri- and postmenopausal women followed for 12 months. We hypothesized that tibolone would produce improvements in global sleep quality equivalent to those achieved with combined MHT.
Materials and Methods
Study participants
The study, spanning from October 2010 to July 31, 2021, at Shanghai Sixth People’s Hospital’s specialized menopause clinic, executed an ambidirectional (retrospective–prospective) cohort study involving menopausal patients: routine-care records from October 1, 2010, to May 31, 2016, were abstracted retrospectively, and eligible women seen from June 1, 2016, to July 31, 2021, were enrolled and followed prospectively. Participants were chosen through convenience sampling and included women aged 40–65 years in the peri- or postmenopausal stages as defined by STRAW + 10 criteria, without contraindications to MHT, and expressing willingness for participation. Exclusions were made for primary sleep disorders, including restless legs syndrome, obstructive sleep apnea, and narcolepsy, and for those who underwent estrogen therapy within the previous 6 months. During the study period, 1091 women attended the clinic; 766 met eligibility criteria. The Institutional Review Board (IRB) granted a waiver of consent for retrospective chart review through May 31, 2016; from June 1, 2016, onward, all participants provided written informed consent. In total, 285 women were enrolled and included in the analysis. Ethical approval for this study was granted by the Ethics Committee of Shanghai Jiao Tong University’s Sixth People’s Hospital (2016-R07; Registration Number: ChiCTR-IPR-16008754).
Study design
This ambidirectional (retrospective–prospective) cohort study, conducted at Shanghai Sixth People’s Hospital’s specialized menopause clinic, divided participants into two treatment groups based on their STRAW + 10 stratification and preference for continued menstruation. The tibolone group received a dosage of 2.5 mg/day in the morning, while dydrogesterone 10 mg was administered once daily on days 15–28 of each 28-day cycle, with continuous oral 17β-estradiol 2 mg in the combined HT group. Both regimens were standardized for all women in this group throughout the 12-month follow-up. No participant used an alternative schedule. For participants in stages −2 and −1, combined HT was recommended. Those in stage 0 or +1 were given the choice; if they preferred to continue menstruating, combined HT was administered, otherwise, tibolone was prescribed. Participants in stages +1b, +1c, and +2 were advised to undergo tibolone therapy. This approach ensured that treatment was tailored to the individual needs and stages of each participant.
Data collection
Demographic information including name, age, height, weight, marital status, occupation, educational level, monthly income, contraceptive history, smoking and alcohol intake history, and previous estrogen treatments was collected. Sleep disorders were assessed using the Pittsburgh Sleep Quality Index (PSQI), a standardized questionnaire administered by trained researchers, conducted twice: once before treatment initiation and again at the 1-year follow-up.
Parameters such as subjective sleep quality, sleep latency, sleep duration, habitual sleep efficiency, sleep disturbance, use of sleeping medication, daytime dysfunction, and global PSQI score were separately evaluated based on the participant’s responses to the PSQI questionnaire, which typically asks about sleep habits over the previous months. The PSQI assesses sleep quality across seven components: subjective sleep quality (overall perception of sleep, rated 0–3), sleep latency (time to fall asleep, scaled 0–3), sleep duration (hours of actual sleep, rated 0–3), habitual sleep efficiency (percentage of time spent asleep while in bed, rated 0–3), sleep disturbance (frequency of disturbances like waking up at night, rated 0–3), use of sleeping medication (frequency of medication use, scaled 0–3), and daytime dysfunction (impact of sleep on daily activities, rated 0–3). The Global PSQI Score, ranging from 0 to 21, is a summation of these components, with higher scores indicating poorer sleep quality. Overall score surpassing 5 was considered as the cutoff value to indicate sleep disorders. Patients were evaluated at baseline and posttreatment evaluation occurred at 12 months to maximize drug effect observation.
Statistical method
The data processing involved preprocessing, statistical description, and both one-way and multivariate analyses, conducted using R (version 4.3.0). The measurement data conformed to a normal distribution and were expressed using the mean and standard deviation (SD). In cases where the data did not follow a normal distribution, the median, maximum, and minimum values were used instead. The statistical analysis included two-sample t-tests and analysis of variance for datasets that followed a normal distribution, and nonparametric tests were utilized for data that did not meet the assumptions of normality. The categorical data were presented as frequency and percentage, and χ2 tests were employed for the differences between categorical groups, as well as demographic variables, relevant treatment history, baseline PSQI scores, and changes in PSQI levels after treatment. A linear regression model was used to examine the impact of sleep alleviation of two types of drugs after adjusting for covariates that were the baseline demographic variables that differed between treatment groups. A p-value of <0.05 was considered statistically significant.
Results
Participants’ description
A total of 285 participants were recruited for this trial, 184 in the tibolone group and 101 in the combined HT group. The mean age of the participants was 53.2 years (SD = 3.8) in the tibolone group and 47.9 years (SD = 4.6) in the combined HT group. The difference in age between the two groups was statistically significant (p < 0.001). The mean body mass index (BMI) in the tibolone and combined HT groups was 22.15 (SD = 3.29) and 21.48 (SD = 2.41), respectively, and the difference was not significant at the 0.05 level (p = 0.059). Most participants (97% or more) in both groups were married. Participants in the combined HT group were more likely to be less educated and unemployed with lower income than their counterparts in the tibolone group (p < 0.05). Contraceptive use, smoking habits, and alcohol consumption were largely similar between the two groups. See Table 1.
Basic Characteristics
*p < 0.05; **p < 0.01.
Mean (SD); n (%).
Welch’s two-sample t-test; Fisher’s exact test; Pearson’s chi-squared test; Wilcoxon rank-sum test.
HT, hormone therapy; SD, standard deviation; IUD, intrauterine device.
Comparison of pre- and posttreatment changes in sleep parameters
The results presented in Figure 1 are based on univariate analysis of differences in PSQI scores before and after treatment. At the 12-month follow-up visit, PSQI scores improved in both the combined HT and tibolone groups in all sleep parameter scores posttreatment (p < 0.001). Women in the tibolone group experienced a greater reduction in sleep disturbance scores compared to the combined HT group (mean difference: 0.45 vs. 0.26, p = 0.012). However, differences in other PSQI components between the two groups were not statistically significant (p > 0.05). Further multivariate analysis is required to confirm these findings and adjust for potential confounders.

Difference in PSQI score pre- and posttreatment by treatment groups. PSQI, Pittsburgh Sleep Quality Index.
Univariate analysis of differences in overall scores between pretreatment and posttreatment
The univariate analysis assessed the association between potential influencing factors and the differences in overall scores (Δd) between pretreatment and posttreatment. This analysis did not identify statistically significant relationships between Δd and variables such as marital status, occupational status, education level, monthly income, contraception method, age, BMI, smoking history, or alcohol intake (all p > 0.05). Although some trends were observed—for instance, a positive association with oral contraception use (β = 2.941, p = 0.335) and a potential negative association with “Unemployed” occupational status (β = −0.905, p = 0.493)—none reached statistical significance. The results, based on the Δd as the dependent variable, are detailed in Table 2.
Comparison of Differences in Overall Scores Between Pretreatment and Posttreatment: Results of Univariate Analysis
BMI, body mass index; CI, confidence interval; SE, standard error; IUD, intrauterine device.
Both tibolone and combined HT produced significant improvements across all PSQI domains after 12 months of treatment (Table 3). Table 4 compares sleep-related outcomes pre- and postintervention across the overall sample (n = 285), tibolone group (n = 184), and combined HT group (n = 101). Sleep disturbance pre-intervention significantly differed between the groups (p < 0.001), with higher scores in the tibolone group. Postintervention, daytime dysfunction showed a significant difference between groups (p = 0.024), with the combined HT group reporting higher scores. Change scores (post–pre) indicate an overall reduction in sleep-related issues across both groups, though only changes in sleep disturbance (p = 0.012) reached statistical significance between groups. No significant differences were observed for the change in overall sleep scores (p = 0.599) or other subdomains such as sleep quality, latency, efficiency, or the use of sleep medication. These results suggest that both interventions improved sleep outcomes, but specific aspects such as sleep disturbance may differ in their impact between the treatments. See Table 4.
Comparison of the Effectiveness of Sleep Alleviation Within the Tibolone and Combined Hormone Therapy Groups
*p < 0.05; **p < 0.01.
Comparison of the Effectiveness of Sleep Alleviation Between the Tibolone and Hormone Therapy Groups
Comparison of the effectiveness between the tibolone and HT groups: Results of multivariate linear regression analyses
The data revealed the indistinct sleep alleviation effects in terms of PSQI scores differences between the tibolone group and the combined HT group. After adjustment for age, occupation, education, and income, between-group differences became nonsignificant. The total score, sleep quality, latency, duration, efficiency, use of sleeping medication, and daytime dysfunction showed no significant differences between the groups as well (p > 0.05). See Table 5.
Comparison of the Effect of Sleep Alleviation Between the Two Drug Groups: Results of Multivariate Linear Regression Analyses a
Combined HT and tibolone groups were controlled in the above linear regression analyses for age, occupation, education, and income.
Discussion
Both tibolone and cyclical estradiol plus dydrogesterone produced clinically meaningful improvements in global PSQI and in every component score. In univariate analyses tibolone yielded a slightly larger reduction in the “sleep disturbance” component than combined HT (−0.45 ± 0.63 vs. −0.26 ± 0.61, p = 0.012); however, after adjustment for age, STRAW stage, and socioeconomic covariates, this difference was no longer significant (β = −0.165, 95% CI = 0.086, p = 0.057). The convergence of posttreatment scores indicates that the baseline disturbance—rather than an intrinsic pharmacological superiority—accounts for the unadjusted change.
Tibolone was taken once daily in the morning, consistent with its mild androgenic profile, whereas 17β-estradiol was continuous and dydrogesterone was periodical (days 15–28 of each 28-day cycle). Although androgen receptor activation has been hypothesized to stabilize arousal thresholds, 16 current evidence does not demonstrate a distinct sleep pathway for tibolone beyond relief of vasomotor symptoms. Accordingly, our data do not support preferential use of tibolone solely for insomnia when combined HT is otherwise suitable.
The sleep quality during menopause is also predominantly influenced by other endocrine and chronobiological factors. 17 Recent studies associate elevated 24-hour urinary free cortisol, 18 declining nocturnal melatonin,19,20 and age-related phase shifts in core body temperature21,22 with poorer sleep quality in menopausal women. Unfortunately, we could not test these pathways due to the lack of biomarker data. However, their potential contribution underscores the need for future trials that integrate objective sleep metrics with serial measures of cortisol, melatonin, and vasomotor intensity. Such multimodal designs would clarify whether adjunct circadian-based interventions—light therapy, timed exercise, or melatonin agonists—can augment the sleep benefit we observed with tibolone and combined MHT.
This study offers a rare, head-to-head comparison of tibolone and cyclical estradiol plus dydrogesterone, delivered in protocol-standardized doses and evaluated with the PSQI at baseline and 12 months. The ambidirectional design combines a decade of research data with uniform outcome measurement, with over 200 participants. The analysis demonstrates that both regimens comparably improve menopausal sleep, providing credible evidence to inform guideline-recommended, patient-centered therapy.
This study has several limitations that warrant consideration. First, treatment allocation was non-randomized and preference-based, raising the possibility of selection bias. Second, key manifestations such as hot-flush severity and mood status (e.g., GAD-7, PHQ-9) were not recorded for the entire ambidirectional cohort because unmeasured vasomotor changes would likely bias toward the null, given similar efficacy between regimens, 12 yet its absence remains a constraint. Third, recruitment from a single specialized menopause clinic restricts ethnic and socioeconomic diversity and therefore limits external validity. Fourth, the 12-month follow-up may be insufficient to characterize longer-term benefits and risks of either regimen. Last, sleep was assessed exclusively with the PSQI, a subjective instrument susceptible to recall bias. 23 Future investigations should combine patient-reported measures with objective methodologies such as actigraphy or polysomnography. Collectively, these constraints highlight the need for multicenter, randomized trials with extended follow-up to delineate the comparative effectiveness of tibolone and combined HT as well as other MHT.
Conclusions
The study indicates that tibolone and combined HT are equally effective in improving sleep quality after adjustment for baseline differences. Further researches are warranted to substantiate these findings.
Authors’ Contributions
Z.S.: Conceptualization (equal), data curation (lead), formal analysis (lead), writing—original draft (lead), and writing—review and editing (equal). C.L.: Data curation (equal), formal analysis (equal), and writing—review and editing (equal). F.J.: Interpretation of results (equal) and writing—review and editing (equal). D.S.: Writing—review and editing (lead), supervision (equal), and validation (equal). M.T.: Conceptualization (lead), study design (lead), supervision (lead), writing—review and editing (equal), and project administration (lead).
Footnotes
Acknowledgments
The authors express their sincere gratitude to the participants of this study for their invaluable contributions. They also appreciate the support provided by the Menopause and Elderly Women’s Bone Health Management Project, funded by the Shanghai Municipal Health Commission. Additionally, the authors acknowledge the contributions of the medical and research staff at Shanghai Sixth People’s Hospital for their assistance in data collection and patient follow-up. Finally, the authors thank their colleagues and mentors for their insightful feedback, which has helped refine their analysis and interpretation.
Author Disclosure Statement
The authors declare that there are no conflicts of interest regarding the publication of this article.
Funding Information
The source of funding comes from the Menopause and Elderly Women’s Bone Health Management Project sponsored by the Shanghai Municipal Health Commission.
