Abstract
Two-phase sampling designs, including nested case-control and case-cohort designs, are frequently utilized in large cohort studies involving expensive biomarkers. To analyze data from two-phase designs with a binary outcome, parametric models such as logistic regression are often adopted. However, when the model assumptions are not valid, parametric models may lead to biased estimation and risk evaluation. In this paper, we propose a robust semiparametric regression model for binary outcomes and an easy-to-implement computational procedure that combines the pool-adjacent violators algorithm with inverse probability weighting. The asymptotic properties are established, including consistency and the convergence rate. Simulation studies show that the proposed method performs well and is more robust than logistic regression methods. We demonstrate the application of the proposed method to real data from the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial.
Keywords
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
