Abstract
Longitudinal data are often available in cohort studies and clinical settings, such as covariates collected at cohort follow-up visits or prescriptions captured in electronic health records. Such longitudinal information, if correlates with the health event of interest, may be incorporated to dynamically predict the probability of a health event with better precision. Landmarking is a popular approach to dynamic prediction. There are well-established methods for landmarking using full cohort data, but collecting data on all cohort members may not be feasible when resource is limited. Instead, one may select a subset of the cohort using subsampling designs, and only collect data on this subset. In this work, we present conditional likelihood and inverse-probability weighted methods for landmarking using data from cohort subsampling designs, and discuss considerations for choosing a particular method. Simulations are conducted to evaluate the applicability of the methods and their predictive performance in different scenarios. Results show that our methods have similar predictive performance to the full cohort analysis but only use small fractions of the full cohort data. We use real nested case-control data from the Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Trial to illustrate the methods.
Keywords
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
