Sage Journals: Discover world-class research

Abstract

Keywords

Alzheimer's disease neurobehavioral markers passive sensing naturalistic driving longitudinal modeling

Introduction

Dementia is prevalent among older adults (age ≥65 years) globally,¹ with Alzheimer's disease (AD) being the most common etiology; AD cases are estimated to reach nearly 14 million by 2050 in the United States.² AD can be detected 10–25 years before clinical onset using gold-standard biomarkers; however, these are costly, invasive, and hard to scale at the population level.^3–5 Conventional biomarkers create high individual-test costs, patient burden, and limited access outside major metropolitan areas.^6–8 Plasma biomarkers obtained via blood draws offer a more accessible and efficient alternative⁵; however, many assays are still undergoing validation in large, diverse cohorts.⁹

Passive body-worn sensors, smartphones, or in-vehicle devices offer a complementary, continuous, and low-burden signal of everyday function.^10,11 These data streams can quantify subtle changes in basic activities of daily living, like sleep efficiency, gait speed, and heart rate, but can also assess instrumental activities of daily living, like managing finances,¹² cognitive fluctuations,¹³ and driving behavior.¹⁴ Over the past decade, our team deployed in-vehicle telematics sensors to study driving, linking behavior to biofluid, imaging, and clinical symptoms as markers of AD.^15,16 Across multiple cohorts, we showed that preclinical AD pathology, depression, and day-to-day cognitive variability are associated with—and can forecast—changes in route, complexity, and adverse behaviors years before clinical impairment or driving cessation.^17–23 We complemented these discoveries with methodological work (e.g., disease-specific modeling, feature development, machine learning), attention to disparities, geriatrics, and translational science.^24–29

We aver that digital markers will only serve patients if infrastructure, governance, and ethics are treated as intentional outcomes. Passive sensing can complement molecular markers, but it is not a simplified plug-and-play solution. The field advances when we privilege (1) interpretability over volume, (2) consent that evolves with declining cognitive capacity, and (3) vendor-agnostic, versioned pipelines. We posit that these commitments are the gatekeepers between research and equitable clinical utility. Operational choices matter clinically because interpretability, fairness, and longitudinal stability are prerequisites for robust, equitable, and generalizable results and precision medicine. Dual timestamping prevents day-night misclassification, which would bias sleep and circadian metrics. Schema versioning and contracting tests ensure reproducibility, a crucial aspect for regulatory review. Transparent handling of missing data guards against subgroup drift, which could undermine equity. These choices determine whether a digital marker can forecast meaningful outcomes (e.g., independence, safety, disease progression, change-point detection), be understood by clinicians and families, and ultimately justify its use alongside—or in advance of—molecular biomarkers.

We distill five lessons from large, longitudinal deployments, including signal gaps, missing GPS, time-handling, data volume, and vendor dependencies, and we couple each with pragmatic, standard-operating-procedures researchers can adopt. We use driving as a contextual exemplar, but the lessons generalize to passive sensing broadly. This perspective targets clinical and digital health researchers and decision-makers to provide technical details that affect study validity, equity, and translation.

Lessons from deployment

The challenges and remedies apply broadly to passive sensing, including wearables (e.g., PPG/actigraphy), smartphones (e.g., GPS/IMU), home/ambient sensors (e.g., Wi-Fi localization, PIR), and environmental monitors (e.g., air quality). This section summarizes five common challenges inherent to sensor data, regardless of the commercial-off-the-shelf (COTS) platform used for passive measurement (Figure 1). The remainder of this perspective will use COTS dataloggers as a benchmark for passively collecting driving data from vehicles for behavioral analysis. These dataloggers record per-trip driving metrics in separate data streams (30- and 1-s intervals), convert those metrics into comma-separated value (CSV) files using a vendor's pipeline, and upload those CSV files to a cloud repository for researchers to access.

Figure 1.

Prospective data ecosystem of datalogger from a third-party vendor spanning participant use to research output, with five key challenges overlaid onto the pipeline.

Lesson 1: Incomplete data stream (applies to any passive stream)

Sensor-based longitudinal studies are vulnerable to interruptions in data transfer from devices to the cloud or local storage due to device malfunctions, connectivity issues, weather, and electromagnetic or solar conditions. In-vehicle telematics are particularly sensitive because signal capture depends on external infrastructure (e.g., cellular networks, internet service providers [ISP], Global Navigation Satellite System [GNSS]), while balancing dead zones, poor satellite connectivity, and tampering that can result in incomplete trips or missing key fields (e.g., speed, GPS). Digital health researchers should deploy automated quality-control pipelines that validate file schemas, monitor completeness in near-real time, flag gaps in data transmission against expected volume (file size), and trigger re-ingestion when feasible. If recoverable variables are absent, derive them from raw data (e.g., compute total trip time by summing recorded 1–30 s epochs). For irrecoverable gaps, avoid single-value methods such as mean imputation; instead, use time-series approaches (e.g., Kalman/state-space smoothing, multiple imputation, model-based estimation) with prespecified sensitivity analyses. Sensor datasets also contain identifiers (e.g., device/participant, vehicle, latitude/longitude). Missing or incomplete identifiers typically indicate improper installation/pairing or vendor-side redaction. These fields should not be imputed but instead resolved at the source via documented communication with participants (to confirm use) and vendors (to reconcile data versions/backend processing). Researchers should specify governance upfront, including which identifiers are collected, how they are anonymized, and how linkage keys are protected to preserve data integrity, privacy, and reproducibility, consistent with FAIR principles and digital health reporting standards (Table 1).

Table 1.

Challenges and insights for processing sensor data.

Challenge	Example	Potential Cause(s)	Quality Assurance	Quality Control
Incomplete data stream	Missing GPS/GNSS coordinates, date-times, IDs	Degraded GPS/ISP/GNSS data quality or latency period of file upload	Compute SHA-256 checksums for files and account for all missing or mis-formatted data	All imputations (e.g., Kalman, model-based estimation) should be flagged with uncertainty metadata
Missing GPS data	Missing start and end coordinates of trips	Poor quality GPS/GNSS connection	Check trips for missing start and end coordinates	Impute missing start and end coordinates from prior and subsequent trips, respectively
Specific data transformations	Date-times are recorded in UTC and local time zones are ambiguous	Vendor best practices enforce uniform data management and schema	Identify trips affected by different time zones or Daylight Savings Time	Filter trips based on locale, and perform logical conversions to local time zone
Processing large volumes of data	High frequency data necessitates large computational resources	Duration of study, frequency of observation, specific research question	Prior to analysis, estimate runtime for processing data.	Use file compression, implement parallel processing, acquire dedicated hardware
Communication considerations	Files can be downloaded but not parsed	Vendor updated their pipeline and altered file format	Create, enforce, and update a data transfer agreement	Maintain communication with vendor support team(s)

Lesson 2: Missing GPS data (location-aware sensors)

Telematics sensor data often involves missing GPS coordinates at trip start or end, which prevents accurate origin–destination assignment. This typically reflects cellular degradation/ISP/GNSS acquisition (e.g., weak signal at ignition, shielding), rather than participant nonadherence, and is beyond the control of researchers or vendors. For digital-health workflows, we recommend a prespecified hierarchical imputation: (1) when the temporal gap between consecutive data are short and the terminal speed is ∼0 km/h, carry forward the prior trip's end point to impute the next start (and vice-versa), enforcing distance thresholds (e.g., ≤ 100–200 meters) and time limits (e.g., ≤ 3 h); (2) if intermediate coordinates exist, apply map-matching or state-space/Kalman smoothing to infer the most probable origin/destination on the road network; (3) when criteria are not met, label the endpoints as missing rather than forcing values. All imputations should be flagged with uncertainty metadata, and sensitivity analyses should confirm that endpoint reconstruction does not change primary inferences. Consecutive trips with missing start/end coordinates are a device-level signal at a specific location. Building automated quality checks to detect such runs and trigger participant/vendor follow-up (e.g., installation check, firmware update, device replacement). Finally, do not impute or manufacture core identifiers (vehicle/participant IDs, latitude/longitude) but resolve these at the source to preserve linkage integrity, privacy, and reproducibility.

Lesson 3: Transforming time values for use (all time-stamped sensors)

Timestamping is foundational in sensor-based research; however, the choice of time standard determines what questions can be answered. Storing observations in Coordinated Universal Time (UTC) creates a single, consistent reference that simplifies aggregation across regions. However, UTC obscures individual behavior that depends on local clock time, such as the ratio of day-to-night driving, rush-hour exposure, or alignment with circadian/sleep–wake cycles. A trip that occurs at night for a driver on the East Coast can appear as a daytime event when viewed only through UTC, leading to misclassification of circadian patterns and biased subgroup comparisons. Recording both a canonical UTC timestamp and a location-aware local time obviates this obstacle. The local time should be computed based on the geographic location of the event and must incorporate the correct historical rules for standard time and daylight-saving time. For trips that cross time-zone boundaries, events should be assigned to the local time of the destination where the trip occurred, not the origin or destination. Alongside these fields, store the human-readable name of the time zone and the numeric offset from UTC in minutes at the moment of the observation. This dual representation preserves global consistency while enabling analyses meaningful to an individual's behavior. Over the course of 10 years, our participants drove across the contiguous US, including parts of Canada and Mexico (Figure 2). Analytically, researchers should derive day–night indicators, day-of-week indicators, and holiday or school-calendar flags from local time and, when relevant, classify light conditions using sunrise and sunset for the event's GPS and date rather than a fixed temporal threshold. Quality control should include checks for temporal anomalies (e.g., skipped or repeated hours around daylight saving transitions), negative or implausible trip durations, and discrepancies between reported location and time-zone metadata. All transformations must be reproducible, documenting the time-zone database and software versions used, retaining the original device timestamps, and logging each conversion in the data-processing provenance. Adopting these practices enables investigators to compare participants across regions equitably and to draw valid inferences about diurnal patterns, mobility, and health behaviors throughout a longitudinal study.

Figure 2.

Starting and ending destinations of trips made by participants enrolled from July 1, 2015, to July.

Lesson 4: Processing large volumes of data (high-frequency data streams)

High-fidelity sensors can generate event-level data at one-second intervals, producing a separate file for each vehicular trip and, over longitudinal follow-up, hundreds of thousands of raw objects. This “small-files problem” quickly exceeds the capacity of local machines and processing scripts. A digital health—ready architecture should leverage cloud storage (e.g., AWS S3, Azure Blob, or Google Cloud Storage) as a centralized data lake for raw objects, with scheduled compression into open, analytics-optimized formats (e.g., tarball, Parquet) that incorporate compression and partitioning by participant and date. Store derived features and metadata in a relational warehouse, maintain schema versioning and a metadata catalog, and capture full ETL provenance. For processing, deploy parallel and distributed computing (e.g., Dask, Spark, or Ray) to ingest, validate, and feature files at scale, and containerize workflows with orchestration (e.g., Airflow/Prefect) for reproducibility. This stack enables near-real-time completeness checks, rapid architecture development, and scalable modeling. Implementing role-based access control, encryption at rest/in transit, audit logging, and cost monitoring can meet privacy and governance requirements while keeping budgets predictable. Together, these practices turn high-volume sensor streams into an efficient, secure, and analyzable pipeline suitable for multi-year, multi-site, digital health studies.

Lesson 5: Managing expectations and communication (any vendor-mediated data)

When partnering with vendors to store or process high-volume sensor data, treat the relationship as part of the informatics pipeline. Establish a Data Transfer Agreement and change-control plan that specifies versioned schemas, machine-readable data dictionaries, units/time zones, file formats (e.g., CSV/Parquet plus JSON metadata), upload cadence, and provenance fields. Since vendors often modify internal pipelines without notice, requesting advance release notes, backward-compatible exports, and a staging area where researchers can run automated contract tests (e.g., schema validation, type checks) before data ingestion is imperative. Request explicit pipeline documentation describing storage structure, feature definitions, and update frequency, and mandate prompt notification of any changes with dated version tags. Maintain regular, bidirectional communication and keep governance explicit, especially regarding how identifiers are handled/pseudonymized, what logs and audit trails are retained. The goal is to ensure that longitudinal datasets remain interoperable, reproducible, and analysis-ready throughout a study.

Ethics, trust, and lived realities

Digital health in dementia raises questions beyond data pipelines. Consent evolves as decisional capacity declines. Protocols should anticipate re-consent, caregiver assent, and clear opt-out paths that do not compromise care. Participant burden persists even in “passive” sensing—installation, charging, and perceived surveillance can cause distress. Minimizing burden and offering plain-language dashboards that return value (e.g., mobility summaries) builds trust. Privacy should be contextual and flexible, not absolute. Location and routine patterns are re-identifiable; adopt data minimization, on-device pre-processing where feasible, privacy budgets for sharing, and strict role-based access controls. Caregiver mediation complicates attribution and responsibility. As a result, document who installs, who monitors, and who acts on alerts. Finally, equity requires calibration across devices and geographies, participatory design with diverse older adults and caregivers, and transparent communication about what the data cannot say. Ethical guardrails are not optional, but they determine scalability and legitimacy.

Conclusion

Passive sensing has clear value for longitudinal phenotyping and forecasting outcomes relevant for our aging population. Yet clinical readiness hinges on three unresolved areas: 1) interpretable features that clinicians and families can understand, 2) equitable performance across diverse hardware/operating systems and populations, and 3) governance that supports evolving consent and regulatory review. The short-term goal is not an AD diagnosis but rather support for aging in place by flagging changes, informing safety planning, and enriching trials. With transparent methods, shared dictionaries, and ethically grounded deployment, passive sensing can transition from promising pilots to tools that assist clinicians and patients at the population level.

Footnotes

Acknowledgment

We acknowledge the altruism of our participants and their families for their often decade-long follow-up in our research studies.

ORCID iDs

Ganesh M Babulal

Subrata Pal

Ethical approval

N/A

Contributorship

GMB designed the study. All co-authors contributed to interpreting the results and critically reviewed the manuscript.

Funding

This work was funded by the National Institute of Health (NIH)/National Institute on Aging (NIH/NIA) grants R01AG089700 (GMB), R01AG068183 (GMB), R01AG067428 (GMB), R13AG096982 (GMB).

Conflicting interests

All authors report no conflict of interest

Guarantor

N/A

References

Livingston

Huntley

Liu

, et al. Dementia prevention, intervention, and care: 2024 report of the lancet standing commission. Lancet 2024; 404: 572–628.

Alzheimer’s Association. Alzheimer's disease facts and figures. Alzheimer’s & Dementia 2025; 2025: 3708–3821.

Sperling

Aisen

Beckett

, et al. Toward defining the preclinical stages of Alzheimer’s disease: recommendations from the national institute on aging-Alzheimer's association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement 2011; 7: 280–292.

Sperling

Karlawish

Johnson

. Preclinical Alzheimer disease—the challenges ahead. Nature Reviews Neurology 2013; 9: 54–58.

Jr CR

Bennett

Blennow

, et al. NIA-AA research framework: toward a biological definition of Alzheimer's disease. Alzheimers Dement 2018; 14: 535–562.

Shaw

Arias

Blennow

, et al. Appropriate use criteria for lumbar puncture and cerebrospinal fluid testing in the diagnosis of Alzheimer's disease. Alzheimers Dement 2018; 14: 1505–1521.

Fortea

García-Arcelay

Terrancle

, et al. Attitudes of neurologists toward the use of biomarkers in the diagnosis of early Alzheimer’s disease. J. Alzheimer's Dis. 2023; 93: 275–282.

van Maurik

Altomare

Collij

, et al. Utility, costs and cost-utility of amyloid-PET in the diagnostic process of memory clinic patients: a trial-based economic evaluation from AMYPAD-DPMS. Eur J Neurol 2025; 32: 54–58.

Babulal

. Inclusion of ethnoracial populations and diversity remains a key challenge in Alzheimer's disease biofluid-based biomarker studies. J Neurol Sci 2020; 421: 117269–117269.

10.

Hoff

Kitsakos

Silva

. A scoping review of the patient experience with wearable technology. Digital Health 2024; 10: 1–23.

11.

Kourtis

Regele

Wright

, et al. Digital biomarkers for Alzheimer’s disease: the mobile/wearable devices opportunity. NPJ Digital Medicine 2019; 2: 1–9.

12.

Han

Barnes

Leurgans

, et al. Susceptibility to scams in older black and white adults. Front Psychol 2021; 12: 685258–685258.

13.

Öhman

Hassenstab

Berron

, et al. Current advances in digital cognitive assessment for preclinical Alzheimer's disease. Alzheimers Dement 2021; 13: 1–19.

14.

Babulal

Johnson

Fagan

, et al. Identifying preclinical Alzheimer’s disease using everyday driving behavior: proof of concept. J Alzheimer's Dis 2021; 79: 1009–1014.

15.

Blake

Brown

Chen

, et al. A combined naturalistic driving, clinical, and neurobehavioral data set for investigating aging and dementia. Scientific Data 2025: 1–15.

16.

Babulal

Traub

Webb

, et al. Creating a driving profile for older adults using GPS devices and naturalistic driving methodology. F1000Res 2016: 1–18. DOI: 10.12688/f1000research.9608.1.

17.

Hafezifar

Alizadeh

Dickerson

, et al. The neighbourhood built environment affects driving behaviours of older adults: a combined geographic information systems and machine learning method. Cities & Health 1–16. In Press.

18.

Long

Babulal

Bayat

. Characterizing navigational changes in preclinical Alzheimer’s disease: a route complexity metric derived from naturalistic driving data. IEEE J Transl Eng Health Med 2025: 471–479. DOI: 10.1109/JTEHM.2025.3619802.

19.

Shacham

Brown

, et al. Association of environmental exposome and cognitive function among older adults with and without preclinical Alzheimer's disease. Alzheimers Dement 2025; 21: 1–11.

20.

Babulal

Chen

Murphy

, et al. Predicting driving cessation among cognitively normal older drivers: the role of Alzheimer disease biomarkers and clinical assessments. Neurology 2024; 102: 1–9.

21.

Carr

Beyene

Doherty

, et al. Medication and road test performance among cognitively healthy older adults. JAMA Network Open 2023; 6: e2335651–e2335651.

22.

Babulal

Chen

Carr

, et al. Cortical atrophy and leukoaraiosis, imaging markers of cerebrovascular small vessel disease, are associated with driving behavior changes among cognitively normal older adults. J Neurol Sci 2023; 448: 1–7.

23.

Wisch

Roe

Babulal

, et al. Naturalistic driving measures of route selection associate with resting state networks in older adults. Sci Rep 2022; 12: 6486–6486.

24.

Wisch

Kianfar

Carr

, et al. Differential impacts of road diets on driving behavior among older adults with and without preclinical Alzheimer’s pathology. Transportation Research Part F: Traffic Psychology and Behaviour 2023; 98: 18–28.

25.

Carr

Babulal

. Addressing the complex driving needs of an aging population. JAMA 2023; 330: 1187–1188.

26.

Vivoda

Cao

Koumoutzis

, et al. Planning for driving retirement: the effect of driving perceptions, driving events, and assessment of driving alternatives. Transportation Research Part F: Traffic Psychology and Behaviour 2021; 76: 193–201.

27.

Stout

Babulal

Johnson

, et al. Recruitment of African American and non-hispanic white older adults for Alzheimer disease research via traditional and social Media: a case study. J Cross Cult Gerontol 2020; 35: 329–339.

28.

Babulal

Stout

Williams

, et al. Differences in driving outcomes among cognitively normal African American and Caucasian older adults. J Racial Ethn Health Disparities 2020; 7: 269–280.

29.

Babulal

Kolady

Stout

, et al. A systematic review examining associations between cardiovascular conditions and driving outcomes among older drivers. Geriatrics 2020; 5: 1–10.

Operationalizing passive sensors into scalable,reproducible,neurobehavioral digital markers for Alzheimer's disease: Lessons learned over 10 years

Abstract

Keywords

Introduction

Lessons from deployment

Lesson 1: Incomplete data stream (applies to any passive stream)

Lesson 2: Missing GPS data (location-aware sensors)

Lesson 3: Transforming time values for use (all time-stamped sensors)

Lesson 4: Processing large volumes of data (high-frequency data streams)

Lesson 5: Managing expectations and communication (any vendor-mediated data)

Ethics, trust, and lived realities

Conclusion

Footnotes

Acknowledgment

ORCID iDs

Ethical approval

Contributorship

Funding

Conflicting interests

Guarantor

References