Dynamic Benchmarks: Spatial and Temporal Alignment for Automated Driving Systems Performance Evaluation

Abstract

Deployed SAE level 4 automated driving systems (ADS) without a human driver are currently operating in ride-hailing fleets on surface streets in the U. S. This current use case, and future applications of this technology, will determine where and when the fleets operate, potentially resulting in a divergence from the distribution of driving of the human benchmark population within a given locality. Existing benchmarks for evaluating ADS performance have only done county-level geographical matching of ADS and benchmark driving exposure in crash rates. This study presents a novel methodology for constructing dynamic human benchmarks that adjust for spatial and temporal variations in driving distribution between ADS and the overall human-driven fleet. Dynamic benchmarks were generated using human police-reported crash data, human vehicle miles traveled data, and over 20 million miles of operational data collected from an active ADS deployment across three U. S. counties. The spatial adjustment revealed significant differences across various severity levels in adjusted crash rates compared with unadjusted benchmarks with these differences ranging from 10% to 47% higher in San Francisco, 12% to 20% higher in Maricopa, and 7% lower to 34% higher in Los Angeles counties. The time-of-day adjustment in San Francisco, limited to this region because of data availability, resulted in adjusted crash rates 2% lower to 16% higher than unadjusted rates, depending on severity level. The findings underscore the importance of adjusting for spatial and temporal confounders in benchmarking analysis, which ultimately contributes to a more equitable benchmark for ADS performance evaluations.

Keywords

autonomous driving system dynamic human benchmarks safety impact

Get full access to this article

View all access options for this article.

References

Chen

J. J.

Shladover

S. E.

Initial Indications of Safety of Driverless Automated Driving Systems. Transportation Research Board 103rd Annual Meeting, 2024.

Di Lillo

Gode

Zhou

Atzei

Chen

Victor

Comparative Safety Performance of Autonomous-and Human Drivers: A Real-World Case Study of the Waymo One Service. arXiv Preprint arXiv:2309.01206, 2023.

Flannagan

Leslie

Kiefer

Bogard

Chi-Johnston

Freeman

Huang

Walsh

Joseph

Establishing a Crash Rate Benchmark Using Large-Scale Naturalistic Human Ridehail Data. Technical Report, UMTRI, 2023.

Kusano

K. D.

Scanlon

J. M.

Chen

Y. H.

McMurry

T. L.

Chen

Gode

Victor

Comparison of Waymo Rider-Only Crash Data to Human Benchmarks at 7.1 Million Miles. Traffic Injury Prevention, Vol. 25, No. sup1, 2024, pp. S66–S77.

Scanlon

Kusano

K. D.

Fraade-Blanar

L. A.

McMurry

T. L.

Chen

Y. H.

Victor

Benchmarks for Retrospective Automated Driving System Crash Rate Analysis Using Police-Reported Crash Data. Traffic Injury Prevention, Vol. 25, No. sup1, 2024, pp. S51–S65.

Victor

Kusano

Gode

Chen

Schwall

Safety Performance of the Waymo Rider-Only Automated Driving System at One Million Miles. Technical Report. 2023. https://waymo.com/research/safety-performance-of-the-waymo-rider-only-automat/. Accessed December 1, 2025.

Schwall

Daniel

Victor

Favaro

Hohnhold

Waymo Public Road Safety Performance Data. arXiv Preprint arXiv:2011.00038, 2020.

Nees

M. A.

Safer than the Average Human Driver (who is Less Safe than Me)? Examining a Popular Safety Benchmark for Self-Driving Cars. Journal of Safety Research, Vol. 69, 2019, pp. 61–68.

Favaro

F.M.

Victor

Hohnhold

Schnelle

Interpreting Safety Outcomes: Waymo’s Performance Evaluation in the Context of a Broader Determination of Safety Readiness. arXiv Preprint arXiv:2306.14923, 2023.

10.

Scanlon

Teoh

E. R.

Kidd

D. G.

Kusano

K. D.

Bärgman

Chi-Johnston

Di Lillo

, et al. RAVE Checklist: Recommendations for Overcoming Challenges in Retrospective Safety Studies of Automated Driving Systems. arXiv Preprint arXiv:2408.07758, 2024.

11.

Blanco

Atwood

Russell

S. M.

Trimble

T. E.

McClafferty

J. A.

Perez

M. A.

Automated Vehicle Crash Rate Comparison Using Naturalistic Data. Virginia Tech Transportation Institute, 2016.

12.

Teoh

E. R.

Kidd

D. G.

Rage against the Machine? Google’s Self-Driving Cars versus Human Drivers. Journal of Safety Research, Vol. 63, 2017, pp. 57–60.

13.

Goodall

N. J.

Comparison of Automated Vehicle Struck-from-Behind Crash Rates with National Rates Using Naturalistic Data. Accident Analysis & Prevention, Vol. 154, 2021, p. 106056.

14.

Qin

Ivan

J. N.

Ravishanker

Selecting Exposure Measures in Crash Rate Prediction for Two-Lane Highway Segments. Accident Analysis & Prevention, Vol. 36, No. 2, 2004, pp. 183–191.

15.

Martin

J-L.

Relationship between Crash Rate and Hourly Traffic Flow on Interurban Motorways. Accident Analysis & Prevention, Vol. 34, No. 5, 2002, pp. 619–629.

16.

Rice

T. M.

Peek-Asa

Kraus

J. F.

Nighttime Driving, Passenger Transport, and Injury Crash Rates of Young Drivers. Injury Prevention, Vol. 9, No. 3, 2003, pp. 245–250.

17.

Arévalo-Támara

Orozco-Fontalvo

Cantillo

Factors Influencing Crash Frequency on Colombian Rural Roads. Promet-Traffic & Transportation, Vol. 32, No. 4, 2020, pp. 449–460.

18.

Kopelias

Papadimitriou

Papandreou

Prevedouros

Urban Freeway Crash Analysis: Geometric, Operational, and Weather Effects on Crash Number and Severity. Transportation Research Record: Journal of the Transportation Research Board, 2007. 2015: 123–131.

19.

Sharmin

Ivan

J. N.

Zhao

Wang

Hossain

M. J.

Ravishanker

Jackson

Incorporating Demographic Proportions into Crash Count Models by Quasi-Induced Exposure Method. Transportation Research Record: Journal of the Transportation Research Board, 2020. 2674: 548–560.

20.

Arizona Department of Transportation. 2022 Motor Vehicle Crash Facts for the State of Arizona. Phoenix, Arizona: The Arizona Department of Transportation, 2023.

21.

Arizona Department of Transportation. Extent and Travel Dashboard, 2023. https://experience.arcgis.com/experience/ac0948fc05224aa8a80313f59a634fde. Accessed July 22, 2024.

22.

Arizona Department of Transportation. Records Center, 2023. https://azdot.govqa.us/WEBAPP/_rs/supporthome.aspx. Accessed July 22, 2024.

23.

California Highway Patrol. SWITRS 2019 Report: Annual Report of Fatal and Injury Motor Vehicle Traffic Collisions (Preface). California Highway Patrol, 2021.

24.

San Francisco Department of Public Health-Program on Health, Equity and Sustainability. Vision Zero High Injury Network: 2017 Update – A Methodology for San Francisco, California. Technical Report, 2017.

25.

Federal Highway Administration. U.S. Department of Transportation Feature Server, 2022. https://geo.dot.gov/server/rest/services/Hosted/. Accessed July 22, 2024.

26.

Federal Highway Administration. Highway Performance Monitoring System Field Manual, 2018. https://www.fhwa.dot.gov/policyinformation/hpms/fieldmanual/page01.cfm. Accessed July 22, 2024.

27.

Federal Highway Administration. Highway Performance Monitoring System: State Practices, 2014. https://www.fhwa.dot.gov/policyinformation/hpms/statepractices.cfm. Accessed April 14 2025.

28.

Federal Highway Administration. Highway Statistics, 2021. https://www.fhwa.dot.gov/policyinformation/statistics/2021/. Accessed July 22, 2024.

29.

San Francisco County Transportation Authority. TNCs and Congestion Report, 2018. https://www.sfcta.org/sites/default/files/2019-05/TNCs_Congestion_Report_181015_Finals.pdf. Accessed December 1, 2025.

30.

S2 Geometry. S2 Cells Developer’s Guide. S2 Geometry, 2017. http://s2geometry.io/devguide/s2cell_hierarchy.html. Accessed December 1, 2025.

31.

Chamandy

, et al. Estimating Uncertainty for Massive Data Streams. Google, 2012. https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/43157.pdf. Accessed December 1, 2025.

32.

Blincoe

Miller

Wang

J.-S.

Swedler

Coughlin

Lawrence

Guo

Klauer

Dingus

The Economic and Societal Impact of Motor Vehicle Crashes, 2019 (Revised). Technical Report, DOT HS 813 403, National Highway Traffic Safety Administration, 2023.

33.

Theofilatos

Yannis

A Review of the Effect of Traffic and Weather Characteristics on Road Safety. Accident Analysis & Prevention, Vol. 72, 2014, pp. 244–256.

34.

Edwards

J. B.

The Relationship between Road Accident Severity and Recorded Weather. Journal of Safety Research, Vol. 29, No. 4, 1998, pp. 249–262.

35.

Doherty

S. T.

Andrey

J. C.

MacGregor

The Situational Risks of Young Drivers: The Influence of Passengers, Time of Day and Day of Week on Accident Rates. Accident Analysis & Prevention, Vol. 30, No. 1, 1998, pp. 45–52.

36.

Pawlovich

M. D.

Carriquiry

Welch

Iowa’s Experience With Road Diet Measures: Use of Bayesian Approach to Assess Impacts on Crash Frequencies and Crash Rates. Transportation Research Record: Journal of the Transportation Research Board, 2006. 1953: 163–171.