Sage Journals: Discover world-class research

Abstract

Surrogate safety measures (SSM) have been used extensively in traffic safety studies for crash risk estimation. Most SSM-based studies employing extreme value theory (EVT) use the peak over threshold (POT) approach to detect anomalies or extreme events during safety-critical situations. This study investigated the efficacy of unsupervised machine learning (ML)-based anomaly detection methods as an extreme event sampling approach compared with the conventional POT sampling approach by developing bivariate EVT models for rear-end crash risk estimation on a freeway segment. Three widely used SSMs, namely time-to-collision (TTC), modified time-to-collision (MTTC), and deceleration rate to avoid crash (DRAC), were considered for the bivariate EVT modeling. Video data were collected from the selected segment of the I-40 expressway in Memphis, Tennessee. Among three SSMs, the combination of MTTC and DRAC bivariate EVT models provided the most accurate crash risk estimation (within the 99% confidence interval of the observed crashes), applying the traditional POT sampling approach, and ML-based isolation forest (iForest) and one-class support vector machine (OCSVM) sampling approaches. ML-based OCSVM sampling method provided a 21% crash estimation accuracy improvement over the POT and iForest sampling methods. Based on these findings, it can be concluded that unsupervised ML anomaly detection can be an effective sampling approach, reducing subjectivity in the threshold selection encountered in the POT sampling method. Safety improvement programs aim to maximize outcomes with limited resources, and an accurate estimation of the expected number of crashes helps engineers prioritize high-impact improvement locations.

Keywords

operations surrogate safety measures safety safety performance and analysis crash prediction models

Get full access to this article

View all access options for this article.

References

NHTSA. Overview of Motor Vehicle Traffic Crashes in 2022. National Highway Traffic Safety Administration, Washington, D.C., 2023.

Zheng

Sayed

A Bivariate Bayesian Hierarchical Extreme Value Model for Traffic Conflict-Based Crash Estimation. Analytic Methods in Accident Research, Vol. 25, 2020, p. 100111.

Hussain

Arun

Haque

M. M.

A Hybrid Modelling Framework of Machine Learning and Extreme Value Theory for Crash Risk Estimation Using Traffic Conflicts. Analytic Methods in Accident Research, Vol. 36, 2022, p. 100248.

Mannering

Temporal Instability and the Analysis of Highway Accident Data. Analytic Methods in Accident Research, Vol. 17, 2018, pp. 1–13.

Arun

Haque

M. M.

Bhaskar

Washington

Transferability of Multivariate Extreme Value Models for Safety Assessment by Applying Artificial Intelligence-Based Video Analytics. Accident Analysis & Prevention, Vol. 170, 2022, p. 106644.

Ali

Haque

M. M.

Mannering

Assessing Traffic Conflict/Crash Relationships with Extreme Value Theory: Recent Developments and Future Directions for Connected and Autonomous Vehicle and Highway Safety Research. Analytic Methods in Accident Research, Vol. 39, 2023, p. 100276.

Tarko

A. P.

Surrogate Measures of Safety. In Safe Mobility: Challenges, Methodology and Solutions ( Lord

Washington

, eds.), Transport and Sustainability, Vol. 11, Emerald Publishing Limited, Leeds, 2018, pp. 383–405.

Lord

Mannering

The Statistical Analysis of Crash-Frequency Data: A Review and Assessment of Methodological Alternatives. Transportation Research Part A: Policy and Practice, Vol. 44, 2010, pp. 291–305.

Tarko

A. P.

Use of Crash Surrogates and Exceedance Statistics to Estimate Road Safety. Accident Analysis and Prevention, Vol. 45, 2012, pp. 230–240.

10.

Wang

Xie

Huang

Liu

A Review of Surrogate Safety Measures and Their Applications in Connected and Automated Vehicles Safety Modeling. Accident Analysis & Prevention, Vol. 157, 2021, p. 106157.

11.

Arun

Haque

Washington

Sayed

Mannering

How Many are Enough? Investigating the Effectiveness of Multiple Conflict Indicators for Crash Frequency-By-Severity Estimation by Automated Traffic Conflict Analysis. Transportation Research Part C: Emerging Technologies, Vol. 138, 2022, p. 103653.

12.

Zheng

Sayed

Comparison of Traffic Conflict Indicators for Crash Estimation Using Peak over Threshold Approach. Transportation Research Record: Journal of the Transportation Research Board, 2019. 2673: 493–502.

13.

Arun

Haque

Md. M.

Washington

Sayed

Mannering

A Systematic Review of Traffic Conflict-Based Safety Measures with a Focus on Application Context. Analytic Methods in Accident Research, Vol. 32, 2021, p. 100185.

14.

Saunier

Sayed

Probabilistic Framework for Automated Analysis of Exposure to Road Collisions. Transportation Research Record: Journal of the Transportation Research Board, 2008. 2083: 96–104.

15.

Guo

Klauer

S. G.

Hankey

J. M.

Dingus

T. A.

Near Crashes as Crash Surrogate for Naturalistic Driving Studies. Transportation Research Record: Journal of the Transportation Research Board, 2010. 2147: 66–74.

16.

Davis

G. A.

Hourdos

Xiong

Chatterjee

Outline for a Causal Model of Traffic Conflicts and Crashes. Accident Analysis and Prevention, Vol. 43, 2011, pp. 1907–1919.

17.

Songchitruksa

Tarko

A. P.

The Extreme Value Theory Approach to Safety Estimation. Accident Analysis & Prevention, Vol. 38, 2006, pp. 811–822.

18.

K. F.

Jovanis

P. P.

Crashes and Crash-Surrogate Events: Exploratory Modeling with Naturalistic Driving Data. Accident Analysis and Prevention, Vol. 45, 2012, pp. 507–516.

19.

Tarko

A. P.

Davis

Saunier

Sayed

Washington

Surrogate Measures of Safety. White Paper. Transportation Research Board, Washington, D.C., 2009.

20.

Zheng

Ismail

Meng

Freeway Safety Estimation Using Extreme Value Theory Approaches: A Comparative Study. Accident Analysis and Prevention, Vol. 62, 2014, pp. 32–41.

21.

Arun

Haque

M. M.

Bhaskar

Washington

Sayed

A Bivariate Extreme Value Model for Estimating Crash Frequency by Severity Using Traffic Conflicts. Analytic Methods in Accident Research, Vol. 32, 2021, p. 100180.

22.

Zheng

Sayed

From Univariate to Bivariate Extreme Value Models: Approaches to Integrate Traffic Conflict Indicators for Crash Estimation. Transportation Research Part C: Emerging Technologies, Vol. 103, 2019, pp. 211–225.

23.

Zheng

Ismail

Sayed

Fatema

Bivariate Extreme Value Modeling for Road Safety Estimation. Accident Analysis & Prevention, Vol. 120, 2018, pp. 83–91.

24.

Wang

Dai

A Crash Prediction Method Based on Bivariate Extreme Value Theory and Video-Based Vehicle Trajectory Data. Accident Analysis & Prevention, Vol. 123, 2019, pp. 365–373.

25.

Zheng

Sayed

Essa

Validating the Bivariate Extreme Value Modeling Approach for Road Safety Estimation with Different Traffic Conflict Indicators. Accident Analysis & Prevention, Vol. 123, 2019, pp. 314–323.

26.

Sayed

Zheng

Multivariate Bayesian Hierarchical Modeling of the Nonstationary Traffic Conflict Extremes for Crash Estimation. Analytic Methods in Accident Research, Vol. 28, 2020, p. 100135.

27.

Liu

Ting

Zhou

Z.-H.

Isolation-Based Anomaly Detection. ACM Transactions on Knowledge Discovery from Data (TKDD), Vol. 6, 2012, pp. 1–39.

28.

Zhang

Muthuraman

Jiang

One Class Support Vector Machine for Anomaly Detection in the Communication Network Performance Data. Proc., 5th Conference on Applied Electromagnetics, Wireless and Optical Communications, Tenerife, Spain, February 15–17, 2007.

29.

Rehman

Belhaouari

S. B.

Unsupervised Outlier Detection in Multidimensional Data. Journal of Big Data, Vol. 8, 2021, pp. 1–27.

30.

Terven

Cordova-Esparza

A Comprehensive Review of YOLO: From YOLOv1 to YOLOv8 and Beyond. arXiv Preprint arXiv:2304.00501, 2023.

31.

Wojke

Bewley

Paulus

Simple Online and Realtime Tracking with a Deep Association Metric. arXiv Preprint arXiv:1703.07402v1, 2017.

32.

Khan

Bacchus

Erwin

Surrogate Safety Measures as Aid to Driver Assistance System Design of the Cognitive Vehicle. IET Intelligent Transport Systems, Vol. 8, No. 4, 2013, pp. 415–424.

33.

Kusumastutie

N. S.

Patria

Kusrohmaniah

Hastjarjo

T. D.

Drivers’ Decision-Making When Experiencing a Traffic Conflict: A Scoping Review. Transportation Research Record: Journal of the Transportation Research Board, 2023. 2678: 950–964.

34.

Basso

Muñoz

Pezoa

Varas

Assessing Factors Influencing the Occurrence of Traffic Conflicts: A Vehicle-By-Vehicle Approach. Transportmetrica B: Transport Dynamics, Vol. 12, No. 1, 2024, p. 2332716.

35.

Zheng

Ismail

Meng

Traffic Conflict Techniques for Road Safety Analysis: Open Questions and Some Insights. Canadian Journal of Civil Engineering, Vol. 41, No. 7, 2014, pp. 633–641.

36.

DataFromSky. DataFromSky Viewer - New Version with Many Amazing Features! DataFromSky, January 19, 2021. https://datafromsky.com/news/datafromsky-viewer-new-version-new-features/.

37.

Jadhav

Pramod

Ramanathan

Comparison of Performance of Data Imputation Methods for Numeric Dataset. Applied Artificial Intelligence, Vol. 33, No. 10, 2019, pp. 913–933. https://doi.org/10.1080/08839514.2019.1637138.

38.

TDOT TRIMS Highway Data Request, Tennessee Department of Transportation. https://www.tn.gov/tdot/long-range-planning-home/longrange-roadway-data/longrange-road-inventory-trims-data-request.html. Accessed July 25, 2024.

39.

Vogel

A Comparison of Headway and Time to Collision as Safety Indicators. Accident Analysis & Prevention, Vol. 35, 2003, pp. 427–433.

40.

Meng

Estimation of Rear-End Vehicle Crash Frequencies in Urban Road Tunnels. Accident Analysis & Prevention, Vol. 48, 2012, pp. 254–263.

41.

Coles

An Introduction to Statistical Modeling of Extreme Values. Springer-Verlag, London, 2001.

42.

Fauconnier

Haesbroeck

Outliers Detection with the Minimum Covariance Determinant Estimator in Practice. Statistical Methodology, Vol. 6, 2009, pp. 363–379.

43.

Breunig

Kriegel

H.-P.

Sander

LOF: Identifying Density-Based Local Outliers. Proc., ACM SIGMOD International Conference on Management of Data, Dallas, TX, 2000.

44.

Palacio-Niño

J.-O.

Berzal

Evaluation Metrics for Unsupervised Learning Algorithms. arXiv Preprint arXiv:1905.05667, 2019. https://doi.org/10.48550/arxiv.1905.05667.

45.

James

Witten

Hastie

Tibshirani

Taylor

Unsupervised Learning. In An Introduction to Statistical Learning: With Applications in Python (R. De Veaux, R. Nugent, and G. Allen, eds.), Springer Texts in Statistics, Springer, Cham, 2023, pp. 503–556.

46.

Nizam

Q. Z. B. A.

Ibrahim

M. Z.

Fadilah

Othman

M. R.

Faudzi

A. A. B. M.

Empowering Traffic Management: Anomaly Detection in Vehicle Traffic Flow Using XGBOOST and Isolation Forest Algorithms. In Proceedings of the 7th International Conference on Electrical, Control and Computer Engineering ( Md. Zain

Sulaiman

Mustafa

Shakib

M. N.

Jabbar

W. A.

, eds.), Lecture Notes in Electrical Engineering, Springer, Singapore, 2024, pp. 345–357. https://doi.org/10.1007/978-981-97-3851-9_30.

47.

Spanos

Giannoutakis

K. M.

Votis

Tzovaras

Combining Statistical and Machine Learning Techniques in IoT Anomaly Detection for Smart Homes. Proc., IEEE 24th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD), Limassol, Cyprus, September 11–13, 2019. https://doi.org/10.1109/camad.2019.8858490.

48.

Hossain

M. M.

Sun

Mitran

Rahman

M. A.

Investigating Fatal and Injury Crash Patterns of Teen Drivers with Unsupervised Learning Algorithms. IATSS Research, Vol. 45, No. 4, 2021, pp. 561–573. https://doi.org/10.1016/j.iatssr.2021.07.002.

49.

Holmgren

Knapen

Olsson

Masud

A. P.

On the Use of Clustering Analysis for Identification of Unsafe Places in an Urban Traffic Network. Procedia Computer Science, Vol. 170, 2020, pp. 187–194. https://doi.org/10.1016/j.procs.2020.03.024.

50.

Mohammadnazar

Arvin

Khattak

A. J.

Classifying Travelers’ Driving Style Using Basic Safety Messages Generated by Connected Vehicles: Application of Unsupervised Machine Learning. Transportation Research Part C: Emerging Technologies, Vol. 122, 2020, p. 102917. https://doi.org/10.1016/j.trc.2020.102917.

51.

Yang

Shami

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice. Neurocomputing, Vol. 415, 2020, pp. 295–316. https://doi.org/10.1016/j.neucom.2020.07.061.

52.

Abreu

Automated Architecture Design for Deep Neural Networks. arXiv Preprint arXiv:1908.10714, 2019. https://doi.org/10.48550/arxiv.1908.10714.

53.

API Reference. Scikit-learn. https://scikit-learn.org/stable/api/index.html.

54.

Zheng

Westra

Leonard

Sisson

S. A.

Modeling Dependence between Extreme Rainfall and Storm Surge to Estimate Coastal Flooding Risk. Water Resource Research, Vol. 50, 2014, pp. 2050–2071.

55.

Tawn

J. A.

Bivariate Extreme Value Theory: Models and Estimation. Biometrika, Vol. 75, 1988, pp. 397–415.

56.

Cunto

Saccomanno

F. F.

Calibration and Validation of Simulated Vehicle Safety Performance at Signalized Intersections. Accident Analysis and Prevention, Vol. 40, 2008, pp. 1171–1179.

57.

Gilleland

Katz

R. W.

extRemes 2.0: An Extreme Value Analysis Package in R. Journal of Statistical Software, Vol. 72, 2016, pp. 1–39.

58.

Stephenson

A. G.

Evd: Extreme Value Distributions. R News, Vol. 2, 2002, pp. 31–32.

59.

Borsos

Application of Bivariate Extreme Value models to Describe the Joint Behavior of Temporal and Speed Related Surrogate Measures of Safety. Accident Analysis & Prevention, Vol. 159, 2021, p. 106274.

60.

GeeksforGeeks. RBF SVM Parameters in Scikit Learn. https://www.geeksforgeeks.org/rbf-svm-parameters-in-scikit-learn/. Accessed July 25, 2024.

61.

Mirzahossein

Sashurpour

Hosseinian

S. M.

Gilani

V. N. M.

Presentation of Machine Learning Methods to Determine the Most Important Factors Affecting Road Traffic Accidents on Rural Roads. Frontiers of Structural and Civil Engineering, Vol. 16, No. 5, 2022, pp. 657–666.

62.

Wang

A Piecewise Hybrid of ARIMA and SVMs for Short-Term Traffic Flow Prediction. Proc., International Conference on Neural Information Processing, Guangzhou, China, November 14–18, 2017, Springer, Cham, pp. 493–502.

63.

Xie

Choi

Y.-K.

Hybrid Traffic Prediction Scheme for Intelligent Transportation Systems Based on Historical and Real-Time Data. International Journal of Distributed Sensor Networks, Vol. 13, 2017, pp. 1–11.

64.

Chauhan

Dhamaniya

Arkatkar

Haque

M. M.

A Conflict-Based Safety Assessment Technique for Rear-End Crash Risk at Signalized Intersections in a Lower-Middle-Income Country: A Comparison between Homogeneous and Heterogeneous Traffic Conditions. Safety Science, Vol. 161, 2023, p. 106075.

Application of a Machine Learning–Based Sampling Method in Extreme Value Theory for Crash Risk Estimation of a Freeway Segment

Abstract

Keywords

Get full access to this article

References