A Simple Yet Efficient K-Nearest Neighbor-Based Method for High-Resolution Traffic Time

Abstract

A time–space diagram (TSD) is an efficient tool for traffic analysis and visualization, representing the macroscopic traffic state as a set of cells. However, its application is often hampered by data sparsity, which obscures high-resolution traffic dynamics. This study proposes a modified K-nearest neighbors method, characterized by an adaptive iterative process, to impute missing TSD data. To support the method’s design, analytical bounds on error propagation motivated by Green’s function-based theory are established, and a practical empirical formula for the optimal K parameter is derived. The framework’s performance was rigorously validated on diverse data sets from China (Ubiquitous Traffic Eyes), the US (Next Generation Simulation), and Germany (HighD) across 30 distinct experimental conditions. Compared against four baseline models, the proposed model demonstrates a compelling balance between high imputation accuracy and exceptional computational efficiency. Further analyses confirm the influence of neighborhood order and the systematic performance bias. The model’s potential for knowledge transfer is also demonstrated via a cross-data set imputation scheme.

Keywords

traffic time–space diagram K-nearest neighbors traffic data imputation traffic dynamics traffic state analysis

Get full access to this article

View all access options for this article.

References

Grumert

E. F.

Tapani

Characteristics of Variable Speed Limit Systems. European Transport Research Review, Vol. 10, No. 2, 2018. https://doi.org/10.1186/s12544-018-0294-8.

Dai

Wang

Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction. Sensors, Vol. 17, No. 4, 2017, P. 818. https://doi.org/10.3390/s17040818.

Guan

Constructing Spatiotemporal Speed Contour Diagrams: Using Rectangular or Non-Rectangular Parallelogram Cells?

Transportmetrica B: Transport Dynamics, Vol. 7, No. 1, 2019, pp. 44-60.

Refining Time-Space Traffic Diagrams: A Simple Multiple Linear Regression Model. IEEE Transactions on Intelligent Transportation Systems, Vol. 25, No. 2, 2024, pp. 1465–1475. https://doi.org/10.1109/tits.2023.3316593.

Wang

Yan

From Rectangle to Parallelogram: An Area-Weighted Method to Make Time-Space Diagrams Incorporate Traffic Waves. Digital Transportation and Safety, Vol. 3, No. 1, 2024, pp. 1–7. https://doi.org/10.48130/dts-0024-0001.

Zheng

Visualizing Traffic Dynamics Based on Floating Car Data. Journal of Transportation Engineering, Part A: Systems, Vol. 143, No. 5, 2017, P. 04017005. https://doi.org/10.1061/JTEPBS.0000024.

Zhang

Feng

Jia

Zhao

TSR-GAN: Generative Adversarial Networks for Traffic State Reconstruction with Time Space Diagrams. Physica A: Statistical Mechanics and its Applications, Vol. 591, 2022, P. 126788. https://doi.org/10.1016/j.physa.2021.126788.

Missing Traffic Data: Comparison of Imputation Methods. IET Intelligent Transport Systems, Vol. 8, No. 1, 2014, pp. 51–57. https://doi.org/10.1049/iet-its.2013.0052.

Peng

C.-Y. J.

Zhu

Jin

Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression. Educational and Psychological Measurement, Vol. 68, No. 1, 2008, pp. 58–77. https://doi.org/10.1177/0013164407305582.

10.

Zhang

Zheng

Zhao

A Generative Adversarial Network for Travel Times Imputation Using Trajectory Data. Computer-Aided Civil and Infrastructure Engineering, Vol. 36, No. 2, 2021, pp. 197–212. https://doi.org/10.1111/mice.12595.

11.

Luo

Yang

Zhang

Spatiotemporal Traffic Flow Prediction with KNN and LSTM. Journal of Advanced Transportation, Vol. 2019, 2019, pp. 1–10. https://doi.org/10.1155/2019/4145353.

12.

Xia

Yin

Liang

Chen

Missing Traffic Data Imputation based on Tensor Completion and Graph Network Fusion. Transportation Research Record: Journal of the Transportation Research Board, 2025. 2679: 877-897. https://doi.org/10.1177/03611981251330889.

13.

Wang

Jia

Zhang

Real-Time Road Traffic States Measurement Based on Kernel-KNN Matching of Regional Traffic Attractors. Measurement, Vol. 94, 2016, pp. 862–872. https://doi.org/10.1016/j.measurement.2016.08.038.

14.

Zhao

Wang

Song

Zhu

Real-Time Freeway Traffic State Estimation for Inhomogeneous Traffic Flow. Physica A: Statistical Mechanics and Its Applications, Vol. 639, 2024, P. 129633. https://doi.org/10.1016/j.physa.2024.129633.

15.

Zhao

Towards Real-World Traffic Prediction and Data Imputation: A Multi-Task Pretraining and Fine-Tuning Approach. Information Sciences, Vol. 657, 2024, P. 119972. https://doi.org/10.1016/j.ins.2023.119972.

16.

Zheng

Short-Term Traffic Volume Forecasting: A k-Nearest Neighbor Approach Enhanced by Constrained Linearly Sewing Principle Component Algorithm. Transportation Research Part C: Emerging Technologies, Vol. 43, 2014, pp. 143–157. https://doi.org/10.1016/j.trc.2014.02.009.

17.

Lin

Using Support Vector Regression and K-Nearest Neighbors for Short-Term Traffic Flow Prediction Based on Maximal Information Coefficient. Information Sciences, Vol. 608, 2022, pp. 517–531. https://doi.org/10.1016/j.ins.2022.06.090.

18.

Karimpour

Y. J.

Data-driven Transfer Learning Framework for Estimating On-ramp and Off-ramp Traffic Flows. Journal of Intelligent Transportation Systems, Vol. 29, No. 1, 2025, pp. 67–80. https://doi.org/10.1080/15472450.2023.2301696.

19.

Khalladi

S. A.

Ouessai

Benamara

N. K.

Keche

Deep-Learning-Based Microscopic Approach for Road Traffic Congestion Classification in Highway and Urban Roads Under Adverse Conditions. Transportation Research Record: Journal of the Transportation Research Board. 2024. 2678(11): 1777–1795. https://doi.org/10.1177/03611981241246255.

20.

Park

H. S.

Park

Y. W.

Kwon

O. H.

Park

S. H.

Applying Clustered KNN Algorithm for Short-Term Travel Speed Prediction and Reduced Speed Detection on Urban Arterial Road Work Zones. Journal of Advanced Transportation, Vol. 2022, 2022, pp. 1–11. https://doi.org/10.1155/2022/1107048.

21.

Zheng

Guan

A Simple Nonparametric Car-Following Model Driven by Field Data. Transportation Research Part B: Methodological, Vol. 80, 2015, pp. 185–201. https://doi.org/10.1016/j.trb.2015.07.010.

22.

Wei

Peng

Xuan

Guo

GE-GAN: A Novel Deep Learning Framework for Road Traffic State Estimation. Transportation Research Part C: Emerging Technologies, Vol. 117, 2020, P. 102635. https://doi.org/10.1016/j.trc.2020.102635.

23.

Tak

Woo

Yeo

Data-Driven Imputation Method for Traffic Data in Sectional Units of Road Links. IEEE Transactions on Intelligent Transportation Systems, Vol. 17, No. 6, 2016, pp. 1762–1771. https://doi.org/10.1109/tits.2016.2530312.

24.

Chang

Yoon

High-Speed Data-Driven Methodology for Real-Time Traffic Flow Predictions: Practical Applications of ITS. Journal of Advanced Transportation, Vol. 2018, 2018, pp. 1–11. https://doi.org/10.1155/2018/5728042.

25.

Cui

Meng

Teng

T.-H.

Yang

Spatiotemporal Correlation Modelling for Machine Learning-Based Traffic State Predictions: State-of-the-Art and Beyond. Transport Reviews, Vol. 43, No. 4, 2023, pp. 780–804. https://doi.org/10.1080/01441647.2023.2171151.

26.

Huda

N. M.

Imro’ah

Determination of the Best Weight Matrix for the Generalized Space Time Autoregressive (GSTAR) Model in the Covid-19 Case on Java Island, Indonesia. Spatial Statistics, Vol. 54, 2023, P. 100734. https://doi.org/10.1016/j.spasta.2023.100734.

27.

Jiang

Chen

Meng

Wang

A Novel Density Peaks Clustering Algorithm Based on k Nearest Neighbors for Improving Assignment Process. Physica A: Statistical Mechanics and its Applications, Vol. 523, 2019, pp. 702–713. https://doi.org/10.1016/j.physa.2019.03.012.

28.

Fazli

Poshtan

Wind Turbine Fault Prognosis Using SCADA Measurements, Pre-Fault Labeling, and KNN Classifiers Robust against Data Imbalance. Measurement, Vol. 243, 2025, P. 116202. https://doi.org/10.1016/j.measurement.2024.116202.

29.

Liu

Zhang

An Efficient Spatial-Temporal Transformer with Temporal Aggregation and Spatial Memory for Traffic Forecasting. Expert Systems with Applications, Vol. 250, 2024, P. 123884. https://doi.org/10.1016/j.eswa.2024.123884.

30.

Guo

Wang

Zhang

Deng

Modeling Dynamic Spatio-Temporal Correlations and Transitions with Time Window Partitioning for Traffic Flow Prediction. Expert Systems with Applications, Vol. 252, 2024, P. 124187. https://doi.org/10.1016/j.eswa.2024.124187.

31.

Wang

Jia

Zhang

Real-Time Road Traffic States Measurement Based on Kernel-KNN Matching of Regional Traffic Attractors. Measurement, Vol. 94, 2016, pp. 862–872.

32.

Luo

Yang

Zhang

Spatiotemporal Traffic Flow Prediction with KNN and LSTM. Journal of Advanced Transportation, Vol. 2019, No. 1, 2019, P. 4145353. https://doi.org/10.1155/2019/4145353.

33.

Zhao

Zhan

Zhang

Finite-Time Boundary Stabilization for LWR Traffic Flow Model. IEEE Control Systems Letters, Vol. 7, 2023, pp. 3471–3476. https://doi.org/10.1109/LCSYS.2023.3332450.

34.

Wen

Xiao

Lyu

A Novel Stochastic Second-Order Macroscopic Continuum Traffic Flow Model for Traffic Instability. Chaos, Solitons & Fractals, Vol. 190, 2025, P. 115752. https://doi.org/10.1016/j.chaos.2024.115752.

35.

Shagolshem

Bira

Sil

Conservation Laws and Some New Exact Solutions for Traffic Flow Model via Symmetry Analysis. Chaos, Solitons & Fractals, Vol. 165, 2022, P. 112779. https://doi.org/10.1016/j.chaos.2022.112779.

36.

Nikolov

Nikolova

Stoilova

Green-Function for the Highway Vehicular Traffic Flow Modeling and Analysis. IFAC Proceedings Volumes, Vol. 45, No. 24, 2012, pp. 150–157. https://doi.org/10.3182/20120912-3-BG-2031.00029.

37.

Moreno-Ahedo

Castillo

Rascón

Pena Ramirez

On the Design of Linear Dynamic Controllers for the Master–Slave Synchronization of Chaotic Oscillators. Chaos, Solitons & Fractals, Vol. 192, 2025, P. 116002. https://doi.org/10.1016/j.chaos.2025.116002.

38.

U.S. Department of Transportation Federal Highway Administration. Next Generation Simulation (NGSIM) Vehicle Trajectories and Supporting Data. https://data.transportation.gov/Automobiles/Next-Generation-Simulation-NGSIM-Vehicle-Trajector/8ect-6jqj/about_data.

39.

Krajewski

Bock

Kloeker

Eckstein

The HighD Dataset: A Drone Dataset of Naturalistic Vehicle Trajectories on German Highways for Validation of Highly Automated Driving Systems. 2018 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, 2018, pp. 2118-2125. https://doi.org/10.1109/ITSC.2018.8569552.

40.

Wang

Liu

Chen

Surrogate Safety Measures for Traffic Oscillations Based on Empirical Vehicle Trajectories Prior to Crashes. Transportation Research Part C: Emerging Technologies, Vol. 161, 2024, P. 104543. https://doi.org/10.1016/j.trc.2024.104543.

41.

Coifman

Partial Trajectory Method to Align and Validate Successive Video Cameras for Vehicle Tracking. Transportation Research Part C: Emerging Technologies, Vol. 158, 2024, P. 104416. https://doi.org/10.1016/j.trc.2023.104416.

42.

Tang

Zhang

Liu

Missing Traffic Data Imputation Considering Approximate Intervals: A Hybrid Structure Integrating Adaptive Network-Based Inference and Fuzzy Rough Set. Physica A: Statistical Mechanics and its Applications, Vol. 573, 2021, P. 125776. https://doi.org/10.1016/j.physa.2021.125776.

43.

Hastie

Mazumder

Lee

J. D.

Zadeh

Matrix Completion and Low-Rank SVD via Fast Alternating Least Squares. Journal of Machine Learning Research, Vol. 16, 2015, pp. 3367–3402.

44.

Ngoduy

Low-Rank Unscented Kalman Filter for Freeway Traffic Estimation Problems. Transportation Research Record: Journal of the Transportation Research Board, Vol. 2260, No. 1, 2011, pp. 113–122. https://doi.org/10.3141/2260-13.

45.

Zeng

Xiong

Liu

Tang

Uncovering the Spatiotemporal Patterns of Traffic Congestion from Large-Scale Trajectory Data: A Complex Network Approach. Physica A: Statistical Mechanics and its Applications, Vol. 604, 2022, P. 127871. https://doi.org/10.1016/j.physa.2022.127871.

46.

Xie

Zhang

Guo

Chen

X. (Michael)

. Network-Scale Traffic Prediction via Knowledge Transfer and Regional MFD Analysis. Transportation Research Part C: Emerging Technologies, Vol. 141, 2022, P. 103719. https://doi.org/10.1016/j.trc.2022.103719.

47.

Balasubramani

Natarajan

A Cascaded Transition Recurrent Feature Network (CTRFN) Based Paramount Transfer Learning (PTL) Model for Traffic Congestion Prediction. Expert Systems with Applications, Vol. 248, 2024, P. 123446. https://doi.org/10.1016/j.eswa.2024.123446.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.61 MB

A Simple Yet Efficient K-Nearest Neighbor-Based Method for High-Resolution Traffic Time–Space Diagram Imputation

Abstract

Keywords

Get full access to this article

References

Supplementary Material