Enhancing Taxi Demand Prediction with Limited Data using a Spatial-Temporal Large Language Model

Abstract

Taxi demand prediction is essential for intelligent transportation systems. Accurate prediction results help address the issue of supply–demand imbalances and enable more efficient traffic management. Significant advances have been made in traffic demand prediction, particularly through the use of deep learning models. However, these models heavily rely on a large amount of data. Data scarcity remains a significant challenge because of high acquisition and storage costs, as well as data sparsity in certain locations and times. Thus, this study proposes a novel taxi demand prediction model that leverages the large language model GPT-2 to capture complex spatio-temporal dependencies. By integrating spatial correlations through a graph attention network and incorporating temporal dependencies at multiple scales, the proposed spatio-temporal taxi demand prediction large model (STTDP-LM) is capable of achieving accurate prediction with limited training data. Extensive experiments validate its effectiveness across two districts in Xi’an. Compared to the baseline method, the STTDP-LM reduces the root mean square error (RMSE), mean absolute percentage error (MAPE), and mean absolute error (MAE) by an average of 12.25%, 12.55%, and 18.33%, respectively, across the two districts. When trained with only 1% of the data, the model still shows significant improvement, with average reductions of 33.83%, 34.12%, and 17.03% in the RMSE, MAE, and MAPE, respectively. The prediction accuracy of the model is more prominent in multi-step prediction with a total duration of 60 min. In summary, this study offers a promising solution for taxi demand prediction with limited historical data, providing a valuable insight for real-world applications in intelligent transportation systems.

Keywords

taxi demand prediction spatio-temporal modeling large language models graph attention networks intelligent transportation

Get full access to this article

View all access options for this article.

References

Liu

Qiu

Wang

Ouyang

Lin

Contextualized Spatial–Temporal Network for Taxi Origin-Destination Demand Prediction. IEEE Transactions on Intelligent Transportation Systems, Vol. 20, No. 10, 2019, pp. 3875–3887.

Yao

Tang

Jia

Gong

Li.

Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction. Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32, No. 316, 2018, pp. 2588–2595.

Zhang

Zhu

Wang

F.-Y.

MLRNN: Taxi Demand Prediction Based on Multi-Level Deep Learning and Regional Heterogeneity Analysis. IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 7, 2021, pp. 8412–8422.

Saxena

Cao

Multimodal Spatio-Temporal Prediction with Stochastic Adversarial Networks. ACM Transactions on Intelligent Systems and Technology (TIST), Vol. 13, No. 2, 2022, pp. 1–23.

Ren

Chen

Liu

Wang

Cui

TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Models. arXiv Preprint arXiv:2403.02221, 2024.

Xia

Tang

Shi

Xia

Yin

Huang

UrbanGPT: Spatio-Temporal Large Language Models. Proc., 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Barcelona, Spain, Association for Computing Machinery, New York, 2024, pp. 5351–5362.

Huang

Enhancing Traffic Prediction with Textual Data Using Large Language Models. arXiv Preprint arXiv:2405.06719, 2024.

Guo

Zhang

Jiang

Peng

Zhu

Yang

H. F.

Towards Explainable Traffic Flow Prediction with Large Language Models. Communications in Transportation Research, Vol. 4, 2024, p. 100150.

Moreira-Matias

Gama

Ferreira

Mendes-Moreira

Damas

Predicting Taxi–Passenger Demand Using Streaming Data. IEEE Transactions on Intelligent Transportation Systems, Vol. 14, No. 3, 2013, pp. 1393–1402.

10.

Kim

Sharda

Zhou

Pendyala

R. M.

A Stepwise Interpretable Machine Learning Framework Using Linear Regression (LR) and Long Short-Term Memory (LSTM): City-Wide Demand-Side Prediction of Yellow Taxi and For-Hire Vehicle (FHV) Service. Transportation Research Part C: Emerging Technologies, Vol. 120, 2020, p. 102786.

11.

Zhang

Short-Term Traffic Flow Prediction Based on Incremental Support Vector Regression. Proc., Third International Conference on Natural Computation (ICNC 2007), Vol. 1, IEEE, New York, 2007, pp. 640–645.

12.

Castro-Neto

Jeong

Y.-S.

Jeong

M.-K.

Han

L. D.

Online-SVR for Short-Term Traffic Flow Prediction Under Typical and Atypical Traffic Conditions. Expert Systems with Applications, Vol. 36, No. 3, 2009, pp. 6164–6173.

13.

Alvarez-Garcia

J. A.

Ortega

J. A.

Gonzalez-Abril

Velasco

Trip Destination Prediction Based on Past GPS Log Using a Hidden Markov Model. Expert Systems with Applications, Vol. 37, No. 12, 2010, pp. 8166–8171.

14.

Zhe

Taxi Demand Prediction Model Based on Spark and Improved BP Neural Network. Frontiers of Data and Domputing, Vol. 5, No. 4, 2023, pp. 112–126.

15.

Rahmatizadeh

Bölöni

Turgut

Real-Time Prediction of Taxi Demand Using Recurrent Neural Networks. IEEE Transactions on Intelligent Transportation Systems, Vol. 19, No. 8, 2017, pp. 2572–2581.

16.

Fathi

Balali

A Ride-Hailing Company Supply Demand Prediction Using Recurrent Neural Networks, GRU and LSTM. Proc., Science and Information Conference, Springer, Cham, 2024, pp. 123–133.

17.

Zhou

Chen

A Spatiotemporal Attention Mechanism-Based Model for Multi-Step Citywide Passenger Demand Prediction. Information Sciences, Vol. 513, 2020, pp. 372–385.

18.

Fang

Liu

Efficient Multi-Step Prediction Model That Considers the Influence of Spatial and Temporal Factors on Ride-Hailing Demand. Transportation Research Record: Journal of the Transportation Research Board, 2025. 2679: 03611981241287192.

19.

Wang

Xie

Zhao

Quick Taxi Route Assignment via Real-Time Intersection State Prediction with a Spatial-Temporal Graph Neural Network. Transportation Research Part C: Emerging Technologies, Vol. 158, 2024, p. 104414.

20.

Liu

Yang

Long

Zhao

Spatial-Temporal Large Language Model for Traffic Prediction. arXiv Preprint arXiv:2401.10134, 2024.

21.

Rong

Mao

Chen

Large-Scale Traffic Flow Forecast with Lightweight LLM in Edge Intelligence. IEEE Internet of Things Magazine. https://doi.org/10.1109/IOTM.001.2400047

22.

de Zarzà i Cubero

de Curtò i Díaz

Roig

Calafate

C. T.

LLM Multimodal Traffic Accident Forecasting. https://doi.org/10.3390/s23229225

23.

Peng

Guo

Chen

Zhu

Chen

Wang

, et al. LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models. arXiv Preprint arXiv:2403.18344, 2025.

24.

Eddy

S. R.

Hidden Markov Models. Current Opinion in Structural Biology, Vol. 6, No. 3, 1996, pp. 361–365.

25.

Zhang

Wang

Shan

Zhou

Wang

CMT-Net: A Mutual Transition Aware Framework for Taxicab Pick-Ups and Drop-Offs Co-prediction. Proc., Fifteenth ACM International Conference on Web Search and Data Mining, Association for Computing Machinery, New York, 2022, pp. 1406–1414.

26.

Goto

Matsumoto

Rizk

Yanai

Yamaguchi

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning. Proc., 2023 IEEE International Conference on Smart Computing (SMARTCOMP), IEEE, New York, 2023, pp. 297–302.