Sage Journals: Discover world-class research

Abstract

Against the backdrop of the booming development of autonomous driving technology, to address the problems of insufficient accuracy in long-term prediction and incomplete interaction modeling for pedestrian trajectory prediction from a first-person perspective, this paper proposes the Fusion-Hierarchical Graph Trajectory Network (FHGT-Net). This network first utilizes a Multi-Layer Perceptron (MLP) to generate prior future trajectories, which are then concatenated with historical trajectories to form fused features. Subsequently, an encoder-decoder structure is utilized. The encoder, composed of local sub-graphs and a global graph, extracts features from the fused data. The local sub-graphs process individual pedestrian trajectories through multiple MLP layers, layer normalization, and Rectified Linear Unit (ReLU) activation, while the global graph, leveraging a multi-head attention mechanism, models interactions among pedestrians. Finally, the decoder, based on a Long Short-Term Memory (LSTM) network, decodes the encoder output to iteratively derive the most likely future trajectories for each pedestrian. Experimental results show that, in the PIE (Pedestrian In-car Environment) dataset, compared with the current advanced level, this model reduces the Average Displacement Error (ADE) by 9.9% and the Final Displacement Error (FDE) by 7.5%, significantly improving the accuracy. In addition, this model also performs well in real-vehicle experiments. The research results provide a reliable solution for pedestrian trajectory prediction from a first-person perspective, especially for predictions in multi-pedestrian interaction scenarios, which is of great significance.

Plain Language Summaries

Understanding how people move on the road is critical for self-driving cars to make safe decisions. This study introduces a new artificial intelligence model called FHGT-Net, designed to predict the walking paths of pedestrians as seen from a car's front camera. Traditional models often struggle to make accurate long-term predictions and to correctly understand how multiple pedestrians influence each other's movements. FHGT-Net solves these problems through a new fusion-hierarchical design. It first combines the car's observations of past pedestrian movements with early guesses of where they might go next. Then, it uses a graph network to model both individual movement patterns and group interactions among pedestrians. Finally, a recurrent neural network predicts where each pedestrian is most likely to move in the next few seconds. When tested on a public dataset called PIE (Pedestrian In-car Environment), FHGT-Net outperformed existing methods, reducing prediction errors by about 10%. It also showed strong performance in real driving tests. In simple terms, this model helps self-driving systems “read” pedestrian behavior more accurately and respond safely in busy environments. The findings bring autonomous driving one step closer to understanding human motion in real-world traffic.

Keywords

autonomous driving pedestrian trajectory prediction first-person perspective deep learning prior future trajectory

Get full access to this article

View all access options for this article.

References

Alahi

Goel

Ramanathan

Robicquet

Savarese

Social

L. S. T. M.

(2016). Paper presentation.

Cai

Dai

Wang

Chen

Sotelo

M. A.

(2022). Pedestrian motion trajectory prediction in intelligent driving from far shot first-person perspective video. IEEE Transactions on Intelligent Transportation Systems, 23(6), 5298–5313. https://doi.org/10.1109/TITS.2021.3052908

Cao

Chen

Cai

(2023). A survey of pedestrian trajectory prediction based on graph neural network. Computer Engineering & Science, 45(6), 1040–1053. https://doi.org/10.3969/j.issn.1007-130X.2023.06.011

Chen

Duan

Houthooft

Schulman

Sutskever

Abbeel

(2016). InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. Advances in Neural Information Processing Systems, 29, 2180–2188. https://doi.org/10.48550/arXiv.1606.03657

Cho

Merrienboer

Gulcehre

Bahdanau

Bougares

Schwenk

Bengio

(2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.

Golchoubian

Ghafurian

Dautenhahn

Azad

N. L.

(2023). Pedestrian trajectory prediction in pedestrian-vehicle mixed environments: A systematic review. IEEE Transactions on Intelligent Transportation Systems, 24(11), 11544–11567. https://doi.org/10.1109/TITS.2023.3291196

Goodfellow

Pouget-Abadie

Mirza

Warde-Farley

Ozair

Courville

Bengio

(2014). Generative adversarial nets. Advances in Neural Information Processing Systems, 27, 2672–2680. https://doi.org/10.48550/arXiv.1406.2661

Gupta

Johnson

Fei-Fei

Savarese

Alahi

(2018). Social GAN: Socially acceptable trajectories with generative adversarial networks. Proc. IEEE/CVF conference on computer vision and pattern recognition.

Helbing

Molnár

(1995). Social force model for pedestrian dynamics. Physical Review E, 51(5), 4282–4286. https://doi.org/10.1103/PhysRevE.51.4282

10.

Hochreiter

Schmidhuber

(1997). Long short-term memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735

11.

Huang

Mao

Wang

(2019). STGAT: Modeling spatial-temporal interactions for human trajectory prediction. Proc. IEEE/CVF International Conference on Computer Vision, 6272–6281. https://doi.org/10.1109/ICCV.2019.00637

12.

Kingma

D. P.

Welling

(2014). Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114.

13.

Kong

Liu

Wang

Cui

(2021). A survey of pedestrian trajectory prediction methods based on deep learning. Control And Decision, 36(12), 2769–2786. DOI:https://doi.org/10.13195/j.kzyjc.2020.1841

14.

Kooij

J. F. P.

Schneider

Flohr

F. B.

Gavrila

D. M.

(2014). Context-based pedestrian path prediction. Lecture Notes in Computer Science, 8694(1), 618–633. https://doi.org/10.1007/978-3-319-10599-4_40

15.

Kosaraju

Sadeghian

Martin-Martin

Reid

Rezatofighi

S. H.

Savarese

. Social-BiGAT: Multimodal trajectory forecasting using bicycle-GAN and graph attention networks. Advances in Neural Information Processing Systems, 32, 137–146, 2019.

16.

Zhou

Ren

Lian

(2021). Review of pedestrian trajectory prediction methods. Chinese Journal of Intelligent Science & Technology, 3(4), 399–411. DOI:https://doi.org/10.11959/j.issn.2096-6652.202140

17.

Qiao

Katsigiannis

Zhu

Shum

H. P. H.

(2025). Unified spatial-temporal edge-enhanced graph networks for pedestrian trajectory prediction. IEEE Transactions on Circuits and Systems for Video Technology, 35. DOI:https://doi.org/10.1109/TCSVT.2025.3539522

18.

Zhu

Qiao

Shum

H. P.

(2025). H. ViTE: Virtual graph trajectory expert Router for pedestrian trajectory prediction. arXiv preprint arXiv:2511.12214.

19.

Lin

Zhang

Wang

Yin

(2025). Multi-scale wavelet transform enhanced graph neural network for pedestrian trajectory prediction. Physica A: Statistical Mechanics and its Applications, 659, 130319. https://doi.org/10.1016/j.physa.2024.130319

20.

Liu

Meng

Xie

Guo

(2024). Temporal-Aware Convolutional Network for First-Person View Pedestrian Trajectory Prediction. Proc. 8th CAA international conference on vehicular control and intelligence.

21.

Wang

Zhang

(2024). SSAGCN: Social soft attention graph convolution network for pedestrian trajectory prediction. IEEE Transactions on Neural Networks and Learning Systems, 35(9), 11989–12003. https://doi.org/10.1109/TNNLS.2023.3250485

22.

Wang

(2024). Modeling the uncertainty in pedestrians trajectory prediction. Tehnički Vjesnik, 31(5), 1712–1718. DOI:https://doi.org/10.17559/TV-20230727000834

23.

Man

Liu

Zhang

(2022). Survey of pedestrian trajectory prediction methods based on first person. Proc. 26th annual conference on new network technologies and applications.

24.

Mohamed

Qian

Elhoseiny

Claudel

(2020). Social-STGCNN: A social spatio-temporal graph convolutional neural network for human trajectory prediction. Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 14424–14432. DOI:https://doi.org/10.1109/CVPR42600.2020.01443

25.

Nematzadeh

Kiani

Torkamanian-Afshar

Aydin

(2022). Tuning hyperparameters of machine learning algorithms and deep neural networks using metaheuristics: A bioinformatics study on biomedical and biological cases. Computational Biology and Chemistry, 97, 107619. https://doi.org/10.1016/j.compbiolchem.2021.107619

26.

Rasouli

Kotseruba

Kunic

Tsotsos

(2019). PIE: A large-scale dataset and models for pedestrian intention estimation and trajectory prediction. Proc. IEEE/CVF International Conference on Computer Vision, 6262–6271. DOI:https://doi.org/10.1109/ICCV.2019.00636

27.

Rumelhart

D. E.

Hinton

G. E.

Williams

R. J.

(1986). Learning representations by back-propagation errors. Nature, 323(6088), 533–536. https://doi.org/10.1038/323533a0

28.

Sang

Wang

Chen

Wang

(2023). Non-Autoregressive pedestrian trajectory prediction model based on the first perspective. Chinese Journal of Electronics, 51(5), 1266–1272. DOI:https://doi.org/10.12263/DZXB.20211467

29.

Schneider

Gavrila

D. M.

(2013). Pedestrian path prediction with recursive Bayesian filters: A comparative study. IEEE Conference on Intelligent Transportation Systems, 8142, 174–183. DOI:https://doi.org/10.1007/978-3-642-40602-7_18

30.

Shi

Wang

Long

Zhou

Niu

Hua

(2021). SGCN: Sparse graph convolution network for pedestrian trajectory prediction. Proc. IEEE/CVF International Conference on Computer Vision, 8994–9003. DOI:https://doi.org/10.1109/CVPR46437.2021.00888

31.

Veličković

Cucurull

Casanova

Romero

Lió

Bengio

(2017). Graph attention networks. arXiv preprint arXiv:1710.10903.

32.

Vemula

Muelling

(2018). Social attention: modeling attention in human crowds. Proc. IEEE International Conference on Robotics and Automation, 4601–4607. DOI:https://doi.org/10.1109/ICRA.2018.8460504

33.

Yang

Wang

Lai

Wang

(2025). Pedestrian trajectory prediction method based on feature fusion. IEEE Transactions on Instrumentation and Measurement, 74, 1–9. DOI:https://doi.org/10.1109/TIM.2025.3551031

34.

Yang

Liao

Wang

(2023). SGAMTE-Net: A pedestrian trajectory prediction network based on spatiotemporal graph attention and multimodal trajectory endpoints. Applied Intelligence, 53, 31165–31180. https://doi.org/10.1007/s10489-023-05132-z

35.

Yao

Atkins

Johnson-Roberson

Vasudevan

(2021). Bitrap: Bi-directional pedestrian trajectory prediction with multi-modal goal estimation. IEEE Robotics and Automation Letters, 6(2), 1463–1470. https://doi.org/10.1109/LRA.2021.3056339

36.

Yao

Choi

Crandall

D. J.

Atkins

E. M.

Dariush

Egocentric vision-based future vehicle localization for intelligent driving assistance systems. 2019 International conference on robotics and automation (ICRA). IEEE, 2019: 9711–9717.

37.

Zhai

Chen

Wang

Tang

(2025). Trustworthy Pedestrian Trajectory Prediction via Pattern-Aware Interaction Modeling. arXiv preprint arXiv:2507.13397.

38.

Zhang

Zheng

(2025). Group-PTP: A pedestrian trajectory prediction method based on group features. IEEE Transactions on Multimedia, 1–15. DOI:https://doi.org/10.1109/TMM.2025.3535380

39.

Zhang

Ouyang

Zhang

Xue

Zheng

(2019). SR-LSTM: State refinement for LSTM towards pedestrian trajectory prediction. Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 12085–12094. DOI:https://doi.org/10.1109/CVPR.2019.01236

40.

Zhang

Zhou

Liu

Xiao

(2025). Completed interaction networks for pedestrian trajectory prediction. IEEE Transactions on Multimedia, 27, 5119–5129. DOI:https://doi.org/10.1109/TMM.2025.3542967

41.

Zhou

Zhao

Yang

Liu

(2022). GCHGAT: Pedestrian trajectory prediction using group constrained hierarchical graph attention networks. Applied Intelligence, 52, 11434–11447. https://doi.org/10.1007/s10489-021-02997-w

42.

Zong

Chang

Dang

Wang

(2024). Pedestrian trajectory prediction in crowded environments using social attention graph neural networks. Applied Sciences, 14(20), 9349. https://doi.org/10.3390/app14209349

FHGT-Net: A Fusion-Hierarchical Graph Trajectory Network for First-Person Perspective Pedestrian Trajectory Prediction in Autonomous Driving

Abstract

Keywords

Get full access to this article

References