Sage Journals: Discover world-class research

Abstract

Pedestrian trajectory prediction plays a pivotal role in real-world applications such as autonomous driving, unmanned delivery, and intelligent surveillance. However, existing deep learning approaches still face critical challenges, including mode collapse and the generation of unrealistic trajectories in complex environments. To address these limitations, we propose Phase Fusion Network (PFNet), a novel trajectory prediction framework designed to enhance prediction accuracy in intricate digital media scenarios. PFNet introduces an innovative Graph Encoder (GE) that incorporates a probabilistic modeling strategy to better capture spatial features and pedestrian interactions. To mitigate mode collapse, a common limitation in GAN-based methods, PFNet employs a dual-discriminator mechanism that improves both the realism and diversity of predicted trajectories. Additionally, PFNet adopts a two-phase architecture, where the generation phase strengthens spatial representation and the prediction phase refines temporal consistency. Extensive experiments on standard benchmarks, including ETH, UCY, and the Stanford Drone datasets, demonstrate that PFNet consistently outperforms state-of-the-art methods in terms of both Average Displacement Error (ADE) and Final Displacement Error (FDE).

Keywords

Pedestrian trajectory prediction phase fusion network GAN graph encoder dual-discriminator

Get full access to this article

View all access options for this article.

References

Battaglia

Pascanu

Lai

Jimenez Rezende

(2016). Interaction networks for learning about objects, relations and physics. Advances in Neural Information Processing Systems, 29, 4509–4517.

Cai

Dai

Wang

Chen

Sotelo

M. A.

(2021). Pedestrian motion trajectory prediction in intelligent driving from far shot first-person perspective video. IEEE Transactions on Intelligent Transportation Systems, 23(6), 5298–5313.

Chen

Fan

Zhang

(2023). Unsupervised sampling promoting for stochastic human trajectory prediction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (pp. 17874–17884).

Dendorfer

Elflein

Leal-Taixé

(2021). Mg-gan: A multi-generator model preventing out-of-distribution samples in pedestrian trajectory prediction. In Proceedings of the IEEE/CVF international conference on computer vision. (pp. 13158–13167).

Eiffert

Kong

Pirmarzdashti

Sukkarieh

(2020a). Path planning in dynamic environments using generative rnns and monte carlo tree search. In 2020 IEEE international conference on robotics and automation (ICRA). (pp. 10263–10269). IEEE.

Eiffert

Shan

Worrall

Sukkarieh

Nebot

(2020b). Probabilistic crowd gan: Multimodal pedestrian trajectory prediction using a graph vehicle-pedestrian attention network. IEEE Robotics and Automation Letters, 5(4), 5026–5033.

Fan

Zhao

Tang

Yin

(2019). Graph neural networks for social recommendation. In The world wide web conference. (pp. 417–426).

Chen

Lin

Rao

Zhou

(2022a). Stochastic trajectory prediction via motion indeterminacy diffusion. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (pp. 17113–17122).

Chen

Lin

Rao

Zhou

(2022b). Stochastic trajectory prediction via motion indeterminacy diffusion. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (pp. 17113–17122).

10.

Kipf

T. N.

Welling

(2016a). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.

11.

Kipf

T. N.

Welling

(2016b). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.

12.

Kosaraju

Sadeghian

Martín-Martín

Reid

Rezatofighi

Savarese

(2019). Social-bigat: Multimodal trajectory forecasting using bicycle-gan and graph attention networks. Advances in Neural Information Processing Systems, 32, 137–146.

13.

Lefèvre

Laugier

Ibañez-Guzmán

(2011). Exploiting map information for driver intention estimation at road intersections. In 2011 IEEE intelligent vehicles symposium (IV). (pp. 583–588). IEEE.

14.

Lerner

Chrysanthou

Lischinski

(2007). Crowds by example. In Computer graphics forum (Vol. 26, pp. 655–664). Wiley Online Library.

15.

Shan

Narula

Worrall

Nebot

(2020). Socially aware crowd navigation with multimodal pedestrian trajectory prediction for autonomous vehicles. In 2020 IEEE 23rd international conference on intelligent transportation systems (ITSC). (pp. 1–8). IEEE.

16.

Tarlow

Brockschmidt

Zemel

(2015). Gated graph sequence neural networks. arXiv preprint arXiv:1511.05493.

17.

Tedrake

Tenenbaum

J. B.

Torralba

(2018). Learning particle dynamics for manipulating rigid bodies, deformable objects, and fluids. arXiv preprint arXiv:1810.01566.

18.

Liu

Sun

Jia

Xing

Gao

Sun

Boulnois

Fan

(2019). Chemi-net: A molecular graph convolutional network for accurate drug property prediction. International Journal of Molecular Sciences, 20(14), 3389.

19.

Liu

Zhang

Qiao

Worrall

Y. F.

Kong

(2023). Knowledge-aware graph transformer for pedestrian trajectory prediction. In 2023 IEEE 26th International conference on intelligent transportation systems (ITSC). (pp. 4360–4366). IEEE.

20.

Yuan

(2024). Learning autoencoder diffusion models of pedestrian group relationships for multimodal trajectory prediction. IEEE Transactions on Instrumentation and Measurement, 73, 1–12.

21.

Wang

Zhang

(2023). Ssagcn: Social soft attention graph convolution network for pedestrian trajectory prediction. IEEE Transactions on Neural Networks and Learning Systems, 35(9), 11989–12003.

22.

Mangalam

Girase

Agarwal

Lee

K. H.

Adeli

Malik

Gaidon

(2020). It is not the journey but the destination: Endpoint conditioned trajectory prediction. In Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, proceedings, part II 16. (pp. 759–776). Springer.

23.

Mao

Zhu

Chen

Wang

(2023). Leapfrog diffusion model for stochastic trajectory prediction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (pp. 5517–5526).

24.

Mohamed

Qian

Elhoseiny

Claudel

(2020). Social-stgcnn: A social spatio-temporal graph convolutional neural network for human trajectory prediction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (pp. 14424–14432).

25.

Pellegrini

Ess

Van Gool

(2010). Improving data association by joint modeling of pedestrian trajectories and groupings. In Computer Vision–ECCV 2010: 11th European conference on computer vision, Heraklion, Crete, Greece, September 5-11, 2010, Proceedings, Part I 11. (pp. 452–465). Springer.

26.

Robicquet

Sadeghian

Alahi

Savarese

(2016). Learning social etiquette: Human trajectory understanding in crowded scenes. In European conference on computer vision. (pp. 549–565). Springer.

27.

Sadeghian

Kosaraju

Sadeghian

Hirose

Rezatofighi

Savarese

(2019). Sophie: An attentive gan for predicting paths compliant to social and physical constraints. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 1349–1358).

28.

Salzmann

Ivanovic

Chakravarty

Pavone

(2020). Trajectron++: Dynamically-feasible trajectory forecasting with heterogeneous data. In Computer Vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVIII 16 (pp. 683–700). Springer.

29.

Seeger

(2004). Gaussian processes for machine learning. International Journal of Neural Systems, 14(02), 69–106.

30.

Shi

Wang

Long

Zhou

Niu

Hua

(2021). Sgcn: Sparse graph convolution network for pedestrian trajectory prediction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 8994–9003).

31.

Shi

Wang

Zhou

Hua

(2023). Trajectory unified transformer for pedestrian trajectory prediction. In Proceedings of the IEEE/CVF International conference on computer vision. (pp. 9675–9684).

32.

Tang

Wang

(2024). Using a diffusion model for pedestrian trajectory prediction in semi-open autonomous driving environments. IEEE Sensors Journal, 24(10), 17208–17218.

33.

Xie

Zhang

Xia

Xiao

Jiang

Zhou

Qin

Chen

(2024). Pedestrian trajectory prediction based on social interactions learning with random weights. IEEE Transactions on Multimedia, 26, 7503–7515.

34.

Yang

Wang

Yan

(2022). Graph neural networks are inherently good generalizers: Insights by bridging gnns and mlps. arXiv preprint arXiv:2212.09034.

35.

Ren

Zhao

(2020). Spatio-temporal graph transformer networks for pedestrian trajectory prediction. In Computer Vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII 16. (pp. 507–523). Springer.

36.

Yuan

Weng

Kitani

K. M.

(2021). Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 9813–9823).

37.

Zhang

Ouyang

Zhang

Xue

Zheng

(2019). Sr-lstm: State refinement for lstm towards pedestrian trajectory prediction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 12085–12094).

38.

Zheng

Zhu

Zhang

Liu

Cheng

Zhao

(2020). Distribution-induced bidirectional generative adversarial network for graph representation learning. In Proceedings of the ieee/cvf conference on computer vision and pattern recognition (pp. 7224–7233).

PFNet: A Phase Fusion Network for Pedestrian Trajectory Prediction

Abstract

Keywords

Get full access to this article

References