Sage Journals: Discover world-class research

Abstract

Transformer-based policy networks have shown promising prospects in Portfolio Selection (PS). However, constrained by high computational complexity, they primarily focus on short-term price series (e.g., the past 12 h), while overlooking valuable long-term features. Furthermore, the inherent volatility and noise tend to lead the model to overfit on low-level information. To tackle these challenges, we propose an innovative approach. This approach leverages a pre-training model to extract high-level patterns from extended price series (e.g., the past two weeks) and then enhances decision-making in policy networks. The pre-training model is designed based on discrete representations to achieve better generalization and interpretability. Specifically, we tokenize the price series into discrete tokens through Vector-Quantization Variational AutoEncoder (VQ-VAE) and encourage the pre-training model to reconstruct these tokens according to the masked price series. Then we introduce a compact encoder-only policy network named Portfolio Transformer Encoder (PTE). Finally, PTE is provided with high-level patterns from the pre-training model to make more comprehensive decisions. We term the whole approach as Vector-Quantization Portfolio Transformer Encoder (VQ-PTE). VQ-PTE demonstrates superior performance on real-world currency and S&P500 datasets, achieving a minimum improvement of 35% in returns. Additionally, visualization results highlight superior interpretability.

Keywords

portfolio selection vector quantization pre-training representation learning

Get full access to this article

View all access options for this article.

References

Şahin Zorluoğlu

Kabak

. A literature survey on project portfolio selection problem. Multiple criteria decision making: beyond the information age. Cham: Springer Nature Switzerland AG. Vol. 25, 2021, pp.387–411.

Vaswani

Shazeer

Parmar

et al. Attention is all you need. Adv Neural Inf Process Syst 2017; 30: 5998–6008.

Kisiel

Gorse

. Portfolio transformer for attention-based asset allocation. In: Artificial intelligence and soft computing: 21st international conference, ICAISC 2022, Zakopane, Poland, 19–23 June 2022, proceedings, Part I. Springer, pp. 61–71.

Luo

Liao

Yan

. Lower risks, better choices: stock correlation based portfolio selection in stock markets. In: Companion proceedings of the ACM web conference 2023, Austin, TX, USA: Association for Computing Machinery (ACM). pp.1–4.

Zhang

et al. Relation-aware transformer for portfolio policy learning. In: Proceedings of the twenty-ninth international conference on international joint conferences on artificial intelligence, Yokohama, Japan, International Joint Conferences on Artificial Intelligence Organization (IJCAI Organization). pp.4647–4653.

Shao

Zhang

Wang

et al. Pre-training enhanced spatial-temporal graph neural network for multivariate time series forecasting. In: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining. Washington DC USA: ACM, pp.1567–1577. DOI: 10.1145/3534678.3539396.

Xia

et al. CI-STHPAN: pre-trained attention network for stock selection with channel-independent spatio-temporal hypergraph. In Proceedings of the AAAI conference on artificial intelligence. Vancouver, Canada: Association for the Advancement of Artificial Intelligence (AAAI Press). Vol. 38, pp.9187–9195.

Lei

Pan

Gao

et al. Portfolio management algorithm based on long-term prediction of assets. In: Proceedings of the 5th international conference on big data technologies, Qingdao, China: Association for Computing Machinery (ACM). pp.145–153.

Chen

Xie

et al. Masked autoencoders are scalable vision learners. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, New Orleans, Louisiana, USA: IEEE. pp.16000–16009.

10.

Wong

Barahona

. Robust machine learning pipelines for trading market-neutral stock portfolios. arXiv preprint arXiv:230100790 (2022).

11.

Van Den Oord

Vinyals

Kavukcuoglu

. Neural discrete representation learning. Adv Neural Inf Process Syst 2017; 30: 6309–6318.

12.

Mnih

Rezende

. Variational inference for monte carlo objectives. In: International conference on machine learning. PMLR, pp.2188–2196.

13.

Aftan

Shah

. A survey on bert and its applications. In: 2023 20th learning and technology conference (L&T). IEEE, pp.161–166.

14.

Bao

Dong

Piao

et al. BEiT: BERT pre-training of image transformers. In: International conference on learning representations. Virtual: International Conference on Learning Representations (ICLR), 2022.

15.

Jiang

Liang

. A deep reinforcement learning framework for the financial portfolio management problem. arXiv preprint arXiv:170610059 (2017).

16.

Leung

Wang

. Minimax and biobjective portfolio selection based on collaborative neurodynamic optimization. IEEE Trans Neural Netw Learn Syst 2020; 32: 2825–2836.

17.

Leung

Wang

. Cardinality-constrained portfolio selection based on collaborative neurodynamic optimization. Neural Netw 2022; 145: 68–79.

18.

Farhadi

Zamanifar

Alipour

et al. A hybrid LSTM-GRU model for stock price prediction. IEEE Access 2025; 13: 117594–117618.

19.

Sorouri

NoorMohammadzadehMaleki

Salehi

et al. Meta-optimized risk-aware portfolio management: a hybrid deep reinforcement learning and LSTM-GRU ensemble. In: 2025 10th South-East Europe design automation, computer engineering, computer networks and social media conference (SEEDA-CECNSM). IEEE, pp.1–6.

20.

Liu

Zheng

et al. GPT understands, too. AI Open 2024; 5: 208–215.

21.

Zhang

. FinBERT–MRC: financial named entity recognition using BERT under the machine reading comprehension paradigm. Neural Process Lett 2023; 55: 7393.

22.

Wang

et al. Adaptive and explainable margin trading via large language models on portfolio management. In: Proceedings of the 5th ACM international conference on AI in finance, Brooklyn, NY, USA: Association for Computing Machinery (ACM). pp.248–256.

23.

Tam

Yeung

. A multimodal and sentiment-based trading system for financial portfolio optimisation. In: 2025 IEEE international conference on consumer electronics (ICCE). IEEE, pp.1–6.

24.

Chen

Duan

Houthooft

, et al. Infogan: interpretable representation learning by information maximizing generative adversarial nets. Adv Neural Inf Process Syst 2016; 29: 2180–2188.

25.

Denton

Gross

Fergus

. Semi-supervised learning with context-conditional generative adversarial networks. arXiv preprint arXiv:161106430 (2016).

26.

Tian

Jiang

Yuan

et al. Visual autoregressive modeling: scalable image generation via next-scale prediction. Adv Neural Inf Process Syst 2024; 37: 84839–84865.

27.

Liu

Jin

Lai

et al. Cross-modal discrete representation learning. In: Proceedings of the 60th annual meeting of the association for computational linguistics (Volume 1: Long Papers), Dublin, Ireland: Association for Computational Linguistics (ACL). pp.3013–3035.

28.

Dieleman

Nash

Engel

et al. Variable-rate discrete representation learning. arXiv preprint arXiv:210306089 (2021).

29.

Verma

Chafe

. A generative model for raw audio using transformer architectures. In: 2021 24th international conference on digital audio effects (DAFx). IEEE, pp.230–237.

30.

Wang

et al. Adaptive long-short pattern transformer for stock investment selection. In: IJCAI, Shenzhen, China: International Joint Conferences on Artificial Intelligence Organization (IJCAI Organization). pp.3970–3977.

31.

Moody

Saffell

. Learning to trade via direct reinforcement. IEEE Trans Neural Netw 2001; 12: 875–889.

32.

Hoi

. On-line portfolio selection with moving average reversion. In: Proceedings of the 29th international coference on international conference on machine learning, Edinburgh, Scotland: Omnipress. pp.563–570.

33.

Huang

Zhou

et al. Robust median reversion strategy for online portfolio selection. IEEE Trans Knowl Data Eng 2016; 28: 2480–2493.

34.

Liang

Chen

Zhu

et al. Adversarial deep reinforcement learning in portfolio management. arXiv preprint arXiv:180809940 (2018).

35.

Liu

Yin

et al. Empirical evaluation of multi-task learning in deep neural networks for natural language processing. Neural Comput Appl 2021; 33: 4417–4428.

36.

Shen

Wang

. Portfolio selection via subset resampling. In: Proceedings of the thirty-first AAAI conference on artificial intelligence, San Francisco, California USA: Association for the Advancement of Artificial Intelligence (AAAI Press). pp.1517–1523.

Exploring high-level patterns for portfolio selection through pre-training

Abstract

Keywords

Get full access to this article

References