Sage Journals: Discover world-class research

Abstract

Massive multiple-input multiple-output (MIMO) systems are at the core of next-generation wireless communications because of their promise of high spectral efficiency, better signal quality, and power savings. However, the optimal beamforming design for such systems is an extremely compute-intensive problem, particularly when channel conditions vary, and the channel state information (CSI) is imperfect. This paper presents a reinforcement learning approach to beamforming that bypasses these limitations through an adaptive and data-driven policy learning paradigm for efficient transmission. The Q-learning-based reinforcement learning (RL) algorithm was run for 500 episodes and showed steady convergence, with the training loss decreasing from 0.87 to 0.04 and the validation loss from 0.90 to 0.05. Prominent performance parameters delivered by this approach constituted mean signal to interference plus noise ratio (SINR) is 22.9 dB, spectral efficiency 14.8 bits/s/Hz, and BER 3.1 × 10⁻⁴ at 15 dB signal to noise ratio (SNR). The SINR obtained could be improved by 50% and 22.5% with respect to maximum ratio transmission (MRT) and zero-forcing (ZF) techniques, respectively, while the power used was 9.6 W. Robustness testing with 5% CSI error and 100 Hz Doppler fading scenarios showed the very least deterioration in performance. This leads to the conclusion that RL in beamforming is a promising approach for real-time systems in view of future adaptive scalable MIMO deployments. This work can be extended in multi-user and multi-cell settings for more general applicability.

Keywords

beamforming energy efficiency massive MIMO reinforcement learning spectral efficiency wireless communication

Get full access to this article

View all access options for this article.

References

Tarafder

Choi

. Deep reinforcement learning-based coordinated beamforming for mmWave massive MIMO vehicular networks. Sensors 2023; 23: 2772.

Ahmed

Shahid

Khammari

, et al. Machine learning based beam selection with low complexity hybrid beamforming design for 5G massive MIMO systems. IEEE Trans Green Commun Netw 2021; 5: 2160–2173.

Liang

Zhang

, et al. Deep reinforcement learning for distributed dynamic coordinated beamforming in massive MIMO cellular networks. IEEE Trans Wireless Commun 2023; 23: 4155–4169.

Ahmed

Shahid

Faisal

. Deep reinforcement learning based beam selection for hybrid beamforming and user grouping in massive MIMO-NOMA system. IEEE Access 2022; 10: 89519–89533.

Segarra

Dick

, et al. A deep reinforcement learning-based resource scheduler for massive MIMO networks. IEEE Trans Mach Learn Commun Netw 2023; 1: 242–257.

Paranthaman

Sonker

Varalakshmi

, et al. Reinforcement learning-based model for the prevention of beam-forming vector attacks on massive MIMO system. Opt Quantum Electron 2024; 56: 44.

Fozi

Sharafat

Bennis

. Fast MIMO beamforming via deep reinforcement learning for high mobility mmWave connectivity. IEEE J Sel Areas Commun 2021; 40: 127–142.

Arjoune

Faruque

. Experience-driven learning-based intelligent hybrid beamforming for massive MIMO mmWave communications. Phys Commun 2022; 51: 101534.

Ahmed

Ahmad

Fortunati

, et al. A reinforcement learning based approach for multitarget detection in massive MIMO radar. IEEE Trans Aerosp Electron Syst 2021; 57: 2622–2636.

10.

Lopes

VHL

Nahum

Dreifuerst

, et al. Deep reinforcement learning-based scheduling for multiband massive MIMO. IEEE Access 2022; 10: 125509–125525.

11.

Naeem

De Pietro

Coronato

. Application of reinforcement learning and deep learning in multiple-input and multiple-output (MIMO) systems. Sensors 2021; 22: 309.

12.

Lavdas

Gkonis

Zinonos

, et al. A machine learning adaptive beamforming framework for 5G millimeter wave massive MIMO multicellular networks. IEEE Access 2022; 10: 91597–91609.

13.

Eappen

Cosmas

Nilavalan

, et al. Deep learning integrated reinforcement learning for adaptive beamforming in B5G networks. IET Commun 2022; 16: 2454–2466.

14.

Chu

Liu

Lau

, et al. Deep reinforcement learning based end-to-end multiuser channel prediction and beamforming. IEEE Trans Wireless Commun 2022; 21: 10271–10285.

15.

Zhai

Wang

Cao

, et al. Reinforcement learning based dual-functional massive MIMO systems for multi-target detection and communications. IEEE Trans Signal Process 2023; 71: 741–755.

16.

Bishe

Koc

Le-Ngoc

. Deep reinforcement learning-based sum-rate maximization in hybrid beamforming multi-user massive MIMO systems. In: 2024 Tenth international conference on communications and electronics (ICCE), July 2024, pp.601–606. IEEE.

17.

Zhang

Alrabeiah

Alkhateeb

. Reinforcement learning of beam codebooks in millimeter wave and terahertz MIMO systems. IEEE Trans Commun 2021; 70: 904–919.

18.

Wang

Sun

Xin

, et al. Deep transfer reinforcement learning for beamforming and resource allocation in multi-cell MISO-OFDMA systems. IEEE Trans Signal Inf Process Over Netw 2022; 8: 815–829.

19.

Zhang

Liang

, et al. Deep reinforcement learning for distributed coordinated beamforming in massive MIMO. In: 2023 IEEE 34th annual international symposium on personal, indoor and mobile radio communications (PIMRC), September 2023, pp.1–6. IEEE.

20.

Ahmad

Narmeen

Becvar

, et al. Machine learning-based beamforming for unmanned aerial vehicles equipped with reconfigurable intelligent surfaces. IEEE Wirel Commun 2022; 29: 32–38.

21.

Fredj

Al-Eryani

Maghsudi

, et al. Distributed beamforming techniques for cell-free wireless networks using deep reinforcement learning. IEEE Trans Cogn Commun Netw 2022; 8: 1186–1201.

22.

Dang

Nguyen

Shin

. Optimization of IRS-NOMA-assisted cell-free massive MIMO systems using deep reinforcement learning. IEEE Access 2023; 11: 94402–94414.

23.

Lavdas

Gkonis

Tsaknaki

, et al. A deep learning framework for adaptive beamforming in massive MIMO millimeter wave 5G multicellular networks. Electronics (Basel) 2023; 12: 3555.

24.

Salh

Alhartomi

Hussain

, et al. Deep reinforcement learning-driven hybrid precoding for efficient mm-wave multi-user MIMO systems. J Sens Actuator Netw 2025; 14: 20.

25.

Qiao

Niu

, et al. Deep reinforcement learning based MmWave beam alignment for V2I communications. IEEE Trans Mach Learn Commun Netw 2024; 2: 1216–1228.

26.

Zhu

Shi

Liu

, et al. Multi-agent reinforcement learning-based joint precoding and phase shift optimization for RIS-aided cell-free massive MIMO systems. IEEE Trans Veh Technol 2024; 73: 14015–14020.

27.

Al-Eryani

Akrout

Hossain

. Antenna clustering for simultaneous wireless information and power transfer in a MIMO full-duplex system: a deep reinforcement learning-based design. IEEE Trans Commun 2021; 69: 2331–2345.

28.

Feng

Clerckx

. Deep reinforcement learning for multi-user massive MIMO with channel aging. IEEE Trans Mach Learn Commun Network 2023; 1: 360–375.

29.

Zhou

Chen

Tong

, et al. Attention-deep reinforcement learning jointly beamforming based on tensor decomposition for RIS-assisted V2X mmWave massive MIMO system. Complex & Intell Syst 2024; 10: 145–160.

30.

Bendjillali

Bendelhoum

Tadjeddine

, et al. Deep learning-powered beamforming for 5G massive MIMO systems. J Telecommun Inform Technol 2023; 4: 38–45.

31.

Hoffmann

Kryszkiewicz

. Reinforcement learning for energy-efficient 5G massive MIMO: intelligent antenna switching. IEEE Access 2021; 9: 130329–130339.

32.

Gao

, et al. Data-driven deep learning based hybrid beamforming for aerial massive MIMO-OFDM systems with implicit CSI. IEEE J Sel Areas Commun 2022; 40: 2894–2913.

33.

ElHalawany

Hashima

Hatano

, et al. Leveraging machine learning for millimeter wave beamforming in beyond 5G networks. IEEE Syst J 2021; 16: 1739–1750.

34.

Riadhusin

Sudhakar

Prakash

, et al. Hybrid beamforming optimization in millimeter wave MIMO using deep deterministic policy gradient based deep reinforcement learning. In: 2025 3rd international conference on integrated circuits and communication systems (ICICACS), February 2025, pp.1–6. IEEE.

35.

Nandhini

Deny

. Intelligent beamforming by deep reinforcement learning for mmWave massive MIMO by implementing NOMA for futuristic communication. In: Sustainable materials and technologies in VLSI and information processing. Boca Raton, FL: CRC Press, 2025, pp.85–91.

36.

Bhuyan

Sarma

Misra

, et al. Adaptive hybrid beamforming codebook design using multi-agent reinforcement learning for multiuser multiple-input–multiple-output systems. Appl Sci 2024; 14: 7109.

37.

Kwon

Lee

Choi

. Machine learning-based beamforming in K-user MISO interference channels. IEEE Access 2021; 9: 28066–28075.

38.

Yan

Zhang

, et al. A novel model-assisted decentralized multi-agent reinforcement learning for joint optimization of hybrid beamforming in massive MIMO mmWave systems. IEEE Trans Veh Technol 2023; 72: 14743–14755.

39.

Nguyen

. A deep learning framework for beam selection and power control in massive MIMO-millimeter-wave communications. IEEE Trans Mob Comput 2022; 22: 4374–4387.

Reinforcement learning-based beamforming for massive multiple-input multiple-output systems: A robust and energy-efficient approach

Abstract

Keywords

Get full access to this article

References