Abstract
Multi-zone sound field reproduction aims to create personal sound zones within distinct spatial regions of a shared environment using a loudspeaker array. This paper proposes a wave-domain acoustic energy difference maximization (WDAEDM) method for reproducing multi-zone sound field. The proposed method first represents sound fields and transfer functions with spatial harmonic expansion, then constructs an acoustic energy difference maximization (AEDM) model in the wave domain using sound field coefficients and their relationship with loudspeaker weights, and finally solves this model through eigenvalue decomposition. Simulation results of free-field and automotive cabin acoustic environments have demonstrated that the proposed WDAEDM method presents a more uniform energy distribution within zones compared with the existing spatial-domain methods, and also eliminates the parameter selection issues encountered in existing wave-domain methods.
Introduction
Multi-zone sound field reproduction utilizes a loudspeaker array to deliver well-isolated sound zones in different regions of a shared space,1–5 which has broad application prospects in smart car cabins, smart mobile devices, and advanced audio-visual systems. The essence of this technology is to design loudspeaker weights. With the effects of loudspeaker weights and transfer functions, bright zones and dark zones are formed in the space. In bright zones, the high-energy signal is reproduced, while in dark zones, the signal is attenuated as much as possible.
The existing reproduction methods are classified into the energy-based methods and the matching methods. The representatives are acoustic contrast control (ACC)6–8 and pressure matching (PM),9–11 respectively. ACC obtains the loudspeaker weights by maximizing the ratio of the energy between bright zones and dark zones, that is, maximizing acoustic contrast (AC). PM achieves weight estimation by minimizing the difference between the reproduced and target signal. ACC method shows higher AC but involves matrix inversion, which leads to high sensitivity to noise when the inverted matrix is severely ill-conditioned.12–16 To mitigate the effects caused by the inversion of ill-conditioned matrices, regularization methods have been adopted.17,18 It is found that ACC is highly sensitive to the regularization parameter, and the optimal regularization parameter varies significantly across different frequencies, system configurations, and noise conditions. Therefore, it is a challenging task to select an appropriate regularization parameter. Shin et al. proposed the acoustic energy difference maximization (AEDM) method, which avoids matrix inversion and thereby circumvents regularization. 19 Unlike ACC, AEDM obtains weights by maximizing the energy difference between bright and dark zones and also provides sufficiently high AC.
Both ACC and AEDM are based on discrete spatial point control. However, the discreteness and the limited number of control points lead to performance degradation away from these points, resulting in uneven energy distribution within bright and dark zones. To overcome this limitation, Han et al. proposed wave-domain acoustic contrast control (WDACC),20,21 which represents sound fields through spatial harmonic expansion and uses sound field coefficients to establish the ACC model in the wave domain. Recently, Wen et al. 22 also exploited harmonic expansion to formulate a wave-domain multi-objective optimization problem to achieve a tradeoff between AC, signal distortion and array effort. Compared with spatial-domain methods, these wave-domain methods present a more uniform energy distribution within zones. However, both methods require regularization parameters and/or Lagrange multipliers, whose values strongly depend on frequency and substantially affect performance, thereby making their determination time-consuming and labor-intensive.
Inspired by the wave-domain methods and AEDM, this paper proposes wave-domain acoustic energy difference maximization (WDAEDM). The proposed method first performs spatial harmonic expansion on the sound field and transfer functions with cylindrical harmonic functions. It then uses sound field coefficients and their relationship with loudspeaker weights to construct a model that maximizes the acoustic energy difference between bright zones and dark zones, and finally applies eigenvalue decomposition to obtain the solutions. By incorporating harmonic expansion and the AEDM concept into the model formulation, the proposed method inherits the advantages of both wave-domain approaches and AEDM, namely, achieving a more uniform energy distribution within zones while avoiding the need to select regularization parameters and Lagrange multipliers. In this paper, the environment is assumed to be stable. The paper is organized as follows: In Theory, the proposed WDAEDM method is elaborated. In Simulation, the performance is evaluated. The conclusions are summerized in the last section.
Theory
Model construction
Figure 1 shows a multi-zone sound field reproduction system. A circular loudspeaker array with a radius of Multi-zone sound field reproduction system.
The weight of the
Acoustic energy difference maximization
AEDM method estimates the loudspeaker weights by maximizing the energy difference between the bright zone and the dark zone. The energy within the bright zone and the dark zone is
To obtain
Obviously, the extremum of
Wave-domain acoustic energy difference maximization
AEDM achieves multi-zone sound reproduction based on discrete spatial control points. However, due to the discreteness and the limited number of these points, the performance of AEDM degrades in the regions away from the control points, resulting in uneven energy distribution within the zones. To overcome this limitation, spatial harmonic expansion is introduced into AEDM, leading to WDAEDM.
As shown in Figure 1, there is no source within the zones. Therefore, the sound pressure at any point within these zones can be expanded with respect to the zone center
Previous studies16,22–24 have shown that, when n exceeds
Similarly, the extremum of
Simulations
Free-field simulation
This subsection examines the performance of the proposed method based on the free-field simulation. This simulation is performed in MATLAB 2022. Figure 2 shows the layout of a multi-zone sound reproduction system. A circular array with 30 loudspeakers and a radius of 2 m is placed at the origin. Inside the circular array, two circular zones with a radius of 0.5 m are defined as the bright zone and the dark zone. The centers of the two zones are located at (0.6 m, 0.5 m) and (−0.5 m, −0.7 m), respectively. 22 control points are evenly distributed along the circumferences of the bright zone and the dark zone. Multiple evaluation points are uniformly distributed within two zones, with a spacing of 0.01 m between the adjacent evaluation points. The layout of the multi-zone sound reproduction system used in the simulation. (▼ loudspeakers,● control points, █ evaluation points).
Evaluation metrices
We use the pressure at the evaluation points to calculate acoustic contrast (AC) and audible gain (AG). AC is defined as the ratio of the energy between the bright zone and the dark zone,
A high AC indicates strong isolation between the bright zone and the dark zone, while a high AG reflects efficient sound energy radiation by the loudspeaker array. The larger the two metrices, the better the performance.
The influence of the tuning factor
According to equation (14), the tuning factor AC and AG under different tuning factors and frequencies.
As shown in Figure 3(a), when the frequency is fixed, AC increases with
Performance comparison
This subsection compares the performance of the AEDM,
19
WDACC,20,21 WDAEDM, and Lagrange method (LM).
22
Multi-zone sound field reproduction is performed at 200 Hz and 1200 Hz. For WDAEDM and AEDM, The energy distribution maps at 200 Hz and 1200 Hz.
In this part, we further compare the performance of WDACC and WDAEDM. The frequency ranges from 20 Hz to 1200 Hz. For WDAEDM,
Figure 5 shows AC, AG, and sound pressure levels (SPLs) in the dark zone of each method. It is interesting to find that: (1) WDAEDM and WDACC (with the optimal regularization parameter) show comparable AC, AG, and SPL in the dark zone. However, it is worth noting that the optimal regularization parameter for WDACC varies significantly across frequencies, as shown in Figure 6. To achieve an optimal AC at each frequency, WDACC requires substantial manual effort and time to determine the optimal parameter frequency-by-frequency. In contrast, WDAEDM avoids this issue, as a fixed tuning factor can yield performance equivalent to WDACC across most frequencies, which is a key advantage of WDAEDM. (2) The WDACC method using the SV-based regularization fails to effectively suppress energy in the dark zone at several frequencies, resulting in SPLs exceeding 50 dB, which further leads to a marked decline in AC at those frequencies. The reason is that the empirically chosen regularization parameter does not reliably mitigate the impact of ill-conditioned matrix inversion on the performance. Therefore, WDAEDM has better performance than WDACC. AC, AG, and SPL in dark zone versus frequency. The optimal regularization parameter of WDACC versus frequency.

To summarize, compared with the other three methods, the proposed WDAEDM achieves higher AC and AG, resolves the issue of uneven energy distribution, and avoid spending extensive effort and time on regularization. Overall, WDAEDM delivers the best performance among all approaches discussed above.
Finite element-based simulation
This subsection conducts finite element-based simulations to evaluate the performance of the proposed method in a car cabin acoustic environment. The acoustic model of the car cabin, as shown in Figure 7, is built on the COMSOL Multiphysics platform. The car windows, dashboards, and doors are modeled with constant absorption coefficients ( The finite element acoustic model of the car cabin.
The comparison of performance metrics.
(Note. “Control points” refer to results calculated based on the sound pressure at control points, while “Evaluation points” refer to results calculated based on the sound pressure at evaluation points).

The energy distribution maps of the reproduced sound field in the car cabin.
Conclusion
This paper proposes the WDAEDM method for multi-zone sound field reproduction based on spatial harmonic expansion and acoustic energy difference maximization. Its performance is examined through the free-field and the car cabin acoustic environment simulations. Results show that compared with spatial-domain AEDM, WDAEDM yields a more uniform sound energy distribution. In comparison with other wave-domain methods, WDAEDM outperforms LM, and attains performance comparable to WDACC, while avoiding selecting a regularization parameter.
It is worth mentioning that the computational complexity of the proposed method increases with frequency because a higher order is required. Consequently, more loudspeakers are needed, resulting in increased cost. Therefore, the proposed method is recommended for use primarily at low frequencies. In addition, this paper does not account for environmental variations, and thus WDAEDM is currently suitable only for relatively stable acoustic environments. Future work will focus on developing robust WDAEDM against dynamic environments, but this lies beyond the scope of this paper.
Footnotes
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the National Natural Science Foundation of China, grant number 12304519, the New Chongqing Youth Innovation Talent Project, grant number CSTB2024NSCQ-QCXMX0068, and the Science and Technology Research Program of Chongqing Municipal Education Commission, grant number KJZD-K202303202.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
