Wave-domain acoustic energy difference maximization method for multi-zone sound field reproduction

Abstract

Multi-zone sound field reproduction aims to create personal sound zones within distinct spatial regions of a shared environment using a loudspeaker array. This paper proposes a wave-domain acoustic energy difference maximization (WDAEDM) method for reproducing multi-zone sound field. The proposed method first represents sound fields and transfer functions with spatial harmonic expansion, then constructs an acoustic energy difference maximization (AEDM) model in the wave domain using sound field coefficients and their relationship with loudspeaker weights, and finally solves this model through eigenvalue decomposition. Simulation results of free-field and automotive cabin acoustic environments have demonstrated that the proposed WDAEDM method presents a more uniform energy distribution within zones compared with the existing spatial-domain methods, and also eliminates the parameter selection issues encountered in existing wave-domain methods.

Keywords

sound zone control wave domain acoustic energy difference harmonic expansion acoustic contrast

Introduction

Multi-zone sound field reproduction utilizes a loudspeaker array to deliver well-isolated sound zones in different regions of a shared space,^1–5 which has broad application prospects in smart car cabins, smart mobile devices, and advanced audio-visual systems. The essence of this technology is to design loudspeaker weights. With the effects of loudspeaker weights and transfer functions, bright zones and dark zones are formed in the space. In bright zones, the high-energy signal is reproduced, while in dark zones, the signal is attenuated as much as possible.

The existing reproduction methods are classified into the energy-based methods and the matching methods. The representatives are acoustic contrast control (ACC)^6–8 and pressure matching (PM),^9–11 respectively. ACC obtains the loudspeaker weights by maximizing the ratio of the energy between bright zones and dark zones, that is, maximizing acoustic contrast (AC). PM achieves weight estimation by minimizing the difference between the reproduced and target signal. ACC method shows higher AC but involves matrix inversion, which leads to high sensitivity to noise when the inverted matrix is severely ill-conditioned.^12–16 To mitigate the effects caused by the inversion of ill-conditioned matrices, regularization methods have been adopted.^17,18 It is found that ACC is highly sensitive to the regularization parameter, and the optimal regularization parameter varies significantly across different frequencies, system configurations, and noise conditions. Therefore, it is a challenging task to select an appropriate regularization parameter. Shin et al. proposed the acoustic energy difference maximization (AEDM) method, which avoids matrix inversion and thereby circumvents regularization.¹⁹ Unlike ACC, AEDM obtains weights by maximizing the energy difference between bright and dark zones and also provides sufficiently high AC.

Both ACC and AEDM are based on discrete spatial point control. However, the discreteness and the limited number of control points lead to performance degradation away from these points, resulting in uneven energy distribution within bright and dark zones. To overcome this limitation, Han et al. proposed wave-domain acoustic contrast control (WDACC),^20,21 which represents sound fields through spatial harmonic expansion and uses sound field coefficients to establish the ACC model in the wave domain. Recently, Wen et al.²² also exploited harmonic expansion to formulate a wave-domain multi-objective optimization problem to achieve a tradeoff between AC, signal distortion and array effort. Compared with spatial-domain methods, these wave-domain methods present a more uniform energy distribution within zones. However, both methods require regularization parameters and/or Lagrange multipliers, whose values strongly depend on frequency and substantially affect performance, thereby making their determination time-consuming and labor-intensive.

Inspired by the wave-domain methods and AEDM, this paper proposes wave-domain acoustic energy difference maximization (WDAEDM). The proposed method first performs spatial harmonic expansion on the sound field and transfer functions with cylindrical harmonic functions. It then uses sound field coefficients and their relationship with loudspeaker weights to construct a model that maximizes the acoustic energy difference between bright zones and dark zones, and finally applies eigenvalue decomposition to obtain the solutions. By incorporating harmonic expansion and the AEDM concept into the model formulation, the proposed method inherits the advantages of both wave-domain approaches and AEDM, namely, achieving a more uniform energy distribution within zones while avoiding the need to select regularization parameters and Lagrange multipliers. In this paper, the environment is assumed to be stable. The paper is organized as follows: In Theory, the proposed WDAEDM method is elaborated. In Simulation, the performance is evaluated. The conclusions are summerized in the last section.

Theory

Model construction

Figure 1 shows a multi-zone sound field reproduction system. A circular loudspeaker array with a radius of $r_{L}$ is used to create a bright zone and a dark zone within the region. The number of loudspeakers is L, and the radii of the bright zone and the dark zone are $r_{B}$ and $r_{D}$ , respectively. $r_{B}$ and $r_{D}$ are smaller than $r_{L}$ . $M_{B}$ and $M_{D}$ control points are placed in the bright zone and the dark zone. Each loudspeaker is regarded as a point source and placed in a free field, and thus the transfer function between the $m th (m = 1, 2, \dots, M_{B (D)})$ control point and the $l th (l = 1, 2, \dots, L)$ loudspeaker is

g_{l, m}^{B (D)} (ω) = \frac{e^{- i k | r_{m} - r_{l} |}}{4 π | r_{m} - r_{l} |},

(1)

where

r_{m}

and

r_{l}

are positions of the

m th

control point and the

l th

loudspeaker,

i = \sqrt{- 1}

is the imaginary unit,

k = ω / c

is the wave number,

ω

is the circular frequency, and

c

is the speed of sound.

Figure 1.

Multi-zone sound field reproduction system.

The weight of the $l th$ loudspeaker is denoted as $q_{l}$ , and the pressure at each control point within the bright zone and the dark zone is expressed as

p_{m}^{B (D)} = \sum_{l}^{L} g_{l, m}^{B (D)} q_{l} .

(2)

For brevity,

ω

is omitted in this and subsequent equations. Let

p^{B} = {[p_{1}^{B}, p_{2}^{B}, \dots, p_{M_{B}}^{B}]}^{T}

\in C^{M_{B} \times 1}

and

p^{D} = {[p_{1}^{D}, p_{2}^{D}, \dots, p_{M_{D}}^{D}]}^{T} \in C^{M_{D} \times 1}

denote the pressure vectors of the bright zone and the dark zone, respectively. The superscript “^T” represents the transport operator. According to equation (2),

p^{B}

and

p^{D}

can be written as

{\begin{cases} p^{B} = G^{B} q \\ p^{D} = G^{D} q \end{cases},

(3)

where

q = {[q_{1}, q_{2}, \dots, q_{L}]}^{T} \in C^{L \times 1}

is the weight vector, and

G^{B} \in C^{M_{B} \times L}

and

G^{D} \in C^{M_{D} \times L}

are the transfer function matrices which take equation (1) as elements.

Acoustic energy difference maximization

AEDM method estimates the loudspeaker weights by maximizing the energy difference between the bright zone and the dark zone. The energy within the bright zone and the dark zone is

{\begin{cases} E^{B} = \frac{{(p^{B})}^{H} p^{B}}{M_{B}} = q^{H} R^{B} q \\ E^{D} = \frac{{(p^{D})}^{H} p^{D}}{M_{D}} = q^{H} R^{D} q \end{cases},

(4)

where

R^{B} = {(G^{B})}^{H} G^{B} / M_{B} \in C^{L \times L}

and

R^{D} = {(G^{D})}^{H} G^{D} / M_{D} \in C^{L \times L}

are the spatial auto-correlation matrices. The superscript “^H” represents the Hermitian operator. To maximize the energy difference between the bright zone and the dark zone, the cost function of AEDM is formulated as

\max_{q} J (q) = \frac{E^{B} - ξ E^{D}}{q^{H} q} = \frac{q^{H} (R^{B} - ξ R^{D}) q}{q^{H} q},

(5)

where

E^{L} = q^{H} q

is the energy of the array input, which is used to normalize the energy difference.

ξ (ξ > 0)

is a tuning factor. With a small

ξ

, the cost function tends to maximize the energy within the bright zone. On the contrary, when

ξ

is relatively large, the cost function primarily aims to suppress the energy in the dark zone.

To obtain $\hat{q}$ that maximizes equation (5), we first multiply $q^{H} q$ on the both sides of equation (5), and the cost function becomes

J (q) q^{H} q = q^{H} (R^{B} - ξ R^{D}) q .

(6)

Then, by taking the partial derivatives of equation (6) with regard to

q

, we obtain

\frac{\partial J (q)}{\partial q} q^{H} q + J (q) \frac{\partial (q^{H} q)}{\partial q} = \frac{\partial (q^{H} (R^{B} - ξ R^{D}) q)}{\partial q} .

(7)

When

J (q)

takes its extremum,

\partial J (q) / \partial q = 0

holds. Thus, equation (7) becomes

J (q) \frac{\partial (q^{H} q)}{\partial q} = \frac{\partial (q^{H} (R^{B} - ξ R^{D}) q)}{\partial q} .

(8)

Since

\partial (q^{H} q) / \partial q = 2 q

, and

\partial (q^{H} (R^{B} - ξ R^{D}) q) / \partial q = 2 (R^{B} - ξ R^{D}) q

J (q) q = (R^{B} - ξ R^{D}) q .

(9)

Obviously, the extremum of $J (q)$ is the largest eigenvalue of the matrix $R^{B} - ξ R^{D}$ , and the optimal loudspeaker weights ${\hat{q}}_{AEDM}$ is the eigenvector corresponding to the largest eigenvalue.

Wave-domain acoustic energy difference maximization

AEDM achieves multi-zone sound reproduction based on discrete spatial control points. However, due to the discreteness and the limited number of these points, the performance of AEDM degrades in the regions away from the control points, resulting in uneven energy distribution within the zones. To overcome this limitation, spatial harmonic expansion is introduced into AEDM, leading to WDAEDM.

As shown in Figure 1, there is no source within the zones. Therefore, the sound pressure at any point within these zones can be expanded with respect to the zone center $O^{B (D)}$ using cylindrical harmonic functions,

p^{B (D)} (\overset{⌢}{r}) = \sum_{n = - \infty}^{\infty} a_{n}^{B (D)} J_{n} (k \overset{⌢}{r}) e^{i n \overset{⌢}{ϕ}},

(10)

where

\overset{⌢}{r} = (\overset{⌢}{r}, ϕ)

denotes the position of this point measured from

O^{B (D)}

J_{n} (\cdot)

is the first Bessel function with the order of n, and

a_{n}^{B (D)}

are the sound field coefficients. Similarly, transfer functions represent the responses of unit-amplitude sources at given points, and can be also represented by

g_{l}^{B (D)} (\overset{⌢}{r}) = \sum_{n = - \infty}^{\infty} b_{n, l}^{B (D)} J_{n} (k \overset{⌢}{r}) e^{i n \overset{⌢}{ϕ}},

(11)

where

g_{l}^{B (D)} (\overset{⌢}{r})

is the transfer function between the

l th

loudspeaker and the position

\overset{⌢}{r}

b_{n}^{B (D)}

are the sound field coefficients of transfer functions.

b_{n}^{B (D)}

can be obtained by sampling the sound pressures at evenly spaced control points on a circle,²²

b_{n}^{B (D)} = \frac{1}{J_{n} (k r_{B (D)})} \sum_{m = 1}^{M_{B (D)}} \frac{g_{l}^{B (D)} ({\overset{⌢}{r}}_{m})}{M_{B (D)}} e^{- i 2 π n m / M_{B (D)}} .

(12)

where

g_{l}^{B (D)} ({\overset{⌢}{r}}_{m})

denotes the transfer function between the

l th

loudspeaker and the

m th

control point. Combining equation (2), equation (10), and equation (11), we could build the relationship between

a_{n}^{B (D)}

and

b_{n, l}^{B (D)}

a_{n}^{B (D)} = \sum_{l = 1}^{L} b_{n, l}^{B (D)} q_{l} .

(13)

In the wave domain, the energy in the bright zone and the dark zone is expressed as

{\begin{cases} E^{B} = \frac{1}{S_{B}} \iint_{S_{B}} {| p^{B} (\overset{⌢}{r}) |}^{2} \overset{⌢}{r} d \overset{⌢}{r} d \overset{⌢}{ϕ} = \frac{2}{r_{B}^{2}} \sum_{n = - \infty}^{\infty} {(a_{n}^{B})}^{2} \int_{0}^{r_{B}} J_{n}^{2} (k \overset{⌢}{r}) \overset{⌢}{r} d \overset{⌢}{r} = \sum_{n = - \infty}^{\infty} {(a_{n}^{B})}^{2} w_{n} (k r_{B}) \\ E^{D} = \frac{1}{S_{D}} \iint_{S_{D}} {| p^{D} (\overset{⌢}{r}) |}^{2} \overset{⌢}{r} d \overset{⌢}{r} d \overset{⌢}{ϕ} = \frac{2}{r_{D}^{2}} \sum_{n = - \infty}^{\infty} {(a_{n}^{D})}^{2} \int_{0}^{r_{D}} J_{n}^{2} (k \overset{⌢}{r}) \overset{⌢}{r} d \overset{⌢}{r} = \sum_{n = - \infty}^{\infty} {(a_{n}^{D})}^{2} w_{n} (k r_{D}) \end{cases},

(14)

where

S_{B}

and

S_{D}

are the areas of the bright zone and the dark zone, and

w_{n} (k r_{B (D)}) = 2 / r_{B (D)}^{2} \int_{0}^{r_{B (D)}} J_{n}^{2} (k \overset{⌢}{r}) \overset{⌢}{r} d \overset{⌢}{r} = J_{n}^{2} (k r_{B (D)}) - J_{n - 1} (r_{B (D)}) J_{n + 1} (r_{B (D)})

is the weight.²¹ Equation (4) indicates that, in the spatial domain, the energy within a sound zone is estimated using the sound pressure at discrete control points. Equation (14) shows that, in the wave domain, the energy is obtained by integrating over the entire continuous region. This is the key to the WDAEDM method to overcome the problem of uneven energy distribution.

Previous studies^16,22–24 have shown that, when n exceeds $N \approx ⌈ k r_{B (D)} ⌉$ , where $⌈ \cdot ⌉$ is the ceiling function, $w_{n} (k r_{B (D)})$ drops to zeros. That is, only the basis functions $J_{n} (k \overset{⌢}{r}) e^{i n \overset{⌢}{ϕ}}$ with indices in the set [-N, N] contribute significant energy to the sound zones. Therefore, we truncate the order to $N \approx ⌈ k r_{B (D)} ⌉$ . Define $a^{B (D)} = {[a_{- N}^{B (D)}, \dots, a_{N}^{B (D)}]}^{T} \in C^{(2 N + 1) \times 1}$ , $b_{l}^{B (D)} = {[b_{- N, l}^{B (D)}, \dots, b_{N, l}^{B (D)}]}^{T} \in C^{(2 N + 1) \times 1}$ , $B^{B (D)} = [b_{1}^{B (D)}, b_{2}^{B (D)}, \dots, b_{L}^{B (D)}] \in C^{(2 N + 1) \times L}$ , $W^{B (D)} =$ $Diag ([w_{- N} (k r_{B (D)}), \dots, w_{N} (k r_{B (D)})]) \in C^{(2 N + 1) \times (2 N + 1)}$ , where $Diag (\cdot)$ constructs a diagonal matrix using the vector in parenesis as its diagonal. We rewrite equation (13) and equation (14) into a matrix form,

a^{B (D)} = B^{B (D)} q .

(15)

{\begin{cases} E^{B} = {(a^{B})}^{H} W^{B} a^{B} \\ E^{D} = {(a^{D})}^{H} W^{D} a^{D} \end{cases} .

(16)

By using the energy expression in equation (16) and the relationship in equation (15), we construct the AEDM model,

\max_{q} J (q) = \frac{E_{B} - ξ E_{D}}{q^{H} q} = \frac{{(a^{B})}^{H} W^{B} a^{B} - ξ {(a^{D})}^{H} W^{D} a^{D}}{q^{H} q} = \frac{q^{H} ({\bar{R}}^{B} - ξ {\bar{R}}^{D}) q}{q^{H} q},

(17)

where

{\bar{R}}^{B} = {(B^{B})}^{H} W^{B} B^{B}

and

{\bar{R}}^{D} = {(B^{D})}^{H} W^{D} B^{D}

are the auto-correlation matrices in the wave domain. The steps used to solve equation (5) are used here to solve equation (17), and we get

J (q) q = ({\bar{R}}^{B} - ξ {\bar{R}}^{D}) q

(18)

Similarly, the extremum of $J (q)$ is the largest eigenvalue of the matrix ${\bar{R}}^{B} - ξ {\bar{R}}^{D}$ , and the optimal loudspeaker weights ${\hat{q}}_{WDAEDM}$ is the eigenvector corresponding to the largest eigenvalue.

Simulations

Free-field simulation

This subsection examines the performance of the proposed method based on the free-field simulation. This simulation is performed in MATLAB 2022. Figure 2 shows the layout of a multi-zone sound reproduction system. A circular array with 30 loudspeakers and a radius of 2 m is placed at the origin. Inside the circular array, two circular zones with a radius of 0.5 m are defined as the bright zone and the dark zone. The centers of the two zones are located at (0.6 m, 0.5 m) and (−0.5 m, −0.7 m), respectively. 22 control points are evenly distributed along the circumferences of the bright zone and the dark zone. Multiple evaluation points are uniformly distributed within two zones, with a spacing of 0.01 m between the adjacent evaluation points.

Figure 2.

The layout of the multi-zone sound reproduction system used in the simulation. (▼ loudspeakers,● control points, █ evaluation points).

Evaluation metrices

We use the pressure at the evaluation points to calculate acoustic contrast (AC) and audible gain (AG). AC is defined as the ratio of the energy between the bright zone and the dark zone,

AC = 10 \log_{10} \frac{E^{B}}{E^{D}} \approx 10 \log_{10} (\frac{{\tilde{M}}_{D}}{{\tilde{M}}_{B}} \frac{q^{H} {({\tilde{G}}^{B})}^{H} {\tilde{G}}^{B} q}{q^{H} {({\tilde{G}}^{D})}^{H} {\tilde{G}}^{D} q}),

(19)

where

{\tilde{M}}_{B}

and

{\tilde{M}}_{D}

are the number of evaluation points in the bright zone and the dark zone, and

{\tilde{G}}^{B}

and

{\tilde{G}}^{D}

represent the matrices whose elements are the transfer functions between the loudspeakers and the evaluation points. AG is defined as the ratio of the energy between the bright zone and the loudspeaker array input,

AG = 10 \log_{10} \frac{E^{B}}{E^{L}} \approx 10 \log_{10} (\frac{1}{{\tilde{M}}_{B}} \frac{q^{H} {({\tilde{G}}^{B})}^{H} {\tilde{G}}^{B} q}{q^{H} q})

(20)

A high AC indicates strong isolation between the bright zone and the dark zone, while a high AG reflects efficient sound energy radiation by the loudspeaker array. The larger the two metrices, the better the performance.

The influence of the tuning factor

According to equation (14), the tuning factor $ξ$ simultaneously affects AC and AG. This subsection analyzes the influence of $ξ$ on the performance of the proposed WDAEDM method. $ξ$ is logarithmically sampled within the range [10⁻², 10⁵], with nine data points selected per order of magnitude, resulting in a total of 63 sampling points. The frequency ranges from 20 Hz to 1200 Hz. Figure 3 shows the AC and AG at different tuning factors across various frequencies.

Figure 3.

AC and AG under different tuning factors and frequencies.

As shown in Figure 3(a), when the frequency is fixed, AC increases with $ξ$ . For each frequency, AC reaches its peak and stops increasing once $ξ$ exceeds 1000. The black dashed curve in this figure indicates the variation of $ξ$ with frequency when AC is fixed at 20 dB. This curve shows that, within the frequency range of 20–1100 Hz, $ξ$ fluctuates between 0 and 60 to maintain an AC of 20 dB. When $ξ$ is set to 60, the WDAEDM method achieves an AC above 20 dB across the entire 20–1100 Hz frequency range. Figure 3(b) shows that, below 200 Hz, AG decreases as $ξ$ increases. However, above 200 Hz, $ξ$ has little impact on AG.

Performance comparison

This subsection compares the performance of the AEDM,¹⁹ WDACC,^20,21 WDAEDM, and Lagrange method (LM).²² Multi-zone sound field reproduction is performed at 200 Hz and 1200 Hz. For WDAEDM and AEDM, $ξ$ is set to 60. For WDACC, the regularization parameter corresponding to the optimal AC is selected using the traversal method (hereafter referred to as the optimal regularization parameter). For LM, its Lagrange multipliers are selected to satisfy the constraints on the energy in the dark zone and array effort. Figure 4 shows the energy distribution maps of the reproduced sound fields. At 200 Hz (Figure 4(a)–(d)), all these methods exhibit good sound zone separation. AEDM, WDACC, and WDAEDM achieve an AC above 28 dB and exhibit comparable performance in suppressing energy in the dark zone, whereas LM yields an AC slightly below 28 dB. The AG of the four methods is −18.61 dB, −17.15 dB, −16.77 dB, and −19.49 dB, respectively, indicating that the WDACC and WDAEDM methods achieve higher radiation efficiency in the bright zone compared to AEDM and LM. In addition, the maps reveal that WDACC, WDAEDM, and LM do not exhibit the dropouts present in the AEDM method within the bright zone, resulting in a more uniform energy distribution. As frequency increases to 1200 Hz, LM fails, and the performance of the other three methods decreases to varying degrees. The AC of AEDM drops to 12.12 dB, while that of WDACC and WDAEDM decreases to approximately 19.5 dB, indicating that the latter two methods perform similarly in terms of AC control and outperform AEDM. The underlying reason is that AEDM fails to sufficiently suppress energy in the dark zone, leading to uneven energy distribution. At this frequency, the AG of the three methods is −24.34 dB, −23.55 dB, and −23.70 dB, respectively. The WDACC and WDAEDM methods demonstrate higher radiation efficiency in the bright zone compared to AEDM. In summary, WDACC and WDAEDM outperform AEDM and LM.

Figure 4.

The energy distribution maps at 200 Hz and 1200 Hz.

In this part, we further compare the performance of WDACC and WDAEDM. The frequency ranges from 20 Hz to 1200 Hz. For WDAEDM, $ξ$ is set to 1000 at each frequency to achieve optimal AC. For WDACC, two strategies are adopted to determine the regularization parameter: (1) the traversal method and (2) the empirical maximal singular value-based method (SV).²⁵

Figure 5 shows AC, AG, and sound pressure levels (SPLs) in the dark zone of each method. It is interesting to find that: (1) WDAEDM and WDACC (with the optimal regularization parameter) show comparable AC, AG, and SPL in the dark zone. However, it is worth noting that the optimal regularization parameter for WDACC varies significantly across frequencies, as shown in Figure 6. To achieve an optimal AC at each frequency, WDACC requires substantial manual effort and time to determine the optimal parameter frequency-by-frequency. In contrast, WDAEDM avoids this issue, as a fixed tuning factor can yield performance equivalent to WDACC across most frequencies, which is a key advantage of WDAEDM. (2) The WDACC method using the SV-based regularization fails to effectively suppress energy in the dark zone at several frequencies, resulting in SPLs exceeding 50 dB, which further leads to a marked decline in AC at those frequencies. The reason is that the empirically chosen regularization parameter does not reliably mitigate the impact of ill-conditioned matrix inversion on the performance. Therefore, WDAEDM has better performance than WDACC.

Figure 5.

AC, AG, and SPL in dark zone versus frequency.

Figure 6.

The optimal regularization parameter of WDACC versus frequency.

To summarize, compared with the other three methods, the proposed WDAEDM achieves higher AC and AG, resolves the issue of uneven energy distribution, and avoid spending extensive effort and time on regularization. Overall, WDAEDM delivers the best performance among all approaches discussed above.

Finite element-based simulation

This subsection conducts finite element-based simulations to evaluate the performance of the proposed method in a car cabin acoustic environment. The acoustic model of the car cabin, as shown in Figure 7, is built on the COMSOL Multiphysics platform. The car windows, dashboards, and doors are modeled with constant absorption coefficients ( $α$ = 0.005, 0.01, and 0.01, respectively).²⁶ The complex impedance of leather seats is set based on experimental data,²⁷ and the ceiling and floor are defined using porous layers under impedance boundary conditions. The depths of ceiling and carpet are 0.7 cm and 0.5 cm. The flow resistance rates are 20,000 $Pa \cdot s / m^{2}$ and 10,000 $Pa \cdot s / m^{2}$ . After completing the mesh independence verification, a grid size of approximately 6 cm was finalized for the finite element model. The head positions of the driver and front passenger are defined as the bright zone and the dark zone, respectively. The linear loudspeaker array is used. The control points (microphones) are positioned at ear level, with the center located 0.55 m away from the loudspeaker array along the normal direction. 30 evaluation points (microphones) are densely distributed within the bright zone and the dark zone to assess performance.

Figure 7.

The finite element acoustic model of the car cabin.

AEDM, WDACC, WDAEDM, and LM are used for multi-zone reproduction. AC, AG, and SPLs in the dark zone (SPL_d) at 500 Hz for each method are listed in Table 1. The corresponding energy distribution maps of the reproduced sound fields are shown in Figure 8. For AEDM, the performance metrics calculated using the pressure at the evaluation points are generally inferior to those calculated from the control points. This is because AEDM is based on discrete point control, providing good performance only near those control points, with degradation observed farther away. For the WDACC and WDAEDM, they show comparable performance, and their performance metrics based on evaluation points are better than those based on control points, due to its ability to control the sound field over continuous regions. In addition, WDACC and WDAEDM outperform LM in term of AC and SPLd. However, the proposed WDAEDM does not need regularization parameters.

Table 1.

The comparison of performance metrics.

	AC/(dB)		AG/(dB)		SPL_d/(dB)
Method	Control point	Evaluation point	Control point	Evaluation point	Control point	Evaluation point
AEDM	19.75	17.08	−34.19	−34.39	38.54	40.09
WDACC	21.85	22.21	−31.02	−31.01	39.71	39.39
WDAEDM	19.90	21.98	−35.05	−34.37	38.07	36.40
LM	17.13	17.10	−39.94	−34.93	40.21	40.12

(Note. “Control points” refer to results calculated based on the sound pressure at control points, while “Evaluation points” refer to results calculated based on the sound pressure at evaluation points).

Figure 8.

The energy distribution maps of the reproduced sound field in the car cabin.

Conclusion

This paper proposes the WDAEDM method for multi-zone sound field reproduction based on spatial harmonic expansion and acoustic energy difference maximization. Its performance is examined through the free-field and the car cabin acoustic environment simulations. Results show that compared with spatial-domain AEDM, WDAEDM yields a more uniform sound energy distribution. In comparison with other wave-domain methods, WDAEDM outperforms LM, and attains performance comparable to WDACC, while avoiding selecting a regularization parameter.

It is worth mentioning that the computational complexity of the proposed method increases with frequency because a higher order is required. Consequently, more loudspeakers are needed, resulting in increased cost. Therefore, the proposed method is recommended for use primarily at low frequencies. In addition, this paper does not account for environmental variations, and thus WDAEDM is currently suitable only for relatively stable acoustic environments. Future work will focus on developing robust WDAEDM against dynamic environments, but this lies beyond the scope of this paper.

Footnotes

ORCID iD

Zhigang Chu

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the National Natural Science Foundation of China, grant number 12304519, the New Chongqing Youth Innovation Talent Project, grant number CSTB2024NSCQ-QCXMX0068, and the Science and Technology Research Program of Chongqing Municipal Education Commission, grant number KJZD-K202303202.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

Nelson

Elliott

. Active control of sound. Academic, 1992.

Elliott

Jones

. An active headrest for personal audio. J Acoust Soc Am 2006; 119(5): 2702–2709.

Druyvesteyn

Garas

. Personal sound. J Audio Eng Soc 1997; 45(9): 685–701.

Cheer

Elliott

. Design and implementation of a personal audio system in a car cabin. J Acoust Soc Am 2013; 133(1): 3251.

Liao

Elliott

Cheer

, et al. Design array of loudspeakers for personal audio in a car cabin system. In: Proceedings of 23rd International Congress on Sound and Vibration, Athens, Greece, 2016.

Choi

Kim

. Generation of an acoustically bright zone with an illuminated region using multiple sources. J Acoust Soc Am 2002; 111(4): 1695–1700.

Chang

Lee

Park

, et al. A realization of sound focused personal audio system using acoustic contrast control. J Acoust Soc Am 2009; 125(4): 2091–2097.

Zhu

Coleman

, et al. Robust acoustic contrast control with reduced in-situ measurement by acoustic modeling. J Audio Eng Soc 2017; 65(6): 460–473.

Zhang

Shi

Christensen

, et al. CGMM-based sound zone generation using robust pressure matching with ATF perturbation constraints. IEEE/ACM Trans Audio Speech Lang Process 2023; 31: 3331–3345.

10.

Koyama

Kimura

Ueno

. Weighted pressure and mode matching for sound field reproduction: theoretical and experimental comparisons. J Audio Eng Soc 2023; 71(4): 173–185.

11.

Moles-cases

Elliott

Cheer

, et al. Weighted pressure matching with windowed targets for personal sound zones. J Acoust Soc Am 2022; 151(1): 334–345.

12.

Demirbilek

Tedjani

Seadawy

. Analytical solutions of the combined Kairat-Ⅱ-Ⅹ equation: a dynamical perspective on bifurcation, chaos, energy, and sensitivity. AIMS Math 2025; 10(6): 13664–13691.

13.

Rizvi

STR

Batool

Seadawy

, et al. Multiple rational solutions for simplified modified Camassa–Holm dynamical model with applications in modern physics. Mod Phys Lett A 2025; 40(23): 2550080.

14.

Ahmed

Hashem

Rizvi

STR

, et al. Characterizing the physical and dynamical properties of lump, rogue waves and their interactions for a cascaded system with spatio‐temporal dispersion and Kerr nonlinearity. AIMS Math 2025; 10(7): 16498–16525.

15.

Jlali

Rizvi

STR

Shabbir

, et al. Study of optical solitons and quasi-periodic behaviour for the fractional cubic quintic nonlinear pulse propagation model. Mathematics 2025; 13: 2117.

16.

Venini

Nascimbene

. A new fixed-point algorithm for hardening plasticity based on non-linear mixed variational inequalities. Int J Numer Methods Eng 2023; 57(1): 83–102.

17.

Zhu

Coleman

, et al. Robust reproduction of sound zones with local sound orientation. J Acoust Soc Am 2017; 142(1): EL118–EL122.

18.

Elliott

Cheer

Choi

Kim

. Robustness and Regularization of Personal Audio Systems. IEEE Transactions on Audio, Speech, and Language processing. 2012; 20(7): 2123–2133.

19.

Shin

Lee

Fazi

, et al. Maximization of acoustic energy difference between two spaces. J Acoust Soc Am 2010; 128(1): 121–131.

20.

Han

Zhu

, et al. Two-dimensional multizone sound field reproduction using a wave-domain method. J Acoust Soc Am 2018; 144(3): EL185–EL190.

21.

Han

Zhu

, et al. Three-dimensional wave-domain acoustic contrast control using a circular loudspeaker array. J Acoust Soc Am 2019; 145(6): EL488–EL493.

22.

Wen

Fan

, et al. A multizone sound field reproduction method with constrained zone acoustic energy in the modal domain. Appl Acoust 2024; 220: 109959.

23.

Betlehem

Abhayapala

. Theory and design of sound field reproduction in reverberant rooms. J Acoust Soc Am 2005; 117(4): 2100–2111.

24.

Kennedy

Sadeghi

Abhayapala

, et al. Intrinsic limits of dimensionality and richness in random multipath fields. IEEE Trans Signal Process 2007; 55(6): 2542–2556.

25.

Stewart

. Introduction to matrix computations. Academic, 1973.

26.

Busch

Cox

Antonio

. Acoustic absorbers and diffusers: theory, design and application. Noise Control Eng J 2010; 58: 467–468.

27.

Didier

. In situ estimation of the acoustic properties of vehicle interiors. Technical University of Denmark, 2019.