Abstract
Two-dimensional grid-free compressive beamforming with rectangular planar microphone arrays has attracted considerable attention due to its wide source identification region (the hemispherical space in front of arrays), high accuracy (overcoming basis mismatch), and robust performance. However, the uniform distribution of microphones tends to result in spatial aliasing, i.e., the existing single-frequency methods fail to achieve accurate localization for the sources with wavelengths shorter than twice the minimum spacing between microphones. This paper proposes a two-dimensional multi-frequency synchronous grid-free compressive beamforming method, aimed at mitigating spatial aliasing and extending its applicability beyond the sampling limit imposed by the array geometry. First, a two-dimensional multi-frequency synchronous atomic norm minimization problem is formulated and then converted into an equivalent dual problem. Subsequently, a spatial spectral function is constructed from the solution of the dual problem to extract source positions and strengths. Simulation and experiment results show that when the spatial Nyquist criterion is satisfied, both the proposed multi-frequency synchronous method and existing single-frequency methods can accurately identify sources, with the proposed method reducing strength quantification error by over 80% under strong interference. When the spatial Nyquist criterion is violated, the proposed method can effectively suppress spatial aliasing inherent in the existing single-frequency methods and accurately identify common engineering noise sources with high-frequency components, such as electric motor noise, gear whine, and bearing noise.
Keywords
1. Introduction
Compressive beamforming (CB)1,2 enables clear and intuitive acoustic imaging, shows strong anti-interference capability and applicability to both coherent and incoherent sources, and thus has great application value in engineering fields such as aerospace,3–6 automobile,7,8 wind power generation9–11 and so on.
Based on the mathematical models, CB can be categorized into four types: on-fixed-grid methods, off-fixed-grid methods, dynamic-grid methods, and grid-free methods. 12 On-fixed-grid CB discretizes the imaging region into a set of fixed grid points, constructs a sensing matrix using the transfer functions between the grid points and microphones, and then formulates an underdetermined equation system relating the measured pressures to the sensing matrix and the source distribution. Sparse constraints and sparse recovery algorithms are applied to solve this equation system. When acoustic sources fall on the preset grid points, on-fixed-grid CB performs well. However, when sources deviate from the grid points, these methods localize sources to the nearby grid points, resulting in significant performance degradation. This is known as basis mismatch problem. 2 The other three types of methods are established to overcome this problem. Off-fixed-grid CB also discretizes the imaging area but mitigates basis mismatch through initial on-grid localization and off-grid offset compensation.13–15 Dynamic-grid CB discretizes the imaging region into dynamic grids and drives these grids to move toward true source positions, which performs well with an appropriate initial grid setting. 16 Different from the above two types of CB methods, grid-free CB treats the source distribution as a continuum, thereby avoiding basis mismatch at the root. 17
Many researchers have made efforts to develop two-dimensional grid-free CB methods based on planar microphone arrays, which enables the identification of the sources lying in the hemispherical space in front of the arrays. In 2017, Yang et al. proposed a two-dimensional grid-free CB method based on atomic norm minimization (ANM), 17 and later introduced iterative reweighted ANM to improve spatial resolution.18,19 In 2019, Yang et al. exploited multiple-snapshot measurements to enhance the performance 20 and developed an efficient solver based on the alternating direction method of multipliers (ADMM). 21 Tang et al. solved the ANM problem in the dual domain by defining a spatial spectral function related to the dual polynomial. 22 In 2020, Liu et al. proposed an iterative Vandermonde decomposition and shrinkage thresholding-based two-dimensional grid-free CB method, 23 which solves its model using accelerated proximal mapping and alternating projections. This method avoids performance degradation caused by inaccurate SNR estimation.
The core of the above two-dimensional grid-free CB methods is to formulate and solve the ANM model, for which the construction of a two-level Toeplitz matrix is essential. Therefore, the existing two-dimensional grid-free CB methods can be only applied to regular array configurations, such as uniform or sparse rectangular microphone arrays. 24 For a given number of microphones and equivalent array diameters, regular microphone arrays have a larger minimum spacing between microphones. According to the spatial Nyquist criterion,25,26 when the minimum spacing between microphones exceeds half the wavelength, DOA estimation may become ambiguous, i.e., high-frequency spatial aliasing occurs. The existing two-dimensional grid-free CB methods process signals at a single frequency, and therefore, their upper applicable frequency limit is constrained by the spatial Nyquist criterion.
Spatial aliasing is frequency-dependent, meaning that aliasing artifacts appear at different locations across frequencies, while the true source locations remain frequency-independent. Therefore, by establishing a multi-frequency model in which the frequency separation (measured in wavelength) exceeds twice the minimum microphone spacing, the intersection of the source distribution sparsity profile across frequencies contains exclusively the true DOAs, enabling aliasing-free DOA estimation. 27 Based on that, some researchers developed a joint sparse multi-frequency model with one-dimensional linear arrays28–30 to integrate acoustic information at different frequencies into a unified optimization framework, achieving spatial aliasing suppression in one-dimensional line spectral estimation. Inspired by these reports, this paper proposes a two-dimensional multi-frequency synchronous grid-free CB method. The proposed method aims to suppress high-frequency spatial aliasing caused by the use of regular microphone arrays and effectively identify sources whose wavelengths are smaller than twice the minimum microphone spacing, thereby extending its applicability to identifying engineering noise containing high-frequency components, such as electric motor noise, gear whine, and bearing noise. The proposed method first formulates a two-dimensional multi-frequency model and defines the atomic norm. Then, it constructs an infinite-dimensional ANM problem and converts it to a dual problem with finite-dimensional variables for solving. Finally, a spatial spectral function related to the dual polynomial is used to recover the primal solution, from which DOAs and source strengths are extracted.
The main contributions of this article are as follows: ● A two-dimensional multi-frequency synchronous grid-free CB model is established based on atomic norm minimization for source identification in two-dimensional space. ● A multi-frequency spatial spectral function is proposed to extract source information. ● Compared with the existing two-dimensional grid-free CB methods, the proposed method can effectively suppress spatial aliasing and accurately identify the sources with wavelengths shorter than twice the minimum spacing between microphones due to its multi-frequency synchronous model. Compared with the existing multi-frequency CB methods, the proposed method is the two-dimensional extension of these methods, and can achieve two-dimensional source localization in continuous space.
The remainder of this paper is organized as follows: Section 2 introduces the theory of the proposed two-dimensional multi-frequency synchronous grid-free CB, Section 3 evaluates the performance of the proposed method via simulations and Monte Carlo trials, Section 4 validates the correctness of simulations and the effectiveness of the proposed method via loudspeaker identification experiments, and Section 5 summarizes the paper.
2. Theory of two-dimensional multi-frequency synchronous grid-free compressive beamforming
2.1 Two-dimensional multi-frequency synchronous model
Figure 1 shows the measurement model. Sources are located in the far field and radiate plane waves. Measurement model.
2.2 Primal problem
In the continuous spatial domain, the atomic norm of
2.3 Dual problem
Firstly, the Lagrange function of Eq. (7) is constructed,
Then, we minimize
Therefore, the optimal noise matrix is
It is known that
We construct a Hermitian matrix
Each block
Let
For an arbitrary
According to the Schur complement condition, we have
Overall, the primal minimization problem (Eq. (7)) is equivalent to the maximization of the dual function (Eq (14).), and it can be rewritten as
This positive semidefinite programming problem can be solved by the SDPT3 solver in the CVX toolbox.
2.4 DOA and strength estimation
Define the spatial spectral function as
With the estimated DOAs, we obtain
3. Simulation
A rectangular microphone array consisting of 8 rows and 8 columns (a total of 64 microphones) is used, with a spacing of
We also define the probability of accurate identification (P), the DOA estimation root mean square error (RMSE), and the strength quantification mean error (ME) as
3.1 Aliasing-free case
Assume three sources. They radiate sound waves with a frequency of 3000 Hz. The DOAs ( Identification results of the proposed method (single-frequency modelling) and the method in Ref.
17
under aliasing-free condition.
We vary SNR and perform Monte Carlo trials. The frequency is set to 3000 Hz. The SNR ranges from 5 dB to 30 dB with a step of 5 dB. At each SNR, 1000 trials are considered. In each trial, four sources are assumed, whose DOAs are randomly generated while satisfying the minimum separation condition.
18
The strengths are set to unit amplitude with random phases. Figure 3 shows P, RMSE, ME, and the average runtime for the two methods. As shown in Figure 3(a), the proposed method achieves a higher probability of accurate identification than the method in Ref.
17
across different SNRs. From Figure 3(b) and 3(c), it can be observed that both methods yield comparable DOA estimation accuracy, whereas the proposed method significantly outperforms the method in Ref.
17
in terms of strength quantification. The advantage becomes more pronounced as the SNR decreases. At SNRs of 10 dB and 5 dB, the strength quantification error is reduced by over 80% and 90%, respectively. In Figure 3(d), the runtimes of the two methods are nearly identical, since they operate on matrices of the same dimension and employ the same solver. Performance comparison between the proposed method (single-frequency modelling) and the method in Ref.
17
under aliasing-free condition at different SNRs.
3.2 Spatial aliasing case
In the case described in Section 3.1, the source frequency is changed to 7000 Hz, while all other conditions remain unchanged. The results are shown in Figure 4. Figure 4(a) presents the spatial spectral function, which produces peaks with high amplitudes at the locations where there is no source due to spatial aliasing. The imaging maps are shown in Figure 4(b) and 4(c). The proposed method identifies some spurious sources, whereas the method in Ref.
17
could only localize a part of sources. The presence of these spurious sources adversely affects the subsequent strength quantification. Identification results of the proposed method (single-frequency modelling) and the method in Ref.
17
under spatial aliasing.
Assume that the three sources contain spectral components not only at 7000 Hz but also at 7500 Hz and 8000 Hz. The source strengths at the frequencies are [100, 94, 94] dB, [94, 100, 94] dB, and [97, 97, 94] dB (referenced to Identification results of the proposed method under spatial aliasing with multi-frequency synchronous modeling.
SNR estimation is required by CVX toolbox to solve the dual problem. To investigate its influence on the performance of the proposed method, we underestimate and overestimate SNR to 10 dB and 30 dB (true SNR is 20 dB), and examine the performance based on the case in Figure 5. The results under inaccurate SNR estimation are shown in Figure 6. Comparing Figure 5(b) and Figure 6, we observe that underestimation of SNR has negligible effect on both DOA estimation and strength quantification, whereas overestimation slightly increases DOA estimation error, with minimal impact on strength quantification. Identification results of the proposed method under spatial aliasing with multi-frequency synchronous modeling and inaccurate SNR estimation.
We vary frequency and perform Monte Carlo trials. The fundamental frequencies are set to 5000 Hz, 6000 Hz, 7000 Hz, 8000 Hz, and 9000 Hz, respectively. For each fundamental frequency, four additional frequency components are defined, each increasing by 100 Hz from the fundamental one (e.g., when the fundamental frequency is 5000 Hz, the frequency components are [5000, 5100, 5200, 5300, 5400] Hz). The method in Ref.
17
only processes the pressures at the fundamental frequencies, whereas the proposed method utilizes both fundamental and additional frequency components. At each frequency, 1000 trials are conducted with four sources. DOAs are randomly generated while satisfying the minimum separation condition. Each source is assigned a unit amplitude with a random phase, and white Gaussian noise is added to achieve an SNR of 20 dB. Figure 7 shows P, RMSE, ME, and the average runtime for the two methods. Obviously, when frequency increases, the probability of accurately identifying sources of the single-frequency method in Ref.
17
drops sharply due to spatial aliasing. In contrast, the proposed method maintains a high probability and accuracy in source identification. A slight performance degradation is observed for the proposed method at higher frequencies, which may be attributed to the reduced relative frequency spacing that causes overlapping peaks at aliasing positions. Since multi-frequency synchronous processing increases the dimensionality of matrices, the runtime correspondingly rises. Performance comparison between the proposed method (multi-frequency synchronous modelling) and the method in Ref.
17
under spatial aliasing at different frequencies.
4. Experiments
To validate the effectiveness of the proposed method and the correctness of the simulation results, a loudspeaker source identification experiment is conducted in a semi-anechoic chamber using a Experimental layout.
Figure 9(a) shows the spatial spectral function of the proposed method when processing the pressures at 6500 Hz and 7000 Hz. It can be observed that many spurious peaks caused by spatial aliasing are effectively suppressed. Figure 9(b) and 9(c) present the imaging results of the proposed method and the method in Ref.
17
at 6500 Hz. The method in Ref.
17
regards the positions where spatial aliasing occurs as sources. The proposed method successfully locates the sources close to their true positions, though there is a weak spurious source within the dynamic range. The experiment draws a similar conclusion to the simulations. In summary, the proposed two-dimensional multi-frequency synchronous grid-free CB method effectively suppresses spatial aliasing and accurately identifies sources whose wavelengths are shorter than twice the minimum microphone spacing. Experimental results of the proposed method (multi-frequency synchronous modelling) and the method in Ref.
17
under spatial aliasing.
5. Conclusion
To overcome high-frequency spatial aliasing problem of the two-dimensional single-frequency grid-free CB method, this paper proposes a two-dimensional multi-frequency synchronous grid-free CB method. The approach formulates a two-dimensional multi-frequency synchronous atomic norm minimization problem, then converts it into a dual problem for solving, and finally extracts DOAs and strengths by using a spatial spectral function related to the dual polynomial. Simulations and experiments demonstrate that when the spatial Nyquist criterion is satisfied, both the proposed method and the existing single-frequency method can achieve accurate source identification, while the proposed method reduces strength quantification error by over 80% under strong noise interference. When the spatial Nyquist criterion is violated, the proposed method effectively suppresses spatial aliasing and accurately identifies sources whose wavelengths are shorter than twice the minimum microphone spacing. The dual problem is currently solved using the CVX toolbox, which suffers from low computational efficiency. Future work will focus on developing a more efficient solver to reduce computational cost, as well as extending the method to non-uniform array geometries.
Footnotes
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the National Natural Science Foundation of China, grant number 12304519, the Natural Science Foundation of Chongqing, grant number CSTB2023NSCQ-MSX0548, and the New Chongqing Youth Innovation Talent Project, grant number CSTB2024NSCQ-QCXMX0068.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
