Spherical microphone arrays are an attractive solution for bandwidth of the array. However, as with all discretely sampled aliasing occurs when the spacing between adjacent sensors is too large. This paper investigates alternative approaches to mitigate the spatial 2. MODAL BEAMFORMING arrays based on MODAL BEAMFORMING. The alternative approaches are based on: spatial anti-aliasing filters, exploiting the natural diffraction.
Original Description:
Original Title
HANDLING SPATIAL ALIASING IN SPHERICAL ARRAY APPLICATIONS
Spherical microphone arrays are an attractive solution for bandwidth of the array. However, as with all discretely sampled aliasing occurs when the spacing between adjacent sensors is too large. This paper investigates alternative approaches to mitigate the spatial 2. MODAL BEAMFORMING arrays based on MODAL BEAMFORMING. The alternative approaches are based on: spatial anti-aliasing filters, exploiting the natural diffraction.
Spherical microphone arrays are an attractive solution for bandwidth of the array. However, as with all discretely sampled aliasing occurs when the spacing between adjacent sensors is too large. This paper investigates alternative approaches to mitigate the spatial 2. MODAL BEAMFORMING arrays based on MODAL BEAMFORMING. The alternative approaches are based on: spatial anti-aliasing filters, exploiting the natural diffraction.
HANDLING SPATIAL ALIASING IN SPHERICAL ARRAY APPLICATIONS
Jens Meyer and Gary W Elko
mh acoustics 25A Summit Ave. Summit, NJ 07901, USA ABSTRACT Another characteristic that determines the number of re- quired sensors is the upper frequency limit of the operating Spherical microphone arrays are an attractive solution for bandwidth of the array. As with conventional arrays this up- many audio applications where flexible beamforming in all per frequency limit is determined by spatial aliasing. Spatial directions is desired. However, as with all discretely sampled aliasing occurs when the spacing between adjacent sensors array systems, spatial aliasing puts a constraint on the upper becomes too large. The topic has been treated by other au- operating frequency range and therefore limits some poten- thors before, e.g. [4], [6]. However, no practical solution tial applications. Simply increasing the number of sensors has been suggested for non-spatially-bandlimited soundfields. to overcome the effect of spatial aliasing is effective but can This work takes a closer look at the spatial aliasing and de- be expensive, especially if very high quality microphones scribe several solutions to reduce it. elements are used. This paper reviews the origin of spatial aliasing for spherical arrays based on modal beamforming and investigates alternative approaches to mitigate the spatial 2. MODAL BEAMFORMING aliasing problem. The alternative approaches are based on: spatial anti-aliasing filters, exploiting the natural diffraction Spherical array beamforming is motivated by the fact that ev- of the spherical baffle and compromising the directivity at ery soundfield on a spherical surface can be expressed as a higher frequencies. All approaches are practical and can be series of orthonormal spatial spherical harmonic modes implemented without increasing the system cost significantly. 00 n Index Terms- Microphone array, spherical microphone P(ka, i9, (o) = bn(ka) E3 AnmymYA(t, ()), (1) array, spatial aliasing where bn describes the modal response, Ynm are the spherical 1. INTRODUCTION harmonics of order n and degree m (see [7] for more details), k is the wavenumber and a is the radius of the sphere. bn Spherical microphone arrays have gained attention over the can be determined analytically once the array setup is known past few years ([1], [2], [3], [4], [5]). Due to their inherent (surface impedance, radial microphone position, nearfield vs. characteristics spherical arrays are an attractive solution for farfield sources. etc.). The coefficients bn are discussed later applications such as conferencing systems or room acoustic in more detail. All soundfield spatial information is contained measurement tools, to namejust a few. Among others the ma- in the Anm coefficients. The idea of the modal beamforming jor characteristics are (a) a coverage of the three-dimensional is to decompose the soundfield into its modal components and (3D) space independent of direction and (b) the beamformer therefore gain access to the the soundfield information Anmm associated with the array can be implemented in an efficient For an ideal continuous microphone aperture the isolation manner as modular blocks allowing for independent control of modes can be done by applying a pressure sensitive mate- of the beam pattern and steering direction. rial on the spherical surface where the sensitivity M(i', 0) is A potential drawback for spherical arrays is the relatively equivalent to the complex conjugate spherical harmonic: large number of microphones that are required. It can be shown that at least (N + 1)2 sensors are required where N Fnm in ] P(ka, V, ~o)M(i(, o) dQ is the highest order mode that the array uses for beamform- X ing. The higher the order N the higher the spatial resolution f m , of the array. For real-time arrays a maximum order of N =4 = jyP(kat,t0,c,)Ynm, (t$O,co) dQ is a reasonable compromise between good spatial resolution X and cost for the array (hardware as well as processing power). = bn' (ka)An',m' (2) 978-1-4244-2338-5/08/$25.00 C2008 IEEE 1 HSCMA 2008 Since bn are known, the soundfield information Anm,Tn' can be + easily computed. M stands for the aperture sensitivity func- n=1 tion and F is the array factor (note that at this point the aper- x n=2 n=3 ture is continuous and the term array is technically not cor- -* rect). The result given in Equ. 2 is derived simply by applying the orthonormality characteristic of spherical harmonics: 5 -20 0 n=6 E A 9 Q5 -35/------------>/- -yv)1yvrJ42........ -30 3. THE ORIGIN OF THE SPATIAL ALIASING 0.5 1 2 5 10 ka Although solutions exist that allow one to realize a continu- ous spherical aperture, it is not practical since one would only Fig. 2. Modal response of a spherical array with sensors have access to one mode at a time. To overcome this limita- mounted flush in an acoustically rigid sphere. tion one can sample the aperture at positions [a, Is, (s] and then apply weights to the sampled positions to extract multi- ple modes in parallel. This approach basically leads to mul- However.tsearnements t a ve. tiple weight-and-add beamformers, one for each mode. The performance outside the designed range. tiple~~~~ wegtan,d bafresonfrechmd. Te Up to this point the modal response bn have been ne- array factor for mode in, m' for the sampled aperture iS ex- U oti on h oa epneb aebe e press fas:or for mode n',m' glected. For a spherical array mounted on the surface of a rigid sphere bn becomes [7]: S-1 Fn',m' S P(ka, t, Wo)M(i.s, ( bn(ka) = jn(ka) h( )(ka) (5) s=O h$P2/(k a)n 00 5 bn (ka) x and is plotted in Fig. 2. Combining the modal response with n=O the aliasing error one can see that the 6th-order aliasing only n S-I starts for about ka > 4. Below this the 6th-order mode is E AnmT Ynm(,s, os)Yn7 (,s, fos) (4) virtually not present. For an array with a radius of 4cm this m=-n s=O implies that the theoretical upper frequency limit for 4th-order an_ni,m/e(in,m,in',m') modes is about 5.5kHz. Above this frequency the 4th-order modes will be seriously aliased by 6th order modes (and more where F is the array factor of the discretely sampled aper- higher order modes as the frequency continues to increase). ture. In cases where the last sum in Equ. 4 fulfills the or- thonormality constraint (e = 0), the array output agrees with 4. APPROACHES TO OVERCOME SPATIAL the continuous solution: F = F. However, due to the typi- ALIASING cally non-spatially-bandlimited nature of soundfields aliasing (e :t 0) will occur for all but very simple soundfields. From The most obvious solution to spatial aliasing is to employ a Equ. 4 it is also obvious that aliasing depends to a great ex- more dense sensor arrangement. This would either mean an tend on the sensor positions. This makes it difficult to present increase in the number of sensors or a reduction in radius. The a general closed-form mathematical description. In the fol- first approach can become prohibitively expensive while the lowing we focus on the characteristics of a 32-element array second one is not desirable due to poorer low frequency noise geometry where the microphones are placed at the center of performance [8]. Next, three alternative approaches are pro- the faces of a truncated icosahedron. posed to reduce spatial aliasing in spherical arrays. The solu- Figure 1 shows the level of the aliasing error e(n, m, n', tions require little or no additional resources and are therefore m') as a function of the order and degree of the spherical of practical relevance. harmonics. For easier visualization the order n and degree m are transformed to an index nm = (n + 1)2 - n + m. White 4.1. Limit spatial resolution at high frequencies lines are added to help separate the different orders n visually. The figure shows that the array is capable of extracting modes The first approach is motivated by a closer view at Fig. 1. It is up to 4th-order with little or no aliasing. Significant aliasing obvious that the higher order modes are more prone to spatial starts to with 6th-order soundfield modes leaking into the 4th- aliasing than the lower order modes. For example the 1st- order beam. Note that other configurations with 32 sensors order mode experiences its first aliasing from the 9th-order exist (e.g. [4]) that have no aliasing error up to 4th-order. mode. FromFig. 2 it canbe seenthat the 9th-order mode only 2 20 25 10 20 30 40 50 60 70 80 90 100 nm Fig. 1. Aliasing for the 32 element truncated icosahedron geometry. contributes significant energy from about ka > 7 onwards. Therefore, by scaling back in directivity from a forth order 10 + n=0 pattern to a first order pattern at high ka one can extend the 5 n=1 * n=2 frequency range from about ka = 4 to about ka = 7. o n-3 Note that the ka values and mode orders in this section x n=4 are specific to the truncated icosahedron sampling. However, | n=6 the general trend should hold for other sampling schemes as LA well. -20- 4.2. Use of spatial anti-aliasing filter -25- Time domain processing commonly employs anti-aliasing 10 20 30 40 60 80 low-pass filters. Unfortunately, spatial anti-aliasing filter are [ ] not as easy to deploy. The following development of an extended sensor covering an area of the spherical array sur- Fig. 3. Sensitivity of a conformal patch sensor to modes 0 to face gives a qualitative analysis that can be used to compute 6 depending on the size of the sensor. the spatial filter operation of a larger microphone. Spatial anti-aliasing filters are also explored in [6]. To gain some insight in the modal low pass filter we start by looking at a continuous circular conformal sensor aperture Tmoa2 w(1-cosido) ,n = o that covers the pole of a sphere. To keep the analysis simple M 2 / x we assume a constant sensitivity mo. The sensor output can 1 2n+1 then be computed according to: (Pn_2(costo)- Pn+(costo)) n>O 'Of (7) F mo ] P(w, r, io)dQC The result from Equ. 7 is plotted in Fig. 3. Note that the modal sensitivity is shown relative to the mode 0. The ordi- nate indicates the size of the sensor represented by the open- S n b1(w, r)AAn (i, 0) x ing angle 00. To the left side the sensor is small, to the ex- n=O treme it is a point source or a spatial Dirac function. Com- 2-r do paring the values towards the left of Fig. 3 one finds that moa2 f n Y(ji,cc)sin(c)ddfdc (6) they agree with the spherical harmonic transform of the spa- JJ tial Dirac function. Towards the right side of the figure the > ' ~~~~~~sensor increases in size and its sensitivity to the higher or- Mn (Wo mo ,a) der modes starts to drop. The uniform weighing of the sen- Equation 6 introduces the new measure Mn2. This measure sor aperture results in rather large sidelobes. These can be can be seen as the sensitivity of the sensor to a mode of order reduced by optimizing the aperture weighting function. The 3 result shows that a rather large sensor (do = 30) will be re- quired to attenuate modes of order 6 and higher. ka = 0.5 90 To apply 32 sensors of the required size onto a sphere - - ka=8.0 is not practical since the sensors would overlap significantly. 1soD ka = 30 A solution is to sample the continuous aperture with small sensors, making sure to place small sensors dense enough to avoid spatial aliasing up to the desired frequency limit. Small 180 O sensors in the overlapping region can be used by all channels covering this space. Sampling the large sensor area with small V sensors also allows for easier implementation of an aperture 21 330 weighting function. At a first glance this solution seems to increase the sys- 24 0 tem cost significantly since the number of sensors required becomes much larger. However, (a) the number of input chan- nels stays the same (in our example 32) and (b) since each input channel is now made up of many small microphones Fig. 4. Directivity pattern for an omnidirectional sensor one can use inexpensive capsules and still obtain good noise mounted flush on the surface of an acoustically rigid sphere. performance. One can also expect that the sensor tolerances - typically a major concern for array beamforming - are less 6 REFERENCES important since they will tend to average out among the many sensors that make up each input channel. [1] Meyer J. and Elko G. W., "A highly scalable spherical microphone array based on an orthonormal decomposi- 4.3. Using the natural diffraction of the sphere tion of the soundfield," Proc ofIEEE ICASSP, vol. II, pp. 1781-1784,2002. Utilizing the diffraction of the rigid spherical baffle is suit- able for spherical arrays that are mounted on the surface of an [2] Abhayapala T. D. and Ward D. B., "Theory and design acoustically rigid sphere. The sphere acts as an acoustic scat- of higher order sound field microphones using spherical terer which results in a direction dependent sensitivity for an microphone arrays," Proc of IEEE ICASSP, vol. II, pp. omnidirectional microphone that is positioned at or close to 19491952,2002. the surface of the sphere. This means that an omnidirectional [3] Daniel J. and Moreau S., "Further study of sound field sensor actually becomes a directional sensor. This effect if coding with higher order ambisonics," Proc. 116th AES more prominent towards higher frequencies. Fig. 4 shows the Convention, 2004. resulting directivity pattern for an omnidirectional point sen- sor positioned flush on the surface of a rigid sphere for several [4] Li Z., Duraiswami R., Grassi E., and Davies L. S., "Flex- frequencies ka. It can be seen that for ka = 2 and above the ible layout and optimal cancelation of the orthonormal- sensor exhibits directional properties. The maximum Direc- ity error for spherical microphone arrays," Proc ofIEEE tivity Index that is achieved towards higher ka is about 3.2dB. ICASSP, pp. 41-44, 2004. The Directivity Index will increase further if one uses mico- phone with larger membranes. [5] Rafaely B., "Analysis and design of spherical microphone To overcome the spatial aliasing one can exploit the scat- arrays," IEEE Trans. Speech Audio Process., vol. 13, no. tering and diffraction and use the output of the sensor closest 1, pp. 135-143,2005. to the current look-direction at high frequencies. Since only a [6] Rafaely B., Weiss B., and Bachmat E., "Spatial alias- single sensor is used no spatial aliasing will occur. ing in spherical microphone arrays," IEEE Trans. Signal Process., vol. 55, no. 3, pp. 1003-1010,2007. 5. CONCLUSION [7] E.G. Williams, Fourier Acoustics, Academic Press, San Diego, 1999. This paper presented three practical approaches to reduce the effect of spatial aliasing. The solutions emphasize the practi- [8] Meyer J. and Elko G. W., "Spherical microphone ar- cality of an implementation. The first (sec. 4.1) and last (sec. rays for 3d sound recording," in Audio Signal Process- 4.3) approach can be used in most current systems since it re- ingfor Next-Generation Multimedia Communication Sys- quires only a modification in the processing algorithm while tems, J. Benesty Y. (A.) Huang, Ed. Kluwer Academic the second approach (sec. 4.2) would require a new physical Publishers, Boston, 2004. array setup. 4