SOUND IMAGE LOCALIZATION DEVICE, SOUND IMAGE LOCALIZATION METHOD, AND PROGRAM
20220157292 · 2022-05-19
Inventors
- Kenta IMAIZUMI (Musashino-shi, Tokyo, JP)
- Kimitaka TSUTSUMI (Musashino-shi, Tokyo, JP)
- Atsushi NAKADAIRA (Musashino-shi, Tokyo, JP)
Cpc classification
H04S2420/11
ELECTRICITY
G10K11/34
PHYSICS
International classification
Abstract
Provided is a sound image localization device capable of flexibly controlling directivity with a short calculation time. A sound image localization device that reflects, on a reflector 50, a sound signal radiated from a speaker array 40 arranged with a plurality of speakers SP.sub.1 to SP.sub.Q on a straight line to localize a sound image includes an expansion coefficient calculation unit 10 configured to analytically calculate expansion coefficients by performing a spherical harmonic function expansion on a window function representing desired directivity, a filter coefficient generation unit 20 configured to convert the expansion coefficients into filter coefficients corresponding to each of the speakers SP.sub.1 to SP.sub.Q, and a speaker drive unit 30 configured to generate a speaker drive signal for driving each of the speakers SP.sub.1 to SP.sub.Q by convolving the filter coefficients in a voice signal.
Claims
1. A sound image localization device that reflects, on a reflector, a sound signal radiated from a speaker array arranged with a plurality of speakers on a straight line to localize a sound image, the sound image localization device comprising: an expansion coefficient calculation unit, including one or more processors, configured to analytically calculate expansion coefficients by performing a spherical harmonic function expansion on a window function representing desired directivity; a filter coefficient generation unit, including one or more processors, configured to generate filter coefficients corresponding to each of the speakers from the expansion coefficients; and a speaker drive unit, including one or more processors, configured to generate a speaker drive signal for driving each of the speakers by convolving the filter coefficients in a voice signal.
2. The sound image localization device according to claim 1, wherein the filter coefficient generation unit is configured to generate the filter coefficient by multiplying each of the expansion coefficients by a corresponding weight of each of the speakers based on spherical harmonic functions being reproduced by the speakers.
3. A sound image localization method to be executed by a sound image localization device that reflects, on a reflector, a sound signal radiated from a speaker array arranged with a plurality of speakers on a straight line to localize a sound image, the sound image localization method comprising: analytically calculating expansion coefficients by performing a spherical harmonic function expansion on a window function representing desired directivity; generating filter coefficients corresponding to each of the speakers from the expansion coefficients; and generating a speaker drive signal for driving each of the speakers by convolving the filter coefficients in a voice signal.
4. A recording medium storing a program, wherein execution of the program causes one or more computers to perform operations comprising: analytically calculating expansion coefficients by performing a spherical harmonic function expansion on a window function representing desired directivity; generating filter coefficients corresponding to each of speakers from the expansion coefficients; and generating a speaker drive signal for driving each of the speakers by convolving the filter coefficients in a voice signal.
5. The recording medium according to claim 4, wherein generating the filter coefficients further comprises generating the filter coefficient by multiplying each of the expansion coefficients by a corresponding weight of each of the speakers based on spherical harmonic functions being reproduced by the speakers.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
DESCRIPTION OF EMBODIMENTS
[0020] Embodiments of the present invention will be described below with reference to the drawings. In the plurality of drawings, the same components are denoted by the same reference numerals, and will not be repeatedly described.
[0021]
[0022] A sound image localization device 100 shown in
[0023]
[0024] As shown in
[0025] The expansion coefficient calculation unit 10 analytically calculates an expansion coefficient by performing a spherical harmonic function expansion on a window function representing desired directivity. The desired directivity is given from the outside by a beam width θ.sub.ω (0<θ.sub.ω≤π).
[0026] The window function will be described by taking a cosine window (Expression (1)) as an example. An example of another window function includes a rectangular window.
[0027] (Spherical Harmonic Function)
[0028] Here, a polar coordinate system shown in
[0029] Here, Y.sup.m.sub.n(θ, ϕ) represents a spherical harmonic function, and A.sup.m.sub.n(ω) represents an expansion coefficient thereof, which can be expressed by the following expression, respectively.
[0030] Here, Pmn(•) represents an associated Legendre function, and Expression (4) is called a spherical harmonic function expansion.
[0031]
[0032] When a spherical harmonic function expansion is performed in a state where a desired characteristic d(θ) modeled in Expression (1) is substituted into S(r,θ,ϕ,ω) of Expression (2) and the order m of the spherical harmonic function is set to 0, an expansion coefficient A.sup.0.sub.n corresponding to the multi-pole sound source can be obtained.
[0033] An expansion coefficient for degrees up to n=2 are shown below.
[0034] An expansion coefficient can be analytically derived for degrees after n=2 as well.
[0035] The filter coefficient generation unit 20 generates a filter coefficient corresponding to each of the speakers forming the speaker array 40 from the expansion coefficient A.sup.m.sub.n by the following expression (step S2 (
[0036] (Directivity Control Technology Using Multi-Pole Sound Source)
[0037] A method is known in which desired directivity is developed by a spherical harmonic function and the obtained expansion coefficient A.sup.0.sub.n is applied to a multi-pole sound source to form directivity (for example, Reference Literature: Yoichi HANEDA et al., “Directivity synthesis using multipole sources based on spherical harmonic function expansion”, The Journal of the Acoustical Society of Japan, 69.11, 2013, 577-588).
[0038] The multi-pole sound source is a sound source in which point sound sources having the same amplitude are distributed in anti-phases as positions as close as possible to the origin. For example, when point sound sources are arranged at minute distances d in a z-axis direction, a sound pressure distribution M.sup.0.sub.n(r,θ,ϕ,ω) of the multi-pole sound source can be expressed by the following expression.
[0039] The approximation is z=cos θ established when 1<<kr. A symbol Q represents an intensity of the point sound source. A symbol k represents a wavenumber (k=ω/c). In addition, the multi-pole sound source has directivity very similar to the spherical harmonic function, and the speaker array 40 arranged in the z-axis direction can reproduce directivity similar to the spherical harmonic function when the order m is 0.
[0040] In other words, the application to the multi-pole sound source can be expressed by the following expression.
[0041] The filter coefficient generation unit 20 generates a filter coefficient w(ω) by multiplying each expansion coefficient A.sup.m.sub.n by a corresponding weight D.sup.0.sub.n(ω) of each of the speakers when the spherical harmonic functions are reproduced by the speakers SP.sub.1 to SP.sub.Q (Expression (11)).
[0042] The weight D.sup.0.sub.n(ω) can be expressed by the following expression when the number of speakers corresponding to the spherical harmonic functions for the degrees up to n=2 is five, for example.
[0043] Here, a symbol d represents a distance between the speakers SP.sub.1 to SP.sub.Q (the above-described minute distance). In addition, a symbol k represents the wavenumber (k=ω/c), and a symbol c represents a speed of light.
[0044] The speaker drive unit convolves the filter coefficient w(ω) in the voice signal input from the outside to generate speaker drive signals for driving the speakers SP.sub.1 to SP.sub.Q, respectively. As is clear from Expression (12), the speaker drive signal for degree n=0 is input only to the speaker SP3 with A.sup.0.sub.n(¼π).sup.0.5. The speaker drive signal for degree n=1 is input to the speakers SP.sub.2 and SP.sub.4. The speaker drive signal for degree n=2 is input to the speakers SP.sub.2, SP.sub.3, and SP.sub.4.
[0045] When such speaker drive signals are input to the speaker array 40, a sound signal corresponding to the desired directivity can be reproduced.
[0046] As described above, the sound image localization device 100 according to the embodiment is a sound image localization device that reflects, on the reflector 50, the sound signal radiated from the speaker array 40 arranged with the plurality of speakers in the straight line to localize the sound image, and includes the expansion coefficient calculation unit 10, the filter coefficient generation unit 20, and the speaker drive unit 30. The expansion coefficient calculation unit 10 performs the spherical harmonic function expansion on the window function indicating the desired directivity to analytically calculate the expansion coefficient. The filter coefficient generation unit 20 generates, from the expansion coefficient A.sup.m.sub.n, the filter coefficient w(ω) corresponding to each of the speakers SP.sub.1 to SP.sub.Q. The speaker drive unit 30 convolves the filter coefficient w(ω) in the voice signal to generate the speaker drive signals for driving the speakers SP.sub.1 to SP.sub.Q, respectively.
[0047] Thus, it is possible to provide the sound image localization device 100 that can flexibly control the directivity with a short calculation time.
[0048] (Sound Image Localization Method)
[0049] A sound image localization method executed by the sound image localization device 100 will be described below.
[0050]
[0051] First, the sound image localization device 100 is set with a beam width representing desired directivity (step S1). The beam width θ.sub.w (Expression (1)) is input to the expansion coefficient calculation unit 10 from the outside (step S1).
[0052] Next, the expansion coefficient calculation unit 10 performs the spherical harmonic function expansion on the window function representing the desired directivity d(θ) to analytically calculate the expansion coefficient A.sup.m.sub.n (step S2).
[0053] Next, the filter coefficient generation unit 20 generates a filter coefficient w(ω) corresponding to each of the speakers SP.sub.1 to SP.sub.Q forming the speaker array 40 from the expansion coefficient A.sup.m.sub.n (step S3). The filter coefficient generation unit 20 generates a filter coefficient w(ω) by multiplying each expansion coefficient A.sup.m.sub.n by a corresponding weight D.sup.0.sub.n(ω) of each of the speakers SP.sub.1 to SP.sub.Q when the spherical harmonic functions are reproduced by the speakers SP.sub.1 to SP.sub.Q (Expression (11)).
[0054] The speaker drive unit 30 convolves the filter coefficient w(ω) in the voice signal input from the outside to generate speaker drive signals for driving the speakers SP.sub.1 to SP.sub.Q, respectively (step S4).
[0055] As described above, the sound image localization method according to the embodiment is a sound image localization method to be executed by the sound image localization device 100 that reflects, on the reflector 50, the sound signal radiated from the speaker array 40 arranged with the plurality of speakers SP.sub.1 to SP.sub.Q on the straight line to localize the sound image. The sound image localization method according to the embodiment includes: expansion coefficient calculation step S2 of analytically calculating expansion coefficients A.sup.m.sub.n by performing a spherical harmonic function expansion on a window function representing desired directivity; filter coefficient generation step S3 of generating filter coefficients w(ω) corresponding to each of the speakers SP.sub.1 to SP.sub.Q from the expansion coefficients A.sup.m.sub.n; and speaker drive step S4 of generating a speaker drive signal for driving each of the speakers SP.sub.1 to SP.sub.Q by convolving the filter coefficients w(ω) in a voice signal. Thus, it is possible to provide the sound image localization method capable of flexibly controlling the directivity with a short calculation time.
[0056]
[0057] Reference numeral 103 indicates a direct sound, reference numeral 104 indicates a reflected sound, and reference numeral 105 indicates a listening point. According to the sound image localization device 100, the listener located at the listening point 105 can perceive the upward sound image localization without using many control points.
Comparative Example
[0058]
[0059] From the directivity control by the least-squares method, a filter coefficient is obtained to minimize the sum of squares of an error between the desired directivity and the directivity observed at the control point. Accordingly, a calculation quantity increases. The directivity control by the least-squares method is well known, and thus will not be described by expressions.
[0060] Further, according to the method based on Non-Patent Literature 1, sound is reflected on the ceiling due to directivity reproduction of a regular polyhedron speaker and upward sound image localization is realized. In such a method, the directivity is formed using a normalized matched filter.
[0061] The normalized matched filter is obtained by providing a filter that matches the observed sound signal when the sound signal radiated from the speaker is observed at the observation point with the sound signal emitted by the speaker. Therefore, a transfer function to the target observation point is required for all of the speakers, resulting in an increase in calculation quantity.
[0062] In the sound image localization method according to the embodiment contrary to the comparative example, the expansion coefficient is analytically calculated by performing the spherical harmonic function expansion on the window function representing the desired directivity, and the filter coefficient corresponding to each of the speakers is generated from the expansion coefficient, thereby the calculation quantity can be reduced. In other words, it is possible to provide the sound image localization method capable of flexibly controlling the directivity with a short calculation time.
[0063] The characteristic function units of the sound image localization device 100 according to the embodiment can be realized by the computer including the ROM, the RAM, and the CPU. In such a case, the content of the function to be processed by each of the function units is described by the program. Such a program can be distributed via a recording medium such as a CD-ROM or a transmission medium such as the Internet.
[0064] It goes without saying that the present invention includes various embodiments and the like not described herein. Therefore, the technical scope of the present invention is defined only by the matters specifying the invention relating to the reasonable claims from the above description.
REFERENCE SIGNS LIST
[0065] 10 Expansion coefficient calculation unit [0066] 20 Filter coefficient generation unit [0067] 30 Speaker drive unit [0068] 40 Speaker array [0069] 50 Reflector (ceiling) [0070] 100 Sound image localization device [0071] 103 Direct sound [0072] 104 Reflected sound [0073] 105 Listening point