METHOD, COMPUTER READABLE STORAGE MEDIUM, AND APPARATUS FOR MULTICHANNEL AUDIO PLAYBACK ADAPTATION FOR MULTIPLE LISTENING POSITIONS
20170245083 · 2017-08-24
Inventors
- Sven Kordon (Wunstorf, DE)
- Jithin ZACHARIAS (Darmstadt, DE)
- Peter STEINBORN (Lehrte, DE)
- Ulrich Gries (Hannover, DE)
- Johannes Boehm (Goettingen, DE)
- Achim FREIMANN (Hannover, DE)
Cpc classification
H04S7/302
ELECTRICITY
H04S2400/11
ELECTRICITY
H04S3/002
ELECTRICITY
H04S2420/05
ELECTRICITY
G06F3/165
PHYSICS
International classification
Abstract
A method, a computer readable medium, and an apparatus for multichannel audio playback adaptation. A user input defining a listening region relative to a loudspeaker arrangement is received via an input. A distance calculating unit determines minimal distances between the listening region and the loudspeakers. A comparator compares the determined minimal distances with a critical distance. A parameter setting unit then adapts output parameters for those loudspeakers for which the determined minimal distance is below the critical distance.
Claims
1. A method for multichannel audio playback adaptation, the method comprising: receiving a user input defining a listening region relative to a loudspeaker arrangement; determining minimal distances between the listening region and the loudspeakers; comparing the determined minimal distances with a critical distance; and adapting output parameters for those loudspeakers for which the determined minimal distance is below the critical distance.
2. The method according to claim 1, wherein the user input is obtained by a touch pad of an input device.
3. The method according to claim 1, further comprising checking whether the user input defines an invalid listening region.
4. The method according to claim 1, wherein the output parameters for those loudspeakers for which the determined minimal distance is below the critical distance are adapted such that sound produced by those loudspeakers is less dominant relative to sound produced by the other loudspeakers.
5. The method according to claim 1, wherein the output parameters that are adapted are gain parameters and delay parameters.
6. The method according to claim 1, wherein the critical distance is determined based on the precedence or Haas effect.
7. The method according to claim 1, further comprising receiving a further user input selecting a listening region among one or more defined listening regions and adapting output parameters of the loudspeakers such that audio playback is optimized for the selected listening region.
8. A non-transitory program storage device, readable by a computer, tangibly embodying a program of instructions executable by the computer to perform a method according to claim 1.
9. An apparatus for multichannel audio playback adaptation, the apparatus comprising: an input configured to receive a user input defining a listening region relative to a loudspeaker arrangement; a distance calculating unit configured to determine minimal distances between the listening region and the loudspeakers; a comparator configured to compare the determined minimal distances with a critical distance; and a parameter setting unit configured to adapt output parameters for those loudspeakers for which the determined minimal distance is below the critical distance.
10. An apparatus for multichannel audio playback adaptation, the apparatus comprising a processing device and a memory device having stored therein instructions, which, when executed by the processing device, cause the apparatus to: receive a user input defining a listening region relative to a loudspeaker arrangement; determine minimal distances between the listening region and the loudspeakers; compare the determined minimal distances with a critical distance; and adapt output parameters for those loudspeakers for which the determined minimal distance is below the critical distance.
11. The apparatus of claim 9, wherein the user input is obtained by a touch pad of an input device.
12. The apparatus of claim 9, further comprising checking whether the user input defines an invalid listening region.
13. The apparatus of claim 9, wherein the output parameters for those loudspeakers for which the determined minimal distance is below the critical distance are adapted such that sound produced by those loudspeakers is less dominant relative to sound produced by the other loudspeakers.
14. The apparatus of claim 9, wherein the output parameters that are adapted are gain parameters and delay parameters.
15. The apparatus of claim 9, wherein the critical distance is determined based on the precedence or Haas effect.
16. The apparatus of claim 9, further comprising receiving a further user input selecting a listening region among one or more defined listening regions and adapting output parameters of the loudspeakers such that audio playback is optimized for the selected listening region.
17. The apparatus of claim 10, wherein the user input is obtained by a touch pad of an input device.
18. The apparatus of claim 10, wherein the output parameters for those loudspeakers for which the determined minimal distance is below the critical distance are adapted such that sound produced by those loudspeakers is less dominant relative to sound produced by the other loudspeakers.
19. The apparatus of claim 10, wherein the output parameters that are adapted are gain parameters and delay parameters.
20. The apparatus of claim 10, wherein the critical distance is determined based on the precedence or Haas effect.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0029]
[0030]
[0031]
[0032]
[0033]
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
[0034] For a better understanding the principles of embodiments of the invention shall now be explained in more detail in the following description with reference to the figures. It is understood that the invention is not limited to these exemplary embodiments and that specified features can also expediently be combined and/or modified without departing from the scope of the present invention as defined in the appended claims. In the drawings, the same or similar types of elements or respectively corresponding parts are provided with the same reference numbers in order to prevent the item from needing to be reintroduced.
[0035]
[0036]
[0037] In
[0038] For example, the processing device 32 can be a processor adapted to perform the steps according to one of the described methods. In an embodiment said adaptation comprises that the processor is configured, e.g. programmed, to perform steps according to one of the described methods.
[0039] A processor as used herein may include one or more processing units, such as microprocessors, digital signal processors, or combination thereof.
[0040] The storage unit 26 and the memory device 31 may include volatile and/or non-volatile memory regions and storage devices such as hard disk drives, DVD drives, and solid-state storage devices. A part of the memory is a non-transitory program storage device readable by the processing device 32, tangibly embodying a program of instructions executable by the processing device 32 to perform program steps as described herein according to the principles of the invention.
[0041] In the following further implementation details shall be described.
[0042] The proposed solution consists basically of three steps. In the first step the user creates multiple listening regions, e.g. using a user interface of a playback device. In the second step one set of loudspeaker correction gain and delay values is computed for each listening region. Both steps need to be performed only once because the gain and delay values are constant for each listening region. In the third step the gain and delay values for a selected listening region are applied to the signals of the corresponding loudspeakers.
[0043] In order to handle different listener arrangements, the user defines different listening regions for typical arrangements. A listening region specifies the area (region) where the listeners are located in the room. In the simplest case a listening region is described by one or more polygons or concatenated rectangles in a plane that is parallel to the floor and at the height of the listener's ears. This assumes that the ears of all listeners are approximately on the same level. If the height of the ears should also be variable a listening region can alternatively be defined by a three dimensional mesh grid or by concatenated cuboids. In the following for simplicity the 2D plane is used to describe the further processing.
[0044] The listening regions are preferably specified by the user relative to the loudspeaker positions using a top view of the listening room. It has to be assured that the distances between the loudspeakers and the borders of the listening region match the real distances in the room. Therefore, it is assumed that the absolute position of each loudspeaker, for example given in Cartesian coordinates, is known by the playback device. A grid of a variable scale, for example of 20 cm×20 cm, can be used to help the user to approximate the right dimensions of the listening region in relation to the positions of the loudspeakers.
[0045] One example of a listening region for one listener is illustrated in
[0046] For each listening region the following processing is applied. First the minimal distance between the listening region and each loudspeaker is computed. For all loudspeakers that have a minimal distance to the listening region that is smaller than a critical distance, correction gain and delay values are computed. The critical distance defines the distance for which the delay and level of the loudspeaker signal has to be adapted because it would otherwise be annoying for the closest listener. It is preferably computed from the well-known precedence or Haas effect. This effect describes the dominance of one sound source over another with respect to the relative delay and level differences between the sources. Thus a delay and a gain value are obtained to make the dominant sound from the closest loudspeaker less dominant relative to other loudspeaker signals.
[0047] In the following an exemplary solution for the computation of the critical distance r.sub.crit, the gain value α.sub.i, and the time delay value Δt.sub.i for the adaptation of the i-th loudspeaker signal x.sub.i at a minimal distance r.sub.i between the listener area and the loudspeaker shall be described. In a first step the vector vpointing to the geometric centroid of the loudspeaker setup is computed:
where L is the total number of loudspeakers and x.sub.i describes the position of the i-th loudspeaker by a vector in Cartesian coordinates. Also computed is the average distance of the given loudspeaker setup
where |v−x.sub.1| defines the Euclidean distance between the two vectors.
[0048] The critical distance can be computed from the precedence effect. The precedence effect says that if the time delay between two identical signals that arrive from two different directions at the listener is less than 5 ms, the listener will perceive only one source at a position in between the two impinging directions. Therefore, one source is not perceived as dominant over the other if the precedence effect is valid. As a compromise for all listeners it is assumed that a loudspeaker is perceived as dominant if its signal arrives Δt.sub.max=5 ms earlier at the position v than a signal of a loudspeaker that has a distance of r.sub.mean to v. The critical distance between a loudspeaker and the listening array is then defined by
r.sub.crit=max(r.sub.mean−c.Math.Δt.sub.max,0)
with the speed of sound
[0049] The pressure level of a spherical source, which is used to model the loudspeaker, is inversely proportional to the distance from the source. The idea is to correct the pressure level of the loudspeaker to the level at the distance r.sub.crit by:
[0050] Under consideration of some boundary values the gain values are computed from:
[0051] For example, the gain value may be corrected to the level at the average loudspeaker distance if the critical distance is equal to zero.
[0052] The signal of a loudspeaker that has a minimal distance smaller than the critical distance is delayed in a way that the total runtime from the loudspeaker to the listening array is equal to
Therefore, the additional time delay values are determined by:
[0053] Finally, the corrected speaker signals {circumflex over (x)}.sub.i(t) are determined from
{circumflex over (x)}.sub.i(t)=α.sub.i.Math.x.sub.i(t−Δt.sub.i).
[0054] For playback the user selects the listening region that best matches the actual positioning of the listeners. The gain and delay values that have been computed for the selected listening region are then applied to the corresponding loudspeaker signals.