Apparatus and method for processing soundfield data

10433093 ยท 2019-10-01

Assignee

Inventors

Cpc classification

International classification

Abstract

An apparatus for processing soundfield data is provided. The soundfield data defines a soundfield within a spatial reproduction region comprising at least one bright zone and at least one quiet zone. The apparatus comprises an applicator configured to apply a spatially continuously varying weighting function to the soundfield data in order to obtain weighted soundfield data defining a weighted soundfield, wherein the spatially continuously varying weighting function is configured to enhance the soundfield in at least one of the bright zone and the quiet zone.

Claims

1. An apparatus for processing soundfield data, the soundfield data defining a soundfield within a spatial reproduction region comprising an at least one bright zone and an at least one quiet zone, the apparatus comprising: an applicator that applies a spatially continuously varying weighting function to the soundfield data to obtain a weighted soundfield data defining a weighted soundfield, wherein the spatially continuously varying weighting function enhances the soundfield in at least one of the group consisting of: the at least one bright zone and the at least one quiet zone; and a compressor that compresses the soundfield data based on a performance measure associated with the weighted soundfield.

2. The apparatus of claim 1, wherein the compressor compresses the soundfield data, in a case where the performance measure associated with the weighted soundfield differs from a predefined performance measure threshold.

3. The apparatus of claim 1, wherein the performance measure associated with the weighted soundfield is an acoustical contrast between the at least one bright zone and the at least one quiet zone of the weighted soundfield.

4. The apparatus of claim 3, wherein the acoustical contrast between the bright zone and the quiet zone is obtained based on a ratio between an average of the weighted soundfield in the at least one bright zone and an average of the weighted soundfield in the at least one quiet zone.

5. The apparatus of claim 3, wherein the acoustical contrast between the at least one bright zone and the at least one quiet zone is obtained based on the following: ( t ) = 10 log 10 b .Math. S ( x , t ) w ( x ) .Math. 2 dx / D b q .Math. S ( x , t ) w ( x ) .Math. 2 dx / D q , wherein (t) denotes the acoustical contrast as a function of time (t), S(x, t) denotes the soundfield data defining the soundfield as a function of a space and a time, w(x) denotes the spatially continuously varying weighting function and D.sub.b and D.sub.q denote a size of the at least one bright zone and a size of the at least one quiet zone, respectively.

6. The apparatus of according to claim 1, wherein the spatially continuously varying weighting function is a smoothly changing function that enhances the soundfield associated with the soundfield data in the at least one bright zone and the at least one quiet zone relative to a portion of the spatial reproduction region outside of the at least one bright zone and the at least one quiet zone.

7. The apparatus according to claim 1, wherein the spatially continuously varying weighting function is a linear combination of a first normal distribution centered at a center of the at least one bright zone and a second normal distribution centered at a center of the at least one quiet zone.

8. The apparatus according to claim 1, wherein the soundfield data is encoded in a Higher Order Ambisonic (HOA) B-Format.

9. The apparatus according to claim 1, wherein the apparatus further comprises a memory that stores the soundfield data to be weighted by the spatially continuously varying weighting function.

10. The apparatus according to claim 1 further comprising a renderer that renders the weighted soundfield based on the weighted soundfield data.

11. The apparatus of claim 1 further comprising: a soundfield reproduction apparatus that receives the weighted soundfield data; and a renderer that renders the weighted soundfield based on the weighted soundfield data.

12. The apparatus of claim 11, wherein the soundfield reproduction apparatus further comprises a performance measure determiner that determines the performance measure based on the weighted soundfield and feeds back the performance measure associated with the weighted soundfield to the compressor.

13. A method for processing a soundfield data, the soundfield data defining a soundfield within a spatial reproduction region comprising an at least one bright zone and an at least one quiet zone, the method comprising: applying a spatially continuously varying weighting function to the soundfield data to obtain a weighted soundfield data defining a weighted soundfield, wherein the spatially continuously varying weighting function enhances the soundfield in the at least one of the group consisting of: the at least one bright zone and the at least one quiet zone; and compressing the soundfield data based on a performance measure associated with the weighted soundfield.

14. The method of claim 13, wherein the performance measure associated with the weighted soundfield is an acoustical contrast between the at least one bright zone and the at least one quiet zone of the weighted soundfield.

15. The method of claim 14, wherein the acoustical contrast between the bright zone and the quiet zone is obtained based on a ratio between an average of the weighted soundfield in the at least one bright zone and an average of the weighted soundfield in the at least one quiet zone.

16. The method of claim 14, wherein the acoustical contrast between the at least one bright zone and the at least one quiet zone is obtained based on the following: ( t ) = 10 log 10 b .Math. S ( x , t ) w ( x ) .Math. 2 dx / D b q .Math. S ( x , t ) w ( x ) .Math. 2 dx / D q , wherein (t) denotes the acoustical contrast as a function of time (t), S(x, t) denotes the soundfield data defining the soundfield as a function of a space and a time, w(x) denotes the spatially continuously varying weighting function and D.sub.b and D.sub.q denote a size of the at least one bright zone and a size of the at least one quiet zone, respectively.

17. A non-transitory computer readable storage medium having a computer-executable instructions that, when executed by a processor, facilitate carrying out a method for processing a soundfield data, the soundfield data defining a soundfield within a spatial reproduction region comprising an at least one bright zone and an at least one quiet zone, the method comprising: applying a spatially continuously varying weighting function to the soundfield data to obtain a weighted soundfield data defining a weighted soundfield, wherein the spatially continuously varying weighting function enhances the soundfield in the at least one of the group consisting of: the at least one bright zone and the at least one quiet zone; and compressing the soundfield data based on a performance measure associated with the weighted soundfield.

18. The non-transitory computer-readable medium of claim 17, wherein the performance measure associated with the weighted soundfield is an acoustical contrast between the at least one bright zone and the at least one quiet zone of the weighted soundfield.

19. The non-transitory computer-readable medium of claim 18, wherein the acoustical contrast between the bright zone and the quiet zone is obtained based on a ratio between an average of the weighted soundfield in the at least one bright zone and an average of the weighted soundfield in the at least one quiet zone.

20. The non-transitory computer-readable medium of claim 18, wherein the acoustical contrast between the at least one bright zone and the at least one quiet zone is obtained based on the following: ( t ) = 10 log 10 b .Math. S ( x , t ) w ( x ) .Math. 2 dx / D b q .Math. S ( x , t ) w ( x ) .Math. 2 dx / D q , wherein (t) denotes the acoustical contrast as a function of time (t), S(x, t) denotes the soundfield data defining the soundfield as a function of a space and a time, w(x) denotes the spatially continuously varying weighting function and D.sub.b and D.sub.q denote a size of the at least one bright zone and a size of the at least one quiet zone, respectively.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) Further embodiments of the disclosure will described with respect to the following figures, wherein:

(2) FIG. 1 shows a schematic diagram of an apparatus for processing soundfield data according to an embodiment;

(3) FIG. 2 shows a schematic diagram of a method for processing soundfield data according to an embodiment;

(4) FIG. 3 shows a schematic diagram of a soundfield reproduction system according to an embodiment comprising an apparatus for processing soundfield data according to an embodiment;

(5) FIG. 4 shows a diagram illustrating the dependence of the averaged acoustic contrast performance as a function of a transmission bitrate for a plurality of different compression techniques that can be implemented in a soundfield reproduction system shown in FIG. 3;

(6) FIG. 5 shows a schematic diagram of an apparatus for processing soundfield data according to an embodiment;

(7) FIG. 6 shows a schematic diagram illustrating different aspects of embodiments of the disclosure; and

(8) FIG. 7 shows a schematic diagram illustrating different aspects of embodiments of the disclosure.

(9) In the various figures, identical reference signs will be used for identical or at least functionally equivalent features.

DETAILED DESCRIPTION OF THE EMBODIMENTS

(10) In the following description, reference is made to the accompanying drawings, which form part of the disclosure, and in which are shown, by way of illustration, specific aspects in which the present disclosure may be placed. It is understood that other aspects may be utilized and structural or logical changes may be made without departing from the scope of the present disclosure. The following detailed description, therefore, is not to be taken in a limiting sense, as the scope of the present disclosure is defined be the appended claims.

(11) For instance, it is understood that a disclosure in connection with a described method may also hold true for a corresponding device or system configured to perform the method and vice versa. For example, if a specific method step is described, a corresponding device may include a unit to perform the described method step, even if such unit is not explicitly described or illustrated in the figures. Further, it is understood that the features of the various exemplary aspects described herein may be combined with each other, unless specifically noted otherwise.

(12) FIG. 1 shows a schematic diagram of an apparatus 100 for processing soundfield data. As schematically indicated on the right hand side of FIG. 1, the soundfield data defines a soundfield within a spatial reproduction region 101 comprising at least one bright zone 101a and at least one quiet zone 101b.

(13) The term soundfield data is used herein to refer to any data which includes information relating to directional characteristics of the sound it represents. Soundfield data can be represented in a variety of different formats, each of which has a defined number of audio channels, and requires a different interpretation in order to reproduce the sound represented. Examples of such formats include stereo, 5.1 surround sound and formats such Higher Order Ambisonic (HOA) formats, in particular HOA B-format.

(14) The spatial reproduction region of the soundfield defined by the soundfield data can have a plurality of different shapes. In an implementation form the soundfield can be three-dimensional or two-dimensional with the spatial reproduction region, the bright zone and the quiet zone lying in a two-dimensional plane. In an implementation form the bright zone and the quiet zone can have spherical, cylindrical or circular shapes. Other shapes are possible.

(15) The apparatus 100 comprises an applicator 103 configured to apply a spatially continuously varying weighting function to the soundfield data in order to obtain weighted soundfield data defining a weighted soundfield. The spatially continuously varying weighting function is configured to enhance the soundfield in the bright zone 101a and/or the quiet zone 101b of the spatial reproduction region 101.

(16) In an embodiment, the apparatus 100 further comprises a compressor 105 configured to compress the soundfield data on the basis of a performance measure associated with the weighted soundfield.

(17) In an embodiment, the compressor 105 is configured to compress the soundfield data, in case the performance measure associated with the weighted soundfield differs from a predefined performance measure threshold.

(18) In an embodiment, the performance measure associated with the weighted soundfield is an acoustical contrast between the at least one bright zone 101a and the at least one quiet zone 101b of the weighted soundfield.

(19) In an embodiment, the acoustical contrast between the bright zone 101a and the quiet zone 101b is based on a ratio between an average of the weighted soundfield in the bright zone 101a and an average of the weighted soundfield in the quiet zone 101b.

(20) In an embodiment, the acoustical contrast between the bright zone 101a and the quiet zone 101b is based on the following equation:

(21) ( t ) = 10 log 10 b .Math. S ( x , t ) w ( x ) .Math. 2 dx / D b q .Math. S ( x , t ) w ( x ) .Math. 2 dx / D q , ( 1 )
wherein (t) denotes the acoustical contrast as a function of time, S(x,t) denotes the soundfield associated with the soundfield data as a function of space and time, w(x) denotes the spatially continuously varying weighting function and D.sub.b and D.sub.q denote the size of the bright region 101a and the size of the quiet region 101b, respectively.

(22) In an embodiment, the spatially continuously varying weighting function is a smoothly changing function configured to enhance the soundfield associated with the soundfield data in the bright region 101a and the quiet region 101b relative to the portions of the spatial reproduction region 101 outside of the bright region 101a and the quiet region 101b.

(23) In an embodiment, the spatially continuously varying weighting function is a linear combination of a first normal distribution centered at a center of the bright zone 101a and a second normal distribution centered at a center of the quiet zone 101b. This preferred choice of the spatially continuously varying weighting function is based on the finding that, in practice, the position of the listener's head (ears) is not guaranteed to be stationary within the bright region and/or quiet region due to the movement of its body. Rather, the distribution of listener's head position can be modelled as a Gaussian distribution function of its distance to the center of the bright zone and the quiet zone, respectively. Thus, in an embodiment, the spatially continuously varying weighting function can be defined by the following equation:

(24) w ( x ) = a a 2 e - ( .Math. x - O b .Math. ) 2 2 a 2 + b b 2 e - ( .Math. x - O q .Math. ) 2 2 b 2 , ( 2 )
wherein w(x) denotes the spatially continuously varying weighting function, O.sub.b denotes the center of the bright zone, O.sub.q denotes the center of the quiet zone and a, b, .sub.a and .sub.b denote predefined weighting function parameters.

(25) With the above preferred choice for the weighting function the probability that the listener's head is positioned within a circle of radius r/2 from the center of the bright zone (or equivalently the center of the quiet zone) is 68.3%. With this choice of the weighting function, the system will distribute the importance of the reproduction accuracy over different zones in a more flexible and efficient manner due to the introduction of the smoothly and continuously changing weighting function. More emphasis will be attached to the region where the listener' ears are more likely to appear (e.g. the central region of the bright and quiet zone), while the reproduction effort might be distracted in some region (e.g. the edge of the bright and quiet zone) in order to alleviate the occurrence of spurious sound outside of the bright zone and the quiet zone.

(26) FIG. 2 shows a schematic diagram of a method 200 for processing soundfield data according to an embodiment, for instance, the soundfield data defining a soundfield within the spatial reproduction region 101 shown in FIG. 1, comprising the acoustically bright zone 101a and the acoustically quiet zone 101b.

(27) The method 200 comprises the step 201 of applying a spatially continuously varying weighting function to the soundfield data, for instance, the spatially continuously varying weighting function defined in equation (2) above, in order to obtain weighted soundfield data defining a weighted soundfield, wherein the spatially continuously varying weighting function is configured to enhance the soundfield in the bright zone 101a and/or the quiet zone 101b.

(28) Further implementation forms, embodiments and aspects of the apparatus 100 for processing soundfield data and the method 200 for processing soundfield data will be described in the following.

(29) FIG. 3 shows a schematic diagram of a soundfield reproduction system 300 according to an embodiment comprising an apparatus 100 for processing soundfield data according to an embodiment.

(30) In the embodiment of the apparatus 100 for processing soundfield data shown in FIG. 3, the applicator 103 shown in FIG. 1 is referred to as a Multizone HOA format converter 103 and the compressor 105 shown in FIG. 1 is referred to as Compression. In addition to the applicator 103 and the compressor 105 the embodiment of the apparatus 100 for processing soundfield data shown in FIG. 3 comprises an acquisition device 107 configured to acquire the original, i.e. non-weighted, soundfield data. In an embodiment, the acquisition device 107 can comprise one or more microphones, such as a 32-channel Eigenmike. In an embodiment, the acquisition device 107 can be a communication interface configured to receive the original, i.e. non-weighted, soundfield data from another device.

(31) In an embodiment, the acquisition device 107 is configured to provide the original, i.e. non-weighted, soundfield data in HOA B-format to a HOA format converter 109 configured to perform a plane wave decomposition of the HOA B-format soundfield data into the spherical/circular harmonic domain resulting in the soundfield data S(x,k), wherein x denotes the position vector and k denotes the wave number, or equivalently the soundfield data S(x,t), wherein t denotes time.

(32) The HOA format converter 109 of the embodiment of the apparatus 100 for processing soundfield data shown in FIG. 3 is configured to provide the soundfield data S(x,k) (or equivalently S(x,t)) to the applicator 103, which, as already mentioned above, in the embodiment shown in FIG. 8 is referred to as the Multizone HOA format converter 103. As already described in the context of the embodiment shown in FIG. 1, the applicator 103 is configured to apply a spatially continuously varying weighting function to the soundfield data provided by the HOA format converter 109 in order to obtain weighted soundfield data defining a weighted soundfield. The spatially continuously varying weighting function used by the applicator 103 is configured to enhance the soundfield in the bright zone 101a and/or the quiet zone 101b of the spatial reproduction region 101. In an embodiment, the applicator 103 is configured to provide the weighted soundfield data as HOA-B format weighted soundfield data. As schematically indicated in FIG. 3, in order to be able to perform this conversion to the HOA-B format, the applicator 103 requires as input some information about the soundfield and the weighting function, such as the location of the bright zone and/or the quit zone.

(33) In the embodiment shown in FIG. 3, the apparatus 100 for processing soundfield data comprises in addition an electronic storage or memory 111 configured to store soundfield data to be processed by the applicator 103, i.e. to be weighted by the spatially continuously varying weighting function. Thus, in embodiments, the applicator 103 can be configured to process soundfield data provided by either one or by both of the HOA format converter 109 or the storage 111.

(34) In the embodiment shown in FIG. 3 the weighted soundfield data generated by the applicator 103 is provided to the compressor 105, which is configured to compress the weighted soundfield data using one or more conventional compression techniques. As will be described in more detail further below, in an embodiment, the compressor 105 is configured to adapt its compression rate for compressing the weighted soundfield data on the basis of a performance measure, which is being fed back to the compressor 105 from the soundfield reproduction apparatus 310 shown in FIG. 3.

(35) In the embodiment shown in FIG. 3 the apparatus 100 for processing soundfield data and the soundfield reproduction apparatus 310 are part of the soundfield reproduction system 300. In other embodiment, the apparatus 100 for processing soundfield data and the soundfield reproduction apparatus 310 can be separated in space and/or time. For instance, the apparatus 100 for processing soundfield data could be implemented as a web server providing the compressed weighted soundfield data over the Internet to the soundfield reproduction apparatus 310 implemented as a web client. In such a scenario the apparatus 100 for processing soundfield data can be considered to be an encoder, whereas the soundfield reproduction apparatus 310 can be considered to be a corresponding decoder.

(36) In the embodiment shown in FIG. 3, the soundfield reproduction apparatus 310 comprises a decompressor 312 configured to decompress the compressed weighted soundfield data provided by the apparatus 100 for processing soundfield data. In case the compressor 105 and the decompressor 312 are implemented to use lossless compression techniques the decompressor 312 can fully restore the weighted soundfield data. Furthermore, the soundfield reproduction apparatus 310 comprises a renderer 313 configured to render, i.e. reproduce the weighted soundfield on the basis of the weighted soundfield data. In an embodiment, the renderer 313 can comprise one or more appropriately arranged transducers, in particular loudspeakers.

(37) Finally, in the embodiment shown in FIG. 3, the soundfield reproduction apparatus 310 comprises a performance measure determiner 315 configured to determine a performance measure on the basis of the weighted soundfield. To this end, in an embodiment, the performance measure determiner 315 can comprise one or more microphones, such as a 32-channel Eigenmike, for measuring the weighted soundfield reproduced by the renderer 313 as well as a processing unit configured to determine a performance measure on the basis of the measured weighted soundfield, for instance, the performance measure defined in equation (1) above.

(38) In an embodiment, the soundfield reproduction apparatus 310 is configured to feedback the performance measure determined by the performance measure determiner 315 to the compressor 105 of the apparatus 100. In an embodiment, the compressor 105 is configured to adjust its compression rate on the basis of the performance measure provided by the performance measure determiner 315. For instance, in an embodiment the compressor 105 can check, whether the performance measure provided by the performance measure determiner 315 is larger than a predefined performance measure threshold, e.g. whether the acoustical contrast between the bright region 101a and the quiet region is larger than a predefined minimal acoustical contrast, and, if this is the case, can increase the compression rate applied to the weighted soundfield data.

(39) In an embodiment, the compressor 105 can implement a compression strategy based on the pre-calculated graphs shown in FIG. 4, which shows the dependence of the averaged acoustic contrast performance as a function of a transmission bitrate for a plurality of different compression techniques, such as different versions of EVS and different versions of AAC. For instance, in an embodiment, the compressor 105 could be configured to increase its compression rate, in case for a given previously chosen bitrate the performance measure provided by the performance measure determiner 315, i.e. the averaged acoustic contrast performance, falls below the curve show in FIG. 4 for the compression strategy adopted by the compressor 105.

(40) FIG. 5 shows a schematic diagram of a further embodiment of an apparatus 100 for processing soundfield data. As the embodiment of the apparatus 100 for processing soundfield data shown in FIG. 1, the further embodiment of the apparatus 100 for processing soundfield data shown in FIG. 5 comprises an applicator 103 (referred to as Multizone HOA format converter in FIG. 5) configured to apply a spatially continuously varying weighting function to soundfield data, for instance, the spatially continuously varying weighting function defined in equation (2) above, in order to obtain weighted soundfield data defining a weighted soundfield, wherein the spatially continuously varying weighting function is configured to enhance the soundfield in the bright zone 101a and/or the quiet zone 101b. In the embodiment shown in FIG. 5, the soundfield data is taken from an electronic storage or memory 111, for instance a DVD player, a CD player or a Flash memory, configured to store the soundfield data to be weighted by the spatially continuously varying weighting function. In an embodiment, the applicator 103 is configured to provide the weighted soundfield data as HOA-B format weighted soundfield data. As schematically indicated in FIG. 5, in order to be able to perform this conversion to the HOA-B format, the applicator 103 requires as input some information about the soundfield and the weighting function, such as the location of the bright zone and/or the quit zone.

(41) As in the embodiment shown in FIG. 5, the weighted soundfield data is provided from the applicator 103 directly to a renderer 113 configured to render, i.e. reproduce, the weighted soundfield on the basis of the weighted soundfield data, the apparatus 100 shown in FIG. 5 does not comprise a compressor, such as the compressor 105 of the apparatus shown in FIG. 1.

(42) FIGS. 6 and 7 show schematic diagrams illustrating different aspects of embodiments of the disclosure in the context of an unrestricting illustrative example. In this illustrative example it is assumed that the bright zone of the weighted soundfield has the size of a circle with diameter 2*Ro (outer zone) as shown in the FIG. 6, which generally is much larger than the size of an average human head. As already described above, according to embodiments of the disclosure, a bitrate reduction can be achieved by having a smooth weighting function/model corresponding to some criteria such as the possible user movement within the region of diameter 2*Ri (inner zone) inside the outer zone.

(43) In multizone applications, it is practically desirable to have the size of outer zone as large as possible. One may choose to focus on the reproduction inside a smaller region denoted by the inner zone. This will make the system to be inferior due to a smaller area of coverage and reprocessing of the multizone HOA B-format signals due to a change in the multizone arrangement input, resulting in an undesired quality as the user moves away from the inner zone. Embodiments of the disclosure on the other hand, guarantee a smooth transition in quality as highlighted in FIG. 7.

(44) While a particular feature or aspect of the disclosure may have been disclosed with respect to only one of several implementations or embodiments, such feature or aspect may be combined with one or more other features or aspects of the other implementations or embodiments as may be desired and advantageous for any given or particular application. Furthermore, to the extent that the terms include, have, with, or other variants thereof are used in either the detailed description or the claims, such terms are intended to be inclusive in a manner similar to the term comprise. Also, the terms exemplary, for example and e.g. are merely meant as an example, rather than the best or optimal. The terms coupled and connected, along with derivatives may have been used. It should be understood that these terms may have been used to indicate that two elements cooperate or interact with each other regardless whether they are in direct physical or electrical contact, or they are not in direct contact with each other.

(45) Although specific aspects have been illustrated and described herein, it will be appreciated by those of ordinary skill in the art that a variety of alternate and/or equivalent implementations may be substituted for the specific aspects shown and described without departing from the scope of the present disclosure. This application is intended to cover any adaptations or variations of the specific aspects discussed herein.

(46) Although the elements in the following claims are recited in a particular sequence with corresponding labeling, unless the claim recitations otherwise imply a particular sequence for implementing some or all of those elements, those elements are not necessarily intended to be limited to being implemented in that particular sequence.

(47) Many alternatives, modifications, and variations will be apparent to those skilled in the art in light of the above teachings. Of course, those skilled in the art readily recognize that there are numerous applications of the disclosure beyond those described herein. While the present disclosure has been described with reference to one or more particular embodiments, those skilled in the art recognize that many changes may be made thereto without departing from the scope of the present disclosure. It is therefore to be understood that within the scope of the appended claims and their equivalents, the disclosure may be practiced otherwise than as specifically described herein.