Apparatus and methods for adapting audio information in spatial audio object coding
10497375 · 2019-12-03
Assignee
Inventors
- Thorsten Kastner (Erlangen, DE)
- Juergen Herre (Erlangen, DE)
- Leon Terentiv (Erlangen, DE)
- Oliver Hellmuth (Erlangen, DE)
- Jouni Paulus (Erlangen, DE)
- Falko Ridderbusch (Erlangen, DE)
Cpc classification
H04S2420/03
ELECTRICITY
G10L19/008
PHYSICS
International classification
Abstract
An apparatus for adapting input audio information, encoding one or more audio objects, to obtain adapted audio information is provided. The input audio information includes two or more input audio downmix channels and further includes input parametric side information. The adapted audio information includes one or more adapted audio downmix channels and further includes adapted parametric side information. The apparatus includes a downmix signal modifier for adapting, depending on adaptation information, the two or more input audio downmix channels to obtain the one or more adapted audio downmix channels. Moreover, the apparatus includes a parametric side information adapter for adapting, depending on the adaptation information, the input parametric side information to obtain the adapted parametric side information.
Claims
1. An audio encoder for encoding one or more audio object signals to obtain one or more second downmix channels and second parametric side information, wherein the apparatus comprises: a first audio signal encoding unit configured for downmixing the one or more audio object signals to obtain two or more first audio downmix channels and to obtain first parametric side information, a downmix signal modifier configured for applying an adaptation matrix on the two or more first audio downmix channels to acquire the one or more second audio downmix channels, wherein the adaptation matrix comprises at least two rows, and wherein the adaptation matrix comprises at least two columns, and a parametric side information adapter configured for applying said adaptation matrix on the first parametric side information to acquire the second parametric side information, wherein the audio encoder is configured for outputting the one or more second audio downmix channels and the second parametric side information so that the one or more audio object signals are decodable using the one or more second audio downmix channels, and using the second parametric side information, wherein the apparatus is implemented using a hardware apparatus or using a computer or using a combination of a hardware apparatus and a computer.
2. An audio encoder according to claim 1, wherein the first parametric side information indicates an initial downmix matrix, such that by applying the initial downmix matrix on the one or more audio object signals, the two or more first audio downmix channels are acquired, and wherein the parametric side information adapter is configured to determine an adapted downmix matrix as the second parametric side information, such that by applying the adapted downmix matrix on the one or more audio object signals, the one or more second audio downmix channels are acquired.
3. An audio encoder according to claim 1, wherein the downmix signal modifier is configured to adapt the two or more first audio downmix channels using the adaptation matrix, such that the number of the one or more second audio downmix channels is smaller than the number of the two or more first audio downmix channels.
4. An audio encoder according to claim 1, wherein the adaptation matrix depends on a decoder instance, and wherein the downmix signal modifier is configured to adapt the two or more first audio downmix channels depending on the decoder instance.
5. An audio encoder according to claim 4, wherein the decoder instance is capable of decoding at most a maximum number of downmix channels, wherein the adaptation matrix depends on said maximum number of downmix channels, and wherein the downmix signal modifier is configured to adapt the two or more first audio downmix channels depending on the adaptation matrix to acquire the one or more second audio downmix channels, such that the number of the one or more second audio downmix channels is equal to said maximum number of downmix channels.
6. An audio encoder according to claim 1, wherein the downmix signal modifier is configured to adapt, depending on the adaptation matrix D.sub.dmx.sup.DSM, the two or more first audio downmix channels X.sub.dmx.sup.ENC to acquire the one or more second audio downmix channels X.sub.dmx.sup.DSM by applying the formula:
X.sub.dmx.sup.DSM=D.sub.dmx.sup.DSMX.sub.dmx.sup.ENC.
7. An audio encoder according to claim 1, wherein the parametric side information adapter is configured to adapt, depending on the adaptation matrix D.sub.dmx.sup.DSM, the first parametric side information D.sub.dmx.sup.ENC to acquire the second parametric side information D.sub.dmx.sup.PSI by applying the formula:
D.sub.dmx.sup.PSI=D.sub.dmx.sup.DSMD.sub.dmx.sup.ENC.
8. A system for generating one or more audio channels from first audio information encoding one or more audio object signals, wherein the apparatus comprises: an audio encoder according to claim 1 for adapting the first audio information to acquire second audio information, wherein the first audio information comprises two or more first audio downmix channels and further comprises first parametric side information, wherein the second audio information comprises one or more second audio downmix channels and further comprises second parametric side information, and an audio decoder for decoding, depending on the second parametric side information, the one or more second audio downmix channels to acquire the one or more audio channels.
9. A system according to claim 8, wherein the parametric side information adapter of the apparatus according to claim 1 is configured to adapt the first parametric side information to acquire the second parametric side information, and to feed the second parametric side information into the audio decoder, and wherein the audio decoder is configured to decode the one or more second audio downmix channels depending on the second parametric side information.
10. A system according to claim 8, wherein the parametric side information adapter of the apparatus according to claim 1 is configured to feed a bit stream comprising the second parametric side information into the audio decoder, and wherein the audio decoder is configured to decode the one or more second audio downmix channels depending on the bit stream.
11. A method for audio encoding for encoding one or more audio object signals to obtain one or more second downmix channels and second parametric side information, wherein the method comprises: downmixing the one or more audio object signals to obtain two or more first audio downmix channels and to obtain first parametric side information, applying an adaptation matrix on the two or more first audio downmix channels to acquire the one or more second audio downmix channels, wherein the adaptation matrix comprises at least two rows, and wherein the adaptation matrix comprises at least two columns, and applying said adaptation matrix on the first parametric side information to acquire the second parametric side information, outputting the one or more second audio downmix channels and the second parametric side information so that the one or more audio object signals are decodable using the one or more second audio downmix channels, and using the second parametric side information, wherein the method is performed using a hardware apparatus or using a computer or using a combination of a hardware apparatus and a computer.
12. A method according to claim 11, wherein the first parametric side information indicates an initial downmix matrix, such that by applying the initial downmix matrix on the one or more audio object signals, the two or more first audio downmix channels are acquired, and wherein adapting the first parametric side information comprises determining an adapted downmix matrix as the second parametric side information, such that by applying the adapted downmix matrix on the one or more audio object signals, the one or more second audio downmix channels are acquired.
13. A non-transitory computer-readable medium comprising a computer program for implementing, when being executed by a computer or signal processor, a method for audio encoding for encoding one or more audio object signals to obtain one or more second downmix channels and second parametric side information, wherein the method comprises: downmixing the one or more audio object signals to obtain two or more first audio downmix channels and to obtain first parametric side information, applying an adaptation matrix on the two or more first audio downmix channels to acquire the one or more second audio downmix channels, wherein the adaptation matrix comprises at least two rows, and wherein the adaptation matrix comprises at least two columns, and applying said adaptation matrix on the first parametric side information to acquire the second parametric side information, outputting the one or more second audio downmix channels and the second parametric side information so that the one or more audio object signals are decodable using the one or more second audio downmix channels, and using the second parametric side information.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Embodiments of the present invention will be detailed subsequently referring to the appended drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION OF THE INVENTION
(11) Before describing embodiments of the present invention, more background on state-of-the-art-SAOC systems is provided.
(12)
(13) In the case of a stereo downmix, the channels of the downmix signal 18 are denoted L0 and R0, in case of a mono downmix same is simply denoted L0. In order to enable the SAOC decoder 12 to recover the individual objects s.sub.1 to s.sub.N, side-information estimator 17 provides the SAOC decoder 12 with side information including SAOC-parameters. For example, in case of a stereo downmix, the SAOC parameters comprise object level differences (OLD), inter-object correlations (IOC) (inter-object cross correlation parameters), downmix gain values (DMG) and downmix channel level differences (DCLD). The side information 20, including the SAOC-parameters, along with the downmix signal 18, forms the SAOC output data stream received by the SAOC decoder 12.
(14) The SAOC decoder 12 comprises an up-mixer which receives the downmix signal 18 as well as the side information 20 in order to recover and render the audio signals .sub.1 and .sub.N onto any user-selected set of channels .sub.1 to .sub.M, with the rendering being prescribed by rendering information 26 input into SAOC decoder 12.
(15) The audio signals s.sub.1 to s.sub.N may be input into the encoder 10 in any coding domain, such as, in time or spectral domain. In case the audio signals s.sub.1 to s.sub.N are fed into the encoder 10 in the time domain, such as PCM coded, encoder 10 may use a filter bank, such as a hybrid QMF bank, in order to transfer the signals into a spectral domain, in which the audio signals are represented in several sub-bands associated with different spectral portions, at a specific filter bank resolution. If the audio signals s.sub.1 to s.sub.N are already in the representation expected by encoder 10, same does not have to perform the spectral decomposition.
(16)
(17) As outlined above, side information extractor 17 of
(18) The side information extractor 17 depicted in
(19)
wherein the sums and the indices n and k, respectively, go through all temporal indices 34, and all spectral indices 30 which belong to a certain time/frequency tile 42, referenced by the indices l for the SAOC frame (or processing time slot) and m for the parameter band. Thereby, the energies of all sub-band values x.sub.i of an audio signal or object i are summed up and normalized to the highest energy value of that tile among all objects or audio signals. x.sub.i.sup.n,k* denotes the complex conjugate of x.sub.i.sup.n,k.
(20) Further, the SAOC side information extractor 17 is able to compute a similarity measure of the corresponding time/frequency tiles of pairs of different input objects s.sub.1 to s.sub.N. Although the SAOC side information extractor 17 may compute the similarity measure between all the pairs of input objects s.sub.1 to s.sub.N, side information extractor 17 may also suppress the signaling of the similarity measures or restrict the computation of the similarity measures to audio objects s.sub.1 to s.sub.N which form left or right channels of a common stereo channel. In any case, the similarity measure is called the inter-object cross-correlation parameter IOC.sub.i,j.sup.l,m. The computation is as follows
(21)
with again indices n and k going through all sub-band values belonging to a certain time/frequency tile 42, i and j denoting a certain pair of audio objects s.sub.1 to s.sub.N, and Re{ } denoting the operation of discarding the imaginary part of the complex argument.
(22) The downmixer 16 of
(23) This downmix prescription is signaled to the decoder side by means of downmix gains DMG.sub.i and, in case of a stereo downmix signal, downmix channel level differences DCLD.sub.i.
(24) The downmix gains are calculated according to:
DMG.sub.i=20 log.sub.10(d.sub.i+),(mono downmix),
DMG.sub.i=10 log.sub.10(d.sub.1,i.sup.2+d.sub.2,i.sup.2+),(stereo downmix),
where is a small number such as 10.sup.9.
(25) For the DCLDs the following formula applies:
(26)
In the normal mode, downmixer 16 generates the downmix signal according to:
(27)
for a mono downmix, or
(28)
for a stereo downmix, respectively.
(29) Thus, in the abovementioned formulas, parameters OLD and IOC are a function of the audio signals and parameters DMG and DCLD are a function of d. By the way, it is noted that d may be varying in time and in frequency.
(30) Thus, in the normal mode, downmixer 16 mixes all objects s.sub.1 to s.sub.N with no preferences, i.e., with handling all objects s.sub.1 to s.sub.N equally.
(31) At the decoder side, the upmixer performs the inversion of the downmix procedure and the implementation of the rendering information 26 represented by a matrix R (in the literature sometimes also called A) in one computation step, namely, in case of a two-channel downmix
(32)
where matrix E is a function of the parameters OLD and IOC, and the matrix D contains the downmixing coefficients as
(33)
(34) The matrix E is an estimated covariance matrix of the audio objects s.sub.1 to s.sub.N. In current SAOC implementations, the computation of the estimated covariance matrix E is typically performed in the spectral/temporal resolution of the SAOC parameters, i.e., for each (l,m), so that the estimated covariance matrix may be written as E.sup.l,m. The estimated covariance matrix E.sup.l,m is of size NN with its coefficients being defined as
e.sub.i,j.sup.l,m={square root over (OLD.sub.i.sup.l,mOLD.sub.j.sup.l,m)}IOC.sub.i,j.sup.l,m.
Thus, the matrix E.sup.l,m with
(35)
has along its diagonal the object level differences, i.e., e.sub.i,j.sup.l,m=OLD.sub.i.sup.l,m for i=j, since OLD.sub.i.sup.l,m=OLD.sub.j.sup.l,m and IOC.sub.i,j.sup.l,m=1 for i=j. Outside its diagonal the estimated covariance matrix E has matrix coefficients representing the geometric mean of the object level differences of objects i and j, respectively, weighted with the inter-object cross correlation measure IOC.sub.i,j.sup.l,m.
(36)
(37) In the following, embodiments of the present invention are described.
(38)
(39) The input audio information comprises two or more input audio downmix channels and further comprises input parametric side information. The adapted audio information comprises one or more adapted audio downmix channels and further comprises adapted parametric side information.
(40) The apparatus comprises a downmix signal modifier (DSM) 110 for adapting, depending on adaptation information, the two or more input audio downmix channels to obtain the one or more adapted audio downmix channels.
(41) Moreover, the apparatus comprises a parametric side information adapter (PSIA) 120 for adapting, depending on the adaptation information, the input parametric side information to obtain the adapted parametric side information.
(42)
(43) In an embodiment, the adaptation information may depend on a decoder instance, and the downmix signal modifier 110 may be configured to adapt the two or more input audio downmix channels depending on the decoder instance.
(44) For example, the downmix signal modifier 110 of
(45) According to an embodiment, the downmix signal modifier 110 may be configured to adapt the two or more input audio downmix channels depending on the adaptation information, such that the number of the one or more adapted audio downmix channels is smaller than the number of the two or more input audio downmix channels.
(46) For example, in the embodiment of
(47) E.g., 22.2 input audio downmix channels (=24 input audio downmix channels) may be reduced to 7.1 adapted audio downmix channels (=8 adapted audio downmix channels).
(48) Or, for example, 5.1 input audio downmix channels (=6 input audio downmix channels) are reduced to 2.0 adapted audio downmix channels (=2 adapted audio downmix channels).
(49) Or, for example, 2 input audio downmix channels are reduced to 1 adapted audio downmix channel.
(50) Various other combinations of input audio downmix channels and adapted audio downmix channels are possible
(51) According to an embodiment, the decoder instance may be capable of decoding at most a maximum number of downmix channels. The adaptation information may depend on said maximum number of downmix channels. Moreover, the downmix signal modifier 110 may be configured to adapt the two or more input audio downmix channels depending on the adaptation information to obtain the one or more adapted audio downmix channels, such that the number of the one or more adapted downmix channels is equal to said maximum number of downmix channels.
(52) For example, the downmix signal modifier 110 of
(53) According to an embodiment, the adaptation information may, for example, comprise an adaptation matrix (D.sub.dmx.sup.DSM).
(54) The parametric side information adapter 120 may, e.g., adapt the PSI to correspond to the modified downmix in order to decrease the computational complexity for the decoder, and to reduce the corresponding data bitstream size/bitrate without producing negative influence on the decoder output audio quality.
(55) For example, the PSIA 120 modifies the corresponding PSI bitstream substituting the information representing the initial downmix matrix by the updated information describing the resulting downmix (accounting for the DSM modifications) to correspond to the particular specification of the decoder.
(56) For example, an SAOC encoder provides the stereo downmix signal X.sub.dmx.sup.ENC resulting from application of the encoder downmix matrix D.sub.dmx.sup.ENC to the input audio object signals S:
X.sub.dmx.sup.ENC=D.sub.dmx.sup.ENCS.
(57) According to an embodiment, the downmix signal modifier 110 may be configured to adapt, depending on the adaptation matrix D.sub.dmx.sup.DSM, the two or more input audio downmix channels X.sub.dmx.sup.ENC to obtain the one or more adapted audio downmix channels X.sub.dmx.sup.DSM. In an embodiment, this is realized, for example, by applying the formula X.sub.dmx.sup.DSM=D.sub.dmx.sup.DSMX.sub.dmx.sup.ENC.
(58) For example, in an embodiment, where it is assumed that the particular SAOC decoder instance supports only mono downmix (e.g. SAOC Low Delay profile/Level 1). In this case, the DSM 110 converts the stereo downmix X.sub.dmx.sup.ENC to the mono signal X.sub.dmx.sup.DSM using a predefined downmix matrix D.sub.dmx.sup.DSM as follows:
X.sub.dmx.sup.DSM=D.sub.dmx.sup.DSMX.sub.dmx.sup.ENC.
(59) According to an embodiment, the parametric side information adapter 120 may be configured to adapt, depending on the adaptation matrix D.sub.dmx.sup.DSM, the input parametric side information D.sub.dmx.sup.ENC to obtain the adapted parametric side information D.sub.dmx.sup.PSI. In an embodiment, this may, for example, be realized by applying the formula: D.sub.dmx.sup.PSI=D.sub.dmx.sup.DSMD.sub.dmx.sup.ENC.
(60) For example, according to an embodiment, the PSIA 120 parses the corresponding PSI bitstream; extracts information that describes the downmix matrix D.sub.dmx.sup.ENC; substitutes these data by updated information that describes the new downmix matrix D.sub.dmx.sup.PSI:
D.sub.dmx.sup.PSI=D.sub.dmx.sup.DSMD.sub.dmx.sup.ENC.
Thus, according to an embodiment, the input parametric side information (D.sub.dmx.sup.enc) may indicate an initial downmix matrix, such that by applying the initial downmix matrix (D.sub.dmx.sup.enc) on the one or more audio objects (S), the two or more input audio downmix channels (X.sub.dmx.sup.enc) are obtained. The parametric side information adapter may be configured to determine an adapted downmix matrix (D.sub.dmx.sup.PSI) as the adapted parametric side information, such that by applying the adapted downmix matrix (D.sub.dmx.sup.PSI) on the one or more audio objects (S), the one or more adapted audio downmix channels (X.sub.dmx.sup.DSM) are obtained.
(61) In an embodiment, the PSIA formats the new modified bitstream or directly passes these parameters to the decoder.
(62) This encoding and decoding process performed by the PSIA can also include conversion of different downmix matrix representation formats (e.g. polar- to Cartesian-coordinate system, etc.).
(63) This described function of the PSIA can solve potential compatibility issues and reduce the size of the corresponding bitstream.
(64)
(65) The apparatus 700 for generating the one or more audio channels comprises an apparatus 710 according to one of the above-described embodiments for adapting the input audio information to obtain adapted audio information. The input audio information comprises two or more input audio downmix channels and further comprises input parametric side information. The adapted audio information comprises one or more adapted audio downmix channels and further comprises adapted parametric side information.
(66) The apparatus 710 according to one of the above-described embodiments for adapting the input audio information comprises a downmix signal modifier 110 and a parametric side information adapter 120.
(67) Moreover, the apparatus 700 for generating the one or more audio channels comprises a decoder instance 720, for decoding, depending on the adapted parametric side information, the one or more adapted audio downmix channels to obtain the one or more audio channels.
(68) According to an embodiment, the parametric side information adapter 120 of the apparatus 710 for adapting the input audio information may be configured to receive an input bit stream comprising the input parametric side information. The parametric side information adapter 120 of the apparatus 710 for adapting the input audio information may be configured to adapt the input parametric side information to obtain the adapted parametric side information, and to feed the adapted parametric side information into the decoder instance 720. The decoder instance 720 may be configured to decode the one or more adapted audio downmix channels depending on the adapted parametric side information.
(69) In another embodiment, the parametric side information adapter 120 of the apparatus 710 for adapting the input audio information may be configured to receive an input bit stream comprising the input parametric side information. The parametric side information adapter 120 of the apparatus 710 for adapting the input audio information may be configured to substitute the input parametric side information within the input bit stream by the adapted parametric side information to obtain a modified bit stream. The parametric side information adapter 120 of the apparatus 710 for adapting the input audio information may be configured to feed the modified bit stream into the decoder instance 720. Moreover, the decoder instance 720 may be configured to decode the one or more adapted audio downmix channels depending on the modified bit stream.
(70)
(71) In particular,
(72)
(73) The joint (integrated) implementation of the apparatus for adapting input audio information can be realized in order to reduce computational complexity for decoding (see
(74)
(75) In particular,
(76) The disjoint (separated) implementation of the apparatus for adapting input audio information can be realized in order to reduce the corresponding data bitstream size/bitrate, see
(77) Although some aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus.
(78) The inventive decomposed signal can be stored on a digital storage medium or can be transmitted on a transmission medium such as a wireless transmission medium or a wired transmission medium such as the Internet.
(79) Depending on certain implementation requirements, embodiments of the invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a CD, a ROM, a PROM, an EPROM, an EEPROM, or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed.
(80) Some embodiments according to the invention comprise a non-transitory data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed.
(81) Generally, embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer. The program code may for example be stored on a machine readable carrier.
(82) Other embodiments comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier.
(83) In other words, an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer.
(84) A further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein.
(85) A further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein. The data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet.
(86) A further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
(87) A further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein.
(88) In some embodiments, a programmable logic device (for example a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods are performed by any hardware apparatus.
(89) While this invention has been described in terms of several advantageous embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention.
REFERENCES
(90) [MPS] ISO/IEC 23003-1:2007, MPEG-D (MPEG audio technologies), Part 1: MPEG Surround, 2007 [BCC] C. Faller and F. Baumgarte, Binaural Cue CodingPart II: Schemes and applications, IEEE Trans. on Speech and Audio Proc., vol. 11, no. 6, November 2003 [JSC] C. Faller, Parametric Joint-Coding of Audio Sources, 120th AES Convention, Paris, 2006 [SAOC1] J. Herre, S. Disch, J. Hilpert, O. Hellmuth: From SAC To SAOCRecent Developments in Parametric Coding of Spatial Audio, 22nd Regional UK AES Conference, Cambridge, UK, April 2007 [SAOC2] J. Engdegrd, B. Resch, C. Falch, O. Hellmuth, J. Hilpert, A. Hlzer, L. Terentiev, J. Breebaart, J. Koppens, E. Schuijers and W. Oomen: Spatial Audio Object Coding (SAOC)The Upcoming MPEG Standard on Parametric Object Based Audio Coding, 124th AES Convention, Amsterdam 2008 [SAOC] ISO/IEC, MPEG audio technologiesPart 2: Spatial Audio Object Coding (SAOC), ISO/IEC JTC1/SC29/WG11 (MPEG) International Standard 23003-2. [ISS1] M. Parvaix and L. Girin: Informed Source Separation of underdetermined instantaneous Stereo Mixtures using Source Index Embedding, IEEE ICASSP, 2010 [ISS2] M. Parvaix, L. Girin, J.-M. Brossier: A watermarking-based method for informed source separation of audio signals with a single sensor, IEEE Transactions on Audio, Speech and Language Processing, 2010 [ISS3] A. Liutkus and J. Pinel and R. Badeau and L. Girin and G. Richard: Informed source separation through spectrogram coding and data embedding, Signal Processing Journal, 2011 [ISS4] A. Ozerov, A. Liutkus, R. Badeau, G. Richard: Informed source separation: source coding meets source separation, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011 [ISS5] Shuhua Zhang and Laurent Girin: An Informed Source Separation System for Speech Signals, INTERSPEECH, 2011 [ISS6] L. Girin and J. Pinel: Informed Audio Source Separation from Compressed Linear Stereo Mixtures, AES 42nd International Conference: Semantic Audio, 2011