Apparatus and method for generating a frequency enhanced signal using shaping of the enhancement signal
09640189 ยท 2017-05-02
Assignee
Inventors
- Sascha Disch (Fuerth, DE)
- Ralf Geiger (Erlangen, DE)
- Christian Helmrich (Erlangen, DE)
- Markus Multrus (Nuremberg, DE)
- Konstantin Schmidt (Nuremberg, DE)
Cpc classification
G10L19/06
PHYSICS
G10L19/12
PHYSICS
G10L25/18
PHYSICS
International classification
G10L19/02
PHYSICS
G10L19/06
PHYSICS
G10L19/12
PHYSICS
G10L25/18
PHYSICS
Abstract
An apparatus for generating a frequency enhancement signal has: a calculator for calculating a value describing an energy distribution with respect to frequency in a core signal; and a signal generator for generating an enhancement signal having an enhancement frequency range not included in the core signal, from the core signal, wherein the signal generator is configured for shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal.
Claims
1. An apparatus for generating a frequency enhancement signal, comprising: a calculator for calculating a value describing an energy distribution with respect to frequency in a core signal; a signal generator for generating an enhancement signal comprising an enhancement frequency range not comprised in the core signal, from the core signal, and wherein the signal generator is configured for shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal, wherein the signal generator is configured to shape the enhancement signal or the core signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is acquired for a first value describing a first energy distribution, and so that a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is acquired for a second value describing a second energy distribution, wherein the second frequency is greater than the first frequency, wherein the second spectral envelope decrease is greater than the first spectral envelope decrease, and wherein the first value indicates that the core signal comprises an energy concentration at a higher frequency of the core signal compared to the second value.
2. The apparatus of claim 1, further comprising a combiner for combining the enhancement signal and the core signal to acquire the frequency enhancement signal.
3. The apparatus of claim 1, wherein the calculator is configured to calculate a measure for a spectral centroid of a current frame as the value on the energy distribution, wherein the signal generator is configured to shape, in accordance with the value for the spectral centroid, so that the spectral centroid at a higher frequency results in a more shallow slope of the spectral envelope than a spectral centroid at a lower frequency.
4. The apparatus in accordance with claim 1, wherein the calculator is configured to calculate the information on the energy distribution using only a frequency portion of the core signal, the frequency portion of the core signal starting at a first frequency and ending at a second frequency higher than the first frequency, wherein the first frequency is higher than a lowest frequency of the core signal or the second frequency is the highest frequency of the core signal.
5. The apparatus in accordance with claim 1, wherein the value describing an energy distribution is calculated using the following equation:
6. The apparatus in accordance with claim 1, wherein the signal generator is configured for applying a shaping factor to an input signal, wherein the shaping factor is calculated based on the following equation:
att=p(sp); wherein att is a value influencing a shaping factor, and p is a polynomial, and sp is the value on the frequency distribution calculated by the calculator.
7. The apparatus in accordance with claim 1, wherein the signal generator is configured for performing the shaping using the following equation:
(t,xover+f)=Qr(t,xover+f)*att.sup.f; f=1 . . . nBands, or
(t,xover+f)=Qi(t,xover+f)*att.sup.f; f=1 . . . nBands wherein
is a real part of a shaped subband sample, t is a time index, xover is a crossover frequency, f is a frequency index and att is a constant derived from the value on the spectral distribution, Q.sub.r is a real part of a subband sample before shaping, and Q.sub.i is an imaginary part of a subband sample before shaping.
8. The apparatus in accordance with claim 1, wherein the core signal comprises a plurality of core signal subbands, wherein the calculator is configured to calculate individual energies of core signal bands and to calculate the information on the energy distribution using the individual energies.
9. The apparatus in accordance with claim 1, wherein the core signal comprises a plurality of core signal bands, wherein the signal generator is configured to copy-up or to mirror one or a plurality of core signal bands to acquire a plurality of enhancement signal bands forming the enhancement frequency range.
10. The apparatus in accordance with claim 1, wherein the calculator is configured to calculate the value based on the following equation:
11. The apparatus in accordance with claim 1, wherein the signal generator is configured to perform, subsequent to or concurrent to the shaping of the enhancement signal or the core signal, a temporal smoothing operation, the temporal smoothing operation comprising finding a decision about a smoothing intensity and applying the smoothing operation to the enhancement frequency range or the core signal based on the decision.
12. The apparatus in accordance with claim 1, wherein the signal generator is configured to apply a band-wise energy limitation subsequent to the shaping or the temporal smoothing or concurrent to the shaping or the temporal smoothing.
13. A method of generating a frequency enhancement signal, comprising: calculating a value describing an energy distribution with respect to frequency in a core signal; generating an enhancement signal comprising an enhancement frequency range not comprised in the core signal, from the core signal, and wherein the generating comprises shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal, wherein the generating comprises shaping the enhancement signal or the core signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is acquired for a first value describing a first energy distribution, and so that a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is acquired for a second value describing a second energy distribution, wherein the second frequency is greater than the first frequency, wherein the second spectral envelope decrease is greater than the first spectral envelope decrease, and wherein the first value indicates that the core signal comprises an energy concentration at a higher frequency of the core signal compared to the second value.
14. A system for processing audio signals, comprising: an encoder for generating an encoded core signal; and an apparatus for generating a frequency enhancement signal, the apparatus comprising: a calculator for calculating a value describing an energy distribution with respect to frequency in a core signal; a signal generator for generating an enhancement signal comprising an enhancement frequency range not comprised in the core signal, from the core signal, and wherein the signal generator is configured for shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal, wherein the signal generator is configured to shape the enhancement signal or the core signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is acquired for a first value describing a first energy distribution, and so that a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is acquired for a second value describing a second energy distribution, wherein the second frequency is greater than the first frequency, wherein the second spectral envelope decrease is greater than the first spectral envelope decrease, and wherein the first value indicates that the core signal comprises an energy concentration at a higher frequency of the core signal compared to the second value.
15. A method for processing audio signals, comprising: generating an encoded core signal; and generating a frequency enhancement signal, the generating comprising: calculating a value describing an energy distribution with respect to frequency in a core signal; generating an enhancement signal comprising an enhancement frequency range not comprised in the core signal, from the core signal, and wherein the generating comprises shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal, wherein the generating comprises shaping the enhancement signal or the core signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is acquired for a first value describing a first energy distribution, and so that a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is acquired for a second value describing a second energy distribution, wherein the second frequency is greater than the first frequency, wherein the second spectral envelope decrease is greater than the first spectral envelope decrease, and wherein the first value indicates that the core signal comprises an energy concentration at a higher frequency of the core signal compared to the second value.
16. A non-transitory storage medium having stored thereon a computer program for performing, when running on a computer or a processor, a method of generating a frequency enhancement signal, the method comprising: calculating a value describing an energy distribution with respect to frequency in a core signal; generating an enhancement signal comprising an enhancement frequency range not comprised in the core signal, from the core signal, and wherein the generating comprises shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal, wherein the generating comprises shaping the enhancement signal or the core signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is acquired for a first value describing a first energy distribution, and so that a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is acquired for a second value describing a second energy distribution, wherein the second frequency is greater than the first frequency, wherein the second spectral envelope decrease is greater than the first spectral envelope decrease, and wherein the first value indicates that the core signal comprises an energy concentration at a higher frequency of the core signal compared to the second value.
17. A non-transitory storage medium having stored thereon a computer program for performing, when running on a computer or a processor, a method for processing audio signals, the method comprising: generating an encoded core signal; and generating a frequency enhancement signal, the generating comprising: calculating a value describing an energy distribution with respect to frequency in a core signal; generating an enhancement signal comprising an enhancement frequency range not comprised in the core signal, from the core signal, and wherein the generating comprises shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal, wherein the generating comprises shaping the enhancement signal or the core signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is acquired for a first value describing a first energy distribution, and so that a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is acquired for a second value describing a second energy distribution, wherein the second frequency is greater than the first frequency, wherein the second spectral envelope decrease is greater than the first spectral envelope decrease, and wherein the first value indicates that the core signal comprises an energy concentration at a higher frequency of the core signal compared to the second value.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Embodiments of the present invention are subsequently described with respect to the accompanying drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
DETAILED DESCRIPTION OF THE INVENTION
(18)
(19) Advantageously, the apparatus for generating the frequency enhanced signal 140 of
(20) The individual subband signals of the core signal 110 which are available at 120 are then input into a signal generator 200 and the output of the signal generator 200 is an enhancement signal 130. This enhancement signal 130 comprises an enhancement frequency range which is not included in the core signal 110 and the signal generator generates this enhancement signal not e.g. by (only) shaping noise or so, but using the core signal 110 or advantageously the core signal subbands 120. The synthesis filterbank then combines the core signal subbands 120 and the frequency enhancement signal 130, and the synthesis filterbank 300 then outputs the frequency enhanced signal.
(21) Basically, the signal generator 200 comprises a signal generation block 202 which is indicated as HF generation where HF stands for high frequency. However, the frequency enhancement in
(22) Furthermore, the signal generator comprises a shaping functionality 204, which is controlled by the calculation for calculating a value indicating the energy distribution with respect to frequency in the core signal 120. This shaping may be a shaping of the signal generated by block 202 or alternatively the shaping of the low frequency, when the order between functionality 202 and 204 is reversed as discussed in the context of
(23) A further functionality is the temporal smoothing functionality 206, which is controlled by a smoothing controller 800. An energy limitation 208 may be performed at the end of the procedure, but the energy limitation can also be placed at any other position in the chain of processing functionalities 202 to 208 as long as it is made sure that the combined signal output by the synthesis filterbank 300 fulfills the energy limitation criterion such as that a higher frequency band must not have more energy than the adjacent lower frequency band or that the higher frequency band must not have more energy compared to the adjacent lower frequency band, where the increment is limited, at the most, to a predefined threshold such as 3 dB
(24)
(25)
(26) Furthermore, it is to be emphasized that the functionalities of shaping, temporal smoothing and energy limiting may all be performed by applying certain factors to a subband signal as, for example, illustrated in
(27) Furthermore, the temporal smoothing is performed by multipliers 1402b, 1401b and 1400b. Additionally, the energy limitation is performed by limitation factors 1402c, 1401c and 1400c for the individual bands i+2, i+1 and i. Due to the fact that all of these functionalities are implemented in this embodiment by multiplication factors, it is to be noted that all these functionalities can also be applied to the individual subband signals by a single multiplication factor 1402, 1401, 1400 for each individual band, and this single master multiplication factor would then be a product of the individual factors 1402a, 1402b and 1402c for a band i+2, and the situation would be analogous to the other bands i+1 and i. Thus, the real/imaginary subband samples values for the subbands are then multiplied by this single master multiplication factor and the output is obtained as multiplied real/imaginary subband sample values at the output of block 1402, 1401 or 1400, which are then introduced into the synthesis filterbank 300 of
(28)
(29)
(30) Subsequently, an implementation of the generation 202 of the enhancement frequency range not included in the core signal using the core signal is illustrated.
(31) In order to generate the artificial signal above the crossover frequency, typically QMF values from the frequency range below the crossover frequency are copied (patched) up into the high band. This copy-operation can be done by just shifting QMF samples from the lower frequency range up to the area above the crossover frequency or by additionally mirroring these samples. The advantage of the mirroring is that the signal just below the crossover frequency and the artificial generated signal will have a very similar energy and harmonic structure at the crossover frequency. The mirroring or copy up can be applied to a single subband of the core signal or to a plurality of subbands of the core signal.
(32) In the case of said QMF filterbank, the mirrored patch advantageously consists of the negative complex conjugate of the base band in order to minimize subband aliasing in the transition region:
Qr(t,xover+f1)=Qr(t,xoverf); f=1 . . . nBands
Qi(t,xover+f1)=Qi(t,xoverf); f=1 . . . nBands
(33) Here, Qr(t,f) is the real value of the QMF at time-index t and subband-index f and Qi(t,f) is the imaginary value; xover is the QMF subband referring to the crossover frequency; nBands is the integer number of bands to be extrapolated. The minus sign in the real part denotes the negative conjugate complex operation.
(34) Advantageously, the HF generation 202 or generally the generation of the enhancement frequency range relies on a subband representation provided by block 100. Advantageously, the inventive apparatus for generating a frequency enhanced signal should be a multi-bandwidth decoder which is able to resample the decoded signal 110 to vary sampling frequencies, to support, for example narrow band, wideband and super-wideband output. Therefore, the QMF filterbank 100 takes the decoded time domain signal as input. By padding zeroes in the frequency domain, the QMF filterbank can be used to resample the decoded signal, and the same QMF filterbank may also be used to create the high band signal.
(35) Advantageously, the apparatus for generating a frequency enhanced signal is operative to perform all operations in the frequency domain. Thus, an existing system already having an internal frequency domain representation at a decoder side is extended as illustrated in
(36) This representation is simply re-used for additional tasks like sampling rate conversion and other signal manipulations which may be done in the frequency domain (e.g. insertion of shaped comfort noise, high-pass/low-pass filtering). Thus, no additional time-frequency transformation needs to be calculated.
(37) Instead of using noise for the HF content, the high-band signal is generated based on the low-band signal only in this embodiment. This can be done by means of a copy-up or folding-up (mirroring) operation in the frequency domain. Thus, a high band signal with the same harmonic and temporal fine-structure as the low band signal is assured. This avoids a computationally costly folding of the time-domain signal and additional delay.
(38) Subsequently, the functionality of the shaping 204 technology of
(39)
(40) Advantageously, the apparatus additionally comprises a combiner 300 for combining the enhancement signal 130 output by block 200 and the core signal 120 to obtain the frequency enhanced signal 140. Additional operations such as temporal smoothing 206 or energy limitation 208 may be of advantage to further process the shaped signal, but are not necessarily necessitated in certain implementations.
(41) The signal generator 200 is configured to shape the enhancement signal so that a first spectral envelope decrease from a first frequency in the enhancement frequency range to a second higher frequency in the enhancement frequency range is obtained for a first value describing the energy distribution. Furthermore, a second spectral envelope decrease from the first frequency in the enhancement range to the second frequency in the enhancement range is obtained for a second value describing a second energy distribution. If the second frequency is greater than the first frequency, and the second spectral envelope decrease is greater than the first spectral envelope decrease, then the first value indicates that the core signal has an energy concentration at a higher frequency range of the core signal compared to the second value describing an energy concentration at a lower frequency range of the core signal.
(42) Advantageously, the calculator 500 is configured to calculate a measure for a spectral centroid of a current frame as the information value on the energy distribution. Then, the signal generator 200 shapes in accordance with this measure for the spectral centroid so that a spectral centroid at a higher frequency results in a more shallow slope of the spectral envelope compared to a spectral centroid at a lower frequency.
(43) The information on the energy distribution calculated by the energy distribution calculator 500 is calculated on a frequency portion of the core signal starting at the first frequency and ending at the second frequency being higher than the first frequency. The first frequency is lower than a lowest frequency in the core signal, as for example illustrated at 410 in
(44) In an embodiment, the procedure of
(45) Then, in step 608 performed by the signal generator 208, the weighting factors are applied to real and imaginary parts of the subband samples.
(46) Fricative sounds are detected by calculating the spectral centroid of the current frame in the QMF domain. The spectral centroid is a measure that has a range of 0.0 to 1.0. A high spectral centroid (a value close to one) means that the spectral envelope of the sound has a rising slope. For speech signals this means that the current frame most likely contains a fricative. The closer the value of the spectral centroid approaches one, the steeper is the slope of the spectral envelope or the more energy is concentrated in the higher frequency range.
(47) The spectral centroid is calculated according to:
(48)
where E(i) is the energy of QMF subband i and start is the QMF subband-index referring to 1 kHz. The copied QMF subbands are weighted with the factor att.sup.f:(t,xover+f)=Qr(t,xover+f)*att.sup.f; f=1 . . . nBands
where att=0.5*sp+0.5. Generally, att can be calculated using the following equation:
att=p(sp),
wherein p is a polynomial. Advantageously, the polynomial has degree 1:
att=a*sp+b,
wherein a, b or generally the polynomial coefficients are all between 0 and 1.
(49) Apart from the above equation, other equations having a comparable performance can be applied. Such other equations are as follows:
(50)
(51) In particular, the value a.sub.i should be so that the value is higher for higher i and, importantly, the values b.sub.i are lower than the values a.sub.i at least for the index i>1. Thus, a similar result, but with a different equation compared to the above equation, is obtained. Generally, ai, bi are monotonically increasing or decreasing values with i.
(52) Furthermore, reference is made to
(53) Courses of shaping factors over frequency indicated at 702 and 704 are for correspondingly increasing spectral distribution values. Thus, for item 704, the energy distribution value is greater than 0 but smaller than the energy distribution value for item 702 as indicated by parametric arrow 708.
(54)
(55) A controller 800 is for calculating the same smoothing information 802 for the plurality of subband signals of the enhancement frequency range or the core signal. Furthermore, the signal generator 200 is configured for smoothing the plurality of subband signals of the enhancement frequency range using the same smoothing information 802 or for smoothing the plurality of subband signals of the core signal using the same smoothing information 802. The output of the signal generator 200 is, in
(56) The controller 800 may be configured to calculate the smoothing information using a combined energy of the plurality of subband signals the core signal and the frequency enhancement signal or using only the frequency enhancement signal of the time portion. Furthermore, an average energy of the plurality of subband signals of the core signal and the frequency enhancement signal or of the core signal only of one or more earlier time portions preceding the current time portion is used. The smoothing information is a single correction factor for the plurality of subband signals of the enhancement frequency range in all bands and therefore the signal generator 200 is configured to apply the correction factor to the plurality of subband signals of the enhancement frequency range.
(57) As discussed in the context of
(58) The controller 800 is configured to calculate a smoothing intensity control value based on the core signal or the frequency enhanced signal of the current time portion and based on one or more preceding time portions and the controller 800 is then configured to calculate the smoothing information using the smoothing control value such that the smoothing intensity varies depending on a difference between an energy of the core signal or the frequency enhancement signal of the current time portion and the average energy of the core signal or the frequency enhancement signal of the one or more preceding time portions.
(59) Reference is made to
(60) Then, in step 902, a smoothing information is calculated based on the decision of the smoothing intensity of the step 900. This step 902 is also performed by the controller 800. Then, the signal generator 200 performs 904 comprising the application of the smoothing information to several bands, where one and the same smoothing information 802 is applied to these several bands either in the core signal or in the enhancement frequency range.
(61)
(62) In an embodiment, the temporal smoothing is performed in two steps:
(63) Decision about Smoothing Intensity.
(64) For the decision about the smoothing intensity, the stationary of the signal over time is evaluated. A possible way to perform this evaluation is to compare the energy of the current short-term window or QMF time-slot with averaged energy values of previous short-term windows or QMF time-slots. To save on complexity, this might be evaluated for the high-band portion only. The closer the compared energy values are, the lower should be the intensity of smoothing. This is reflected in a smoothing coefficient a, where 0<a1. The greater a, the higher is the intensity of smoothing.
(65) Application of Smoothing to the High-Band.
(66) The smoothing is applied for the high-band portion on a QMF time-slot base. Therefore, the high-band energy of the current time-slot Ecurr.sub.t is adapted to an averaged high-band energy Eavg.sub.t of one or multiple previous QMF time-slots:=aEcurr.sub.t+(1a)Eavg.sub.t
(67) Ecurr is calculated as the sum of high-band QMF energies in one timeslot:
(68)
(69) Eavg is the moving average over time of the energies:
(70)
where start and stop are the borders of the interval used for calculating the moving average.
(71) The real and imaginary QMF values used for synthesis are multiplied with a correction factor currFac:=currFacQr.sub.t,f
=currFacQi.sub.t,f which is derived from Ecurr and Eavg:
(72)
(73) The factor a may be fixed or dependent on the difference of the energy of Ecurr and Eavg.
(74) As already discussed in
(75) It is of advantage to use the same smoothing information such as the correction factor for all subbands in the enhancement range. However, it can also be an implementation, in which the same smoothing information is applied not for all bands but for a group of bands wherein such a group has at least two subbands.
(76)
(77) In order to implement the energy limitation procedure, the signal generator 200 is configured for performing an energy limitation in order to make sure that the frequency enhanced signal 140 obtained by the synthesis filterbank 300 is so that an energy of a higher band is, at the most, equal to an energy in a lower band or greater than the energy in a lower band, at the most, by a predefined threshold.
(78) The signal generator may be implemented to make sure that a higher QMF subband k must not exceed the energy at a QMF subband k1. Nevertheless, the signal generator 200 can also be implemented to allow a certain incremental increase which may advantageously be a threshold of 3 dB and a threshold may advantageously be 2 dB and even more advantageously 1 dB or even smaller. The predetermined threshold may be a constant for each band or dependent on the spectral centroid calculated previously. An advantageous dependence is that the threshold becomes lower, when the centroid approaches lower frequencies, i.e. becomes smaller, while the threshold can become greater the closer the centroid approaches higher frequencies or sp approaches 1.
(79) In a further implementation, the signal generator 200 is configured to examine a first subband signal in a first subband and to examine a subband signal in a second subband being adjacent in frequency to the first subband and having a center frequency being higher than a center frequency of the first subband and the signal generator will not limit the second subband signal, when an energy of the second subband signal is equal to an energy of the first subband signal or when the energy of the second subband signal is greater than the energy of the first subband signal by less than the predefined threshold.
(80) Furthermore, the signal generator is configured to form a plurality of processing operations in a sequence as illustrated, for example, in
(81) Furthermore, the signal generator is configured to perform spectral shaping 204 or temporal smoothing 206 before the energy limitation.
(82) In one embodiment, the signal generator 200 is configured to generate the plurality of subband signals of the enhancement signal by mirroring a plurality of subbands of the core signal.
(83) For the mirroring, advantageously the procedure of negating either the real part or the imaginary part is performed as discussed earlier.
(84) In a further embodiment, the signal generator is configured for calculating a correction factor limFac and this limitation factor limFac is then applied to the subband signals of the core or the enhancement frequency range as follows:
(85) Let E.sub.f be the energy of one band averaged over a time span stop-start:
(86)
(87) If this energy exceeds the average energy of the previous band by some level, the energy of this band is multiplied by a correction/limitation factor limFac:
(88)
and the real and imaginary QMF values are corrected by:=limFacQr.sub.t,f
=limFacQi.sub.t,f
(89) The factor or predetermined threshold fac may be a constant for each band or dependent on the spectral centroid calculated previously.
(90) {circumflex over (Q)}r.sub.t,f is the energy limited real part of subband signal at the subband indicated by f. {circumflex over (Q)}i.sub.t,f is the corresponding imaginary part of a subband signal subsequent to energy limitation in a subband f. Qr.sub.t,f and Qi.sub.t,f are corresponding real and imaginary parts of the subband signals before energy limitation such as the subband signals directly when any shaping or temporal smoothing is not performed or the shaped and temporally smoothed subband signals.
(91) In another implementation, the limitation factor limFac is calculated using the following equation:
(92)
(93) In this equation, E.sub.lim is the limitation energy, which is typically the energy of the lower band or the energy of the lower band incremented by the certain threshold fac. E.sub.f(i) is the energy of the current band f or i.
(94) Reference is made to
(95) As outlined,
(96) The energy limitation may apply for a single extension band. Then, the comparison or energy limitation is done using the energy of the highest core band. This may also apply for a plurality of extension bands. Then a lowest extension band is energy limited using the highest core band, and a highest extension band is energy limited with respect to the second to highest extension band.
(97)
(98) Although the present invention has been described in the context of block diagrams where the blocks represent actual or logical hardware components, the present invention can also be implemented by a computer-implemented method. In the latter case, the blocks represent corresponding method steps where these steps stand for the functionalities performed by corresponding logical or physical hardware blocks.
(99) Although some aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus. Some or all of the method steps may be executed by (or using) a hardware apparatus, like for example, a microprocessor, a programmable computer or an electronic circuit. In some embodiments, some one or more of the most important method steps may be executed by such an apparatus.
(100) The inventive transmitted or encoded signal can be stored on a digital storage medium or can be transmitted on a transmission medium such as a wireless transmission medium or a wired transmission medium such as the Internet.
(101) Depending on certain implementation requirements, embodiments of the invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disc, a DVD, a Blu-Ray, a CD, a ROM, a PROM, and EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed. Therefore, the digital storage medium may be computer readable.
(102) Some embodiments according to the invention comprise a data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed.
(103) Generally, embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer. The program code may, for example, be stored on a machine readable carrier.
(104) Other embodiments comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier.
(105) In other words, an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer.
(106) A further embodiment of the inventive method is, therefore, a data carrier (or a non-transitory storage medium such as a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein. The data carrier, the digital storage medium or the recorded medium are typically tangible and/or non-transitory.
(107) A further embodiment of the invention method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein. The data stream or the sequence of signals may, for example, be configured to be transferred via a data communication connection, for example, via the internet.
(108) A further embodiment comprises a processing means, for example, a computer or a programmable logic device, configured to, or adapted to, perform one of the methods described herein.
(109) A further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein.
(110) A further embodiment according to the invention comprises an apparatus or a system configured to transfer (for example, electronically or optically) a computer program for performing one of the methods described herein to a receiver. The receiver may, for example, be a computer, a mobile device, a memory device or the like. The apparatus or system may, for example, comprise a file server for transferring the computer program to the receiver.
(111) In some embodiments, a programmable logic device (for example, a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods may be performed by any hardware apparatus.
(112) While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which will be apparent to others skilled in the art and which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention.