Transition from a transform coding/decoding to a predictive coding/decoding

Abstract

Methods and apparatus are provided for coding and decoding a digital audio signal. Decoding includes: decoding according to an inverse transform decoding of a previous frame of samples of the digital signal, which is received and coded according to a transform coding; and decoding according to a predictive decoding of a current frame of samples of the digital signal, which is received and coded according to a predictive coding. The predictive decoding of the current frame is a transition predictive decoding which does not use any adaptive dictionary arising from the previous frame. At least one state of the predictive decoding is reinitialized to a predetermined default value, and an add-overlap step combines a signal segment synthesized by predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame.

Claims

1. A decoding method for decoding a digital audio signal, comprising the following acts performed by a decoding device: receiving the digital audio signal; decoding according to an inverse transform decoding of a previous frame of samples of the digital signal, received and coded according to a transform coding; decoding according to a predictive decoding of a current frame of samples of the digital signal, received and coded according to a predictive coding, wherein the predictive decoding of the current frame is a transition predictive decoding which does not use any adaptive dictionary arising from the previous frame; reinitializing at least one state of the predictive decoding to a predetermined default value; and an overlap-add act, which combines a signal segment synthesized by the predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame.

2. The decoding method as claimed in claim 1, wherein the inverse transform decoding has a smaller processing delay than that of the predictive decoding and wherein a first segment of the current frame decoded by the predictive decoding is replaced with a segment arising from the inverse transform decoding of the previous frame, wherein a size of the segment arising from the inverse transform decoding of the previous frame corresponds to a delay shift between the predictive decoding and the inverse transform decoding, and wherein the segment arising from the inverse transform decoding of the previous frame is stored in memory during the decoding of the previous frame.

3. The decoding method as claimed in claim 1, wherein the signal segment synthesized by inverse transform decoding is corrected before the overlap-add act by application of an inverse window compensating a window previously applied to the signal segment synthesized by inverse transform decoding.

4. The decoding method as claimed in claim 1, wherein the signal segment synthesized by inverse transform decoding is resampled beforehand at a sampling frequency corresponding to the synthesized signal segment of the current frame.

5. The decoding method as claimed in claim 1, wherein a state of the predictive decoding is in a list of the following states: a state memory for a filter for resampling at an internal frequency of the predictive decoding; state memories for pre-emphasis/de-emphasis filters; coefficients of a linear prediction filter; a state memory of a synthesis filter; a memory of an adaptive dictionary; a state memory of a low-frequency post-filter; a quantization memory for fixed dictionary gain.

6. The decoding method as claimed in claim 5, wherein a calculation of coefficients of a linear prediction filter for the predictive decoding of the current frame is performed by decoding coefficients of a unique filter and by allotting identical coefficients to an end-of-frame linear prediction filter, a middle-of-frame linear prediction filter and a start-of-frame linear prediction filter.

7. The decoding method as claimed in claim 5, further comprising calculation of coefficients of a linear prediction filter for the predictive decoding of the current frame, which comprises the following acts: determination of decoded values of coefficients of a middle-of-frame filter by using decoded values of coefficients of an end-of-frame filter and predetermined reinitialization values of coefficients of a start-of-frame filter; replacement of the predetermined reinitialization values of coefficients of the start-of-frame filter by the determined decoded values of the coefficients of the middle-of-frame filter; determination of coefficients of a linear prediction filter for the predictive decoding of the current frame by using the determined decoded values of the coefficients of the end-of-frame filter, the middle-of-frame filter and the start-of-frame filter.

8. The decoding method as claimed in claim 5, wherein coefficients of a start-of-frame linear prediction filter are reinitialized to predetermined values corresponding to average values of long-term prediction filter coefficients and wherein linear prediction coefficients of a linear prediction filter for the predictive decoding of the current frame are determined by using the predetermined values and decoded values of coefficients of an end-of-frame filter.

9. A method for coding a digital audio signal, comprising the following acts performed by a coding device: coding a previous frame of samples of the digital signal according to a transform coding; reception of a current frame of samples of the digital signal to be coded according to a predictive coding, wherein the predictive coding of the current frame is a transition predictive coding which does not use any adaptive dictionary arising from the previous frame; and reinitializing at least one state of the predictive coding to a predetermined default value.

10. The coding method as claimed in claim 9, wherein coefficients of a linear prediction filter form part of at least one state of the predictive coding and calculation of coefficients of a linear prediction filter for the predictive coding of the current frame is performed by determination of values of coefficients of a single prediction filter, either of middle or of end of frame prediction filter and of allotting of identical values for coefficients of the start-of-frame prediction filter and end-or middle-of-frame prediction filter.

11. The coding method as claimed in claim 10, wherein at least one state of the predictive coding is coded in a direct manner.

12. The coding method as claimed in claim 9, wherein coefficients of a linear prediction filter form part of at least one state of the predictive coding and calculation of coefficients of a linear prediction filter for predictive coding of the current frame comprises the following acts: determination of coded values of coefficients of a middle-of-frame filter by using coded values of coefficients of an end-of-frame filter and predetermined reinitialization values of coefficients of a start-of-frame filter; replacement of the predetermined reinitialization values of coefficients of the start-of-frame filter by the determined coded values of the coefficients of the middle-of-frame filter; determination of the coefficients of the linear prediction filter for the predictive coding of the current frame by using the determined coded values of the coefficients of the end-of-frame filter, the middle-of-frame filter and the start-of-frame filter.

13. The coding method as claimed in claim 9, wherein coefficients of a linear prediction filter form part of at least one state of the predictive coding, coefficients of a start-of-frame linear prediction filter are reinitialized to predetermined values corresponding to average values of long-term prediction filter coefficients and wherein linear prediction coefficients of a linear prediction filter for predictive coding of the current frame are determined by using the predetermined values and coded values of coefficients of an end-of-frame filter.

14. A digital audio signal decoder, comprising: a processor; and a non-transitory computer-readable medium comprising instructions stored thereon, which when executed by the processor configure the digital audio signal decoder to perform acts comprising: an inverse transform decoding a previous frame of samples of the digital signal, received and coded according to a transform coding; predictive decoding a current frame of samples of the digital signal, received and coded according to a predictive coding, wherein the predictive decoding of the current frame is a transition predictive decoding which does not use any adaptive dictionary arising from the previous frame; reinitializing at least one state of the predictive decoding by a predetermined default value; and performing an overlap-add which combines a signal segment synthesized by predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame.

15. A digital audio signal coder, comprising: a processor; and a non-transitory computer-readable medium comprising instructions stored thereon, which when executed by the processor configure the digital audio signal coder to perform acts comprising: transform coding a previous frame of samples of the digital signal; predictive coding a current frame of samples of the digital signal, wherein the predictive coding of the current frame is a transition predictive coding which does not use any adaptive dictionary arising from the previous frame; and reinitializing at least one state of the predictive coding by a predetermined default value.

16. A non-transitory computer-readable medium comprising a computer program stored thereon having instructions for execution of a decoding method when the instructions are executed by a processor of a decoding device, wherein the instructions configure the decoding device to perform acts of: receiving a digital audio signal; decoding according to an inverse transform decoding of a previous frame of samples of the digital audio signal, received and coded according to a transform coding; decoding according to a predictive decoding of a current frame of samples of the digital signal, received and coded according to a predictive coding, wherein the predictive decoding of the current frame is a transition predictive decoding which does not use any adaptive dictionary arising from the previous frame; reinitializing at least one state of the predictive decoding to a predetermined default value; and an overlap-add act, which combines a signal segment synthesized by the predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) Other characteristics and advantages of the invention will become apparent on examining the description detailed hereinafter, and the appended figures among which:

(2) FIG. 1 illustrates a process of transition, between a transform coding and a predictive coding, of the state of the art and described previously;

(3) FIG. 2 illustrates the transition at the coder between a frame coded according to a transform coding and a frame coded according to a predictive coding, according to an implementation of the invention;

(4) FIG. 3 illustrates an embodiment of the coding method and of the coder according to the invention;

(5) FIG. 4 illustrates in the form of a flowchart the steps implemented in a particular embodiment, to determine the coefficients of the linear prediction filter during the predictive coding of the current frame, the previous frame having been coded according to a transform coding;

(6) FIG. 5 illustrates the transition at the decoder between a frame decoded according to an inverse transform decoding and a frame decoded according to a predictive decoding, according to an implementation of the invention;

(7) FIG. 6 illustrates an embodiment of the decoding method and of the decoder according to the invention;

(8) FIG. 7 illustrates in the form of a flowchart the steps implemented in an embodiment of the invention, to determine the coefficients of the linear prediction filter during the predictive decoding of the current frame, the previous frame having been decoded according to an inverse transform decoding;

(9) FIG. 8 illustrates the overlap-add step implemented during decoding according to an embodiment of the invention;

(10) FIG. 9 illustrates a particular mode of implementation of the transition between transform decoding and predictive decoding when they have different delays; and

(11) FIG. 10 illustrates a hardware embodiment of the coder or of the decoder according to the invention.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

(12) FIG. 2 illustrates in a schematic manner, the principle of coding during a transition between a transform coding and a predictive coding according to the invention. Considered here is a succession of audio frame to be coded either with a transform coder (FD) for example of MDCT type or with a predictive coder (LPD) for example of ACELP type; it will be noted that additional coding modes are possible without affecting the invention. In this example the transform coder (FD) uses windows with small delay of Tukey type (the invention is independent of the type of window used) and whose total length is equal to two frames (zero values inclusive) as represented in the figure.

(13) During coding, the windows of the FD coder are synchronized in such a way that the last non-zero part of the window (on the right) corresponds with the end of a new frame of the input signal. Note that the splitting into frames illustrated in FIG. 2 includes the lookahead (or future signal) and the frame actually coded is therefore typically shifted in time (delayed) as explained further on in relation to FIG. 5. When there is no transition, the coder performs the aliasing and DCT transformation procedure such as described in the state of the art (MDCT). Upon the arrival of the frame having to be coded by a coder of LPD type, the window is not applied, the states or memories corresponding to the filters of the LPD coder are reinitialized to predetermined values.

(14) It is considered here that the LPD coder is derived from the UIT-T G.718 coder whose CELP coding operates at an internal frequency of 12.8 kHz. The LPD coder according to the invention can operate at two internal frequencies 12.8 kHz or 16 kHz according to the bitrate.

(15) By state of the predictive coding (LPD), at least the following states are implied: The state memory of the resampling filter for the input frequency fs at the internal frequency of the CELP coding (12.8 or 16 kHz). It is considered here that the resampling can be performed as a function of the input frequency and internal frequency by FIR filter, filter bank or IIR filter, knowing that an embodiment of FIR type simplifies the use of the state memory which corresponds to the past input signal. The state memories of the pre-emphasis filter (1??z.sup.?1 with typically ?=0.68) and de-emphasis filter (1/(1??z.sup.?1)). The coefficients of the linear prediction filter at the end of the previous frame or their equivalent version in the domains such as the LSF (Line Spectral Frequencies) or ISF (Imittance Spectral Frequencies) domains. The state memory of the LPC synthesis filter typically of order 16 (in the preaccentuated domain). The memory of the adaptive dictionary (past CELP excitation). The state memory of the low-frequency post-filter (LPF) as defined in the standard UIT-G.718 (see clause 7.14.1.1 of the standard UIT-T G.718). The quantization memory for the fixed dictionary gain (when this quantization is performed with memory).

(16) FIG. 3 illustrates an embodiment of a coder and of a coding method according to the invention.

(17) The particular embodiment lies within the framework of transition between an FD transform codec using an MDCT and a predictive codec of ACELP type.

(18) After a first conventional step of placement in frame (E301) by a module 301, a decision module (dec.) determines whether the frame to be processed should be coded by ACELP predictive coding or by FD transform coding.

(19) In the case of the transform coding, a complete step of MDCT transform is performed (E302) by the transform coding entity 302. This step comprises inter alia a windowing with a low-lag window aligned as illustrated in FIG. 2, a step of aliasing and a step of transformation in the DCT domain. The frame FD is thereafter quantized in a step (E303) by a quantization module 303 and then the data thus encoded are written in the bitstream at E305, by the bitstream construction module 305.

(20) The case of the transition from a predictive coding to a transform coding is not dealt with in this example since it does not form the subject of the present invention.

(21) If the decision step (dec.) chooses the ACELP predictive coding, then: Either the previous frame (last ACELP) had also been encoded by the ACELP coding entity 304, the ACELP coding (E304) then continues while updating the memories or states of the predictive coding. We do not deal here with the problem of switching of internal sampling frequencies of the CELP coding (from 12.8 to 16 kHz and vice-versa). The coded and quantized information is written in the bitstream in a step E305. Or the previous frame (last MDCT) had been encoded by the transform coding entity 302, at E302, in this case, the memories or states of the ACELP predictive coding are reinitialized in a step (E306) to default values (not necessarily zero) predetermined in advance. This reinitialization step is implemented by the reinitialization module 306, for at least one state of the predictive coding.

(22) A step of predictive coding for the current frame is then implemented at E308 by a predictive coding entity 308.

(23) The coded and quantized information is written in the bitstream in step E305.

(24) This predictive coding E308 can, in a particular embodiment, be a transition coding such as defined by the name TC mode in the standard UIT-T G.718, in which the coding of the excitation is direct and does not use any adaptive dictionary arising from the previous frame. A coding, which is independent of the previous frame, of the excitation is then carried out. This embodiment allows the predictive coders of LPD type to stabilize much more rapidly (with respect to a conventional CELP coding which would use an adaptive dictionary which would be set to zero). This further simplifies the implementation of the transition according to the invention.

(25) In a variant of the invention, it will be possible for the coding of the excitation not to be in a transition mode but for it to use a CELP coding in a manner similar to G.718 and possibly using an adaptive dictionary (without forcing or limiting the classification) or a conventional CELP coding with adaptive and fixed dictionaries. This variant is however less advantageous since, the adaptive dictionary not having been recalculated and having been set to zero, the coding will be sub-optimal.

(26) In another variant, the CELP coding in the transition frame by TC mode will be able to be replaced with any other type of coding which is independent of the previous frame, for example by using the coding model of iLBC type.

(27) In a particular embodiment, a step E307 of calculating the coefficients of the linear prediction filter for the current frame is performed by the calculation module 307.

(28) Several modes of calculation of the coefficients of the linear prediction filter are possible for the current frame. It is considered here that the predictive coding (block 304) performs two linear prediction analyses per frame as in the standard G.718, with a coding of the LPC coefficients in the form of ISF (or LSF in an equivalent manner) obtained at the end of frame (NEW) and a very reduced bitrate coding of the LPC coefficients obtained in the middle of the frame (MID), with an interpolation by sub-frame between the LPC coefficients of the end of previous frame (OLD), and those of the current frame (MID and NEW).

(29) In a first embodiment, the prediction coefficients in the previous frame (OLD) of FD type are not known since no LPC coefficient is coded in the FD coder. One then chooses to code a single coefficient set of the linear prediction filter which corresponds either to the middle of the frame (MID) or else to the end of the frame (NEW). This choice may be for example made according to a classification of the signal to be coded. For a stable signal, it will be possible to choose the middle-of-frame filter. An arbitrary choice can also be made; in the case where the choice pertains to the LPC coefficients in the middle of the frame, in a variant, the interpolation of the LPC coefficients (in the ISP (Imittance Spectral Pairs) domain or LSP (Line Spectral Pairs) domain) will be able to be modified in the second LPD frame which follows the transition LPD frame.

(30) On the basis of these coded values obtained, identical coded values are allotted for the prediction filter coefficients for frame start (OLD) and for frame end or middle according to the choice which has been made. Indeed, the LPC coefficients of the previous frame (OLD) not being known, it is not possible to code the frame middle (MID) LPC coefficients as in G.718. It will be noted that in this variant the reinitialization of the LPC coefficients (OLD) is not absolutely necessary, since these coefficients are not used. In this case, the coefficients used in each sub-frame are fixed in a manner identical to the value coded in the frame.

(31) Advantageously, the bits which could be reserved for the coding of the set of frame middle (MID) or frame start LPC coefficients are used for example to code in a direct manner at least one state of the predictive coding, for example the memory of the de-emphasis filter.

(32) In a second possible embodiment, the steps illustrated in FIG. 4 are implemented. A first step E401 is the initialization of the coefficients of the prediction filter and of the equivalent ISF or LSF representations according to the implementation of step E306 of FIG. 3, that is to say to predetermined values, for example according to the long-term average value over an a priori learning base for the LSP coefficients. Step E402 codes the coefficients of the end-of-frame filter (LSP NEW) and the coded values obtained (LEP NEW Q) as well as the predetermined reinitialization values of the coefficients of the start-of-frame filter (LSP OLD) are used in E403 to code the coefficients of the middle-of-frame prediction filter (LSP MID). A step of replacement E404 of the values of start-of-frame coefficients (LSP OLD) by the coded values of the middle-of-frame coefficients (LSP MID Q), is performed. Step E405 makes it possible to determine the coefficients of the linear prediction filter for the current frame on the basis of these values thus coded (LSP OLD, LSP MID Q, LSP NEW Q).

(33) In a third possible embodiment, the coefficients of the linear prediction filter for the previous frame (LSP OLD) are initialized to a value which is already available free of charge in an FD coder variant using a spectral envelope of LPC type. In this case, it will be possible to use a normal coding such as used in G.718, the sub-frame-based linear prediction coefficients being calculated as an interpolation between the values of the prediction filters OLD, MID and NEW, this operation thus allows the LPD coder to obtain without additional analysis a good estimation of the LPC coefficients in the previous frame.

(34) In other variants of the invention, the coding LPD will be able by default to code just a set of LPC coefficients (NEW), the previous variant embodiments are simply adapted to take into account that no set of coefficients is available in the frame middle (MID).

(35) In a variant embodiment of the invention, the initialization of the states of the predictive coding can be performed with default values predetermined in advance which can for example correspond to various types of frame to be encoded (for example the initialization values can be different if the frame comprises a signal of voiced or unvoiced type).

(36) FIG. 5 illustrates in a schematic manner, the principle of decoding during a transition between a transform decoding and a predictive decoding according to the invention.

(37) Considered here is a succession of audio frame to be decoded either with a transform decoder (FD) for example of MDCT type or with a predictive decoder (LPD) for example of ACELP type. In this example the transform decoder (FD) uses small-delay synthesis windows of Tukey type (the invention is independent of the type of window used) and whose total length is equal to two frames (zero values inclusive) as represented in the figure.

(38) Within the meaning of the invention, after the decoding of a frame coded with an FD coder, an inverse DCT transformation is applied to the decoded frame. The latter is de-aliased and then the synthesis window is applied to the de-aliased signal. The synthesis windows of the FD coder are synchronized in such a way that the non-zero part of the window (on the left) corresponds with a new frame. Thus, the frame can be decoded up to the point A since the signal does not have any temporal aliasing before this point.

(39) At the moment of the arrival of the LPD frame, as at the coder, the states or memories of the predictive decoding are reinitialized to predetermined values.

(40) By state of the predictive decoding (LPD), at least the following states are implied: The state memory of the resampling filter for the internal frequency of the CELP decoding (12.8 or 16 kHz) at the output frequency fs. It is considered here that the resampling can be performed as a function of the input frequency and internal frequency by FIR filter, filter bank or IIR filter, knowing that an embodiment of FIR type simplifies the use of the state memory which corresponds to the past input signal. The state memories of the de-emphasis filter (1/(1??z.sup.?1)). The coefficients of the linear prediction filter at the end of the previous frame or their equivalent version in the domains such as the LSF (Line Spectral Frequencies) or ISF (Imittance Spectral Frequencies) domains. The state memory of the LPC synthesis filter typically of order 16 (in the preaccentuated domain). The memory of the adaptive dictionary (past excitation). The state memory of the low-frequency post-filter (LPF) as defined in the standard UIT-G.718 (see clause 7.14.1.1 of the standard UIT-T G.718). The quantization memory for the fixed dictionary gain (when this quantization is performed with memory).

(41) FIG. 6 illustrates an embodiment of a decoder and of a decoding method according to the invention.

(42) The particular embodiment lies within the framework of transition between an FD transform codec using an MDCT and a predictive codec of ACELP type.

(43) After a first conventional step of reading in the binary train (E601) by a module 601, a decision module (dec.) determines whether the frame to be processed should be decoded by ACELP predictive decoding or by FD transform decoding.

(44) In the case of an MDCT transform decoding, a step of decoding E602 by the transform decoding entity 602, makes it possible to obtain the frame in the transformed domain. The step can also contain a step of resampling at the sampling frequency of the ACELP decoder. This step is followed by an inverse MDCT transformation E603 comprising an inverse DCT transformation, a temporal de-aliasing, and the application of a synthesis window and of a step of overlap-add with the previous frame, as described subsequently with reference to FIG. 8.

(45) The part for which the temporal aliasing has been canceled is placed in a frame in a step E605 by the frame placement module 605. The part which comprises a temporal aliasing is kept in memory (MDCT Mem.) to carry out a step of overlap-add at E609 by the processing module 609 with the next frame, if any, decoded by the FD core. In a variant, the stored part of the MDCT decoding which is used for the overlap-add step, does not comprise any temporal aliasing, for example in the case where a sufficiently significant temporal shift exists between the MDCT decoding and the CELP decoding.

(46) This step is illustrated in FIG. 8. It is seen in this figure that a temporal discontinuity exists between the decoding arising from the FD and that from the LPD. Step E609 uses the memory of the transform coder (MDCT Mem.), such as described hereinabove, that is to say the signal decoded after the point A but which comprises aliasing (in the case illustrated).

(47) Preferentially, the signal is used up to the point B which is the point of aliasing of the transform. In a particular embodiment, this signal is compensated beforehand by the inverse of the window previously applied over the segment AB. Thus, before the overlap-add step the segment AB is corrected by the application of an inverse window compensating the windowing previously applied to the segment. The segment is therefore no longer windowed and its energy is close to that of the original signal.

(48) The two segments AB, that arising from the transform decoding and that arising from the predictive decoding, are thereafter weighted and summed so as to obtain the final signal AB. The weighting functions preferentially have a sum equal to 1 (of the quadratic sinusoidal or linear type for example). Thus, the overlap-add step combines a signal segment synthesized by predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame.

(49) In another particular embodiment, in the case where the resampling has not yet been performed (at E602 for example), the signal segment synthesized by inverse transform decoding of FD type is resampled beforehand at the sampling frequency corresponding to the decoded signal segment of the current frame of LPD type. This resampling of the MDCT memory will be able to be done with or without delay with conventional techniques by filter of FIR type, filter bank, IIR filter or indeed by using splines.

(50) In the converse case, if the FD and LPD coding modes operate at different internal sampling frequencies, it will be possible in an alternative to resample the synthesis of the CELP coding (optionally post-processed with in particular the addition of an estimated or coded high band) and to apply the invention. This resampling of the synthesis of the LPD coder will be able to be done with or without delay with conventional techniques by filter of FIR type, filter bank, IIR filter or indeed by using splines.

(51) This makes it possible to perform a transition without defect in the case where the sampling frequency of the transform decoding is different from that of the predictive decoding.

(52) In a particular embodiment, it is possible to apply an intermediate delay step (E604) so as to temporally align the two decoders if the FD decoder has less lag than the CELP (LPD) decoder. A signal part whose size corresponds to the lag between the two decoders is then stored in memory (Mem.delay).

(53) FIG. 9 depicts this illustrative case. The embodiment here proposes to advantageously exploit this difference in lag D so as to replace the first segment D arising from the LPD predictive decoding with that arising from the FD transform decoding and then to undertake the overlap-add step (E609) such as described previously, on the segment AB. Thus, when the inverse transform decoding has a smaller processing delay than that of the predictive decoding, the first segment of current frame decoded by predictive decoding is replaced with a segment arising from the decoding of the previous frame corresponding to the delay shift and placement in memory during the decoding of the previous frame.

(54) In FIG. 6, if the decision (dec.) indicates that it is necessary to do an ACELP predictive decoding, then: Either the last decoded frame, previous frame (last ACELP), was also decoded according to an ACELP predictive decoding by the ACELP decoding entity 603, the predictive decoding then continues in a step (E603), the audio frame is thus produced at E605. Or the previous frame (last MDCT) has been decoded by the transform decoding entity 602, at E602, in this case, a step (E606) of reinitialization of the states of the ACELP predictive decoding is applied. This reinitialization step is implemented by the reinitialization module 606, for at least one state of the predictive decoding. The reinitialization values are default values predetermined in advance (not necessarily zero). The initialization of the states of the LPD decoding can be done with default values predetermined in advance which may for example correspond to various types of frame to be decoded as a function of what was done during the encoding.

(55) A step of predictive decoding for the current frame is then implemented at E608 by a predictive decoding entity 608, before the overlap-add step (E609) described previously. The step can also contain a step of resampling at the sampling frequency of the MDCT decoder.

(56) This predictive coding E608 can, in a particular embodiment, be a transition predictive decoding, if this solution has been chosen at the encoder, in which the decoding of the excitation is direct and does not use any adaptive dictionary. In this case, the memory of the adaptive dictionary does not need to be reinitialized.

(57) A non-predictive decoding of the excitation is then carried out. This embodiment allows predictive decoders of LPD type to stabilize much more rapidly since in this case it does not use the memory of the adaptive dictionary which had been previously reinitialized. This further simplifies the implementation of the transition according to the invention. When decoding the current frame, the predictive decoding of the long-term excitation is replaced with a non-predictive decoding of the excitation.

(58) In a particular embodiment, a step E607 of calculating the coefficients of the linear prediction filter for the current frame is performed by the calculation module 607.

(59) Several modes of calculation of the coefficients of the linear prediction filter are possible for the current frame.

(60) In a first embodiment, the prediction coefficients in the previous frame (OLD) of FD type are not known since no LPC coefficient is coded in the FD coder and the values have been reinitialized to zero. One then chooses to decode coefficients of a unique linear prediction filter, i.e. that corresponding to the end-of-frame prediction filter (NEW), or that corresponding to the middle-of-frame prediction filter (MID). Identical coefficients are thereafter allotted to the end-, middle- and start-of-frame linear prediction filter.

(61) In a second possible embodiment, the steps illustrated in FIG. 7 are implemented. A first step E701 is the initialization of the coefficients of the prediction filter (LSP OLD) according to the implementation of step E606 of FIG. 6. Step E702 decodes the coefficients of the end-of-frame filter (LSP NEW) and the decoded values obtained (LSP NEW) as well as the predetermined reinitialization values of the coefficients of the start-of-frame filter (LSP OLD) are used jointly at E703 to decode the coefficients of the middle-of-frame prediction filter (LSP MID). A step E704 of replacement of the values of start-of-frame coefficients (LSP OLD) by the decoded values of the middle-of-frame coefficients (LSP MID) is performed. Step E705 makes it possible to determine the coefficients of the linear prediction filter for the current frame on the basis of these values thus decoded (LSP OLD, LSP MID, LSP NEW).

(62) In a third possible embodiment, the coefficients of the linear prediction filter for the previous frame (LSP OLD) are initialized to a predetermined value, for example according to the long-term average value of the LSP coefficients. In this case, it will be possible to use a normal decoding such as used in G.718, the sub-frame-based linear prediction coefficients being calculated as an interpolation between the values of the prediction filters OLD, MID and NEW. This operation thus allows the LPD coder to stabilize more rapidly.

(63) With reference to FIG. 10, a hardware device adapted to embody a coder or a decoder according to an embodiment of the present invention is described.

(64) This coder or decoder can be integrated into a communication terminal, a communication gateway or any type of equipment such as a set top box type decoder, or audio stream reader.

(65) This device DISP comprises an input for receiving a digital signal which in the case of the coder is an input signal x(n) and in the case of the decoder, the binary train bst.

(66) The device also comprises a digital signals processor PROC adapted for carrying out coding/decoding operations in particular on a signal originating from the input E.

(67) This processor is linked to one or more memory units MEM adapted for storing information necessary for driving the device in respect of coding/decoding. For example, these memory units comprise instructions for the implementation of the decoding method described hereinabove and in particular for implementing the steps of decoding according to an inverse transform decoding of a previous frame of samples of the digital signal, received and coded according to a transform coding, of decoding according to a predictive decoding of a current frame of samples of the digital signal, received and coded according to a predictive coding, a step of reinitialization of at least one state of the predictive decoding to a predetermined default value and an overlap-add step which combines a signal segment synthesized by predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame.

(68) When the device is of coder type, these memory units comprise instructions for the implementation of the coding method described hereinabove and in particular for implementing the steps of coding a previous frame of samples of the digital signal according to a transform coding, of receiving a current frame of samples of the digital signal to be coded according to a predictive coding, a step of reinitialization of at least one state of the predictive coding to a predetermined default value.

(69) These memory units can also comprise calculation parameters or other information.

(70) More generally, a storage means, readable by a processor, possibly integrated into the coder or into the decoder, optionally removable, stores a computer program implementing a decoding method and/or a coding method according to the invention. FIGS. 3 and 6 may for example illustrate the algorithm of such a computer program.

(71) The processor is also adapted for storing results in these memory units. Finally, the device comprises an output S linked to the processor so as to provide an output signal which in the case of the coder is a signal in the form of a binary train bst and in the case of the decoder, an output signal {circumflex over (x)}(n).

(72) Although the present disclosure has been described with reference to one or more examples, workers skilled in the art will recognize that changes may be made in form and detail without departing from the scope of the disclosure and/or the appended claims.

Transition from a transform coding/decoding to a predictive coding/decoding

Assignee

Inventors

Cpc classification

Classification Explorer

G10L19/04

PHYSICS

Classification Explorer

G10L19/26

PHYSICS

Classification Explorer

G10L19/20

PHYSICS

Classification Explorer

G10L19/0212

PHYSICS

Classification Explorer

G10L19/173

PHYSICS

Classification Explorer

G10L19/022

PHYSICS

International classification

Classification Explorer

G10L19/20

PHYSICS

Classification Explorer

G10L19/02

PHYSICS

Classification Explorer

G10L19/022

PHYSICS

Classification Explorer

G10L19/16

PHYSICS

Classification Explorer

G10L19/26

PHYSICS

Classification Explorer

G10L19/04

PHYSICS

Abstract

Claims

Description