Audio decoding device, audio coding device, audio decoding method, audio coding method, audio decoding program, and audio coding program

Abstract

An objective of the present invention is to correct a temporal envelope shape of a decoded signal with a small information volume and to reduce perceptible distortions. An audio decoding device which decodes a coded audio signal and outputs an audio signal comprises: a coded series analysis unit that analyzes a coded series which contains the coded audio signal; an audio decoding unit that receives from the coded series analysis unit the coded series which contains the coded audio signal and decodes same, obtaining an audio signal; a temporal envelope shape establishment unit that receives information from the coded series analysis unit and/or the audio decoding unit, and, on the basis of the information, establishes a temporal envelope shape of the decoded audio signal; and a temporal envelope correction unit that, on the basis of the temporal envelope shape which is established with the temporal envelope shape establishment unit, corrects the temporal envelope shape of the decoded audio signal and outputs same.

Claims

1. A speech encoding device that encodes an input speech signal to output a code sequence, the speech encoding device comprising: a speech encoder that encodes the speech signal; a temporal envelope information encoder that calculates and encodes temporal envelope information of the speech signal; and a code sequence multiplexer that multiplexes a code sequence including the speech signal obtained by the speech encoder and a code sequence of the temporal envelope information obtained by the temporal envelope information encoder to generate an encoded sequence for output by the speech encoding device, wherein the temporal envelope information is generated based on a ratio between an arithmetic mean and geometric mean of a temporal envelope of a high frequency signal of the speech signal, and the temporal envelope information included in the encoded sequence indicates whether a temporal envelope shape is flat or not, wherein the temporal envelope information is represented by flag.

2. The speech encoding device according to claim 1 wherein the temporal envelope information is represented by one bit.

3. A speech encoding method executed by a speech encoding device that encodes an input speech signal to output a code sequence, the speech encoding method comprising: a speech encoding step of encoding the speech signal; a temporal envelope information encoding step of calculating and encoding temporal envelope information of the speech signal; and a code sequence multiplexing step of multiplexing a code sequence including the speech signal obtained in the speech encoding step and a code sequence of the temporal envelope information obtained in the temporal envelope information encoding step to generate an encoded sequence; and outputting the encoded sequence for receipt by a decoder, wherein the temporal envelope information is generated based on a ratio between an arithmetic mean and geometric mean of a temporal envelope of a high frequency signal of the speech signal, and the temporal envelope information output in the encoded sequence indicating whether a temporal envelope shape is flat or not, wherein the temporal envelope information is represented by flag.

4. The speech encoding method according to claim 3 wherein the temporal envelope information is represented by one bit.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) FIG. 1 is a figure showing the configuration of the speech decoding device 1 according to a first embodiment.

(2) FIG. 2 is a flow chart showing the operation of the speech decoding device according to the first embodiment.

(3) FIG. 3 is a figure showing the configuration of the speech to digital converter 2 according to the first embodiment.

(4) FIG. 4 is a flow chart showing the operation of the speech to digital converter 2 according to the first embodiment.

(5) FIG. 5 is a figure showing the configuration of the speech decoding device 100 according to a second embodiment.

(6) FIG. 6 is a flow chart showing the operation of the speech decoding device according to the second embodiment.

(7) FIG. 7 is a figure showing the configuration of the speech to digital converter 200 according to the second embodiment.

(8) FIG. 8 is a flow chart showing the operation of the speech to digital converter 200 according to the second embodiment.

(9) FIG. 9 is a figure showing the configuration of the first modification 100A of the speech decoding device according to the second embodiment.

(10) FIG. 10 is a flow chart showing the operation of the first modification 100A of the speech decoding device according to the second embodiment.

(11) FIG. 11 is a figure showing the configuration of the first modification 100A of the speech to digital converter according to the second embodiment.

(12) FIG. 12 is a figure showing the configuration of the speech decoding device 110 according to a third embodiment.

(13) FIG. 13 is a flow chart showing the operation of the speech decoding device according to the third embodiment.

(14) FIG. 14 is a figure showing the configuration of the speech to digital converter 210 according to the third embodiment.

(15) FIG. 15 is a flow chart showing the operation of the speech to digital converter 210 according to the third embodiment.

(16) FIG. 16 is a figure showing the configuration of the speech decoding device 120 according to a fourth embodiment.

(17) FIG. 17 is a flow chart showing the operation of the speech decoding device 120 according to the fourth embodiment.

(18) FIG. 18 is a figure showing the configuration of the speech to digital converter 220 according to the fourth embodiment.

(19) FIG. 19 is a flow chart showing the operation of the speech to digital converter 220 according to the fourth embodiment.

(20) FIG. 20 is a figure showing the configuration of the first modification 120A of the speech decoding device according to the fourth embodiment.

(21) FIG. 21 is a flow chart showing the operation of the first modification 120A of the speech decoding device according to the fourth embodiment.

(22) FIG. 22 is a figure showing the configuration of the second modification 120B of the speech decoding device according to the fourth embodiment.

(23) FIG. 23 is a flow chart showing the operation of the second modification 120B of the speech decoding device according to the fourth embodiment.

(24) FIG. 24 is a figure showing the configuration of the 3rd modification 120C of the speech decoding device according to the fourth embodiment.

(25) FIG. 25 is a flow chart showing the operation of the 3rd modification 120C of the speech decoding device according to the fourth embodiment.

(26) FIG. 26 is a figure showing the configuration of the 4th modification 120D of the speech decoding device according to the fourth embodiment.

(27) FIG. 27 is a flow chart showing the operation of the 4th modification 120D of the speech decoding device according to the fourth embodiment.

(28) FIG. 28 is a figure showing the configuration of the fifth modification 120E of the speech decoding device according to the fourth embodiment.

(29) FIG. 29 is a flow chart showing the operation of the fifth modification 120E of the speech decoding device according to the fourth embodiment.

(30) FIG. 30 is a figure showing the configuration of the sixth modification 120F of the speech decoding device according to the fourth embodiment.

(31) FIG. 31 is a flow chart showing the operation of the sixth modification 120F of the speech decoding device according to the fourth embodiment.

(32) FIG. 32 is a figure showing the configuration of the seventh modification 120G of the speech decoding device according to the fourth embodiment.

(33) FIG. 33 is a flow chart showing the operation of the seventh modification 120G of the speech decoding device according to the fourth embodiment.

(34) FIG. 34 is a figure showing the configuration of the eighth modification 120H of the speech decoding device according to the fourth embodiment.

(35) FIG. 35 is a flow chart showing the operation of the eighth modification 120H of the speech decoding device according to the fourth embodiment.

(36) FIG. 36 is a figure showing the configuration of the ninth modification 120I of the speech decoding device according to the fourth embodiment.

(37) FIG. 37 is a flow chart showing the operation of the ninth modification 120I of the speech decoding device according to the fourth embodiment.

(38) FIG. 38 is a figure showing the configuration of the tenth modification 120J of the speech decoding device according to the fourth embodiment.

(39) FIG. 39 is a flow chart showing the operation of the tenth modification 120J of the speech decoding device according to the fourth embodiment.

(40) FIG. 40 is a figure showing the configuration of the 11th modification 120K of the speech decoding device according to the fourth embodiment.

(41) FIG. 41 is a flow chart showing the operation of the 11th modification 120K of the speech decoding device according to the fourth embodiment.

(42) FIG. 42 is a figure showing the configuration of the 12th modification 120L of the speech decoding device according to the fourth embodiment.

(43) FIG. 43 is a flow chart showing the operation of the 12th modification 120L of the speech decoding device according to the fourth embodiment.

(44) FIG. 44 is a figure showing the configuration of the 13th modification 120M of the speech decoding device according to the fourth embodiment.

(45) FIG. 45 is a flow chart showing the operation of the 13th modification 120M of the speech decoding device according to the fourth embodiment.

(46) FIG. 46 is a figure showing the configuration of the 14th modification 120N of the speech decoding device according to the fourth embodiment.

(47) FIG. 47 is a flow chart showing the operation of the 14th modification 120N of the speech decoding device according to the fourth embodiment.

(48) FIG. 48 is a figure showing the configuration of the speech decoding device 130 according to a fifth embodiment.

(49) FIG. 49 is a flow chart showing the operation of the speech decoding device according to the fifth embodiment.

(50) FIG. 50 is a figure showing the configuration of the speech to digital converter 230 according to the fifth embodiment.

(51) FIG. 51 is a flow chart showing the operation of the speech to digital converter 230 according to the fifth embodiment.

(52) FIG. 52 is a figure showing the configuration of the speech decoding device 140 according to the sixth embodiment.

(53) FIG. 53 is a flow chart showing the operation of the speech decoding device according to the sixth embodiment.

(54) FIG. 54 is a figure showing the configuration of the speech to digital converter 240 according to the sixth embodiment.

(55) FIG. 55 is a flow chart showing the operation of the speech to digital converter 240 according to the sixth embodiment.

(56) FIG. 56 is a figure showing the configuration of the first modification 140A of the speech decoding device according to the sixth embodiment.

(57) FIG. 57 is a flow chart showing the operation of the first modification 140A of the speech decoding device according to the sixth embodiment.

(58) FIG. 58 is a figure showing the configuration of the second modification 140B of the speech decoding device according to the sixth embodiment.

(59) FIG. 59 is a figure showing the configuration of the 3rd modification 140C of the speech decoding device according to the sixth embodiment.

(60) FIG. 60 is a flow chart showing the operation of the 3rd modification 140C of the speech decoding device according to the sixth embodiment.

(61) FIG. 61 is a figure showing the configuration of the 4th modification 140D of the speech decoding device according to the sixth embodiment.

(62) FIG. 62 is a flow chart showing the operation of the 4th modification 140D of the speech decoding device according to the sixth embodiment.

(63) FIG. 63 is a figure showing the configuration of the fifth modification 140E of the speech decoding device according to the sixth embodiment.

(64) FIG. 64 is a flow chart showing the operation of the fifth modification 140E of the speech decoding device according to the sixth embodiment.

(65) FIG. 65 is a figure showing the configuration of the sixth modification 140F of the speech decoding device according to the sixth embodiment.

(66) FIG. 66 is a flow chart showing the operation of the sixth modification 140F of the speech decoding device according to the sixth embodiment.

(67) FIG. 67 is a figure showing the configuration of the seventh modification 140G of the speech decoding device according to the sixth embodiment.

(68) FIG. 68 is a flow chart showing the operation of the seventh modification 140G of the speech decoding device according to the sixth embodiment.

(69) FIG. 69 is a figure showing the configuration of the eighth modification 140H of the speech decoding device according to the sixth embodiment.

(70) FIG. 70 is a flow chart showing the operation of the eighth modification 140H of the speech decoding device according to the sixth embodiment.

(71) FIG. 71 is a figure showing the configuration of the ninth modification 140I of the speech decoding device according to the sixth embodiment.

(72) FIG. 72 is a flow chart showing the operation of the ninth modification 140I of the speech decoding device according to the sixth embodiment.

(73) FIG. 73 is a figure showing the configuration of the tenth modification 140J of the speech decoding device according to the sixth embodiment.

(74) FIG. 74 is a flow chart showing the operation of the tenth modification 140J of the speech decoding device according to the sixth embodiment.

(75) FIG. 75 is a figure showing the configuration of the 11th modification 140K of the speech decoding device according to the sixth embodiment.

(76) FIG. 76 is a flow chart showing the operation of the 11th modification 140K of the speech decoding device according to the sixth embodiment.

(77) FIG. 77 is a figure showing the configuration of the 12th modification 140L of the speech decoding device according to the sixth embodiment.

(78) FIG. 78 is a flow chart showing the operation of the 12th modification 140L of the speech decoding device according to the sixth embodiment.

(79) FIG. 79 is a figure showing the configuration of the 13th modification 140M of the speech decoding device according to the sixth.

(80) FIG. 80 is a flow chart showing the operation of the 13th modification 140M of the speech decoding device according to the sixth embodiment.

(81) FIG. 81 is a figure showing the configuration of the 14th modification 140N of the speech decoding device according to the sixth embodiment.

(82) FIG. 82 is a flow chart showing the operation of the 14th modification 140N of the speech decoding device according to the sixth embodiment.

(83) FIG. 83 is a figure showing the configuration of the speech decoding device 150 according to a seventh embodiment.

(84) FIG. 84 is a flow chart showing the operation of the speech decoding device according to the seventh embodiment.

(85) FIG. 85 is a figure showing the configuration of the speech to digital converter 250 according to the seventh embodiment.

(86) FIG. 86 is a flow chart showing the operation of the speech to digital converter 250 according to the seventh embodiment.

(87) FIG. 87 is a figure showing the configuration of the first modification 150A of the speech decoding device according to the seventh embodiment.

(88) FIG. 88 is a flow chart showing the operation of the first modification 150A of the speech decoding device according to the seventh embodiment.

(89) FIG. 89 is a figure showing the configuration of the second modification 150B of the speech decoding device according to the seventh embodiment.

(90) FIG. 90 is a figure showing the configuration of the 3rd modification 150C of the speech decoding device according to the seventh embodiment.

(91) FIG. 91 is a flow chart showing the operation of the 3rd modification 150C of the speech decoding device according to the seventh embodiment.

(92) FIG. 92 is a figure showing the configuration of the 4th modification 150D of the speech decoding device according to the seventh embodiment.

(93) FIG. 93 is a flow chart showing the operation of the 4th modification 150D of the speech decoding device according to the seventh embodiment.

(94) FIG. 94 is a figure showing the configuration of the fifth modification 150E of the speech decoding device according to the seventh embodiment.

(95) FIG. 95 is a flow chart showing the operation of the fifth modification 150E of the speech decoding device according to the seventh embodiment.

(96) FIG. 96 is a figure showing the configuration of the sixth modification 150F of the speech decoding device according to the seventh embodiment.

(97) FIG. 97 is a flow chart showing the operation of the sixth modification 150F of the speech decoding device according to the seventh embodiment.

(98) FIG. 98 is a figure showing the configuration of the seventh modification 150G of the speech decoding device according to the seventh embodiment.

(99) FIG. 99 is a flow chart showing the operation of the seventh modification 150G of the speech decoding device according to the seventh embodiment.

(100) FIG. 100 is a figure showing the configuration of the eighth modification 150H of the speech decoding device according to the seventh embodiment.

(101) FIG. 101 is a flow chart showing the operation of the eighth modification 150H of the speech decoding device according to the seventh embodiment.

(102) FIG. 102 is a figure showing the configuration of the ninth modification 150I of the speech decoding device according to the seventh embodiment.

(103) FIG. 103 is a flow chart showing the operation of the ninth modification 150I of the speech decoding device according to the seventh embodiment.

(104) FIG. 104 is a figure showing the configuration of the tenth modification 150J of the speech decoding device according to the seventh embodiment.

(105) FIG. 105 is a flow chart showing the operation of the tenth modification 150J of the speech decoding device according to the seventh embodiment.

(106) FIG. 106 is a figure showing the configuration of the 11th modification 150K of the speech decoding device according to the seventh embodiment.

(107) FIG. 107 is a flow chart showing the operation of the 11th modification 150K of the speech decoding device according to the seventh embodiment.

(108) FIG. 108 is a figure showing the configuration of the 12th modification 150L of the speech decoding device according to the seventh embodiment.

(109) FIG. 109 is a flow chart showing the operation of the 12th modification 150L of the speech decoding device according to the seventh embodiment.

(110) FIG. 110 is a figure showing the configuration of the 13th modification 150M of the speech decoding device according to the seventh embodiment.

(111) FIG. 111 is a flow chart showing the operation of the 13th modification 150M of the speech decoding device according to the seventh embodiment.

(112) FIG. 112 is a figure showing the configuration of the 14th modification 150N of the speech decoding device according to the seventh embodiment.

(113) FIG. 113 is a flow chart showing the operation of the 14th modification 150N of the speech decoding device according to the seventh embodiment.

(114) FIG. 114 is a figure showing the configuration of the speech decoding device 160 according to an eighth embodiment.

(115) FIG. 115 is a flow chart showing the operation of the speech decoding device according to the eighth embodiment.

(116) FIG. 116 is a figure showing the configuration of the speech to digital converter 260 according to the eighth embodiment.

(117) FIG. 117 is a flow chart showing the operation of the speech to digital converter 260 according to the eighth embodiment.

(118) FIG. 118 is a figure showing the configuration of the first modification 160A of the speech decoding device according to the eighth embodiment.

(119) FIG. 119 is a flow chart showing the operation of the first modification 160A of the speech decoding device according to the eighth embodiment.

(120) FIG. 120 is a figure showing the configuration of the second modification 160B of the speech decoding device according to the eighth embodiment.

(121) FIG. 121 is a figure showing the configuration of the 3rd modification 160C of the speech decoding device according to the eighth embodiment.

(122) FIG. 122 is a flow chart showing the operation of the 3rd modification 160C of the speech decoding device according to the eighth embodiment.

(123) FIG. 123 is a figure showing the configuration of the 4th modification 160D of the speech decoding device according to the eighth embodiment.

(124) FIG. 124 is a flow chart showing the operation of the 4th modification 160D of the speech decoding device according to the eighth embodiment.

(125) FIG. 125 is a figure showing the configuration of the fifth modification 160E of the speech decoding device according to the eighth embodiment.

(126) FIG. 126 is a flow chart showing the operation of the fifth modification 160E of the speech decoding device according to the eighth embodiment.

(127) FIG. 127 is a figure showing the configuration of the sixth modification 160F of the speech decoding device according to the eighth embodiment.

(128) FIG. 128 is a flow chart showing the operation of the sixth modification 160F of the speech decoding device according to the eighth embodiment.

(129) FIG. 129 is a figure showing the configuration of the seventh modification 160G of the speech decoding device according to the eighth embodiment.

(130) FIG. 130 is a flow chart showing the operation of the seventh modification 160G of the speech decoding device according to the eighth embodiment.

(131) FIG. 131 is a figure showing the configuration of the eighth modification 160H of the speech decoding device according to the eighth embodiment.

(132) FIG. 132 is a flow chart showing the operation of the eighth modification 160H of the speech decoding device according to the eighth embodiment.

(133) FIG. 133 is a figure showing the configuration of the ninth modification 160I of the speech decoding device according to the eighth embodiment.

(134) FIG. 134 is a flow chart showing the operation of the ninth modification 160I of the speech decoding device according to the eighth embodiment.

(135) FIG. 135 is a figure showing the configuration of the tenth modification 160J of the speech decoding device according to the eighth embodiment.

(136) FIG. 136 is a flow chart showing the operation of the tenth modification 160J of the speech decoding device according to the eighth embodiment.

(137) FIG. 137 is a figure showing the configuration of the 11th modification 160K of the speech decoding device according to the eighth embodiment.

(138) FIG. 138 is a flow chart showing the operation of the 11th modification 160K of the speech decoding device according to the eighth embodiment.

(139) FIG. 139 is a figure showing the configuration of the 12th modification 160L of the speech decoding device according to the eighth embodiment.

(140) FIG. 140 is a flow chart showing the operation of the 12th modification 160L of the speech decoding device according to the eighth embodiment.

(141) FIG. 141 is a figure showing the configuration of the 13th modification 160M of the speech decoding device according to the eighth embodiment.

(142) FIG. 142 is a flow chart showing the operation of the 13th modification 160M of the speech decoding device according to the eighth embodiment.

(143) FIG. 143 is a figure showing the configuration of the 14th modification 160N of the speech decoding device according to the eighth embodiment.

(144) FIG. 144 is a flow chart showing the operation of the 14th modification 160N of the speech decoding device according to the eighth embodiment.

(145) FIG. 145 is a figure showing the configuration of the speech decoding device 380 according to a ninth embodiment.

(146) FIG. 146 is a flow chart showing the operation of the speech decoding device 380 according to the ninth embodiment.

(147) FIG. 147 is a figure showing the configuration of the first modification 380A of the speech decoding device according to the ninth embodiment.

(148) FIG. 148 is a flow chart showing the operation of the first modification 380A of the speech decoding device according to the ninth embodiment.

(149) FIG. 149 is a figure showing the configuration of the speech decoding device 390 according to a tenth embodiment.

(150) FIG. 150 is a flow chart showing the operation of the speech decoding device 390 according to the tenth embodiment.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

(151) Various embodiments will be described with reference to the accompanying drawings. The same parts are denoted with the same reference signs, if possible, and an overlapping description will be omitted.

First Embodiment

(152) FIG. 1 is a diagram showing the configuration of a speech decoding device 1 according to a first embodiment. A communication device of the speech decoding device 1 receives a multiplexed code sequence output from a speech encoding device 2 described below and outputs a decoded speech signal to the outside. As shown in FIG. 1, the speech decoding device 1 functionally includes a code sequence analyzer 1a, a speech decoder 1b, a temporal envelope shape determiner 1c, and a temporal envelope modifier 1d.

(153) FIG. 2 is a flowchart showing the operation of the speech decoding device 1 according to the first embodiment.

(154) The code sequence analyzer 1a analyzes a code sequence and divides the code sequence into a speech encoded part and information about the temporal envelope shape (step S1-1).

(155) The speech decoder 1b decodes the speech encoded part of the code sequence to obtain a decoded signal (step S1-2).

(156) The temporal envelope shape determiner 1c determines the temporal envelope shape of the decoded signal, based on at least one of the information about the temporal envelope shape divided by the code sequence analyzer 1a and the decoded signal obtained by the speech decoder 1b (step S1-3).

(157) For example, it is determined that the temporal envelope shape of the decoded signal is flat. For example, parameters representing the power of the decoded signal or parameters similar thereto are calculated. Thereafter, the dispersion, or a parameter similar thereto, of the parameters is calculated. The calculated parameter is compared with a predetermined threshold to determine whether the temporal envelope shape is flat or determine the degree of flatness. In another example, the ratio, or a parameter similar thereto, of an arithmetic mean to a geometric mean of the parameters, or parameters similar thereto, representing the power of the decoded signal and is compared with a predetermined threshold to determine whether the temporal envelope shape is flat or determine the degree of flatness. The method of determining that the temporal envelope shape of the decoded signal is flat is not limited to the above examples.

(158) For example, it is determined that the temporal envelope shape of the decoded signal is onset. For example, parameters, or parameters similar thereto, representing the power of the decoded signal are determined, differential values of the parameters in time direction are calculated, and the maximum value in the differential values in an arbitrary time segment is calculated. The maximum value is compared with a predetermined threshold to determine whether the temporal envelope shape is rising or determine the degree of onset. The method of determining that the temporal envelope shape of the decoded signal is onset is not limited to the above examples.

(159) For example, it is determined that the temporal envelope shape of a low frequency signal is offset. For example, parameters, or parameters similar thereto, representing the power of the decoded signal are determined, differential values of the parameters in time direction are calculated, and the minimum value of the differential values in an arbitrary time segment is calculated. The minimum value is compared with a predetermined threshold to determine whether the temporal envelope shape is offset or determine the degree of offset. The method of determining that the temporal envelope shape of the decoded signal is offset is not limited to the above examples.

(160) The above examples can also be applied to a case where the decoded signal is output as a time domain signal from the speech decoder 1b, and can also be applied to a case where the decoded signal is output as a plurality of subband signals.

(161) The temporal envelope modifier 1d modifies the shape of the temporal envelope of the decoded signal output from the speech decoder 1b, based on the temporal envelope shape determined by the temporal envelope shape determiner 1c (step S1-4).

(162) For example, if the decoded signal is expressed by a plurality of subband signals, the temporal envelope modifier 1d uses a predetermined function F(X.sub.dec(k,i)) for a plurality of subband signals X.sub.dec(k,i) (0≤k<k.sub.h, t(l)≤i<t(l+1)) of the decoded signal within an arbitrary time segment to calculate X′.sub.dec(k,i) using the following equation (1):
X.sub.dec′(k,i)=F(X.sub.dec)(k,i)) [Eq. 1]

(163) X′.sub.dec(k,i) being calculated as subband signals of the decoded signal whose temporal envelope shape is modified. The temporal envelope modifier 1d synthesizes a time domain signal from the subband signals and outputs the synthesized signal.

(164) For example, when it is determined that the temporal envelope shape of the decoded signal is flat, the temporal envelope shape of the decoded signal can be modified by the following process. For example, the subband signals X.sub.dec(k,i) are divided into M.sub.dec frequency bands having boundaries represented by B.sub.dec(m) (m=0, . . . , M.sub.dec, M.sub.dec≥1) (B.sub.dec(0)≥0, B.sub.dec(M.sub.dec)<k.sub.h) and, using a predetermined function F(X.sub.dec(k,i) expressed by the equations below for the subband signals X.sub.dec(k,i) (B.sub.dec(m)≤k<B.sub.dec(m+1)) t(l)≤i<t(l+1)) included in the m-th frequency band,

(165) $\begin{matrix} F (X_{dec} (k, i)) = \sqrt{\frac{{.Math.}_{n = t_{E} (l)}^{t_{E} (l + 1) - 1} {.Math.}_{j = B_{dec} (m)}^{B_{dec} (m + 1) - 1} {.Math. X_{dec} (j, n) .Math.}^{2}}{(t_{E} (l + 1) - t_{E} (l)) .Math. (B_{dec} (m + 1) - B_{dec} (m))}} \frac{X_{dec} (k, i)}{\sqrt{{.Math. X_{dec} (k, i) .Math.}^{2}}} & [Eq . 2] \\ or \\ F (X_{dec} (k, i)) = \sqrt{\frac{{.Math.}_{n = t_{E} (l)}^{t_{E} (l + 1) - 1} {.Math.}_{j = B_{dec} (m)}^{B_{dec} (m + 1) - 1} {.Math. X_{dec} (j, n) .Math.}^{2}}{t_{E} (l + 1) - t_{E} (l)}} \frac{X_{dec} (k, i)}{\sqrt{{.Math.}_{j = B_{dec} (m)}^{B_{dec} (m + 1) - 1} {.Math. X_{dec} (j, n) .Math.}^{2}}} \end{matrix}$

(166) X′.sub.dec(k,i) is calculated as subband signals of the decoded signal whose temporal envelope shape is modified. In another example, a predetermined function F(X.sub.dec(k,i)) defined by is used to perform a smoothing filter process on the subband signals X.sub.dec(k,i).

(167) $\begin{matrix} F (X_{dec} (k, i)) = {.Math.}_{p = 0}^{N_{filt} - 1} a (p) X_{dec} (k, i - p) & [Eq . 3] \end{matrix}$

(168) With the definition of (N.sub.filt≥1), X′.sub.dec(k,i) are calculated as subband signals of the decoded signal whose temporal envelope shape is modified. The process can be performed such that the powers of the subband signals before and after the filter process are matched in each frequency band having the boundaries represented by the B.sub.dec(m).

(169) In another example, the subband signals X.sub.dec(k,i) are linearly predicted in the frequency direction in each frequency band having the boundaries represented by the B.sub.dec(m) to obtain a linear prediction coefficient α.sub.p(m) (m=0, . . . , M.sub.dec-1), and a predetermined function F(X.sub.dec(k,i)) is used to perform a linear prediction inverse filter process on the subband signals X.sub.dec(k,i).

(170) $\begin{matrix} F (X_{dec} (k, i)) = X_{dec} (k, i) + {.Math.}_{p = 1}^{N_{pred}} α_{p} (m) X_{dec} (k - p, i) & [Eq . 4] \end{matrix}$

(171) With the definition of (N.sub.pred≥1), X′.sub.dec(k,i) are calculated as subband signals of the decoded signal whose temporal envelope shape is modified.

(172) The process of modifying the temporal envelope into a flat shape can be carried out in any combination of the above examples.

(173) The processes performed by the temporal envelope modifier 1d to modify the temporal envelope of the decoded signal into a flat shape are not limited to the above examples.

(174) For example, when it is determined that the temporal envelope shape of the decoded signal is onset, the temporal envelope shape of the decoded signal can be modified by the following process.

(175) For example, a predetermined function F(X.sub.dec(k,i)) set forth below is defined using a function incr(i) that monotonously increases relative to i.

(176) $\begin{matrix} F (X_{dec} (k, i)) = i n c r (i) \frac{X_{dec} (k, i)}{\sqrt{{.Math. X_{dec} (k, i) .Math.}^{2}}} & [Eq . 5] \end{matrix}$

(177) X′.sub.dec(k,i) are calculated as the subband signals of the decoded signal whose temporal envelope shape is modified. A process can be performed such that the powers of the subband signals before and after modification of the temporal envelope shape are matched in each frequency band having the boundaries represented by the B.sub.dec(m).

(178) The temporal envelope modifier 1d carries out a process of modifying the temporal envelope shape of a plurality of subband signals of the decoded signal when it is onset, and the process is not limited to the above examples.

(179) For example, when it is determined that the temporal envelope shape of the decoded signal is offset, the temporal envelope shape of the decoded signal can be modified by the following process.

(180) For example, a predetermined function F(X.sub.dec(k,i)) set forth below includes a function decr(i) that monotonously decreases relative to i.

(181) $\begin{matrix} F (X_{dec} (k, i)) = decr (i) \frac{X_{dec} (k, i)}{\sqrt{{.Math. X_{dec} (k, i) .Math.}^{2}}} & [Eq . 6] \end{matrix}$

(182) X′.sub.dec(k,i) are calculated as subband signals of the low frequency signal whose temporal envelope shape is modified. A process can be performed such that the powers of the subband signals before and after modification of the temporal envelope shape are matched in each frequency band having the boundaries represented by the B.sub.dec(m).

(183) The temporal envelope modifier 1d performs a process of modifying the temporal envelope shape of a plurality of subband signals of the decoded signal when it is offset, and the process is not limited to the above examples.

(184) For example, if the decoded signal can be represented as a time domain signal, as shown below, the temporal envelope modifier 1d applies a predetermined function F.sub.t(x.sub.dec(i)) for the decoded signal x.sub.dec(i) (t(l)≤i<t(l+1)) in an arbitrary time segment to obtain x′.sub.dec(i).
x.sub.dec′(i)=F.sub.t(x.sub.dec(i)) [Eq. 7]

(185) Which is output as a decoded signal whose temporal envelope shape is modified.

(186) For example, when it is determined that the temporal envelope shape of the decoded signal is flat, the temporal envelope shape of the decoded signal can be modified by the following process. For example, a predetermined function F.sub.t(x.sub.dec(i)) set forth below for the decoded signal x.sub.dec(i) is used.

(187) $\begin{matrix} F_{t} (x_{dec} (i)) = \sqrt{\frac{{.Math.}_{n = t_{E} (l)}^{t_{E} (l + 1) - 1} {.Math. X_{dec} (n) .Math.}^{2}}{(t_{E} (l + 1) - t_{E} (l))}} \frac{x_{dec} (i)}{\sqrt{{.Math. x_{dec} (i) .Math.}^{2}}} & [Eq . 8] \end{matrix}$

(188) To output x′.sub.dec(i) as a decoded signal whose temporal envelope shape is modified.

(189) In another example, a predetermined function F.sub.t(x.sub.dec(i)) set forth below to perform a smoothing filter process on the decoded signal x.sub.dec(i).

(190) $\begin{matrix} F_{t} (X_{dec} (i)) = {.Math.}_{p = 0}^{N_{filt} - 1} a (p) x_{dec} (i - p) & [Eq . 9] \end{matrix}$

(191) With a definition of (N.sub.filt≥1), x′.sub.dec(i) is output as a decoded signal whose temporal envelope shape is modified.

(192) The process of modifying the temporal envelope into a flat shape can be carried out in any combination of the above examples.

(193) For example, when it is determined that the temporal envelope shape of the decoded signal is onset, the temporal envelope shape of the decoded signal can be modified by the following process.

(194) For example, a predetermined function F.sub.t(x.sub.dec(i)) set forth below uses a function incr(i) that monotonously increases relative to i.

(195) $\begin{matrix} F_{t} (x_{dec} (i)) = i n c r (i) \frac{x_{dec} (i)}{\sqrt{{.Math. x_{dec} (i) .Math.}^{2}}} & [Eq . 10] \end{matrix}$

(196) x′.sub.dec(i) is output as a decoded signal whose temporal envelope shape is modified.

(197) The temporal envelope modifier 1d carries out a process of modifying the temporal envelope of the decoded signal when it is onset, and the process is not limited to the above examples.

(198) For example, when it is determined that the temporal envelope shape of the decoded signal is offset, the temporal envelope shape of the decoded signal can be modified by the following process.

(199) For example, a predetermined function F.sub.t(x.sub.dec(i)) set forth below uses a function decr(i) that monotonously decreases relative to i.

(200) $\begin{matrix} F_{t} (x_{dec} (i)) = decr (i) \frac{x_{dec} (i)}{\sqrt{{.Math. x_{dec} (i) .Math.}^{2}}} & [Eq . 11] \end{matrix}$

(201) x′dec(i) is output as a decoded signal whose temporal envelope shape is modified. The temporal envelope modifier 1d carries out a process of modifying the temporal envelope of the decoded signal when it is offset, and the process is not limited to the above examples.

(202) For example, if the decoded signal is expressed by frequency domain transform coefficients X.sub.dec(k) (0≤k<k.sub.h) by a time-frequency transform, such as the discrete Fourier transform, the discrete cosine transform, or the modified discrete cosine transform, a predetermined function F.sub.f(X.sub.dec(k) is used in the following equation (12).
[Eq. 12]
X.sub.dec′(k)=F.sub.j(X.sub.dec(k)) formula (51)

(203) X′.sub.dec(k) are calculated as frequency domain transform coefficients of the decoded signal whose temporal envelope shape is modified, and then transformed into a time domain signal by a predetermined frequency transform to be output.

(204) For example, when it is determined that the temporal envelope shape of the decoded signal is flat, the temporal envelope shape of the decoded signal can be modified by the following process.

(205) In M.sub.dec arbitrary frequency bands B.sub.dec(m) having boundaries represented by B.sub.dec(m) (m=0, . . . , M.sub.dec, M.sub.dec≥1) (B.sub.dec(0)≥0, B.sub.dec(M.sub.dec)<k.sub.h), a linear prediction coefficient α.sub.p(m) (m=0, . . . , M.sub.dec−1) is obtained by linear prediction in a frequency direction, and a predetermined function F.sub.f(X.sub.dec(k)) set forth below is used to perform a linear prediction inverse filter process on the transform coefficients X.sub.dec(k).

(206) 0 $\begin{matrix} F_{f} (X_{dec} (k)) = X_{dec} (k) + {.Math.}_{p = 1}^{N_{pred}} α_{p} (m) X_{dec} (k - p) & [Eq . 13] \end{matrix}$

(207) With a definition of (N.sub.pred≥1), X′.sub.dec(k,i) are calculated as transform coefficients of the decoded signal whose temporal envelope shape is modified.

(208) The temporal envelope modifier 1d performs a process of modifying the temporal envelope of the decoded signal into a flat shape, and the process is not limited to the above examples.

(209) FIG. 3 is a diagram showing the configuration of a speech encoding device 2 according to the first embodiment. A communication device of the speech encoding device 2 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 3, the speech encoding device 2 functionally includes a speech coder 2a, a temporal envelope information encoder 2b, and a code sequence multiplexer 2c.

(210) FIG. 4 is a flowchart showing the operation of the speech encoding device 2 according to the first embodiment.

(211) The speech coder 2a encodes an input speech signal (step S2-1).

(212) The temporal envelope information encoder 2b calculates and encodes temporal envelope information, based on at least one of the input speech signal and information obtained in the encoding process including the encoding result of the input speech signal in the speech coder 2a (step S2-2).

(213) For example, the temporal envelope E.sub.t(i) of the input speech signal x(i), which is a time domain signal in an arbitrary time segment t(l)≤i<(l+1)), can be calculated as the power of the decoded signal normalized in the time segment.

(214) $\begin{matrix} E_{t} (i) = \frac{{.Math. x (i) .Math.}^{2}}{{.Math.}_{n = t (l)}^{t (l + 1) - 1} {.Math. x (n) .Math.}^{2}} & [Eq . 14] \end{matrix}$

(215) For example, if the input speech signal is calculated as a plurality of subband signals X(k,i) in the speech coder 2a, as the time envelop of the input speech signal, the temporal envelope E(k,i) of the subband signals X(k,i) (B(m)≤k<B(m+1), t(l)≤i<t(l+1)) of the input speech signal divided into M frequency bands having boundaries represented by B(m) (m=0, . . . , M, M≥1) (B(0)≥0, B(M)<k.sub.h) in an arbitrary time segment t(l)≤i<t(l+1) and included in the m-th frequency band can be calculated as the power of the subband signals of the input speech signal normalized in the time segment.

(216) $\begin{matrix} E (k, i) = \frac{{.Math.}_{j = B (m)}^{B (m + 1) - 1} {.Math. X (j, n) .Math.}^{2}}{{.Math.}_{n = t (l)}^{t (l + 1) - 1} {.Math.}_{j = B (m)}^{B (m + 1) - 1} {.Math. X (j, n) .Math.}^{2}} & [Eq . 15] \end{matrix}$

(217) The temporal envelope of the input speech signal is not limited to the above examples as long as it is a parameter indicating variations of the magnitude of the input speech signal in the time direction.

(218) For example, the decoded signal x.sub.dec(i) is calculated based on the encoding result of the input speech signal in the speech coder 2a, and the temporal envelope E.sub.dec,t(i) of the decoded signal x.sub.dec(i) in an arbitrary time segment t(l)≤i<t(l+1) can be calculated as the power of the decoded signal normalized in the time segment.

(219) $\begin{matrix} E_{dec, t} (i) = \frac{{.Math. x_{dec} (i) .Math.}^{2}}{{.Math.}_{n = t (l)}^{t (l + 1) - 1} {.Math. x_{dec} (n) .Math.}^{2}} & [Eq . 16] \end{matrix}$

(220) For example, if the subband signals X.sub.dec(k,i) of the decoded signal are calculated during the process of encoding the input speech signal in the speech coder 2a or based on the encoding result, as the time envelop of the decoded signal, the temporal envelope E.sub.dec(k,i) of the subband signals X.sub.dec(k,i) (B(m)≤k<B(m+1), t(l)≤i<t(l+1)) of the input speech signal divided into M frequency bands having boundaries represented by B(m) (m=0, . . . M, M≥1) (B(0)≥0, B(M)<k.sub.h) in an arbitrary time segment t(l)≤i<t(l+1) and included in the m-th frequency band can be calculated as the power of the subband signals of the input speech signal normalized in the time segment.

(221) $\begin{matrix} E_{dec} (k, i) = \frac{{.Math.}_{j = B (m)}^{B (m + 1) - 1} {.Math. X_{dec} (j, n) .Math.}^{2}}{{.Math.}_{n = t (l)}^{t (l + 1) - 1} {.Math.}_{j = B (m)}^{B (m + 1) - 1} {.Math. X_{dec} (j, n) .Math.}^{2}} & [Eq . 17] \end{matrix}$

(222) For example, the temporal envelope information encoder 2b calculates information representing the degree of flatness as temporal envelope information. For example, at least one of a parameter, and a parameter similar thereto, representing the dispersion of the temporal envelope of the input speech signal and the decoded signal is calculated. In another example, at least one of the ratio, and a parameter similar thereto, of an arithmetic mean to a geometric mean of the temporal envelope of the input speech signal and the decoded signal is calculated. In this case, the temporal envelope information encoder 2b may calculate information representing the flatness of the temporal envelope of the input speech signal as the temporal envelope information, and the process thereby is not limited to the above examples. The parameter is then encoded. For example, the differential value of the parameter of the input speech signal and the decoded signal or the absolute value of the differential value is encoded. For example, at least one of the value of the parameter of the input speech signal and the absolute value is encoded. For example, if the flatness of the temporal envelope is expressed by information of being flat or not, the information can be encoded by one bit. For example, for the time domain input speech signal, the information can be encoded by one bit in the arbitrary time segment. For example, when the information is encoded for each of the M frequency bands of the subband signals of the input speech signal, it can be encoded by M bits. The method of encoding the temporal envelope information is not limited to the above examples.

(223) For example, the temporal envelope information encoder 2b calculates information representing the degree of onset as the temporal envelope information. For example, in an arbitrary time segment t(l)≤i<t(l+1), the maximum value of the differential value of the temporal envelope of the input speech signal in time direction is calculated.
d.sub.Et,max(k)=max(E.sub.t(k,i)−E.sub.t(k,i−1))
d.sub.Edec,t,max(k)=max(E.sub.dec,t(k,i)−E.sub.dec,t(k,i−1))
or
d.sub.E max(k)=max(E(k,i)−E(k,i−1))
d.sub.Edec,max(k)=max(E.sub.dec(k,i)−E.sub.dec(k,i−1)) [Eq. 18]

(224) In these equations, the maximum value of the differential value of a parameter in time direction, the parameter being obtained by smoothing the temporal envelope in time direction, can be calculated in place of the temporal envelope.

(225) In this case, the temporal envelope information encoder 2b may calculate information representing the degree of onset of the temporal envelope of the input speech signal as the temporal envelope information, and the process thereby is not limited to the above examples. The parameter is then encoded. For example, at least one of the differential value of the parameter of the input speech signal and the decoded signal and the absolute value of the differential value is encoded. For example, if the rise of the temporal envelope is represented by information of being onset or not, the information can be encoded by one bit. For example, for the time domain input speech signal, the information can be encoded by one bit in the arbitrary time segment. For example, when the information is encoded for each of the M frequency bands of the subband signals of the input speech signal, it can be encoded by M bits. The method of encoding the temporal envelope information is not limited to the above examples.

(226) For example, the temporal envelope information encoder 2b calculates information representing the degree of offset as the temporal envelope information. For example, in the arbitrary time segment t(l)≤i<t(l+1), the minimum value of the differential value in time direction of the temporal envelope of the input speech signal is calculated.
d.sub.Et,min(k)=min(E.sub.t)(k,i)−E.sub.t(k,i−1))
d.sub.Edec,t,min(k)=min(E.sub.dec,t(k,i)−E.sub.dec,t(k,i−1))
or
d.sub.Emin(k)=min(E(k,i)−E(k,i−1))
d.sub.Edec,min(k)=min(E.sub.dec(k,i)−E.sub.dec(k,i−1)) [Eq. 19]

(227) In these equations, the minimum value of the differential value of a parameter in time direction, the parameter being obtained by smoothing the temporal envelope in time direction, can be calculated in place of the temporal envelope. In this case, the temporal envelope information encoder 2b may calculate information representing the degree of offset of the temporal envelope of the subband signals of the input speech signal as the temporal envelope information, and the process thereby is not limited to the above examples. The parameter is then encoded. For example, at least one of the differential value of the parameter of the input speech signal and the decoded signal and the absolute value of the differential value is encoded. For example, if the fall of the temporal envelope is represented by information of being offset or not, the information can be encoded by one bit. For example, for the time domain input speech signal, the information can be encoded by one bit in the arbitrary time segment. For example, when the information is encoded for each of the M frequency bands of the subband signals of the input speech signal, it can be encoded by M bits. The method of encoding the temporal envelope information is not limited to the above examples.

(228) In the above examples, in the arbitrary time segment t(l)≤i<t(l+1), an encoding parameter (for example, the gain of a codebook in CELP encoding) having a correlation to the power of a time segment shorter than the time segment can be used in the speech coder 2a, in place of the temporal envelope of the input speech signal.

(229) The code sequence multiplexer 2c receives the code sequence of the input speech signal from the speech coder 2a, receives the temporal envelope shape information encoded by the temporal envelope information encoder 2b and outputs a multiplexed code sequence (step S2-3).

Second Embodiment

(230) FIG. 5 is a diagram showing the configuration of a speech decoding device 100 according to an second embodiment. A communication device of the speech decoding device 100 receives a multiplexed code sequence output from a speech encoding device 200 described below and outputs a decoded speech signal to the outside. As shown in FIG. 5, the speech decoding device 100 functionally includes a code sequence demultiplexer 100a, a low frequency decoder 100b, a low frequency temporal envelope shape determiner 100c, a low frequency temporal envelope modifier 100d, a high frequency decoder 100e, and a low frequency/high frequency signal combiner 100f.

(231) FIG. 6 is a flowchart showing the operation of the speech decoding device according to the second embodiment.

(232) The code sequence demultiplexer 100a divides a code sequence into a low frequency encoded part, which is the encoded low frequency signal, and a high frequency encoded part, which is the encoded high frequency signal (step S100-1).

(233) The low frequency decoder 100b decodes the low frequency encoded part divided by the code sequence demultiplexer 100a to obtain a low frequency signal (step S100-2).

(234) The low frequency temporal envelope shape determiner 100c determines the temporal envelope shape of the low frequency signal, based on at least one of information about the low frequency temporal envelope shape divided by the code sequence demultiplexer 100a and the low frequency signal obtained by the low frequency decoder 100b (step S100-3).

(235) Examples include a case where it is determined that the temporal envelope shape of the low frequency signal is flat, a case where it is determined that the temporal envelope shape of the low frequency signal is onset, and a case where it is determined that the temporal envelope shape of the low frequency signal is offset.

(236) The temporal envelope shape of the low frequency signal is determined, for example, by replacing the decoded signal obtained by the speech decoder 1b with the low frequency signal obtained by the low frequency decoder 100b in the process of determining the temporal envelope shape of the decoded signal by the temporal envelope shape determiner 1c.

(237) The low frequency temporal envelope modifier 100d modifies the shape of the temporal envelope of the low frequency signal output from the low frequency decoder 100b, based on the temporal envelope shape determined by the low frequency temporal envelope shape determiner 100c (step S100-4).

(238) The temporal envelope shape of the low frequency signal can be modified, for example, by replacing the decoded signal obtained by the speech decoder 1b with the low frequency signal obtained by the low frequency decoder 100b in the process of modifying the temporal envelope shape of the decoded signal in the temporal envelope modifier 1d.

(239) The high frequency decoder 100e decodes the high frequency encoded part divided by the code sequence demultiplexer 100a to obtain a high frequency signal (step S100-5).

(240) The decoding of the high frequency signal in the high frequency decoder 100e can be performed by a method of decoding a code sequence in which a high frequency signal is encoded by at least one of domain signals of a time domain signal, a subband signal, and a frequency domain signal.

(241) For example, in some speech decoding devices, a high frequency signal can be generated by a bandwidth extension technique that generates a high frequency signal using the decoding result obtained by the low frequency decoder. In such speech decoding devices, if information required to generate a high frequency signal by a bandwidth extension technique is included in the code sequence, part of the code sequence that includes the information is the high frequency encoded part. A high frequency signal is then generated by decoding the high frequency encoded part divided by the code sequence demultiplexer 100a and obtaining the information required for the bandwidth extension technique. By contrast, if information required to generate a high frequency signal by a bandwidth extension technique is not included in the code sequence, the code sequence demultiplexer 100a inputs nothing to the high frequency decoder 100e and generates a high frequency signal through a predetermined process or a process using the decoding result obtained by the low frequency decoder.

(242) The low frequency/high frequency signal combiner 100f combines the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d and the high frequency signal obtained by the high frequency decoder 100e to output a speech signal including a low frequency component and a high frequency component (step S100-6).

(243) FIG. 7 is a diagram showing the configuration of the speech encoding device 200 according to the second embodiment. A communication device of the speech encoding device 200 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 1, the speech encoding device 200 functionally includes a low frequency encoder 200a, a high frequency encoder 200b, a low frequency temporal envelope information encoder 200c, and a code sequence multiplexer 200d.

(244) FIG. 8 is a flowchart showing the operation of the speech encoding device 200 according to the second embodiment.

(245) The low frequency encoder 200a encodes a low frequency signal corresponding to the low frequency component of the input speech signal (step S200-1).

(246) The high frequency encoder 200b encodes a high frequency signal corresponding to the high frequency component of the input speech signal (step S200-2).

(247) The low frequency temporal envelope information encoder 200c calculates and encodes low frequency temporal envelope shape information, based on at least one of the input speech signal and information obtained in the encoding process including the encoding result of the input speech signal in the low frequency encoder 200a (step S200-3).

(248) The process of calculating and encoding low frequency temporal envelope shape information can be performed in the same manner, for example, by using the low frequency signal of the input speech signal in place of the input speech signal and using the low frequency decoded signal obtained by decoding the encoding result in the low frequency encoder 200a in place of the decoded signal, in the process of calculating and encoding temporal envelope information on the input speech signal in the temporal envelope information encoder 2b.

(249) The code sequence multiplexer 200d receives the code sequence of the low frequency speech signal from the low frequency encoder 200a, receives the code sequence of the high frequency speech signal from the high frequency encoder 200b, receives the low frequency temporal envelope shape information encoded by the low frequency temporal envelope information encoder 200c and outputs a multiplexed code sequence (step S200-4).

(250) [First Modification of Speech Decoding Device of Second Embodiment]

(251) FIG. 9 is a diagram showing the configuration of a first modification 100A of the speech decoding device according to the second embodiment.

(252) FIG. 10 is a flowchart showing the operation of the first modification 100A of the speech decoding device according to the second embodiment.

(253) A high frequency decoder 100eA decodes the high frequency encoded part divided by the code sequence demultiplexer 100a to obtain a high frequency signal (step S100-5A).

(254) The high frequency decoder 100eA differs from the high frequency decoder 100e in that the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d is used when the low frequency decoded signal obtained by the low frequency decoder is used in decoding of the high frequency signal.

(255) [Second Modification of Speech Decoding Device of Second Embodiment]

(256) FIG. 11 is a diagram showing the configuration of a first modification 100A of the speech decoding device according to the second embodiment.

(257) The difference from the first modification of the speech decoding device in the second embodiment is that the low frequency signal input to the low frequency/high frequency signal combiner 100f is not output from the low frequency temporal envelope modifier 100d but output from the low frequency decoder 100b.

Third Embodiment

(258) FIG. 12 is a diagram showing the configuration of a speech decoding device 110 according to a third embodiment. A communication device of the speech decoding device 110 receives a multiplexed code sequence output from a speech encoding device 210 described below and outputs a decoded speech signal to the outside. As shown in FIG. 12, the speech decoding device 110 functionally includes a code sequence demultiplexer 110a, a low frequency decoder 100b, a high frequency decoder 100e, a high frequency temporal envelope shape determiner 110b, a high frequency temporal envelope modifier 110c, and a low frequency/high frequency signal combiner 100f.

(259) FIG. 13 is a flowchart showing the operation of the speech decoding device according to the third embodiment.

(260) The code sequence demultiplexer 110a divides a code sequence into a low frequency encoded part, a high frequency encoded part and information about the high frequency temporal envelope shape (step S110-1).

(261) The high frequency temporal envelope shape determiner 110b determines the temporal envelope shape of the high frequency signal, based on at least one of information about the high frequency temporal envelope shape divided by the code sequence demultiplexer 110a, the high frequency signal obtained by the high frequency decoder 100e and the low frequency signal obtained by the low frequency decoder 100b (step S110-2).

(262) Examples include a case where it is determined that the temporal envelope shape of the high frequency signal is flat, a case where it is determined that the temporal envelope shape of the high frequency signal is onset, and a case where it is determined that the temporal envelope shape of the high frequency signal is offset.

(263) The temporal envelope shape of the high frequency signal is determined, for example, by replacing the decoded signal obtained by the speech decoder 1b with the high frequency signal obtained by the high frequency decoder 100e in the process of determining the temporal envelope shape of the decoded signal in the temporal envelope shape determiner 1c. Similarly, the decoded signal obtained by the speech decoder 1b can be replaced with the low frequency signal obtained by the low frequency decoder 100b.

(264) The high frequency temporal envelope modifier 110c modifies the shape of the temporal envelope of the high frequency signal output from the high frequency decoder 110e, based on the temporal envelope shape determined by the high frequency temporal envelope shape determiner 110b (step S110-3). For example, when it is determined that the temporal envelope shape of the high frequency signal is flat, the temporal envelope shape of the high frequency signal can be modified by the following process.

(265) The temporal envelope shape of the high frequency signal can be modified, for example, by replacing the decoded signal obtained by the speech decoder 1b with the high frequency signal obtained by the high frequency decoder 100e in the process of modifying the temporal envelope shape of the decoded signal in the temporal envelope modifier 1d.

(266) FIG. 14 is a diagram showing the configuration of the speech encoding device 210 according to the third embodiment. A communication device of the speech encoding device 210 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 14, the speech encoding device 210 functionally includes a low frequency encoder 200a, a high frequency encoder 200b, a high frequency temporal envelope information encoder 210a, and a code sequence multiplexer 210b.

(267) FIG. 15 is a flowchart showing the operation of the speech encoding device 210 according to the third embodiment.

(268) The high frequency temporal envelope information encoder 210a calculates and encodes high frequency temporal envelope shape information, based on at least one of the input speech signal, information obtained in the encoding process including the encoding result of the input speech signal in the low frequency encoder 200a, and information obtained in the encoding process including the encoding result of the input speech signal in the high frequency encoder 200b (step S210-1).

(269) Calculating and encoding high frequency temporal envelope shape information can be performed similarly, for example, in the process of calculating and encoding the temporal envelope information on the input speech signal in the temporal envelope information encoder 2b where the high frequency signal of the input speech signal is used in place of the input speech signal, and the high frequency decoded signal obtained by decoding the encoding result in the high frequency encoder 200b is used in place of the decoded signal.

(270) The code sequence multiplexer 210b receives the code sequence of the low frequency speech signal from the low frequency encoder 200a, receives the code sequence of the high frequency speech signal from the high frequency encoder 200b, receives the encoded high frequency temporal envelope shape information from the high frequency temporal envelope information encoder 210a and outputs a multiplexed code sequence (step S210-2).

Fourth Embodiment

(271) FIG. 16 is a diagram showing the configuration of a speech decoding device 120 according to a fourth embodiment. A communication device of the speech decoding device 120 receives a multiplexed code sequence output from a speech encoding device 220 described below and outputs a decoded speech signal to the outside. As shown in FIG. 16, the speech decoding device 120 functionally includes a code sequence demultiplexer 120a, a low frequency decoder 100b, a low frequency temporal envelope shape determiner 100c, a low frequency temporal envelope modifier 100d, a high frequency decoder 100e, a high frequency temporal envelope shape determiner 120b, a high frequency temporal envelope modifier 110c, and a low frequency/high frequency signal combiner 100f.

(272) FIG. 17 is a flowchart showing the operation of the speech decoding device 120 according to the fourth embodiment.

(273) The code sequence demultiplexer 120a divides a code sequence into a low frequency encoded part, a high frequency encoded part, information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape (step S120-1).

(274) In doing so, the information about the low frequency temporal envelope shape and the information about the high frequency temporal envelope shape can be divided, for example, from a code sequence including information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape that are separately encoded or can be divided from a code sequence including information about the frequency temporal envelope shape and information about the high frequency temporal envelope shape that are encoded in combination. For example, they can be divided from a code sequence including information in which information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape are represented by a single piece of information and encoded.

(275) The high frequency temporal envelope shape determiner 120b determines the temporal envelope shape of the high frequency signal, based on at least one of the information about the high frequency temporal envelope shape divided by the code sequence demultiplexer 120a, the low frequency signal obtained by the low frequency decoder 100b, and the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d (step S120-2).

(276) Examples include a case where it is determined that the temporal envelope shape of the high frequency signal is flat, a case where it is determined that the temporal envelope shape of the high frequency signal is onset, and a case where it is determined that the temporal envelope shape of the high frequency signal is offset.

(277) If the process of determining the high frequency temporal envelope shape in the high frequency temporal envelope shape determiner 120b is based on the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d, the decoded signal obtained by the speech decoder 1b can be replaced with the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d in the process of determining the temporal envelope shape of the decoded signal in the temporal envelope shape determiner 1c.

(278) FIG. 18 is a diagram showing the configuration of the speech encoding device 220 according to the fourth embodiment. A communication device of the speech encoding device 220 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 18, the speech encoding device 220 functionally includes a low frequency encoder 200a, a high frequency encoder 200b, a low frequency temporal envelope information encoder 200c, a high frequency temporal envelope information encoder 220a, and a code sequence multiplexer 220b.

(279) FIG. 19 is a flowchart showing the operation of the speech encoding device 220 according to the fourth embodiment.

(280) The high frequency temporal envelope information encoder 220a calculates and encodes high frequency temporal envelope shape information, based on at least one of the input speech signal, information obtained in the encoding process including the encoding result of the input speech signal in the low frequency encoder 200a, information obtained in the encoding process including the encoding result of the input speech signal in the high frequency encoder 200b, and information obtained in the encoding process including the encoding result of the low frequency temporal envelope information in the low frequency temporal envelope information encoder 200c (step S220-1).

(281) Calculating and encoding high frequency temporal envelope shape information can be performed, for example, in the process of calculating and encoding the temporal envelope information on the high frequency signal by the high frequency temporal envelope information encoder 210a. For example, the process may be based on the encoding result of the low frequency temporal envelope information. For example, only when the result indicating that the low frequency temporal envelope is flat is obtained as the encoding result of the low frequency temporal envelope information, can whether the high frequency temporal envelope is flat be encoded as the high frequency temporal envelope information.

(282) The code sequence multiplexer 220b receives the code sequence of the low frequency speech signal from the low frequency encoder 200a, receives the code sequence of the high frequency speech signal from the high frequency encoder 200b, receives the encoded low frequency temporal envelope shape information from the low frequency temporal envelope information encoder 200c, receives the encoded high frequency temporal envelope shape information from the high frequency temporal envelope information encoder 210a, and outputs a multiplexed code sequence (step S220-2).

(283) In doing so, in the encoding of the information about the low frequency temporal envelope shape and the information about the high frequency temporal envelope shape, for example, separately encoded information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape may be received, or unitedly encoded information about the frequency temporal envelope shape and information about the high frequency temporal envelope shape may be received. For example, information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape, both being represented by a single piece of information and encoded, may be received.

(284) [First Modification of Speech Decoding Device of Fourth Embodiment]

(285) FIG. 20 is a diagram showing the configuration of a first modification 120A of the speech decoding device according to the fourth embodiment. The difference from the speech decoding device 120 in the fourth embodiment is that the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d is used in decoding a high frequency signal in the high frequency decoder 100eA.

(286) FIG. 21 is a flowchart showing the operation of the first modification 120A of the speech decoding device according to the fourth embodiment. In step 100-5A in FIG. 21, when the low frequency decoded signal obtained by the low frequency decoder 100b is used in decoding a high frequency signal, the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d is used.

(287) [Second Modification of Speech Decoding Device of Fourth Embodiment]

(288) FIG. 22 is a diagram showing the configuration of a second modification 120B of the speech encoding device according to the fourth embodiment. The difference from the first modification of the speech decoding device in the fourth embodiment is that the low frequency signal input to the low frequency/high frequency signal combiner 100f is not output from the low frequency temporal envelope modifier 100d but output from the low frequency decoder 100b.

(289) FIG. 23 is a flowchart showing the operation of the second modification 120B of the speech decoding device according to the fourth embodiment. In step S100-6 in FIG. 23, the low frequency signal from the low frequency decoder 100b and the high frequency signal from the high frequency temporal envelope modifier 110c are combined.

(290) [Third Modification of Speech Decoding Device of Fourth Embodiment]

(291) FIG. 24 is a diagram showing the configuration of a third modification 120C of the speech decoding device according to the fourth embodiment.

(292) FIG. 25 is a flowchart showing the operation of the third modification 120C of the speech decoding device according to the fourth embodiment.

(293) The present modification differs from the speech decoding device 120 according to the fourth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 120d in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 110c.

(294) In the present modification, the low frequency temporal envelope shape determiner 120c differs from the low frequency temporal envelope shape determiner 100c in that it also notifies the high frequency temporal envelope modifier 120d of the determined temporal envelope shape.

(295) The high frequency temporal envelope modifier 120d differs from the high frequency temporal envelope modifier 110c in that the shape of the temporal envelope of the high frequency signal output from the high frequency decoder 100e is modified, based on at least one of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b and the temporal envelope shape determined by the low frequency temporal envelope shape determiner 120c (S120-3).

(296) For example, if the low frequency temporal envelope shape determiner 120c determines that the temporal envelope shape is flat, the temporal envelope of the high frequency signal output from the high frequency decoder 100e is modified into a flat shape, irrespective of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b. For example, if the low frequency temporal envelope shape determiner 120c determines that the temporal envelope shape is not flat, the temporal envelope of the high frequency signal output from the high frequency decoder 100e is not modified into a flat shape, irrespective of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b. This is applicable to the cases of onset and offset and is not limited to any specific temporal envelope shape.

(297) [Fourth Modification of Speech Decoding Device of Fourth Embodiment]

(298) FIG. 26 is a diagram showing the configuration of a fourth modification 120D of the speech decoding device according to the fourth embodiment.

(299) FIG. 27 is a flowchart showing the operation of the fourth modification 120D of the speech decoding device according to the fourth embodiment.

(300) The present modification differs from the speech decoding device 120 according to the fourth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(301) In the present modification, the high frequency temporal envelope shape determiner 120bA differs from the high frequency temporal envelope shape determiner 120b in that it also notifies the low frequency temporal envelope modifier 120e of the determined temporal envelope shape.

(302) The determination of the temporal envelope shape in the high frequency temporal envelope shape determiner 120bA can be based, for example, on the frequency power distribution of the low frequency signal, in addition to the above examples. For example, the frame length in the decoding of the high frequency signal obtained from the code sequence demultiplexer 120a can be used. For example, it can be determined that the shape is flat if the frame length is long, and it can be determined that the shape is onset or offset if the frame length is short. The high frequency temporal envelope shape determiner 120b can also determine in the same manner.

(303) The low frequency temporal envelope modifier 120e differs from the low frequency temporal envelope modifier 100d in that the shape of the temporal envelope of the low frequency signal output from the low frequency decoder 100b is modified, based on at least one of the temporal envelope shape determined by the low frequency temporal envelope shape determiner 100c and the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120bA (S120-4).

(304) For example, if the high frequency temporal envelope shape determiner 120bA determines that the temporal envelope shape is flat, the temporal envelope of the low frequency signal output from the low frequency decoder 100b is modified into a flat shape, irrespective of the temporal envelope shape determined by the low frequency temporal envelope shape determiner 100c. For example, if the high frequency temporal envelope shape determiner 120bA determines that the temporal envelope shape is flat, the temporal envelope of the low frequency signal output from the low frequency decoder 100b is not modified into a flat shape, irrespective of the temporal envelope shape determined by the low frequency temporal envelope shape determiner 100c. This is applicable to the cases of onset and offset and is not limited to any specific temporal envelope shape.

(305) [Fifth Modification of Speech Decoding Device of Fourth Embodiment]

(306) FIG. 28 is a diagram showing the configuration of a fifth modification 120E of the speech decoding device according to the fourth embodiment.

(307) FIG. 29 is a flowchart showing the operation of the fifth modification 120E of the speech decoding device according to the fourth embodiment.

(308) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 120d, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(309) [Sixth Modification of Speech Decoding Device of Fourth Embodiment]

(310) FIG. 30 is a diagram showing the configuration of a sixth modification 120F of the speech decoding device according to the fourth embodiment.

(311) FIG. 31 is a flowchart showing the operation of the sixth modification 120F of the speech decoding device according to the fourth embodiment.

(312) The present modification differs from the speech decoding device 120 according to the fourth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(313) The temporal envelope shape determiner 120f determines the temporal envelope shape, based on at least one of information about the low frequency temporal envelope shape from the code sequence demultiplexer 120a, information about the high frequency temporal envelope shape, the low frequency signal from the low frequency decoder 100b, and the high frequency signal from the high frequency decoder 100e (S120-5). The low frequency temporal envelope modifier 100d and the high frequency temporal envelope modifier 110c are notified of the determined temporal envelope shape.

(314) For example, it may be determined that the temporal envelope shape is flat. For example, it may be determined that the temporal envelope shape is onset. For example, it may be determined that the temporal envelope shape is offset. The determined temporal envelope shape is not limited to the above examples.

(315) The temporal envelope shape determiner 120f can determine the temporal envelope shape, for example, as performed by the low frequency temporal envelope shape determiners 100c and 120c, and the high frequency temporal envelope shape determiners 120b and 120bA. The method of determining the temporal envelope shape is not limited to the above examples.

(316) [Seventh Modification of Speech Decoding Device of Fourth Embodiment]

(317) FIG. 32 is a diagram showing the configuration of a seventh modification 120G of the speech decoding device according to the fourth embodiment.

(318) FIG. 33 is a flowchart showing the operation of the seventh modification 120G of the speech decoding device according to the fourth embodiment.

(319) The present modification differs from the first modification 120A of the speech decoding device according to the fourth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 120d in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 110c.

(320) [Eighth Modification of Speech Decoding Device of Fourth Embodiment]

(321) FIG. 34 is a diagram showing the configuration of an eighth modification 120H of the speech decoding device according to the fourth embodiment.

(322) FIG. 35 is a flowchart showing the operation of the eighth modification 120H of the speech decoding device according to the fourth embodiment.

(323) The present modification differs from the first modification 120A of the speech decoding device according to the fourth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(324) [Ninth Modification of Speech Decoding Device of Fourth Embodiment]

(325) FIG. 36 is a diagram showing the configuration of a ninth modification 120I of the speech decoding device according to the fourth embodiment.

(326) FIG. 37 is a flowchart showing the operation of the ninth modification 120I of the speech decoding device according to the fourth embodiment.

(327) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 120d, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(328) [Tenth Modification of Speech Decoding Device of Fourth Embodiment]

(329) FIG. 38 is a diagram showing the configuration of a tenth modification 120J of the speech decoding device according to the fourth embodiment.

(330) FIG. 39 is a flowchart showing the operation of the tenth modification 120J of the speech decoding device according to the fourth embodiment.

(331) The present modification differs from the first modification 120A of the speech decoding device according to the fourth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(332) [Eleventh Modification of Speech Decoding Device of Fourth Embodiment]

(333) FIG. 40 is a diagram showing the configuration of an eleventh modification 120K of the speech decoding device according to the fourth embodiment.

(334) FIG. 41 is a flowchart showing the operation of the eleventh modification 120K of the speech decoding device according to the fourth embodiment.

(335) The present modification differs from the second modification 120B of the speech decoding device according to the fourth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 120d in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 110c.

(336) [Twelfth Modification of Speech Decoding Device of Fourth Embodiment]

(337) FIG. 42 is a diagram showing the configuration of a twelfth modification 120L of the speech decoding device according to the fourth embodiment.

(338) FIG. 43 is a flowchart showing the operation of the twelfth modification 120L of the speech decoding device according to the fourth embodiment.

(339) The present modification differs from the second modification 120B of the speech decoding device according to the fourth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(340) [Thirteenth Modification of Speech Decoding Device of Fourth Embodiment]

(341) FIG. 44 is a diagram showing the configuration of a thirteenth modification 120M of the speech decoding device according to the fourth embodiment.

(342) FIG. 45 is a flowchart showing the operation of the thirteenth modification 120M of the speech decoding device according to the fourth embodiment.

(343) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 120d, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(344) [Fourteenth Modification of Speech Decoding Device of Fourth Embodiment]

(345) FIG. 46 is a diagram showing the configuration of a fourteenth modification 120N of the speech decoding device according to the fourth embodiment.

(346) FIG. 47 is a flowchart showing the operation of the fourteenth modification 120N of the speech decoding device according to the fourth embodiment.

(347) The present modification differs from the second modification 120B of the speech decoding device according to the fourth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

Fifth Embodiment

(348) FIG. 48 is a diagram showing the configuration of a speech decoding device 130 according to a fifth embodiment. A communication device of the speech decoding device 130 receives a multiplexed code sequence output from a speech encoding device 230 described below and outputs a decoded speech signal to the outside. As shown in FIG. 48, the speech decoding device 130 functionally includes a code sequence demultiplexer 110a, a low frequency decoder 100b, a high frequency temporal envelope shape determiner 110b, a high frequency temporal envelope modifier 130a, a high frequency decoder 130b, and a low frequency/high frequency signal combiner 100f.

(349) FIG. 49 is a flowchart showing the operation of the speech decoding device according to the fourth embodiment.

(350) The high frequency temporal envelope modifier 130a modifies the shape of the temporal envelope of the low frequency signal input to the high frequency decoder 130b, based on the temporal envelope shape determined by the high frequency temporal envelope shape determiner 110b (step S130-1). The modification of the temporal envelope shape in the high frequency temporal envelope modifier 130a is performed, for example, in the process of modifying the temporal envelope shape of the decoded signal in the temporal envelope modifier 1d in which the decoded signal obtained by the speech decoder 1b is replaced with the low frequency signal obtained by the low frequency decoder 100b.

(351) The high frequency decoder 130b decodes the high frequency encoded part divided by the code sequence demultiplexer 100a to obtain a high frequency signal (step S130-2).

(352) The high frequency decoder 130b differs from the high frequency decoder 100e in that the low frequency signal having the temporal envelope shape modified by the high frequency temporal envelope modifier 130a is used when the low frequency decoded signal obtained by the low frequency decoder is used in decoding the high frequency signal.

(353) FIG. 50 is a diagram showing the configuration of the speech encoding device 230 according to the fifth embodiment. A communication device of the speech encoding device 230 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 50, the speech encoding device 230 functionally includes a low frequency encoder 200a, a high frequency encoder 200b, a high frequency temporal envelope information encoder 230a, and a code sequence multiplexer 210b.

(354) FIG. 51 is a flowchart showing the operation of the speech encoding device 230 according to the fifth embodiment.

(355) The high frequency temporal envelope information encoder 230a calculates and encodes the high frequency temporal envelope shape information, based on at least one of the input speech signal, information obtained in the encoding process including the encoding result of the input speech signal in the low frequency encoder 200a, and information obtained in the encoding process including the encoding result of the input speech signal in the high frequency encoder 200b (step S230-1).

(356) Calculating and encoding high frequency temporal envelope shape information can be performed, for example, in the process, by the low frequency temporal envelope information encoder 200c, of calculating and encoding the temporal envelope information on the low frequency signal. However, the process of calculating and encoding high frequency temporal envelope shape information differs from the process of calculating and encoding the temporal envelope information on the low frequency signal using the low frequency decoded signal of the input speech signal in that the information obtained in the encoding process including the encoding result of the input speech signal in the high frequency encoder 200b can be additionally used.

Sixth Embodiment

(357) FIG. 52 is a diagram showing the configuration of a speech decoding device 140 according to a sixth embodiment. A communication device of the speech decoding device 140 receives a multiplexed code sequence output from a speech encoding device 240 described below and outputs a decoded speech signal to the outside. As shown in FIG. 52, the speech decoding device 140 functionally includes a code sequence demultiplexer 120a, a low frequency decoder 100b, a low frequency temporal envelope shape determiner 100c, a low frequency temporal envelope modifier 100d, a high frequency temporal envelope shape determiner 120b, a high frequency temporal envelope modifier 130a, a high frequency decoder 130b, and a low frequency/high frequency signal combiner 100f.

(358) FIG. 53 is a flowchart showing the operation of the speech decoding device according to the sixth embodiment. The code sequence demultiplexer 120a and the high frequency temporal envelope shape determiner 120b perform the same operation as the code sequence demultiplexer 120a and the high frequency temporal envelope shape determiner 120b in the fourth embodiment (steps S120-1, S120-2). The high frequency temporal envelope modifier 130a and the high frequency decoder 130b perform the same operation as the high frequency temporal envelope modifier 130a and the high frequency decoder 130b in the fifth embodiment (steps S130-1, S130-2).

(359) FIG. 54 is a diagram showing the configuration of the speech encoding device 240 according to the sixth embodiment. A communication device of the speech encoding device 240 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 54, the speech encoding device 240 functionally includes a low frequency encoder 200a, a high frequency encoder 200b, a low frequency temporal envelope information encoder 200c, a high frequency temporal envelope information encoder 220a, and a code sequence multiplexer 220b.

(360) FIG. 55 is a flowchart showing the operation of the speech encoding device 240 according to the sixth embodiment.

(361) [First Modification of Speech Decoding Device of Sixth Embodiment]

(362) FIG. 56 is a diagram showing the configuration of a first modification 140A of the speech decoding device according to the sixth embodiment.

(363) FIG. 57 is a flowchart showing the operation of the first modification 140A of the speech decoding device according to the sixth embodiment.

(364) A high frequency temporal envelope modifier 140a modifies the shape of the temporal envelope of the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d, based on the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b (step S140-1). The difference from the high frequency temporal envelope modifier 130a is that the input signal is the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d.

(365) [Second Modification of Speech Decoding Device of Sixth Embodiment]

(366) FIG. 58 is a diagram showing the configuration of a second modification 140B of the speech encoding device according to the sixth embodiment.

(367) The difference from the first modification of the speech decoding device in the present embodiment is that the low frequency signal to be used in the combining process by the low frequency/high frequency signal combiner 100f is not the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d but the low frequency signal decoded by the low frequency decoder 100b.

(368) [Third Modification of Speech Decoding Device of Sixth Embodiment]

(369) FIG. 59 is a diagram showing the configuration of a third modification 140C of the speech decoding device according to the sixth embodiment.

(370) FIG. 60 is a flowchart showing the operation of the third modification 140C of the speech decoding device according to the sixth embodiment.

(371) The present modification differs from the speech decoding device 140 according to the sixth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 140b in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 130a.

(372) The high frequency temporal envelope modifier 140b differs from the high frequency temporal envelope modifier 130a in that the shape of the temporal envelope of the low frequency signal input to the high frequency decoder 130b is modified based on at least one of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b and the temporal envelope shape determined by the low frequency temporal envelope shape determiner 120c (S140-2).

(373) For example, if the low frequency temporal envelope shape determiner 120c determines that the temporal envelope shape is flat, the temporal envelope of the low frequency signal input to the high frequency decoder 130b is modified into a flat shape, irrespective of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b. For example, if the low frequency temporal envelope shape determiner 120c determines that the temporal envelope shape is not flat, the temporal envelope of the low frequency signal input to the high frequency decoder 130b is not modified into a flat shape, irrespective of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b. This is applicable to the cases of onset and offset and is not limited to any specific temporal envelope shape.

(374) [Fourth Modification of Speech Decoding Device of Sixth Embodiment]

(375) FIG. 61 is a diagram showing the configuration of a fourth modification 140D of the speech decoding device according to the sixth embodiment.

(376) FIG. 62 is a flowchart showing the operation of the fourth modification 140D of the speech decoding device according to the sixth embodiment.

(377) The present modification differs from the speech decoding device 140 according to the sixth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(378) [Fifth Modification of Speech Decoding Device of Sixth Embodiment]

(379) FIG. 63 is a diagram showing the configuration of a fifth modification 140E of the speech decoding device according to the sixth embodiment.

(380) FIG. 64 is a flowchart showing the operation of the fifth modification 140E of the speech decoding device according to the sixth embodiment.

(381) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 140b, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(382) [Sixth Modification of Speech Decoding Device of Sixth Embodiment]

(383) FIG. 65 is a diagram showing the configuration of a sixth modification 140F of the speech decoding device according to the sixth embodiment.

(384) FIG. 66 is a flowchart showing the operation of the sixth modification 140F of the speech decoding device according to the sixth embodiment.

(385) The present modification differs from the speech decoding device 140 according to the sixth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(386) [Seventh Modification of Speech Decoding Device of Sixth Embodiment]

(387) FIG. 67 is a diagram showing the configuration of a seventh modification 140G of the speech decoding device according to the sixth embodiment.

(388) FIG. 68 is a flowchart showing the operation of the seventh modification 140G of the speech decoding device according to the sixth embodiment.

(389) The present modification differs from the first modification 140A of the speech decoding device according to the sixth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 140b in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 140a.

(390) In the present modification, the high frequency temporal envelope modifier 140b modifies the shape of the temporal envelope of the low frequency signal having the temporal envelope shape modified to be input to the high frequency decoder 130b, based on at least one of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b and the temporal envelope shape determined by the low frequency temporal envelope shape determiner 120c (S140-2).

(391) [Eighth Modification of Speech Decoding Device of Sixth Embodiment]

(392) FIG. 69 is a diagram showing the configuration of an eighth modification 140H of the speech decoding device according to the sixth embodiment.

(393) FIG. 70 is a flowchart showing the operation of the eighth modification 140H of the speech decoding device according to the sixth embodiment.

(394) The present modification differs from the first modification 140A of the speech decoding device according to the sixth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(395) [Ninth Modification of Speech Decoding Device of Sixth Embodiment]

(396) FIG. 71 is a diagram showing the configuration of a ninth modification 140I of the speech decoding device according to the sixth embodiment.

(397) FIG. 72 is a flowchart showing the operation of the ninth modification 140I of the speech decoding device according to the sixth embodiment.

(398) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 140b, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(399) [Tenth Modification of Speech Decoding Device of Sixth Embodiment]

(400) FIG. 73 is a diagram showing the configuration of a tenth modification 140J of the speech decoding device according to the sixth embodiment.

(401) FIG. 74 is a flowchart showing the operation of the tenth modification 140J of the speech decoding device according to the sixth embodiment.

(402) The present modification differs from the first modification 140A of the speech decoding device according to the sixth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(403) [Eleventh Modification of Speech Decoding Device of Sixth Embodiment]

(404) FIG. 75 is a diagram showing the configuration of an eleventh modification 140K of the speech decoding device according to the sixth embodiment.

(405) FIG. 76 is a flowchart showing the operation of the eleventh modification 140K of the speech decoding device according to the sixth embodiment.

(406) The present modification differs from the second modification 140B of the speech decoding device according to the sixth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 140b in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 140a.

(407) [Twelfth Modification of Speech Decoding Device of Sixth Embodiment]

(408) FIG. 77 is a diagram showing the configuration of a twelfth modification 140L of the speech decoding device according to the sixth embodiment.

(409) FIG. 78 is a flowchart showing the operation of the twelfth modification 140L of the speech decoding device according to the sixth embodiment.

(410) The present modification differs from the second modification 140B of the speech decoding device according to the sixth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(411) [Thirteenth Modification of Speech Decoding Device of Sixth Embodiment]

(412) FIG. 79 is a diagram showing the configuration of a thirteenth modification 140M of the speech decoding device according to the sixth embodiment.

(413) FIG. 80 is a flowchart showing the operation of the thirteenth modification 140M of the speech decoding device according to the sixth embodiment.

(414) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 140b, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(415) [Fourteenth Modification of Speech Decoding Device of Sixth Embodiment]

(416) FIG. 81 is a diagram showing the configuration of a fourteenth modification 140N of the speech decoding device according to the sixth embodiment.

(417) FIG. 82 is a flowchart showing the operation of the fourteenth modification 140N of the speech decoding device according to the sixth embodiment.

(418) The present modification differs from the second modification 140B of the speech decoding device according to the sixth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

Seventh Embodiment

(419) FIG. 83 is a diagram showing the configuration of a speech decoding device 150 according to a seventh embodiment. A communication device of the speech decoding device 150 receives a multiplexed code sequence output from a speech encoding device 250 described below and outputs a decoded speech signal to the outside. As shown in FIG. 83, the speech decoding device 150 functionally includes a code sequence demultiplexer 150a, switches 150b, a low frequency decoder 100b, a low frequency temporal envelope shape determiner 100c, a low frequency temporal envelope modifier 100d, a high frequency decoder 100e, a high frequency temporal envelope shape determiner 120b, a high frequency temporal envelope modifier 110c, and a low frequency/high frequency signal combiner 150c.

(420) FIG. 84 is a flowchart showing the operation of the speech decoding device according to the seventh embodiment.

(421) The code sequence demultiplexer 150a divides a code sequence into high frequency signal generation control information, a low frequency encoded part, and information about the temporal envelope shape (step S150-1).

(422) It is determined whether to generate a high frequency signal, based on the high frequency signal generation control information obtained in the code sequence demultiplexer 150a (step S150-2).

(423) If a high frequency signal is to be generated, the code sequence demultiplexer 150a extracts a high frequency encoded part from the code sequence (step S150-3). A high frequency signal is then generated using the high frequency encoded part of the code sequence, the temporal envelope shape of the high frequency signal is determined, and the temporal envelope shape of the high frequency signal is modified.

(424) The order in which the processing in step S150-2 and S150-3 is performed is not limited to the order illustrated in the flowchart in FIG. 84 as long as it is before the determination of the high frequency temporal envelope shape and the decoding of the high frequency encoded part.

(425) If it is determined to generate a high frequency signal based on the high frequency signal generation information, the low frequency/high frequency signal combiner 150c synthesizes an output speech signal from the low frequency signal whose temporal envelope shape is modified and the high frequency signal whose temporal envelope shape is modified. If it is determined not to generate a high frequency signal based on the high frequency signal generation information, the low frequency/high frequency signal combiner 150c synthesizes an output speech signal from the low frequency signal whose temporal envelope shape is modified (step S150-4). However, even when it is determined not to generate a high frequency signal, if the low frequency signal, whose temporal envelope shape is modified, is input in a state ready for output to low frequency/high frequency signal combiner 150c, the input low frequency signal can be optionally output as it is.

(426) FIG. 85 is a diagram showing the configuration of the speech encoding device 250 according to the seventh embodiment. A communication device of the speech encoding device 250 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 85, the speech encoding device 250 functionally includes a high frequency signal generation control information encoder 250a, a low frequency encoder 200a, a high frequency encoder 200b, a low frequency temporal envelope information encoder 200c, a high frequency temporal envelope information encoder 220a, and a code sequence multiplexer 250b.

(427) FIG. 86 is a flowchart showing the operation of the speech encoding device 250 according to the seventh embodiment.

(428) The high frequency signal generation control information encoder 250a determines whether to generate a high frequency signal based on at least one of an input speech signal and a high frequency signal generation control instruction signal and encodes high frequency signal generation control information (step S250-1). For example, if the input speech signal includes a signal in a frequency band to be encoded by the high frequency encoder 200b, it can be determined to generate a high frequency signal. For example, if the high frequency signal generation control instruction signal instructs to generate a high frequency signal, it can be determined to generate a high frequency signal. For example, these two methods can be combined, and, for example, if at least one of these two methods decides to generate a high frequency signal, it can be determined to generate a high frequency signal.

(429) The high frequency signal generation control information can be encoded, for example, by one bit representing whether to generate a high frequency signal.

(430) The method of determining whether to generate a high frequency signal and the method of encoding the high frequency signal generation control information are not limited.

(431) If the high frequency signal generation control information encoder 250a determines to generate a high frequency signal, the high frequency encoder 200b encodes a high frequency signal corresponding to the high frequency component of the input speech signal, and the high frequency temporal envelope information encoder 220a calculates and encodes high frequency temporal envelope shape information. By contrast, if the high frequency signal generation control information encoder 250a determines not to generate a high frequency signal, the encoding of the high frequency signal and the calculation and encoding of high frequency temporal envelope shape information are not carried out (step S250-2).

(432) The code sequence multiplexer 250c receives the encoded high frequency signal generation control information from the high frequency signal generation control information encoder 250a, receives the code sequence of the low frequency speech signal from the low frequency encoder 200a, receives the encoded low frequency temporal envelope shape information from the low frequency temporal envelope information encoder 200c, additionally receives the code sequence of the high frequency speech signal from the high frequency encoder 200b and the encoded high frequency temporal envelope shape information from the high frequency temporal envelope information encoder 210a if the high frequency signal generation control information encoder 250a determines to generate a high frequency signal, and outputs a multiplexed code sequence (step S250-3).

(433) If the high frequency signal generation control information encoder 250a determines to generate a high frequency signal, when encoding of the information about the low frequency temporal envelope shape and the information about the high frequency temporal envelope shape, for example, separately encoded information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape may be received, or unitedly encoded information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape may be received. For example, information about the low frequency temporal envelope shape and information about the high frequency temporal envelope shape, both being represented by a single piece of information and encoded, may be received.

(434) [First Modification of Speech Decoding Device of Seventh Embodiment]

(435) FIG. 87 is a diagram showing the configuration of a first modification 150A of the speech decoding device according to the seventh embodiment.

(436) FIG. 88 is a flowchart showing the operation of the first modification 150A of the speech decoding device according to the seventh embodiment. The difference from the speech decoding device 150 in the seventh embodiment is that the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d is used in decoding a high frequency signal by the high frequency decoder 100eA. In step 100-5A in FIG. 88, when the low frequency decoded signal obtained by the low frequency decoder 100b is used in decoding a high frequency signal, the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d is used.

(437) The order in which the processing in step S150-2 and S150-3 is performed is not limited to the order illustrated in the flowchart in FIG. 88 as long as it is before the determination of the high frequency temporal envelope shape and the decoding of the high frequency encoded part.

(438) [Second Modification of Speech Decoding Device of Seventh Embodiment]

(439) FIG. 89 is a diagram showing the configuration of a second modification 150B of the speech decoding device according to the seventh embodiment. The difference from the first modification of the speech decoding device in the seventh embodiment is that the low frequency signal input to the low frequency/high frequency signal combiner 150c is not output from the low frequency temporal envelope modifier 100d but output from the low frequency decoder 100b.

(440) [Third Modification of Speech Decoding Device of Seventh Embodiment]

(441) FIG. 90 is a diagram showing the configuration of a third modification 150C of the speech decoding device according to the seventh embodiment.

(442) FIG. 91 is a flowchart showing the operation of the third modification 150C of the speech decoding device according to the seventh embodiment.

(443) The present modification differs from the speech decoding device 150 according to the seventh embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 120d in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 110c.

(444) [Fourth Modification of Speech Decoding Device of Seventh Embodiment]

(445) FIG. 92 is a diagram showing the configuration of a fourth modification 150D of the speech decoding device according to the seventh embodiment.

(446) FIG. 93 is a flowchart showing the operation of the fourth modification 150D of the speech decoding device according to the seventh embodiment.

(447) The present modification differs from the speech decoding device 150 according to the seventh embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(448) [Fifth Modification of Speech Decoding Device of Seventh Embodiment]

(449) FIG. 94 is a diagram showing the configuration of a fifth modification 150E of the speech decoding device according to the seventh embodiment.

(450) FIG. 95 is a flowchart showing the operation of the fifth modification 150E of the speech decoding device according to the seventh embodiment.

(451) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 120d, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(452) [Sixth Modification of Speech Decoding Device of Seventh Embodiment]

(453) FIG. 96 is a diagram showing the configuration of a sixth modification 150F of the speech decoding device according to the seventh embodiment.

(454) FIG. 97 is a flowchart showing the operation of the sixth modification 150F of the speech decoding device according to the seventh embodiment.

(455) The present modification differs from the speech decoding device 150 according to the seventh embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(456) [Seventh Modification of Speech Decoding Device of Seventh Embodiment]

(457) FIG. 98 is a diagram showing the configuration of a seventh modification 150G of the speech decoding device according to the seventh embodiment.

(458) FIG. 99 is a flowchart showing the operation of the seventh modification 150G of the speech decoding device according to the seventh embodiment.

(459) The present modification differs from the first modification 150A of the speech decoding device according to the seventh embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 120d in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 110c.

(460) [Eighth Modification of Speech Decoding Device of Seventh Embodiment]

(461) FIG. 100 is a diagram showing the configuration of an eighth modification 150H of the speech decoding device according to the seventh embodiment.

(462) FIG. 101 is a flowchart showing the operation of the eighth modification 150H of the speech decoding device according to the seventh embodiment.

(463) The present modification differs from the first modification 150A of the speech decoding device according to the seventh embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(464) [Ninth Modification of Speech Decoding Device of Seventh Embodiment]

(465) FIG. 102 is a diagram showing the configuration of a ninth modification 150I of the speech decoding device according to the seventh embodiment.

(466) FIG. 103 is a flowchart showing the operation of the ninth modification 150I of the speech decoding device according to the seventh embodiment.

(467) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 120d, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(468) [Tenth Modification of Speech Decoding Device of Seventh Embodiment]

(469) FIG. 104 is a diagram showing the configuration of a tenth modification 150J of the speech decoding device according to the seventh embodiment.

(470) FIG. 105 is a flowchart showing the operation of the tenth modification 150J of the speech decoding device according to the seventh embodiment.

(471) The present modification differs from the first modification 150A of the speech decoding device according to the seventh embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(472) [Eleventh Modification of Speech Decoding Device of Seventh Embodiment]

(473) FIG. 106 is a diagram showing the configuration of an eleventh modification 150K of the speech decoding device according to the seventh embodiment.

(474) FIG. 107 is a flowchart showing the operation of the eleventh modification 150K of the speech decoding device according to the seventh embodiment.

(475) The present modification differs from the second modification 150B of the speech decoding device according to the seventh embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 120d in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 110c.

(476) [Twelfth Modification of Speech Decoding Device of Seventh Embodiment]

(477) FIG. 108 is a diagram showing the configuration of a twelfth modification 150L of the speech decoding device according to the seventh embodiment.

(478) FIG. 109 is a flowchart showing the operation of the twelfth modification 150L of the speech decoding device according to the seventh embodiment.

(479) The present modification differs from the second modification 150B of the speech decoding device according to the seventh embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(480) [Thirteenth Modification of Speech Decoding Device of Seventh Embodiment]

(481) FIG. 110 is a diagram showing the configuration of a thirteenth modification 150M of the speech decoding device according to the seventh embodiment.

(482) FIG. 111 is a flowchart showing the operation of the thirteenth modification 150M of the speech decoding device according to the seventh embodiment.

(483) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 120d, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(484) [Fourteenth Modification of Speech Decoding Device of Seventh Embodiment]

(485) FIG. 112 is a diagram showing the configuration of a fourteenth modification 150N of the speech decoding device according to the seventh embodiment.

(486) FIG. 113 is a flowchart showing the operation of the fourteenth modification 150N of the speech decoding device according to the seventh embodiment.

(487) The present modification differs from the second modification 150B of the speech decoding device according to the seventh embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

Eighth Embodiment

(488) FIG. 114 is a diagram showing the configuration of a speech decoding device 160 according to an eighth embodiment. A communication device of the speech decoding device 160 receives a multiplexed code sequence output from a speech encoding device 260 described below and outputs a decoded speech signal to the outside. As shown in FIG. 114, the speech decoding device 160 functionally includes a code sequence demultiplexer 150a, switches 150b, a low frequency decoder 100b, a low frequency temporal envelope shape determiner 100c, a low frequency temporal envelope modifier 100d, a high frequency temporal envelope shape determiner 120b, a high frequency temporal envelope modifier 130a, a high frequency decoder 130b, and a low frequency/high frequency signal combiner 150c.

(489) FIG. 115 is a flowchart showing the operation of the speech decoding device according to the eighth embodiment. The order in which the processing in step S150-2 and S150-3 is performed is not limited to the order illustrated in the flowchart in FIG. 115 as long as it is before the determination of the high frequency temporal envelope shape and the decoding of the high frequency encoded part.

(490) FIG. 116 is a diagram showing the configuration of the speech encoding device 260 according to the eighth embodiment. A communication device of the speech encoding device 260 receives a speech signal to be encoded from the outside and outputs the encoded code sequence to the outside. As shown in FIG. 116, the speech encoding device 260 functionally includes a high frequency signal generation control information encoder 250a, a low frequency encoder 200a, a high frequency encoder 200b, a low frequency temporal envelope information encoder 200c, a high frequency temporal envelope information encoder 220a, and a code sequence multiplexer 250b.

(491) FIG. 117 is a flowchart showing the operation of the speech encoding device 260 according to the eighth embodiment.

(492) [First Modification of Speech Decoding Device of Eighth Embodiment]

(493) FIG. 118 is a diagram showing the configuration of a first modification 160A of the speech decoding device according to the eighth embodiment.

(494) FIG. 119 is a flowchart showing the operation of the first modification 160A of the speech decoding device according to the eighth embodiment.

(495) The difference from the speech decoding device 160 of the present embodiment is that the high frequency temporal envelope modifier 140a described in the first modification of the speech decoding device in the sixth embodiment is used in place of the high frequency temporal envelope modifier 130a.

(496) The order in which the processing in step S150-2 and S150-3 is performed is not limited to the order illustrated in the flowchart in FIG. 119 as long as it is before the determination of the high frequency temporal envelope shape and the decoding of the high frequency encoded part.

(497) [Second Modification of Speech Decoding Device of Eighth Embodiment]

(498) FIG. 120 is a diagram showing the configuration of a second modification 170B of the speech decoding device according to the eighth embodiment.

(499) The difference from the first modification 160A of the speech decoding device of the present embodiment is that the low frequency signal to be used in the combining process by the low frequency/high frequency signal combiner 150c is the low frequency signal decoded by the low frequency decoder 100b, not the low frequency signal having the temporal envelope shape modified by the low frequency temporal envelope modifier 100d, as in the second modification of the speech decoding device of the sixth embodiment.

(500) [Third Modification of Speech Decoding Device of Eighth Embodiment]

(501) FIG. 121 is a diagram showing the configuration of a third modification 160C of the speech decoding device according to the eighth embodiment.

(502) FIG. 122 is a flowchart showing the operation of the third modification 160C of the speech decoding device according to the eighth embodiment.

(503) The present modification differs from the speech decoding device 160 according to the eighth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 140b in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 130a.

(504) [Fourth Modification of Speech Decoding Device of Eighth Embodiment]

(505) FIG. 123 is a diagram showing the configuration of a fourth modification 160D of the speech decoding device according to the eighth embodiment.

(506) FIG. 124 is a flowchart showing the operation of the fourth modification 160D of the speech decoding device according to the eighth embodiment.

(507) The present modification differs from the speech decoding device 160 according to the eighth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(508) [Fifth Modification of Speech Decoding Device of Eighth Embodiment]

(509) FIG. 125 is a diagram showing the configuration of a fifth modification 160E of the speech decoding device according to the eighth embodiment.

(510) FIG. 126 is a flowchart showing the operation of the fifth modification 160E of the speech decoding device according to the eighth embodiment.

(511) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 140b, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(512) [Sixth Modification of Speech Decoding Device of Eighth Embodiment]

(513) FIG. 127 is a diagram showing the configuration of a sixth modification 160F of the speech decoding device according to the eighth embodiment.

(514) FIG. 128 is a flowchart showing the operation of the sixth modification 160F of the speech decoding device according to the eighth embodiment.

(515) The present modification differs from the speech decoding device 160 according to the eighth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(516) [Seventh Modification of Speech Decoding Device of Eighth Embodiment]

(517) FIG. 129 is a diagram showing the configuration of a seventh modification 160G of the speech decoding device according to the eighth embodiment.

(518) FIG. 130 is a flowchart showing the operation of the seventh modification 160G of the speech decoding device according to the eighth embodiment.

(519) The present modification differs from the first modification 160A of the speech decoding device according to the eighth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 140b in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 140a.

(520) In the present modification, the high frequency temporal envelope modifier 140b modifies the shape of the temporal envelope of the low frequency signal having the temporal envelope shape modified to be input to the high frequency decoder 130b, based on at least one of the temporal envelope shape determined by the high frequency temporal envelope shape determiner 120b and the temporal envelope shape determined by the low frequency temporal envelope shape determiner 120c (S140-2).

(521) [Eighth Modification of Speech Decoding Device of Eighth Embodiment]

(522) FIG. 131 is a diagram showing the configuration of an eighth modification 160H of the speech decoding device according to the eighth embodiment.

(523) FIG. 132 is a flowchart showing the operation of the eighth modification 160H of the speech decoding device according to the eighth embodiment.

(524) The present modification differs from the first modification 160A of the speech decoding device according to the eighth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(525) [Ninth Modification of Speech Decoding Device of Eighth Embodiment]

(526) FIG. 133 is a diagram showing the configuration of a ninth modification 160I of the speech decoding device according to the eighth embodiment.

(527) FIG. 134 is a flowchart showing the operation of the ninth modification 160I of the speech decoding device according to the eighth embodiment.

(528) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 140b, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(529) [Tenth Modification of Speech Decoding Device of Eighth Embodiment]

(530) FIG. 135 is a diagram showing the configuration of a tenth modification 160J of the speech decoding device according to the eighth embodiment.

(531) FIG. 136 is a flowchart showing the operation of the tenth modification 160J of the speech decoding device according to the eighth embodiment.

(532) The present modification differs from the first modification 160A of the speech decoding device according to the eighth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(533) [Eleventh Modification of Speech Decoding Device of Eighth Embodiment]

(534) FIG. 137 is a diagram showing the configuration of an eleventh modification 160K of the speech decoding device according to the eighth embodiment.

(535) FIG. 138 is a flowchart showing the operation of the eleventh modification 160K of the speech decoding device according to the eighth embodiment.

(536) The present modification differs from the second modification 160B of the speech decoding device according to the eighth embodiment in that it includes a low frequency temporal envelope shape determiner 120c and a high frequency temporal envelope modifier 140b in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope modifier 140a.

(537) [Twelfth Modification of Speech Decoding Device of Eighth Embodiment]

(538) FIG. 139 is a diagram showing the configuration of a twelfth modification 160L of the speech decoding device according to the eighth embodiment.

(539) FIG. 140 is a flowchart showing the operation of the twelfth modification 160L of the speech decoding device according to the eighth embodiment.

(540) The present modification differs from the second modification 160B of the speech decoding device according to the eighth embodiment in that it includes a high frequency temporal envelope shape determiner 120bA and a low frequency temporal envelope modifier 120e in place of the high frequency temporal envelope shape determiner 120b and the low frequency temporal envelope modifier 100d.

(541) [Thirteenth Modification of Speech Decoding Device of Eighth Embodiment]

(542) FIG. 141 is a diagram showing the configuration of a thirteenth modification 160M of the speech decoding device according to the eighth embodiment.

(543) FIG. 142 is a flowchart showing the operation of the thirteenth modification 160M of the speech decoding device according to the eighth embodiment.

(544) The present modification includes the low frequency temporal envelope shape determiner 120c, the high frequency temporal envelope modifier 140b, the high frequency temporal envelope shape determiner 120bA, and the low frequency temporal envelope modifier 120e.

(545) [Fourteenth Modification of Speech Decoding Device of Eighth Embodiment]

(546) FIG. 143 is a diagram showing the configuration of a fourteenth modification 160N of the speech decoding device according to the eighth embodiment.

(547) FIG. 144 is a flowchart showing the operation of the fourteenth modification 160N of the speech decoding device according to the eighth embodiment.

(548) The present modification differs from the second modification 160B of the speech decoding device according to the eighth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 120b.

(549) [Speech Decoding Device of Ninth Embodiment]

(550) FIG. 145 is a diagram showing the configuration of a speech decoding device 380 according to a ninth embodiment.

(551) FIG. 146 is a flowchart showing the operation of the speech decoding device 380 according to the ninth embodiment.

(552) The temporal envelope modifier 380a modifies the shape of the temporal envelope of the low frequency signal output from the low frequency decoder 100b and the high frequency signal output from the high frequency decoder 100e, based on at least one of the temporal envelope shape determined by the low frequency temporal envelope shape determiner 100c and the temporal envelope shape determined by the high frequency temporal envelope shape determiner 110b (S380-1).

(553) The temporal envelope shape determined by the low frequency temporal envelope shape determiner 100c and the temporal envelope shape determined by the high frequency temporal envelope shape determiner 110b may be the same or different.

(554) [First Modification of Speech Decoding Device of Ninth Embodiment]

(555) FIG. 147 is a diagram showing the configuration of a first modification 380A of the speech decoding device according to the ninth embodiment.

(556) FIG. 148 is a flowchart showing the operation of the first modification 380A of the speech decoding device according to the ninth embodiment.

(557) The present modification differs from the speech decoding device 380 according to the ninth embodiment in that it includes a temporal envelope shape determiner 120f in place of the low frequency temporal envelope shape determiner 100c and the high frequency temporal envelope shape determiner 110b, and a temporal envelope modifier 380aA in place of the temporal envelope modifier 380a.

(558) The temporal envelope modifier 380aA modifies the shape of the temporal envelope of the low frequency signal output from the low frequency decoder 100b and the high frequency signal output from the high frequency decoder 100e, based on the temporal envelope shape determined by the temporal envelope shape determiner 120f (S380-1a).

(559) [Speech Decoding Device of Tenth Embodiment]

(560) FIG. 149 is a diagram showing the configuration of a speech decoding device 390 according to a tenth embodiment.

(561) FIG. 150 is a flowchart showing the operation of the speech decoding device 390 according to the tenth embodiment.

(562) In the present modification, the temporal envelope modifier 380aA modifies the shape of the temporal envelope of the low frequency signal output from the low frequency decoder 100b, based on the temporal envelope shape determined by the temporal envelope shape determiner 120f, and, if it is determined to generate a high frequency signal based on the high frequency signal generation information, additionally modifies the shape of the temporal envelope of the high frequency signal output from the high frequency decoder 100e (S380-1a).

Audio decoding device, audio coding device, audio decoding method, audio coding method, audio decoding program, and audio coding program

Assignee

Inventors

Cpc classification

Classification Explorer

G10L21/038

PHYSICS

Classification Explorer

G10L19/265

PHYSICS

Classification Explorer

G10L19/24

PHYSICS

International classification

Classification Explorer

G10L19/26

PHYSICS

Classification Explorer

G10L19/24

PHYSICS

Abstract

Claims

Description