Device and method for generating and decoding a side channel signal transmitted with a main channel signal
09659566 ยท 2017-05-23
Assignee
Inventors
- Gerald Schuller (Erfurt, DE)
- Stefan Wabnik (Oldenburg, DE)
- Bernhard Grill (Lauf, DE)
- Alexander Zink (Stegaurach, DE)
Cpc classification
H04L5/0007
ELECTRICITY
H04N21/44209
ELECTRICITY
H04N21/2383
ELECTRICITY
H04N21/4382
ELECTRICITY
G10L19/008
PHYSICS
H04N21/631
ELECTRICITY
International classification
G10L19/008
PHYSICS
H04L27/34
ELECTRICITY
H04N21/2343
ELECTRICITY
H04N21/2383
ELECTRICITY
H04N21/63
ELECTRICITY
H04N21/442
ELECTRICITY
H04N21/438
ELECTRICITY
Abstract
For generating a signal to be transmitted original information is encoded into a main channel and a side channel, wherein the side channel is more robust against channel influences than the main channel. On the receiver side, when the receive quality is above a threshold, which is necessitated to execute a successful decoding of the main channel, the main channel is reproduced. If the receive quality falls below this threshold, however, the side channel is reproduced which may have less bits than the main channel and which is a correspondingly lower quality representation of the original information than the main channel.
Claims
1. A device for generating a signal to be transmitted, comprising: an encoder for generating an encoded signal from an information signal, wherein the encoder is implemented to generate a main channel and a side channel which are decodable separate from each other and represent the information signal, and to generate the main channel and the side channel so that the side channel is more robust against transmission-channel influences than the main channel; wherein the encoder is a source encoder which is implemented to execute a time-domain/spectral-domain conversion and which provides a description of a spectral envelope for generating the side-channel coefficients using fine quantization; wherein, on a decoder side with a predetermined signal/noise ratio, a speech intelligibility is achievable; and wherein the side channel uses half or less than half as many bits as the main channel.
2. The device according to claim 1, wherein the encoder is implemented to generate the side channel so that it is a qualitatively worse representation of the information signal than the main channel.
3. The device according to claim 1, wherein the encoder is implemented to execute a lossy data compression with the information signal, wherein a compression factor in a generation of the side channel is higher than in a generation of the main channel, or wherein for the main channel a lossless data compression may be executed, while for the side channel a lossy data compression may be executed.
4. The device according to claim 1, wherein the encoder is implemented to generate the side channel such that it necessitates a lower bit rate for transmission than the main channel.
5. The device according to claim 1, wherein the encoder comprises a source encoder and a downstream redundancy encoder, wherein the redundancy encoder is implemented to generate the main channel with a first code rate and to generate the side channel with a second code rate which is smaller than the first code rate, wherein the code rates are smaller than 1 and are defined by the ratio of a number of input bits with regard to a number of output bits derived from the input bits.
6. The device according to claim 1, wherein the encoder comprises a mapper which is implemented to generate the main channel using a first mapping rule and to generate the side channel using a second mapping rule, wherein a signal/noise ratio for decoding the main channel is higher than a signal/noise ratio for decoding the side channel.
7. The device according to claim 6, wherein the second mapping rule defines more different mapping states than the first mapping rule.
8. The device according to claim 6, wherein a mapping state of the second mapping rule represents more bits than a mapping state of the first mapping rule.
9. The device according to claim 1, comprising: a modulator for modulating information of the main channel and the side channel to a plurality of signal carriers, wherein the signal carriers are frequency carriers, time carriers and/or code carriers in an FDMA, TDMA and/or CDMA system.
10. The device according to claim 1, wherein the encoder comprises a source encoder comprising a speech encoder for generating the side channel and an audio encoder for generating the main channel, wherein the audio encoder necessitates more bits for a given time length of the information signal than the speech encoder.
11. The device according to claim 1, wherein the coefficients are LPC coefficients or derived from LPC coefficients.
12. The device according to claim 1, wherein the encoder comprises a source encoder which is implemented to generate both the main channel and also the side channel, wherein the source encoder further comprises a selector to select a part of the data generated by the source encoder to form the side channel using this data.
13. The device according to claim 1, which is implemented to feed a DAB or DRM transmitter or to feed a package-oriented data network.
14. The device according to claim 1, wherein the encoder is implemented to generate a further side channel with a different time offset and/or different robustness against channel influences which may also be decoded separately from the main channel and from the side channel.
15. A method for generating a signal to be transmitted, comprising: generating an encoded signal from an information signal wherein the generating the encoded signal includes: generating a main channel and a side channel which are decodable separate from each other and represent the information signal, and wherein the main channel and the side channel are generated such that the side channel is more robust against transmission-channel influences than the main channel; and wherein the generating the encoded signal includes executing a time-domain/spectral-domain conversion and providing a description of a spectral envelope for generating the side-channel coefficients using fine quantization; wherein, on a decoder side with a predetermined signal/noise ratio, a speech intelligibility is achievable; and wherein the side channel uses half or less than half as many bits as the main channel.
16. A non-transitory storage medium having stored thereon a computer program comprising a program code for executing, when the computer program runs on a computer, a method of generating a signal to be transmitted, the method comprising: generating an encoded signal from an information signal, wherein the generating the encoded signal includes: generating a main channel and a side channel which are decodable separate from each other and represent the information signal, and wherein the main channel and the side channel are generated such that the side channel is more robust against transmission-channel influences than the main channel; and wherein the generating the encoded signal includes executing a time-domain/spectral-domain conversion and providing a description of a spectral envelope for generating the side-channel coefficients using fine quantization; wherein, on a decoder side with a predetermined signal/noise ratio, a speech intelligibility of more than 50% of the total transmitted words is achievable; and wherein the side channel uses half or less than half as many bits as the main channel.
17. A device for generating a signal to be transmitted, comprising: an encoder for generating an encoded signal from an information signal, wherein the encoder is implemented to generate a main channel and a side channel which are decodable separately from each other and represent the information signal, and to generate the main channel and the side channel so that the side channel is more robust against transmission-channel influences than the main channel; wherein the encoder comprises a source encoder which is implemented to generate both the main channel and also the side channel, wherein the source encoder further comprises a selector to select a part of the data generated by the source encoder to form the side channel using this data; and wherein the encoder comprises a subband-based audio encoder and wherein the selector is implemented to select existing scale factors for each band, or wherein the encoder comprises a subband-based audio encoder which is implemented to encode quantized spectral values using different code books, wherein the code books represent value ranges of quantized spectral values of different sizes, and wherein code book indices are generated which each represent used code books for the quantized spectral values, and wherein the code book indices are generated representing the code books used for the quantized spectral values, and wherein the selector is implemented to select the code book indices.
18. A method for generating a signal to be transmitted, comprising: generating an encoded signal from an information signal, wherein the encoded signal is implemented, by generating a main channel and a side channel which are decodable separately from each other and represent the information signal, and wherein the main channel and the side channel are generated such that the side channel is more robust against transmission-channel influences than the main channel; wherein the generating the encoded signal comprises generating both the main channel and also the side channel, wherein the generating the encoded signal further comprises selecting a part of the data generated by the generating the encoded signal to form the side channel using this data; and wherein the generating the encoded signal comprises a subband-based audio encoding and wherein the selecting is implemented to select existing scale factors for each band, or wherein the generating the encoded signal comprises subband-based audio encoding which is implemented to encode quantized spectral values using different code books, wherein the code books represent value ranges of quantized spectral values of different sizes, and wherein code book indices are generated which each represent used code books for the quantized spectral values, and wherein the code book indices are generated representing the code books used for the quantized spectral values, and wherein the selecting comprises selecting the code book indices.
19. A non-transitory storage medium having stored thereon a computer program comprising a program code for executing, when the computer program runs on a computer, a method of generating a signal to be transmitted, the method comprising: generating an encoded signal from an information signal, wherein the encoded signal is implemented, by generating a main channel and a side channel which are decodable separately from each other and represent the information signal, and wherein the main channel and the side channel are generated such that the side channel is more robust against transmission-channel influences than the main channel; wherein the generating the encoded signal comprises generating both the main channel and also the side channel, wherein the generating the encoded signal further comprises selecting a part of the data generated by the generating the encoded signal to form the side channel using this data; and wherein the generating the encoded signal comprises a subband-based audio encoding and wherein the selecting is implemented to select existing scale factors for each band, or wherein the generating the encoded signal comprises subband-based audio encoding which is implemented to encode quantized spectral values using different code books, wherein the code books represent value ranges of quantized spectral values of different sizes, and wherein code book indices are generated which each represent used code books for the quantized spectral values, and wherein the code book indices are generated representing the code books used for the quantized spectral values, and wherein the selecting comprises selecting the code book indices.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) In the following, embodiments of the present invention are explained in more detail with reference to the accompanying drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
DETAILED DESCRIPTION OF THE INVENTION
(9) According to the invention, a so-called graceful degradation is achieved, i.e. a radio broadcasting reproduction of audio and/or video data below the threshold needed for the main channel. Thus, the side channel, e.g., contains the data of a low bit rate coder having the same content as the main channel. It is the purpose to generate a substitute signal when failures occur which may be implemented, e.g. based on the technology of generating comfort noise in speech encoders, i.e. using data transmitted in the side channel. This data is used in the receiver-side signal synthesis, in order not to leave signal synthesis only with noise generation but to control the signal synthesis at least in such a detailed way that speech intelligibility is, for example, produced in such a way that e.g. news can still be understood.
(10) Further, in one embodiment, a time shift between the main channel and the side channel is executed, so that the main channel is delayed with regard to the side channel. According to the invention, further a more robust modulation is executed for the side channel than for the main channel with the main audio data.
(11) A modulus in DRM technology for the main channel, which is referred to there as a media service channel (MSC), has a bit rate of 14.5 kb/s, for example. In the side channel, in this embodiment additional data are transmitted with 1 kb/s in the SDC (service description channel) in a more robust way and, e.g., encoded with an advance of 30 seconds. The main channel is, therefore, delayed by 30 seconds as compared to the additional channel. When the connection is interrupted for up to 30 seconds, the side channel is reproduced. If the SNR is reduced below the threshold of the MSC, but above the threshold of the SDC, the side channel is also reproduced. The invention may thus be used in a transmission system for digital media transmission in which the main channel exists which comprises main media data and in which further the side channel exists in which the same media data is represented with a higher compression and a lower bit rate, wherein the side channel has a more robust modulation than the main channel, which necessitates a lower minimal SNR for the transmission and the encoding of the side channel as compared to the main channel. Depending on the implementation, the main channel may be delayed in time with regard to the side channel.
(12) In the following, detailed reference is made to
(13) In particular, the encoder is implemented to generate the main channel 18a and the side channel 18b which may be decoded separate from each other and thus both represent the information signal.
(14) To obtain a decoded version of the information, according to the invention, either the side channel or the main channel is sufficient. In other words, the data of the side channel alone without the data of the main channel is sufficient to provide at least a lower-quality representation of the information on the encoder side.
(15) According to the invention, the encoder 14 is further implemented to generate the main channel 18a and the side channel 18b such that the side channel is more robust against transmission channel influences than the main channel. This robustness may be generated in different ways, for example by the use of a more robust modulation for the side-channel data or by introducing a higher redundancy into the side-channel data than into the main-channel data.
(16)
(17) Alternatively, the mapper 24 may also generate one single output data stream and supply the same to the modulator 26, which, e.g., executes an FDMA, TDMA or CDMA modulation method, i.e. one of the known frequency multiplex, time multiplex or code multiplex methods or a combination of those methods, as it is known in the art. Depending on the implementation, the modulator may provide one single signal on the output side, which would then include both the main channel and also the side channel already in one single data stream, or the modulator may provide the side channel and the main channel as separate data streams which are then combined with each other in the transmitter 12 before one single antenna signal is emitted which includes both channels.
(18) In principle, it is sufficient that in one of blocks 20, 22, 24, 26 different channels with different robustnesses are generated. However, also different robustnesses may be cumulated. Thus, for example the side channel generated by the redundancy encoder 22, which is per se already more error-resistant due to the greater redundancy, may additionally be subjected to a mapping rule in the mapper which is more robust than another mapping rule with which the main channel may be provided, wherein additionally in the modulator for the side channel a more error-resistant modulation method may be used than for the main channel.
(19) According to the invention it is advantageous, however, for the source encoder 20 to generate two different output data streams, wherein the side channel has a low bit rate and the main channel has a high bit rate which may then be processed by a combination of blocks 22, 24, 26, or which, e.g., only receive a different robustness by the redundancy encoder 22, wherein the redundancy encoder then already combines both data streams according to a certain regulation on the output side so that only one data stream enters the mapper 24 and one data stream exits the same. Alternatively, however, the redundancy encoder may simply process both input-side data streams with the same code rate, wherein the mapper or the modulator would then generate the more different robustness of the two data channels.
(20) Different robustnesses may be achieved in the redundancy encoder according to the invention, e.g. by using a Reed-Solomon code or an FEC code, e.g. with a feedback shift registers, comprising a certain generator polynomial and operating with or without puncturing. The code rate describes the number of output bits for a certain number of input bits and is smaller than 1 due to the redundancy adding. For the side channel a code rate smaller than 0.5 may be used, while for the main channel a code rate higher than or equal to 0.5 may be used.
(21) With regard to the mapper 24, for the main channel and the side channel different mapping rules may be used. A mapping rule has a certain number of symbols in the complex plane, wherein for a QPSK mapping only four symbols exist in the complex plane, while for a 16-QAM mapping for example 16 symbols exist in the complex plane. This means that a decoder with QPSK only has to differentiate between four different symbols, while a decoder with 16 QAM already has to differentiate between 16 different symbols. The minimal SNR for a QPSK mapping is thus substantially lower than the minimal SNR for a 16-QAM mapping. However, for every modulation symbol in a 16-QAM mapping four data bits are transmitted, while with OPSK for each symbol only two data bits have to be transmitted. Mapping has a great influence on robustness against transmission channel influences.
(22) Alternative modulation methods may also be used, like, for example, DPSK or 8 QAM. Also hierarchical modulation methods in which, e.g., a 16 QAM is overlaid onto a QPSK, may be implemented for the different channels. Thus, also for the main channel 64 QAM may be used, and for the side channel 16 QAM may be used.
(23) Also in the modulator 26 different robustnesses may be generated, when, e.g., for a CDMA modulation code sequences of a different length are used for the side channel and/or for the main channel, or when in an FDMA modulation for the different channels different frequency bandwidths are used, or in a CDMA modulation time slots of a different length are used.
(24) In the following, reference is made to implementations of the source encoder 20 of
(25) Other data rates are also possible, wherein in particular ratios between the MSC and the SCD and/or between the main channel and the side channel of <2, in particular <5 and again in particular <10 are advantageous.
(26) In the embodiment illustrated in
(27) Alternatively, the source encoder 20 of
(28) The spectrum output by the filter bank 41 and/or the spectral illustration output by the filter bank 41 is quantized in the quantizer 43. The quantizer 43 is controlled by a psychoacoustic model 44 which is implemented to calculate the psychoacoustic masking threshold for each band and to make the quantization so coarse that the quantization noise is below the masking threshold. The quantized spectral values output by the quantizer 43 are supplied to a Huffman encoder 45. It is noted that the quantizer 43 not only calculates quantized spectral values but also scale factors which represent the spectral coarse structure of the spectral illustration. In contrast, the spectral fine structure is contained in the quantized spectral values.
(29) For Huffman encoding, the Huffman encoder 45 uses a plurality of predefined code books, wherein according to the MPEG-AAC standard twelve different code tables are used which are all different in the value range of the elements and/or spectral values or groups of spectral values encoded by the code table. Every code table is identified by its code table number, which is, just like the scale factors, supplied to a bit stream formatter 46 and necessitated on the decoder side to execute a decoding using the correct code table.
(30) The output data stream generated by the bit stream formatter 46 then represents the main channel, while the side channel is generated using a side channel selector 47. The side channel selector is implemented to select a certain portion of the data coming into the main channel to occupy the side channel using these data. The less data is selected, the lower the data rate will be in the side channel, which is desirable for reasons of a responsible handling of the transmission bandwidth. However, a certain minimum measure of data is needed not only to generate a pink noise on the receiver side but to be better, for example to cause speech intelligibility. For this purpose, scale factors and/or code table numbers are supplied depending on the suitability and necessity of a delta encoding 40 and a subsequent Huffman encoding 42. For the code table numbers, a delta encoding will not be as suitable. However, by a delta encoding of scale factors a further redundancy reduction may be achieved. The quantized spectral values are not transmitted in the side channel. I.e., the spectral fine structure is not transmitted in the side channel. Here, only the spectral coarse structure exists.
(31) Depending on the implementation, the side-channel data may thus come from a low-rate speech encoder or from a low-rate audio encoder. Thus, even a part of or also all coefficients of the side channel may come from coefficients of the encoder of the main channel. In particular, when the main encoder is a subband-based audio encoder, it is advantageous to select the coefficients of the encoder of the main channel, representing the scale factors, into the side channel. Depending on the implementation, the indices for Huffman code books may also be used as selection data.
(32) It is in particular to be noted, as is also indicated in
(33)
(34) The receive signal which is received by the receiver stage 50 includes a main channel and a side channel which may be decoded separately from each other. In particular, a minimal signal/noise ratio which is needed for decoding the side channel is smaller than a minimal signal/noise ratio which is needed for decoding the main channel. I.e., the side channel is more robust against transmission characteristics of the transmission channel than the main channel.
(35) Downstream from the receiver, depending on the implementation, a channel separation stage 51 is connected to separate the side channel from the main channel already on the HF side. Depending on the implementation, this functionality may, however, also be integrated in a decoder 52 which is directly coupled to the receiver and which generates the main channel separate from the side channel. The inventive broadcasting subscriber device, as is illustrated in
(36) The quality observation means 53 is implemented to provide a switchover signal when a receive quality is determined which is less than a receive quality source, wherein the signal is, in the most general case, supplied to the decoder 52 which may then switch over from a reproduction of the main channel to a reproduction of the side channel.
(37) The changeover switch which is, e.g., contained in the decoder 52 or which may also be implemented separately may thus be controlled by the quality observer to provide the side channel as the decoded signal when the receive quality is less than a threshold quality, and to provide the main channel as a decoded signal when the receive quality is greater than or equal to the threshold quality.
(38) In the following, with reference to
(39) In
(40) If the source decoder 63 receives main-channel data, it comprises all data needed for decoding including the spectral fine structure, and a high-quality output is achieved. If, however, only side-channel data is transmitted, like, for example, the spectral envelope of the original information signal, the source decoder will, for example, execute a signal synthesis, wherein the spectral fine structure is synthesized and weighted using the transmitted data of the spectral coarse structure, such that a synthesized spectrum is generated which is then supplied to the synthesis filter bank to generate a decoded audio signal which at least comprises speech intelligibility. The same procedure may be used when scale factors and/or code table indices are transmitted. In this case, all transmitted data are used according to their original purpose, while non-transmitted data are synthesized, for example by the synthetic generation of spectral values which are, regarded together in a scale factor band, weighted such that a certain energy distribution is achieved in the band, wherein the absolute energy is basically determined anyway by the scale factor which was directly transmitted in the side-channel data stream.
(41) Alternatively, the side-channel data stream may also be a band-limited representation (e.g. up to 4 kHz) of the original data, so that the side channel and the main channel are only different regarding their bandwidth. In this case, the decoder for the side-channel data would not synthesize any further spectral values, but would only generate the narrow-banded signal the way it is.
(42) If the side-channel data stream is the output signal of a speech encoder, however, like, for example that of a CELP encoder, as it is used in GSM, the inventive broadcast/radio receiver will also include a GSM speech decoder to generate the side-channel data when the level of the receive quality threshold is fallen below.
(43) The quality observer 53 in
(44) In the illustration illustrated in
(45) With regard to the quality observation 53 it is to be noted that, when the channel estimator 64 is not accessed or when additional data on the receive quality are desired, the main channel or the combined main-channel/side-channel signal may be accessed in any location to obtain an impression on the current receive quality and in particular on the receive quality of the main channel.
(46) In the following, with reference to
(47) Only for reasons of clarity, in
(48) When the signal is a video signal, the side channel may be a downsampled version of the main channel. The decimation of data is executed by the spatial decimation of, e.g., every second pixel per image vertically and horizontally and/or by the time decimation of, e.g., every second image of a sequence or by any other decimation measures.
(49) Depending on the circumstances, the inventive method may be implemented in hardware or in software. The implementation may be on a digital storage medium, in particular a floppy disc or a DC having electronically readable control signals which may cooperate with a programmable computer system so that the method is executed. In general, the invention thus also consists in a computer program product having a program code stored on a machine-readable carrier for executing the inventive method when the computer program product runs on a computer. In other words, the invention may thus be realized as a computer program having a program code for executing the method, when the computer program runs on a computer.
(50) While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations and equivalents as fall within the true spirit and scope of the present invention.