Patent classifications
G10L19/173
Concept for switching of sampling rates at audio processing devices
Audio decoder device for decoding a bitstream, the audio decoder device including: a predictive decoder for producing a decoded audio frame from the bitstream, wherein the predictive decoder includes a parameter decoder for producing one or more audio parameters for the decoded audio frame from the bitstream and wherein the predictive decoder includes a synthesis filter device for producing the decoded audio frame by synthesizing the one or more audio parameters for the decoded audio frame; a memory device including one or more memories, wherein each of the memories is configured to store a memory state for the decoded audio frame, wherein the memory state for the decoded audio frame of the one or more memories is used by the synthesis filter device for synthesizing the one or more audio parameters for the decoded audio frame; and a memory state resampling device configured to determine the memory state for synthesizing the one or more audio parameters for the decoded audio frame, which has a sampling rate, for one or more of the memories by resampling a preceding memory state for synthesizing one or more audio parameters for a preceding decoded audio frame, which has a preceding sampling rate being different from the sampling rate of the decoded audio frame, for one or more of the memories and to store the memory state for synthesizing of the one or more audio parameters for the decoded audio frame for one or more of the memories into the respective memory.
Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder
A multi-channel signal encoding method includes determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, and reverberation gain parameters corresponding to different subbands of the first channel signal and the second channel signal, where the obtained reverberation gain parameters are belonging to at least two reverberation gain parameter groups. The method further includes selecting, from the at least two reverberation gain parameter groups, a target reverberation gain parameter group. The method further includes generating parameter indication information, where the parameter indication information indicates the target reverberation gain parameter group. The method further includes encoding reverberation gain parameters corresponding to the target reverberation gain parameter group, the parameter indication information, and the downmixed signal to obtain a bitstream.
Multichannel Audio Speech Classification
Examples of the present disclosure describe systems and methods for multichannel audio speech classification. In examples, an audio signal comprising multiple audio channels is received at a processing device. Each of the audio channels in the audio signal is transcoded to a predefined audio format. For each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. A correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. Each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. Based on the classification, an action associated with the audio signal may be performed.
LOW BITRATE AUDIO ENCODING/DECODING SCHEME HAVING CASCADED SWITCHES
An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.
LOW BITRATE AUDIO ENCODING/DECODING SCHEME HAVING CASCADED SWITCHES
An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.
Audio Encryption in a Media Playback System
A first playback device is configured to obtain audio content from a cloud-based media service, the audio content encrypted according to a first encryption scheme. The first playback device is configured to communicate with a cloud-based license service to obtain information for decrypting the audio content and use the obtained information to decrypt the audio content. The first playback device is also configured to re-encrypt the decrypted audio content in accordance with a second encryption scheme and transmit the re-encrypted audio content to at least a second playback device that is grouped with the first playback device for synchronous audio playback. The first playback device is also configured to play back the audio content in synchrony with playback of the audio content by the second playback device.
Concept for switching of sampling rates at audio processing devices
Audio decoder device for decoding a bitstream, the audio decoder device including: a predictive decoder for producing a decoded audio frame from the bitstream, wherein the predictive decoder includes a parameter decoder for producing one or more audio parameters for the decoded audio frame from the bitstream and wherein the predictive decoder includes a synthesis filter device for producing the decoded audio frame by synthesizing the one or more audio parameters for the decoded audio frame; a memory device including one or more memories, wherein each of the memories is configured to store a memory state for the decoded audio frame, wherein the memory state for the decoded audio frame of the one or more memories is used by the synthesis filter device for synthesizing the one or more audio parameters for the decoded audio frame; and a memory state resampling device configured to determine the memory state for synthesizing the one or more audio parameters for the decoded audio frame, which has a sampling rate, for one or more of the memories by resampling a preceding memory state for synthesizing one or more audio parameters for a preceding decoded audio frame, which has a preceding sampling rate being different from the sampling rate of the decoded audio frame, for one or more of the memories and to store the memory state for synthesizing of the one or more audio parameters for the decoded audio frame for one or more of the memories into the respective memory.
AUDIO DECODER, AUDIO ENCODER, METHOD FOR PROVIDING A DECODED AUDIO SIGNAL, METHOD FOR PROVIDING AN ENCODED AUDIO SIGNAL, AUDIO STREAM, AUDIO STREAM PROVIDER AND COMPUTER PROGRAM USING A STREAM IDENTIFIER
An audio decoder for providing a decoded audio signal representation on the basis of an encoded audio signal representation is configured to adjust decoding parameters in dependence on a configuration information, to decode one or more audio frames using a current configuration information, to compare a configuration information in a configuration structure associated with one or more frames to be decoded by the current configuration information, and to make a transition to perform decoding using the configuration information in the configuration structure associated with the one or more frames to be decoded as a new configuration information if the configuration information in the configuration structure associated with the one or more frames to be decoded, or a relevant portion thereof, is different from the current configuration information, and to consider a stream identifier information included in the configuration structure when comparing the configuration information.
Method and system for parallel audio transcoding
A parallel audio transcoding method includes splitting audio into segments of a certain length; performing parallel transcoding by allocating the split segments to a plurality of encoders; and concatenating the segments encoded through the parallel transcoding and merging the same into a single encoded file. Performing parallel transcoding includes inserting additional regions, which overlap and neighbor each of the split segments, and sending the same to the encoders, and merging includes cutting out the additional regions from the encoded stream to create a stream corresponding to the split segments.
Methods and apparatus to perform audio watermarking and watermark detection and extraction
Example methods and apparatus to audio watermarking and watermark detection and extraction are disclosed herein. Example methods disclosed herein include determining a first watermark symbol encoded in encoded audio samples and storing the first watermark symbol in tangible memory. Disclosed example methods also include determining a second watermark symbol encoded in the encoded audio samples and storing the second watermark symbol in the tangible memory. Disclosed example methods further include, in response to determining that the first watermark symbol matches the second watermark symbol, outputting the first watermark symbol.