Patent classifications
G10L19/24
Audio Transcoding Method and Apparatus, Audio Transcoder, Device, and Storage Medium
Provided is an audio transcoding method, including: (301) performing entropy decoding on a first audio stream with a first bitrate, to obtain an audio feature parameter and an excitation signal of the first audio stream, the excitation signal being a quantized audio signal; (302) obtaining a time-domain audio signal corresponding to the excitation signal based on the audio feature parameter and the excitation signal; (303) re-quantizing the excitation signal and the audio feature parameter based on the time-domain audio signal and a target transcoding bitrate, to obtain a target excitation signal and a target audio feature parameter; and (304) performing entropy coding on the target audio feature parameter and the target excitation signal, to obtain a second audio stream with a second bitrate, the second bitrate being lower than the first bitrate.
METHOD AND APPARATUS FOR DETERMINING WEIGHTING FACTOR DURING STEREO SIGNAL ENCODING
Various embodiments provide a method and an apparatus for determining a weighting factor during stereo signal encoding. In those embodiments, a parameter value corresponding to the encoding mode of the to-be-encoded signal is determined based on an encoding mode of a to-be-encoded signal in a stereo signal and a correspondence between an encoding mode and a parameter value. Based on the determined parameter value and an energy spectrum of a linear prediction filter corresponding to an original line spectral frequency parameter of the to-be-encoded signal, a weighting factor for calculating a distance between the original line spectral frequency parameter and a target original line spectral frequency parameter is calculated.
Systems and Methods for Selective Storing of Data Included in a Corrupted Data Packet
An exemplary hearing device is configured to receive, from a source, a data packet, the data packet including a plurality of frames including a first frame and a second frame. The hearing device determines that the data packet has an invalid checksum. The hearing device accesses, in response to the determining that the data packet has the invalid checksum, a first frame checksum for the first frame and a second frame checksum for the second frame. The hearing device determines that the first frame checksum is invalid and that the second frame checksum is valid. The hearing device discards, based on the first frame checksum being invalid, the first frame and stores, based on the second frame checksum being valid, the second frame.
Systems and Methods for Selective Storing of Data Included in a Corrupted Data Packet
An exemplary hearing device is configured to receive, from a source, a data packet, the data packet including a plurality of frames including a first frame and a second frame. The hearing device determines that the data packet has an invalid checksum. The hearing device accesses, in response to the determining that the data packet has the invalid checksum, a first frame checksum for the first frame and a second frame checksum for the second frame. The hearing device determines that the first frame checksum is invalid and that the second frame checksum is valid. The hearing device discards, based on the first frame checksum being invalid, the first frame and stores, based on the second frame checksum being valid, the second frame.
METHODS, APPARATUS AND SYSTEMS FOR 6DOF AUDIO RENDERING AND DATA REPRESENTATIONS AND BITSTREAM STRUCTURES FOR 6DOF AUDIO RENDERING
The present disclosure relates to methods, apparatus and systems for encoding an audio signal into a bitstream, in particular at an encoder, comprising: encoding or including audio signal data associated with 3DoF audio rendering into one or more first bitstream parts of the bitstream, and encoding or including metadata associated with 6DoF audio rendering into one or more second bitstream parts of the bitstream. The present disclosure further relates to methods, apparatus and systems for decoding an audio signal and audio rendering based on the bitstream.
Audio Transmission Method and Electronic Device
An audio transmission method includes: a second device sends noise energy data on the second device end and transmission efficiency data to a first device; the first device determines a first bit rate based on the noise energy data, and determines a second bit rate based on the transmission efficiency data; the first device encodes an audio stream based on a lower bit rate of the first bit rate and the second bit rate to obtain audio data; the first device sends the audio data obtained by encoding to the second device.
DATA TRANSMISSION METHOD AND APPARATUS, STORAGE MEDIUM, AND TERMINAL DEVICE
A data transmission method and apparatus, a non-transitory computer storage medium, and a terminal device are provided in implementations of the present disclosure. The method includes the following. A scan parameter of a Bluetooth scan or/and a data packet of data transmission is adjusted under a condition. Data is transmitted to a target Bluetooth device via Bluetooth according to an adjusted scan parameter or/and an adjusted data packet.
Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
Methods of encoding and decoding speech signal using neural network model recognizing sound sources, and encoding and decoding apparatuses for performing the same
Methods of encoding and decoding a speech signal using a neural network model that recognizes sound sources, and encoding and decoding apparatuses for performing the methods are provided. A method of encoding a speech signal includes identifying an input signal for a plurality of sound sources; generating a latent signal by encoding the input signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining a number of bits used for quantization of each of the plurality of sound source signals according to a type of each of the plurality of sound sources; quantizing each of the plurality of sound source signals based on the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.