G10L19/16

ELECTRONIC DEVICE FOR TRANSMITTING AND RECEIVING DATA PACKET AND OPERATING METHOD THEREOF

According to various embodiments, an electronic device includes a communication circuit for supporting a Bluetooth™ communication, and at least one processor functionally connected to the communication circuit. The at least one processor is configured to establish, via the communication circuit, a Bluetooth™ low energy (BLE) communication link with an external electronic device, to generate a first data packet from first audio data using a first coding scheme, and generate a second data packet from first audio data using a second coding scheme, and through the BLE communication link, to transmit the first data packet to the external electronic device in a first time interval of a predetermined time interval, and transmit the second data packet to the external electronic device in a second time interval of the predetermined time interval.

Signal encoding method and apparatus and signal decoding method and apparatus

A spectrum coding method includes quantizing spectral data of a current band based on a first quantization scheme, generating a lower bit of the current band using the spectral data and the quantized spectral data, quantizing a sequence of lower bits including the lower bit of the current band based on a second quantization scheme, and generating a bitstream based on a upper bit excluding N bits, where N is 1 or greater, from the quantized spectral data and the quantized sequence of lower bits.

Methods for parametric multi-channel encoding

The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal. In addition, the system comprises a configuration unit configured to determine one or more control settings for the parameter processing unit based on one or more external settings; wherein the one or more external settings comprise a target data-rate for the bitstream and wherein the one or more control settings comprise a maximum data-rate for the spatial metadata.

LOW-FREQUENCY EMPHASIS FOR LPC-BASED CODING IN FREQUENCY DOMAIN

The invention provides an audio encoder including a combination of a linear predictive coding filter having a plurality of linear predictive coding coefficients and a time-frequency converter, wherein the combination is configured to filter and to convert a frame of the audio signal into a frequency domain in order to output a spectrum based on the frame and on the linear predictive coding coefficients; a low frequency emphasizer configured to calculate a processed spectrum based on the spectrum, wherein spectral lines of the processed spectrum representing a lower frequency than a reference spectral line are emphasized; and a control device configured to control the calculation of the processed spectrum by the low frequency emphasizer depending on the linear predictive coding coefficients of the linear predictive coding filter.

LOW-FREQUENCY EMPHASIS FOR LPC-BASED CODING IN FREQUENCY DOMAIN

The invention provides an audio encoder including a combination of a linear predictive coding filter having a plurality of linear predictive coding coefficients and a time-frequency converter, wherein the combination is configured to filter and to convert a frame of the audio signal into a frequency domain in order to output a spectrum based on the frame and on the linear predictive coding coefficients; a low frequency emphasizer configured to calculate a processed spectrum based on the spectrum, wherein spectral lines of the processed spectrum representing a lower frequency than a reference spectral line are emphasized; and a control device configured to control the calculation of the processed spectrum by the low frequency emphasizer depending on the linear predictive coding coefficients of the linear predictive coding filter.

AUDIO AND VIDEO TRANSMISSION DEVICES AND AUDIO AND VIDEO TRANSMISSION SYSTEMS
20230078451 · 2023-03-16 ·

Audio and video transmission device and audio and video transmission system are provided. The audio and video transmission system includes an audio and video transmission device and at least one wireless microphone transmitting device. Each of the at least one wireless microphone transmitting device is configured to send an audio signal acquired by a wireless microphone to the audio and video transmission device. The audio and video transmission device is configured to be respectively connected to the wireless microphone transmitting device and an external video acquisition device and configured to: receive the audio signal from the wireless microphone transmitting device and transmit the audio signal to the video acquisition device, obtain a mixture signal generated by the video acquisition device from the audio signal and a video signal, and process and output the mixture signal.

Methods, apparatus and systems for unified speech and audio decoding and encoding decorrelation filter improvements

The present disclosure relates to an apparatus for decoding an encoded Unified Audio and Speech stream. The apparatus comprises a core decoder for decoding the encoded Unified Audio and Speech stream. The core decoder includes an upmixing unit adapted to perform mono to stereo upmixing. The upmixing unit includes a decorrelator unit D adapted to apply a decorrelation filter to an input signal. The decorrelator unit is adapted to determine filter coefficients for the decorrelation filter by referring to pre-computed values. The present disclosure further relates to a an apparatus for encoding a Unified Audio and Speech stream, as well as to corresponding methods and storage media.

AUTOMATIC DISCOVERY AND LOCALIZATION OF VOICE DEGRADATION FAULTS USING ULTRASOUND TECHNIQUES
20220337442 · 2022-10-20 ·

A method comprises, at a local participant device, establishing audio connections with remote participant devices over a network for an online voice conference. The method includes generating ultrasound signals for corresponding ones of the remote participant devices, and transmitting the ultrasound signals over corresponding ones of the audio connections. The method further includes collecting indications, transmitted by corresponding ones of the remote participant devices over the network, that indicate whether the corresponding ones of the remote participant devices detected the ultrasound signals. The method includes identifying which of the remote participant devices detected the ultrasound signals based on the indications, and localizing degraded voice quality to particular ones of the local participant device and the remote participant devices based, at least in part, on results of identifying.

Audio Transcoding Method and Apparatus, Audio Transcoder, Device, and Storage Medium

Provided is an audio transcoding method, including: (301) performing entropy decoding on a first audio stream with a first bitrate, to obtain an audio feature parameter and an excitation signal of the first audio stream, the excitation signal being a quantized audio signal; (302) obtaining a time-domain audio signal corresponding to the excitation signal based on the audio feature parameter and the excitation signal; (303) re-quantizing the excitation signal and the audio feature parameter based on the time-domain audio signal and a target transcoding bitrate, to obtain a target excitation signal and a target audio feature parameter; and (304) performing entropy coding on the target audio feature parameter and the target excitation signal, to obtain a second audio stream with a second bitrate, the second bitrate being lower than the first bitrate.

Low bitrate audio encoding/decoding scheme having cascaded switches

An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.