G10L2019/0001

REAL TIME DIGITAL VOICE COMMUNICATION METHOD
20210074304 · 2021-03-11 ·

A communication system includes at least one first device and at least one second device which are linked in a manner that enables data transfer with each other. The first device enables the speech signal that it receives as the input to be expressed in terms of the energy functions representing the energy patterns, information functions representing the information patterns and the noise functions of the frames of the real speech samples; and transfers the indexes of these functions in the database and the frame gain factor of each frame to the second device. The second device finds the functions via the indexes from the copy database which is a copy of the database and reconstructs the speech signal by these functions and the frame gain factor, enabling it to be provided as the voice output.

METHOD AND APPARATUS FOR CALCULATING DOWNMIXED SIGNAL AND RESIDUAL SIGNAL
20210082442 · 2021-03-18 ·

A method and an apparatus for calculating a downmixed signal and a residual signal are provided. According to the method, if a first target frame (a current frame or a previous frame of the current frame) is a switching frame, a to-be-encoded downmixed signal and a to-be-encoded residual signal of the subband corresponding to the preset frequency band in the current frame is calculated based on a switch fade-in/fade-out factor of a second target frame, an initial downmixed signal and an initial residual signal of the preset frequency band.

METHODS, APPARATUS AND ARTICLES OF MANUFACTURE TO IDENTIFY SOURCES OF NETWORK STREAMING SERVICES
20210027792 · 2021-01-28 ·

Methods, apparatus and articles of manufacture to identify sources of network streaming services are disclosed. An example apparatus includes a coding format identifier to identify, from a received first audio signal representing a decompressed second audio signal, an audio compression configuration used to compress a third audio signal to form the second audio signal, and a source identifier to identify a source of the second audio signal based on the identified audio compression configuration.

QUANTIZATION OF SPATIAL AUDIO PARAMETERS
20210020185 · 2021-01-21 ·

There is disclosed inter alia an apparatus for spatial audio signal encoding which determines at least one spatial audio parameter comprising a direction parameter with an elevation component and an azimuth component. The elevation component and azimuth component of the direction parameter are then converted to an index value.

Characteristic-based speech codebook selection
10878831 · 2020-12-29 · ·

An apparatus includes a speech processing engine configured to receive data corresponding to speech and to determine whether a first characteristic associated with the speech differs from a reference characteristic by at least a threshold amount. The apparatus further includes a selection circuit responsive to the speech processing engine. The selection circuit is configured to select a particular speech codebook from among a plurality of speech codebooks based on the first characteristic differing from the reference characteristic by at least the threshold amount. The particular speech codebook is associated with the first characteristic.

Multi-stream audio coding

A method includes receiving, at an audio encoder, multiple streams of audio data, where N is the number of the received multi streams. The method includes determining a similarity value for each stream of the multiple streams and comparing the similarity value for each stream of the multiple streams with a threshold. The method also includes identifying, based on the comparison, L (L<N) number of streams to be encoded among the N number of the multiple streams. The method includes encoding the identified L number of streams to generate an encoded bitstream.

METHODS AND DEVICES FOR VECTOR SEGMENTATION FOR CODING

A method for partitioning of input vectors for coding is presented. The method comprises obtaining of an input vector. The input vector is segmented, in a non-recursive manner, into an integer number, N.sup.SEG, of input vector segments. A representation of a respective relative energy difference between parts of the input vector on each side of each boundary between the input vector segments is determined, in a recursive manner. The input vector segments and the representations of the relative energy differences are provided for individual coding. Partitioning units and computer programs for partitioning of input vectors for coding, as well as positional encoders, are presented.

METHOD FOR COMMUNICATING A NON-SPEECH MESSAGE AS AUDIO
20200251088 · 2020-08-06 ·

A method is provided for communicating a non-speech message as audio from a first device to a second device such that information can be passed between the first and second device. The method includes: encoding the non-speech message as a dissimilar speech message having a plurality of phonemes; transmitting the speech message over one or more audio communications channels from the first device; receiving the speech message at the second device; recognizing the speech message; and decoding the dissimilar speech message to the non-speech message. By using existing audio functionality, and the increasingly more reliable voice recognition applications, an improved method is provided for sharing complex data messages using commonly available communication channels.

Methods, apparatus and articles of manufacture to identify sources of network streaming services

Methods, apparatus and articles of manufacture to identify sources of network streaming services are disclosed. An example apparatus includes a coding format identifier to identify, from a received first audio signal representing a decompressed second audio signal, an audio compression configuration used to compress a third audio signal to form the second audio signal, and a source identifier to identify a source of the second audio signal based on the identified audio compression configuration.

Methods and systems for enhancing audio signals corrupted by noise

Systems and methods for audio signal processing including an input interface to receive a noisy audio signal including a mixture of target audio signal and noise. An encoder to map each time-frequency bin of the noisy audio signal to one or more phase-related value from one or more phase quantization codebook of phase-related values indicative of the phase of the target signal. Calculate, for each time-frequency bin of the noisy audio signal, a magnitude ratio value indicative of a ratio of a magnitude of the target audio signal to a magnitude of the noisy audio signal. A filter to cancel the noise from the noisy audio signal based on the phase-related values and the magnitude ratio values to produce an enhanced audio signal. An output interface to output the enhanced audio signal.