Patent classifications
G10L21/0316
SYSTEMS AND METHODS FOR SPEECH RECOGNITION
A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.
MULTI-SOURCE AUDIO PROCESSING SYSTEMS AND METHODS
A conferencing system includes a plurality of microphones and an audio processing system that performs blind source separation operations on audio signals to identify different audio sources. The system processes the separated audio sources to identify or classify the sources and generates an output stream including the source separated content.
Loudspeaker with transmitter
A speaker device includes an electroacoustic transducer configured to convert an audio signal into a set of sound waves and a transmitter configured to transmit an electromagnetic signal that carries the audio signal for receipt at distances limited to an audibility range of the set of sound waves. The audibility range of the set of sound waves corresponds to a distance at which the set of sound waves is estimated to be below a predetermined sound level.
Loudspeaker with transmitter
A speaker device includes an electroacoustic transducer configured to convert an audio signal into a set of sound waves and a transmitter configured to transmit an electromagnetic signal that carries the audio signal for receipt at distances limited to an audibility range of the set of sound waves. The audibility range of the set of sound waves corresponds to a distance at which the set of sound waves is estimated to be below a predetermined sound level.
SIGNAL PROCESSING APPARATUS AND METHOD, AND PROGRAM
The present technology relates to a signal processing apparatus and method, and a program that make it possible to obtain high-sound-quality signals even with a small processing amount. A signal processing apparatus includes a selecting section that is supplied with a plurality of audio signals and selects an audio signal to be subjected to a sound quality enhancement process, and a sound-quality-enhancement processing section that performs the sound quality enhancement process on the audio signal selected by the selecting section. The present technology may be applied to a portable terminal.
PRE-CONDITIONING AUDIO FOR MACHINE PERCEPTION
An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
PRE-CONDITIONING AUDIO FOR MACHINE PERCEPTION
An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
NETWORKED AUTOMIXER SYSTEMS AND METHODS
Systems and methods are disclosed for networked audio automixing using array microphones and an aggregator unit that participate in making a common gating decision to determine which channels to gate on and off. Through the use of such a network of array microphones having the capability to generate submix audio signals and reduced bandwidth metrics, as well as AEC processing capability, array microphone lobe selection can be enhanced while maximizing signal-to-noise ratio, increasing intelligibility, and increasing user satisfaction.
Centrally controlling communication at a venue
One example may include a method that includes receiving, at a presentation server, an audio data signal from a mobile device located in a presentation space, identifying a mobile device identification characteristic of the mobile device based on the received audio data signal, determining a mobile device location via a location determination procedure, and playing the audio signal via a loudspeaker.
Centrally controlling communication at a venue
One example may include a method that includes receiving, at a presentation server, an audio data signal from a mobile device located in a presentation space, identifying a mobile device identification characteristic of the mobile device based on the received audio data signal, determining a mobile device location via a location determination procedure, and playing the audio signal via a loudspeaker.