Patent classifications
G10L21/057
AUTOMATED AUDIO TUNING AND COMPENSATION PROCEDURE
An example may include detecting, via a controller, one or more microphones and one or more speakers in an area, measuring, via the one or more microphones, an initial frequency response of an audio signal generated by the one or more speakers inside the area and generating an initial room performance rating, comparing the initial frequency response to a target frequency response, creating audio compensation values to apply to the one or more speakers based on the comparison, and applying the audio compensation values to the one or more speakers.
AUTOMATED AUDIO TUNING AND COMPENSATION PROCEDURE
An example may include detecting, via a controller, one or more microphones and one or more speakers in an area, measuring, via the one or more microphones, an initial frequency response of an audio signal generated by the one or more speakers inside the area and generating an initial room performance rating, comparing the initial frequency response to a target frequency response, creating audio compensation values to apply to the one or more speakers based on the comparison, and applying the audio compensation values to the one or more speakers.
CONTENT-BASED AUDIO PLAYBACK SPEED CONTROLLER
A method includes receiving, by a computer, a video file containing audio content and or visual content. The method further involves, based on a determination of the rates at which information is communicated by different temporal segments of the video file when played back at a default playback speed, assigning a respective target playback speed to each the different temporal segments of the video file. The video file is edited and marked in a format to be played back on a video player so that each of the different temporal segments of the video file can be played back at its respective assigned target playback speed.
AUDIO OUTPUT MODULE FOR USE IN ARTIFICIAL VOICE SYSTEMS
The invention disclosed is an improved audio output module for use with an artificial voice generation device, having a housing separated into a sound system chamber, an interface chamber, and a power source chamber. The interface and power source chambers may be combined. The sound cham-ber is isolated from external air by the housing, the cover plate, and a separating wall, which separates it from other chambers of the module. Volumetric parameters based on speaker characteristics and design re-quirements can thus be implemented independent from the choice of interface type. The module is configurable to be mounted to an external structure or to a speech generating system. It may likewise be detachable from a quick release cradle and receive wireless audio signals from the speech generating system.
SPATIAL REGION BASED AUDIO SEPARATION
A method includes receiving target audio data captured by a first audio input device, the target audio data comprising a target audio signal and a first version of an interfering audio signal, and receiving reference audio data captured by a second audio input device different from the first audio input device, the reference audio data comprising a second version of the interfering audio signal. The method also includes processing, using a trained neural network, the target audio data and the reference audio data to generate enhanced audio data, the neural network attenuating the interfering audio signal in the enhanced audio data.
SPATIAL REGION BASED AUDIO SEPARATION
A method includes receiving target audio data captured by a first audio input device, the target audio data comprising a target audio signal and a first version of an interfering audio signal, and receiving reference audio data captured by a second audio input device different from the first audio input device, the reference audio data comprising a second version of the interfering audio signal. The method also includes processing, using a trained neural network, the target audio data and the reference audio data to generate enhanced audio data, the neural network attenuating the interfering audio signal in the enhanced audio data.
Audio output module for use in artificial voice systems
The invention disclosed is an improved audio output module for use with an artificial voice generation device, having a housing separated into a sound system chamber, an interface chamber, and a power source chamber. The interface and power source chambers may be combined. The sound chamber is isolated from external air by the housing, the cover plate, and a separating wall, which separates it from other chambers of the module. Volumetric parameters based on speaker characteristics and design requirements can thus be implemented independent from the choice of interface type. The module is configurable to be mounted to an external structure or to a speech generating system. It may likewise be detachable from a quick release cradle and receive wireless audio signals from the speech generating system.
Electronic device and control method thereof
The disclosure relates to an electronic device and a control method thereof. The electronic device includes a memory, and a processor configured to: obtain first feature data for estimating a waveform by inputting acoustic data of a first quality to a first encoder model; and obtain waveform data of a second quality that is a higher quality than the first quality by inputting the first feature data to a decoder model to.
Audio playback and captioning
A method, a structure, and a computer system for playback optimization. The exemplary embodiments may include transcribing an audio stream at normal and at high playback speed, as well as determining whether portions of the audio stream are unintelligible at the high playback speed. The exemplary embodiments may further include reducing a speed of the high playback speed for portions determined unintelligible, and transcribing the audio stream with the unintelligible portions played at the reduced high playback speed. The exemplary embodiments may further include determining whether the unintelligible portions are still unintelligible at the reduced high playback speed and, if not, identifying the reduced high playback speed as an optimal playback speed of the unintelligible portions. The exemplary embodiments may additionally include, in response to receiving a request to play the audio stream at the high playback speed, playing the unintelligible portions at the optimal playback speed.
Audio playback and captioning
A method, a structure, and a computer system for playback optimization. The exemplary embodiments may include transcribing an audio stream at normal and at high playback speed, as well as determining whether portions of the audio stream are unintelligible at the high playback speed. The exemplary embodiments may further include reducing a speed of the high playback speed for portions determined unintelligible, and transcribing the audio stream with the unintelligible portions played at the reduced high playback speed. The exemplary embodiments may further include determining whether the unintelligible portions are still unintelligible at the reduced high playback speed and, if not, identifying the reduced high playback speed as an optimal playback speed of the unintelligible portions. The exemplary embodiments may additionally include, in response to receiving a request to play the audio stream at the high playback speed, playing the unintelligible portions at the optimal playback speed.