G10L25/84

Headset sound leakage mitigation
11711645 · 2023-07-25 · ·

An audio system for a headset includes a plurality of speakers and an audio controller. The plurality of speakers may be in a dipole configuration that cancel sound leakage into a local area of the headset. The controller filters audio content presented by the plurality of speakers to further mitigate leakage of audio content into the local area. The audio determines sound filters based on environmental conditions, such as ambient noise levels, as well as based on the audio content being presented.

INFORMATION PROCESSOR, INFORMATION PROCESSING METHOD, AND PROGRAM
20230005481 · 2023-01-05 · ·

An information processor including: an operation control unit that controls a motion of an autonomous mobile body acting on the basis of recognition processing, in a case where a target sound that is a target voice for voice recognition processing is detected, the operation control unit moving the autonomous mobile body to a position, around an approach target, where an input level of a non-target sound that is not the target voice becomes lower, the approach target being determined on the basis of the target sound.

INFORMATION PROCESSOR, INFORMATION PROCESSING METHOD, AND PROGRAM
20230005481 · 2023-01-05 · ·

An information processor including: an operation control unit that controls a motion of an autonomous mobile body acting on the basis of recognition processing, in a case where a target sound that is a target voice for voice recognition processing is detected, the operation control unit moving the autonomous mobile body to a position, around an approach target, where an input level of a non-target sound that is not the target voice becomes lower, the approach target being determined on the basis of the target sound.

METHOD AND SYSTEM FOR SPEECH DETECTION AND SPEECH ENHANCEMENT
20230005469 · 2023-01-05 ·

A method of speech detection and speech enhancement in a speech detection and speech enhancement unit of Multipoint Conferencing Node (MCN) and a method of training the same. The method comprising receiving input audio segments, and determining an acoustic environment based on input audio auxiliary information, extracting T-F-domain features from the received input audio segments, determining if each of the received input audio segments is speech by inputting the T-F domain features into a speech detection classifier trained for the determined acoustic environment, determining, when one of the received input audio segments is speech, if the received audio segment is noisy speech by inputting the T-F domain features into a noise classifier using a statistical generative model representing the probability distributions of the T-F domain features of noisy speech trained for the determined acoustic environment, and applying a noise reduction mask on the received input audio segments according to the determination of the received audio segment is noisy speech

METHOD AND SYSTEM FOR SPEECH DETECTION AND SPEECH ENHANCEMENT
20230005469 · 2023-01-05 ·

A method of speech detection and speech enhancement in a speech detection and speech enhancement unit of Multipoint Conferencing Node (MCN) and a method of training the same. The method comprising receiving input audio segments, and determining an acoustic environment based on input audio auxiliary information, extracting T-F-domain features from the received input audio segments, determining if each of the received input audio segments is speech by inputting the T-F domain features into a speech detection classifier trained for the determined acoustic environment, determining, when one of the received input audio segments is speech, if the received audio segment is noisy speech by inputting the T-F domain features into a noise classifier using a statistical generative model representing the probability distributions of the T-F domain features of noisy speech trained for the determined acoustic environment, and applying a noise reduction mask on the received input audio segments according to the determination of the received audio segment is noisy speech

SPEECH RECOGNITION SYSTEM AND A METHOD FOR PROVIDING A SPEECH RECOGNITION SERVICE
20230238020 · 2023-07-27 · ·

Provided are a speech recognition system and a method for providing a speech recognition service that may map and register a tap signal, generated by tapping an object around a user of a vehicle, to a specific command, and replace an utterance for the specific command with a simple action of tapping the nearby object, to improve a user convenience. A speech recognition system includes: a speech processing module configured to extract information from a voice signal of the user in a vehicle; a control module configured to generate a control signal for performing the control intended by the user; and a memory configured to map and store a tap signal and a command corresponding to the tap signal, wherein, when the tap signal is included in an audio signal input, the control module is configured to generate the control signal based on the command corresponding to the stored tap signal.

SPEECH RECOGNITION SYSTEM AND A METHOD FOR PROVIDING A SPEECH RECOGNITION SERVICE
20230238020 · 2023-07-27 · ·

Provided are a speech recognition system and a method for providing a speech recognition service that may map and register a tap signal, generated by tapping an object around a user of a vehicle, to a specific command, and replace an utterance for the specific command with a simple action of tapping the nearby object, to improve a user convenience. A speech recognition system includes: a speech processing module configured to extract information from a voice signal of the user in a vehicle; a control module configured to generate a control signal for performing the control intended by the user; and a memory configured to map and store a tap signal and a command corresponding to the tap signal, wherein, when the tap signal is included in an audio signal input, the control module is configured to generate the control signal based on the command corresponding to the stored tap signal.

Encoding parameter adjustment method and apparatus, device, and storage medium

An encoding parameter adjustment method is performed at a computer device. The method includes: obtaining a first audio signal, and determining a psychoacoustic masking threshold within a service frequency band in the first audio signal; obtaining a second audio signal, and determining a background environmental noise estimation value of the frequency within the service frequency band in the second audio signal; determining a masking tag corresponding to the service frequency band according to the psychoacoustic masking threshold of the first audio signal and the background environmental noise estimation value of the second audio signal; determining a masking rate of the service frequency band according to the masking tag corresponding to the frequency within the service frequency band; determining a first reference bit rate according to the masking rate of the service frequency band; and configuring an encoding bit rate of an audio encoder based on the first reference bit rate.

Encoding parameter adjustment method and apparatus, device, and storage medium

An encoding parameter adjustment method is performed at a computer device. The method includes: obtaining a first audio signal, and determining a psychoacoustic masking threshold within a service frequency band in the first audio signal; obtaining a second audio signal, and determining a background environmental noise estimation value of the frequency within the service frequency band in the second audio signal; determining a masking tag corresponding to the service frequency band according to the psychoacoustic masking threshold of the first audio signal and the background environmental noise estimation value of the second audio signal; determining a masking rate of the service frequency band according to the masking tag corresponding to the frequency within the service frequency band; determining a first reference bit rate according to the masking rate of the service frequency band; and configuring an encoding bit rate of an audio encoder based on the first reference bit rate.

USER ADJUSTMENT INTERFACE USING REMOTE COMPUTING RESOURCE
20230232173 · 2023-07-20 ·

Disclosed herein, among other things, are systems and methods for a user adjustment interface using remote computing resources. Specifically, a system can include a mobile device in communication with a hearing assistance device or a remote server. The mobile device can interpret an acoustic environment and send information about the environment to a remote server. The remote server can determine and send information to the mobile device for use in a user interface. The mobile device can receive a user selection of hearing assistance parameter information to be sent to the hearing assistance device.