G10L21/034

Audio signal
11694709 · 2023-07-04 · ·

A computer device (100) for processing audio signals is described. The computer device (100) includes at least a processor and a memory. The computer device (100) is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device (100) is configured to compress the combined audio signal to provide a compressed audio signal. The computer device (100) is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved.

Audio signal
11694709 · 2023-07-04 · ·

A computer device (100) for processing audio signals is described. The computer device (100) includes at least a processor and a memory. The computer device (100) is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device (100) is configured to compress the combined audio signal to provide a compressed audio signal. The computer device (100) is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved.

AUTONOMOUS MOBILE BODY, INFORMATION PROCESSING METHOD, PROGRAM, AND INFORMATION PROCESSING APPARATUS
20220413795 · 2022-12-29 ·

The present technology relates to an autonomous mobile body, an information processing method, a program, and an information processing apparatus capable of improving user experience by an output sound of the autonomous mobile body.

The autonomous mobile body includes: a recognition unit that recognizes a motion of its own device; and a sound control unit that controls an output sound output from the own device. The sound control unit controls output of a plurality of operation sounds that is the output sound corresponding to a plurality of the motions of the own device, and changes the operation sound in a case where the plurality of motions has been recognized. The present technology can be applied to, for example, a robot.

AUTONOMOUS MOBILE BODY, INFORMATION PROCESSING METHOD, PROGRAM, AND INFORMATION PROCESSING APPARATUS
20220413795 · 2022-12-29 ·

The present technology relates to an autonomous mobile body, an information processing method, a program, and an information processing apparatus capable of improving user experience by an output sound of the autonomous mobile body.

The autonomous mobile body includes: a recognition unit that recognizes a motion of its own device; and a sound control unit that controls an output sound output from the own device. The sound control unit controls output of a plurality of operation sounds that is the output sound corresponding to a plurality of the motions of the own device, and changes the operation sound in a case where the plurality of motions has been recognized. The present technology can be applied to, for example, a robot.

SELECTIVE FINE-TUNING OF SPEECH

Speech conveyed over a network, such as during an electronic conference may be more difficult to understand if the recipient has difficulty understanding the speech of users having a particular speech attribute. However, other recipients may have no difficulty understanding the speech. As provided herein, speech provided by a user may have phonemes comprising accents or other speech pattern that, if removed, are more readily understood by a particular user. Such alterations are provided only to the users that require it, such as by a server or a specific user's communication device, without affecting the speech concurrently presented to other users.

SELECTIVE FINE-TUNING OF SPEECH

Speech conveyed over a network, such as during an electronic conference may be more difficult to understand if the recipient has difficulty understanding the speech of users having a particular speech attribute. However, other recipients may have no difficulty understanding the speech. As provided herein, speech provided by a user may have phonemes comprising accents or other speech pattern that, if removed, are more readily understood by a particular user. Such alterations are provided only to the users that require it, such as by a server or a specific user's communication device, without affecting the speech concurrently presented to other users.

AUDIOMETRIC RECEIVER SYSTEM TO DETECT AND PROCESS AUDIO SIGNALS

In an approach for detecting and processing multiple audio signals simultaneously, an audiometric receiver system comprises a transmitter, wherein the transmitter comprises a digital signal processor, and wherein the digital signal processor comprises a quality check component, an amplifier or attenuator component, mixer component, a modulator component, and an encrypter component; and a receiver, wherein the receiver comprises a decrypter component, a demodulator component, a splitter component, and a second amplifier or attenuator component.

AUDIOMETRIC RECEIVER SYSTEM TO DETECT AND PROCESS AUDIO SIGNALS

In an approach for detecting and processing multiple audio signals simultaneously, an audiometric receiver system comprises a transmitter, wherein the transmitter comprises a digital signal processor, and wherein the digital signal processor comprises a quality check component, an amplifier or attenuator component, mixer component, a modulator component, and an encrypter component; and a receiver, wherein the receiver comprises a decrypter component, a demodulator component, a splitter component, and a second amplifier or attenuator component.

Ratio mask post-filtering for audio enhancement

Post-filtering may be performed for ratio masks as part of audio enhancement. Audio data may be received. A machine learning model may be applied to generate gain values for different spectrum bands of the audio data. The gain values may then be modified using an envelope post-filter according to a monotonically increasing function applied to the gain values to produce modified gain values used to generate an enhanced version of the audio data.

Ratio mask post-filtering for audio enhancement

Post-filtering may be performed for ratio masks as part of audio enhancement. Audio data may be received. A machine learning model may be applied to generate gain values for different spectrum bands of the audio data. The gain values may then be modified using an envelope post-filter according to a monotonically increasing function applied to the gain values to produce modified gain values used to generate an enhanced version of the audio data.