Patent classifications
G10L21/0316
Metadata for loudness and dynamic range control
An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.
VOICE DRIVEN DYNAMIC MENUS
Disclosed are systems, methods, and computer-readable storage media to provide voice driven dynamic menus. One aspect disclosed is a method including receiving, by an electronic device, video data and audio data, displaying, by the electronic device, a video window, determining, by the electronic device, whether the audio data includes a voice signal, displaying, by the electronic device, a first menu in the video window in response to the audio data including a voice signal, displaying, by the electronic device, a second menu in the video window in response to a voice signal being absent from the audio data, receiving, by the electronic device, input from the displayed menu, and writing, by the electronic device, to an output device based on the received input.
CONTROL METHOD, CONTROL DEVICE, AND STORAGE MEDIUM
A method for controlling a frequency response having a bell-shaped amplitude characteristic of a filter that is configured to process an audio signal includes preparing a high-frequency parameter related to a high-frequency side characteristic on a high-frequency side of the frequency response, and a low-frequency parameter related to a low-frequency side characteristic on a low-frequency side of the frequency response, independently changing the high-frequency parameter or the low-frequency parameter in accordance with a first change instruction, changing both of the high-frequency parameter and the low-frequency parameter in conjunction with each other in accordance with a second change instruction, and controlling a slope on the high-frequency side and a slope on the low-frequency side of the bell-shaped amplitude characteristic of the frequency response, by using the high-frequency parameter and the low-frequency parameter.
SYSTEMS AND METHODS FOR DISTINGUISHING VALID VOICE COMMANDS FROM FALSE VOICE COMMANDS IN AN INTERACTIVE MEDIA GUIDANCE APPLICATION
Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application. In some aspects, the interactive media guidance application receives, at a user device, a signature sound sequence. The interactive media guidance application determines, using control circuitry, based on the signature sound sequence, a threshold gain for the current location of the user device. The interactive media guidance application receives, at the user device, a voice command. The interactive media guidance application determines, using the control circuitry, based on the voice command, a gain for the voice command. The interactive media guidance application determines, using the control circuitry, whether the gain for the voice command is different from the threshold gain. Based on determining that the gain for the voice command is different from the threshold gain, the interactive media guidance application executes, using the control circuitry, the voice command.
SYSTEMS AND METHODS FOR DISTINGUISHING VALID VOICE COMMANDS FROM FALSE VOICE COMMANDS IN AN INTERACTIVE MEDIA GUIDANCE APPLICATION
Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application. In some aspects, the interactive media guidance application receives, at a user device, a signature sound sequence. The interactive media guidance application determines, using control circuitry, based on the signature sound sequence, a threshold gain for the current location of the user device. The interactive media guidance application receives, at the user device, a voice command. The interactive media guidance application determines, using the control circuitry, based on the voice command, a gain for the voice command. The interactive media guidance application determines, using the control circuitry, whether the gain for the voice command is different from the threshold gain. Based on determining that the gain for the voice command is different from the threshold gain, the interactive media guidance application executes, using the control circuitry, the voice command.
AI-assisted detection and prevention of unwanted noise
A signal representing a sound can be received. A machine learning model can be run to identify that the sound triggers a reaction in a user hearing the sound. A preventive action can be automatically activated to mitigate the reaction. The user's reactions can be monitored. Responsive to determining that the user's reaction has been mitigated or suppressed, the preventive action can be deactivated. The machine learning model can be retrained using at least the signal as new training data.
Post-processing gains for signal enhancement
A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.
Post-processing gains for signal enhancement
A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.
Audio signal
A computer device (100) for processing audio signals is described. The computer device (100) includes at least a processor and a memory. The computer device (100) is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device (100) is configured to compress the combined audio signal to provide a compressed audio signal. The computer device (100) is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved.
Method, apparatus and device for processing sound signal
The present disclosure provides a method, an apparatus and a device for processing a sound signal, wherein the method comprises: acquiring a transmitted signal spectrum of a target sound signal sent out by a loudspeaker and a received signal spectrum of the target sound signal received by a microphone; detecting whether there is a signal distortion frequency band with signal distortion in the target sound signal according to the transmitted signal spectrum and the received signal spectrum, and when detecting that the signal distortion frequency band exists, performing compression processing on the target sound signal according to the signal distortion frequency band during a current signal processing cycle, and transmitting a compressed target sound signal through the loudspeaker.