Patent classifications
G10L21/0316
System for deliverables versioning in audio mastering
Some implementations of the disclosure relate to using a model trained on mixing console data of sound mixes to automate the process of sound mix creation. In one implementation, a non-transitory computer-readable medium has executable instructions stored thereon that, when executed by a processor, causes the processor to perform operations comprising: obtaining a first version of a sound mix; extracting first audio features from the first version of the sound mix obtaining mixing metadata; automatically calculating with a trained model, using at least the mixing metadata and the first audio features, mixing console features; and deriving a second version of the sound mix using at least the mixing console features calculated by the trained model.
WEARABLE APPARATUS FOR ACTIVE SUBSTITUTION
A hearing aid and related systems and methods. In one implementation, a hearing aid system may comprise a wearable camera configured to capture images from an environment of a user, a microphone configured to capture sounds from the environment of the user, and a processor. The processor may be programmed to receive images captured by the camera; receive audio signals representative of sounds captured by the microphone; operate in a first mode to cause a first selective conditioning of a first audio signal; determine, based on analysis of at least one of the images or the audio signals, to switch to a second mode to cause a second selective conditioning of the first audio signal; and cause transmission of the first audio signal selectively conditioned in the second mode to a hearing interface device configured to provide sound to an ear of the user.
Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application
Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application. In some aspects, the interactive media guidance application receives, at a user device, a signature sound sequence. The interactive media guidance application determines, using control circuitry, based on the signature sound sequence, a threshold gain for the current location of the user device. The interactive media guidance application receives, at the user device, a voice command. The interactive media guidance application determines, using the control circuitry, based on the voice command, a gain for the voice command. The interactive media guidance application determines, using the control circuitry, whether the gain for the voice command is different from the threshold gain. Based on determining that the gain for the voice command is different from the threshold gain, the interactive media guidance application executes, using the control circuitry, the voice command.
Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application
Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application. In some aspects, the interactive media guidance application receives, at a user device, a signature sound sequence. The interactive media guidance application determines, using control circuitry, based on the signature sound sequence, a threshold gain for the current location of the user device. The interactive media guidance application receives, at the user device, a voice command. The interactive media guidance application determines, using the control circuitry, based on the voice command, a gain for the voice command. The interactive media guidance application determines, using the control circuitry, whether the gain for the voice command is different from the threshold gain. Based on determining that the gain for the voice command is different from the threshold gain, the interactive media guidance application executes, using the control circuitry, the voice command.
AUDIO ENCODER AND DECODER WITH DYNAMIC RANGE COMPRESSION METADATA
An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.
Device for outputting sound and method therefor
A device for outputting sound and a method therefor are provided. The sound output method includes predicting external sound to be received from an external environment, variably adjusting sound to be output from the device, based on the predicted external sound, and outputting the adjusted sound.
PACKET LOSS RECOVERY METHOD FOR AUDIO DATA PACKET, ELECTRONIC DEVICE AND STORAGE MEDIUM
The disclosure provides a packet loss recovery method for an audio data packet an electronic device and a storage medium. The method includes: receiving an audio data packet sent by a vehicle-mounted terminal, and identifying a discarded first sampling point set in response to detecting packet loss; obtaining a second sampling point set and a third sampling point set each adjacent to the first sampling point set, in which the second sampling point set is prior to the first sampling point set, the third sampling point set is behind the first sampling point set; and generating target audio data of the first sampling points based on first audio data sampled at the second sampling points and second audio data sampled at the third sampling points, and inserting the target audio data at sampling positions of the first sampling points.
PACKET LOSS RECOVERY METHOD FOR AUDIO DATA PACKET, ELECTRONIC DEVICE AND STORAGE MEDIUM
The disclosure provides a packet loss recovery method for an audio data packet an electronic device and a storage medium. The method includes: receiving an audio data packet sent by a vehicle-mounted terminal, and identifying a discarded first sampling point set in response to detecting packet loss; obtaining a second sampling point set and a third sampling point set each adjacent to the first sampling point set, in which the second sampling point set is prior to the first sampling point set, the third sampling point set is behind the first sampling point set; and generating target audio data of the first sampling points based on first audio data sampled at the second sampling points and second audio data sampled at the third sampling points, and inserting the target audio data at sampling positions of the first sampling points.
Audio communication in a vehicle
An audio communication system for communication between vehicle occupants in a vehicle, including an image capturing device configured to monitor a first vehicle occupant, a processor configured to receive an image of the first vehicle occupant from the image capturing device and determine whether the first vehicle occupant is attracting attention from a second vehicle occupant, a first microphone associated to the first vehicle occupant configured to receive an audio input from the first vehicle occupant in response to the determination of the first vehicle occupant attracting the second vehicle occupant's attention, and a first speaker associated to the second vehicle occupant configured to activate an audio augmentation of the received audio input in the first speaker and output the augmented audio input.
Metadata for loudness and dynamic range control
An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.