Patent classifications
G10L19/00
PYRAMID VECTOR QUANTIZER SHAPE SEARCH
An encoder and a method therein for Pyramid Vector Quantizer, PVQ, shape search, the PVQ taking a target vector x as input and deriving a vector y by iteratively adding unit pulses in an inner dimension search loop. The method comprises, before entering a next inner dimension search loop for unit pulse addition, determining, based on the maximum pulse amplitude, maxamp.sub.y, of a current vector y, whether more than a current bit word length is needed to represent enloop.sub.y, in a lossless manner in the upcoming inner dimension loop. The variable enloop.sub.y is related to an accumulated energy of the vector y. The performing of this method enables the encoder to keep the complexity of the search at a reasonable level.
PYRAMID VECTOR QUANTIZER SHAPE SEARCH
An encoder and a method therein for Pyramid Vector Quantizer, PVQ, shape search, the PVQ taking a target vector x as input and deriving a vector y by iteratively adding unit pulses in an inner dimension search loop. The method comprises, before entering a next inner dimension search loop for unit pulse addition, determining, based on the maximum pulse amplitude, maxamp.sub.y, of a current vector y, whether more than a current bit word length is needed to represent enloop.sub.y, in a lossless manner in the upcoming inner dimension loop. The variable enloop.sub.y is related to an accumulated energy of the vector y. The performing of this method enables the encoder to keep the complexity of the search at a reasonable level.
Embedded audio sensor system and methods
An embedded sensor can include an audio detector, a digital signal processor, a library, and a rules engine. The digital signal processor can be configured to receive signals from the audio detector and to identify the environment in which the embedded sensor is located. The library can store statistical models associated with specific environments, and the digital signal processor can be configured identify specific events based on detected sounds within the particular environment by utilizing the statistical model associated with the particular environment. The DSP can associate a probability of accuracy for the identified audible event. A rules engine can be configured to receive the probability and transmit a report of the detected audible event.
WIRELESS COMMUNICATION DEVICE USING VOICE RECOGNITION AND VOICE SYNTHESIS
Disclosed is a wireless communication device including a voice recognition portion configured to convert a voice signal input through a microphone into a syllable information stream using voice recognition, an encoding portion configured to encode the syllable information stream to generate digital transmission data, a transmission portion configured to modulate from the digital transmission data to a transmission signal and transmit the transmission signal through an antenna, a reception portion configured to demodulate from a reception signal received through the antenna to a digital reception data and output the digital reception data, a decoding portion configured to decode the digital reception data to generate the syllable information stream and a voice synthesis portion configured to convert the syllable information stream into the voice signal using voice synthesis and output the voice signal through a speaker.
AUDIO SIGNAL PROCESSING METHOD, DEVICE AND STORAGE MEDIUM
An audio signal processing method, device and storage medium, are provided. The method includes performing sub-band filtering on a to-be-processed audio signal to obtain a plurality of sub-band signals, wherein the number of the sub-band signals is determined according to a lowest frequency of a band-pass filter and a cut-off frequency of an audio apparatus, and the sub-band signals comprise sub-band band-pass signals; and obtaining a target audio signal according to each of the sub-band band-pass signals and a processing algorithm of virtual bass enhancement signal.
AUDIO SIGNAL PROCESSING METHOD, DEVICE AND STORAGE MEDIUM
An audio signal processing method, device and storage medium, are provided. The method includes performing sub-band filtering on a to-be-processed audio signal to obtain a plurality of sub-band signals, wherein the number of the sub-band signals is determined according to a lowest frequency of a band-pass filter and a cut-off frequency of an audio apparatus, and the sub-band signals comprise sub-band band-pass signals; and obtaining a target audio signal according to each of the sub-band band-pass signals and a processing algorithm of virtual bass enhancement signal.
DETERMINATION OF SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING
An apparatus comprising means configured to: generate spatial audio signal directional metadata parameters for a block of time-frequencies; generate encoded spatial audio signal directional metadata parameters (108) for a block of time-frequencies based on a first quantization resolution (203); compare a number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution against a determined number of bits; output or store the encoded spatial audio signal directional metadata parameters for a block of time-frequencies (108) based on a first quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is less than a determined number of bits (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a second quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is more than the determined number of bits and a difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is less than a determined number of bits is within a determined threshold (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a third quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is more than the determined number of bits and the difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is greater than the determined threshold, wherein the third quantization resolution is determined such that a number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies based on the third quantization resolution is always equal to or less than the determined number of bits (217).
METHOD AND SYSTEM FOR PLAYING AUDIOS
Provided a method for playing audios. The method includes: acquiring vibration control information corresponding to a target audio, wherein at least one vibration period and vibration attribute information corresponding to the at least one vibration period are recorded in the vibration control information, and each vibration period corresponds to a beat period of a target percussive instrument in the target audio; synchronously playing the target audio and the vibration control information; and when any vibration period of the at least one vibration period is played, controlling a terminal to vibrate based on vibration attribute information corresponding to the vibration period.
METHOD AND SYSTEM FOR PLAYING AUDIOS
Provided a method for playing audios. The method includes: acquiring vibration control information corresponding to a target audio, wherein at least one vibration period and vibration attribute information corresponding to the at least one vibration period are recorded in the vibration control information, and each vibration period corresponds to a beat period of a target percussive instrument in the target audio; synchronously playing the target audio and the vibration control information; and when any vibration period of the at least one vibration period is played, controlling a terminal to vibrate based on vibration attribute information corresponding to the vibration period.
Post filter for audio signals
In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.