Patent classifications
G10L21/0216
Multi-user personalization at a voice interface device
A method at an electronic device with one or more microphones and a speaker includes receiving a first voice input; comparing the first voice input to one or more voice models; based on the comparing, determining whether the first voice input corresponds to any of a plurality of occupants, and according to the determination, authenticating an occupant and presenting a response, or restricting functionality of the electronic device.
Multi-user personalization at a voice interface device
A method at an electronic device with one or more microphones and a speaker includes receiving a first voice input; comparing the first voice input to one or more voice models; based on the comparing, determining whether the first voice input corresponds to any of a plurality of occupants, and according to the determination, authenticating an occupant and presenting a response, or restricting functionality of the electronic device.
Method and an audio processing unit for detecting a tone
A method for detecting a prominent tone of an input audio includes establishing a first analysis audio signal based on the input audio signal, establishing a second analysis audio signal based on the input audio signal, wherein an analysis audio signal of the first analysis audio signal and the second analysis audio signal is established by applying an analysis audio filter to the input audio signal, comparing the first analysis audio signal and the second analysis audio signal to obtain an energy level contrast, and determining a representation of the prominent tone by converting the energy level contrast by a contrast-to-frequency mapping function.
Method and an audio processing unit for detecting a tone
A method for detecting a prominent tone of an input audio includes establishing a first analysis audio signal based on the input audio signal, establishing a second analysis audio signal based on the input audio signal, wherein an analysis audio signal of the first analysis audio signal and the second analysis audio signal is established by applying an analysis audio filter to the input audio signal, comparing the first analysis audio signal and the second analysis audio signal to obtain an energy level contrast, and determining a representation of the prominent tone by converting the energy level contrast by a contrast-to-frequency mapping function.
Voice Wakeup Method and System, and Device
A voice wakeup method includes receiving a plurality of voice wakeup messages sent by a plurality of electronic devices, where each voice wakeup message includes a distance and a wakeup energy value; determining, based on distances and wakeup energy values in the plurality of voice wakeup messages from the plurality of electronic devices, whether energy attenuation of the wakeup word emitted by the sound source complies with an attenuation law of sound energy radiated by a point source; and when determining that the energy attenuation of the wakeup word emitted by the sound source does not comply with the attenuation law of the sound energy radiated by the point source, sending a wakeup forbidding instruction to the plurality of electronic devices.
Voice Wakeup Method and System, and Device
A voice wakeup method includes receiving a plurality of voice wakeup messages sent by a plurality of electronic devices, where each voice wakeup message includes a distance and a wakeup energy value; determining, based on distances and wakeup energy values in the plurality of voice wakeup messages from the plurality of electronic devices, whether energy attenuation of the wakeup word emitted by the sound source complies with an attenuation law of sound energy radiated by a point source; and when determining that the energy attenuation of the wakeup word emitted by the sound source does not comply with the attenuation law of the sound energy radiated by the point source, sending a wakeup forbidding instruction to the plurality of electronic devices.
AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM
An audio signal processing device comprises: a determination unit that determines a first voice segment for a target speaker linked to a host device on the basis of an externally acquired first audio signal; a sharing unit that transmits the first audio signal and the first voice segment to another device linked to a non-target speaker and receives a second audio signal and a second voice segment associated with the non-target speaker from the other device; an estimation unit that estimates the voice of the non-target speaker mixed in the first audio signal on the basis of the second audio signal and the second voice segment that are received and an estimation parameter associated with the target speaker that is acquired; and a removal unit that removes the voice of the non-target speaker from the first audio signal.
AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM
An audio signal processing device comprises: a determination unit that determines a first voice segment for a target speaker linked to a host device on the basis of an externally acquired first audio signal; a sharing unit that transmits the first audio signal and the first voice segment to another device linked to a non-target speaker and receives a second audio signal and a second voice segment associated with the non-target speaker from the other device; an estimation unit that estimates the voice of the non-target speaker mixed in the first audio signal on the basis of the second audio signal and the second voice segment that are received and an estimation parameter associated with the target speaker that is acquired; and a removal unit that removes the voice of the non-target speaker from the first audio signal.
ACTION IDENTIFICATION METHOD, ACTION IDENTIFICATION DEVICE, AND NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM RECORDING ACTION IDENTIFICATION PROGRAM
An action identification device acquires sound data from a microphone, calculates a feature amount of the sound data, determines whether or not a user is present in a space in which the microphone is installed, calculates a noise feature amount indicating a feature amount of noise based on the calculated feature amount and stores the calculated noise feature amount in a noise feature amount storage unit in a case where the user is not present in the space, subtracts the noise feature amount stored in the noise feature amount storage unit from the calculated feature amount to extract an action sound feature amount indicating a feature amount of an action sound generated by an action of the user in a case where the user is present in the space, and identifies an action of the user by using the action sound feature amount.
SOUND SIGNAL PROCESSING APPARATUS AND METHOD OF PROCESSING SOUND SIGNAL
A sound signal processing apparatus may include: a directional microphone configured to detect a user voice signal including a user's voice by arranging the directional microphone to face an utterance point of the user's voice; a non-directional microphone configured to detect a mixed sound signal comprising the user voice and an external sound; and a processor configured to generate an external sound signal by attenuating the user's voice from the mixed sound signal, by differentially calculating the user voice signal from the mixed sound signal.