Patent classifications
G10L21/034
VOICE COMMAND RECOGNITION SYSTEM
Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for a voice command recognition system (VCR). An example embodiment operates by receiving a voice command directed to controlling a device, the voice command including a wake command and an action command. An amplitude of the wake command is determined. A gain adjustment for the voice command is calculated based on a comparison of the amplitude of the wake command to a target amplitude. An amplitude of the action command is adjusted based on the calculated gain adjustment for the voice command based on the comparison of the amplitude of the wake command to the target amplitude. A device command for controlling the device is identified based on the action command comprising the adjusted amplitude. The device command is provided to the device.
VOICE COMMAND RECOGNITION SYSTEM
Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for a voice command recognition system (VCR). An example embodiment operates by receiving a voice command directed to controlling a device, the voice command including a wake command and an action command. An amplitude of the wake command is determined. A gain adjustment for the voice command is calculated based on a comparison of the amplitude of the wake command to a target amplitude. An amplitude of the action command is adjusted based on the calculated gain adjustment for the voice command based on the comparison of the amplitude of the wake command to the target amplitude. A device command for controlling the device is identified based on the action command comprising the adjusted amplitude. The device command is provided to the device.
SPEECH PROCESSING APPARATUS AND METHOD FOR ACOUSTIC ECHO REDUCTION
A speech processing apparatus applied in a communication device having a mechanical defect is disclosed. The apparatus comprises an acoustic echo cancellation (AEC) unit, a multiplier and a processor. The AEC unit cancels an echo in a first audio signal from a microphone using a known AEC algorithm to generate a second audio signal. The multiplier multiplies corresponding M frames of a downlink audio signal by a gain to provide a gained downlink signal for a speaker. The processor performs operations comprising: muting an uplink audio signal when a first power level for M frames of a first input signal is less than a first threshold value; and, reducing the gain when the first power level and a second power level for M frames of a second input signal are respectively greater than the first threshold value and a second threshold value.
SPEECH PROCESSING APPARATUS AND METHOD FOR ACOUSTIC ECHO REDUCTION
A speech processing apparatus applied in a communication device having a mechanical defect is disclosed. The apparatus comprises an acoustic echo cancellation (AEC) unit, a multiplier and a processor. The AEC unit cancels an echo in a first audio signal from a microphone using a known AEC algorithm to generate a second audio signal. The multiplier multiplies corresponding M frames of a downlink audio signal by a gain to provide a gained downlink signal for a speaker. The processor performs operations comprising: muting an uplink audio signal when a first power level for M frames of a first input signal is less than a first threshold value; and, reducing the gain when the first power level and a second power level for M frames of a second input signal are respectively greater than the first threshold value and a second threshold value.
INFORMATION PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE MEDIUM, AND INFORMATION PROCESSING METHOD
An information processing apparatus includes: a processor configured to instantaneously acquire quality information indicative of quality of utterer's voice on a listener's side; and instantaneously present improvement information for improving the quality to the utterer in a case where the quality indicated by the acquired quality information does not satisfy a predetermined condition.
INFORMATION PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE MEDIUM, AND INFORMATION PROCESSING METHOD
An information processing apparatus includes: a processor configured to instantaneously acquire quality information indicative of quality of utterer's voice on a listener's side; and instantaneously present improvement information for improving the quality to the utterer in a case where the quality indicated by the acquired quality information does not satisfy a predetermined condition.
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing device includes a processor configured to output, in a case where a service is being used in which at least speech is exchanged among multiple users such that a conversation takes places among all of the multiple users, a speech of a separate conversation distinctly from a speech of the conversation taking place among all of the multiple users to a device of a user who is engaged in the separate conversation with a specific user from among the multiple users, and output the speech of the conversation taking place among all of the multiple users without outputting the speech of the separate conversation to a device of a user who is not engaged in the separate conversation.
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing device includes a processor configured to output, in a case where a service is being used in which at least speech is exchanged among multiple users such that a conversation takes places among all of the multiple users, a speech of a separate conversation distinctly from a speech of the conversation taking place among all of the multiple users to a device of a user who is engaged in the separate conversation with a specific user from among the multiple users, and output the speech of the conversation taking place among all of the multiple users without outputting the speech of the separate conversation to a device of a user who is not engaged in the separate conversation.
DATA PROCESSING APPARATUS, METHOD FOR PROCESSING DATA, AND STORAGE MEDIUM
A data processing apparatus includes one or more processors, and one or more memories including instructions stored thereon that, when executed by the one or more processors, cause the data processing apparatus to function as a copy unit configured to generate second sound data by copying first sound data, and a processing unit configured to apply a first gain to at least one of the first sound data and the second sound data.
AI-BASED DJ SYSTEM AND METHOD FOR DECOMPOSING, MISING AND PLAYING OF AUDIO DATA
The present invention relates to a method for processing and playing audio data comprising the steps of receiving mixed input data and playing recombined output data. Furthermore, the invention relates to a device 10 for processing and playing audio data, preferably DJ equipment, comprising an audio input unit for receiving a mixed input signal, a recombination unit 32 and a playing unit 34 for playing recombined output data. In addition, the present invention relates to a method and a device for representing audio data, i.e. on a display.