Patent classifications
G10L15/30
METHOD FOR PROCESSING AN AUDIO STREAM AND CORRESPONDING SYSTEM
A method and a system for processing an audio stream are described, wherein at least one database of classified voices and at least one database of classified background sounds are provided and a comparison between these classified voices and background sounds with the voices and the sounds extrapolated from a suitably re-processed audio stream is carried out in order to identify possible matches.
METHOD FOR PROCESSING AN AUDIO STREAM AND CORRESPONDING SYSTEM
A method and a system for processing an audio stream are described, wherein at least one database of classified voices and at least one database of classified background sounds are provided and a comparison between these classified voices and background sounds with the voices and the sounds extrapolated from a suitably re-processed audio stream is carried out in order to identify possible matches.
Voice controlled assistant with coaxial speaker and microphone arrangement
A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligned along the center axis. The speaker(s) are oriented to output sound directionally toward the base end and opposite to the microphone(s) in the top end. The sound may then be redirected in a radial outward direction from the center axis at the base end so that the sound is output symmetric to, and equidistance from, the microphone(s).
Voice controlled assistant with coaxial speaker and microphone arrangement
A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligned along the center axis. The speaker(s) are oriented to output sound directionally toward the base end and opposite to the microphone(s) in the top end. The sound may then be redirected in a radial outward direction from the center axis at the base end so that the sound is output symmetric to, and equidistance from, the microphone(s).
Methods and systems for detecting and processing speech signals
Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
Methods and systems for detecting and processing speech signals
Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
Voice control method and apparatus, and computer storage medium
A voice control method can be applied to a first terminal, and include: receiving a user's voice operation instruction after the first terminal is activated, the voice operation instruction being used for controlling the first terminal to perform a target operation; sending an instruction execution request to a server after the voice operation instruction is received, the instruction execution request being used for requesting the server to determine whether the first terminal is to respond to the voice operation instruction according to device information of the terminal in a device network, wherein the first terminal is located in the device network; and performing the target operation in a case where a response message is received from the server, the response message indicating that the first terminal is to respond to the voice operation instruction.
Electronic device and operation method thereof
Provided are an electronic device and an operation method thereof. The electronic device includes: a first sound receiver configured to receive a sound input while power is supplied to the first sound receiver in a standby state; a trigger word/phrase recognizer configured to recognize whether the sound input received by the first sound receiver corresponds to a trigger word or phrase; a second sound receiver configured to receive a sound input by receiving supply of power based on the trigger word or phrase being recognized by the trigger word/phrase recognizer; and a data transceiver configured to output a first sound input signal supplied from the first sound receiver and a second sound input signal supplied from the second sound receiver.
Electronic device and operation method thereof
Provided are an electronic device and an operation method thereof. The electronic device includes: a first sound receiver configured to receive a sound input while power is supplied to the first sound receiver in a standby state; a trigger word/phrase recognizer configured to recognize whether the sound input received by the first sound receiver corresponds to a trigger word or phrase; a second sound receiver configured to receive a sound input by receiving supply of power based on the trigger word or phrase being recognized by the trigger word/phrase recognizer; and a data transceiver configured to output a first sound input signal supplied from the first sound receiver and a second sound input signal supplied from the second sound receiver.
Interactive media system using audio inputs
An interactive media system enables creation, editing, and presentation of voice-driven interactive media content. The interactive media content may include prompts for user input via voice, manual input, or gestures. In the case of an audio input, the interactive media player application obtains a text string representing the spoken phrases and matches the text string against a set of expected values corresponding to different predefined responses and each associated with a different possible action. Based on the matching of the phrase to an expected value, the interactive media player application dynamically selects and performs the action associated with the matching response. The action may comprise, for example, transitioning to playback of a different media object (e.g., a second video segment) and/or causing some other functionality programmatically accessible by the interactive media player application to occur.