Patent classifications
G10L25/00
Concurrent reception of multiple user speech input for translation
An improved translation experience is provided using an auxiliary device, such as a pair of earbuds, and a wirelessly coupled mobile device. Microphones on both the auxiliary device and the mobile device simultaneously capture input from, respectively, a primary user (e.g., wearing the auxiliary device) and a secondary user (e.g., a foreign language speaker providing speech that the primary user desires to translate). Both microphones continually listen, rather than alternating between the mobile device and the auxiliary device. Each device may determine when to endpoint and send a block of speech for translation, for example based on pauses in the speech. Each device may accordingly send the received speech for translation and output, such that it is provided in a natural flow of communication.
Determining an effect on dissemination of information related to an event based on a dynamic confidence level associated with the event
Methods and apparatus related to determining an effect on dissemination of information related to an event based on a dynamic confidence level associated with the event. For example, an event and an event confidence level of the event may be determined based on a message of a user. An effect on dissemination of information related to the event may be determined based on the confidence level. A new confidence level may be determined based on additional data associated with the event and the effect on dissemination of information may be adjusted based on the new confidence level. In some implementations, the additional data may be based on a new message that is related to the message, such as a reply to the message.
Method and voice input apparatus for converting voice input to text input
A method includes: acquiring a voice signal; sending the voice signal to a host; receiving a key code from the host; converting the key code to an identification code that is identified by the host; and sending the identification code to the host.
Voice assistant devices
A voice assistant device comprises an input to receive data defining an audio command; and processing circuitry to perform an operation defined by the audio command responsive to an activation of the voice assistant device; wherein the activation comprises determining presence of an activation source within an activation region from one or more sensors.
Multilingual wakeword detection
A system and method performs multilingual wakeword detection by determining a language corresponding to the wakeword. A first wakeword-detection component, which may execute using a digital-signal processor, determines that audio data includes a representation of the wakeword and determines a language corresponding to the wakeword. A second, more accurate wakeword-detection component may then process the audio data using the language to confirm that it includes the representation of the wakeword. The audio data may then be sent to a remote system for further processing.
Expandable dialogue system
A method for training a dialogue learning model includes presenting, via a user interface of a computing device, an utterance and a list of actions based on the utterance. A selection of an action from the list of actions is received via the user interface. A designated span of the utterance is received via the user interface. The selected action and the designated span of the utterance is provided to a computing system for training the dialogue learning model.
Expandable dialogue system
A method for training a dialogue learning model includes presenting, via a user interface of a computing device, an utterance and a list of actions based on the utterance. A selection of an action from the list of actions is received via the user interface. A designated span of the utterance is received via the user interface. The selected action and the designated span of the utterance is provided to a computing system for training the dialogue learning model.
Method and Apparatus for Detecting Correctness of Pitch Period
A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.
Method and Apparatus for Detecting Correctness of Pitch Period
A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.
Speech enhancement method and apparatus for same
A speech enhancement method is provided. The speech enhancement method includes: estimating a direction of a speaker by using an input signal, generating direction information indicating the estimated direction, detecting speech of a speaker based on a result of the estimating the direction, and enhancing the speech of the speaker by using the direction information based on a result of the detecting the speech.