G10L21/00

Routing natural language commands to the appropriate applications
09734839 · 2017-08-15 · ·

A device is configured with multiple applications that each respond to various commands. The correct application to receive a natural language command is identified by consideration of how well the command matches functions of the application. A target application to receive the command may additionally be selected by consideration of which application is most likely to receive a command. The likelihood of an application to receive a command may be determined by considering context. The command may be a voice input that is analyzed by speech recognition technology to determine word strings representing possible commands. Thus, the selection of a target application to receive the command may be based on any or all of the word strings from the natural language input, a closeness of fit between the command and an application, and the likelihood an application is the target for the next incoming command.

Feedback based beamformed signal selection

Features are disclosed for improving the accuracy and stability of beamformed signal selection. The selection may consider processing feedback information to identify when the current beam selection may need to be re-evaluated. The feedback information may further be used to select a beamformed signal for processing. For example, beams which detect wake-words or yield high confidence speech recognition may be favored over beams which fail to detect or recognize at a lower confidence level.

Cancelling noise in an open ear system
11432067 · 2022-08-30 · ·

System and methods for selectively amplifying audio signals are disclosed. In one implementation, a method includes receiving at least one image of a plurality of images captured by a wearable camera; receiving a first audio signal representative of the sounds captured by a microphone; determining a looking direction of a user; and processing the first audio signal by amplifying audio coming from the looking direction of the user and attenuating audio coming from at least one other direction; receiving a second audio signal representative of the sounds captured by a hearing interface device; transmitting the second audio signal to a speaker associated with the hearing interface device; transmitting an additional audio signal to the speaker, wherein the transmission of the additional audio signal at least partially overlaps the transmission of the second audio signal; and transmitting the processed first audio signal to the speaker.

Script compliance and agent feedback

Systems and methods are provided for using automatic speech recognition to analyze a voice interaction and verify compliance of an agent reading a script to a client during the voice interaction. In one aspect of the invention, a method may include a voice interaction, wherein the agent follows the script via a plurality of panels. The voice interaction is evaluated via the plurality of panels employing panel-by-panel playback with an automatic speech recognition component adapted to analyze the voice interaction. As such, it may be determined, via generating a score using confidence level thresholds of an automatic speech recognition component such that confidence level thresholds are assigned to each of the plurality of panels and evaluating the score against at least one of a static standard and a varying standard, whether the agent has adequately followed the script.

Expandable lead jacket

Methods, devices and systems for separating an implanted object, such as a lead attached to a cardiac conduction device, from formed tissue within a blood vessel are provided. The methods, devices and systems for separating a lead from the tissue relate to dilating the tissue surrounding the lead from underneath the tissue and/or between the lead and the tissue.

Expandable lead jacket

Methods, devices and systems for separating an implanted object, such as a lead attached to a cardiac conduction device, from formed tissue within a blood vessel are provided. The methods, devices and systems for separating a lead from the tissue relate to dilating the tissue surrounding the lead from underneath the tissue and/or between the lead and the tissue.

Audio decoder for interleaving signals

A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal including spectral coefficients corresponding to frequencies up to a first cross-over frequency and performing parametric decoding at a second cross-over frequency to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method further includes extracting from the encoded audio bitstream a second waveform-coded signal including spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal. The interleaved signal is then combined with the first waveform-coded signal.

Playback apparatus, setting apparatus, playback method, and program
09728201 · 2017-08-08 · ·

A playback apparatus includes: an acquiring unit that acquires auditory language data including data to be played back as a spoken voice; an analyzing unit that analyzes the auditory language data to output an analysis result; a setting unit that sets at least a portion of the auditory language data to a control portion to be played back at a set playback speed, based on the analysis result; and a voice playback unit that plays back the control portion as a spoken voice at the set playback speed.

Playback apparatus, setting apparatus, playback method, and program
09728201 · 2017-08-08 · ·

A playback apparatus includes: an acquiring unit that acquires auditory language data including data to be played back as a spoken voice; an analyzing unit that analyzes the auditory language data to output an analysis result; a setting unit that sets at least a portion of the auditory language data to a control portion to be played back at a set playback speed, based on the analysis result; and a voice playback unit that plays back the control portion as a spoken voice at the set playback speed.

Decoupled audio and video codecs

Various of the disclosed embodiments present systems and methods for improving improve audio and video quality in a Voice Over Internet Protocol (VOIP) connection including that includes both audio and video. Particularly, different audio and video codecs may be used and parameters assigned based upon the context in which the communication occurs. For example, audio quality may take precedence to video quality when discussing a matter in a chatroom. Conversely, video quality may take precedence to audio quality when playing a collaborative video game. VP9 may be used to encode video while a combination of ISAC and SPEEX may be used to encode audio. Bandwidth determinations for each channel may also influence the respective codec selections.