Patent classifications
H04M2203/352
Background audio identification for speech disambiguation
Implementations relate to techniques for providing context-dependent search results. A computer-implemented method includes receiving an audio stream at a computing device during a time interval, the audio stream comprising user speech data and background audio, separating the audio stream into a first substream that includes the user speech data and a second substream that includes the background audio, identifying concepts related to the background audio, generating a set of terms related to the identified concepts, influencing a speech recognizer based on at least one of the terms related to the background audio, and obtaining a recognized version of the user speech data using the speech recognizer.
A SYSTEM AND METHOD FOR PROVIDING CONTEXTUAL INFORMATION AND ACTIONS TO MAKE A CONVERSATION MEANINGFUL AND ENGAGING
The invention relates to a system (100) and method (200) for providing contextual information and actions to make a conversation meaningful and engaging. The method (200) comprises the steps of identifying a contact from various data sources (101) and collecting the relevant information from one or more web-based applications and databases, wherein the collected information is mapped to create one or more discussion points by one or more prediction servers (102). Post-conversation suggestions are provided by one or more suggestion servers (103), wherein the discussion points and the post-conversation suggestions are displayed on a user interface device (104) for allowing the user to have meaningful and engaging conversations.
METHOD AND SYSTEM FOR VOLUME CONTROL
A method performed by a first electronic device, the method includes, while engaged in a call with a second electronic device, initiating a joint media playback session in which the first and second electronic devices independently stream media content for synchronous playback; driving a speaker with a mix of a downlink signal of the call and an audio signal of the media content at an overall volume level; receiving a user-adjustment at a single volume control for the first electronic device to reduce the overall volume level; in response to the user adjustment, applying a first gain adjustment to the downlink signal and a second gain adjustment to the audio signal; and driving the speaker with a mix of the downlink signal and the audio signal at the reduced volume level.
BACKGROUND AUDIO IDENTIFICATION FOR SPEECH DISAMBIGUATION
Implementations relate to techniques for providing context-dependent search results. A computer-implemented method includes receiving an audio stream at a computing device during a time interval, the audio stream comprising user speech data and background audio, separating the audio stream into a first substream that includes the user speech data and a second substream that includes the background audio, identifying concepts related to the background audio, generating a set of terms related to the identified concepts, influencing a speech recognizer based on at least one of the terms related to the background audio, and obtaining a recognized version of the user speech data using the speech recognizer.
INITIATE "SEND INFORMATION VIA TEXT" BUTTON DURING A BRANDED CALL
Systems and methods for providing voice assisted data during a branded voice session via a telecommunication network include a provider device including an in-call dialer interface, a cell site, and message management circuitry communicatively coupled to a call center. The provider device is structured to receive an indication of an in-call message during a branded voice session, determine a call address responsive to receiving the indication of the in-call message, generate a message interface including the call address, receive, by the message interface, the voice assisted data, and based on the call address, provide the voice assisted data during the branded voice session.
Ambient sound rendering for online meetings
Techniques of conducting an online meeting involve outputting ambient sound to a participant of an online meeting. Along these lines, in an online meeting during which a participant wears headphones, the participant's computer receives microphone input that contains both speech from the participant and ambient sound that the participant may wish to hear. In response to receiving the microphone input, the participant's computer separates low-volume sounds from high-volume sounds. However, instead of suppressing this low-volume sound from the microphone input, the participant's computer renders this low-volume sound. In most cases, this low-volume sound represents ambient sound generated in the vicinity of the meeting participant. The participant's computer then mixes the low-volume sound with speech received from other conference participants to form output in such a way that the participant may distinguish this sound from the received speech. The participant's computer then provides the output to the participant's headphones.
Silence signatures of audio signals
A method performed by a processing system. The method includes generating silence signatures of audio signals from a plurality of device based on energy levels of the audio signals, providing the silence signatures to an interaction service, and outputting interaction information corresponding to the devices.
VOICE TALLYING SYSTEM
The present invention relates to a voice tallying system to determine the relative participation of individual participants in a meeting. The voice tallying system according to the present invention comprises at least one voice recording device, a communication path from the voice recording device to a computing device having a voice analysis module. The voice tallying system and the method of the present invention include the capability to receive audio signals from each of the participants in a meeting and determine the identity of the speaker for each of the audio stream using voice profile information of the participants previously obtained and stored in the voice analysis module. The voice tallying system and the method further include the capability to tally the relative participation of a participant in a meeting in real time and as a result it is possible to display contemporaneously a voice tally for a participant with reference to that of other participants in the meeting.
Background audio identification for speech disambiguation
Implementations relate to techniques for providing context-dependent search results. A computer-implemented method includes receiving an audio stream at a computing device during a time interval, the audio stream comprising user speech data and background audio, separating the audio stream into a first substream that includes the user speech data and a second substream that includes the background audio, identifying concepts related to the background audio, generating a set of terms related to the identified concepts, influencing a speech recognizer based on at least one of the terms related to the background audio, and obtaining a recognized version of the user speech data using the speech recognizer.
Methods and apparatus for conducting internet protocol telephony communications
IP telephony communications are conducted by sending both audio data produced by a CODEC that represents received spoken audio input, and a textual representation of the spoken audio input. A receiving device utilizes the textual representation of the spoken audio input to help recreate the spoken audio input when a portion of the CODEC data is missing. The textual representation can be generated by a speech-to-text function. Alternatively, the textual representation can be a notation of extracted phonemes.