Patent classifications
H04M2201/39
AUTOMATED SYSTEMS AND METHODS FOR NATURAL LANGUAGE PROCESSING WITH SPEAKER INTENTION INFERENCE
A computerized method of managing a robotic telemarketing call includes calling, by an automated robotic telemarketing system, a customer selected from a customer list. The method includes parsing, by a real-time speech recognition module of the automated robotic telemarketing system, a customer statement received from the customer. The method includes determining, by a language intention determining module, a customer purchase intention according to the parsed customer statement. The method includes selecting a sales pitch response corresponding to the determined customer purchase intention. The method includes providing an audio signal including the selected sales pitch response to the customer.
Real-time transcription and interaction with a caller based on the transcription
Receiving a telephone call to an auto-attendant, artificial intelligence, or person takes place. While this phone call is being conducted, a speech to text transcription is created and sent in real-time to another person at another network node. This person can read the transcript and interact with the phone call by sending his or her own commands, text, or speech to be made part of the phone call.
Method and system for a multitenancy telephone network
A method and system for operating a multitenancy telephony system including receiving a plurality of call requests associated with an application server; selecting a subset of the received call requests using a load balancer; assigning each selected call request to one of a plurality of resources creating, on the assigned resource, a call session for each selected call request; and coordinating the created call sessions resulting from the selected subset of the received call requests associated with the application server.
Maintaining Audio Communication in a Congested Communication Channel
The invention relates to a communication system and a method of maintaining audio communication in a congested communication channel currently bearing the transmission of speech in audio communication between a sender side and a receiver side, the communication channel having at least one signaling channel and at least one payload channel having a quality of service. During the audio communication the quality of service of the payload channel is monitored. If the quality of service of the payload channel is below a threshold the speech at the respective sender side is converted to text; and transmitted over the retained communication channel to the respective receiver side. The text may be converted back to speech at the receiver side.
Personal voice-based information retrieval system
The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.
SYSTEMS AND METHODS FOR PROVIDING NOTIFICATIONS WITHIN A MEDIA ASSET WITHOUT BREAKING IMMERSION
Systems and methods for providing notifications without breaking media immersion. A notification delivery application receives notification data while a media device provides a media asset. In response to receiving the notification data while the media device provides the media asset, the notification delivery application generates a voice model based on a voice detected in the media asset. The notification delivery application converts the notification data to synthesized speech using the voice model and generates, by the media device, the synthesized speech for output at an appropriate point in the media asset based on contextual features of the media asset.
SPEECH COMMUNICATION SYSTEM AND METHOD WITH HUMAN-MACHINE COORDINATION
The present disclosure relates to a field of intelligent communications, and discloses a speech communication system and method with human-machine coordination, which resolve a problem of bad client experience because great differences occur after a switchover in a call through a prior human-machine coordination and time of a client is wasted. Key points of the technical solutions of the present disclosure include a communications module; an answering and calling-out module, allocating a human agent; a speech intention understanding module for understanding an intention in a call process; a voice cloning module, synthesizing, from a reply text, voice of a corresponding attendant; a human-machine interaction module, to communicate with a client by a speech robot; an intervention prompting module, monitoring call content to obtain an intervention willing probability, and prompting the human agent according to a predefined rule; and a manual intervention module, through which the client is replied by a person, and the manual intervention module displays a call intention and prompts the call during the call. In this way, during the call, a seamless switchover may be performed between the human agent and the speech robot. After the manual intervention, the attendant can quickly understand the call intention, to ensure good call experience of the client.
TECHNOLOGIES FOR ENHANCING AUDIO QUALITY DURING LOW-QUALITY CONNECTION CONDITIONS
Techniques for teleconferencing with enhanced audio during low-quality connection conditions are disclosed. In the illustrative embodiment, a user of a compute device is teleconferencing with users of one or more remote compute devices. The compute device monitors a connection quality with a remote compute device. If the connection quality drops below a threshold, risking gaps in the audio data, the compute device generates speech code data that can be used to fill in the gaps in the audio data. The remote compute device can use the speech code data to augment the audio data by using a voice model to create additional audio data based on the speech code data.
Remote access control
A system that allows a user to access a secured area by confirming location of the user near the secured area and receipt of an access word by the system. A user in possession of a mobile device may be detected at the entrance to a secured location. The system can select a question asking for a passcode corresponding to certain access to the secured location. The question can be spoken to a user through a loudspeaker at the secured location. The user's spoken response can be processed by the system, for example using keyword spotting, to determine if the proper access word is included. If it is, the system can grant access to the secured area.
THREE-WAY CALLING TERMINAL FOR MOBILE HUMAN-MACHINE COORDINATION CALLING ROBOT
A three-way calling terminal for a mobile human-machine coordination calling robot. Technical solutions include: a first speech interface, configured to transfer call audio between a call object and a back-end processing module; a CODEC1 module, configured to encode and decode the call audio between the call object and the back-end processing module; a second speech interface, configured to transfer call audio between the human agent and the call object; a CODEC2 module, configured to encode and decode the call audio between the human agent and the call object; a call control module, configured to process a control signal, and automatically make, answer, and hang up a call; a data processing submodule, configured to process speech data and perform data transfer between the data processing submodule and the back-end processing module; and a networking submodule, configured to be connected to the back-end processing module.