IPIQ

H04M3/42221

Recording gap detection and remediation

11716416 · 2023-08-01 ·

Zoom Video Communications, Inc.

A disconnection of a client device is disconnected during a multi-participant communication, such as a call or a conference. An indication of the disconnection is transmitted to the client device to cause an agent at the client device to record media locally at the client device. The media recorded by the agent at the client device based on the indication of the disconnection is later received and included within a recording of the communication. For example, a gap of the recording in which the disconnection occurred may be identified, such as by performing a comparison of media within the recording to identify a start time of the gap and an end time of the gap. The media is then inserted within a portion of the recording of the multi-participant communication corresponding to the gap.

DETECTING AND RESOLVING FRAUDULENT CALLS

20230239400 · 2023-07-27 ·

Computer-implemented methods, computer program products, and computer systems for detecting and resolving fraudulent calls. The computer-implemented methods include one or more processors configured for receiving voice call data corresponding to an incoming telephone call placed to a user device, wherein the voice call data comprises caller voice data. Further, the computer-implemented methods may include one or more processors configured for converting the caller voice data to caller text data comprising one or more text phrases. Further embodiments of the computer-implemented method include processors configured for determining that the one or more text phrases satisfies a first condition, and responsive to determining that the one or more text phrases satisfies the first condition, transmitting a user alert to the user device.

Intelligent Voice Interface for Handling Out-of-Context Dialog

20230028693 · 2023-01-26 ·

In a method for handling out-of-sequence caller dialog, an intelligent voice interface is configured to lead callers through pathways of an algorithmic dialog that includes available voice prompts for requesting different types of caller information. The method may include, during a voice communication with a caller via a caller device, receiving from the caller device caller input data indicative of a voice input of the caller, without having first provided to the caller device any voice prompt that requests a first type of caller information, and determining, by processing the caller input data, that the voice input includes caller information of the first type. The method also includes after determining that the voice input includes the caller information of the first type, bypassing one or more voice prompts, of the available voice prompts, that request the first type of caller information.

SYSTEM AND METHOD FOR AUTOMATED PROCESSING OF NATURAL LANGUAGE USING DEEP LEARNING MODEL ENCODING

20230025114 · 2023-01-26 ·

Rijul MAGU

Automated systems and methods are provided for processing natural language, comprising obtaining first and second digitally-encoded speech representations, respectively corresponding to an agent script for and a voice recording of a telecommunication interaction; generating a similarity structure based on the speech representations, the similarity structure representing a degree of semantic similarity between the speech representations; matching markers in the first speech representation to markers in the second speech representation based on the similarity structure; and dividing the telecommunication interaction into a plurality of sections based on the matching.

Switch controller for separating multiple portions of call

11563780 · 2023-01-24 ·

Tevnos LLC

Stuart Alexander Jacobson

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for collecting call data, feeding call data to applications, and providing advanced call features.

Online Interview Method and System

20230222449 · 2023-07-13 ·

One or more embodiments of the present specification relates to an online interview method and system. The method includes: establishing communication connection between an interviewing terminal and an interviewed terminal through a network; and acquiring communication information between the interviewing terminal and the interviewed terminal. The communication information includes one or more types of audio information, video information, and text information. The interviewing terminal includes one or more of a first host terminal, a second host terminal, and a text processing terminal. The first host terminal is configured to host an interview, and the first host terminal displays an interview outline and/or information of the interviewed terminal. The second host terminal is configured to host the interview and/or participate in the consultation of interview questions. The text processing terminal converts the audio information and/or the audio information in the video into corresponding text information.

Proximity based audio collaboration

11558208 · 2023-01-17 ·

International Business Machines Corporation

A method includes: defining, by a computer device, an audio collaborative environment; defining, by the computer device, an access control of the audio collaborative environment, wherein the access control includes a geofence; receiving, by the computer device, a request from at least one user device to connect to the audio collaborative environment; determining, by the computer device, the at least one user device satisfies the access control; connecting, by the computer device, the at least one user device to an audio channel of the audio collaborative environment; recording, by the computer device, audio data transmitted on the audio channel by the at least one user device; storing, by the computer device, the audio data in a record; tagging, by the computer device, respective portions of the audio data in the record; and presenting one of the respective portions of the audio data to a user based on the tagging.

Machine learning for improving quality of voice biometrics

11699446 · 2023-07-11 ·

Capital One Services, Llc

Methods and systems are disclosed herein for improving the quality of audio for use in a biometric. A biometric system may use machine learning to determine whether audio or a portion of the audio should be used as a biometric for a user. A sample of the user's voice may be used to generate a voice signature of the user. Portions of the audio that do not meet a similarity threshold when compared with the voice signature may be removed from the audio. Additionally or alternatively, interfering noises may be detected and removed from the audio to improve the quality of a voice biometric generated from the audio.

VOICE APPLICATION NETWORK PLATFORM

20230216945 · 2023-07-06 ·

A distributed voice applications system includes a voice application rendering agent and at least one voice applications agent that is configured to provide voice applications to an individual user. A management system may control and direct the voice applications rendering agent to create voice applications that are personalized for individual users based on user characteristics, information about the environment in which the voice applications will be performed, prior user interactions and other information. The voice applications agent and components of customized voice applications may be resident on a local user device which includes a voice browser and speech recognition capabilities. The local device, voice applications rendering agent and management system may be interconnected via a communications network.

Automated Robocall Detection

20230216955 · 2023-07-06 ·

Novel tools and techniques are provided for implementing automated robocall detection. In various embodiments, a computing system may compare first abstracted raw data, obtained from a portion of call data from a first call from a first originating party, with each of a plurality of abstracted raw data, obtained from portions of call data from a plurality of calls from a corresponding plurality of originating parties. In some instances, the plurality of abstracted raw data and the first abstracted raw data may each include at least one of word count data, phoneme count data, inter-word timing data, voice pitch estimation data, and/or background noise data. The computing system may determine whether the first abstracted raw data is indicative of the first call being a suspected unsolicited or unwanted communication, based at least in part on the comparison. If so, the computing system may perform one or more tasks.

Patent classifications

H04M3/42221