G06F16/687

AUTOMATED SPEECH-TO-TEXT PROCESSING AND ANALYSIS OF CALL DATA APPARATUSES, METHODS AND SYSTEMS

The present invention discloses a system, apparatus, and method that obtains audio and metadata information from voice calls, generates textual transcripts from those calls, and makes the resulting data searchable via a user interface. The system converts audio data from one or more sources (such as a telecommunications provider) into searchable usable text transcripts. One use of which is law enforcement and intelligence work. Another use relates to call centers to improve quality and track customer service history. Searches can be performed for callers, callees, keywords, and/or other information in calls across the system. The system can also generate automatic alerts based on callers, callees, keywords, phone numbers, and/or other information. Further the system generates and provides analytic information on the use of the phone system, the semantic content of the calls, and the connections between callers and phone numbers called, which can aid analysts in detecting patterns of behavior, and in looking for patterns of equipment use or failure.

AUTOMATED SPEECH-TO-TEXT PROCESSING AND ANALYSIS OF CALL DATA APPARATUSES, METHODS AND SYSTEMS

The present invention discloses a system, apparatus, and method that obtains audio and metadata information from voice calls, generates textual transcripts from those calls, and makes the resulting data searchable via a user interface. The system converts audio data from one or more sources (such as a telecommunications provider) into searchable usable text transcripts. One use of which is law enforcement and intelligence work. Another use relates to call centers to improve quality and track customer service history. Searches can be performed for callers, callees, keywords, and/or other information in calls across the system. The system can also generate automatic alerts based on callers, callees, keywords, phone numbers, and/or other information. Further the system generates and provides analytic information on the use of the phone system, the semantic content of the calls, and the connections between callers and phone numbers called, which can aid analysts in detecting patterns of behavior, and in looking for patterns of equipment use or failure.

Geo-visual search

Performing a geo-visual search is disclosed. A query feature vector associated with a query tile is obtained. A lookup is performed at least in part by using a key derived from the query feature vector. A list of candidate feature vectors is obtained based at least in part on the lookup. Based at least in part on a comparison of the query feature vector against at least some of the candidate feature vectors in the obtained list, a tile that is visually similar to the query tile is determined. The determined tile is provided as output.

Audio Content Search in a Media Playback System
20230273955 · 2023-08-31 ·

Embodiments are described herein that provide searches, including a multi-dimensional search, a cross-source search, or both in a media playback system. The search can be initiated by way of a selection of a location on user interface of a controller. The location corresponds to one or more metadata that is used in the search. Results are sorted and displayed. In some embodiments, metadata is used to filter and/or sort the results.

COMPUTER SYSTEM FOR REALIZING CUSTOMIZED BEING-THERE IN ASSOCATION WITH AUDIO AND METHOD THEREOF

A method by a computer system including generating audio files based on respective audio signals, the audio signals having been respectively generated from a plurality of objects at a venue, generating metadata including spatial features at the venue that are respectively set for the objects, and transmitting the audio files and the metadata for the objects to a first electronic device to cause the first electronic device to realize a being-there at the venue by rendering the audio files based on the spatial features in the metadata may be provided.

COMPUTER SYSTEM FOR REALIZING CUSTOMIZED BEING-THERE IN ASSOCATION WITH AUDIO AND METHOD THEREOF

A method by a computer system including generating audio files based on respective audio signals, the audio signals having been respectively generated from a plurality of objects at a venue, generating metadata including spatial features at the venue that are respectively set for the objects, and transmitting the audio files and the metadata for the objects to a first electronic device to cause the first electronic device to realize a being-there at the venue by rendering the audio files based on the spatial features in the metadata may be provided.

COMPUTER SYSTEM FOR PRODUCING AUDIO CONTENT FOR REALIZING CUSOMIZED BEING-THERE AND METHOD THEREOF

Provided are a computer system for producing audio content for realizing a user-customized being-there and a method thereof. The computer system may be configured to generate audio files based on respective audio signals that are respectively generated from a plurality of objects at a venue, set spatial features at the venue for the objects, respectively, using a production tool, and generate metadata for the audio files based on the spatial features. An electronic device may realize a being-there at the venue by rendering the audio files based on the spatial features in the metadata. That is, a user of the electronic device may feel a user-customized being-there as if the user directly listens to audio signals generated from corresponding objects at a venue in which the objects are provided.

COMPUTER SYSTEM FOR PRODUCING AUDIO CONTENT FOR REALIZING CUSOMIZED BEING-THERE AND METHOD THEREOF

Provided are a computer system for producing audio content for realizing a user-customized being-there and a method thereof. The computer system may be configured to generate audio files based on respective audio signals that are respectively generated from a plurality of objects at a venue, set spatial features at the venue for the objects, respectively, using a production tool, and generate metadata for the audio files based on the spatial features. An electronic device may realize a being-there at the venue by rendering the audio files based on the spatial features in the metadata. That is, a user of the electronic device may feel a user-customized being-there as if the user directly listens to audio signals generated from corresponding objects at a venue in which the objects are provided.

Prioritizing delivery of location-based personal audio

The technology described in this document can be embodied in a computer-implemented method of controlling a wearable audio device configured to provide an audio output. The method includes receiving data indicating the wearable audio device is proximate a geographic location associated with a localized audio message, and determining that the localized audio message has a higher priority compared to at least one other localized audio message associated with another geographical location proximate to the wearable audio device. The method also includes, responsive to determining that the localized audio message has the higher priority, providing a prompt to initiate playback of the localized audio message with the higher priority to a user of the wearable audio device, and initiating playback of the localized audio message with the higher priority at the wearable audio device in response to actuation of the prompt by the user.

Prioritizing delivery of location-based personal audio

The technology described in this document can be embodied in a computer-implemented method of controlling a wearable audio device configured to provide an audio output. The method includes receiving data indicating the wearable audio device is proximate a geographic location associated with a localized audio message, and determining that the localized audio message has a higher priority compared to at least one other localized audio message associated with another geographical location proximate to the wearable audio device. The method also includes, responsive to determining that the localized audio message has the higher priority, providing a prompt to initiate playback of the localized audio message with the higher priority to a user of the wearable audio device, and initiating playback of the localized audio message with the higher priority at the wearable audio device in response to actuation of the prompt by the user.