G06F16/685

SYSTEMS AND METHODS FOR INTERPRETING NATURAL LANGUAGE SEARCH QUERIES
20230214382 · 2023-07-06 ·

Systems and methods are described herein for interpreting natural language search queries that account for contextual relevance of words of the search query that would ordinarily not be processed, including, for example, processing each word of the query. Each term or phrase is associated with a respective part of speech, and a frequency of occurrence of a combination of adjacent terms or phrases public domain is determined. A relevance of each term is then determined based on its respective type of term and frequency of occurrence in the public domain. The natural language search query is then interpreted based on the importance or relevance of each term.

Media content processing techniques for rights and clearance management
11550878 · 2023-01-10 ·

Systems and methods in accordance with various embodiments of the present disclosure provide improved techniques to process and manage media content and associated intellectual property rights associated with the media content. Intellectual property rights associated with media content can include copyright, trademarks, licenses to composition, synchronization, performance, recordings, etc. In particular, various embodiments provide media licensing management and monetization based on media licensing using a centralized registry of media content and associated asset rights.

BREAKOUT OF PARTICIPANTS IN A CONFERENCE CALL
20230007063 · 2023-01-05 ·

Systems and methods for creating and managing a breakout conference for a primary conference are disclosed. The system monitors communications between participants of a primary conference to determine if a) participants have a disagreement that needs to be resolved or b) if a topic from the meeting agenda requires additional time for discussion. Participant language, including negations and repetitive word usage, job profiles, body language, overlapping voice signals, among other factors, are monitored to determine if a disagreement exists. If a disagreement exists or additional time is required, the system automatically creates a virtual breakout session, determines the topic that created the disagreement, determines participants associated with the disagreed topic, and moves them to the breakout session. The system also provides meeting tools such that participants in the primary conference may communicate and alert participants in the breakout session, and vice versa, without leaving their respective sessions.

SYSTEMS AND METHODS FOR INSERTING EMOTICONS WITHIN A MEDIA ASSET
20230007359 · 2023-01-05 ·

Systems and methods are described herein for inserting emoticons within a media asset based on an audio portion of the media asset. Each audio portion of a media asset is associated with a respective part of speech, and an emotion corresponding to the audio portion for the media asset is determined. A corresponding emoticon is identified based on the determined emotion in the audio portion and causing to be presented at the location within the media asset.

System and method of automated model adaptation

Methods, systems, and computer readable media for automated transcription model adaptation includes obtaining audio data from a plurality of audio files. The audio data is transcribed to produce at least one audio file transcription which represents a plurality of transcription alternatives for each audio file. Speech analytics are applied to each audio file transcription. A best transcription is selected from the plurality of transcription alternatives for each audio file. Statistics from the selected best transcription are calculated. An adapted model is created from the calculated statistics.

Systems and methods for embedding data in media content

An electronic device modifies a first media content item by superimposing a first set of data over a first accented musical event. The first accented musical event has a first audio profile. The first set of data has a second audio profile configured to be masked by the first audio profile during playback of the first media content item. The electronic device transmits, to a second electronic device, the modified first media content item.

Processing audio and video
11546690 · 2023-01-03 · ·

A wearable device may include an image sensor configured to capture a plurality of images from an environment, a microphone configured to capture sounds from the environment, and at least one processor. The at least one processor may be programmed to receive audio signals representative of the sounds captured by the at least one microphone, and receive a first image including a representation of a first individual from among the plurality of images captured by the image sensor. The at least one processor may also be programmed to obtain a first audio segment from the audio signals using the first image. The first audio segment may include a first portion of the audio signals in which the first individual is speaking. The at least one processor may also be programmed to receive a second image including a representation of a second individual from among the plurality of images captured by the image sensor, and obtain a second audio segment from the audio signals using the second image. The second audio segment may include a second portion of the audio signals in which the second individual is speaking. The at least one processor may also be programmed to receive a third image including a representation of the first individual from among the plurality of images captured by the image sensor, and using the third image, obtain a third audio segment from the audio signals. The audio segment may include a third portion of the audio signals in which the first individual is speaking. The at least one processor may also associate the first and third audio segments with the first individual and associate the second audio segment with the second individual.

SYSTEM FOR PROVIDING CUSTOMIZED VIDEO PRODUCING SERVICE USING CLOUD-BASED VOICE COMBINING
20220415362 · 2022-12-29 ·

A system for providing a customized video producing service using a cloud based voice combination of the present invention comprises a customized video production service providing server including: a user terminal that is input and uploads utterance of a user by voice data, selects any one category among at least one type of category to select content including an image or a video, selects a subtitle or background music, and plays a customized video including the content, the uploaded voice data, and the subtitle or background music; a database unit classifying and storing text, image, video, and background music by the at least one type of category; an upload unit receiving the voice data corresponding to the utterance of the user uploaded from the user terminal; a conversion unit that converts the uploaded voice data into text data using STT (Speech to Text) and stores the converted text data; a provision unit that provides an image or video previously mapped and stored in the selected category to the user terminal when any one category among the at least one type of category is selected from the user terminal; a creation unit that creates the customized video including the content, the uploaded voice, and the subtitles or background music when receiving subtitle data or selection of background music from the user terminal by the user terminal's selection of the subtitle or background music.

SYSTEM FOR STORING VOICE RECORDING INFORMATION BASED ON BLOCKCHAIN
20220415329 · 2022-12-29 ·

Provided is a storage system for voice recording information based on a blockchain. The storage system for voice recording information comprises: a voice-to-text conversion device for converting a voice recording file into a text file using a preset voice-to-text conversion algorithm, and outputting the converted text file and information on the voice-to-text conversion algorithm; a blockchain network consisting of a plurality of participating nodes, and configured to generate blocks including the text file and information on the voice-to-text conversion algorithm output from the voice-to-text conversion device according to a preset consensus algorithm, and to store the generated blocks on the blockchain; and a data storage device for storing the original voice recording file, wherein the storage system reliably stores the text file for the voice recording file.

System and method for using multimedia content as search queries

There is provided a method for searching a plurality of information sources using a multimedia element, the method may include receiving at least one multimedia element; generating, by a signature generator, for the at least one multimedia element at least one signature that is unidirectional, and yields compression; generating at least one textual search query using the at least one signature; wherein the generating of the textual search query comprises: (a) searching for at least one matching stored signature that matches one or more of the at least one signature; and (b) using a mapping between stored signatures and textual search queries, selecting at least one textual search query mapped to at least one matching stored signature; searching the plurality of information sources using the at least one textual search query; and causing a display of search results retrieved from the plurality of information sources.