IPIQ

G06F16/685

METHODS AND SYSTEMS FOR VOICE RECOGNITION IN AUTONOMOUS FLIGHT OF AN ELECTRIC AIRCRAFT

20230053811 · 2023-02-23 ·

BETA AIR, LLC

A system for voice recognition in autonomous flight of an electric aircraft that includes a computing device communicatively connected to the electric aircraft configured to receive at least a voice datum from a remote device, wherein the voice datum is configured to include at least an expression datum, generate, using a first machine-learning process, a transcription datum as a function of the at least a voice datum, extract at least a query as a function of the transcription datum, generate, using a second machine-learning process, a communication output as a function of the at least a query, and adjust a flight plan as a function of the communication output.

System and method for combining phonetic and automatic speech recognition search

11587549 · 2023-02-21 ·

Nice Ltd.

A text search query including one or more words may be received. An ASR index created for an audio recording may be searched over using the query to produce ASR search results including words, each word associated with a confidence score. For each of the words in the ASR search results associated with a confidence score below a threshold (and in some cases having one or more preceding words in the ASR index and one or more subsequent words in the ASR index), a phonetic representation of the audio recording may be searched for the word having the confidence score below the threshold, where it occurs in the audio recording, possibly after the one or more preceding words and in the audio recording before the one or more subsequent words, to produce phonetic search results. Search results may be returned include ASR and phonetic results.

Device, system, and method for multimodal recording, processing, and moderation of meetings

11501780 · 2022-11-15 ·

AUDIOCODES LTD.

Devices, systems, and methods for automatic real-time moderation of meetings, by a computerized or automated moderation unit able to manage, steer and guide the meeting in real-time and able to selectively generate and convey real-time differential notifications and advice to particular participants. A Meeting Moderator Bot monitors audio conversations in a meeting, and analyzes their textual equivalent; detects topics that were skipped or that should be discussed, and notifies participants; detects double-talk or interferences and generates warnings accordingly; detects absence of participants that are relevant to particular topics; detects that the conversation should shift to another topic on the agenda; generates other meeting steering notifications; and monitors compliance of the meeting participants with such steering notifications.

SYSTEMS AND METHODS FOR REMOTELY INTERACTING WITH PERFORMERS AND INFLUENCING LIVE EVENTS

20230039768 · 2023-02-09 ·

A computer-implemented method of remotely influencing a performer at a live event via a customer mobile device is disclosed herein. The method includes: displaying a graphical user interface configured to receive user inputs; receiving a first user input including a user request for the performer at the live event; presenting predetermined terms and conditions associated with the user request; receiving a second user input including a user acceptance of the terms and conditions associated with the user request; transmitting the user request to a host server upon receiving the user acceptance of the terms and conditions associated with the user request; receiving a confirmation of the terms and conditions associated with the user request from the host server; and transmitting the user request for receipt by a performer mobile device of the performer during the live event.

Method, apparatus and computer device for searching audio, and storage medium

11574009 · 2023-02-07 ·

Guangzhou Kugou Computer Technology Co., Ltd.

Chaogang Zhang

The present disclosure relates to a method for searching an audio, pertaining to the technical field of electronics. The method includes: detecting a predetermined trigger event in response to receiving a trigger instruction for searching an audio; recording a time point when a detected trigger event occurs each time upon detecting the predetermined trigger event once until a predetermined end event is detected, and acquiring recorded time points to obtain a time point sequence; selecting a target reference time sequence matching the time point sequence from pre-stored reference time sequences; and determining target audio data corresponding to the target reference time sequence based on a pre-stored corresponding relationship between audio data and the reference time sequence.

Lyric search service

11573998 · 2023-02-07 ·

Apple Inc.

This application relates to a client-server architecture that enables search queries to be applied to transcription information for multimedia files. A server device implements a service configured to query a search platform to retrieve results associated with a plurality of multimedia files stored in a content database. The results are ordered according to a plurality of heuristic values calculated based on a text relevance analysis. The service is configured to modify the heuristic values to adjust an order of the results, and generate a response to a search request that includes a representation of at least a portion of the transcription information of the multimedia files referenced by the results. The heuristic values are modified based on at least one of a popularity score for a corresponding multimedia file, a weight associated with a particular field, or a relevance score based on feedback signals.

AUDIO FILE ANNOTATION

20230094828 · 2023-03-30 ·

Hans-Martin Ramsl

Text-to-speech translation is used to generate a transcript for an audio file. Text segments are associated with time segments in the transcript. A trained machine learning model determines, based on the text in the transcript, one or more topics for the audio file. The transcript is modified to include the determined one or more topics. A user interface may be presented that allows a user to search for portions of an audio file that relate to a particular topic. In response to the selected or entered topic, the user interface presents segments having a matching topic. The user may use voice or other user interface commands to modify the annotation of the audio file. User commands may also be used to extract data from the transcript and copy the data to a clipboard or to another application.

AUDIO RECOMMENDATION BASED ON TEXT INFORMATION AND VIDEO CONTENT

20230031056 · 2023-02-02 ·

An electronic device and method for audio recommendation and generation are disclosed. The electronic device receives textual information that indicates a plurality of scenes for video content, and determines a first plurality of features for the plurality of scenes. The electronic device determines a set of positions in the textual information based on the determined first plurality of features. A set of audio files are to be inserted at the set of positions related to a set of scenes of the plurality of scenes. The electronic device determines, by an artificial intelligent (AI) engine, the set of audio files for the set of scenes, based on a second plurality of features and the first plurality of features related to the set of scenes. The electronic device controls a display device to display first information corresponding to the set of positions and second information corresponding to the set of audio files.

Audio track determination based on identification of performer-of-interest at live event

11487815 · 2022-11-01 ·

Sony Corporation

An electronic device includes circuitry, firmware, and software that determines identification information associated with a first performer-of-interest at a live event and retrieves a first set of audio tracks from a plurality of audio tracks based on the determined identification information. The circuitry receives a first audio segment associated with the first performer-of-interest from an audio capturing device. The circuitry compares a first audio characteristic of the first audio segment with a second audio characteristic of a first audio portion of each of the first set of audio tracks. The circuitry determines a first audio track based on the comparison between the first audio characteristic and the second audio characteristic. The circuitry identifies a start position of the first audio track based on the first audio segment associated with the first audio track. The circuitry controls a display of the first lyrics information of the first audio track.

METHOD AND APPARATUS FOR DISPLAYING LYRIC EFFECTS, ELECTRONIC DEVICE, AND COMPUTER READABLE MEDIUM

20220351454 · 2022-11-03 ·

The present disclosure provides a method and an apparatus for displaying lyric effects, an electronic device, and a computer-readable medium. The method includes: obtaining, based on a lyric effect display operation of a user, an image sequence and music data to be displayed, the music data including audio data and lyrics; determining a target time point, playing at least one target image corresponding to the target time point in the image sequence, and determining target lyrics corresponding to the target time point in the lyrics, and adding animation effects on the at least one target image, displaying the target lyrics on the at least one target image, and playing a part of the audio data corresponding to the target lyrics.

Patent classifications

G06F16/685