H04N21/4394

Systems And Methods For Recording Relevant Portions Of A Media Asset
20230217062 · 2023-07-06 ·

Systems and methods are presented herein for recording portions of a media asset relevant to recording criteria. A media application receives input indicating the recording criteria and identifying a first keyword. The media application accesses a data structure to identify a first node associated with the first keyword. The data structure includes the first node and a plurality of nodes connected to the first node via a plurality of paths. The media application receiving audio component data for a portion of the media asset extracts a term from the audio component data, and identifies a second node in the data structure that is associated with the extracted term. The media application calculates a path score for the portion of the media asset based on a path size in the data structure between the first node and the second node. When the score is high enough, the portion of the media asset is recorded.

Media Presentation Device with Voice Command Feature
20230217079 · 2023-07-06 ·

A media presentation device determines a voice command associated with media content presented by the media presentation device. The media presentation device then listens for and detects utterance of the determined voice command during presentation of the media content, and the media presentation device responds to the detected utterance by performing an action that facilitates user purchase of the good or service associated with the media content segment.

DEVICE FOR DETECTING MUSIC DATA FROM VIDEO CONTENTS, AND METHOD FOR CONTROLLING SAME

A data processing method according to the present invention comprises the steps of: receiving an input of video contents including a video stream and an audio stream; detecting music data from the audio stream; and filtering the audio stream so that the music data detected from the audio stream is removed.

Video synthesis method terminal and computer storage medium

The disclosure provides a video synthesis method, a terminal and a storage medium. The method includes acquiring at least one video clip. The method includes acquiring a target audio suitable to video content based on the video content and the number of the at least one video clip. T number of the audio change points of the target audio is greater than or equal to the number of at least one video clip minus one, and the audio change points comprise time points at which change in audio feature satisfies a preset condition; and obtaining a video file by synthesizing the at least one video clip and the target audio based on the audio change points included in the target audio.

Applications for decoder-side modeling of objects identified in decoded video data

Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

SYSTEMS, METHOD, AND MEDIA FOR REMOVING OBJECTIONABLE AND/OR INAPPROPRIATE CONTENT FROM MEDIA
20230216909 · 2023-07-06 ·

Mechanisms for removing objectionable and/or inappropriate content from media content items are provided. In some embodiments, the method comprises: receiving a first media content item and a dictionary, wherein the first media content item includes an audio component and a video component; identifying a plurality of scenes and a plurality of scene breaks associated with the first media content item; transcribing the audio component of the first media content item to produce transcribed audio; comparing the transcribed audio to entries in the dictionary and storing matches between the transcribed audio and the entries; and generating a second media content item by removing at least a portion of at least one of the audio component and the video component based on the matches.

METHODS AND APPARATUS FOR MEASURING ENGAGEMENT DURING MEDIA EXPOSURE
20230217071 · 2023-07-06 ·

Methods, apparatus, systems, and articles of manufacture are disclosed for measuring engagement during media exposure. An example apparatus includes at least one memory, machine readable instructions, and processor circuitry to at least one of instantiate or execute the machine readable instructions to identify media presented via a media device in a media presentation environment, identify ambient audio detected in the media presentation environment, determine whether the ambient audio is distractive to presentation of the media in the media presentation environment, and adjust a media exposure report based on a determination that the ambient audio is distractive.

System and method for dual mode presentation of content in a target language to improve listening fluency in the target language
11551568 · 2023-01-10 · ·

Embodiments of a language learning system and method for implementing or assisting in self-study for improving listening fluency in a target language are disclosed. Such embodiments may simultaneously present the same piece of content in an auditory presentation and a corresponding visual presentation of a transcript of the auditory presentation, where the two presentations are adapted to work in tandem to increase the effectiveness of language learning for users.

Using non-audio data embedded in an audio signal
11553237 · 2023-01-10 · ·

Embodiments included herein generally relate to measuring a latency of a playback device. For example, a method includes: determining a first latency of a playback device; determining a second latency of the playback device; comparing the second latency to the first latency to determine whether an event occurred at the playback device; and in response to detecting a latency change between the second latency and the first latency indicating the occurrence of the event, adjusting a timing of a data stream provided to the playback device based on the latency change.

Systems and methods for facilitating configuration of an audio system
11553277 · 2023-01-10 · ·

A system for facilitating configuration of an audio system tests the existing audio system configuration by playing reference audio clips of various different genres of programming, the audio of which is received by a microphone of a remote control device or other handheld mobile device at various different listening points in the room. The system then compares the audio signal representing audio of the reference audio clip received at the microphone to a reference audio signal representing the reference audio clip and determines other audio characteristics. The system then determines, based on the comparison and other characteristics of the room (e.g., furniture layout, room construction materials, wall treatments and current speaker positioning) suggested changes to a configuration of the audio system to increase audio quality of the audio system for playing audio associated with the various different genres of programming.