H04N21/4394

Systems and methods for audio adaptation of content items to endpoint media devices
11540008 · 2022-12-27 · ·

Methods, systems, and non-transitory, machine-readable media are disclosed for audio adaption of content items to device operations of an endpoint media device. First observation data corresponding to media device operations associated with a first media device and mapped to first content items may be processed. A first content composite including an adaptable content item may be received. The first content composite may be adapted with a first audio segment. Based on the first observation data, the first audio segment may be selected. The first content composite may be configured with the first audio segment so that the adapted first content composite plays the first audio segment when the adapted first content composite is presented. The adapted first content composite may be output for presentation, where the first endpoint media device or the second endpoint media device performs at least one operation relating to the adapted first content composite.

Caption modification and augmentation systems and methods for use by hearing assisted user

A system and method for facilitating communication between an assisted user (AU) and a hearing user (HU) includes receiving an HU voice signal as the AU and HU participate in a call using AU and HU communication devices, transcribing HU voice signal segments into verbatim caption segments, processing each verbatim caption segment to identify an intended communication (IC) intended by the HU upon uttering an associated one of the HU voice signal segments, for at least a portion of the HU voice signal segments (i) using an associated IC to generate an enhanced caption different than the associated verbatim caption, (ii) for each of a first subset of the HU voice signal segments, presenting the verbatim captions via the AU communication device display for consumption, and (iii) for each of a second subset of the HU voice signal segments, presenting enhanced captions via the AU communication device display for consumption.

MEDIA PLAYBACK SYNCHRONIZATION OF MULTIPLE PLAYBACK SYSTEMS

A system includes a primary playback system and a secondary playback system. The primary playback system plays back selected content. The secondary playback system plays back supplemental media associated with the content played back on the primary playback system. A media playback function (such as associated with the secondary playback system) monitors playback of the content on the primary playback system. For example, a first processing thread of the media playback function initially synchronizes playback of supplemental media on the secondary playback system with respect to playback of the content on the primary playback system. Based on further monitoring of playing back the content on the secondary playback system, a second processing thread of the media playback function verifies synchronization (and, when needed, initiates re-synchronization) of playback of the supplemental media on the secondary playback system with respect to playback of the content on the primary playback system.

METHOD AND ELECTRONIC DEVICE FOR NAVIGATING APPLICATION SCREEN

Provided are an electronic device for navigating an application screen, and an operating method thereof. The method may include receiving a user input; determining, based on the user input, a user intent for controlling the electronic device; determining a command for performing a control operation corresponding to the user intent as a goal; identifying elements of a user interface on the screen of the application; determining, based on the user intent and the elements of the user interface, at least one sub-goal for executing the command; and executing the command by performing at least one task corresponding to the at least one sub-goal, wherein the at least one sub-goal is changeable based on a validation of an operation of navigating the application for executing the command, and the at least one task includes units of action for navigating the application.

DIARISATION AUGMENTED REALITY AIDE

An image of a real-world environment including one or more users, is received from an image capture device. A mask status of a first user of is determined by a processor based on the image. A stream of audio including speech from one or more users is captured from one or more audio transceivers. A first user speech from the stream of audio identified by the processor. The stream of audio is parsed, by the processor and based on the first user speech and based on an audio processing technique, to create a first user speech element. An augmented view that includes the first user speech element is generated, for a wearable computing device, based on the first user speech and based on the mask status.

Methods and systems for dynamic content modification

An example method can comprise receiving content for presentation at a user device. The content can comprise a plurality of sections, and each section can comprise a video portion and an audio portion. The user device can also receive content metadata regarding one or more features of the content, where the features of the content comprise one or more candidate sections of the content for modification. The user device can apply one or more rules to the received content based on the content metadata to modify one or more of the audio portion and the video portion of at least one section of the content, creating modified content, and can cause presentation of the modified content on a display device.

Computer-implemented method for telling a Story through sequential layers by an Artist
20220400319 · 2022-12-15 ·

The present invention relates to a computer-implemented method for Story-telling by The Artist. The method comprising uploading and/or creating media content to and/or by a first computer device by The Artist; sending one or more medial content from the first computer device to a remote server through one or more methods of communication, wherein said one or more media content forms a unit of The Story provided by The Artist; dividing said unit Story into at least three layers, wherein each layer is configured to represent a moment in The Story; and wherein the first layer comprises a first part of the media content, wherein the first part comprises an introduction of The Story; and the second layer comprises a second part of the media content; and the third layer comprises a third part of the media content, wherein the third layer comprises at least one final scene of The Story; and accessing to the divided content by a Visitor by means of a second computer, wherein the second computer is accessing the media content comprised in at least the first layer from the remote server; and presenting the first part of the media content on the second computer.

Automated audio mapping using an artificial neural network

According to one implementation, an automated audio mapping system includes a computing platform having a hardware processor and a system memory storing an audio mapping software code including an artificial neural network (ANN) trained to identify multiple different audio content types. The hardware processor is configured to execute the audio mapping software code to receive content including multiple audio tracks, and to identify, without using the ANN, a first music track and a second music track of the multiple audio tracks. The hardware processor is further configured to execute the audio mapping software code to identify, using the ANN, the audio content type of each of the multiple audio tracks except the first music track and the second music track, and to output a mapped content file including the multiple audio tracks each assigned to a respective one predetermined audio channel based on its identified audio content type.

METHOD FOR GENERATING TARGET VIDEO, APPARATUS, SERVER, AND MEDIUM
20220385996 · 2022-12-01 ·

A method for generating a target video, an apparatus, a server, and a medium are provided. The method includes: obtaining live broadcast stream data, wherein the live broadcast stream data comprises at least one among voice data and live broadcast interaction data, and video data; performing processing on the live broadcast stream data, and generating at least one among a corresponding voice metric value and interaction metric value, and a corresponding video metric value according to a target object included in a processing result; generating an overall metric value for the live broadcast stream data according to the generated metric values; and in response to determining that the comprehensive metric value for the live broadcast stream data satisfies a preset condition, generating a target video on the basis of the live broadcast stream data.

Intelligent automated assistant for TV user interactions

Systems and processes are disclosed for controlling television user interactions using a virtual assistant. In an example process, a virtual assistant can interact with a television set-top box to control content shown on a television display. Speech input for the virtual assistant can be received from a device with a microphone. The speech input can comprise a query associated with content shown on the television display. A user intent of the query can be determined based on one or more of the content shown on the television display and a viewing history of media content. A result of the query can be caused to be displayed based on the determined user intent.