H04N21/8106

APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

AUDIO IMPROVEMENT USING CLOSED CAPTION DATA
20230164400 · 2023-05-25 ·

Methods and systems are described herein for improving audio for hearing impaired content consumers. An example method may comprise determining a content asset. Closed caption data associated with the content asset may be determined. At least a portion of the closed caption data may be determined based on a user setting associated with a hearing impairment. Compensating audio comprising a frequency translation associated with at least the portion of the closed caption data may be generated. The content asset may be caused to be output with audio content comprising the compensating audio and the original audio.

System and method for identifying social trends

A method and system for identifying social trends are provided. The method includes collecting multimedia content from a plurality of data sources; gathering environmental variables related to the collected multimedia content; extracting visual elements from the collected multimedia content; generating at least one signature for each extracted visual element; generating at least one cluster of visual elements by clustering at least similar signatures generated for the extracted visual elements; correlating environmental variables related to visual elements in the at least one cluster; determining at least one social trend by associating the correlated environmental variables with the at least one cluster.

TECHNIQUES FOR IMPROVING THE POWER EFFICIENCY OF A PLAYBACK DEVICE
20230111696 · 2023-04-13 ·

A playback device including a processor that executes program instructions such that the playback device is configured to receive first audio data representing audio content, generate and output second audio data based on the first audio data, and at least in part while generating and outputting the second audio data, generate and output a control signal associated with the second audio data to vary a supply voltage for an audio amplifier. The playback device also includes a switch-mode power supply (SMPS) that varies the supply voltage for the audio amplifier based on the control signal. The playback device also includes an amplifier circuitry comprising the audio amplifier powered by the supply voltage from the SMPS. The amplifier circuitry is configured to receive the second audio data and generate an analog audio signal to drive a speaker based on the second audio data.

Video reader with music word learning feature
11470365 · 2022-10-11 ·

Reading material on video gives the reader a seamless reading experience by displaying on a device of their choice a series of segments containing letters, words, phrases, sentences and/or paragraphs on a background of the drafter's choice. One segment flows into the other until the reading material is completed. These sequential segments are set to be viewed seamlessly with audio accompaniment. Words, sentences or paragraphs are set to music, where recognizable features of the music are played at the appearance of a certain word or the beginning of a sentence or paragraph. The appearance of a word, sentence or paragraph may be accompanied by the appearance of an image representing the word, sentence or paragraph, along with a recognizable designated musical element.

System and method for overlaying content on a multimedia content element based on user interest

A method and system for overlaying content on a multimedia content element. The method includes: partitioning the multimedia content element into a plurality of partitions; generating at least one signature for each partition of the multimedia content element, wherein each generated signature represents a concept; determining, based on the generated at least one signature, at least one link to content; identifying, based on the generated at least one signature, at least one of the plurality of partitions as a target area of user interest; and adding, as an overlay to the multimedia content element, the determined at least one link to content, wherein the at least one link is overlaid on the at least one target area.

PUBLISHING A DISPARATE LIVE MEDIA OUTPUT STREAM THAT COMPLIES WITH DISTRIBUTION FORMAT REGULATIONS

A system and a method is provided for publishing a disparate live media output stream that complies with distribution format regulations. The system includes a memory for storing instructions and a processor that executes the instructions. Based on the instructions, the processor is manipulates a manifest of a live input stream based on a media segment identified for an edit. The manipulation of the manifest corresponds to removal of references to the media segment prior to a live event start indicator and after a live event end indicator, maintenance of indicators that mark locations of a non-programming content, and removal of duration information and referenced media segment that corresponds to originally scheduled non-programming content. A pre-encoded media asset is generated for a repeated playback based on the manipulation of the manifest of the live input stream.

MATCHING VIDEO CONTENT TO PODCAST EPISODES
20230105830 · 2023-04-06 ·

Systems and methods for matching videos to podcast episodes are provided. A data store comprising podcast episode identifiers is accessed. The podcast episode identifiers are associated with one or more podcast episode attributes. A video content item is identified. The video content item includes one or more video content item attributes. A matching podcast episode identifier that matches the video content item is determined based on the one or more podcast episode attributes and the one or more video content item attributes. A ranking of one of the video content item or the matching podcast episode identifier is caused to be adjusted to reflect the correspondence between the video content item and the matching podcast episode identifier. Information associated with the matching podcast episode identifier is provided to a first user device.

SYSTEMS AND METHODS FOR REPLAYING A CONTENT ITEM

Systems and methods for replaying a portion of a content item based on the user’s language proficiency level in a secondary language is disclosed. The system accesses a user profile comprising a user’s proficiency level in at least one secondary language, the secondary language being their non-native language. A command to replay a first portion of a content item is received and, in response to receiving the replay command, the system generates for display the first portion of the content item at a level below the user’s proficiency level in the secondary language.

System and method for determining a contextual insight and generating an interface with recommendations based thereon

A system and method for generating an interface for providing recommendations based on contextual insights, the method including: generating at least one signature for at least one multimedia content element identified within an interaction between a plurality of users; generating at least one contextual insight based on the generated at least one signature and user interests of the plurality of users, wherein each contextual insight indicates a current user preference; searching for at least one content item that matches the at least one contextual insight; and generating an interface for providing the at least one content item within the interaction between the plurality of users.