H04N21/4396

METADATA FOR DUCKING CONTROL

An audio encoding device and an audio decoding device are described herein. The audio encoding device may examine a set of audio channels/channel groups representing a piece of sound program content and produce a set of ducking values to associate with one of the channels/channel groups. During playback of the piece of sound program content, the ducking values may be applied to all other channels/channel groups. Application of these ducking values may cause (1) the reduction in dynamic range of ducked channels/channel groups and/or (2) movement of channels/channel groups in the sound field. This ducking may improve intelligibility of audio in the non-ducked channel/channel group. For instance, a narration channel/channel group may be more clearly heard by listeners through the use of selective ducking of other channels/channel groups during playback.

METHODS AND SYSTEMS FOR SELECTIVE PLAYBACK AND ATTENUATION OF AUDIO BASED ON USER PREFERENCE

Systems and methods are presented for providing to filter unwanted sounds from a media asset. Voice profiles of a first character and a second character are generated based on a first voice signal and a second voice signal received from the media device during a presentation. The user provides a selection to avoid a certain sound or voice in association with the second character. During a presentation of the media asset, a second audio segment is analyzed to determine, based on the voice profile of the second character, whether the second voice signal includes the voice of a second character. If so, the second voice signal output characteristics are adjusted to reduce the sound.

IN-VEHICLE MULTI-OCCUPANT MEDIA MANAGEMENT

Various embodiments also include a computer-implemented method comprising determining a communications mode for a plurality of consoles operating within a vehicle, based on the determined communications mode, initiating an in-vehicle communication between a set of consoles included in the plurality of consoles, where the set of consoles includes at least a first console and a second console, in response to the in-vehicle communication, causing each of the set of consoles to attenuate volumes of a set of content items playing on each of the set of consoles, and receiving a speech signal generated by a first user of the set of consoles, and causing the set of consoles to reproduce the speech signal.

SYSTEMS AND METHODS FOR HIGHLIGHTING CONTENT WITHIN MEDIA ASSETS
20230007336 · 2023-01-05 ·

Systems and methods are described herein for highlighting objects with a primary content that are likely to be of interest to a user viewing the primary content. More particularly, when the system receives a segment of primary content to be displayed on a user equipment device for consumption, the system analyzes the received segment to identify an object within the received segment. The system then checks a database storing supplemental content to determine whether supplemental content associated with the identified object is available. When supplemental content associated with the identified object is available within the database, the system modifies the received segment of the primary content to highlight the identified object and displays the modified segment of the primary content on the user equipment device for consumption.

Voice Control Device with Push-To-Talk (PTT) and Mute Controls
20220406300 · 2022-12-22 · ·

Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for a voice control device including a microphone, a mute control, and a push-to-talk (PTT) control. An example embodiment operates by: entering a mute state from an always-listening state when the device receives a mute control signal; entering a PTT state from the mute state when the device is in the mute state and receives a first PTT control signal; activating the microphone when the device is in the PTT state; and entering the mute state from the PTT state when the device is in the PTT state and receives a second PTT control signal.

ON DEMAND SHARED MEDIA STREAMING WITH LOCATIONAL AND TEMPORAL LIMITATIONS
20220377428 · 2022-11-24 ·

A shared media streaming system is configured to allow audience members of a movie screening or other live event to view that event via a user device such as a personal smartphone if they exit the area of the live event to visit a concession area or restroom. The shared media stream begins displaying the film on the user device starting at the same moment that the request to stream was made so that the user may leave and return to the screening room without missing substantial portions of the film, and without re-watching already viewed portions of the film. Viewing is limited to particular geofenced areas within a theater, and is limited in time so that users must return to the screening room if they wish to continue watching the film beyond the limited time. Other limitations are also enforced to prevent misuse of the system.

Content-modification system with volume level adjustment feature

In one aspect, a method includes receiving first content at a content-presentation device and presenting the first content, the first content comprising a first audio-content component. The content-presentation device may receive second content comprising a second audio-content component. The content-presentation device may determine a switch time at which to switch from presenting the first content to presenting the second content. During a first time interval prior to the switch time and ending at the switch time, the volume of the first audio-content component may be decreased to zero. At the switch time, the content-presentation device may switch from presenting the first content to presenting the second content. During a second time interval beginning at the switch time and ending at a second time after the switch time, the volume of the second audio-content component may be increased from zero to a non-zero volume level.

Identifying and removing restricted information from videos
11587591 · 2023-02-21 · ·

A video is provided to viewers using a web-based platform without restricted audio, such as a copyrighted soundtrack. To do so, a video comprising at least two audio layers is received. The audio layers can include separate and distinct audio layers or a mix of audio from separate sources. A restricted audio element is identified in a first audio layer and a speech element is identified in a second audio layer. A stitched text string can be generated by performing speech-to-text on both audio layers and removing the text corresponding to the restricted audio element of the second audio layer. When playing back the video, a portion of the video is muted based on the restricted audio element. A voice synthesizer is employed to generate audible sound during the muted portion using the stitched text string.

Display Device and Volume Control Method

A display apparatus includes a display; a remote controller; a controller. The controller is configured to receive a first command for fitness training, cause the display to present one or more modes including a follow-up mode and an exercising-while-watching mode for selection; in response to selection of the follow-up mode, cause the display to show a first window and a second window disposed on a first user interface for receiving a focus move command from the remote control, with the first window displaying a first video associated with fitness training and the second window displaying a second video associated with image data from a camera; receive a volume adjustment command; in response to the focus being on a volume control for the first or second video, adjust a volume of the first or second video in response to the volume adjustment command. A method for controlling the apparatus is disclosed.

METHODS AND SYSTEMS TO PROVIDE A PLAYLIST FOR SIMULTANEOUS PRESENTATION OF A PLURALITY OF MEDIA ASSETS

Systems and methods are described herein for generating a playlist for a simultaneous presentation of a plurality of media assets. The system retrieves a user preference associated with a user profile and receives a selection of a first media asset and a second media asset from the plurality of media assets for presentation on a user device. The system parses the respective audio streams of the first media asset and the second media asset to identify one or more preferred audio segments based on the user preference and generates the playlist of the identified one or more preferred audio segments. Based on a generated audio playlist, the system generates, for presentation on the user device, the video stream for each of the first media asset and the second media asset and the playlist of the identified one or more preferred audio segments.