Patent classifications
H04N21/23439
Incorporating visual objects into video material
This disclosure relates to the embedding of visual objects into the image content of a video by a visual embed specialist, whilst maintaining the security of the video. A low-resolution version of the video content is sent by the video owner to the specialist for analysis to identify parts of the video that are suitable for visual object insertion. A high resolution version of those identified parts of the video is then sent to the specialist for visual object insertion. The specialist may then return the modified parts of the video and the content owner create a final version of the high-resolution video by replacing the relevant parts of the high-resolution video with the modified parts.
Rendering video stream in sub-area of visible display area
A system and method for rendering a video stream for display on a display device, which may comprise receiving the video stream by streaming and generating display data defining a visible display area to be displayed by the display device, wherein generating of display data comprises rendering the video stream in a sub-area of the visible display area. A selection mechanism may be provided by which a version of the video stream may be selected for streaming having a spatial resolution and/or a bitrate which is selected based on a size of the sub-area.
Rendering stream controller
A system is provided for a rendering stream distributor controller for use with a plurality of content sources, a HTML code repository, a plurality of video rendering engines and a distribution network. The rendering stream distributor controller includes an outbound IP address inventory system, a video rendering engine and network elements inventory system and a rendering stream controller. The rendering stream controller is able to provide a stream instruction, based on one of a plurality of outbound IP addresses, one of a plurality of HTML content identification data, and one of a plurality of sets of HTML code so as to instruct one of the plurality of video rendering engines to output an MPEG transport stream.
Dynamic video background responsive to environmental cues
Aspects of the subject disclosure may include, for example, generating a video scene adapted to establish an ambiance setting and providing the video scene to equipment of a user for presentation at a display device. A visual focus is detected of the user towards presentation of the video scene displayed proximate to the user and the video scene is modified responsive to the visual focus. The modifying of the video scene is adapted to discourage visual focus of the user towards the presentation of the video scene. Other embodiments are disclosed.
Systems and methods for adapting content items to endpoint media devices
Systems, methods, and machine-readable media for adapting content items to device operations of an endpoint media device are disclosed. A first content composite may be created by one or more processing devices for delivery in a packet stream from a content provider system via one or more networks, where the first content composite may include an adaptable content item corresponding to a set of one or more audio and/or video packets corresponding to audio and/or video content. The first content composite may be transmitted, where, consequent to delivery of the first content composite to a first endpoint media device or a second endpoint media device, the first endpoint media device or the second endpoint media device may perform one operation relating to a second content item of the first content composite, where the adaptable content item may be modified based on the second content item.
Power aware video decoding and streaming
Methods and systems are disclosed for a mobile device to decode video based on available power and/or energy. For example, the mobile device may receive a media description file (MDF) from for a video stream from a video server. The MDF may include complexity information associated with a plurality of video segments. The complexity information may be related to the amount of processing power to be utilized for decoding the segment at the mobile device. The mobile device may determine at least one power metric for the mobile device. The mobile device may determine a first complexity level to be requested for a first video segment based on the complexity information from the MDF and the power metric. The mobile device may dynamically alter the decoding process to save energy based on the detected power/energy level.
Method and device for transmitting information on three-dimensional content including multiple view points
Provided is a method for transmitting metadata for omnidirectional content including a plurality of viewpoints. The method comprises identifying the metadata for the omnidirectional content including the plurality of viewpoints; and transmitting the identified metadata, wherein the metadata includes information about an identifier (ID) of a viewpoint group including at least one viewpoint of the plurality of viewpoints, and wherein the at least one viewpoint in the viewpoint group shares a common reference coordinate system.
Live streaming with live video production and commentary
A method comprises receiving from each of a plurality of commentator applications respective distinct streams of media content comprising commentary information, combining at least portions of selected ones of the distinct streams of media content comprising commentary information in a mixer associated with a media server to generate a composite media content stream, and providing the composite media content stream generated by the mixer to one or more servers of a content delivery network for delivery to one or more viewer devices. The commentary information of a given one of the distinct streams of media content received from a corresponding one of the commentator applications illustratively comprises at least one of audio content, video content, image content, social media posting content, chat text and closed caption text. The mixer may comprise a post-mixer coupled to the media server.
Network protocol for enabling enhanced features for media content
A web server provides media content for playback on client devices that is associated with a set of enhanced features. When enabled, each enhanced feature provides functionality associated with and supplemental to the media content. The web server performs a handshake with each of a plurality of client devices to authenticate a communication channel. The web server enables one or more of the enhanced features while delivering media content for playback on a first device if information exchanged in the handshake with the first device satisfies a first criterion. The web server delivers the media content for playback on a second device, without enabling any of the enhanced features, if information exchanged in the handshake with the second device does not satisfy the first criterion.
Model-based dubbing to translate spoken audio in a video
Model-based dubbing techniques are implemented to generate a translated version of a source video. Spoken audio portions of a source video may be extracted and semantic graphs generated that represent the spoken audio portions. The semantic graphs may be used to produce translations of the spoken portions. A machine learning model may be implemented to generate replacement audio for the spoken portions using the translation of the spoken portion. A machine learning model may be implemented to generate modifications to facial image data for a speaker of the replacement audio.