H04N21/234345

Systems and methods to insert supplemental content into presentations of two-dimensional video content
11704882 · 2023-07-18 · ·

Systems and methods for inserting supplemental content into presentations of two-dimensional video content are disclosed. Exemplary implementations may: obtain two-dimensional video content depicting a three-dimensional space; obtain supplemental content; obtain a model of the three-dimensional space defining the one or more visible physical features within the three-dimensional space; determine the camera position of the two-dimensional video content; identify a presentation location within the two-dimensional video content; determine integration information; modify the two-dimensional video content to include the supplemental content at the identified presentation locations in accordance with the integration information and/or perform other operations.

Imaging system, server device, control method for server device, and storage medium
11706534 · 2023-07-18 · ·

An imaging system including an imaging device 501 and a recording server 502 communicatively connected to the imaging device 501, wherein the imaging device 501 includes an imaging unit 503 that generates a video with a plurality of resolution, a dividing unit 504 that performs a division process of dividing the video generated by the imaging unit 503 into one or a plurality of tile areas and generates a tile image, and a transmission unit 506 that transmits the video to the recording server 502, wherein the recording server 502 includes a division control unit 507 that outputs an instruction to change a division method for the division process to the imaging device according to a designation frequency of an area designated on the video transmitted from the imaging device 501.

VISUAL ASSETS OF AUDIOVISUAL SIGNALS
20230013557 · 2023-01-19 ·

In some examples, an electronic device includes a network interface and a processor. The processor is to analyze an audiovisual signal received via the network interface to identify a topic, identify information related to the topic, and cause a display device to display a visual asset for the information in a video representing the audiovisual signal.

METHOD AND SYSTEM FOR LIVE VIDEO STREAMING WITH INTEGRATED ENCODING AND TRANSMISSION SEMANTICS

This disclosure relates generally to method and system for live video streaming with integrated encoding and transmission semantics. The system receives a set of frames associated with a live video stream encoded to generate a set of data fragments using a reference encoder and a delta encoder. Transmitter unit of the live video streaming protocol transmits each packet of the set of full frames and the set of delta frames in sequence with a payload specific header based on a packet mode. Further, the receiver unit receives each packet of the full frames and each packet of the delta frames based on the packet mode to reconstruct an original sequence from the foreground pixels by estimating a total number of packets expected at each frame interval and loss incurred in each packet of the set of full frames and the set of delta frames.

Method and apparatus for generating media data
11700434 · 2023-07-11 · ·

The present invention concerns a method for generating media files from video sequences, the method comprising by a server: obtaining from the video sequences, video data composed of a plurality of samples; generating a video track based on the obtained video data, each video track comprises samples of a video sequence, and the video track is associated with descriptive metadata, the descriptive metadata comprises: a spatial information related to one or more samples of the associated video track; and a composition information for organizing generated video tracks to get a full picture when displayed by a client; and generating media files including the generated video tracks.

Automated video cropping

The disclosed computer-implemented method may include receiving, as an input, segmented video scenes, where each video scene includes a specified length of video content. The method may further include scanning the video scenes to identify objects within the video scene and also determining a relative importance value for the identified objects. The relative importance value may include an indication of which objects are to be included in a cropped version of the video scene. The method may also include generating a video crop that is to be applied to the video scene such that the resulting cropped version of the video scene includes those identified objects that are to be included based on the relative importance value. The method may also include applying the generated video crop to the video scene to produce the cropped version of the video scene. Various other methods, systems, and computer-readable media are also disclosed.

Method and apparatus for processing video

A method and an apparatus for processing a video are provided technology. The method may include: separating a foreground image and a background image from a video frame in the target video stream, in response to acquiring a target video stream; adding a to-be-displayed content at a target display position in the background image to obtain a processed background image; and combining the foreground image and the processed background image to obtain a target video frame. The present disclosure may directly render the to-be-displayed content in the background, so that the content displayed in the background does not block a body in the foreground, such as person.

Re-encoding predicted picture frames in live video stream applications
11700419 · 2023-07-11 · ·

In various examples, a media stream may be received by a re-encode system that may leverage a recode engine to convert (e.g., at an interval, based on a request, etc.) an inter-frame associated with the media stream to an intra-frame. The intra-frame may be converted from the inter-frame using parameters or other information associated with and received with the media stream. The converted intra-frame may be merged into an updated segment of the media stream in place of the original inter-frame to enable storage of the updated segment—or a portion thereof—for later use.

AUTOMATED CONTENT IDENTIFICATION FOR BINGE WATCHING OF DIGITAL MEDIA
20230217073 · 2023-07-06 · ·

“Binge watching” of multiple episodes of a program is improved by the player device automatically skipping repeated portions of the program. Opening and closing credit scenes, for example, can be automatically skipped to thereby allow the viewer to progress through the entire season of programming at an even faster rate than was previously thought possible. Programming to be skipped may be identified by detecting audio or other digital fingerprints in the content itself, for example. Content to be skipped may be identified to the playback device according to presentation time stamp (PTS) or other time markers.

Low latency wireless virtual reality systems and methods

Virtual Reality (VR) processing devices and methods are provided for transmitting user feedback information comprising at least one of user position information and user orientation information, receiving encoded audio-video (A/V) data, which is generated based on the transmitted user feedback information, separating the A/V data into video data and audio data corresponding to a portion of a next frame of a sequence of frames of the video data to be displayed, decoding the portion of a next frame of the video data and the corresponding audio data, providing the audio data for aural presentation and controlling the portion of the next frame of the video data to be displayed in synchronization with the corresponding audio data.