H04N21/23439

Prioritizing encoding of video data received by an online system to maximize visual quality while accounting for fixed computing capacity

An online system receives video data items from users and encodes the video data items using various codecs. To account for different computational resources used for encoding using different codecs, the online system ranks combinations of video data items by ratios of encoding video data items with different codecs to computational costs of encoding different video data items with different codecs. The benefit of encoding a video data item with a codec is based on a compression efficiency of the codec and a predicted aggregate amount of the video data item displayed to various users of the online system. Encoding video data items with codecs based on the determined ratios allows the online system to optimize a duration of video data having at least a threshold video quality to users.

Foveation-based image encoding and decoding

An encoding method and a decoding method. The encoding method includes generating curved image by creating projection of visual scene onto inner surface of imaginary 3D geometric shape that is curved in at least one dimension; dividing curved image into input portion and plurality of input rings; encoding input portion and input rings into first planar image and second planar image, respectively, such that input portion is stored into first planar image, and input rings are packed into corresponding rows of second planar image; and communicating, to display apparatus, first and second planar images and information indicative of sizes of input portion and input rings.

Media content presentation

A method of presenting media content is disclosed. A plurality of assets is received at a mobile device comprising a display and an orientation sensor. The plurality of assets comprises a first video asset associated with a first aspect ratio, and a second video asset associated with a second aspect ratio, different from the first aspect ratio. A desired aspect ratio is determined based on an output of the orientation sensor. In accordance with a determination that the desired aspect ratio is closer to the first aspect ratio than to the second aspect ratio, the first video asset is selected. In accordance with a determination that the desired aspect ratio is closer to the second aspect ratio than to the first aspect ratio, the second video asset is selected. The selected video is presented at the desired aspect ratio via the display.

Method and apparatus for automatic HLS bitrate adaptation

A media player system is provided for receiving and processing a media program that uses a time interval interval t.sub.D required to decode N.sub.D frames of the media program segment. The media system receives the requested media program segment, processes the segment and determines if the throughput of the media program differs from the desired presentation throughput by a tolerance amount. Both decoding and rendering performance are determined and used to determine presentation throughput, and to determine if heavier or lighter variants of the media program should be requested for subsequent media program segments.

Random access at resync points of dash segments

A device for retrieving media includes a memory configured to store media data of a media presentation; and one or more processors implemented in circuitry and configured to: retrieve a manifest file for a media presentation indicating that container parsing of media data of a bitstream can be started at a resync point of a segment of a representation of the media presentation, the resync point being at a position other than a start of the segment and representing a point at which the container parsing of the media data of the bitstream can be started; use the manifest file to form a request to retrieve the media data of the representation starting at the resync point; send the request to initiate retrieval of the media data of the media presentation starting at the resync point; and present the retrieved media data.

Video encoding based on customized bitrate table
11563951 · 2023-01-24 · ·

An original input content is subjected to multiple constant quality probe encodes for a defined set of resolutions. In one embodiment, probe encodes encode a few parts of the original source video, for example, 30 seconds from 5 different positions. Each probe encode delivers an average bitrate that is required to achieve the configured constant quality. The mean value of the average bitrate is taken per resolution. This results in a list of bitrates that map to a resolution that would achieve the best quality, a custom bitrate table. Based on the custom bitrate table, an optimized bitrate ladder is computed. The process starts with a configurable minimum bitrate and steps up by a bitrate step size that is between a configurable min and max bitrate step size until the bitrate of the highest resolution is reached.

Scalable FOV+ for VR 360 video delivery to remote end users

A distribution device for delivering a selected viewport stream of virtual reality (VR) data to each of a plurality of client devices, comprising a processor configured for receiving a plurality of extended viewport streams of a VR video file each comprising a sequence of extended field of view (EFOV) frames created for a respective one of a plurality of overlapping segments constituting a sphere defined in the VR video file and delivering a selected one of the plurality of extended viewport streams to each of a plurality of client devices by performing the following for each of the client devices in each of a plurality of iterations: (1) receiving a current orientation data of the respective client device; (2) selecting one of the plurality of extended viewport streams according to the current orientation data; and (3) transmitting the selected extended viewport stream to the respective client device.

Positional zero latency

Based on viewing tracking data, a viewer's view direction to a three-dimensional (3D) scene depicted by a first video image is determined. The first video image has been streamed in a video stream to the streaming client device before the first time point and rendered with the streaming client device to the viewer at the first time point. Based on the viewer's view direction, a target view portion is identified in a second video image to be streamed in the video stream to the streaming client device to be rendered at a second time point subsequent to the first time point. The target view portion is encoded into the video stream with a higher target spatiotemporal resolution than that used to encode remaining non-target view portions in the second video image.

Signaling Size and Position of Target Picture-In-Picture Region
20230019913 · 2023-01-19 ·

A method of processing media data. The method includes determining, for a conversion between the media data and a media data file, whether a size and a position of a target picture-in-picture region in a main video are present in the media data file, wherein a supplementary video appears to be overlaid on the target picture-in-picture region when the main video is displayed; and performing the conversion between the media data and the media data file based on the size and the position determined. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.

Imaging system, server device, control method for server device, and storage medium
11706534 · 2023-07-18 · ·

An imaging system including an imaging device 501 and a recording server 502 communicatively connected to the imaging device 501, wherein the imaging device 501 includes an imaging unit 503 that generates a video with a plurality of resolution, a dividing unit 504 that performs a division process of dividing the video generated by the imaging unit 503 into one or a plurality of tile areas and generates a tile image, and a transmission unit 506 that transmits the video to the recording server 502, wherein the recording server 502 includes a division control unit 507 that outputs an instruction to change a division method for the division process to the imaging device according to a designation frequency of an area designated on the video transmitted from the imaging device 501.