H04N21/23439

Efficient Delivery of Multi-Camera Interactive Content
20230216908 · 2023-07-06 ·

Techniques are disclosed relating to encoding recorded content for distribution to other computing devices. In various embodiments, a first computing device records content of a physical environment in which the first computing device is located, the content being deliverable to a second computing device configured to present a corresponding environment based on the recorded content and content recorded by one or more additional computing devices. The first computing device determines a pose of the first computing device within the physical environment and encodes the pose in a manifest usable to stream the content recorded by the first computing device to the second computing device. The encoded pose is usable by the second computing device to determine whether to stream the content recorded by the first computing device.

METHODS AND SYSTEMS FOR RESOURCE MANAGEMENT
20230217050 · 2023-07-06 ·

Methods and systems for resource management are described. A computing device may determine a duration of a content break satisfies a timing threshold. Based on the duration of the content break satisfying the timing threshold, the computing device may cause an encoder to cease an encoding operation. Based on the duration of the content break satisfying the timing threshold, the computing device may cause the encoder to send to a media device, secondary content and cause the media device to repeatedly output the secondary content rather than sending repeated requests for the secondary content.

Power aware adaptation for video streaming

Power aware adaptation for a power aware video streaming system may be based on the complexity information conveyed in different ways. A complexity level of a data stream, such as a video data stream, may be selected as a function of a remaining battery power of a wireless transmit/receive unit (WTRU) and on a state set of a plurality of state sets that may be stored and/or managed by the WTRU. These state sets may correspond to, for example, different content sources and/or different complexity estimation algorithms and may be used to select the complexity level of the data stream. The data stream may then be received at the selected complexity level. The complexity level and/or a bitrate of the data stream may be adapted to accommodate, for example, the remaining battery power and/or other circumstances. The adaptation may be customized according to the objectives of use cases.

Cloud-based digital content recorder apparatus and methods

Apparatus and methods for the storage and delivery of content over a network. In one embodiment, the network comprises a managed content distribution network, and the apparatus comprises a “cloud” digital video recorder (cDVR) server which stores, processes and assembles content prior to its delivery in a way that obviates superfluous storage, and gains significant network efficiencies. This approach also facilitates various pre-streaming processing in the video recording (e.g., cDVR) system. One or more content libraries adapted to facilitate cDVR content assembly and access are also disclosed. Secondary content (e.g., promotions, advertisements, etc.) can also be selectively inserted into cDVR content. Various access, business or operational rules and methods implementing the foregoing are also described.

Video streaming

A server for streaming a video to a client involves making the video available from the server to the client upon request in at least a temporal independent version and a temporal dependent version. The server is configured for: i) receiving a request from the client to receive a stream of the video from an arbitrary starting point in time; and ii) retrieving at least the first frame from the temporal independent version; and iii) retrieving frames subsequent to the at least first frame from the temporal dependent version; and iv) sending the at least first frame to the client and send the frames subsequent to the at least first frame to the client.

Methods and apparatus to facilitate using a streaming manifest including a profile indication
11695817 · 2023-07-04 · ·

The present disclosure relates to methods and devices for facilitating using a streaming manifest including a profile signal for content encoded based on a encapsulation format. An example method disclosed herein includes identifying structures associated with first media, wherein the first media is associated with a first streaming format. The example method also includes generating a streaming manifest for the first media, wherein the streaming manifest maps the structures associated with the first media to structures associated with a second streaming format, and including an indication with the streaming manifest, wherein the indication identifies the streaming profile used to map the structures.

Enhanced immersive digital media
11696001 · 2023-07-04 · ·

This disclosure describes systems, methods, and devices related to immersive digital media. A method may include receiving, at a first device, first volumetric data, and second volumetric data including a first volumetric time slice of a first volumetric media stream. The method may include determining that the first volumetric time slice includes a first portion and a second portion, the first portion representing a first object and including an amount of the second volumetric data. The method may include determining that the first volumetric data represents the first object. The method may include generating a second volumetric time slice including the first volumetric data and the second portion of the first volumetric time slice, and generating a second volumetric media stream including the second volumetric time slice. The method may include sending the second volumetric media for presentation at a third device.

Methods for generating video-and audience-specific encoding ladders with audio and video just-in-time transcoding

A method including: populating an encoding ladder with a subset of bitrate-resolution pairs, from a set of bitrate-resolution pairs, based on a distribution of audience bandwidths; receiving a first request for a first playback segment, at a first bitrate-resolution pair in a encoding ladder, in the video from a first device; in response to determining an absence of video segments, at the first bitrate-resolution pair and corresponding to the segment, in a first rendition cache: identifying a first set of mezzanine segments, in the video, corresponding to the first playback segment; assigning the first set of mezzanine segments to a set of workers for transcoding into a first set of video segments according to the first bitrate-resolution pair; storing the first set of video segments in the first rendition cache; and based on the first request, releasing the first set of video segments to the first device.

Signaling the Purpose of Preselection
20230007210 · 2023-01-05 ·

A method of processing media data. The method includes determining, for a conversion between the media data and a media data file, that a preselection element includes an indicator, wherein the indicator indicates that a purpose of the preselection element is for providing a picture-in-picture experience where a supplementary video appears to be overlaid on a target picture-in-picture region in a main video; and performing a conversion between the media data and the media data file based on the indicator. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.

Per-Title Encoding Using Spatial and Temporal Resolution Downscaling
20230007298 · 2023-01-05 · ·

Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.