H04N21/440245

REMOTE IMAGE PROCESSING METHOD AND APPARATUS
20230033785 · 2023-02-02 ·

A remote image processing method, applied to a remote server, includes: obtaining a recommended bit rate, where the recommended bit rate matches an environment parameter of a network in which the remote server is located, and the network environment parameter is used to represent a capability of transmitting an amount of data by the network in a unit time; and generating, based on the recommended bit rate, adjustment parameters corresponding to different regions in a to-be-processed image, and processing the corresponding regions by using the adjustment parameters, to obtain a single-frame image used for display, so that an amount of data included in the single-frame image matches the recommended bit rate.

IMAGE DISPLAY SYSTEM, MOVING IMAGE DISTRIBUTION SERVER, IMAGE PROCESSING APPARATUS, AND MOVING IMAGE DISTRIBUTION METHOD
20220353555 · 2022-11-03 · ·

A server performs a part of a forming process necessary for conversion into formats corresponding to display modes of a head mounted display and a flat-plate display connected to an image processing apparatus, and transmits a processing result to the image processing apparatus. At this time, the server switches the part of the process to transmit any one of a pair of a left-eye image and a right-eye image, an image suited for the flat-plate display, and an image constituted by a left-eye image and a right-eye image to each of which distortion for an ocular lens has been given.

CODING SCHEME FOR IMMERSIVE VIDEO WITH ASYMMETRIC DOWN-SAMPLING AND MACHINE LEARNING
20220345756 · 2022-10-27 ·

Methods of encoding and decoding immersive video are provided. In an encoding method, source video data comprising a plurality of source views is encoded into a video bitstream. At least one of the source views is down-sampled prior to encoding. A metadata bitstream associated with the video stream comprises metadata describing a configuration of the down-sampling, to assist a decoder to decode the video bitstream. It is believed that the use of down-sampled views may help to reduce coding artifacts, compared with a patch-based encoding approach. Also provided are an encoder and a decoder for immersive video, and an immersive video bitstream.

SCALING WINDOW IN SUBPICTURE SUB-BITSTREAM EXTRACTION PROCESS
20230080061 · 2023-03-16 ·

Embodiments for video processing, including video coding, video decoding and video transcoding are described. One example method includes performing a conversion between a video comprising one or more video pictures comprising one or more subpictures and a bitstream of the video, wherein the conversion conforms to a rule that specifies that one or more parameters for a scaling window applicable to a subpicture are determined from one or more syntax elements during a subpicture sub-bitstream extraction process.

VIDEO PROCESSING DEVICE, VIDEO PROCESSING METHOD, VIDEO GENERATION DEVICE, VIDEO GENERATION METHOD, AND RECORDING MEDIUM
20230076845 · 2023-03-09 ·

A video processing device includes an acquirer that acquires video data via a predetermined transmission line, the video data including video and metadata that indicates a first frequency band that is a spatial frequency range in which the video is present; an adjuster that makes sharpness gain adjustment to video such that, among a plurality of regions of the video included in the video data acquired by the acquirer, a sharpness gain for a first region that belongs to the first frequency band indicated by the metadata exceeds a sharpness gain for a second region that belongs to a second frequency band that is a range outside the first frequency band; and an output device that outputs video adjusted by the adjuster.

VIDEO COMPRESSION AND STREAMING
20230071585 · 2023-03-09 ·

A method, system and product for compressing a video frame. The method comprising: obtaining a video frame that comprises at least a first area of interest; determining at least the first area of interest based on a portion of an object displayed therein; determining at least a portion of the frame based on at least the first areas of interest; determining at least a first processing channel based on at least the first areas of interest, wherein first processing channel comprises at least a first actions, wherein the first processing action is associated with at least one processing action parameters; and processing at least the first portion by utilizing at least the first processing channels, whereby an alternative video frame can be constructed based on at least a first processed portions of the video frame.

Creative intent scalability via physiological monitoring

Creative intent input describing emotion expectations and narrative information relating to media content is received. Expected physiologically observable states relating to the media content are generated based on the creative intent input. An audiovisual content signal with the media content and media metadata comprising the physiologically observable states is provided to a playback apparatus. The audiovisual content signal causes the playback device to use physiological monitoring signals to determine, with respect to a viewer, assessed physiologically observable states relating to the media content and generate, based on the expected physiologically observable states and the assessed physiologically observable states, modified media content to be rendered to the viewer.

Pattern addressing for session-based DASH operations
11638056 · 2023-04-25 · ·

A method of session-based DASH operations can include receiving a media presentation description (MPD) referencing a session-based description (SBD) and indicating a key name during a media access session. The SBD includes a first repeating pattern element that includes a first sequence of timed key values of the key name. The first repeating pattern element indicates that the first sequence of the timed key values of the key name is relocated along a timeline or an orderline. A first key value of the key name corresponding to a timing or a segment number of a current segment of a sequence of segments can be determined based on the first repeating pattern element in the SBD. A request for the current segment can be transmitted to a media content server. The request includes a pair of the key name and the first key value.

DECODED PICTURE BUFFER MANAGEMENT AND SUBPICTURES IN VIDEO CODING
20230119084 · 2023-04-20 ·

Embodiments for video encoding and video decoding are described. One example method includes performing a conversion between a video and a bitstream of the video, wherein the bitstream includes one or more pictures including one or more subpictures according to a rule, and wherein the rule specifies that, responsive to a condition, a rewriting operation is performed on referenced one or more sequence parameter sets during a subpicture sub-bitstream extraction process by which a target output sub-bitstream is extracted from the bitstream.

VIDEO-BASED POINT CLOUD STREAMS
20220329923 · 2022-10-13 · ·

Systems, methods, and instrumentalities are disclosed that relate to the processing of a media container file associated with 3D video data. The media container file may indicate that certain video-based point cloud compression (V-PCC) component tracks may be played together as a playout group. These V-PCG component tracks may represent respective encoded versions of one or more V-PCC components, and a video decoding device may play the tracks together in response to determining that the tracks belong to the same playout track group. The video decoding device may also determine from the media container file that certain PCC component tracks include tile groups that correspond to different objects in a point cloud or different parts of a same object in the point cloud. The video decoding device may decode these tile groups independently from each other so that a subset of the objects or parts of the point cloud may be accessed without also accessing the rest of the objects or parts.