H04N2013/0085

Method and apparatus for processing video signal
11785257 · 2023-10-10 · ·

A method for processing a video according to the present invention may comprise: generating a plurality of Most Probable Mode (MPM) candidates; determining whether there is an MPM candidate identical to an intra-prediction mode of a current block among the plurality of MPM candidates; obtaining the intra-prediction mode of the current block, based on a result of the determining; and performing an intra-prediction for the current block, based on the intra-prediction mode of the current block.

Virtual reality environment based manipulation of multi-layered multi-view interactive digital media representations

Various embodiments of the present disclosure relate generally to systems and methods for generating multi-view interactive digital media representations in a virtual reality environment. According to particular embodiments, a plurality of images is fused into a first content model and a first context model, both of which include multi-view interactive digital media representations of objects. Next, a virtual reality environment is generated using the first content model and the first context model. The virtual reality environment includes a first layer and a second layer. The user can navigate through and within the virtual reality environment to switch between multiple viewpoints of the content model via corresponding physical movements. The first layer includes the first content model and the second layer includes a second content model and wherein selection of the first layer provides access to the second layer with the second content model.

Method and apparatus of encoding/decoding image data based on tree structure-based block division

Disclosed are methods and apparatuses for image data encoding/decoding. A method of decoding an image includes receiving a bitstream in which the image is encoded; obtaining index information for specifying a block division type of a current block in the image; and determining the block division type of the current block from a candidate group pre-defined in the decoding apparatus. The candidate group includes a plurality of candidate division types, including at least one of a non-division, a first quad-division, a second quad-division, a binary-division or a triple-division. The method also includes dividing the current block into a plurality of sub-blocks; and decoding each of the sub-blocks with reference to syntax information obtained from the bitstream.

METHOD AND APPARATUS OF ENCODING/DECODING IMAGE DATA BASED ON TREE STRUCTURE-BASED BLOCK DIVISION
20230232043 · 2023-07-20 · ·

Disclosed are methods and apparatuses for image data encoding/decoding. A method of decoding an image includes receiving a bitstream in which the image is encoded; obtaining index information for specifying a block division type of a current block in the image; and determining the block division type of the current block from a candidate group pre-defined in the decoding apparatus. The candidate group includes a plurality of candidate division types, including at least one of a non-division, a first quad-division, a second quad-division, a binary-division or a triple-division. The method also includes dividing the current block into a plurality of sub-blocks; and decoding each of the sub-blocks with reference to syntax information obtained from the bitstream.

Passive Wide-Area Three-Dimensional Imaging
20230054721 · 2023-02-23 · ·

Radar, lidar, and other active 3D imaging techniques require large, heavy sensors that consume lots of power. Passive 3D imaging techniques based on feature matching are computationally expensive and limited by the quality of the feature matching. Fortunately, there is a robust, computationally inexpensive way to generate 3D images from full-motion video acquired from a platform that moves relative to the scene. The full-motion video frames are registered to each other and mapped to the scene coordinates using data about the trajectory of the platform with respect to the scene. The time derivative of the registered frames equals the product of the height map of the scene, the projected angular velocity of the platform, and the spatial gradient of the registered frames. This relationship can be solved in (near) real time to produce the height map of the scene from the full-motion video and the trajectory.

Augmented virtuality self view

A processor system processes image data for rendering a virtual environment for a user present in a real environment. The system receives head tracking data indicative of the orientation of the head of the user. An image processor generates image data for rendering a viewport of the virtual environment on a display system based on the head tracking data. A real-view area is defined in the virtual environment, having at least one boundary. The boundary corresponds to predetermined coordinates in the virtual environment. Thereby a corresponding part of the real environment is made visible in the real-view area, the part showing the real environment as perceived from the user head pose. Effectively the virtual environment is augmented by integrating part of the real environment via the real-view area.

Glasses-Free Determination of Absolute Motion

During operation, an electronic device may capture images using multiple image sensors having different fields of view and positions. Then, the electronic device may determine, based at least in part on an apparent size of an anatomical feature in the images (such as an interpupillary distance) and a predefined or predetermined size of the anatomical feature, absolute motion of at least a portion of the individual along a direction between at least the portion of the individual and the electronic device. Moreover, the electronic device may compute based at least in part on an estimated distance along the direction corresponding to the apparent size and the predefined or predetermined size and angular information associated with one or more objects in the images relative to the positions, absolute motion of at least the portion of the individual in a plane that is perpendicular to the direction.

METHOD AND APPARATUS FOR BUFFER MANAGEMENT IN CLOUD BASED VIRTUAL REALITY SERVICES
20220417487 · 2022-12-29 ·

Provided is a method for creating a virtual reality content, storing the virtual reality content in a transmission buffer, and after that, managing the transmission buffer. A server creates the virtual reality content based on user's motion information, stores the virtual reality content in the transmission buffer and is allowed to modify the virtual reality content stored in the transmission buffer based on subsequently received user's motion information, so that the most recent user's motion information can be appropriately reflected in the virtual reality content. It is possible to provide a more immersive virtual reality service.

METHOD AND APPARATUS OF ENCODING/DECODING IMAGE DATA BASED ON TREE STRUCTURE-BASED BLOCK DIVISION
20230370639 · 2023-11-16 · ·

Disclosed are methods and apparatuses for image data encoding/decoding. A method of decoding an image includes receiving a bitstream in which the image is encoded; obtaining index information for specifying a block division type of a current block in the image; and determining the block division type of the current block from a candidate group pre-defined in the decoding apparatus. The candidate group includes a plurality of candidate division types, including at least one of a non-division, a first quad-division, a second quad-division, a binary-division or a triple-division. The method also includes dividing the current block into a plurality of sub-blocks; and decoding each of the sub-blocks with reference to syntax information obtained from the bitstream.

METHOD AND APPARATUS OF ENCODING/DECODING IMAGE DATA BASED ON TREE STRUCTURE-BASED BLOCK DIVISION
20230370640 · 2023-11-16 · ·

Disclosed are methods and apparatuses for image data encoding/decoding. A method of decoding an image includes receiving a bitstream in which the image is encoded; obtaining index information for specifying a block division type of a current block in the image; and determining the block division type of the current block from a candidate group pre-defined in the decoding apparatus. The candidate group includes a plurality of candidate division types, including at least one of a non-division, a first quad-division, a second quad-division, a binary-division or a triple-division. The method also includes dividing the current block into a plurality of sub-blocks; and decoding each of the sub-blocks with reference to syntax information obtained from the bitstream.