Patent classifications
H04N2013/0088
METHOD FOR PROCESSING IMMERSIVE VIDEO AND METHOD FOR PRODUCING IMMERSIVE VIDEO
Disclosed herein is an immersive video processing method. The immersive video processing method includes: determining a priority order of pruning for source videos; extracting patches from the source videos based on the priority order of pruning; generating at least one atlas based on the extracted patches; and encoding metadata. Herein, a first flag indicating whether or not an atlas includes a patch including information on an entire region of a first source video may be encoded into the metadata.
METHOD FOR PROCESSING IMMERSIVE VIDEO AND METHOD FOR PRODUCING IMMERSIVE VIDEO
Disclosed herein is an immersive video processing method. The immersive video processing method includes: determining a priority order of pruning for source videos; extracting patches from the source videos based on the priority order of pruning; generating at least one atlas based on the extracted patches; and encoding metadata. Herein, the metadata may include first threshold information that becomes a criterion for distinguishing between a valid pixel and an invalid pixel in the atlas video.
System and methods for measuring depth using an array of independently controllable cameras
Systems in accordance with embodiments of the invention can perform parallax detection and correction in images captured using array cameras. Due to the different viewpoints of the cameras, parallax results in variations in the position of objects within the captured images of the scene. Methods in accordance with embodiments of the invention provide an accurate account of the pixel disparity due to parallax between the different cameras in the array, so that appropriate scene-dependent geometric shifts can be applied to the pixels of the captured images when performing super-resolution processing. In a number of embodiments, generating depth estimates considers the similarity of pixels in multiple spectral channels. In certain embodiments, generating depth estimates involves generating a confidence map indicating the reliability of depth estimates.
Method and apparatus for encoding/decoding image for virtual view synthesis
Disclosed herein is an image encoding/decoding method and apparatus for virtual view synthesis. The image decoding for virtual view synthesis may include decoding texture information and depth information of at least one or more basic view images and at least one or more additional view images from a bit stream and synthesizing a virtual view on the basis of the texture information and the depth information, wherein the basic view image and the additional view image comprise a non-empty region and an empty region, and wherein the synthesizing of the virtual view comprises determining the non-empty region through a specific value in the depth information and a threshold and synthesizing the virtual view by using the determined non-empty region.
Computer-generated image processing including volumetric scene reconstruction to replace a designated region
An imagery processing system determines pixel color values for pixels of captured imagery from volumetric data, providing alternative pixel color values. A main imagery capture device, such as a camera, captures main imagery such as still images and/or video sequences, of a live action scene. Alternative devices capture imagery of the live action scene, in some spectra and form, and capture information related to pixel color values for multiple depths of a scene, which can be processed to provide reconstruction of an image including replacing a designated region in the image.
METHODS, SYSTEMS, AND MEDIA FOR RENDERING IMMERSIVE VIDEO CONTENT WITH FOVEATED MESHES
Methods, systems, and media for rendering immersive video content with foveated meshes are provided. In some embodiments, the method comprises: receiving a video content item; determining, using a hardware processor, whether the video content item meets at least one criterion; in response to determining that the video content item meets the at least one criterion, generating, using the hardware processor, a foveated mesh in accordance with a foveation ratio parameter on which frames of the video content item are to be projected, wherein the foveated mesh has a non-uniform position map that increases pixel density in a central portion of each frame of the video content item in comparison with peripheral portions of each frame of the video content item; and storing the video content item in a file format that includes the generated foveated mesh, wherein the immersive video content is rendered by applying the video content item as a texture to the generated foveated mesh.
Systems and methods for multiscopic noise reduction and high-dynamic range
The present disclosure relates to methods and systems that may improve and/or modify images captured using multiscopic image capture systems. In an example embodiment, burst image data is captured via a multiscopic image capture system. The burst image data may include at least one image pair. The at least one image pair is aligned based on at least one rectifying homography function. The at least one aligned image pair is warped based on a stereo disparity between the respective images of the image pair. The warped and aligned images are then stacked and a denoising algorithm is applied. Optionally, a high dynamic range algorithm may be applied to at least one output image of the aligned, warped, and denoised images.
Systems and Methods for Indicating a Field of View
A system for determining a person's field of view with respect to captured data (e.g., recorded audiovisual data). A multi-camera capture system (e.g. bodycam, vehicular camera, stereoscopic, 360-degree captures system) records audiovisual data of an event. The field of capture of a capture system or a field of capture of captured data combined from multiple capture systems may be greater than the field of view of a person. Facial features (e.g. eyes, ears, nose, and jawlines) of the person may be detected from the captured data. Facial features may be used to determine the field of view of the person with respect to the captured data. Sensors that detect head orientation may be used to determine the field of view of the person with respect to captured data. The field of view of the person may be shown with respect to the captured data when the captured data is played back.
METHODS AND APPARATUS FOR RECEIVING AND/OR USING REDUCED RESOLUTION IMAGES
Methods and apparatus for using selective resolution reduction on images to be transmitted and/or used by a playback device are described. Prior to transmission one or more images of an environment are captured. Based on image content, motion detection and/or user input a resolution reduction operation is selected and performed. The reduced resolution image is communicated to a playback device along with information indicating a UV map corresponding to the selected resolution allocation that should be used by the playback device for rendering the communicated image. By changing the resolution allocation used and which UV map is used by the playback device different resolution allocations can be made with respect to different portions of the environment while allowing the number of pixels in transmitted images to remain constant. The playback device renders the individual images with the UV map corresponding to the resolution allocation used to generate the individual images.
METHOD FOR PROCESSING IMMERSIVE VIDEO AND METHOD FOR PRODUCING IMMERSIVE VIDEO
Disclosed herein is an immersive video processing method. The immersive video processing method may include classifying a multiplicity of source view videos into base view videos and additional view videos, generating residual data for the additional view videos, packing a patch, which is generated based on the residual data, into an altas video, and generating metadata for the patch.