H04N13/229

Free-viewpoint photorealistic view synthesis from casually captured video

An estimated camera pose may be determined for each of a plurality of single plane images of a designated three-dimensional scene. The sampling density of the single plane images may be below the Nyquist rate. However, the sampling density of the single plane images may be sufficiently high such that the single plane images is sufficiently high such that they may be promoted to multiplane images and used to generate novel viewpoints in a light field reconstruction framework. Scene depth information identifying for each of a respective plurality of pixels in the single plane image a respective depth value may be determined for each single plane image. A respective multiplane image including a respective plurality of depth planes may be determined for each single plane image. Each of the depth planes may include a respective plurality of pixels from the respective single plane image.

Multi-lens based capturing apparatus and method

A multi-lens based capturing apparatus and method are provided. The capturing apparatus includes a lens array including lenses and a sensor including sensing pixels, wherein at least a portion of sensing pixels in the sensor may generate sensing information based on light entering through different lenses in the lens array, and light incident on each sensing pixel, among the portion of the plurality of sensing pixels may correspond to different combinations of viewpoints.

Multi-lens based capturing apparatus and method

A multi-lens based capturing apparatus and method are provided. The capturing apparatus includes a lens array including lenses and a sensor including sensing pixels, wherein at least a portion of sensing pixels in the sensor may generate sensing information based on light entering through different lenses in the lens array, and light incident on each sensing pixel, among the portion of the plurality of sensing pixels may correspond to different combinations of viewpoints.

Capturing light-field images with uneven and/or incomplete angular sampling
10897608 · 2021-01-19 · ·

A light-field camera may generate four-dimensional light-field data indicative of incoming light. The light-field camera may have an aperture configured to receive the incoming light, an image sensor, and a microlens array configured to redirect the incoming light at the image sensor. The image sensor may receive the incoming light and, based on the incoming light, generate the four-dimensional light-field data, which may have first and second spatial dimensions and first and second angular dimensions. The first angular dimension may have a first resolution higher than a second resolution of the second angular dimension.

Capturing light-field images with uneven and/or incomplete angular sampling
10897608 · 2021-01-19 · ·

A light-field camera may generate four-dimensional light-field data indicative of incoming light. The light-field camera may have an aperture configured to receive the incoming light, an image sensor, and a microlens array configured to redirect the incoming light at the image sensor. The image sensor may receive the incoming light and, based on the incoming light, generate the four-dimensional light-field data, which may have first and second spatial dimensions and first and second angular dimensions. The first angular dimension may have a first resolution higher than a second resolution of the second angular dimension.

Free-viewpoint photorealistic view synthesis from casually captured video

A respective target viewpoint may be rendered for each of a plurality of multiplane images of a three-dimensional scene. Each of the multiplane images may be associated with a respective single plane image of the three-dimensional scene captured from a respective viewpoint. Each of the multiplane images may include a respective plurality of depth planes. Each of the depth planes may include a respective plurality of pixels from the respective single plane image. Each of the pixels in the depth plane may be positioned at approximately the same distance from the respective viewpoint. A weighted combination of the target viewpoint renderings may be determined, where the sampling density of the single plane images is sufficiently high that the weighted combination satisfies the inequality in Equation (7). The weighted combination of the target viewpoint renderings may be transmitted as a novel viewpoint image.

SYSTEMS AND METHODS FOR VIRTUAL LIGHT FIELD EXPANSION WITH ELECTRO-OPTICAL TESSELLATION
20210006763 · 2021-01-07 ·

Some implementations of the disclosure are directed to tessellating a light field into a size or depth that is larger or further extended than the pupil size of an imaging system or display system. In some implementations, a display system comprises: a display configured to emit light corresponding to an image; a first optical component positioned in front of the display, the first optical component configured to pass the light to an orthogonal field evolving cavity (OFEC) at a plurality of different angles; the OFEC, wherein the OFEC comprises a plurality of reflectors that are configured to reflect the light passed at the plurality of different angles to tessellate the size of the image to form a tessellated image; and a second optical component optically coupled to the OFEC, the second optical component configured to relay the tessellated image through an exit pupil of the display system.

Methods and apparatuses for encoding and decoding digital light field images

A method for encoding a raw lenselet image includes a receiving phase, wherein at least a portion of a raw lenselet image is received, the image including a plurality of macro-pixels, each macro-pixel having pixels corresponding to a specific view angle for the same point of a scene, and an output phase, wherein a bitstream having at least a portion of an encoded lenselet image is outputted. The method has an image transform phase, wherein the pixels of said raw lenselet image are spatially displaced in a transformed multi-color image having a larger number of columns and rows with respect to the received raw lenselet image, wherein dummy pixels having undefined value are inserted into the raw lenselet image and wherein the displacement is performed so as to put the estimated center location of each macro-pixel onto integer pixel locations. Moreover, the method includes a sub-view generation phase, wherein a sequence of sub-views is generated, said sub-views having pixels of the same angular coordinates extracted from different macro-pixels of the transformed raw lenselet image. Finally, the method has a graph coding phase, wherein a bitstream is generated by encoding a graph representation of at least one of the sub-views of the sequence according to a predefined graph signal processing technique.

Methods and apparatuses for encoding and decoding digital light field images

A method for encoding a raw lenselet image includes a receiving phase, wherein at least a portion of a raw lenselet image is received, the image including a plurality of macro-pixels, each macro-pixel having pixels corresponding to a specific view angle for the same point of a scene, and an output phase, wherein a bitstream having at least a portion of an encoded lenselet image is outputted. The method has an image transform phase, wherein the pixels of said raw lenselet image are spatially displaced in a transformed multi-color image having a larger number of columns and rows with respect to the received raw lenselet image, wherein dummy pixels having undefined value are inserted into the raw lenselet image and wherein the displacement is performed so as to put the estimated center location of each macro-pixel onto integer pixel locations. Moreover, the method includes a sub-view generation phase, wherein a sequence of sub-views is generated, said sub-views having pixels of the same angular coordinates extracted from different macro-pixels of the transformed raw lenselet image. Finally, the method has a graph coding phase, wherein a bitstream is generated by encoding a graph representation of at least one of the sub-views of the sequence according to a predefined graph signal processing technique.

PASSIVE AND SINGLE-VIEWPOINT 3D IMAGING SYSTEM

A method for a passive single-viewpoint 3D imaging system comprises capturing an image from a camera having one or more phase masks. The method further includes using a reconstruction algorithm, for estimation of a 3D or depth image.