Patent classifications
H04N13/10
Representation of media data
A media data preparation device adapted to receive media data, including at least one processor, and at least one non-transitory memory having computer program code stored thereon for execution by the at least one processor, the computer program code including instructions to receive a set of metadata that is based on at least one spatial coordinate, where the set of metadata is associated with the media data, and determine a representation of the media data in a virtual reality space based on the set of metadata
Employing three-dimensional (3D) data predicted from two-dimensional (2D) images using neural networks for 3D modeling applications and other applications
The disclosed subject matter is directed to employing machine learning models configured to predict 3D data from 2D images using deep learning techniques to derive 3D data for the 2D images. In some embodiments, a method is provided that comprises receiving, by a system comprising a processor, a panoramic image, and employing, by the system, a three-dimensional data from two-dimensional data (3D-from-2D) convolutional neural network model to derive three-dimensional data from the panoramic image, wherein the 3D-from-2D convolutional neural network model employs convolutional layers that wrap around the panoramic image as projected on a two-dimensional plane to facilitate deriving the three-dimensional data.
Employing three-dimensional (3D) data predicted from two-dimensional (2D) images using neural networks for 3D modeling applications and other applications
The disclosed subject matter is directed to employing machine learning models configured to predict 3D data from 2D images using deep learning techniques to derive 3D data for the 2D images. In some embodiments, a method is provided that comprises receiving, by a system comprising a processor, a panoramic image, and employing, by the system, a three-dimensional data from two-dimensional data (3D-from-2D) convolutional neural network model to derive three-dimensional data from the panoramic image, wherein the 3D-from-2D convolutional neural network model employs convolutional layers that wrap around the panoramic image as projected on a two-dimensional plane to facilitate deriving the three-dimensional data.
Directed interpolation and data post-processing
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
Directed interpolation and data post-processing
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
Method and device for overlaying 3D graphics over 3D video
A device and method process graphics to be overlayed over video for three-dimensional display. The video includes a series of video frames updated at a video rate, including main video frames and additional video frames. A first buffer buffers a first part of the overlay information to be overlayed over the main video frames. A second buffer buffers a second part of the overlay information to be overlayed over the additional video frames. For each video frame, the first part of the overlay information or the second part of the overlay information is copied to a frame-accurate area copier. The first part of the overlay information or the second part of the overlay information is output according to whether a current video frame is a main video frame or an additional video frame. The first part of the overlay information and the second part of the overlay information are updated at an overlay rate. The overlay rate is different than the video rate.
Mapping of spherical image data into rectangular faces for transport and decoding across networks
A system captures a first hemispherical image and a second hemispherical image, each hemispherical image including an overlap portion, the overlap portions capturing a same field of view, the two hemispherical images collectively comprising a spherical FOV and separated along a longitudinal plane. The system maps a modified first hemispherical image to a first portion of the 2D projection of a cubic image, the modified first hemispherical image including a non-overlap portion of the first hemispherical image, and maps a modified second hemispherical image to a second portion of the 2D projection of the cubic image, the modified second hemispherical image also including a non-overlap portion. The system maps the overlap portions of the first hemispherical image and the second hemispherical image to the 2D projection of the cubic image, and encodes the 2D projection of the cubic image to generate an encoded image representative of the spherical FOV.
Mapping of spherical image data into rectangular faces for transport and decoding across networks
A system captures a first hemispherical image and a second hemispherical image, each hemispherical image including an overlap portion, the overlap portions capturing a same field of view, the two hemispherical images collectively comprising a spherical FOV and separated along a longitudinal plane. The system maps a modified first hemispherical image to a first portion of the 2D projection of a cubic image, the modified first hemispherical image including a non-overlap portion of the first hemispherical image, and maps a modified second hemispherical image to a second portion of the 2D projection of the cubic image, the modified second hemispherical image also including a non-overlap portion. The system maps the overlap portions of the first hemispherical image and the second hemispherical image to the 2D projection of the cubic image, and encodes the 2D projection of the cubic image to generate an encoded image representative of the spherical FOV.
Array camera, electrical device, and method for operating the same
Disclosed are an array camera, an electrical device, and a method for operating the same. A method for operating an array camera including a plurality of camera modules includes acquiring images through the camera modules, when a first object is located at a first position of a first distance, extracting a first image and a second image acquired by a first camera module and a second camera module selected from among the camera modules, respectively, calculating distance information regarding the first object based on the first image and the second image, and when a second object is located at a second position or a second distance, calculating distance information regarding the second object located at the second position or the second distance based on at least one of a third image acquired by a camera module different from the first camera module and a fourth image acquired by a camera module different from the second camera module. Consequently, user convenience is improved.
Array camera, electrical device, and method for operating the same
Disclosed are an array camera, an electrical device, and a method for operating the same. A method for operating an array camera including a plurality of camera modules includes acquiring images through the camera modules, when a first object is located at a first position of a first distance, extracting a first image and a second image acquired by a first camera module and a second camera module selected from among the camera modules, respectively, calculating distance information regarding the first object based on the first image and the second image, and when a second object is located at a second position or a second distance, calculating distance information regarding the second object located at the second position or the second distance based on at least one of a third image acquired by a camera module different from the first camera module and a fourth image acquired by a camera module different from the second camera module. Consequently, user convenience is improved.