G06T3/0037

CODING SCHEMES FOR VIRTUAL REALITY (VR) SEQUENCES
20230141565 · 2023-05-11 · ·

An improved method for coding video is provided that includes Virtual Reality (VR) sequences that enables more efficient encoding by organizing the VR sequence as a single 2D block structure. In the method, reference picture and subpicture lists are created and extended to account for coding of the VR sequence. To further improve coding efficiency, reference indexing can be provided for the temporal and spatial difference between a current VR picture block and the reference pictures and subpictures for the VR sequence. Further, because the reference subpictures for the VR sequence may not have the proper orientation once the VR sequence subpictures are organized into the VR sequence, reorientation of the reference subpictures is made so that the reference subpicture orientations match the current VR subpicture orientations.

Systems and methods for improving low dose volumetric contrast-enhanced MRI

Methods and systems are provided for improving model robustness and generalizability. The method may comprise: acquiring, using a medical imaging apparatus, a medical image of a subject; reformatting the medical image of the subject in multiple scanning orientations; applying a deep network model to the medical image to improve the quality of the medical image; and outputting an improved quality image of the subject for analysis by a physician.

Three-dimentional plane panorama creation through hough-based line detection
09836871 · 2017-12-05 · ·

A method for creating a plane panorama from point cloud data using Hough transformations is disclosed. The method involves converting the three-dimensional point cloud into a two-dimensional histogram with bins grouping neighboring points, and performing a Hough transformation on the histogram. The resulting transformed data is segmented and the method searches the segments iteratively for a major line, followed by lines that are orthogonal, diagonal, or parallel to the major line, and discards outlying data in each bin as lines are identified. The detected lines are connected to form planes, and the planes are assembled into a hole- and gap-filled panorama. The method may also use an algorithm such as a Random Sample Consensus (RANSAC) algorithm to detect a ground plane.

HANDLING FACE DISCONTINUITIES IN 360-DEGREE VIDEO CODING
20220377385 · 2022-11-24 · ·

Systems, methods, and instrumentalities may be provided for discounting reconstructed samples and/or coding information from spatial neighbors across face discontinuities. Whether a current block is located at a face discontinuity may be determined. The face discontinuity may be a face boundary between two or more adjoining blocks that are not spherical neighbors. The coding availability of a neighboring block of the current block may be determined, e.g., based on whether the neighboring block is on the same side of the face discontinuity as the current block. For example, the neighboring block may be determined to be available for decoding the current block if it is on the same side of the face discontinuity as the current block, and unavailable if it is not on the same side of the face discontinuity. The neighboring block may be a spatial neighboring block or a temporal neighboring block.

Method and data processing system for providing a two-dimensional unfolded image of at least one tubular structure

A computer-implemented method is for providing a two-dimensional unfolded image of at least one tubular structure. In an embodiment, the method includes receiving three-dimensional image data of an examination region including the at least one tubular structure; selecting a set of input points in the three-dimensional image data; determining a projection surface with respect to the three-dimensional image data; calculating a set of surface points of the projection surface; calculating a deformed projection surface by applying a deformation algorithm onto the projection surface; calculating a set of voxel positions with respect to the three-dimensional image data based on the deformed projection surface; and calculating the two-dimensional unfolded image of the at least one tubular structure based on the three-dimensional image data and the set of voxel positions.

Dynamic vector map tiles

The present disclosure relates to systems and processes for providing vector map data for generating a view of a map in a mapping application. In one example process, a request for a vector map sub-tile can be received by a map server. The map server can identify a pre-generated vector map tile corresponding to the requested vector map sub-tile and can generate the requested vector map sub-tile from the identified vector map tile by dividing the vector map tile into two or more vector map sub-tiles. In some examples, dividing the vector map tile into multiple vector map sub-tiles can include identifying features and attributes of the vector map tile that should be included in the requested vector map sub-tile and generating the requested vector map sub-tile to include these features and attributes. The map server can then transmit the requested vector map sub-tile to the requesting electronic device.

3D face identity authentication method and apparatus
11238270 · 2022-02-01 · ·

The present application provides an identity authentication method and an apparatus. The method may include obtaining a sequence of depth images containing a target face and a sequence of original two-dimensional (2D) images containing the target face, and performing identity authentication. The identity authentication may be conducted by: calculating a target face three-dimensional (3D) texture image according to the depth images containing the target face and the original 2D images containing the target face; projecting the target face 3D texture image to a 2D plane to obtain a target face 2D image; extracting feature information from the target face 2D image; comparing the feature information of the target face 2D image with feature information of a reference face 2D image to determine a similarity value; and in response to that the similarity value exceeds a first threshold, determining that the identity authentication succeeds.

TRUNCATED SQUARE PYRAMID GEOMETRY AND FRAME PACKING STRUCTURE FOR REPRESENTING VIRTUAL REALITY VIDEO CONTENT
20170280126 · 2017-09-28 ·

Techniques and systems are described for mapping 360-degree video data to a truncated square pyramid shape. A 360-degree video frame can include 360-degrees' worth of pixel data, and thus be spherical in shape. By mapping the spherical video data to the planes provided by a truncated square pyramid, the total size of the 360-degree video frame can be reduced. The planes of the truncated square pyramid can be oriented such that the base of the truncated square pyramid represents a front view and the top of the truncated square pyramid represents a back view. In this way, the front view can be captured at full resolution, the back view can be captured at reduced resolution, and the left, right, up, and bottom views can be captured at decreasing resolutions. Frame packing structures can also be defined for 360-degree video data that has been mapped to a truncated square pyramid shape.

MEDICAL IMAGE DATA PROCESSING SYSTEM AND METHOD
20170262978 · 2017-09-14 · ·

A medical image data processing system comprises processing circuitry configured to receive a three-dimensional medical imaging data set, process the three-dimensional medical imaging data set to determine a curved plane that has a shape representative of a shape of at least one anatomical structure, wherein the at least one anatomical structure comprises a plurality of sub-structures, and obtain an image based on values of the medical imaging data set at a plurality of sample points of the curved plane.

APPARATUS, METHOD AND STORAGE MEDIUM FOR CORRECTING PAGE IMAGE
20170262163 · 2017-09-14 · ·

When a touch operation is performed with one finger, this touch operation performed with one finger is judged to be a single-point operation performed on one control point on a mesh image constituted by Bezier curves and deformation processing is performed in which the corresponding point is moved in accordance with the movement of the one touching finger. On the other hand, when a touch operation is performed with a plurality of fingers, it is judged to be a multi-point operation performed on all control points on the mesh image constituted by Bezier curves , and deformation processing is performed in which all the control points on the mesh image are moved in accordance with the movements of the plurality of fingers with the linearity of the mesh image being maintained.