G06V20/64

LEARNING-BASED ACTIVE SURFACE MODEL FOR MEDICAL IMAGE SEGMENTATION
20230043026 · 2023-02-09 · ·

A learning-based active surface model for medical image segmentation uses a method including: (a) data generation: obtaining medical images and associated ground truths, and splitting the sample images into a training set and a testing set; (b) raw segmentation: constructing a surface initialization network, parameters of the network trained by images and labels in the training set; (c) surface initialization: segmenting the images by the surface initialization network, and generating the point cloud data as the initial surface from the segmentation; (d) fine segmentation: constructing the surface evolution network, the parameters of the network trained by the initial surface obtained in step (c); (e) surface evolution: deforming the initial surface points along the offsets to obtain the predicted surface, the offsets presenting the prediction of the surface evolution network; (f) surface reconstruction: reconstructing the 3D volumes from the set of predicted surface points set to obtain the final segmentation results.

Systems and Methods for Image Based Perception

Systems and methods for image-based perception. The methods comprise: obtaining, by a computing device, images captured by a plurality of cameras with overlapping fields of view; generating, by the computing device, spatial feature maps indicating locations of features in the images; defining, by the computing device, predicted cuboids at each location of an object in the images based on the spatial feature maps; and assigning, by the computing device, at least two cuboids of said predicted cuboids to a given object when predictions from images captured by separate cameras of the plurality of cameras should be associated with a same detected object.

Arrangement for producing head related transfer function filters
11557055 · 2023-01-17 · ·

When three-dimensional audio is produced by using headphones, particular HRTF-filters are used to modify sound for the left and right channels of the headphone. As the morphology of every ear is different, it is beneficial to have HRTF-filters particularly designed for the user of headphones. Such filters may be produced by deriving ear geometry from a plurality of images taken with an ordinary camera, detecting necessary features from images and fitting said features to a model that has been produced from accurately scanned ears comprising representative values for different sizes and shapes. Taken images are sent to a server (52) that performs the necessary computations and submits the data further or produces the requested filter.

Arrangement for producing head related transfer function filters
11557055 · 2023-01-17 · ·

When three-dimensional audio is produced by using headphones, particular HRTF-filters are used to modify sound for the left and right channels of the headphone. As the morphology of every ear is different, it is beneficial to have HRTF-filters particularly designed for the user of headphones. Such filters may be produced by deriving ear geometry from a plurality of images taken with an ordinary camera, detecting necessary features from images and fitting said features to a model that has been produced from accurately scanned ears comprising representative values for different sizes and shapes. Taken images are sent to a server (52) that performs the necessary computations and submits the data further or produces the requested filter.

Methods and systems for training an object detection algorithm using synthetic images

A method includes: (A) receiving a selection of a 3D model stored in one or more memories, the 3D model corresponding to an object and (B) setting a camera parameter set for a camera for use in detecting a pose of the object in a real scene. The method also includes (C) generating at least one 2D synthetic image based at least on the camera parameter set by rendering the 3D model in a view range for generating training data.

Methods and systems for training an object detection algorithm using synthetic images

A method includes: (A) receiving a selection of a 3D model stored in one or more memories, the 3D model corresponding to an object and (B) setting a camera parameter set for a camera for use in detecting a pose of the object in a real scene. The method also includes (C) generating at least one 2D synthetic image based at least on the camera parameter set by rendering the 3D model in a view range for generating training data.

System and method for ordered representation and feature extraction for point clouds obtained by detection and ranging sensor

A method is described which includes receiving a point cloud having a plurality of data points each representing a 3D location in a 3D space, the point cloud being obtained using a detection and ranging (DAR) sensor. For each data point, associating the data point with a 3D volume containing the 3D location of the data point, the 3D volume being defined using a 3D lattice that partitions the 3D space based on spherical coordinates. For at least one 3D volume, the data points are sorted within the 3D volume based on at least one dimension of the 3D lattice; and the sorted data points are stored as a set of ordered data points. The method also includes performing feature extraction on the set of ordered data points to generate a set of ordered feature vectors and providing the set of ordered feature vectors to perform a machine learning inference task.

System and method for ordered representation and feature extraction for point clouds obtained by detection and ranging sensor

A method is described which includes receiving a point cloud having a plurality of data points each representing a 3D location in a 3D space, the point cloud being obtained using a detection and ranging (DAR) sensor. For each data point, associating the data point with a 3D volume containing the 3D location of the data point, the 3D volume being defined using a 3D lattice that partitions the 3D space based on spherical coordinates. For at least one 3D volume, the data points are sorted within the 3D volume based on at least one dimension of the 3D lattice; and the sorted data points are stored as a set of ordered data points. The method also includes performing feature extraction on the set of ordered data points to generate a set of ordered feature vectors and providing the set of ordered feature vectors to perform a machine learning inference task.

Apparatus and method for x-ray data generation

The apparatus for an X-ray data generation according to an embodiment of the inventive concept includes a processor that receives 3D data to generate output data and a buffer, and the processor includes an extraction unit that extracts raw object data from the 3D data and projects the raw object data onto a 2D plane to generate first object data, an augmentation unit that performs data augmentation on the first object data to generate second object data, a composition unit that synthesizes the second object data and background data to generate composite data, and a post-processing unit that performs post-processing on the composite data to generate the output data, and the buffer stores a plurality of parameters related to generation of the first object data, the second object data, the composite data, and the output data.

Semantic map production system and method

The system includes a metric map creation unit configured to create a metric map using first image data received from a 3D sensor, an image processing unit configured to recognize an object by creating and classifying a point cloud using second image data received from an RGB camera; a probability-based map production unit configured to create an object location map and a spatial semantic map in a probabilistic expression method using a processing result of the image processing unit, a question creation unit configured to extract a portion of high uncertainty about an object class from a produced map on the basis of entropy and ask a user about the portion, and a map update unit configured to receive a response from the user and update a probability distribution for spatial information according to a change in probability distribution for classification of the object.