G06V10/754

Virtualization of Tangible Interface Objects
20210232821 · 2021-07-29 ·

An example system includes a stand configured to position a computing device proximate to a physical activity surface. The system further includes a video capture device, a detector, and an activity application. The video capture device is coupled for communication with the computing device and is adapted to capture a video stream that includes an activity scene of the physical activity surface and one or more interface objects physically interactable with by a user. The detector is executable to detect motion in the activity scene based on the processing and, responsive to detecting the motion, process the video stream to detect one or more interface objects included in the activity scene of the physical activity surface. The activity application is executable to present virtual information on a display of the computing device based on the one or more detected interface objects.

PUPIL POSITIONING METHOD AND APPARATUS, VR/AR APPARATUS AND COMPUTER READABLE MEDIUM

The present disclosure relates to a pupil positioning method. The pupil positioning method may include: obtaining an eye image under illumination of a light source; determining a first internal point in a pupil of the eye image; calculating gradient changes of pixel points along a straight line starting from the first internal point toward outside of the pupil; determining a plurality of edge points at an edge of the pupil based on the gradient changes of the pixel points along the straight line; and performing ellipse fitting on the edge points to obtain a pupil center.

SHIFT INVARIANT LOSS FOR DEEP LEARNING BASED IMAGE SEGMENTATION
20210224999 · 2021-07-22 · ·

Systems and methods of improving alignment in dense prediction neural networks are disclosed. A method includes identifying, at a computing system, an input data set and a label data set with one or more first parts of the input data set corresponding to a label. The computing system processes the input data set using a neural network to generate a predicted label data set that identifies one or more second parts of the input data set predicted to correspond to the label. The computing system determines an alignment result using the predicted label data set and the label data set and a transformation of the one or more first parts, including a shift, rotation, scaling, and/or deformation, based on the alignment result. The computing system computes a loss score using the transformation, label data and the predicted label data set and updates the neural network based on the loss score.

REDUCING SCALE ESTIMATE ERRORS IN SHELF IMAGES

Example image processing methods, apparatus/systems and articles of manufacture are disclosed herein. An example apparatus includes an image recognition application to identify matches between stored patterns and objects detected in a shelf image, where the shelf image has a shelf image scale estimate. The example apparatus further includes a scale corrector to calculate deviation values between sizes of (A) a first set of the objects detected in the shelf image and (B) a first set of the stored patterns matched with the first set of the objects and reduce an error of the shelf image scale estimate by calculating a scale correction value for the shelf image scale estimate based on the deviation values.

Image segmentation of complex structures

Image segmentation of a structure of complex shape proceeds by acquiring by circuitry, a plurality of training images of the structure. A shape model of the structure is formed by fusing the plurality of training images. The shape model is partitioned by circuitry into a predetermined number of portions, an appearance and shape of each portion being determined by a classifier.

Device, system, and method of generating a reduced-size volumetric dataset
11127206 · 2021-09-21 · ·

Device, system, and method of generating a reduced-size volumetric dataset. A method includes receiving a plurality of three-dimensional volumetric datasets that correspond to a particular object; and generating, from that plurality of three-dimensional volumetric datasets, a single uniform mesh dataset that corresponds to that particular object. The size of that single uniform mesh dataset is less than ¼ of the aggregate size of the plurality of three-dimensional volumetric datasets. The resulting uniform mesh is temporally coherent, and can be used for animating that object, as well as for introducing modifications to that object or to clothing or garments worn by that object.

SEMANTIC DEEP FACE MODELS

Techniques are disclosed for training and applying nonlinear face models. In embodiments, a nonlinear face model includes an identity encoder, an expression encoder, and a decoder. The identity encoder takes as input a representation of a facial identity, such as a neutral face mesh minus a reference mesh, and outputs a code associated with the facial identity. The expression encoder takes as input a representation of a target expression, such as a set of blendweight values, and outputs a code associated with the target expression. The codes associated with the facial identity and the facial expression can be concatenated and input into the decoder, which outputs a representation of a face having the facial identity and expression. The representation of the face can include vertex displacements for deforming the reference mesh.

IMAGE ANALYSIS ALGORITHMS USING CONTROL SLIDES
20210150716 · 2021-05-20 ·

Systems and methods for automatically excluding artifacts from an analysis of a biological specimen image are disclosed. An exemplary method includes obtaining an immunohistochemistry (IHC) image and a control image, determining whether the control image includes one or more artifacts, upon a determination that the control image includes one or more artifacts, identifying one or more artifact regions within the IHC image by mapping the one or more artifacts from the control image to the IHC image, and performing image analysis of the IHC image where any identified artifact regions are excluded from the image analysis.

METHOD, APPARATUS, AND PROGRAM FOR DETECTING MARK BY USING IMAGE MATCHING

An apparatus for detecting a mark by using image matching includes a damaged content obtaining unit configured to obtain damaged content generated based on original content including a plurality of cuts; a pre-processed content generating unit configured to generate pre-processed content by merging one or more images included in the damaged content or removing a partial region of one or more cuts included in the damaged content; a matchable content generating unit configured to generate matchable content, in which sizes or locations of one or more cuts included in the pre-processed content are adjusted by comparing one or more cuts included in the original image and the one or more cuts included in the pre-processed content; a matching unit configured to compare the original content with the matchable content, for each cut.

Virtualization of tangible interface objects

An example system includes a stand configured to position a computing device proximate to a physical activity surface. The system further includes a video capture device, a detector, and an activity application. The video capture device is coupled for communication with the computing device and is adapted to capture a video stream that includes an activity scene of the physical activity surface and one or more interface objects physically interactable with by a user. The detector is executable to detect motion in the activity scene based on the processing and, responsive to detecting the motion, process the video stream to detect one or more interface objects included in the activity scene of the physical activity surface. The activity application is executable to present virtual information on a display of the computing device based on the one or more detected interface objects.