G06T7/77

Neural network based position estimation of target object of interest in video frames

Visual target tracking is task of locating a target in consecutive frame of a video. Conventional systems observe target behavior frames of the video. However, dealing with this problem is very challenging when video has illumination variations, occlusion, change in size and view of the object due to relative motion between camera and object. Embodiments of the present disclosure addresses this problem by implementing Neural Network (NN), its features and their corresponding gradients. Present disclosure explicitly guides the NN by feeding target object of interest (ToI) defined by a bounding box in the first frame of the video. With this guidance, NN generates target activation map via convolutional features map and their gradient maps, thus giving tentative location of the ToI to further exploit to locate target object precisely by using correlation filter(s) and peak location estimator, thus repeating process for every frame of video to track ToI accurately.

Neural network based position estimation of target object of interest in video frames

Visual target tracking is task of locating a target in consecutive frame of a video. Conventional systems observe target behavior frames of the video. However, dealing with this problem is very challenging when video has illumination variations, occlusion, change in size and view of the object due to relative motion between camera and object. Embodiments of the present disclosure addresses this problem by implementing Neural Network (NN), its features and their corresponding gradients. Present disclosure explicitly guides the NN by feeding target object of interest (ToI) defined by a bounding box in the first frame of the video. With this guidance, NN generates target activation map via convolutional features map and their gradient maps, thus giving tentative location of the ToI to further exploit to locate target object precisely by using correlation filter(s) and peak location estimator, thus repeating process for every frame of video to track ToI accurately.

SYSTEMS AND METHODS FOR RECONSTRUCTING A SCENE IN THREE DIMENSIONS FROM A TWO-DIMENSIONAL IMAGE

Systems and methods described herein relate to reconstructing a scene in three dimensions from a two-dimensional image. One embodiment processes an image using a detection transformer to detect an object in the scene and to generate a NOCS map of the object and a background depth map; uses MLPs to relate the object to a differentiable database of object priors (PriorDB); recovers, from the NOCS map, a partial 3D object shape; estimates an initial object pose; fits a PriorDB object prior to align in geometry and appearance with the partial 3D shape to produce a complete shape and refines the initial pose estimate; generates an editable and re-renderable 3D scene reconstruction based, at least in part, on the complete shape, the refined pose estimate, and the depth map; and controls the operation of a robot based, at least in part, on the editable and re-renderable 3D scene reconstruction.

Methods, Apparatuses, Systems and Electronic Devices for Processing Data
20220406129 · 2022-12-22 ·

The embodiments of the present disclosure provide a method, apparatus, and system for processing data and an electronic device. The method includes: obtaining image collection data respectively sent by a plurality of collection devices in a target place, wherein the image collection data is obtained by the collection devices by performing image detection on one or more place images, and each of the one or more place images is an image of a region in the target place which corresponds to the collection device; obtaining region statistics data corresponding to a plurality of regions of the target place by statistical processing the image collection data; and visualizing the region statistics data.

Methods, Apparatuses, Systems and Electronic Devices for Processing Data
20220406129 · 2022-12-22 ·

The embodiments of the present disclosure provide a method, apparatus, and system for processing data and an electronic device. The method includes: obtaining image collection data respectively sent by a plurality of collection devices in a target place, wherein the image collection data is obtained by the collection devices by performing image detection on one or more place images, and each of the one or more place images is an image of a region in the target place which corresponds to the collection device; obtaining region statistics data corresponding to a plurality of regions of the target place by statistical processing the image collection data; and visualizing the region statistics data.

BODY AND HAND CORRELATION METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
20220405967 · 2022-12-22 ·

Body and hand correlation method, apparatus, device, storage medium and computer program are provided. Method includes: in an image of which an image content includes a body to be correlated and a hand to be correlated, a first correlation probability between a body detection box of the body and a hand detection box of the hand is determined; a second correlation probability between the body and a wrist key point in a key point is determined based on the key point in the body detection box, the key point including the wrist key point and elbow key point belonging to the same arm; a third correlation probability between the hand detection box and the wrist key point is determined based on the wrist key point and elbow key point; and a correlation degree between the body and the hand is determined based on the first, second, and third correlation probabilities.

BODY AND HAND CORRELATION METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
20220405967 · 2022-12-22 ·

Body and hand correlation method, apparatus, device, storage medium and computer program are provided. Method includes: in an image of which an image content includes a body to be correlated and a hand to be correlated, a first correlation probability between a body detection box of the body and a hand detection box of the hand is determined; a second correlation probability between the body and a wrist key point in a key point is determined based on the key point in the body detection box, the key point including the wrist key point and elbow key point belonging to the same arm; a third correlation probability between the hand detection box and the wrist key point is determined based on the wrist key point and elbow key point; and a correlation degree between the body and the hand is determined based on the first, second, and third correlation probabilities.

METHOD FOR AUTOMATED TOOTH SEGMENTATION OF THREE DIMENSIONAL SCAN DATA USING DEEP LEARNING AND COMPUTER READABLE MEDIUM HAVING PROGRAM FOR PERFORMING THE METHOD
20220398738 · 2022-12-15 · ·

A method of automated tooth segmentation of a three dimensional scan data using a deep learning, includes determining a U-shape of teeth in input scan data and operating a U-shape normalization operation to the input scan data to generate first scan data, operating a teeth and gum normalization operation, in which the first scan data are received and a region of interest (ROI) of the teeth and gum is set based on a landmark formed on the tooth, to generate second scan data, inputting the second scan data to a convolutional neural network to label the teeth and the gum and extracting a boundary between the teeth and the gum using labeled information of the teeth and the gum.

Multi-Image Sensor Module for Quality Assurance
20220394215 · 2022-12-08 ·

Each of a plurality of co-located inspection camera modules captures raw images of objects passing in front of the co-located inspection camera modules which form part of a quality assurance inspection system. The inspection camera modules have either a different image sensor or lens focal properties and generate different feeds of raw images. The co-located inspection camera modules can reside within a single standalone module and be selectively switched amongst to activate the corresponding feed of raw images. The activated feed of raw images is provided to a consuming application or process for quality assurance analysis.

Image processing apparatus, image processing method, and storage medium
11521330 · 2022-12-06 · ·

An image processing apparatus includes a detection unit that executes detection processing for detecting a particular object in an image, a holding unit that holds object information indicating a position and a size of the particular object on the image, a determination unit that determines whether a number of times a particular object is detected in the detection processing on one or more images reaches a predetermined value, a first setting unit that, when the number of times a particular object is detected in the detection processing on the one or more images is determined to reach the value, sets estimation areas on an image based on the object information obtained by the detection processing on the one or more images, and an estimation unit that executes estimation processing for estimating a number of the particular objects in the estimation areas.