Patent classifications
G06V10/85
Method and system for detection-based segmentation-free license plate recognition
A detection-based segmentation-free method and system for license plate recognition. An image of a vehicle is initially captured utilizing an image-capturing unit. A license plate region is located in the image of the vehicle. A set of characters can then be detected in the license plate region and a geometry correction performed based on a location of the set of characters detected in the license plate region. An operation for sweeping an OCR across the license plate region can be performed to infer characters with respect to the set of characters and locations of the characters utilizing a hidden Markov model and leveraging anchored digit/character locations.
BIOINFORMATICS SYSTEMS, APPARATUSES, AND METHODS EXECUTED ON A QUANTUM PROCESSING PLATFORM
A system, method and apparatus for executing a bioinformatics analysis on genetic sequence data includes a quantum computing device formed of a set of hardwired quantum logic circuits interconnected by a plurality of superconducting connections to process information represented as a quantum state that is configured as a set of one or more qubits. The hardwired quantum logic circuits may be arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired quantum logic circuits to perform one or more steps in the bioinformatics analysis on the reads of genomic data. Each subset of the hardwired quantum logic circuits may be formed in a wired configuration to perform the one or more steps in the bioinformatics analysis.
System for video based face recognition using an adaptive dictionary
The method includes a dictionary including a target collection defined by images that are known with a defined level of certainty to include a subject and an imposter collection defined by images of individuals other than the subject. In the method, images of an area are captured over a period of time. In respect of each image: a matching calculation is carried out, based upon a comparison of the image captured with the images in the dictionary to result in a measure of confidence that the subject is in the area; and an inference determination is made to replace one of the target collection images with a further image that is known with the defined level of certainty, the determination being a function of the measure of confidence resultant from the captured image, the measure resultant from one or more previously captured images and the associated capture times.
TREE STRUCTURED CRF WITH UNARY POTENTIAL FUNCTION USING ACTION UNIT FEATURES OF OTHER SEGMENTS AS CONTEXT FEATURE
A method of determining a composite action from a video clip, using a conditional random field (CRF), the method includes determining a plurality of features from the video clip, each of the features having a corresponding temporal segment from the video clip. The method may continue by determining, for each of the temporal segments corresponding to one of the features, an initial estimate of an action unit label from a corresponding unary potential function, the corresponding unary potential function having as ordered input the plurality of features from a current temporal segment and at least one other of the temporal segments. The method may further include determining the composite action by jointly optimising the initial estimate of the action unit labels.
Dynamic hand gesture recognition using depth data
The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.
Object Segmentation, Including Sky Segmentation
A digital medium environment includes an image processing application that performs object segmentation on an input image. An improved object segmentation method implemented by the image processing application comprises receiving an input image that includes an object region to be segmented by a segmentation process, processing the input image to provide a first segmentation that defines the object region, and processing the first segmentation to provide a second segmentation that provides pixel-wise label assignments for the object region. In some implementations, the image processing application performs improved sky segmentation on an input image containing a depiction of a sky.
METHOD AND APPARATUS FOR ANALYZING FACIAL IMAGE
A method to analyze a facial image includes: inputting a facial image to a residual network including residual blocks that are sequentially combined and arranged in a direction from an input to an output; processing the facial image using the residual network; and acquiring an analysis map from an output of an N-th residual block among the residual blocks using a residual deconvolution network, wherein the residual network transfers the output of the N-th residual block to the residual deconvolution network, and N is a natural number that is less than a number of all of the residual blocks, and wherein the residual deconvolution network includes residual deconvolution blocks that are sequentially combined, and the residual deconvolution blocks correspond to respective residual blocks from a first residual block among the residual blocks to the N-th residual block.
GESTURE OPERATION METHOD BASED ON DEPTH VALUES AND SYSTEM THEREOF
A gesture operation method based on depth values and the system thereof are revealed. A stereoscopic-image camera module acquires a first stereoscopic image. Then an algorithm is performed to judge if the first stereoscopic image includes a triggering gesture. Then the stereoscopic-image camera module acquires a second stereoscopic image. Another algorithm is performed to judge if the second stereoscopic image includes a command gesture for performing the corresponding operation of the command gesture.
Vision Based Target Tracking Using Tracklets
A non-hierarchical and iteratively updated tracking system includes a first module for creating an initial trajectory model for multiple targets from a set of received image detections. A second module is connected to the first module to provide identification of multiple targets using a target model, and a third module is connected to the second module to solve a joint object function and maximal condition probability for the target module. A tracklet module can update the first module trajectory module, and after convergence, output a trajectory model for multiple targets.
IMAGE STITCHING METHOD, APPARATUS AND DEVICE BASED ON REINFORCEMENT LEARNING AND STORAGE MEDIUM
The present application provides an image stitching method, apparatus and device based on reinforcement learning and a storage medium. The method includes: acquiring initial calibration parameters, collecting a sample image and position information of a motion platform; setting a negative reward function; acquiring a state set and a negative reward value set according to a randomly generated action set, the initial calibration parameters, the position information of the motion platform and the negative reward function to construct a probability kinematics model; constructing a state value function based on an occurrence probability of the state, and acquiring an optimal action by optimizing the state value function; and acquiring optimized calibration parameters through the optimal action and the initial calibration parameters, and carrying out image stitching on corresponding sample images through the optimized calibration parameters. The application solves the technical problem of low image stitching quality in the prior art.