G06V10/765

LOCATION SENSITIVE ENSEMBLE CLASSIFIER
20210216916 · 2021-07-15 ·

Computer-implemented systems and methods for generating and using a location sensitive ensemble classifier for classifying content includes dividing a validation data set into regions. Each region encompasses data points of the validation data set that fall within the region. A regional ensemble classifier is generated for each region based on the data points that fall within the region. A content item is then classified in at least one of a plurality of classes using the regional ensemble classifier for the region to which the content item belongs.

DATA GENERATING METHOD, AND COMPUTING DEVICE AND NON-TRANSITORY MEDIUM IMPLEMENTING SAME
20210209420 · 2021-07-08 ·

A data generating method includes obtaining first sample data, determining a type of the first sample data and a corresponding data expansion method, expanding the first sample data according to the determined data expansion method to generate second sample data, and dividing the first sample data and the second sample data into a training set and a verification set according to a preset rule. A data model is trained according to the training set, and the data model is verified according to the verification set after training.

Lane Detection and Tracking Techniques for Imaging Systems

A method for tracking a lane on a road is presented. The method comprises receiving, by one or more processors from an imaging system, a set of pixels associated with lane markings. The method further includes generating, by the one or more processors, a predicted spline comprising (i) a first spline and (ii) a predicted extension of the first spline in a direction in which the imaging system is moving. The first spline describes a boundary of a lane and is generated based on the set of pixels. The predicted extension of the first spline is generated based at least in part on a curvature of at least a portion of the first spline.

Lane Detection and Tracking Techniques for Imaging Systems

A method for detecting boundaries of lanes on a road is presented. The method comprises receiving, by one or more processors from an imaging system, a set of pixels associated with lane markings. The method further includes partitioning, by the one or more processors, the set of pixels into a plurality of groups. Each of the plurality of groups is associated with one or more control points. The method further includes generating, by the one or more processors, a spline that traverses the control points of the plurality of groups. The spline traversing the control points describes a boundary of a lane.

Utilizing a large-scale object detector to automatically select objects in digital images

The present disclosure relates to an object selection system that automatically detects and selects objects in a digital image utilizing a large-scale object detector. For instance, in response to receiving a request to automatically select a query object with an unknown object class in a digital image, the object selection system can utilize a large-scale object detector to detect potential objects in the image, filter out one or more potential objects, and label the remaining potential objects in the image to detect the query object. In some implementations, the large-scale object detector utilizes a region proposal model, a concept mask model, and an auto tagging model to automatically detect objects in the digital image.

OCCUPANT MONITORING DEVICE AND OCCUPANT MONITORING METHOD
20210027079 · 2021-01-28 · ·

An occupant monitoring device includes processing circuitry to detect an eye of an occupant on a vehicle and to determine an eye opening degree of the eye using an image captured by an image capturing device having an automatic exposure adjusting function for adjusting an exposure time; to determine that the eye is closed when the eye opening degree of the eye is less than a predetermined eye opening degree threshold value; to deactivate the automatic exposure adjusting function when the automatic exposure adjusting function is active and the eye is determined to be closed; to detect brightness in a vicinity of the eye using an image which is captured by the image capturing device after the automatic exposure adjusting function is deactivated; and when the eye is determined to be closed, to determine that the occupant is in a drowsy state when the brightness in the vicinity of the eye is less than a predetermined brightness threshold value, and to determine that the occupant is in an awake state when the brightness is equal to or greater than the predetermined brightness threshold value.

METHOD OF PERFORMING DATA PROCESSING OPERATION
20210012141 · 2021-01-14 ·

A computer-implemented method of performing a convolution between an input data array and a kernel to generate an output data array includes decomposing the kernel into a plurality of sub-kernels each having a respective position relative to the kernel and respective in-plane dimensions less than or equal to a target kernel dimension, and for each of the plurality sub-kernels: determining a respective portion of the input data array on the basis of the respective in-plane dimensions of the sub-kernel and the respective position of the sub-kernel relative to the kernel; retrieving the respective portion of the input data array; and performing a convolution between the retrieved respective portion of the input data array and the sub-kernel to generate a respective intermediate data array. The method further includes summing the generated intermediate data arrays to generate at least a portion of the output data array.

INFORMATION PROCESSING APPARATUS, CONTROL METHOD, AND NON-TRANSITORY STORAGE MEDIUM
20200410361 · 2020-12-31 · ·

An information processing apparatus (2000) acquires input data (10) and generates, by use of a neural network (30), condition data (50) that indicate one or more conditions satisfied by the input data (10). The information processing apparatus (2000) determines prediction data (20) by use of a value determined based on correct answer data (42) associated with example data (40) that satisfy at least a part of conditions indicated by the condition data (50).

METHOD AND SYSTEM FOR CLASSIFYING AN OBJECT IN INPUT DATA USING ARTIFICIAL NEURAL NETWORK MODEL
20200394443 · 2020-12-17 ·

This disclosure relates to method and system for classifying an object in input data using an artificial neural network (ANN) model. The method may include extracting positive features and orthogonal features associated with the object in the input data, performing a partial classification of the object based on the positive features by a first part of the ANN model, and determining an accuracy of the classification of the object based on the orthogonal features by a second part of the ANN model. The positive features are features uniquely contributing to identification of a class for the object, while the orthogonal features are features not contributing to identification of the class but contributing to identification of one or more of remaining classes.

Content-weighted deep residual learning for video in-loop filtering

Systems and methods are provided for improving filtering performance and Bjntegaard-Delta (BD) rate savings for video processing. In addition to computing the artifacts between a given compressed image and a restored clean image after filtering using Deep Residual Learning (DRL) for recovering the residual between input and output, filtering strength of a loop filter may be controlled by the content of the region of the image, such that, in more important areas, such as the face and edges, the filtering strength may be increased while in less important areas, such as textures and backgrounds, the filtering strength may be decreased.