Patent classifications
G06V10/7715
DEVICE FOR PROCESSING IMAGE AND METHOD FOR OPERATING SAME
Provided are a device and operating method thereof for obtaining compression ratio information for recognizing a target object in an image using a deep neural network model, and compressing an image using the compression ratio information and encoding the compressed image. According to an embodiment of the present disclosure, there is provided a device that receives an image via at least one camera or a communication interface, obtains a feature map for detecting a target object in the received image, outputs a compression ratio for correctly recognizing the target object in the image by inputting the image and the feature map to a deep neural network model composed of pre-trained model parameters, and generates a bitstream by compressing the image using the output compression ratio and encoding the compressed image.
ANCHOR FOR LINE RECOGNITION
A method for determining at least one anchor for an anchor-based lane line recognition and/or roadway marking recognition in a digital image representation on the basis of sensor data that are obtained from at least one surroundings sensor of a system. The method includes at least the following steps: a) receiving a digital image representation, b) setting at least one row or one column of possible anchors in at least one area of the digital image representation, the row or column of possible anchors being situated at a distance from at least the upper and lower or left and right edge of the area of the digital image representation.
METHODS AND SYSTEMS FOR IMAGE SELECTION
Various methods and systems are provided for automatically classifying a plurality of image slices using body region bounding boxes identified from a localizer image. In one embodiment, a localizer image may be mapped to a plurality of bounding boxes, corresponding to a plurality of body regions, using a trained machine learning model. Coordinates of the plurality of bounding boxes may be used to determine body region boundaries, such that the body regions are non-intersecting and coherent. The body regions identified in the localizer image may then be correlated to image slice ranges, and image slices within each image slice range may be labeled as belonging to the corresponding body region.
AUTOMATICALLY CLASSIFYING ANIMAL BEHAVIOR
Systems and methods are disclosed to objectively identify sub-second behavioral modules in the three-dimensional (3D) video data that represents the motion of a subject. Defining behavioral modules based upon structure in the 3D video data itself—rather than using a priori definitions for what should constitute a measurable unit of action—identifies a previously-unexplored sub-second regularity that defines a timescale upon which behavior is organized, yields important information about the components and structure of behavior, offers insight into the nature of behavioral change in the subject, and enables objective discovery of subtle alterations in patterned action. The systems and methods of the invention can be applied to drug or gene therapy classification, drug or gene therapy screening, disease study including early detection of the onset of a disease, toxicology research, side-effect study, learning and memory process study, anxiety study, and analysis in consumer behavior.
METHOD AND SYSTEM PERFORMING PATTERN CLUSTERING
A method of clustering patterns of an integrated circuit includes; providing a pattern image and numeric data, as input data corresponding to a first pattern to a first model, wherein the first model is trained by a plurality of sample images and a plurality of sample values, obtaining a content latent variable using the first model, and grouping a plurality of content latent variables corresponding to a plurality of patterns into a plurality of clusters based on a Euclidean distance, wherein the numeric data represents at least one attribute of the first pattern.
COMPUTER-READABLE RECORDING MEDIUM STORING INFORMATION PROCESSING PROGRAM, METHOD OF PROCESSING INFORMATION, AND INFORMATION PROCESSING APPARATUS
A non-transitory computer-readable recording medium stores an information processing program for causing a computer to execute a process including: extracting a first feature from an image; detecting, from the extracted first feature, a plurality of visual entities included in the image; generating a second feature in which the visual entities in at least one combination of the plurality of detected visual entities are combined, in first feature, with each other; generating, based on the first feature and the second feature, a first map that indicates relation of each visual entity; extracting a fourth feature based on the first map and a third feature obtained by converting the first feature; and estimating the relation from the fourth feature.
Annotation device
An annotation device includes an image-capturing device, a robot, a control unit, a designation unit, a coordinate processing unit, and a storage unit. The control unit controls the robot so as to acquire a learning image of a plurality of objects, each having a different positional relationship with the image-capturing devices. Furthermore, the storage unit converts a position of the object in a robot coordinate system into a position of the object in an image coordinate system at the time of image-capturing or a position of the object in a sensor coordinate system, and stores the position thus converted together with the learning image.
Action recognition method and apparatus
An action recognition method and apparatus related to artificial intelligence and include extracting a spatial feature of a to-be-processed picture, determining a virtual optical flow feature of the to-be-processed picture based on the spatial feature and X spatial features and X optical flow features in a preset feature library, where the X spatial features and the X optical flow features include a one-to-one correspondence, determining a first type of confidence of the to-be-processed picture in different action categories based on similarities between the virtual optical flow feature and Y optical flow features, where each of the Y optical flow features in the preset feature library corresponds to one action category, X and Y are both integers greater than 1, and determining an action category of the to-be-processed picture based on the first type of confidence.
SAFETY BELT DETECTION METHOD, APPARATUS, COMPUTER DEVICE AND STORAGE MEDIUM
A safety belt detection method, apparatus, computer device, and computer readable storage medium are disclosed. In the detection method, an image to be detected is obtained. The image to be detected is inputted into a detection network which includes an image classification branch network and an image segmentation branch network. A classification result, which indicates whether a driver is wearing a safety belt and is output from the image classification branch network, is obtained. A segmentation image, which indicates a position information of the safety belt and is output from the image segmentation branch network, is obtained. A detection result of the safety belt, indicating whether the driver wears the safety belt normatively, is obtained based on the classification result and the segmentation image.
FAST AND FUZZY PATTERN GROUPING
Methods and systems for determining information for a specimen are provided. One system includes a computer subsystem configured for removing one or more patterns in a specimen image that do not touch a defect detected in the specimen image thereby generating a modified specimen image. The computer subsystem is also configured for generating one or more hash codes for the modified specimen image. In addition, the computer subsystem is configured for assigning the specimen image to one of multiple groups based on a distance between the one or more hash codes and one or more other hash codes generated for a second modified specimen image generated for a second specimen image.