G06V30/20

INFORMATION PROCESSING APPARATUS AND INFORMATION PROCESSING METHOD
20240104919 · 2024-03-28 ·

Provided is an information processing apparatus capable of improving detection accuracy of a position pointed by a target object. An acquisition unit acquires a distance image indicating a distance to each object present within a predetermined range. Subsequently, a vector calculation unit calculates a vector extending from the target object present within the predetermined range in a direction pointed by the target object on the basis of the acquired distance image. Subsequently, an intersection calculation unit calculates a position of an intersection of a predetermined surface present within the predetermined range and the calculated vector on the basis of the acquired distance image. Subsequently, a processing execution unit executes processing corresponding to the calculated position of the intersection.

SYSTEMS AND METHODS FOR DETECTING USER CREATED CIRCULAR SHAPED INDICATIONS USING MACHINE LEARNING MODELS
20240046676 · 2024-02-08 ·

In some instances, a method is provided. The method comprises: obtaining, by a computing system, a plurality of documents, wherein at least one document, of the plurality of documents, comprises one or more circular shaped user indications, wherein each of the one or more circular shaped user indications indicates a user selection of a design or text within the associated circular shaped user indication; determining, by the computing system, circular shaped identification information for a document, of the plurality of documents based on inputting the document into a trained machine learningartificial intelligence (ML-AI) model, wherein the circular shaped identification information indicates the user selection of the design or the text within the associated circular shaped user indication; and performing, by the computing system, one or more actions based on the circular shaped identification information.

SYSTEMS AND METHODS FOR DETECTING USER CREATED CIRCULAR SHAPED INDICATIONS USING MACHINE LEARNING MODELS
20240046676 · 2024-02-08 ·

In some instances, a method is provided. The method comprises: obtaining, by a computing system, a plurality of documents, wherein at least one document, of the plurality of documents, comprises one or more circular shaped user indications, wherein each of the one or more circular shaped user indications indicates a user selection of a design or text within the associated circular shaped user indication; determining, by the computing system, circular shaped identification information for a document, of the plurality of documents based on inputting the document into a trained machine learningartificial intelligence (ML-AI) model, wherein the circular shaped identification information indicates the user selection of the design or the text within the associated circular shaped user indication; and performing, by the computing system, one or more actions based on the circular shaped identification information.

Imaging processing device and imaging processing system capable of performing collation in captured image having overexposed portion
12039700 · 2024-07-16 · ·

An imaging processing device includes a processor. The processor performs processing of acquiring a captured image which is captured in a state where a light source emits light, extracting an overexposed portion which occurs in the captured image due to the light emission of the light source and which has a predetermined pixel value or more, and setting a region located in an area, which does not overlap with the extracted overexposed portion in the captured image, as a region of a collation target to be collated with a predetermined reference image.

Method and apparatus for transformation of dot text in an image into stroked characters based on dot pitches
10223618 · 2019-03-05 · ·

A method and apparatus for determining orientation and dot pitch of characters in an image. A statistical neighborhood of a set of dots of an image is determined. The statistical neighborhood includes a set of points and each point is associated with a position and a statistical measure indicative of a likelihood that one or more dots that satisfy a shape and a size criteria are located at that position. A Fast Fourier Transform (FFT) is computed across the set of points of the statistical neighborhood; and based on the FFT of the set of points, a first orientation and a first distance between adjacent dots of characters along the first orientation, and a second orientation and a second distance between adjacent dots of the characters along the second orientation are determined.

INFERENCE DEVICE, INFERENCE METHOD, AND RECORDING MEDIUM

An article image data acquirer acquires article image data as a target of optical character recognition (OCR). An inference result data generator generates first inference result data and second inference result data by inputting the article image data as the target of the OCR into a trained model. An inference result data outputter outputs the first inference result data and the second inference result data. An image filter generator generates a first image filter based on the first inference result data and a second image filter based on the second inference result data. An image filter outputter outputs the first image filter and the second image filter.

INFERENCE DEVICE, INFERENCE METHOD, AND RECORDING MEDIUM

An article image data acquirer acquires article image data as a target of optical character recognition (OCR). An inference result data generator generates first inference result data and second inference result data by inputting the article image data as the target of the OCR into a trained model. An inference result data outputter outputs the first inference result data and the second inference result data. An image filter generator generates a first image filter based on the first inference result data and a second image filter based on the second inference result data. An image filter outputter outputs the first image filter and the second image filter.

Information processing apparatus and information processing method

Provided is an information processing apparatus capable of improving detection accuracy of a position pointed by a target object. An acquisition unit acquires a distance image indicating a distance to each object present within a predetermined range. Subsequently, a vector calculation unit calculates a vector extending from the target object present within the predetermined range in a direction pointed by the target object on the basis of the acquired distance image. Subsequently, an intersection calculation unit calculates a position of an intersection of a predetermined surface present within the predetermined range and the calculated vector on the basis of the acquired distance image. Subsequently, a processing execution unit executes processing corresponding to the calculated position of the intersection.

Information processing apparatus and information processing method

Provided is an information processing apparatus capable of improving detection accuracy of a position pointed by a target object. An acquisition unit acquires a distance image indicating a distance to each object present within a predetermined range. Subsequently, a vector calculation unit calculates a vector extending from the target object present within the predetermined range in a direction pointed by the target object on the basis of the acquired distance image. Subsequently, an intersection calculation unit calculates a position of an intersection of a predetermined surface present within the predetermined range and the calculated vector on the basis of the acquired distance image. Subsequently, a processing execution unit executes processing corresponding to the calculated position of the intersection.

JOINT TEXT SPOTTING AND LAYOUT ANALYSIS

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting text instances of arbitrary shapes, sizes, and locations. In one aspect, a method comprises processing an image depicting one or more text instances, generating a respective prediction for each character in a sequence of characters that are predicted to be depicted in the text instance, the respective prediction comprising a respective character class to which the predicted character belongs, the respective character class selected from a set that includes printable character classes and a space character class and a bounding box that contains the character within the image, and grouping the sequence of characters into a plurality of words based on locations of characters that are predicted to belong to the space character class.