Patent classifications
G06V30/2445
AUTOMATIC LANGUAGE IDENTIFICATION IN IMAGE-BASED DOCUMENTS
The present embodiments relate to identifying a native language of text included in an image-based document. A cloud infrastructure node (e.g., one or more interconnected computing devices implementing a cloud infrastructure) can utilize one or more deep learning models to identify a language of an image-based document (e.g., a scanned document) that is formed of pixels. The cloud infrastructure node can detect text lines that are bounded by bounding boxes in the document, determine a primary script classification of the text in the document, and derive a primary language for the document. Various document management tasks can be performed responsive to determining the language, such as perform optical character recognition (OCR) or derive insights into the text.
Information processing apparatus, non-transitory computer readable medium, and character recognition system
An information processing apparatus includes a processor configured to acquire a result of character recognition of a character string formed on a medium and read by scanning that is subject to character recognition and replace a character or a symbol in a subject with a reference character string that is referred to by the character or the symbol.
ON DEMAND TESTING AS A SERVICE FOR BASE TEXT DIRECTION VERIFICATION TESTING
Methods and systems for testing base text direction (BTD) include receiving one or more images captured by an end-user system. Each of the one or more images displays respective text test case information. Each of the one or more images is compared to a respective reference image associated with a respective text test case. It is determined whether the end-user system produces BTD errors based on the comparison in accordance with one or more BTD error rules.
METHOD FOR CONTROLLING OPERATIONS OF AN ELECTRONIC DEVICE THROUGH AMBIENT LIGHT DETECTION, AND ASSOCIATED APPARATUS
A method for controlling operations of an electronic device through ambient light detection and associated apparatus are provided, where the method includes: utilizing an ambient light sensor of the electronic device to detect ambient light for the electronic device, to generate an ambient light detection signal, sampling the ambient light detection signal to convert the ambient light detection signal into a converted signal, and performing pattern detection on the converted signal to detect at least one pattern of the converted signal; and according to a pattern and event database, determining whether the detected pattern of the converted signal matches a predetermined pattern within a plurality of predetermined patterns, to selectively trigger a predetermined operation associated with the predetermined pattern, wherein the pattern and event database stores the plurality of predetermined patterns.
Chinese, Japanese, or Korean language detection
Disclosed are systems, computer-readable mediums, and methods for determining that text contains Chinese, Japanese, or Korean characters. One method includes determining a language hypothesis for each text fragment in a plurality of text fragments identified from connected components in a document image. The method further includes selecting a first subset of text fragments from the plurality of text fragments based on ratings for the language hypothesis of each text fragment in the plurality of text fragments. The method further includes verifying, by a processor, the language hypothesis of one or more text fragments in the first subset of text fragments based on optical character recognition of the one or more text fragments. The method further includes determining, by the processor, that Chinese, Japanese, or Korean (CJK) characters are present in the document image based on the verification of the language hypothesis of each of the one or more text fragments.
System language switching method, readable storage medium, terminal device, and apparatus
The present application relates to a system language switching method, a computer readable storage medium, a terminal device, and a device. The method includes first obtaining a preset image for setting a system language of a target terminal, then extracting text information in the image and determining a target language corresponding to the text information, and finally switching the system language of the target terminal to the target language. Through the present application, the user only needs to prepare an image for setting the system language of the target terminal in advance, for example, a piece of paper with Chinese written, and a system can obtain the text information on the image through the processes of image acquisition, text information extraction, and the like, determine that the text message is Chinese, and finally switch the system language of the target terminal to Chinese.
INPUT APPARATUS, INPUT METHOD, PROGRAM, AND INPUT SYSTEM
An input apparatus includes a handwriting input unit configured to receive a handwritten input using a position of a pen or a user's finger in contact with a display; and a display unit configured to display the handwritten input received by the handwriting input unit on the display as a handwritten object. The input apparatus is configured to, in response to no occurrence of a change in the handwritten object during a first period, display one or more operation commands on the basis of the handwritten object.
Information processing apparatus and non-transitory computer readable medium
An information processing apparatus includes a processor. The processor is configured to identify, from a character string recognition result for a form, a form feature that indicates at least a field in which the form is used or an attribute of a filling-out person filling out the form, accumulate past correction tendencies for character string recognition results for forms having respective identified form features, and obtain a correction tendency for a form having a form feature that is the same as the identified form feature from among the accumulated correction tendencies, and perform control to display a candidate correct expression for the character string recognition result for the form in accordance with the obtained correction tendency.
Handwriting detector, extractor, and language classifier
Disclosed are methods for handwriting recognition. In some aspects, an image representing a page of a sample document is analyzed to identify a region having indications of handwriting. The region is analyzed to determine frequencies of a plurality of geometric features within the region. The frequencies may be compared to profiles or histograms of known language types, to determine if there are similarities between the frequencies in the sample document relative to those of the known language types. In some aspects, machine learning may be used to characterize the document as a particular language type based on the frequencies of the geometric features.
Mechanism to facilitate image translation
Techniques and structures to facilitate conversion of a workflow process is disclosed. The techniques include receiving an image, identifying one or more objects included in the image, identifying one or more properties associated with each of the one or more objects, generating a matrix including data including the identified objects and associated properties and processing the matrix at a machine learning model to determine whether the image is to be translated based on a determination that one or more objects and associated properties within the image are required to be translated.