G06V30/148

SEMANTIC CLUSTER FORMATION IN DEEP LEARNING INTELLIGENT ASSISTANTS

Enhanced techniques and circuitry are presented herein for providing responses to user questions from among digital documentation sources spanning various documentation formats, versions, and types. One example includes a method comprising receiving a user question directed to subject having a documentation corpus, determining a set of passages of the documentation corpus related to the user question, ranking the set of passages according to relevance to the user question, forming semantic clusters comprising sentences extracted from ranked ones of the set of passages according to sentence similarity, and providing a response to the user question based at least on a selected semantic cluster.

SURGICAL KIT INSPECTION SYSTEMS AND METHODS FOR INSPECTING SURGICAL KITS HAVING PARTS OF DIFFERENT TYPES

Surgical kit inspection systems and methods are provided for inspecting surgical kits having parts of different types. The surgical kit inspection system comprises a vision unit including a first camera unit and a second camera unit to capture images of parts of a first type and a second type in each kit and to capture images of loose parts from each kit that are placed on a light surface. A robot supports the vision unit to move the first and second camera units relative to the parts in each surgical kit. One or more controllers obtain unique inspection instructions for each of the surgical kits to control inspection of each of the surgical kits and control movement of the robot and the vision unit accordingly to provide output indicating inspection results for each of the surgical kits.

Character Restoration Method and Apparatus, Storage Medium, and Electronic Device
20230063967 · 2023-03-02 ·

A character restoration method and apparatus, a storage medium, and an electronic device are provided. The character restoration method includes: a character identifier of a character in a text region is determined, where the character identifier is used for uniquely identifying the character; and encoding is performed at least according to the character identifier, and encoded data is sent to a receiving end, where the encoded data is used for the receiving end to decode the encoded data and restore the character according to the character identifier obtained after decoding, that is, encoding is performed merely according to a small amount of information, and then the information is obtained by decoding, so as to restore the character.

METHOD AND SYSTEM FOR IMAGE TRANSLATION

Provided is a method for augmented reality-based image translation performed by one or more processors, which includes storing a plurality of frames representing a video captured by a camera, extracting a first frame that satisfies a predetermined criterion from the stored plurality of frames, translating a first language sentence (or group of words) included in the first frame into a second language sentence (or group of words), determining a translation region including the second language sentence (or group of words) included in the first frame, and rendering the translation region in a second frame.

AUTOMATIC LANGUAGE IDENTIFICATION IN IMAGE-BASED DOCUMENTS

The present embodiments relate to identifying a native language of text included in an image-based document. A cloud infrastructure node (e.g., one or more interconnected computing devices implementing a cloud infrastructure) can utilize one or more deep learning models to identify a language of an image-based document (e.g., a scanned document) that is formed of pixels. The cloud infrastructure node can detect text lines that are bounded by bounding boxes in the document, determine a primary script classification of the text in the document, and derive a primary language for the document. Various document management tasks can be performed responsive to determining the language, such as perform optical character recognition (OCR) or derive insights into the text.

Audience-based optimization of communication media
11631110 · 2023-04-18 ·

Introduced here are communication optimization platforms configured to improve comprehension, persuasion, or clarity of communications. Initially, a communication optimization platform can acquire input sample(s) that are associated with a source audience. The communication optimization platform can then create a linguistic profile for the source audience by examining the content of the input sample(s). Additionally or alternatively, the communication optimization platform may produce a psychographic profile that specifies various characteristics of the source audience, such as personality, opinions, attitudes, interests, etc. The communication optimization platform can then generate, based on the linguistic profile and/or the psychographic profile, affinity language for communicating with a target audience. By incorporating the affinity language into communications, the communication optimization platform can increase appeal to the target audience.

Electronic product recognition

Methods and systems for identifying one or more products in an electronic image are disclosed. The computer-implemented method to electronically recognize a product in an electronic image captured via an electronic mobile device is disclosed. The method includes receiving, by a server, a video stream from a camera of the electronic mobile device, the video stream including a plurality of frames. The server selects at least one of the plurality of frames from the video stream, the at least one of the plurality of frames from the video stream being selected is a captured image. The server segments a plurality of products in the captured image into a plurality of segments. The server performs an image recognition using each of the plurality of segments to identify the product in each of the plurality of segments. One or more recognized products identified in the image recognition is outputted by the server.

Computing device and method for content authoring of a digital conversational character
11630651 · 2023-04-18 ·

Disclosed herein is a software technology for facilitating an interactive conversational session between a user (e.g., a client, a patient, etc.) and a digital conversational character. For instance, in one aspect, the disclosed process may involve two primary phases: (1) an authoring phase that involves a first user accessing a content authoring tool to create a given type of visual conversation application that facilitates interactions between a second user and a digital conversational character in an interactive conversational session, and (2) a rendering phase that involves the second user accessing the created visual conversation application to interact with the digital conversational character in an interactive conversational session.

IMAGE PROCESSING APPARATUS, NON-TRANSITORY STORAGE MEDIUM, AND IMAGE PROCESSING METHOD
20230063374 · 2023-03-02 ·

When a character string corresponding to a predetermined item is not extracted in a first document image as a processing target by entity extraction processing, the character string corresponding to the predetermined item in the first document image is acquired based on positional information about an area where the character string corresponding to the predetermined item is previously extracted in a second document image having the same format as that of the first document image.

Automated signature extraction and verification

A system for extraction and verification of handwritten signatures from arbitrary documents. The system comprises one or more computing devices configured to: receive a digital image of a document; perform a dilating transformation via convolution matrix on the digital image to obtain a modified image; determine a plurality of regions of connected markings in the digital image; based at least in part on a pixel density or proximity to an anchor substring of each region, determine whether any region contains any handwritten signature; extract first image data of the region containing a handwritten signature from the digital image; retrieve second image data of a confirmed example signature for a purported signer of the handwritten signature; and based on a comparison of the first image data with the second image data, forward a determination of whether the first image data and second image data are similar.