Patent classifications
G06V30/274
Streaming image semantic segmentation method, logical integrated circuit system and electronic device
Disclosed are an image semantic segmentation method, a logical integrated circuit, a system and an electronic device. The logical integrated circuit includes a convolution processing module and a deconvolution processing module. The convolution processing module performs convolution operation processing on an image to generate a piece of feature data of each and every feature image block of the image. The deconvolution processing module is configured to perform deconvolution operation processing on each piece of feature data to obtain an respective image block region; determine an approximation degree of the each piece of feature data and each and every preset semantic category of multiple preset semantic categories, and classify the each piece of feature data into a preset semantic category; and fill each image block region corresponding to the each piece of feature data with a filling color to achieve semantic segmentation of the image.
Geospatial modeling system providing 3D geospatial model update based upon predictively registered image and related methods
A geospatial modeling system may include a memory and a processor cooperating therewith to generate a three-dimensional (3D) geospatial model including geospatial voxels based upon a plurality of geospatial images, obtain a newly collected geospatial image, and determine a reference geospatial image from the 3D geospatial model using Artificial Intelligence (AI) and based upon the newly collected geospatial image. The processor may further align the newly collected geospatial image and the reference geospatial image to generate a predictively registered image, and update the 3D geospatial model based upon the predictively registered image.
Systems and methods for procedure optimization
Procedural optimization is facilitated by receiving user input for creating or modifying a body of text comprising a procedure, detecting one or more procedural steps associated with the procedure using a procedural step detection module, automatically searching within a corpus of references for one or more related procedural steps using a related procedural step extraction module, automatically identifying one or more outcomes within the corpus of references associated with the one or more related procedural steps using an outcome extraction module, automatically determining whether the one or more outcomes comprise detrimental results using an outcome analysis module, and, in response to determining a set of detrimental outcomes from the one or more outcomes that comprise detrimental results, presenting a detriment indicator within the user interface in association with the one or more procedural steps.
Method of and system for explainable knowledge-based visual question answering
A method and a system for generating an augmented scene graph of an image and for training an explainable knowledge based (KB) visual question answering (VQA) machine learning (ML) model are provided. A scene graph encoding spatial and semantic features of objects and relations between objects in the image is obtained. An augmented scene graph is generated by embedding a knowledge graph to enhance the scene graph. An embedded set of questions and associated answers related to the image are obtained. The KB VQA ML model is trained to provide an answer to a given question related to the image based on the augmented scene graph and the embedded set of questions and associated answers. The KB VQA ML model is trained to retrieve a subgraph linking the question and the associated answer as a potential explanation for the answer.
SEMANTIC CLUSTER FORMATION IN DEEP LEARNING INTELLIGENT ASSISTANTS
Enhanced techniques and circuitry are presented herein for providing responses to user questions from among digital documentation sources spanning various documentation formats, versions, and types. One example includes a method comprising receiving a user question directed to subject having a documentation corpus, determining a set of passages of the documentation corpus related to the user question, ranking the set of passages according to relevance to the user question, forming semantic clusters comprising sentences extracted from ranked ones of the set of passages according to sentence similarity, and providing a response to the user question based at least on a selected semantic cluster.
MULTI-CHANNEL HIGH-QUALITY DEPTH ESTIMATION SYSTEM
The present invention discloses a system and a method for providing multi-channel high-quality depth estimation from a monocular camera for providing augmented reality (AR) and virtual reality (VR) features to an image. The invention further includes the method to enhance generalization on deployment-friendly monocular depth inference pipeline with semantic information. Furthermore, a vivid and intact reconstruction is guaranteed by inpainting the missing depth and context within the single image input.
Semantic segmentation for stroke classification in inking application
A data processing system for performing a semantic analysis of digital ink stroke data implements obtaining the digital ink stroke data representing handwritten text, drawings, or both; analyzing the digital ink stroke data to extract path signature feature information from the digital ink stroke data; analyzing the path signature feature information using a convolutional neural network (CNN) trained to perform a pixel-level sematic analysis of the digital ink stroke data and to output a pixel segmentation map with semantic prediction information for each pixel of digital ink stroke data; analyzing the pixel segmentation map to generate stroke-level semantic information using a pixel-to-stroke conversion model; and processing the digital ink stroke data based on the stroke-level semantic information.
Center-biased machine learning techniques to determine saliency in digital images
A location-sensitive saliency prediction neural network generates location-sensitive saliency data for an image. The location-sensitive saliency prediction neural network includes, at least, a filter module, an inception module, and a location-bias module. The filter module extracts visual features at multiple contextual levels, and generates a feature map of the image. The inception module generates a multi-scale semantic structure, based on multiple scales of semantic content depicted in the image. In some cases, the inception block performs parallel analysis of the feature map, such as by parallel multiple layers, to determine the multiple scales of semantic content. The location-bias module generates a location-sensitive saliency map of location-dependent context of the image based on the multi-scale semantic structure and on a bias map. In some cases, the bias map indicates location-specific weights for one or more regions of the image.
Polarity semantics engine analytics platform
Embodiments of the systems and methods disclosed herein provide a prescriptive analytics platform, a polarity analysis engine, and a semantic analysis engine in which a user can identify a target objective and use the system to find out whether the user's objectives are being met, what predictive factors are positively or negatively affecting the targeted objectives, as well as what recommended changes the user can make to better meet the objectives. The systems and methods may include a polarity analysis engine configured to determine the polarity of terms in free-text input in view of the target objective and the predictive factors and use the polarity to generate the recommended changes. The systems and methods may also include a semantic analysis engine to extend the results of the polarity analysis engine for improved determination of predictive factors and improved recommendations.
Systems for Generating Indications of Relationships between Electronic Documents
In implementations of systems for generating indications of relationships between electronic documents, a processing device implements a relationship system to segment text of electronic documents included in a document corpus into segments. The relationship system determines a subset of the electronic documents that includes electronic document pairs having a number of similar segments that is greater than a threshold number. The similar segments are identified using locality sensitive hashing. The electronic document pairs are classified as related documents or unrelated documents using a machine learning model that receives a pair of electronic documents as an input and generates an indication of a classification for the pair of electronic documents as an output. Indications of relationships between particular electronic documents included in the subset are generated based at least partially on the electronic document pairs that are classified as related documents.