G06V10/426

System and method for ontology guided indoor scene understanding for cognitive robotic tasks

Existing cognitive robotic applications follow a practice of building specific applications for specific use cases. However, the knowledge of the world and the semantics are common for a robot for multiple tasks. In this disclosure, to enable usage of knowledge across multiple scenarios, a method and system for ontology guided indoor scene understanding for cognitive robotic tasks is described where in scenes are processed based on techniques filtered based on querying ontology with relevant objects in perceived scene to generate a semantically rich scene graph. Herein, an initially manually created ontology is updated and refined in online fashion using external knowledge-base, human robot interaction and perceived information. This knowledge helps in semantic navigation, aids in speech, and text based human robot interactions.

Activity recognition systems and methods
09886625 · 2018-02-06 · ·

An activity recognition system is disclosed. A plurality of temporal features is generated from a digital representation of an observed activity using a feature detection algorithm. An observed activity graph comprising one or more clusters of temporal features generated from the digital representation is established, wherein each one of the one or more clusters of temporal features defines a node of the observed activity graph. At least one contextually relevant scoring technique is selected from similarity scoring techniques for known activity graphs, the at least one contextually relevant scoring technique being associated with activity ingestion metadata that satisfies device context criteria defined based on device contextual attributes of the digital representation, and a similarity activity score is calculated for the observed activity graph as a function of the at least one contextually relevant scoring technique, the similarity activity score being relative to at least one known activity graph.

Methods and systems of performing video object segmentation

Techniques and systems are described for performing video segmentation using fully connected object proposals. For example, a number of object proposals for a video sequence are generated. A pruning step can be performed to retain high quality proposals that have sufficient discriminative power. A classifier can be used to provide a rough classification and subsampling of the data to reduce the size of the proposal space, while preserving a large pool of candidate proposals. A final labeling of the candidate proposals can then be determined, such as a foreground or background designation for each object proposal, by solving for a posteriori probability of a fully connected conditional random field, over which an energy function can be defined and minimized.

Method, System and Computer Program for Automatically Detecting Traffic Circles on Digital Maps
20180018513 · 2018-01-18 ·

A computer-implemented method for detecting a traffic circle on a digital map, the method comprising: detecting a cycle within a road graphic of the digital map; calculating a similarity of an internal angle of a corner of a polygon, which represents the geometry of the cycle, to an internal angle of a corresponding corner of a reference polygon; calculating a similarity indicator on the basis of the calculated similarity of the internal angle of all the corners of the polygon of the detected cycle; and, if the similarity indicator exceeds a predefined threshold value, defining the detected cycle as a traffic circle on the digital map.

COMPUTER-READABLE RECORDING MEDIUM STORING INFORMATION PROCESSING PROGRAM, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING DEVICE
20240428438 · 2024-12-26 · ·

A non-transitory computer-readable recording medium storing an information processing program for causing a computer to execute processing including: acquiring a video image captured by one or more camera devices; specifying, by analyzing the acquired video image, a relation that identifies an interaction between an object and a person included in the video image; determining, based on the specified relation, whether a first object is placed outside an imaging range of the camera device; and issuing an alert, based on a determination result obtained by the determining of whether the first object is placed outside the imaging range.

COMPUTER-READABLE RECORDING MEDIUM STORING INFORMATION PROCESSING PROGRAM, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING DEVICE
20240428438 · 2024-12-26 · ·

A non-transitory computer-readable recording medium storing an information processing program for causing a computer to execute processing including: acquiring a video image captured by one or more camera devices; specifying, by analyzing the acquired video image, a relation that identifies an interaction between an object and a person included in the video image; determining, based on the specified relation, whether a first object is placed outside an imaging range of the camera device; and issuing an alert, based on a determination result obtained by the determining of whether the first object is placed outside the imaging range.

Graph convolutional networks with motif-based attention

Various embodiments describe techniques for making inferences from graph-structured data using graph convolutional networks (GCNs). The GCNs use various pre-defined motifs to filter and select adjacent nodes for graph convolution at individual nodes, rather than merely using edge-defined immediate-neighbor adjacency for information integration at each node. In certain embodiments, the graph convolutional networks use attention mechanisms to select a motif from multiple motifs and select a step size for each respective node in a graph, in order to capture information from the most relevant neighborhood of the respective node.

Graph convolutional networks with motif-based attention

Various embodiments describe techniques for making inferences from graph-structured data using graph convolutional networks (GCNs). The GCNs use various pre-defined motifs to filter and select adjacent nodes for graph convolution at individual nodes, rather than merely using edge-defined immediate-neighbor adjacency for information integration at each node. In certain embodiments, the graph convolutional networks use attention mechanisms to select a motif from multiple motifs and select a step size for each respective node in a graph, in order to capture information from the most relevant neighborhood of the respective node.

Computational systems pathology spatial analysis platform for in situ or in vitro multi-parameter cellular and subcellular imaging data

A computational systems pathology spatial analysis platform includes: (i) a spatial heterogeneity quantification component configured for generating a global quantification of spatial heterogeneity among cells of varying phenotypes in multi-parameter cellular and subcellular imaging data; (ii) a microdomain identification component configured for identifying a plurality of microdomains for tissue samples based on the global quantification, each microdomain being associated with a tissue sample; and (iii) a weighted graph component configured for constructing a weighted graph for the multi-parameter cellular and subcellular imaging data, the weighted graph having a plurality of nodes and a plurality of edges each being located between a pair of the nodes, wherein in the weighted graph each node is a particular one of the microdomains and the edge between each pair of microdomains in the weighted graph is indicative of a degree of similarity between the pair of the microdomains.

Computational systems pathology spatial analysis platform for in situ or in vitro multi-parameter cellular and subcellular imaging data

A computational systems pathology spatial analysis platform includes: (i) a spatial heterogeneity quantification component configured for generating a global quantification of spatial heterogeneity among cells of varying phenotypes in multi-parameter cellular and subcellular imaging data; (ii) a microdomain identification component configured for identifying a plurality of microdomains for tissue samples based on the global quantification, each microdomain being associated with a tissue sample; and (iii) a weighted graph component configured for constructing a weighted graph for the multi-parameter cellular and subcellular imaging data, the weighted graph having a plurality of nodes and a plurality of edges each being located between a pair of the nodes, wherein in the weighted graph each node is a particular one of the microdomains and the edge between each pair of microdomains in the weighted graph is indicative of a degree of similarity between the pair of the microdomains.