G06V10/7635

METHOD AND SYSTEM FOR GRAPH NEURAL NETWORK BASED PEDESTRIAN ACTION PREDICTION IN AUTONOMOUS DRIVING SYSTEMS

The present disclosure relates to methods and systems for spatiotemporal graph modelling of road users in observed frames of an environment in which an autonomous vehicle operates (i.e. a traffic scene), clustering of the road users into categories, and providing the spatiotemporal graph to a trained graphical convolutional neural network (GNN) to predict a future pedestrian action. The future pedestrian action can be: one of the pedestrian will cross a road and the pedestrian will not cross the road. The spatiotemporal graph includes a better understanding of the observed frames (i.e. traffic scene).

Intelligent video analysis
11328510 · 2022-05-10 · ·

An apparatus is provided. The apparatus receives a video feed and processes the video feed in real-time as the video feed is received. The apparatus performs object detection and recognition on the video feed to detect and classify objects therein, performs activity recognition to detect and classify activities of at least some of the objects, and outputs classified objects and classified activities in the video feed. The apparatus generates natural language text that describes the video feed, produces a semantic network, and stores the video feed, classified objects and classified activities, natural language text, and semantic network in a knowledge base. The apparatus generates a graphical user interface (GUI) configured to enable queries of the knowledge base, and presentation of selections of the video feed, classified objects and classified activities, natural language text, and semantic network.

PHOTOGRAPH CONTENT CLUSTERING FOR DIGITAL PICTURE FRAME DISPLAY

A method for automated routing of pictures taken on mobile electronic devices to a digital picture frame including a camera integrated with the frame, and a network connection module allowing the frame for direct contact and upload of photos from electronic devices or from photo collections of community members. The integrated camera is used to automatically determine an identity of a frame viewer and can capture gesture-based feedback. The displayed photos are automatically shown and/or changed according to the detected viewers. The photos can be filtered and cropped at the receiver side. Clustering photos by content is used to improve display and to respond to photo viewer desires.

Computer-vision techniques for time-series recognition and analysis
11321954 · 2022-05-03 · ·

Some examples herein describe time-series recognition and analysis techniques with computer vision. In one example, a system can access an image depicting data lines representing time series datasets. The system can execute a clustering process to assign pixels in the image to pixel clusters. The system can generate image masks based on attributes of the pixel clusters, and identify a respective set of line segments defining the respective data line associated with each image mask. The system can determine pixel sets associated with the time series datasets based on the respective set of line segments associated with each image mask, and provide one or more pixel sets as input for a computing operation that processes the pixel sets and returns a processing result. The system may then display the processing result on a display device or perform another task based on the processing result.

Generating Occurrence Contexts for Objects in Digital Content Collections

In implementations of systems for generating occurrence contexts for objects in digital content collections, a computing device implements a context system to receive context request data describing an object that is depicted with additional objects in digital images of a digital content collection. The context system generates relationship embeddings for the object and each of the additional objects using a representation learning model trained to predict relationships for objects. A relationship graph is formed for the object that includes a vertex for each relationship between the object and the additional objects indicated by the relationship embeddings. The context system clusters the vertices of the relationship graph into contextual clusters that each represent an occurrence context of the object in the digital images of the digital content collection. The context system generates, for each contextual cluster, an indication of a respective occurrence context for the object for display in a user interface.

TECHNIQUES FOR DETERMINING TISSUE CHARACTERISTICS USING MULTIPLEXED IMMUNOFLUORESCENCE IMAGING

Techniques for processing multiplexed immunofluorescence (MxIF) images. The techniques include obtaining at least one MxIF image of a same tissue sample, obtaining information indicative of locations of cells in the at least one MxIF image, identifying multiple groups of cells in the at least one MxIF image at least in part by determining feature values for at least some of the cells using the at least one MxIF image and the information indicative of locations of the at least some cells in the at least one MxIF image and grouping the at least some of the cells into the multiple groups using the determined feature values, and determining at least one characteristic of the tissue sample using the multiple cell groups.

Image processing using generative graphical models
11308368 · 2022-04-19 · ·

An image processing technique is presented using a hierarchical image model. The technique may be used as a precursor to subsequent image processing, to fix detected images in a post processing stage or as a segmentation or classification stage. The techniques may also be applied to super resolution. In a first layer of the hierarchical image model, each observed pixel of the image has a representation allocated to one or more input node. A set of the input nodes are assigned to a hidden node of a second layer, and a duplicate set of input nodes of the first layer is assigned to a duplicate of the hidden node in the second layer. In this way, a dense latent tree is formed in which a subtree is duplicated. Variables are assigned to the input nodes, the hidden node and the duplicate nodes and recurringly modified to process the image. Belief propagation messages may be utilised to recursively modify the variables. An image processing system using the method is described. A planning system for an autonomous vehicle comprising the image processing system is described.

Systems and methods for identifying electronic content using video graphs

Systems and methods are provided for identifying and recommending electronic content to consumers. In accordance with an implementation, one or more elements of electronic content are identified based on video graph data. In an exemplary method, information associated with a first element of video content is received, and corresponding video graph data is obtained. One or more second elements of video content that are similar to the first element of video content are identified based on the obtained video graph data. A subset the first and second elements of video content is subsequently identified for delivery to the user.

Rapid point cloud alignment and classification with basis set learning

A system configured to process an input point cloud, which represents an object using unstructured data points, to generate a feature vector that has an ordered structure and a fixed length. The system may process the input point cloud using a basis point set to generate the feature vector. For example, for each basis point in the basis point set, the system may identify a closest data point in the point cloud data and store a distance value or other information associated with the closest data point in the feature vector. The system may process the feature vector using a trained model to generate output data, such as performing point cloud registration to generate mesh data, point cloud classification to generate classification data, and/or the like.

Tracking objects in sequences of digital images

A system for tracking objects in a temporal sequence of digital images is configured to: detect potential objects in the images, the detected potential objects being indicated as nodes, identify pairs of neighboring nodes, such that for each pair the nodes of said pair potentially represent an identical object and their spatial and/or temporal relationship with each other is within a predetermined range, connect each pair of neighboring nodes with a first type edge, identify at least one supplementary pair of distant nodes whose spatial and/or temporal relationship with each other exceeds the predetermined range, connect the pair of distant nodes with a supplementary second type edge, each of the first and second type edges being assigned a cost value, and determine a track of an object in the temporal sequence of digital images based on a set of connected first type edges and at least one second type edge.