G06V2201/10

GEOGRAPHIC MANAGEMENT OF DOCUMENT CONTENT
20230215207 · 2023-07-06 ·

Methods and systems are provided to manage documents and extract information from documents by defining segments in each document, each of which is assigned a location in a coordinate system defined over a collection of documents. Metadata is attached to each segment to describe the contents, position, and semantic meaning of material within the segment. A segmenting-specific query language can be used to query the segments and respond to requests for information contained in the documents.

System and method for providing an interactive visual learning environment for creation, presentation, sharing, organizing and analysis of knowledge on subject matter
11551567 · 2023-01-10 · ·

The embodiments herein disclose a system and a method for providing an online web-based interactive audio-visual platform for note creation, presentation, sharing, organizing, and analysis. The system provides a conceptual and interactive interface to content; analyses a student's notes and instantly determines the accuracy of the conceptual connections made and a student's understanding of a topic. The system enables the student to add and use audio, visual, drawing, text notes, and mathematical equations in addition to those suggested by the note taking solution; to collate notes from various sources in a meaningful manner by grouping concepts using colors, images, and text; and to personalize other maps developed within the same environment while maintaining links back to the original source from which the notes are derived. The system highlights keywords in conjunction with spoken text to complement the advantages of using visual maps to improve learning outcomes.

Visualizing machine learning predictions of human interaction with vehicles
11551030 · 2023-01-10 · ·

A computing device accesses video data displaying one or more traffic entities and generates a plurality of sequences from the video data. For each sequence, the computing device identifies a plurality of stimuli in the sequence and applies a machine learning model to generate an output describing the traffic entity. The computing device generates a data structure for storing, for each sequence, information describing the sequence and linking frame indexes of stimuli from the sequence to outputs of the machine learning model. The computing device stores the data structure in association with the video data. Responsive to receiving a selection of a sequence, the computing device loads video data for the sequence. Responsive to receiving a selection of a traffic entity within the video data, the computing device generates a graphical display element including the machine learning model output for the selected traffic entity.

Integrated event processing and policy enforcement
11550692 · 2023-01-10 · ·

A method may include receiving an event from an event source. The event may correspond to event data. The event source may be a container executing an image. The image may correspond to image metadata including attributes describing the image. The method may further include combining the event data with the image metadata to obtain enriched data, detecting, using the enriched data, a deviation from a policy, and in response to detecting the deviation from the policy, performing an action to enforce the policy.

System and method for controlling content upload on a network
11693928 · 2023-07-04 · ·

A system and method for protecting copyright in content distributed online, in combination with specified business rules. A portion of content presented for upload on a network is analyzed to detect an image associated with a content owner; the image is compared with reference images to identify the content owner; and business rules are applied to control unauthorized uploading of the content. The identifier may be a logo included in the content as a digital graphic, or a non-visual marker. Analysis is advantageously performed on a sample of video frames or a segment of preselected length. If the content is found to be copyrighted, and the attempted upload is unauthorized, uploading may or may not be permitted, and the user may or may not be charged a fee for subsequent access to the content.

Systems and methods for geolocation prediction
11693901 · 2023-07-04 · ·

In one example embodiment, a computer-implemented method for extracting information from imagery includes obtaining data representing a sequence of images, at least one of the sequence of images depicting an object. The method includes inputting the sequence of images into a machine-learned information extraction model that is trained to extract location information from the sequence of images. The method includes obtaining as an output of the information extraction model in response to inputting the sequence of images, data representing a real-world location associated with the object depicted in the sequence of images.

Image processing apparatus that sets metadata of image data, method of controlling same, and storage medium
11694458 · 2023-07-04 · ·

An image processing apparatus that enables easy setting of metadata of image data. The image processing apparatus obtains image data associated with a selected work. A key candidate is identified from t image data based on one or more key types defined according to the selected work. A value candidate corresponding to the identified key candidate is identified based on a value type rule and a value search area rule which are defined for each of the one or more key types, and the identified value candidate is set as the metadata of the image data.

Personalized conversational recommendations by assistant systems

In one embodiment, a method includes receiving a user request from a client system associated with a user, generating a response to the user request which references one or more entities, generating a personalized recommendation based on the user request and the response, wherein the personalized recommendation references one or more of the entities of the response, and sending instructions for presenting the response and the personalized recommendation to the client system.

Video analytics scene classification and automatic camera configuration based automatic selection of camera profile

Example implementations include a method, apparatus and computer-readable medium for configuring profiles for a camera, comprising receiving video from the camera. The implementations further include classifying a first scene of the first video stream. Additionally, the implementations further include determining a first metadata for the first scene. Additionally, the implementations further include selecting a first profile for the camera based on the first metadata, wherein the first profile comprises one or more configuration parameters, wherein values of each of the one or more configuration parameters of the first profile are based on the first metadata. Additionally, the implementations further include configuring the camera with the first profile.

IMAGE FORGERY DETECTION VIA PIXEL-METADATA CONSISTENCY ANALYSIS

Systems and/or techniques for facilitating image forgery detection via pixel-metadata consistency analysis are provided. In various embodiments, a system can receive an electronic image from a client device. In various cases, the system can obtain a pixel vector and/or an image metadata vector that correspond to the electronic image. In various aspects, the system can determine whether the electronic image is authentic or forged, based on analyzing the pixel vector and the image metadata vector via at least one machine learning model.