G06V10/7788

ROAD SIGN CONTENT PREDICTION AND SEARCH IN SMART DATA MANAGEMENT FOR TRAINING MACHINE LEARNING MODEL

Systems and method for machine-learning assisted road sign content prediction and machine learning training is disclosed. A sign detector model processes images or video with road signs. A visual attribute prediction model extracts visual attributes of the sign in the image. The visual attribute prediction model can communicate with a knowledge graph reasoner to validate the visual attribute prediction model by applying various rules to the output of the visual attribute prediction model. A plurality of potential sign candidates are retrieved that match the visual attributes of the image subject to the visual attribute prediction model, and the rules help to reduce the list of potential sign candidates and improve accuracy of the model.

Devices and methods for accurately identifying objects in a vehicle's environment

Vehicle navigation control systems in autonomous driving rely on accurate predictions of objects within the vicinity of the vehicle to appropriately control the vehicle safely through its surrounding environment. Accordingly this disclosure provides methods and devices which implement mechanisms for obtaining contextual variables of the vehicle's environment for use in determining the accuracy of predictions of objects within the vehicle's environment.

Feature-based search

Various embodiments of systems and methods allow a system to identify subsets of items by mixing and matching identified features in one or more other items. A system can identify features of items in an item database. The system can then calculate “fingerprints” of these features which are vectors describing the characteristics of the features. The system can present a collection of items and a user can select an item of the collection. The user can then select positive features to include in a search and/or negative features to include in the search. The system can then do a search of the database for items that contain features similar to those positive features and do not contain features similar to those negative features. The user can select features through a variety of means.

Target model broker
11587323 · 2023-02-21 · ·

A machine accesses a set of image target models, each image target model being associated with model parameters, the model parameters comprising at least an operational domain, an expected input image quality, and an expected orientation. The machine receives an image for processing by one or more image target models from the set, the image including metadata specifying image parameters of the received image. The machine identifies, based on the image parameters in the metadata of the received image and the model parameters of one or more models in the set, a first subset of the set of image target models including image target models that are capable of processing the received image. The machine provides the received image to at least one image target model in the first subset.

TIME-LINE BASED OBJECT TRACKING ANNOTATION

Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for generating and editing object track labels for objects detected in video data. One of the methods includes obtaining a video segment comprising multiple image frames associated with multiple time points; obtaining object track data specifying a set of object tracks; providing, for presentation to a user, a user interface for modifying the object track data, the user interface displaying object timeline representations of the object tracks; receiving one or more user inputs that indicate one or more modifications to the object timeline representations; updating the object timeline representations displayed in the timeline display area; and updating the object track data according to the updated object timeline representations.

Analyzing, classifying, and restricting user-defined annotations

The present disclosure relates to systems, non-transitory computer-readable media, and methods for customizing a set of restrictions for a hashtag or other user-defined annotation that violates guidelines or rules of an online resource based on automated and manual review. In particular, in one or more embodiments, the disclosed systems trigger manual review of user-defined annotations in a social networking system, determine various metrics based on both manual and automated review of content including a particular user-defined annotation, and generate a customized set of restrictions for the user-defined annotation based on those metrics. More specifically, the system can generate and utilize various manual review metrics and a moderated media metric to generate a custom set of restrictions for a user-defined annotation.

Systems and methods for stream recognition

The present disclosure provides systems and methods for providing augmented reality experiences. Consistent with disclosed embodiments, one or more machine-learning models can be trained to selectively process image data. A pre-processor can be configured to receive image data provided by a user device and trained to automatically determine whether to select and apply a preprocessing technique to the image data. A classifier can be trained to identify whether the image data received from the pre-processor includes a match to one of a plurality of triggers. A selection engine can be trained to select, based on a matched trigger and in response to the identification of the match, a processing engine. The processing engine can be configured to generate an output using the image data, and store the output or provide the output to the user device or a client system.

VIDEO PROCESSING METHOD, VIDEO PROCESSING APPARATUS, AND COMPUTER-READABLE STORAGE MEDIUM
20230099444 · 2023-03-30 ·

This disclosure relates to a video processing method, a video processing apparatus, and a computer-readable storage medium. The video processing method includes: providing a first user with an interactive interface for tagging people in a video; receiving a tagging operation on at least one people in the video, which is inputted by the first user through the interactive interface; and in response to the tagging operation of the first user, displaying a tagging result in-feed outside a video display interface when the video is posted on a social network.

SYSTEM AND METHOD FOR REDUCING SURVEILLANCE DETECTION ERRORS

A method is disclosed. The method includes providing an imaging apparatus, recording image data of an imaging location using the imaging apparatus, displaying the image data to a user via a user device, selecting an image object from the image data based on a selection criteria, and determining whether or not a selection criteria error of the image object is to be checked. The method also includes displaying a bounding shape, which bounds the image object, to the user via the user device when the selection criteria error is to be checked, prompting the user to enter user input indicating whether or not the selection criteria error is present, and storing data of the image object in a cache when the user input indicates that the selection criteria error is present.

Motion-based human video detection

Methods, systems, and apparatus for motion-based human video detection are disclosed. A method includes generating a representation of a difference between two frames of a video; providing, to an object detector, a particular frame of the two frames and the representation of the difference between two frames of the video; receiving an indication that the object detector detected an object in the particular frame; determining that detection of the object in the particular frame was a false positive detection; determining an amount of motion energy where the object was detected in the particular frame; and training the object detector based on penalization of the false positive detection in accordance with the amount of motion energy where the object was detected in the particular frame.