G06V10/768

REPEATABILITY PREDICTIONS OF INTEREST POINTS
20220351518 · 2022-11-03 ·

The present disclosure describes approaches for evaluating interest points for localization uses based on a repeatability of the detection of the interest point in images capturing a scene that includes the interest point. The repeatability of interest points is determined by using a trained repeatability model. The repeatability model is trained by analyzing a time series of images of a scene and determining repeatability functions for each interest point in the scene. The repeatability function is determined by identifying which images in the time series of images allowed for the detection of the interest point by an interest point detection model.

INTELLIGENT GENERATION AND MANAGEMENT OF ESTIMATES FOR APPLICATION OF UPDATES TO A COMPUTING DEVICE

The present disclosure is directed to automated generation and management of update estimates relative to application of an update to a computing device. One or more updated to be applied to a computing device are identified. A trained artificial intelligence (AI) model is applied that is adapted to generate an update estimate predicting an amount of time that is required to apply an update to the computing device. An update estimate is generated based on a contextual analysis that evaluates one or more of: parameters associated with the update; device characteristics of the computing device to be updated; a state of current user activity on the computing device; historical predictions relating to prior update estimates for one or more computing devices (e.g., that comprise the computing device); or a combination thereof. A notification of the update estimate is then automatically generated and caused to be rendered.

N-BEST SOFTMAX SMOOTHING FOR MINIMUM BAYES RISK TRAINING OF ATTENTION BASED SEQUENCE-TO-SEQUENCE MODELS

A method and apparatus are provided that analyzing sequence-to-sequence data, such as sequence-to-sequence speech data or sequence-to-sequence machine translation data for example, by minimum Bayes risk (MBR) training a sequence-to-sequence model and within introduction of applications of softmax smoothing to an N-best generation of the MBR training of the sequence-to-sequence model.

ANOMALY DETECTION IN EVENT-BASED SYSTEMS USING IMAGE PROCESSING
20220351001 · 2022-11-03 · ·

At least one processor may capture a plurality of image snapshots containing information about a monitored system at a plurality of sequential times, each snapshot having the same vertical and horizontal dimensions. The processor may label the plurality of image snapshots as indicative of an event that took place in the monitored system, may receive additional data describing the event, may cluster the labeled plurality of image snapshots and the additional data using at least one machine learning clustering algorithm, and may merge the clustered plurality of image snapshots and the clustered additional data into merged data. The processors may create a model by processing the merged data using at least one neural network, the model being configured to detect future events of a same type as the event in the monitored system. The processor may store the model in a memory in communication with the processor.

VIRTUAL REALITY ENVIRONMENT BASED MANIPULATION OF MULTI-LAYERED MULTI-VIEW INTERACTIVE DIGITAL MEDIA REPRESENTATIONS

Various embodiments of the present disclosure relate generally to systems and methods for generating multi-view interactive digital media representations in a virtual reality environment. According to particular embodiments, a plurality of images is fused into a first content model and a first context model, both of which include multi-view interactive digital media representations of objects. Next, a virtual reality environment is generated using the first content model and the first context model. The virtual reality environment includes a first layer and a second layer. The user can navigate through and within the virtual reality environment to switch between multiple viewpoints of the content model via corresponding physical movements. The first layer includes the first content model and the second layer includes a second content model and wherein selection of the first layer provides access to the second layer with the second content model.

Detection device, detection method, and recording medium for detecting an object in an image

An information processing device is an information processing device including a processor. The processor obtains a detection result of a first detector for detecting a first target in first sensing data; and based on the detection result of the first detector, determines a setting of processing by a second detector for detecting a second target in second sensing data next in an order after the first sensing data, the second target being different from the first target.

SYSTEMS, METHODS, AND APPARATUSES FOR GENERATING PRE-TRAINED MODELS FOR nnU-Net THROUGH THE USE OF IMPROVED TRANSFER LEARNING TECHNIQUES
20230072400 · 2023-03-09 ·

Described herein are means for generating pre-trained models for nnU-Net through the use of improved transfer learning techniques, in which the pre-trained models are then utilized for the processing of medical imaging. According to a particular embodiment, there is a system specially configured for segmenting medical images, in which such a system includes: a memory to store instructions; a processor to execute the instructions stored in the memory; wherein the system is specially configured to: execute instructions via the processor for executing a pre-trained model from Models Genesis within a nnU-Net framework; execute instructions via the processor for learning generic anatomical patterns within the executing Models Genesis through self-supervised learning; execute instructions via the processor for transforming an original image using distortion and cutout-based methods; execute instructions via the processor for learning the reconstruction of the original image from the transformed image using an encoder-decoder architecture of the nnU-Net framework to identify the generic anatomical representation from the transformed image by recovering the original image; and wherein architecture determined by the nnU-Net framework is utilized with Models Genesis and is trained to minimize the L2 distance between the prediction and ground truth. Other related embodiments are disclosed.

Image processing apparatus, control method therefor, and storage medium
11600090 · 2023-03-07 · ·

An image processing apparatus includes a character recognition processing unit configured to execute character recognition processing on the image data, an acquisition unit configured to acquire one or more character string blocks included in the image data, from the image data, a selection unit configured to select a character string block to be used for setting of a file name, from among the one or more character string blocks acquired by the acquisition unit, and a setting unit configured to set the file name of image data by using a character recognition result of the character recognition processing unit for the character string block selected by the selection unit.

Contextual self-checkout based verification
11599864 · 2023-03-07 · ·

A transaction is detected on a terminal. A context for an association of items, bags, shelving, carts, and individuals present during the transaction is maintained from images captured during the transaction. Rules are processed to determine actions to process during the transaction and/or after the transaction based on reported transaction details, the context, and analytics derived from the images for the context. In an embodiment, at least one rule identifies an action that overrides an attendant intervention at the terminal, which would have been issued for a detected weight discrepancy by a scale of the terminal.

Dynamic virtual background selection for video communications
11475615 · 2022-10-18 ·

Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for providing video communications with dynamic virtual backgrounds within a communication platform. Based on the contextual information of a meeting, the system selects a virtual background to be used for the meeting. During, a video meeting, the system generates for display, on one or more client devices, a composite video depicting the imagery of the user overlaid on imagery of the selected virtual background.