G06V40/103

Adaptive interface for screen-based interactions
11561806 · 2023-01-24 ·

Systems and methods for customizing an output based on user data are described herein. An example method for customizing an output based on user data may commence with continuously capturing, by at least one sensor, the user data. The method may continue with analyzing, by at least one computing resource, the user data received from the at least one sensor and determining dependencies between the user data and output data. The method may further include determining, based on predetermined criteria, that an amount of the user data and the dependencies is sufficient to customize the output data. The method may continue with continuously customizing, by an adaptive interface, the output data using at least one machine learning technique based on the analysis of the user data. The customized output data may be intended to elicit a personalized change.

Ergonomic man-machine interface incorporating adaptive pattern recognition based control system

An adaptive interface for a programmable system, for predicting a desired user function, based on user history, as well as machine internal status and context. The apparatus receives an input from the user and other data. A predicted input is presented for confirmation by the user, and the predictive mechanism is updated based on this feedback. Also provided is a pattern recognition system for a multimedia device, wherein a user input is matched to a video stream on a conceptual basis, allowing inexact programming of a multimedia device. The system analyzes a data stream for correspondence with a data pattern for processing and storage. The data stream is subjected to adaptive pattern recognition to extract features of interest to provide a highly compressed representation which may be efficiently processed to determine correspondence. Applications of the interface and system include a VCR, medical device, vehicle control system, audio device, environmental control system, securities trading terminal, and smart house. The system optionally includes an actuator for effecting the environment of operation, allowing closed-loop feedback operation and automated learning.

VOLUMETRIC CAPTURE AND MESH-TRACKING BASED MACHINE LEARNING 4D FACE/BODY DEFORMATION TRAINING
20230230304 · 2023-07-20 ·

Mesh-tracking based dynamic 4D modeling for machine learning deformation training includes: using a volumetric capture system for high-quality 4D scanning, using mesh-tracking to establish temporal correspondences across a 4D scanned human face and full-body mesh sequence, using mesh registration to establish spatial correspondences between a 4D scanned human face and full-body mesh and a 3D CG physical simulator, and training surface deformation as a delta from the physical simulator using machine learning. The deformation for natural animation is able to be predicted and synthesized using the standard MoCAP animation workflow. Machine learning based deformation synthesis and animation using standard MoCAP animation workflow includes using single-view or multi-view 2D videos of MoCAP actors as input, solving 3D model parameters (3D solving) for animation (deformation not included), and given 3D model parameters solved by 3D solving, predicting 4D surface deformation from ML training.

Enhanced animation generation based on motion matching using local bone phases

Systems and methods are provided for enhanced animation generation based on using motion mapping with local bone phases. An example method includes accessing first animation control information generated for a first frame of an electronic game including local bone phases representing phase information associated with contacts of a plurality of rigid bodies of an in-game character with an in-game environment. Executing a local motion matching process for each of the plurality of local bone phases and generating a second pose of the character model based on the plurality of matched local poses for a second frame of the electronic game.

Visual, depth and micro-vibration data extraction using a unified imaging device
11706377 · 2023-07-18 · ·

A unified imaging device used for detecting and classifying objects in a scene including motion and micro-vibrations by receiving a plurality of images of the scene captured by an imaging sensor of the unified imaging device comprising a light source adapted to project on the scene a predefined structured light pattern constructed of a plurality of diffused light elements, classifying object(s) present in the scene by visually analyzing the image(s), extracting depth data of the object(s) by analyzing position of diffused light element(s) reflected from the object(s), identifying micro-vibration(s) of the object(s) by analyzing a change in a speckle pattern of the reflected diffused light element(s) in at least some consecutive images and outputting the classification, the depth data and data of the one or more micro-vibrations which are derived from the analyses of images captured by the imaging sensor and are hence inherently registered in a common coordinate system.

Information processing apparatus, information processing method, and program
11704934 · 2023-07-18 · ·

An information processing apparatus (100) includes an acquisition unit (122) that acquires a first image from which person region feature information regarding a region including other than a face of a retrieval target person is extracted, a second image in which a collation result with the person region feature information indicates a match, and a facial region is detected, and result information indicating a collation result between face information stored in a storage unit and face information extracted from the facial region, and a display processing unit (130) that displays at least two of the first image, the second image, and the result information on an identical screen.

Methods and systems for operating a moving platform to determine data associated with a target person or object
11703863 · 2023-07-18 ·

Methods and systems for operating a moving platform to locate a known target at an area associated with the target are disclosed. In an example method to locate the target at the area, a first moving platform, configured with a first type of sensor, is caused to move to the area. An attempt is made to locate, via the first moving platform and the first type of sensor, the target at the area. Based on the attempt, a second moving platform, configured with a second type of sensor, is caused to move to the area. The target is located via the second moving platform and the second type of sensor.

Depth-based object re-identification

An object re-identifier. For each of a plurality of frames of a video, a quality of the frame is assessed and a confidence that a previously-recognized object is present in the frame is determined. The determined confidence for the frame is weighted based on the assessed quality of the frame such that frames with higher relative quality are weighted more heavily than frames with lower relative quality. An overall confidence that the previously-recognized object is present in the video is assessed based on the weighted determined confidences.

IMAGE PROCESSING DEVICE, AND NON-TRANSITORY COMPUTER-READABLE MEDIUM

A reception interface receives image data corresponding to an image in which a person is captured. In a case where at least one of feature points belonging to a left upper limb group is estimated to be included in a hidden body part and at least one of the feature points belonging to a right upper limb group is estimated to be included in the hidden body part, one of all the feature points belonging to the left upper limb group and all the feature points belonging to the right upper limb group are handled as feature points included in the hidden body part, and the other one of all the first feature points and all the second feature points are handled as feature points included in a non-hidden body part.

Modifying virtual content to invoke a target user state

In one implementation, a method includes: while presenting reference CGR content, obtaining a request from a user to invoke a target state for the user; generating, based on a user model and the reference CGR content, modified CGR content to invoke the target state for the user; presenting the modified CGR content; after presenting the modified CGR content, determining a resultant state of the user; in accordance with a determination that the resultant state of the user corresponds to the target state for the user, updating the user model to indicate that the modified CGR content successfully invoked the target state for the user; and in accordance with a determination that the resultant state of the user does not correspond to the target state for the user, updating the user model to indicate that the modified CGR content did not successfully invoke the target state for the user.