G06V40/174

DEVICE AND METHOD FOR CONTROLLING DOOR LOCK
20230042025 · 2023-02-09 · ·

A door lock control device comprises: a door lock interface for communicating with a door lock; an imaging device; a controller for processing images to be acquired through the imaging device; and a storage medium. The controller determines each of first and second objects in the images as being either authorized or unauthorized depending on whether each of the first and second objects matches authentication data read from the storage medium, and controls the door lock through a door lock interface by referring to a distance between the first and second objects determined from the images when the first object is determined as being authorized and the second object is determined as being unauthorized.

DRIVER ASSISTANCE METHOD AND DRIVER ASSISTANCE APPARATUS
20230042206 · 2023-02-09 ·

A driver assistance method and a driver assistance apparatus are provided, which may be applied to the field of autonomous driving or intelligent driving. The driver assistance method includes: determining that a driver is in an abnormal state; determining that a first operation performed by the driver on a first terminal is an abnormal operation; and performing first processing, where the first processing includes outputting indication information and/or control information, and the indication information or the control information indicates a second operation performed on the first terminal, or the first processing includes controlling the first terminal to perform the second operation.

Systems and Methods for Assisted Translation and Lip Matching for Voice Dubbing
20230039248 · 2023-02-09 ·

Systems and methods for generating candidate translations for use in creating synthetic or human-acted voice dubbings, aiding human translators in generating translations that match the corresponding video, automatically grading how well a candidate translation matches the corresponding video, suggesting modifications to the speed and/or timing of the translated text to improve the grading of a candidate translation, and suggesting modifications to the voice dubbing and/or video to improve the grading of a candidate translation. In that regard, the present technology may be used to fully automate the process of generating lip-matched translations and associated voice dubbings, or as an aid for human-in-the-loop processes that may reduce or eliminate the time and effort required from translators, adapters, voice actors, and/or audio editors to generate voice dubbings.

System, computer-readable non-transitory recording medium, and method for estimating psychological state of user

A system includes: a light source that emits pulsed light that illuminates a user's head portion; a photodetector that detects at least part of pulsed light returning from the head portion and that outputs one or more signals corresponding to an intensity of the at least part; electrical circuitry; and a memory that stores an emotion model indicating a relationship between the one or more signals and emotions. Based on a change in the one or more signals, the electrical circuitry selects an emotion by referring to the model. The one or more signals include a first signal corresponding to an intensity of first part of the reflection pulsed light and a second signal corresponding to an intensity of second part of the reflection pulsed light. The first part includes part before a falling period is started; and the second part includes at least part in the falling period.

Systems and methods for providing real-time surveillance in automobiles
11558584 · 2023-01-17 · ·

Techniques for providing real-time vehicle surveillance is disclosed. An in-vehicle surveillance device continuously captures images from the surroundings of a vehicle and the interior of the vehicle and transmits them to a surveillance management system. The images are processed in real-time using machine learning modules to determine primary, secondary, and adverse events. Upon determining the events, alerts are generated and sent to a display unit provided on the in-vehicle surveillance device to improve the safety of the passengers. The techniques further allow vehicle-to-vehicle communication and vehicle to third party device communication upon determining an event.

Personalized videos featuring multiple persons

Provided are systems and methods for personalized videos featuring multiple persons. An example method includes receiving a user selection of a video having at least one frame with metadata that include a first location and a second location and receiving an image of a source face and a further image of a further source face, modifying the image of the source face to generate an image of a modified source face and modifying the further image of the further source face to generate an image of a modified further source face, inserting, in the at least one frame of the video, the image of the modified source face at the first location and the image of the modified further source face at the second location to generate a personalized video, and sending the personalized video via a communication chat.

Artificial intelligence robot and method of controlling the same
11557387 · 2023-01-17 · ·

An artificial intelligence (AI) robot includes a body for defining an exterior appearance and containing a medicine to be discharged according to a medication schedule, a support, an image capture unit for capturing an image within a traveling zone to create image information, and a controller for discharging the medicine to a user according to the medication schedule, reading image data of the user to determine whether the user has taken the medicine, and reading image data and biometric data of the user after the medicine-taking to determine whether there is abnormality in the user. The AI robot identifies a user and discharges a medicine matched with the user, so as to prevent errors. The AI robot detects a user's reaction after medicine-taking through a sensor, and performs deep learning, etc. to learn the user's reaction, to determine an emergency situation, etc. and cope with a result of the determination.

System, device, and method for generating and utilizing content-aware metadata
11557121 · 2023-01-17 · ·

System, device, and method for generating and utilizing content-aware metadata, particularly for playback of video and other content items. A method includes: receiving a video file, and receiving content-aware metadata about visual objects that are depicted in said video file; and dynamically adjusting or modifying playback of that video file, on a video playback device, based on the content-aware metadata. The modifications include content-aware cropping, summarizing, watermarking, overlaying of other content elements, modifying playback speed, adding user-selectable indicators or areas around or near visual objects to cause a pre-defined action upon user selection, or other adjustments or modification. Optionally, a modified and content-aware version of the video file is automatically generated or stored. Optionally, the content-aware metadata is stored internally or integrally within the video file, in its header or as a private channel; or is stored in an accompanying file.

Automatic image-based skin diagnostics using deep learning

There is shown and described a deep learning based system and method for skin diagnostics as well as testing metrics that show that such a deep learning based system outperforms human experts on the task of apparent skin diagnostics. Also shown and described is a system and method of monitoring a skin treatment regime using a deep learning based system and method for skin diagnostics.

Employment recruitment method based on face recognition and terminal device using same

An employment recruitment method based on face recognition includes acquiring a candidate's data from a third-party website, analyzing the candidate's data by a semantic analysis method to identify human resources information of the candidate, and analyzing messages and postings in the human resources information of the candidate to determine candidate's personality. A terminal device acquires a second face image of the candidate by a second camera, analyzes the second face image of the candidate by a computer vision algorithm to determine a micro-expression of the candidate, and provides the candidate's human resources information, the candidate's personality, and the candidate's micro-expression to the recruiter to evaluate the candidate. The terminal device applying the method is also disclosed.