G06V10/806

VIDEO GENERATION METHOD AND SYSTEM FOR HIGH RESOLUTION FACE SWAPPING

A video generation method includes: obtaining a target face image and a source face image; extracting a feature of each of the source face image and the target face image through a face feature encoder, to obtain corresponding source feature codes and target feature codes; generating swapped face feature codes through a face feature exchanger according to the source feature codes and the target feature codes; generating an initial swapped face image through a face generator according to the swapped face feature codes; and fusing the initial swapped face image with the target face image through a face fuser, to obtain a final swapped face image. The face feature encoder performs hierarchical encoding on the face feature to reserve semantic details of a face, and the face feature exchanger performs further processing based on the hierarchical encoding, to obtain hierarchical encoding of a swapped face feature with semantic details.

VIDEO HIGHLIGHT EXTRACTION METHOD AND SYSTEM, AND STORAGE MEDIUM

The present disclosure relates to a video highlight extraction method and system, and a storage medium. The method includes: obtaining a to-be-processed online class video and a teacher-student interaction feature and dividing the to-be-processed online class video into a plurality of target videos; respectively analysis on pictures corresponding to all frames of a target video, to obtain a visual feature set of a student and a visual feature set of a teacher in the pictures corresponding to the frames; determining timeliness of student feedback; performing speech recognition on the speech segment corresponding to the student and the speech segment corresponding to the teacher and extracting a key word, to determine fluency of language of the teacher, fluency of language of the student, and correctness of teaching knowledge; and determining a highlight in the to-be-processed online class video according to priorities of the target videos.

MULTI-SCALE DRIVING ENVIRONMENT PREDICTION WITH HIERARCHICAL SPATIAL TEMPORAL ATTENTION

In accordance with one embodiment of the present disclosure, method includes obtaining multi-level environment data corresponding to a plurality of driving environment levels, encoding the multi-level environment data at each level, extracting features from the multi-level environment data at each encoded level, fusing the extracted features from each encoded level with a spatial-temporal attention framework to generate a fused information embedding, and decoding the fused information embedding to predict driving environment information at one or more driving environment levels.

DATA FUSION AND ANALYSIS ENGINE FOR VEHICLE SENSORS

Systems and methods for data fusion and analysis of vehicle sensor data, including receiving a multiple modality input data stream from a plurality of different types of vehicle sensors, determining latent features by extracting modality-specific features from the input data stream, and aligning a distribution of the latent features of different modalities by feature-level data fusion. Classification probabilities can be determined for the latent features using a fused modality scene classifier. A tree-organized neural network can be trained to determine path probabilities and issue driving pattern judgments, with the tree-organized neural network including a soft tree model and a hard decision leaf. One or more driving pattern judgments can be issued based on a probability of possible driving patterns derived from the modality-specific features.

System and method for sensor fusion system having distributed convolutional neural network
11605228 · 2023-03-14 · ·

An early fusion network is provided that reduces network load and enables easier design of specialized ASIC edge processors through performing a portion of convolutional neural network layers at distributed edge and data-network processors prior to transmitting data to a centralized processor for fully-connected/deconvolutional neural networking processing. Embodiments can provide convolution and downsampling layer processing in association with the digital signal processors associated with edge sensors. Once the raw data is reduced to smaller feature maps through the convolution-downsampling process, this reduced data is transmitted to a central processor for further processing such as regression, classification, and segmentation, along with feature combination of the data from the sensors. In some embodiments, feature combination can be distributed to gateway or switch nodes closer to the edge sensors, thereby further reducing the data transferred to the central node and reducing the amount of computation performed there.

LANDING TRACKING CONTROL METHOD AND SYSTEM BASED ON LIGHTWEIGHT TWIN NETWORK AND UNMANNED AERIAL VEHICLE
20220332415 · 2022-10-20 ·

A landing tracking control method comprises the following contents: a tracking model training stage and an unmanned aerial vehicle real-time tracking stage. The landing tracking control method extracts a network Snet by using a lightweight feature and makes modification, so that an extraction speed of the feature is increased to better meet a real-time requirement. Weight allocation on the importance of channel information is carried out to differentiate effective features more purposefully and utilize the features, so that the tracking precision is improved. In order to improve a training effect of the network, a loss function of an RPN network is optimized, a regression precision of a target frame is measured by using CIOU, and meanwhile, calculation of classified loss function is adjusted according to CIOU, and a relation between a regression network and classification network is enhanced.

METHOD FOR GENERATING PRE-TRAINED MODEL, ELECTRONIC DEVICE AND STORAGE MEDIUM
20220335711 · 2022-10-20 ·

A method for generating a pre-trained model, includes: extracting, by each of candidate models that are selected from a model set, features from samples in a test set, to obtain features output by each of the candidate models; obtaining fusion features by fusing features output by the candidate models; obtaining prediction information by performing a preset target recognition task based on the fusion features; determining combination performance of the candidate models based on difference between the prediction information and standard information of the samples; and generating the pre-trained model based on the candidate models in response to the combination performance satisfying a preset performance index.

ELECTRONIC DEVICE CAPABLE OF IDENTIFYING INELIGIBLE OBJECT
20230076392 · 2023-03-09 ·

An electronic device for face recognition is provided. The electronic device is used to exclude an ineligible object to be identified according to the relative relationship between object distances and image sizes, the image variation with time and/or the feature difference between images captured by different cameras to prevent the possibility of cracking the face recognition by using a photo or a video.

SYSTEMS AND METHODS FOR PRIVACY-ENABLED BIOMETRIC PROCESSING
20230070649 · 2023-03-09 · ·

In one embodiment, a set of feature vectors can be derived from any biometric data, and then using a deep neural network (“DNN”) on those one-way homomorphic encryptions (i.e., each biometrics’ feature vector) can determine matches or execute searches on encrypted data. Each biometrics’ feature vector can then be stored and/or used in conjunction with respective classifications, for use in subsequent comparisons without fear of compromising the original biometric data. In various embodiments, the original biometric data is discarded responsive to generating the encrypted values. In another embodiment, the homomorphic encryption enables computations and comparisons on cypher text without decryption. This improves security over conventional approaches. Searching biometrics in the clear on any system, represents a significant security vulnerability. In various examples described herein, only the one-way encrypted biometric data is available on a given device. Various embodiments restrict execution to occur on encrypted biometrics for any matching or searching.

Track segment cleaning of tracked objects
11625909 · 2023-04-11 · ·

Provided are methods for track segment cleaning of tracked objects using neural networks, which can include detecting a first track segment and a second track segment. The method includes applying a machine learning model trained to determine if the first track segment and second track segment capture real objects and if the first track segment and the second track segment are representative of an identical object exterior to a vehicle. The method further includes combining the first track segment and the second track segment to form a single track segment having a single trajectory in response to the first track segment and the second track segment being determined to be representative of the identical object. Systems and computer program products are also provided.