IPIQ

G06V20/40

APPARATUS OF SELECTING VIDEO CONTENT FOR AUGMENTED REALITY, USER TERMINAL AND METHOD OF PROVIDING VIDEO CONTENT FOR AUGMENTED REALITY

20230051112 · 2023-02-16 ·

Jinwook BAEK

A video content selecting apparatus for augmented reality is provided. The apparatus includes a communication interface; and an operation processor configured to perform: (a) collect a plurality of video contents through the Internet; (b) extract feature information and metadata for each of the plurality of video contents, and generate a hash value corresponding to the feature information by using a predetermined hashing function; (c) manage a database to include at least the hash value and the metadata of each of the plurality of video contents; (d) receive object information corresponding to an object in a real-world environment from a user terminal through the communication interface; (e) search the database based on the object information and select a video content corresponding to the object information from among the plurality of video contents; and (f) transmit the metadata of the selected video content to the user terminal through the communication interface.

ENVIRONMENTALLY AWARE PREDICTION OF HUMAN BEHAVIORS

20230048304 · 2023-02-16 ·

A behavior prediction system predicts human behaviors based on environment-aware information such as camera movement data and geospatial data. The system receives sensor data of a vehicle reflecting a state of the vehicle at a given time and a given location. The system determines a field of concern in images of a video stream and determines one or more portions of images of the video stream that correspond to the field of concern. The system may apply different levels of processing powers to objects in the images based on whether an object is in the field of concern. The system then generates features of objects and identify VRUs from the objects of the video stream. For the identified VRUs, the system inputs a representation of the VRUs and the features into a machine learning model, and outputs from the machine learning model a behavioral risk assessment of the VRUs.

ONE-TOUCH SPATIAL EXPERIENCE WITH FILTERS FOR AR/VR APPLICATIONS

20230049175 · 2023-02-16 ·

A method to assess user condition for wearable devices using electromagnetic sensors is provided. The method includes receiving a signal from an electromagnetic sensor, the signal being indicative of a health condition of a user of a wearable device, selecting a salient attribute from the signal, and determining, based on the salient attribute, the health condition of the user of the wearable device. A non-transitory, computer-readable medium storing instructions which, when executed by a processor, cause a system to perform the above method, and the system, are also provided.

ONE-TOUCH SPATIAL EXPERIENCE WITH FILTERS FOR AR/VR APPLICATIONS

20230049175 · 2023-02-16 ·

FEW-SHOT ACTION RECOGNITION

20230049770 · 2023-02-16 ·

Methods and systems of training a neural network include training a feature extractor and a classifier using a first set of training data that includes one or more base cases. The classifier is trained with few-shot adaptation using a second set of training data, smaller than the first set of training data, while keeping parameters of the feature extractor constant.

SECURITY ECOSYSTEM

20230046880 · 2023-02-16 ·

A system, method, and apparatus for implementing workflows across multiple differing systems and devices are provided herein. During operation a workflow is automatically generated upon the detection of new signage. In particular, a workflow server will detect the presence of new signage in a particular area. The new signage will be analyzed, and an appropriate trigger and action will be determined based on the new signage. The appropriate trigger and action will then be implemented as a newly-created workflow.

METHOD FOR OPTIMIZING PROCESS OF DISPLAYING VIDEO STREAMS WITH SPECIFIED EVENT, APPARATUS EMPLOYING METHOD, AND COMPUTER READABLE STORAGE MEDIUM

20230046816 · 2023-02-16 ·

A method for optimizing a process of displaying video streams with specified event receives video streams. The video streams are sequenced based on a specified arrangement role to from a video stream queue. By analyzing each video streams, whether each video stream includes a specified event is determined. If each video stream without the specified event, the video streams are outputted based on the video stream queue. If the video stream includes the specified event, the video stream with the specified event is adjusted to be priority in the video stream queue, and the video streams are outputted based on the updated video stream queue. The video streams with the specified event can be prioritized processed and focus. A video stream processing apparatus and a computer readable storage medium applying the method are also provided.

Person replacement utilizing deferred neural rendering

11582519 · 2023-02-14 ·

Amazon Technologies, Inc.

Techniques are disclosed for performing video synthesis of audiovisual content. In an example, a computing system may determine first parameters of a face and body of a source person from a first frame in a video shot. The system also determines second parameters of a face and body of a target person. The system determines that the target person is a replacement for the source person in the first frame. The system generates third parameters of the target person based on merging the first parameters with the second parameters. The system then performs deferred neural rendering of the target person based on a neural texture that corresponds to a texture space of the video shot. The system then outputs a second frame that shows the target person as the replacement for the source person.

Search results within segmented communication session content

11580737 · 2023-02-14 ·

Zoom Video Communications, Inc.

Methods and systems provide for search results within segmented communication session content. In one embodiment, the system receives a transcript and video content of a communication session between participants, the transcript including timestamps for a number of utterances associated with speaking participants; processes the video content to extract textual content visible within the frames of the video content; segments frames of the video content into a number of contiguous topic segments; determines a title for each topic segment; assigns a category label for each topic segment; receives a request from a user to search for specified text within the video content; determines one or more titles or category labels for which a prediction of relatedness with the specified text is present; and presents content from at least one topic segment associated with the one or more titles or category labels for which a prediction of relatedness is present.

Methods, systems, and media for adaptive presentation of a video content item based on an area of interest

11580740 · 2023-02-14 ·

Google Llc

Methods, systems, and media for adaptive presentation of a video content item based on an area of interest are provided. In some embodiments, the method comprises: causing a video content item to be presented within a viewport having first dimensions in connection with a web page, wherein the video content item is associated with area of interest information corresponding to one or more frames of the video content item; determining that the first dimensions associated with the viewport have changed in which the viewport is currently associated with second dimensions; determining that a modified video content item should be presented within the viewport having the second dimensions in response to determining that the first dimensions associated with the viewport have changed, wherein the modified video content item includes an area of interest based on the area of interest information associated with the video content item and wherein portions of at least one frame of the modified video content item are removed based on the second dimensions of the viewport; and causing the modified video content item to be presented within the viewport having the second dimensions.

Patent classifications

G06V20/40