Patent classifications
G06V20/41
Augmented reality content recommendation
Methods and systems are described herein for providing streamlined access to media assets of interest to a user. The method includes determining that a supplemental viewing device, through which a user views a field of view, is directed at a first field of view. The method further involves detecting that the supplemental viewing device is now directed at a second field of view, and determining that a media consumption device is within the second field of view. A first media asset of interest to the user that is available for consumption via the media consumption device is identified, and the supplemental viewing device generates a visual indication in the second field of view. The visual indication indicates that the first media asset is available for consumption via the media consumption device, and the visual indication tracks a location of the media consumption device in the second field of view.
Method and system for producing story video
A method and a system for producing a story video are provided. A method for producing a story video, according to one embodiment, can produce a specific story video by determining a theme of a story that is suitable for collected videos and selecting and arranging an appropriate video for each frame of a template associated with the theme.
Action recognition method and apparatus
An action recognition method and apparatus related to artificial intelligence and include extracting a spatial feature of a to-be-processed picture, determining a virtual optical flow feature of the to-be-processed picture based on the spatial feature and X spatial features and X optical flow features in a preset feature library, where the X spatial features and the X optical flow features include a one-to-one correspondence, determining a first type of confidence of the to-be-processed picture in different action categories based on similarities between the virtual optical flow feature and Y optical flow features, where each of the Y optical flow features in the preset feature library corresponds to one action category, X and Y are both integers greater than 1, and determining an action category of the to-be-processed picture based on the first type of confidence.
Video-informed spatial audio expansion
Assigning spatial information to audio segments is disclosed. A method includes receiving a first audio segment that is non-spatialized and is associated with first video frames; identifying visual objects in the first video frames; identifying auditory events in the first audio segment; identifying a match between a visual object of the visual objects and an auditory event of the auditory events; and assigning a spatial location to the auditory event based on a location of the visual object.
Depth-based object re-identification
An object re-identifier. For each of a plurality of frames of a video, a quality of the frame is assessed and a confidence that a previously-recognized object is present in the frame is determined. The determined confidence for the frame is weighted based on the assessed quality of the frame such that frames with higher relative quality are weighted more heavily than frames with lower relative quality. An overall confidence that the previously-recognized object is present in the video is assessed based on the weighted determined confidences.
Road obstacle detection device, road obstacle detection method, and computer-readable storage medium
The road obstacle detection device includes a semantic label estimation unit that estimates a semantic label for each pixel of an image using a classifier learned in advance and generates a semantic label image, an original image estimation unit for reconstruction of the original image from the semantic label image, a difference calculating unit for calculating a difference between the original image and the reconstructed image from the original image estimation unit as a calculation result, and a road obstacle detection unit for detecting a road obstacle based on the calculation result.
Systems and methods for detecting patterns within video content
A method of reducing false positives and identifying relevant true alerts in a video management system includes analyzing images to look for patterns indicating changes between subsequent images. When a pattern indicating changes between subsequent images is found, the video management system solicits from a user an indication of whether the pattern belongs to one of two or more predefined categories. The patterns indicating changes between subsequent images are saved for subsequent use. Subsequent images received from the video camera are analyzed to look for patterns indicating changes between subsequent images. When a pattern indicating changes between subsequent images is detected by the video management system, the video management system compares the pattern indicating changes between subsequent images to those previously categorized into one of the two or more predefined categories. Based on the comparison, the video management system may provide an alert to the user.
SCRATCHPAD CREATION METHOD AND ELECTRONIC DEVICE
A scratchpad creation method and an electronic device are disclosed. The method includes: receiving a first input performed by a user on a target identifier, where the target identifier is associated with a first video file; and displaying a first scratchpad in response to the first input, where the first scratchpad is a scratchpad created based on content of the first video file, the first scratchpad includes at least one video identifier and at least one progress identifier, the video identifier is used to indicate a video clip in the first video file, and the progress identifier is used to indicate completion progress of an operation corresponding to the video clip.
System and method to determine outcome probability of an event based on videos
System and method for determining an outcome probability of an event based on videos are disclosed. The method includes receiving the videos of an event, creating a building block model, extracting one of an audio content, a video content from the videos, analysing extracted content, generating an analysis result, analysing an engagement between speaker and participant of event, generating a data lake comprising a keyword library, computing the outcome probability of the event, enabling the building block model to learn from the data lake and the outcome probability computed and representing the at least one outcome probability in a pre-defined format.
Segment action detection
Aspects of the present disclosure involve a system comprising a storage medium storing a program and method for receiving a video comprising a plurality of video segments; selecting a target action sequence that includes a sequence of action phases; receiving features of each of the video segments; computing, based on the received features, for each of the plurality of video segments, a plurality of action phase confidence scores indicating a likelihood that a given video segment includes a given action phase of the sequence of action phases; identifying a set of consecutive video segments of the plurality of video segments that corresponds to the target action sequence, wherein video segments in the set of consecutive video segments are arranged according to the sequence of action phases; and generating a display of the video that includes the set of consecutive video segments and skips other video segments in the video.