Patent classifications
G06V20/47
SYSTEMS AND METHODS FOR GENERATING IMPROVED CONTENT BASED ON MATCHING MAPPINGS
Systems and methods are disclosed herein for generating content based on matching mappings by implementing deconstruction and reconstruction techniques. The system may retrieve a first content structure that includes a first object with a first mapping that includes a first list of attribute values. The system may then search content structures for a matching content structure having a second object with a second list of attributes and a second mapping including second attribute values corresponding to the second list of attributes. Upon finding a match, the system may generate a new content structure having the first object from the first content structure with the second mapping from the matching content structure. The system may then generate for output a new content segment based on the newly generated content structure.
HIERARCHICAL SAMPLING FOR OBJECT IDENTIFICATION
Aspects of the present disclosure include methods, systems, and non-transitory computer readable media that perform the steps of receiving a first plurality of snapshots, generating a first plurality of descriptors each associated with the first plurality of snapshots, grouping the first plurality of snapshots into at least one cluster based on the plurality of descriptors, selecting a representative snapshot for each of the at least one cluster, generating at least one second descriptor for the representative snapshot for each of the at least one cluster, wherein the at least one second descriptor is more complex than the first plurality of descriptors, and identifying a target by applying the at least second descriptor to a second plurality of snapshots.
DETECTION OF DEMARCATING SEGMENTS IN VIDEO
A method of detecting frames in a video that demarcate a pre-determined type of video segment within the video is provided. The method includes identifying visually distinctive candidate marker frames within the video, grouping the candidate marker frames into a plurality of groups based on visual similarity, computing a collective score for each of the groups based on temporal proximity of each of the candidate marker frames within the group to related events occurring within the video, and selecting at least one of the groups based on the collective proximity scores as marker frames that demarcate the pre-determined type of video segment. A video processing electronic device and at least one non-transitory computer readable storage medium having computer program instructions stored thereon for performing the method are also provided.
GENERATING A SUMMARY VIDEO SEQUENCE FROM A SOURCE VIDEO SEQUENCE
A method for generating a summary video sequence from a source video sequence is disclosed. The method comprises: identifying, in the source video sequence, event video sequences, wherein each event video sequence comprises consecutive video frames in which one or more objects of interest are present; extracting, from video frames of one or more event video sequences of the event video sequences, pixels depicting the respective one or more objects of interest; while keeping spatial and temporal relations of the extracted pixels as in the source video sequence, overlaying the extracted pixels of the video frames of the one or more event video sequences onto video frames of a main event video sequence of the event video sequences, thereby generating the summary video sequence. A video processing device configured to generate the summary video sequence is also disclosed.
INFORMATION PROCESSING METHOD, IMAGE PROCESSING APPARATUS, AND PROGRAM
[Object] To propose an image processing method, an information processing apparatus and a program, which are able to set a section actually adopted in a summary image for each section extracted as a candidate for adoption in the summary image. [Solution] An information processing method including: analyzing content of an input image; and setting a position of an adoption section that is adopted from the image on the basis of information on a section of music and scene information of the analyzed image.
GENERATING VIDEO SUMMARY
A computer-implemented method includes receiving a viewer request for playing a video summary of a video, wherein the viewer request includes a length of the video summary, generating the video summary of the viewer-requested length comprising a set of frames selected from the video based on audience reviews of the video, and playing a video stream of the video summary.
METHOD AND APPARATUS FOR SIMULTANEOUS VIDEO RETRIEVAL AND ALIGNMENT
Disclosed herein method and apparatus for simultaneous video retrieval and alignment. According to an embodiment of the present disclosure, there is provided a method for retrieving a video. The method comprising: detecting a section of interest in a query video that is a retrieval request video; producing one or more frame-level descriptor and a video-level descriptors for the query video by using key frames within the detected section of interest; and retrieving a reference video corresponding to the query video based on the frame-level descriptor and the video-level descriptor for the query video and one or more frame-level descriptor and a video-level descriptor for each of reference videos stored in a database.
Event detection apparatus and event detection method
An event detection apparatus includes an input unit configured to input a plurality of time-sequential images, a first extraction unit configured to extract sets of first image samples according to respective different sample scales from a first time range of the plurality of time-sequential images based on a first scale parameter, a second extraction unit configured to extract sets of second image samples according to respective different sample scales from a second time range of the plurality of time-sequential images based on a second scale parameter, a dissimilarity calculation unit configured to calculate a dissimilarity between the first and second image samples based on the sets of the first and second image samples, and a detection unit configured to detect an event from the plurality of time-sequential images based on the dissimilarity.
METHOD, SYSTEM AND APPARATUS FOR SELECTING A VIDEO FRAME
A method of selecting at least one video frame of a video sequence. A plurality of faces is detected in at least one video frame of the video sequence. An orientation of the detected faces is tracked over a series of subsequent video frames to determine whether a first detected face is turning towards a second detected face. The method then determines, using the tracked orientation of the detected faces, a portion of the video sequence in which the first and second detected faces are oriented towards each other for at least a predetermined number of frames defining a gaze fixation of the detected faces. At least one video frame is selected from the determined portion of the video sequence, the selected video frame capturing the gaze fixation of the detected faces.
METHODS AND DEVICE FOR VIDEO DATA ANALYSIS
Methods and apparatuses are provided for movie and television series video data analysis. The method includes: gathering and reading, by a processor, a plurality of input movies; removing a video border of each input movie; splitting the input movie into short clips, based on accuracy and efficiency requirements of different analyzing models; assessing attributes of each input movie by analyzing, with the different analyzing models, the input movie, the short clips cut from the input movie, and the frame images extracted from the input movie; and summarizing the plurality of input movies based on matching and integrating the attributes assessed for each input movie.