Patent classifications
G11B27/28
Segment action detection
Aspects of the present disclosure involve a system comprising a storage medium storing a program and method for receiving a video comprising a plurality of video segments; selecting a target action sequence that includes a sequence of action phases; receiving features of each of the video segments; computing, based on the received features, for each of the plurality of video segments, a plurality of action phase confidence scores indicating a likelihood that a given video segment includes a given action phase of the sequence of action phases; identifying a set of consecutive video segments of the plurality of video segments that corresponds to the target action sequence, wherein video segments in the set of consecutive video segments are arranged according to the sequence of action phases; and generating a display of the video that includes the set of consecutive video segments and skips other video segments in the video.
RECOMMENDATION OF AUDIO BASED ON VIDEO ANALYSIS USING MACHINE LEARNING
An electronic device and method for recommendation of audio based on video analysis is provided. The electronic device receives one or more frames of a first scene of a plurality of scenes of a video. The first scene includes a set of objects. The electronic device applies a trained neural network model on the received one or more frames to detect the set of objects. The electronic device determines an impact score of each object of the detected set of objects of the first scene based on the application of the trained neural network model on the set of objects. The electronic device further selects at least one first object from the set of objects based on the impact score of each object, and recommends one or more first audio tracks as a sound effect for the first scene based on the selected at least one first object.
Simulate live video presentation in a recorded video
An embodiment for simulating a live video presentation in a recorded video is provided. The embodiment may include receiving a previously recorded online meeting. The embodiment may also include transcribing and indexing the transcription of the previously recorded online meeting. The embodiment may further include receiving audio content from a user. The embodiment may also include searching for a response to the audio content in the transcription. The embodiment may further include in response to determining the response is found in the transcription, generating a solution for the audio content from the transcription. The embodiment may also include integrating the generated solution into the previously recorded online meeting. The embodiment may further include updating one or more video frames of the previously recorded online meeting based on the generated solution.
AUTOMATIC MODULATION OF DISPLAY TIMING BASED ON BEAT
The disclosed technology provides solutions for enhancing a user's experience of music video playback. Beat temporal locations are identified in the soundtrack of a multimedia content item, and surround the beat temporal locations, the playback speed of video frames is adjusted. An audio event trigger is determined using a beat decomposition process that generates event vectors including time-index information, wherein the event vectors indicate temporal locations of the audio event trigger. Playback speed can be changed by advancing the timing of displayed frames before the occurrence of a rhythm event, and by delaying it after. Shader parameter courses can be changed by being accelerated before the occurrence of a rhythm event and by being decelerated after.
Modification of objects in film
A computer-implemented method of processing video data comprising a first sequences of image frames containing a first instance of an object. The method includes isolating said first instance of the object within the first sequence of image frames, determining, using the isolated first instance of the object, first parameter values for a synthetic model of the object, modifying the first parameter values for the synthetic model of the object, rendering a modified first instance of the object using a trained machine learning model and the modified first parameter values for the synthetic model of the object, and replacing at least part of the first instance of the object within the first sequence of image frames with a corresponding at least part of the modified first instance of the object.
OPTIMIZING CONTINUOUS MEDIA COLLECTION
Described herein are techniques that may be used to identify a portion of media data to be prioritized. Such techniques may comprise receiving, from a media collection device, media information that includes a first media data and at least one of trigger data or sensor data, determining, based on one or more of the trigger data or the sensor data, that a portion of the first media data is to be prioritized, identifying, based on one or more of the trigger data or the sensor data, a beginning and end time to be associated with a second media data that includes the portion of the first media data, and generating the second media data from the received first media data based on the beginning and ending time, the second media data including less than the entirety of the first media data.
Video synthesis method terminal and computer storage medium
The disclosure provides a video synthesis method, a terminal and a storage medium. The method includes acquiring at least one video clip. The method includes acquiring a target audio suitable to video content based on the video content and the number of the at least one video clip. T number of the audio change points of the target audio is greater than or equal to the number of at least one video clip minus one, and the audio change points comprise time points at which change in audio feature satisfies a preset condition; and obtaining a video file by synthesizing the at least one video clip and the target audio based on the audio change points included in the target audio.
Video synthesis method terminal and computer storage medium
The disclosure provides a video synthesis method, a terminal and a storage medium. The method includes acquiring at least one video clip. The method includes acquiring a target audio suitable to video content based on the video content and the number of the at least one video clip. T number of the audio change points of the target audio is greater than or equal to the number of at least one video clip minus one, and the audio change points comprise time points at which change in audio feature satisfies a preset condition; and obtaining a video file by synthesizing the at least one video clip and the target audio based on the audio change points included in the target audio.
MANAGEMENT OF VIDEO PLAYBACK SPEED BASED ON OBJECTS OF INTEREST IN THE VIDEO DATA
Systems, methods, and software described herein manage the playback speed of video data based on processing objects in the video data. In one example, a video processing service obtains video data from a video source and identifies objects of interest in the video data. The video processing service further determines complexity in frames of the video data related to the objects of interest and updates playback speeds for segments of the video data based on the complexity of the frames.
Automatic trailer detection in multimedia content
The disclosed computer-implemented method may include accessing media segments that correspond to respective media items. At least one of the media segments may be divided into discrete video shots. The method may also include matching the discrete video shots in the media segments to corresponding video shots in the corresponding media items according to various matching factors. The method may further include generating a relative similarity score between the matched video shots in the media segments and the corresponding video shots in the media items, and training a machine learning model to automatically identify video shots in the media items according to the generated relative similarity score between matched video shots. Various other methods, systems, and computer-readable media are also disclosed.