G11B27/005

Automatic generation of video playback effects

In various examples, recordings of gameplay sessions are enhanced by the application of special effects to relatively high(er) and/or low(er) interest durations of the gameplay sessions. Durations of relatively high(er) or low(er) predicted interest in a gameplay session are identified, for instance, based upon level of activity engaged in by a gamer during a particular gameplay session duration. Once identified, different variations of video characteristic(s) are applied to at least a portion of the identified durations for implementation during playback. The recordings may be generated and/or played back in real-time with a live gameplay session, or after completion of the gameplay session. Further, video data of the recordings themselves may be modified to include the special effects and/or indications of the durations and/or variations may be included in metadata and used for playback.

Altering audio to improve automatic speech recognition

Techniques for altering audio being output by a voice-controlled device, or another device, to enable more accurate automatic speech recognition (ASR) by the voice-controlled device. For instance, a voice-controlled device may output audio within an environment using a speaker of the device. While outputting the audio, a microphone of the device may capture sound within the environment and may generate an audio signal based on the captured sound. The device may then analyze the audio signal to identify speech of a user within the signal, with the speech indicating that the user is going to provide a subsequent command to the device. Thereafter, the device may alter the output of the audio (e.g., attenuate the audio, pause the audio, switch from stereo to mono, etc.) to facilitate speech recognition of the user's subsequent command.

Systems and methods for generating time lapse videos
11488631 · 2022-11-01 · ·

Video information may define spherical video content having a duration. Spherical video content may define visual content viewable from a point of view as a function of progress through the spherical video content. Path information may define a path selection for the spherical video content. Path selection may include movement of a viewing window within the spherical video content. The viewing window may define extents of the visual content viewable from the point of view as the function of progress through the spherical video content. Time lapse parameter information may define at least two of a time portion of the duration, an image sampling rate, and a time lapse speed effect. A time lapse video may be generated based on the video information, the path information, and the time lapse parameter information.

Tap to advance by subtitles
11488632 · 2022-11-01 · ·

Systems and methods are provided for advancing through video content. The systems and methods include generating playback of a video associated with subtitles comprising a sequence of subtitle segments; receiving a user request to advance playback of the video from a first video play position corresponding to a first subtitle segment of the sequence of subtitle segments; identifying a second subtitle segment of the sequence of subtitle segments that is adjacent to the first subtitle segment in the sequence of subtitle segments; and in response to receiving the user request to advance playback of the video, advancing playback of the video from the first video play position to a second video play position based on a subtitle segment play position associated with the second subtitle segment.

METHOD AND APPARATUS FOR DETERMINING MULTIMEDIA EDITING INFORMATION, DEVICE AND STORAGE MEDIUM
20230085665 · 2023-03-23 ·

A method for determining multimedia editing information includes: obtaining a first playback time of a first target frame in first multimedia data to be edited, a second playback time of the first target frame for second multimedia data after editing and a playback multi-times speed set for the first multimedia data; determining, based on the first playback time and the second playback time, a time deviation that occurs while editing the first multimedia data at the playback multi-times speed for the first multimedia data; and determining, based on the time deviation, the first playback time, the second playback time and the playback multi-times speed, a playback time of the first multimedia data corresponding to each playback time of the second multimedia data after the second playback time.

System and method for controlling the display of an image stream

Embodiments of the invention are related to a system and method of controlling a display of image stream. The system may include a memory to store the image stream; the image stream may comprise a plurality of image frames. The system may further include a processor configured to execute the method. The moving image stream may be displayed to a user in an image stream display area of a screen and a frame rate control interface may be generated on a speed control area of the screen, such that the image stream display area is horizontally adjacent to the speed control area. An indication of a desired frame rate for displaying the image stream of the image frames may be received from the user, in that the frame rate may be selected according to a location of a pointing indicator in the speed control area.

PROGRAMMATICALLY CONTROLLING MEDIA CONTENT NAVIGATION BASED ON CORRESPONDING TEXTUAL CONTENT

A method, computer system, and a computer program product for content navigation within a media player is provided. The present invention may include displaying, by a computing device, a media content and a corresponding textual content. The present invention may include receiving, from a user, input regarding the textual content. The present invention may include modifying a playback of the media content based upon the input regarding the textual content to generate a modified media content. The present invention may include playing the modified media content.

SINGLE CLIP SEGMENTATION OF MEDIA
20220343952 · 2022-10-27 ·

An example system may provide a graphical user interface displaying a timeline graphical element, where the timeline graphical element represents a timeline of a media clip, and a graphical current point indicator indicating a selected point on the timeline. The system may provide a marking graphical element that is selectable to record a start point or an end point of a segment of the certain media clip. The system may receive a first user input scrubbing the timeline graphical element horizontally, and it may receive a second user input indicating a user interaction with the marking graphical element. The system may determine a start point and an end point of each of a plurality of segments of the timeline graphical element corresponding to the certain media clip based on the second user input and update the graphical user interface based on the determination.

Management of video playback speed based on objects of interest in the video data
11611719 · 2023-03-21 · ·

Systems, methods, and software described herein manage the playback speed of video data based on processing objects in the video data. In one example, a video processing service obtains video data from a video source and identifies objects of interest in the video data. The video processing service further determines complexity in frames of the video data related to the objects of interest and updates playback speeds for segments of the video data based on the complexity of the frames.

Retiming digital videos utilizing machine learning and temporally varying speeds

This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that generate a temporally remapped video that satisfies a desired target duration while preserving natural video dynamics. In certain instances, the disclosed systems utilize a playback speed prediction machine-learning model that recognizes and localizes temporally varying changes in video playback speed to re-time a digital video with varying frame-change speeds. For instance, to re-time the digital video, the disclosed systems utilize the playback speed prediction machine-learning model to infer the slowness of individual video frames. Subsequently, in certain embodiments, the disclosed systems determine, from frames of a digital video, a temporal frame sub-sampling that is consistent with the slowness predictions and fit within a target video duration. In certain implementations, the disclosed systems utilize the temporal frame sub-sampling to generate a speed varying digital video that preserves natural video dynamics while fitting the target video duration.