G11B27/322

Media generating and editing system

A media generating and editing system that generates audio playback in alignment with text that has been automatically transcribed from the audio. A transcript data file that includes a plurality of text words transcribed from audio words included in the audio data is stored. Timing data is paired with the text words indicating locations in the audio data of the corresponding audio words from which the text words are transcribed. The audio data is provided for playback at a user device. The text words are displayed on a display screen at a user device and a visual marker is displayed on the display screen to indicate the text words on the display screen in time alignment with the audio playback of the corresponding audio words at the user device. The text words in the transcript data file are amended in response to inputs from the user device.

Integration of audio into a multi-view interactive digital media representation

Various embodiments of the present invention relate generally to systems and methods for integrating audio into a multi-view interactive digital media representation. According to particular embodiments, one process includes retrieving a multi-view interactive digital media representation that includes numerous images fused together into content and context models. The process next includes retrieving and processing audio data to be integrated into the multi-view interactive digital media representation. A first segment of audio data may be associated with a first position in the multi-view interactive digital media representation. In other examples, a first segment of audio data may be associated with a visual position or the location of a camera in the multi-view interactive digital media representation. The audio data may be played in coordination with the multi-view interactive digital media representation based on a user's navigation through the multi-view interactive digital media representation, where the first segment is played when the first position or first visual position is reached.

Indicating tracks as erased without deleting data for the tracks

Provided are a computer program product, system, and method for indicating tracks as erased without deleting data for the tracks. In response to receiving erase commands to erase tracks in the storage, indicating the tracks as erased without performing an erase operation on the tracks subject to the erase command. Data in the storage for the tracks indicated as erased remains in the storage while requests are directed to the tracks indicated as erased. A command is received indicating an operation with respect to a target track. The operation to proceed is permitted with respect to the target track in response to determining that the target track is not indicated as erased. An alternate operation is performed providing a result different from the operation indicated in the command in response to determining that the target track is indicated as erased.

Systems and Methods for Performing Adaptive Bitrate Streaming

Systems and methods for performing trick play functionality using trick play streams during adaptive bitrate streaming in accordance with embodiments of the invention are disclosed. One embodiment includes requesting a video container index from a video container file containing a video stream from a plurality of alternative streams of video; requesting at least one portion of the video stream using at least one entry from the video container index; decoding the at least one portion of the video stream; receiving at least one user instruction to perform a visual search of the media; requesting a trick play container index from a trick play container file containing a trick play stream; requesting at least one frame of video from the at least one trick play stream; and decoding and displaying the at least one frame of video from the trick play stream.

METHOD, APPARATUS AND SYSTEM FOR FACILITATING NAVIGATION IN AN EXTENDED SCENE
20230298275 · 2023-09-21 ·

A method, apparatus and system for facilitating navigation toward a region of interest in an extended scene of video content include determining a timeline including information regarding at least one region of interest in the video content and displaying, in a portion of the video content currently being displayed, a visual indicator indicating a direction in which to move in the video content to cause the display of the at least one region of interest. In one embodiment of the present principles a timeline is attached to the content and carries information evolving over time about the region(s) of interest. A renderer processes the timeline and provides navigation information to a user using available means such as a graphical representation or haptic information, or a combination of several means.

MULTIFUNCTION MULTIMEDIA DEVICE
20230283837 · 2023-09-07 ·

A method for interpreting messages, user-defined alert conditions, voice commands and performing an action in response is described. A method for annotating media content is described. A method for presenting additional content associated with media content identified based on a fingerprint is described. A method for identifying that an advertisement portion of media content is being played based on a fingerprint derived from the media content is described. A method of one media device recording particular media content automatically in response to another media device recording the particular media content is described. A method of concurrently playing media content on multiple devices is described. A method of publishing information associated with recording of media content is described. A method of deriving fingerprints by media devices that meet an idleness criteria is described. A method of loading, modifying, and displaying a high definition frame from a frame buffer is described. A method of recording or playing media content identified based on fingerprints is described.

SYSTEM AND METHOD FOR AUTOMATICALLY MANAGING MEDIA CONTENT
20220398275 · 2022-12-15 ·

A method, computer program product and computing device for receiving a request to load at least one new media content item on a personal media device. The size of the at least one new media content item is compared with the amount of storage space remaining on the personal media device to determine if the personal media device has sufficient available storage space. If the personal media device does not have sufficient available storage space, a relative weight associated with at least one old media content item stored on the personal media device is ascertained, the relative weight corresponding to a likelihood that the at least one old media content item will be rendered on the personal media device.

Method and system for segmenting video without tampering video data
11540022 · 2022-12-27 ·

Techniques segmenting a video using tags without modifying video data thereof are disclosed. According to one aspect of the present invention, each tag is created to define a portion of the video, wherein the tags can be modified, edited, looped, reordered or restored to a create an impression other than that if the video was played back sequentially. The tags are so structured in a table included in a tagging file that can be shared or published electronically or modified or updated by others. Further the table may be modified to include one or more conditional or commercial tags.

SERVER SIDE CROSSFADING FOR PROGRESSIVE DOWNLOAD MEDIA
20230011998 · 2023-01-12 ·

In exemplary embodiments of the present invention systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service so that each client or user has a consistent high quality experience. The effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, thus still allowing a personalized experience for each individual user, in a manner where the processing burden is minimized on the downstream side or client device. This approach enables a consistent user experience, independent of client device capabilities, both static and dynamic. The cross-fade can be implemented after decoding the relevant chunks of each component clip, processing, recoding and rechunking, or, in a preferred embodiment, the cross-fade or other effect can be implemented on the relevant chunks to the effect in the compressed domain, thus obviating any loss of quality by re-encoding. A large scale personalized content delivery service can be implemented by limiting the processing to essentially the first and last chunks of any file, since there is no need to processing the full clip. In exemplary embodiments of the present invention this type of processing can easily be accommodated in cloud computing technology, where the first and last files may be conveniently extracted and processed within the cloud to meet the required load. Processing may also be done locally, for example, by the broadcaster, with sufficient processing power to manage peak load.

Systems and methods for using video metadata to associate advertisements therewith

A system for using metadata from a video signal to associate advertisements therewith, comprising (i) a segmentation system to divide the video signal into video clips, (ii) a digitizing system for digitizing the video clips, (iii) a feature extraction system for extracting audio and video features from each video clip, associating each audio feature with respective video clips, associating each video feature with respective video clips, and saving the audio and video features into an associated metadata file, (iv) a web interface to the feature extraction system for receiving the video clips, and (v) a database, wherein video signals and associated metadata files are stored and indexed, wherein the associated metadata file is provided when a video player requests the corresponding video signal, enabling selection of a relevant advertisement for presentment in conjunction with respective video clips based on the associated audio and video features of the respective video clip.