Patent classifications
H04N5/9305
System and method for movie karaoke
While watching a movie, a user speaks lines of dialogue. The system records the speech, compares with the dialogue in the movie, and reports a score to the user. The system can share scores through an online service to create a community experience. In particular, the systems and methods disclosed implement a technique for matching user input to media content. A computer system receives audio input from a user (speech) and compares the received speech to dialogue in a movie or television program. For example, the computer system may convert the received speech to text and may compare the converted text against dialogue text using closed captioning or subtitle data. Alternatively, waveform data may be compared. The computer system generates a score for the speech based on how closely the speech matches the dialogue, and reports the score to the user through a user interface.
Tracking interactivity with a prerecorded media file
In a method for tracking interactivity with a prerecorded video file superimposed into a video, presentation instructions for displaying a prerecorded video file are displayed on a display device of a mobile electronic device, the presentation instructions including display conditions for displaying the prerecorded video file. A video of a scene is displayed on the display device of the mobile electronic device. Responsive to detecting at least one display condition of the display conditions, the prerecorded video file is displayed on the display device of the mobile electronic device, such that the video is partially obscured by the prerecorded video file. Responsive to the displaying the prerecorded video file, a display instance for the prerecorded video file is logged.
Overlaying multi-source media in VRAM
Methods, apparatuses, and computer program products for overlaying multisource media in VRAM are described. The primary media source is rendered in VRAM by an application program, and then the secondary media source(s) are rendered and blended to the primary source in VRAM at the same location of the primary source in VRAM, so no extra buffer is needed. This improves system performance and reduces power consumption, through reduced system bus, system memory, and CPU usage.
Apparatus for video output and associated methods
An apparatus comprising a processor and memory including computer program code, the memory and computer program code configured to, with the processor, enable the apparatus at least to: use received current-field-of-view indication data together with future-event-direction data, in respect of recorded panoramic video output provided by panoramic video content data, to provide a sensory cue for a viewer of the recorded panoramic video output to indicate the direction of a future event in the recorded panoramic video output which is outside a current field of view, wherein the recorded panoramic video output is configured to provide video content to the viewer which extends outside the field of view of the viewer in at least in one direction, and the future-event-direction data is supplemental to the panoramic video content data which provides the video content itself.
SYSTEM AND METHOD FOR PRESENTING VIRTUAL REALITY CONTENT TO A USER
This disclosure describes a system configured to present primary and secondary, tertiary, etc., virtual reality content to a user. Primary virtual reality content may be displayed to a user, and, responsive to the user turning his view away from the primary virtual reality content, a sensory cue is provided to the user that indicates to the user that his view is no longer directed toward the primary virtual reality content, and secondary, tertiary, etc., virtual reality content may be displayed to the user. Primary virtual reality content may resume when the user returns his view to the primary virtual reality content. Primary virtual reality content may be adjusted based on a user's interaction with the secondary, tertiary, etc., virtual reality content. Secondary, tertiary, etc., virtual reality content may be adjusted based on a user's progression through the primary virtual reality content, or interaction with the primary virtual reality content.
TRANSCRIPT PARAGRAPH SEGMENTATION AND VISUALIZATION OF TRANSCRIPT PARAGRAPHS
Embodiments of the present invention provide systems, methods, and computer storage media for segmenting a transcript into paragraphs. In an example embodiment, a transcript is segmented to start a new paragraph whenever there is a change in speaker and/or a long pause in speech. If any remaining paragraphs are longer than a designated length or duration (e.g., 50 or 100 words), each of those paragraphs is segmented using dynamic programming to minimize a cost function that penalizes candidate paragraphs based on divergence from a target paragraph length and/or that rewards candidate paragraphs that group semantically similar sentences. As such, the transcript is visualized, segmented at the identified paragraphs.
System and method for presenting virtual reality content to a user
This disclosure describes a system configured to present primary and secondary, tertiary, etc., virtual reality content to a user. Primary virtual reality content may be displayed to a user, and, responsive to the user turning his view away from the primary virtual reality content, a sensory cue is provided to the user that indicates to the user that his view is no longer directed toward the primary virtual reality content, and secondary, tertiary, etc., virtual reality content may be displayed to the user. Primary virtual reality content may resume when the user returns his view to the primary virtual reality content. Primary virtual reality content may be adjusted based on a user's interaction with the secondary, tertiary, etc., virtual reality content. Secondary, tertiary, etc., virtual reality content may be adjusted based on a user's progression through the primary virtual reality content, or interaction with the primary virtual reality content.
Recording apparatus, reproduction apparatus and file management method
A recording apparatus, a reproduction apparatus and a file management method are disclosed wherein, even if one of files recorded on a recording medium cannot be reproduced regularly, another file selected by the user can be reproduced normally. A file having a hierarchical structure formed from video data and audio data both in the form of compressed data together with information necessary for processing of the video data and audio data is produced and recorded on a predetermined recording medium. Upon production of the file, information regarding decoding of the video data and audio data is disposed collectively on the top side of the file.
Systems, Methods, and Devices for Synchronization of Vehicle Data with Recorded Audio
A method for post-processing to synchronize audio data with vehicle data includes generating an artificial sound data based on time-series vehicle data. The method includes determining an offset that maximizes cross-correlation between the artificial sound data and recorded audio data. The method also includes shifting one or more of the time-series data and the recorded audio data relative to each other in time based on the offset. The shift may be used to generate or render a synchronized set of time-series data and recorded audio data.
SYSTEMS AND METHODS FOR INTELLIGENTLY SYNCHRONIZING EVENTS IN VISUAL CONTENT WITH MUSICAL FEATURES IN AUDIO CONTENT
Systems for synchronizing events or transitions in visual content with musical features in audio content are configured to obtain audio and visual content; determine a minimum, maximum, and/or a target display duration for items of visual content; determine a first playback-time in the first audio content to associate with a start-of-display time for the first visual content; identify a first timeframe in the first audio content corresponding to a range of acceptable end-of-display times for the first visual content; identify musical features within the first timeframe; identify a candidate musical feature among the identified musical features in accordance with a hierarchy; and/or define a first candidate end-of-display time that aligns with the playback time of the candidate musical feature. A set of candidate end-of-display times are defined for multiple visual content items in a single multimedia project, the set identified by seeking a solution that increases rank among the hierarchy.