Patent classifications
H04N21/8106
Audio and video processing method and apparatus, terminal and storage medium
An audio and video processing method, includes: displaying a video creation interface of a target audio, where the video creation interface includes n audio clips of the target audio and video recording entries corresponding to the n audio clips respectively, n≥2; receiving a trigger signal acting on a target video recording entry on the video creation interface, where the target video recording entry is a video recording entry corresponding to a target audio clip; acquiring a target video corresponding to the target audio clip based on the trigger signal, where the target video is a video clip of which the video duration is less than a duration threshold; and sending a video creation request carrying the target video to a server, where the video creation request is used to instruct to play picture information of the target video when the target audio is played.
SYSTEM AND METHOD FOR PROVIDING ADVANCED CONTENT INTERACTIVITY FEATURES
Systems and methods for interactively engaging consumers of a media asset are disclosed. The methods allow selection and personalization of a media asset character's name, voice, or dialogue while the media asset is being consumed. The personalization may be propagated through the entire media asset or additionally to other episodes, sequels, and related media assets by identifying and replacing associated metatags. The system determines whether the media asset is being consumed as a group watch where its members are consuming the media asset from different IP addresses or being consumed by viewers in the same room to determine the type of changes allowed. The methods also present queries to engage the viewer, such as by the character asking them a question, and provide supplemental videos to aid in responding to the queries. The responses to queries may also determine the path a story takes in the media asset.
Information processing apparatus and non-transitory computer readable medium
An information processing apparatus includes a processor configured to acquire video data that enables playback of a video in which audio, an image, and a caption are chronologically synchronized, receive a section of a playback time of the video, the section being to be removed, and remove a partial caption that corresponds to the audio in the received section and that is at least a portion of the caption from the image in the received section.
METHODS AND SYSTEMS FOR SUPPLEMENTING MEDIA ASSETS DURING FAST-ACCESS PLAYBACK OPERATIONS
Methods and systems are disclosed herein for a media guidance application that enhances the viewer experience by providing supplemental content related to a media asset during a fast-access playback operation. For example, in response to a user input during a fast-forward or rewind operation, the media guidance application may generate for display supplemental content related to the progression point of the media asset at which the user input was received while the fast-forward or rewind operation continues.
METHOD, SYSTEM, AND APPARATUS FOR MULTIMEDIA CONTENT DELIVERY TO CABLE TV AND SATELLITE OPERATORS
Systems, methods, and computer-readable media for delivering multimedia content from the cloud to cable operators are disclosed. A device located at the cable headend or implemented in the cloud can receive a request for at least one media stream for playback on a broadcast media channel. Content corresponding to a plurality of multimedia files in the media stream can be obtained from the internet or a cloud based service. The content can be used to generate the multimedia files in a format that is compatible with the cable operator. The multimedia files can be used to assemble the at least one media stream which can be provided to the cable operator for broadcast on the broadcast media channel.
METHODS AND APPARATUS TO DETERMINE HEADPHONE ADJUSTMENT FOR PORTABLE PEOPLE METER LISTENING TO ENCODED AUDIO STREAMS
Methods, apparatus, systems and articles of manufacture to determine headphone adjustment for portable people meter listening to encoded audio streams are disclosed. An example system disclosed herein includes meter data analyzer circuitry to determine an audience estimate for streaming media based on media data measured by a media meter, the media meter to measure the streaming media based on an identification of the streaming media in ambient audio collected by a microphone of the media meter, and data analyzer circuitry to calculate at least one headphone adjustment factor based on a first determined proportion of an audience that listens to streaming media via headphones and a second determined proportion of the audience that listens to streaming media without headphones.
Stereo Playback Configuration and Control
An example method includes, based on an adjustment to a first displayed volume control, instructing the first playback device to adjust playback volume level; based on an adjustment to a second displayed volume control, instructing the second playback device to adjust playback volume level; after sending the commands, instructing the first and/or second playback device to process an audio stream into a first and/or second channel and to reproduce a respective one of the first and second channel, wherein the grouped first and second playback devices provide multi-channel sound; and based on an adjustment to a third displayed volume control, instructing the first and/or second playback device to adjust a group volume level for both the first and second playback devices.
METHOD TO ALIGN AN IMMERSIVE VIDEO AND AN IMMERSIVE SOUND FIELD
A system comprising a video source, one or more audio sources and a computing device. The video source may be configured to generate a plurality of video streams that capture a view of an environment. The one or more audio sources may be configured to capture audio data of the environment. The computing device may comprise one or more processors configured to (i) perform a stitching operation on the plurality of video streams to generate a video signal representative of an immersive field of view of the environment, (ii) generate a sound field based on the audio data, (iii) identify an orientation for the sound field with respect to the video signal, and (iv) determine a rotation of the sound field based on the orientation. The rotation of the sound field aligns the sound field to the video signal.
Audio file processing to reduce latencies in play start times for cloud served audio files
Methods, systems, and computer programs are presented for managing audio files of a user to reduce latencies in play start times on local devices. The audio files are stored on cloud storage managed by a server. One method includes processing a plurality of audio files associated with a user, where the processing is configured to create audio snippet files from each of the plurality of audio files. The audio snippet files representing a beginning part of each of the plurality of audio files. The method also includes transmitting the audio snippet files to a client device and detecting a request from the client to begin playing a first audio file from the plurality of audio files of the user. The first audio file being stored on the cloud storage managed by the server.
Systems and methods for automatically enabling subtitles based on detecting an accent
Systems and methods are described for automatically enabling subtitles based on a user profile when a language is spoken with an accent a user has difficulty understanding. For example, a media guidance application may detect a first plurality of user interactions of the user while the given language is being spoken with the accent. Based on the first plurality of interactions, the media guidance application may calculate a first value associated with a user specific level of difficulty indicating how difficult it is for the user to understand the language when spoken with the accent. If the first plurality of user interactions are not being performed again, the media guidance application may update the user specific difficulty with a second value that is lower than the first value. The media guidance application may automatically generate for display subtitles for a media asset based on the user specific level of difficulty.