Patent classifications
H04N21/8547
WATERMARK-BASED METADATA ACQUISITION AND PROCESSING
The disclosed embodiments relate to media devices implementing new television standards, such as ATSC 3.0, which includes audio/video essence and metadata/signaling. The disclosed embodiments include techniques for confirming an association between two media devices by augmenting media content with a watermark using an upstream device, wherein the watermark conveys an identifier. The media content is sent from the upstream device to a downstream device and the watermark is detected using the downstream device. The identifier is transmitted to a server upon the detection of the watermark using the downstream device. The server identifies an association between the upstream and downstream devices upon receipt of the identifier from the downstream device.
Server for providing a graphical user interface to a client and a client
The invention relates to a server for providing a graphical user interface to a client over a communication network. The graphical user interface comprises a graphical user interface element, the graphical user interface element being formed by an element shape and an element text, the element shape being represented by element shape data, the element text being represented by element text data. The server comprises an encoder configured to encode the element shape data into video data, a detector configured to detect a change associated with the graphical user interface element within the graphical user interface, and a communication interface configured to separately transmit the video data and the element text data over the communication network, the element text data being transmitted upon detection of the change associated with the graphical user interface element for providing the graphical user interface to the client.
Server for providing a graphical user interface to a client and a client
The invention relates to a server for providing a graphical user interface to a client over a communication network. The graphical user interface comprises a graphical user interface element, the graphical user interface element being formed by an element shape and an element text, the element shape being represented by element shape data, the element text being represented by element text data. The server comprises an encoder configured to encode the element shape data into video data, a detector configured to detect a change associated with the graphical user interface element within the graphical user interface, and a communication interface configured to separately transmit the video data and the element text data over the communication network, the element text data being transmitted upon detection of the change associated with the graphical user interface element for providing the graphical user interface to the client.
Real-time incorporation of user-generated content into third-party streams
Systems and methods for real-time incorporation of user-produced content into a broadcast media stream are provided. A media title may be streamed to a producer computing device over a communication network. The producer computing device is associated with a channel for distributing the user-produced content in conjunction with the media title. Produced content may be captured from the producer computing device as the streamed media title is played on the producer computing device. Such captured produced content may be designated for the channel. The media title and the produced content may then be broadcast in real-time over the communication network to one or more subscriber devices subscribed to the channel. The media title and the produced content may be synchronized within the broadcast to reflect when the produced content was captured in relation to the media title as the media title was played on the producer computing device.
Method and system for transmitting and reproducing video of dynamic bitrate with a plurality of channels
Disclosed is a method and system for transmitting and reproducing a video of a dynamic bitrate using a plurality of channels. A video transmission method may transferring frames of a video to N encoders using an interleaving scheme, N denoting a first natural number greater than or equal to 2, generating N video streams by encoding the frames using the N encoders, each of the N video streams corresponding to one of the N encoders, and transmitting each of the N video streams as an independent stream.
Method and system for precise presentation of audiovisual content with temporary closed captions
A method, set-top box, and non-transitory computer readable medium are disclosed for presentation of audiovisual content with closed captions. The method includes receiving, via an input device interfaced with the electronic device, an instruction requesting a replay of previously viewed video content with closed captioning; sending, to the display device interfaced with the electronic device, one or more thumbnail images of the previously viewed video content to be displayed on the display device; receiving, via the input device interfaced with the electronic device, one of the one or more thumbnail images of the previously viewed video content being selected for replay of the previously viewed video content; and sending, to the display device interfaced with the electronic device, closed captioning with the previously viewed video content starting at a video frame corresponding to the one of the one or more thumbnails of the previously viewed video content selected for replay.
Method and system for precise presentation of audiovisual content with temporary closed captions
A method, set-top box, and non-transitory computer readable medium are disclosed for presentation of audiovisual content with closed captions. The method includes receiving, via an input device interfaced with the electronic device, an instruction requesting a replay of previously viewed video content with closed captioning; sending, to the display device interfaced with the electronic device, one or more thumbnail images of the previously viewed video content to be displayed on the display device; receiving, via the input device interfaced with the electronic device, one of the one or more thumbnail images of the previously viewed video content being selected for replay of the previously viewed video content; and sending, to the display device interfaced with the electronic device, closed captioning with the previously viewed video content starting at a video frame corresponding to the one of the one or more thumbnails of the previously viewed video content selected for replay.
System and method to support synchronization, closed captioning and highlight within a text document or a media file
The present invention relates to a system and method for synchronizing and highlighting a target text and audio associated with a reference document. The system and method may comprise one or more of an input unit, an extracting unit, a mapping unit, a processing unit, and an image resizing unit. The system and method may synchronize the target text and audio in order to provide a user with a Read Along. The invention further synchronizes and highlights closed captions and audio that helps people with hearing impairment to comprehend better while watching a movie or listening to songs.
Language agnostic missing subtitle detection
Some implementations include methods for detecting missing subtitles associated with a media presentation and may include receiving an audio component and a subtitle component associated with a media presentation, the audio component including an audio sequence, the audio sequence divided into a plurality of audio segments; evaluating the plurality of audio segments using a combination of a recurrent neural network and a convolutional neural network to identify refined speech segments associated with the audio sequence, the recurrent neural network trained based on a plurality of languages, the convolutional neural network trained based on a plurality of categories of sound; determining timestamps associated with the identified refined speech segments; and determining missing subtitles based on the timestamps associated with the identified refined speech segments and timestamps associated with subtitles included in the subtitle component.
LIVE COMMENTING PROCESSING METHOD AND SYSTEM
The present disclosure describes techniques of processing bullet comments. The techniques comprise acquiring a page of playing a video to output a video stream; acquiring multiple pieces of bullet comment data associated with the video stream; traversing the multiple pieces of bullet comment data and determining whether the multiple pieces of bullet comment data comprise at least one piece of bullet comment data in an expired state; and deleting the at least one piece of bullet comment data in the expired state from the plurality of pieces of bullet comment data in response to determining that the plurality of pieces of bullet comment data comprise the at least one piece of bullet comment data in the expired state.