H04N21/4888

Language-agnostic subtitle drift detection and localization

Devices, systems, and methods are provided for language-agnostic subtitle drift detection and localization. A method may include extracting audio from video, dividing the audio into overlapping blocks, and determining the probabilities of overlapping portions of the blocks, the probabilities indicating a presence of voice data represented by the audio in the blocks. The method may generate machine blocks using overlapping portions of blocks where voice data is present, and may map the machine blocks to corresponding blocks indicating that subtitles are available for the video. For mapped blocks, the method may include determining features such as when subtitles are available without voice audio, when voice audio is available without subtitles, and when voice audio and subtitles both are available. Using the features, the method may include determining the probability that the video includes subtitle drift, and the method may include analyzing the video to localize where the subtitle drift occurs.

METHOD AND APPARATUS FOR RETRIEVING TELEPLAY CONTENT

A method and an apparatus for retrieving teleplay content is disclosed. The method includes: generating basic summary information corresponding to each teleplay based on basic information of entities of each teleplay; generating episode summary information corresponding to each episode of each teleplay based on episode data of each episode of each teleplay; establishing a teleplay graph database based on the basic summary information corresponding to each teleplay and the episode summary information corresponding to each episode; and feeding a playing portal for a target episode of a target teleplay corresponding to teleplay search information back to a user based on the teleplay graph database.

Reprogramming of a programmable device of a specific version

.[.A unified system of programming communication. The system encompasses the prior art (television, radio, broadcast hardcopy, computer communications, etc.) and new user specific mass media. Within the unified system, parallel processing computer systems, each having an input (e.g., 77) controlling a plurality of computers (e.g., 205), generate and output user information at receiver stations. Under broadcast control, local computers (73, 205), combine user information selectively into prior art communications to exhibit personalized mass media programming at video monitors (202), speakers (263), printers (221), etc. At intermediate transmission stations (e.g., cable television stations), signals in network broadcasts and from local inputs (74, 77, 97, 98) cause control processors (71) and computers (73) to selectively automate connection and operation of receivers (53), recorder/players (76), computers (73), generators (82), strippers (81), etc. At receiver stations, signals in received transmissions and from local inputs (225, 218, 22) cause control processors (200) and computers (205) to automate connection and operation of converters (201), tuners (215), decryptors (224), recorder/players (217), computers (205), furnaces (206), etc. Processors (71, 200) meter and monitor availability and usage of programming..]. .Iadd.A method and apparatus to reprogram a receiver station, where the receiver station includes a receiver and a programmable device of a specific version having a memory. The receiver station receives an electronic digital information transmission including operating system instructions and a digital control signal that designates a designated hardware version of a programmable device. The received operating system instructions are communicated to the memory if a match occurs between the designated hardware version included in the transmission and the specific version of the programmable device resident at the receiver station..Iaddend.

Display apparatus and control method thereof

A display apparatus includes a receiver, an image processor, a display and a controller. The receiver receives an image of content in the form image segments. The image processor processes the image of content received via the receiver. The display displays the processed image of content. The controller controls the image processor to display an image corresponding to one viewpoint of the image of content, and display information about a display quality of at least one image segment based on reception states of the image segments. With this, the display apparatus may provide the information about display quality for the at least one segment of the image of content, thereby allowing a user to watch the image of content while smoothly moving a viewpoint.

Identifying and tracking words in a video recording of captioning session

Disclosed are a method, a system, and a non-transitory computer readable medium for identifying captions in captioned video. A method includes receiving audio and video content from a caption device where the video content includes captioned text, extracting frames of video from the received video content where the frames of video include captioned text, recognizing text from the captioned text in the extracted frames of video, and generating a descriptive textual file including timing information for the recognized text and timing information for the captioned text.

Augmentation of audio/video content with enhanced interactive content
10863245 · 2020-12-08 · ·

Provided are systems and methods for augmenting audio/video content with enhanced interactive content. Events are detected in the audio/video content and contextual information is determined corresponding to the events using enhanced metadata and content-specific data. Indicators are displayed to indicate the occurrence of an event and the information about the event is provided.

IDENTIFYING AND TRACKING WORDS IN A VIDEO RECORDING OF CAPTIONING SESSION

Disclosed are a method, a system, and a non-transitory computer readable medium for identifying captions in captioned video. A method includes receiving audio and video content from a caption device where the video content includes captioned text, extracting frames of video from the received video content where the frames of video include captioned text, recognizing text from the captioned text in the extracted frames of video, and generating a descriptive textual file including timing information for the recognized text and timing information for the captioned text.

Enhanced timed text in video streaming

Timed text that is provided in a television broadcast or media stream can be enhanced to provide an improved user experience. A scrollable text window can be provided in a media player application, for example, that can allow the user to quickly catchup from a missed moment. The timed text may be enhanced to allow links to dictionaries, encyclopedias, online sources, thesauruses, translating services, and/or the like. Further implementations could use automated tools to automatically generate program summaries for watched or unwatched content.

Systems and methods for aligning text and multimedia content

The present disclosure is generally directed to a tangible, non-transitory machine-readable medium that includes machine-readable instructions that, when executed by processing circuitry, cause the processing circuitry to receive multimedia content that includes a plurality of multimedia content portions of the multimedia content. The instructions, when executed by the processing circuitry, also cause the processing circuitry to receive text data corresponding to words spoken in the multimedia content. The text data includes a plurality of text data subdivisions of the text data. Moreover, the instructions, when executed by the processing circuitry, cause the processing circuitry to align the multimedia content and the text data by determining, for each of the plurality of multimedia content portions, a corresponding subdivision of the plurality of text data subdivisions. Furthermore, the instructions, when executed by the processing circuitry, cause the processing circuitry to cause display of the multimedia content aligned to the text data.

SYSTEMS AND METHODS FOR ALIGNING TEXT AND MULTIMEDIA CONTENT
20200213478 · 2020-07-02 ·

The present disclosure is generally directed to a tangible, non-transitory machine-readable medium that includes machine-readable instructions that, when executed by processing circuitry, cause the processing circuitry to receive multimedia content that includes a plurality of multimedia content portions of the multimedia content. The instructions, when executed by the processing circuitry, also cause the processing circuitry to receive text data corresponding to words spoken in the multimedia content. The text data includes a plurality of text data subdivisions of the text data. Moreover, the instructions, when executed by the processing circuitry, cause the processing circuitry to align the multimedia content and the text data by determining, for each of the plurality of multimedia content portions, a corresponding subdivision of the plurality of text data subdivisions. Furthermore, the instructions, when executed by the processing circuitry, cause the processing circuitry to cause display of the multimedia content aligned to the text data.