Patent classifications
H04N21/440236
SYSTEMS AND METHODS FOR REAL-TIME ADAPTIVE BITRATE TRANSCODING AND TRANSMISSION OF TRANSCODED MEDIA
Methods and systems are provided for streaming a media asset with an adaptive bitrate transcoder. A server receives, from a client device, a first request for a first portion of the plurality of portions to be transcoded at a first bitrate. The server then starts to transcode the plurality of portions at the requested first bitrate to generate a plurality of corresponding transcoded portions. The server updates a header of a transcoded portion to include: 1) a transcode latency value; and 2) a count value indicating a number of available pre-transcoded portions of the media asset at the time the first request was received. The server then transmits the transcoded portion to the client. The client device then determines a second bitrate based on the transcode latency value included in the header of the transcoded portion corresponding to the first portion.
System and method for presenting a video via transcode
A method for handling video includes extracting video content and context information from a video file. The context information is associated with the video content. The method further includes transmitting the video content via a first communication path and transmitting the context information via a second communication path separate from the first communication path.
Translating between spoken languages with emotion in audio and video media streams
Systems and methods are described herein for generating alternate audio for a media stream. The media system receives media that is requested by the user. The media comprises a video and audio. The audio includes words spoken in a first language. The media system stores the received media in a buffer as it is received. The media system separates the audio from the buffered media and determines an emotional state expressed by spoken words of the first language. The media system translates the words spoken in the first language into words spoken in a second language. Using the translated words of the second language, the media system synthesizes speech having the emotional state previously determined. The media system then retrieves the video of the received media from the buffer and synchronizes the synthesized speech with the video to generate the media content in a second language.
Method and system of displaying subtitles, computing device, and readable storage medium
The present invention discloses techniques for generating and presenting subtitles. The disclosed techniques comprise extracting target audio information from a video; converting the target audio information to first text information, wherein the target audio information and the first text information are in a first language; translating the first text information to at least one second text information, wherein the at least one second text information is in at least one second language; generating a first subtitle based on the first text information; generating at least one second subtitle based on the at least one second text information; obtaining a first target subtitle and at least one second target subtitle by implementing a sensitive word processing to the first subtitle and the at least one second subtitle, respectively; and presenting at least one of the first target subtitle or the at least one second target subtitle in response to user input.
Generation of audience appropriate content
Multimedia content to be played on a multimedia player device can be received. Whether the multimedia content contains audience-inappropriate content can be determined. Replacement content corresponding to the audience-inappropriate content can be generated. The generated replacement content can be caused to play on the multimedia player device in lieu of the audience-inappropriate content.
METHOD FOR PROCESSING VIDEO, DEVICE AND STORAGE MEDIUM
The present disclosure provides examples of a method and apparatus for processing a video, a device and a storage medium. The method may include: acquiring a target video and a target comment of the target video; recognizing a picture in the target video to obtain text information of the picture; determining a target comment matching a content of the text information; and inserting, in response to displaying the picture in the target video, the target comment matching the content in a form of a bullet screen.
System and method for context aware detection of objectionable speech in video
Embodiments provide a system and method for filtering speech in a video. Speech in video may contain objectionable or profane words that need to be filtered. To ascertain whether a word or phrase is objectionable, the contextual information from surrounding words and the contextual information from detected objects and scenes in the video are used. Unwanted words may be filtered or collected and presented to the user.
Display device and operating method thereof
Provided are a display device for more accurately providing a function intended by a user upon reception of the voice command and an operating method thereof. The display device comprises a wireless communication unit configured to communicate with at least one external server, a storage unit, a voice recognition unit configured to receive a voice command, a control unit configured to acquire a function corresponding to the voice command, a determination module configured to determine a provider providing the function corresponding to the voice command and an output unit configured to receive data related to the function from the at least one external server or the storage unit according to the determined provider and output the function corresponding to the voice command based on the received data.
Media streaming
There is disclosed a system for providing streaming services, comprising: a plurality of capture devices, each for capturing data and providing a captured data stream; and a server, for receiving the plurality of captured data streams; wherein each capture device is configured to generate metadata for the captured data, and transmit said metadata to the server.
SYSTEMS AND METHODS FOR GENERATING CONTENT FOR A SCREENPLAY
Systems and methods are disclosed herein for generating content based on format-specific screenplay parsing techniques. The techniques generate and present content by generating new dynamic content structures to generate content segments for output on electronic devices. In one disclosed technique, a first instance of a first character name is identified from the screenplay document. A first set of character data following the first instance of the first character name from the screenplay document and preceding an instance of a second character name from the screenplay document is then identified. Upon identification of the first set of character data, a content structure including an object is generated. The object includes attribute table entries based on the first set of character data. A content segment is generated for output based on the content structure (e.g., a 3D animation of the first character interacting within a scene).