Patent classifications
H04N21/2335
DYNAMIC DECODER CONFIGURATION FOR LIVE TRANSCODING
A method and system for managing transcoding of data in a stream that includes identifying an input source change for the stream with a new input source type, and adding a decoder for the new input source type, the decoder configured to output for a respective encoder in a transcoder pipeline.
Method and apparatus for generating caption
A method and apparatus for generating a caption are provided. The method of generating a caption according to one embodiment comprises: generating caption text which corresponds to a voice of a speaker included in broadcast data; generating reference voice information using a part of the voice of the speaker included in the broadcast data; and generating caption style information for the caption text based on the voice of the speaker and the reference voice information.
TEXT-DRIVEN EDITOR FOR AUDIO AND VIDEO ASSEMBLY
The disclosed technology is a system and computer-implemented method for assembling and editing a video program from spoken words or soundbites. The disclosed technology imports source audio/video clips and any of multiple formats. Spoken audio is transcribed into searchable text. The text transcript is synchronized to the video track by timecode markers. Each spoken word corresponds to a timecode marker, which in turn corresponds to a video frame or frames. Using word processing operations and text editing functions, a user selects video segments by selecting corresponding transcribed text segments. By selecting text and arranging that text, a corresponding video program is assembled. The selected video segments are assembled on a timeline display in any chosen order by the user. The sequence of video segments may be reordered and edited, as desired, to produce a finished video program for export.
Fragment-aligned audio coding
Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On the other hand, the alignment of the frame grid with respect to the fragments' beginnings allows for an easy and fragment-synchronized way of handling the fragments in connection with, for example, parallel audio video streaming, bitrate adaptive streaming or the like.
METHOD AND DEVICE FOR QUICK START OF LIVE VIDEO STREAMING
The present disclosure provides a method for quick start of live video streaming. The method includes: recognizing a type of a current player and invoking a quick start policy compatible with the type of the current player; during the execution of the quick start policy, retrieving a current system time and a quick start configuration time corresponding to the current player; based on the current system time and the quick start configuration time, determining a quick start index position in a target live video stream, and based on the quick start index position, determining a key frame index position in the target live video stream; and sending video data in the target live video stream starting from the key frame index position to the current player and discarding audio data in the target live video stream between the key frame index position and the quick start index position.
LOUDNESS ADJUSTMENT METHOD AND APPARATUS, AND ELECTRONIC DEVICE AND STORAGE MEDIUM
The present application relates to a loudness adjustment method and apparatus, and an electronic device and a storage medium, mainly relating to the technical field of multimedia. The method comprises: converting a sound signal of a multimedia resource into a first frequency domain signal; acquiring a second frequency domain signal based on the first frequency domain signal and frequency response information of a current electronic device, wherein the second frequency domain signal is used for reflecting the loudness of the first frequency domain signal when same is played on the current electronic device; and adjusting the loudness of the sound signal based on the second frequency domain signal and a target loudness, so as to obtain a target sound signal of the multimedia resource.
ADAPTIVE STREAMING WITH DEMUXED AUDIO AND VIDEO TRACKS
Aspects of the subject disclosure may include, for example, accessing adaptive streaming information for a multimedia content segment, determining available access bandwidths for a plurality of video tracks and a plurality of audio tracks identified by the adaptive streaming information, selecting one of a plurality of candidate track pairings identified by the adaptive streaming information based on the available access bandwidths, obtaining video content of the multimedia content segment via a video track of the selected track pairing, and obtaining audio content of the multimedia content segment via an audio track of the selected track pairing. Other embodiments are disclosed.
TRANSCODING METHOD AND DEVICE FOR AUDIO/VIDEO STREAM
The present invention provides a transcoding method and a device for audio/video streams. The method includes: acquiring an audio/video source stream from a source server, and when transcoding the audio/video source stream, designating a preset encapsulation format as an encapsulation format of the audio/video source stream; analyzing header data of the audio/video source stream to obtain configuration information of the audio/video source stream, and respectively initializing a video decoder and an audio decoder according to the configuration information; and decoding the audio/video source stream using the initialized video decoder and audio decoder, re-encoding decoded audio/video data as a target audio/video stream, and pushing the target audio/video stream to a live broadcast server. The transcoding method and device for audio/video streams provided by the present disclosure may improve the transcoding speed.
Control signal transmission circuit and control signal receiving circuit for audio/video interface
A control signal transmission circuit and a control signal receiving circuit for an audio/video interface are provided. The control signal transmission circuit includes an audio/video interface encoder, a signal packaging circuit and a data allocator. The audio/video interface encoder is configured to receive an audio packet and supports a user-defined packet format. The signal packaging circuit is configured to receive a first control signal and package the first control signal into a control data packet according to the user-defined packet format. The data allocator is configured to receive a video data and a second control signal and to mix the second control signal and the video data to generate a mixed data packet. The audio/video interface encoder packages the control data packet, the mixed data packet and the audio packet according to an audio/video transmission protocol to generate an audio/video and control data.
ARTIFICIAL INTELLIGENCE MODEL FOR PREDICTING PLAYBACK OF MEDIA DATA
A system is provided to predict requested playbacks of media files by users from a media storage system. The system includes a processor and a computer readable medium operably coupled thereto, to perform predictive playback operations which include accessing an AI model and a media file comprising metadata associated with generating the media file, generating a predictive score for a playback of the media file based on the AI model and the metadata, comparing the predictive score to a threshold required to transcode the media file into a playback format prior to the playback, predicting the playback based on the comparing, determining a predicted playback time of the media file based on the metadata for the media file, and transcoding the media file into the playback format prior to the predicted playback time.