Patent classifications
H04N21/4856
Multimedia Distribution System for Multimedia Files with Packed Frames
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, data that can be used to generate a menu interface to access the contents of the file and ‘meta data’ concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and ‘meta data’ external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames and encoded menu information.
Transmission of audio streams
A system and method of transmitting respective audio streams to a plurality of end points, such as headphones, earphones, headsets, speakers, etc. is disclosed. Different audio streams are transmitted to each of the plurality of end points. The end points may be arranged to audibly output received audio streams, and so each end point may audibly output a respective different audio stream, i.e. the respective audio streams may be mutually different from each other.
Automated Generation of Banner Images
Example systems and methods for automated generation of banner images are disclosed. A program identifier associated with a particular media program may be received by a system, and used for accessing a set of iconic digital images and corresponding metadata associated with the particular media program. The system may select a particular iconic digital image for placing a banner of text associated with the particular media program, by applying an analytical model of banner-placement criteria to the iconic digital images. The system may apply another analytical model for banner generation to the particular iconic image to determine (i) dimensions and placement of a bounding box for containing the text, (ii) segmentation of the text for display within the bounding box, and (iii) selection of font, text size, and font color for display of the text. The system may store the particular iconic digital image and banner metadata specifying the banner.
INTERACTIVE SMART MEDIA DEVICE
An apparatus, method, and computer-readable recording medium act upon a voice input and replay a portion of streaming media data. The smart media device has a controller, a display, a buffer for maintaining a portion of streaming media data, a speaker, a microphone, a non-transitory memory storing a program, and a communication interface configured to establish communication connections with a remote server. The smart media device receives streaming media data for output to the display and speaker from the remote server, stores a portion of the streaming media data within the buffer, receives a voice command via the input microphone while media content is output to the display and speaker, generates a data representation of dialog contained within the buffer, and outputs the data representation of dialog to the display.
ELECTRONIC DEVICE, METHOD, MEDIUM, AND PROGRAM FOR SIMULTANEOUS INTERPRETATION
The present disclosure relates to an electronic device, a method, a medium, and a program for simultaneous interpretation. An electronic device includes: a memory storing instruction; and a processor configured to execute the instruction to cause the electronic device to: present a prompt on a user interface indicating whether simultaneous interpretation is required when the language of a video or program is not the official language of the geographic location; present target language options to the user in response to selection of simultaneous interpretation, wherein the target language options include the official language; receive the original audio of the video or program; extract the audio segments of the original audio in real time and translate them into a target language in response to the selection of the target language; and output the audio segments in the target language.
Subtitle rendering based on the reading pace
Systems and methods for summarizing captions, configuring playback speed, and rewriting the caption file for a media asset are disclosed. The system determines whether to display the original captions or a summarized version of the captions, which are based on user's language proficiency level, reading pace, and historical data, and can be generated either on-demand or automatically when rewinds and pauses are detected. The caption file which includes the original captions can be rewritten. The system determines whether to stream a caption or a rewritten file to a media device based on user or system selections. In the absence of a caption file, or when the caption file cannot be summarized, the playback speed of the media asset is slowed down to provide additional reading time to the user.
Subtitle data editing method and subtitle data editing program for contents such as moving images
The present invention provides a subtitle data editing method that simplifies the work of inputting subtitles to be displayed on contents such as moving images and facilitates quick and efficient editing work. By accepting a predetermined line feed operation twice consecutively in a state wherein a cursor is present in a subtitle input field in which the subtitle content is input, a subtitle input field is separated to be displayed on screen separations before and after the cursor. By accepting a predetermined line feed operation in a state where the cursor exists at the beginning of the second and subsequent lines, the line and the line directly above are separated and the subtitle input field is divided and displayed on screen, when the subtitle input field in which the subtitle content is input is a plurality of lines.
Audio video translation into multiple languages for respective listeners
An audio source such as a display device configured to present AV content can present the video and send the audio in different languages to the respective devices of different listeners. For example, a device/TV/source can send audio in different languages to connected headphones/smartglasses with speakers/devices/sink. Furthermore, machine learning may be employed both to recognize listeners and correlate them to likely languages and to mimic voices in the played-back audio. Or, the source AV display device may send language in only the selected language of the display device to each listener device, with each receiving listener device converting the audio to the preferred language of the respective listener on the fly.
Multimedia distribution system for multimedia files with packed frames
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, data that can be used to generate a menu interface to access the contents of the file and ‘meta data’ concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and ‘meta data’ external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames and encoded menu information.
Systems and methods for controlling closed captioning
A system for controlling turning on and off of closed captioning receives information regarding a program content stream and automatically determines whether to turn on or off closed captioning based on thresholds being crossed regarding an estimated current loudness level of ambient noise and an estimated current loudness level of the audio of the program content stream. The estimated current loudness level of audio of the program content stream is, or is based on, one or more indications of current volume level in an audio signal representing the audio of the program content stream and current audio settings of a device outputting the audio of the program content stream. The system may estimate the loudness level of the ambient noise by use of a loudness meter that causes the ambient noise to be sampled with a microphone and a decibel level of the sampled ambient noise to be determined.