Patent classifications
H04N21/4394
Scene aware searching
Novel tools and techniques are provided for scene aware searching. A system may include a media player configured to play a video stream, a database, and a server configured to host an artificial intelligence (AI) engine. The server may further include a processor and a non-transitory computer readable medium comprising a set of instructions that, when executable by the processor to receive, from the media device, a search query from a user. The AI engine may further be configured to obtain the video stream associated with the search query, identify one or more objects in the video stream, derive contextual data associated with the one or more objects, identify one or more matches based on the contextual data, and determine a result of the search query.
Electronic apparatus and control method thereof
An electronic apparatus includes a communication interface, a display, and a processor configured to, based on a content being received from a source apparatus via the communication interface, obtain feature information of the received content, and transmit the obtained feature information to an external server, receive, from the external server, identification information of the content, the identification information being obtained based on the transmitted feature information, based on a predetermined mode of the display being turned on according to a signal that is received from the source apparatus via the communication interface, obtain information regarding a first time point when the predetermined mode is turned on, and based on the predetermined mode of the display being turned off, obtain information regarding a second time point when the predetermined mode is turned off after the first time point.
Optimization tools and techniques for audio and audiovisual content
A system is provided for modifying media content, including remastering audio or video components of the content. The system may include an application programming interface programmed for communicating selected mastering profiles to a user interface; applying the mastering profiles to uploaded original media content files; separating the uploaded media content files into their audio and video component portions; applying the selected mastering profiles to at least the audio component portion to generate a remastered audio component portion; and, combining the video component portions with the remastered audio component portions to generate modified media content files. The system may also include a user interface programmed for selecting original media content files for electronic upload from an access device; displaying multiple available mastering profiles in connection with the selected media content files; and receiving and previewing the modified media content files.
System and method for identifying potential commercial breaks in a video data stream by detecting absence of identified persons associated with program type content in the video data stream
Ad break boundaries, which identify start and end points of a commercial break, are identified by detecting the absence of voices and/or facial images of one or more primary cast members of a currently scheduled program.
System for the Reproduction of a Multimedia Content Using an Alternative Network if Poor Quality in First Network
A method, an apparatus and a system for the reproduction of a specific multimedia content are provided. The method includes the step of acquiring one or more samples of a specific multimedia content reproduced by a mobile multimedia radio/television apparatus, where the specific multimedia content is included in a first signal emitted by a broadcast-type radio/television network, including the further step of determining an identifier of the specific multimedia content and the value of a quality parameter of the first signal. According to the value of the quality parameter of the first signal, further steps include receiving on a multimedia telephone apparatus the specific multimedia content through a second signal emitted by a cellular telephone network, sending the specific multimedia content to one or more multimedia reproduction units included in the mobile multimedia radio/television apparatus.
DYNAMICALLY ENHANCING A VIDEO BY AUTOMATICALLY GENERATING AND ADDING AN OVERLAY WINDOW
A computer-implemented method for enhancing a video is provided. The method may include generating an annotation matrix comprising extracted video content associated with a video. The method may further include generating a viewer feedback matrix comprising extracted and aggregated viewer feedback, wherein the aggregated viewer feedback comprises a plurality of comments and viewer actions associated with the video, and wherein the plurality of comments appears as text that is located separate from a main window for playing the video. The method may further include generating an overlay matrix by merging the viewer feedback matrix and the annotation matrix. The method may further include generating at least one overlay window for overlaying in the main window of the video at the particular point in time during a playing of the video, wherein the at least one overlay window includes textual information generated from the aggregated viewer feedback.
METHODS AND USER INTERFACES FOR AUDITORY FEATURES
The present disclosure generally relates to providing auditory features. An example method includes, at a computer system in communication with one or more input devices: while playing an audio media item of a first type, receiving, via the one or more input devices, a request to play an audio media item of a second type; in accordance with a determination that a set of concurrent audio criteria is satisfied, concurrently playing: the audio media item of the first type; and the audio media item of the second type; and in accordance with a determination that the set of concurrent audio criteria is not satisfied: ceasing to play the audio media item of the first type; and playing the audio media item of the second type.
Assistance during audio and video calls
Implementations relate to providing information items for display during a communication session. In some implementations, a computer-implemented method includes receiving, during a communication session between a first computing device and a second computing device, first media content from the communication session. The method further includes determining a first information item for display in the communication session based at least in part on the first media content. The method further includes sending a first command to at least one of the first computing device and the second computing device to display the first information item.
Systems and methods for displaying a context image for a multimedia asset
Systems and methods for displaying a context image for a multimedia asset are disclosed. In one embodiment, a system includes a programmable processor, and a display device. In some embodiments, the programmable processor is configured to identify a first multimedia asset being broadcast in a region, determine and retrieve a first context image associated with the first multimedia asset, and direct the display device to display the first context image during the broadcast of the first multimedia asset.
Signaling loudness adjustment for an audio scene
Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for loudness adjustment for an audio scene associated with an MPEG-I immersive audio stream. One apparatus includes processing circuitry that receives a first syntax element indicating a number of sound signals included in the audio scene. The processing circuitry determines whether one or more speech signals are included in the sound signals indicated by the first syntax element. The processing circuitry determines a reference speech signal from the one or more speech signals based on the one or more speech signals being included in the sound signals. The processing circuitry adjusts a loudness level of the reference speech signal of the audio scene based on an anchor speech signal. The processing circuitry adjusts loudness levels of the sound signals based on the adjusted loudness level of the reference speech signal.