H04N21/4666

Video playback device and control method thereof

Provided are an artificial intelligence (AI) system that mimics cognitive functions, such as cognition and judgment, of the human brain using a machine learning algorithm such as deep learning and applications thereof. More particularly, provided is a device including a memory storing least one program and a first video, a display, and at least one processor configured to display the first video on at least one portion of the display by executing the at least one program, wherein the at least one program includes instructions for: comparing an aspect ratio of the first video with an aspect ratio of an area in which the first video is to be displayed, generating a second video corresponding to the aspect ratio of the area by using the first video when the aspect ratio of the first video is different from the aspect ratio of the area, and displaying the second video in the area, wherein the generating of the second video is performed by inputting at least one frame of the first video to an AI neural network.

Message push method and apparatus, device, and storage medium

A message push method includes playing a first video in a play interface of a terminal according to first video data transmitted by a server. The method further includes displaying, by processing circuitry of the terminal, a scenario interaction interface at a preset playback time of the first video, the scenario interaction interface being set according to display content of an electronic device shown in the first video. Next, the method includes obtaining interaction information input based on the displayed scenario interaction interface, and obtaining a target message related to the obtained interaction information. Finally, the method includes outputting the obtained target message by the processing circuitry of the terminal.

METHOD AND APPARATUS FOR TRANSMITTING ADAPTIVE VIDEO IN REAL TIME USING CONTENT-AWARE NEURAL NETWORK

A method and apparatus for transmitting adaptive video in real time using a content-aware neural network are disclosed. At least one embodiment provides a method performed by a server for transmitting an adaptive video in real time by using content-aware deep neural networks (DNNs), including downloading a video, encoding a downloaded video for each of at least one resolution, dividing an encoded video into video chunks of a predetermined size, training the content-aware DNNs by using encoded video, generating a configuration or manifest file containing information on trained content-aware DNNs and information on the encoded video, and transmitting the configuration file upon a request of a client.

Measuring video-content viewing

A computer-implemented method of using video viewing activity data as input to an aggregation engine built on the Hadoop MapReduce framework which calculates second-by-second video viewing activity aggregated to the analyst's choice of (a) geographic area, (b) video server, (c) video content (channel call sign, video program, etc.), or (d) viewer demographic, or any combination of these fields, for each second of the day represented in the video viewing activity data. Also calculates overall viewing for use as a denominator in calculations. The source data may be extracted from a database defined according to the Cable Television Laboratories, Inc. Media Measurement Data Model defined in “Audience Data Measurement Specification” as “OpenCable™. Specifications, Audience Measurement, Audience Measurement Data Specification” document OC-SP-AMD-101-130502 or any similar format. These metrics provide detailed data needed to calculate information on customer viewing behavior that can drive business decisions for service providers, advertisers, and content producers.

Visible indicator for importance of audio

In one aspect, a device includes a processor, a video display accessible to the processor, and storage accessible to the processor and comprising instructions executable by the processor to present an indication on the video display of an importance of audio to video relative to effective user consumption.

Method and terminal for providing content

According to the present disclosure, an artificial intelligence (AI) system and a method of providing content according to an application of the AI system are provided. The method includes: obtaining one or more images included in the content; generating additional content for guiding user information, the additional content corresponding to the one or more images, based on feature information extracted from the one or more images; when receiving a request for reproducing the content, synchronizing the generated additional content with the one or more images; and reproducing the content and the additional content, according to a result of the synchronizing.

Supplementing Entertainment Content with Ambient Lighting
20210400227 · 2021-12-23 ·

According to one implementation, a system for supplementing entertainment content with ambient lighting includes a computing platform having a hardware processor and a memory storing a software code. The hardware processor is configured to execute the software code to receive an entertainment content, detect one or more attributes of the entertainment content that correspond to an artistic intent of a producer of the entertainment content, and interpret the artistic intent of the producer of the entertainment content using the detected one or more attributes. The hardware processor is further configured to execute the software code to compose an ambient lighting routine as a supplement to the entertainment content based on the interpreted artistic intent.

Language agnostic automated voice activity detection

Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating a number of audio segments using the audio file, the plurality of audio segments including a first segment and a second segment, where the first segment and the second segment are consecutive segments. Example methods may include determining, using a Gated Recurrent Unit neural network, that the first segment includes first voice activity, determining, using the Gated Recurrent Unit neural network, that the second segment includes second voice activity, and determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment.

Providing customized entertainment experience using human presence detection

Disclosed herein are system, method, and computer program product embodiments for the detection of human presence in front of a plurality of sensors such as those of speakers and a device with a processor, such as a television. Data gathered from the plurality of sensors may be analyzed by the processor to determine if one or more humans are present proximate to the device. Based on the determined presence or absence of one or more humans, further actions including, inter alia, customizing a home theatre experience for the one or more humans, making content recommendations, or activating parental controls can be taken by the device.

METHODS AND APPARATUS FOR MONITORING AN AUDIENCE OF MEDIA BASED ON THERMAL IMAGING

Methods, apparatus, systems, and articles of manufacture are disclosed. An example apparatus includes a thermal image detector to determine a heat blob count based on a frame of thermal image data, the frame of thermal image data captured in the media environment, a comparator to compare the heat blob count to a prompted people count, the prompted people count based on one or more responses to a prompting message, and when the heat blob count and the prompted people count match, cause a timer that is to trigger generation of the prompting message to be reset.