Patent classifications
H04N21/44
Panorama video editing method,apparatus,device and storage medium
The present invention is applicable to the technical field of panoramic videos. Provided are a panoramic video clip method, apparatus and device, and a storage medium. The method comprises acquiring a panoramic video photographed by means of a panoramic camera, and recording an advancing-direction viewing angle of the panoramic camera during moving photographing; carrying out a frame extraction operation on the acquired panoramic video to obtain a corresponding panoramic video frame, carrying out significant target detection on the panoramic video frame, tracking a detected significant target by using a preset target tracking algorithm, and acquiring a viewing angle at which the tracked significant target is located; and clipping the panoramic video according to the advancing-direction viewing angle and the viewing angle at which the significant target is located, so as to generate a target video corresponding to the panoramic video. A panoramic video is automatically clipped, and the fluency of transitions, and the effectiveness and the degree to which content in a target video is interesting are also ensured.
VIEWING TERMINAL, VIEWING METHOD, VIEWING SYSTEM, AND PROGRAM
A student terminal is for viewing a class given in a virtual space that is immersive. The student terminal includes: a VR function section configured to display the virtual space according to virtual space information; and an input section for receiving a video capturing a desk of a student who views the class. The VR function section extracts, from the video, an area including a top plate of the desk corresponding to a desk object in the virtual space, and performs image composition for fitting a video capturing the area onto a top plate of the desk object.
STREAM REPAIR MEMORY MANAGEMENT
Techniques are described for expanding and/or improving the Advanced Television Systems Committee (ATSC) 3.0 television protocol in robustly delivering the next generation broadcast television services. Multiple memory buffers are used to manage broadcast packet repair and presentation or storage.
Selection of a prerecorded media file for superimposing into a video
In a method for selecting of a prerecorded media file for superimposing into a video, a video of a scene is displayed on a display device of a mobile electronic device. A location of the scene is determined. A prerecorded video file is selected based at least in part on the location. The prerecorded video file is superimposed over the video, such that the video is partially obscured by the prerecorded video file. The prerecorded video file is played while displaying the video, such that the prerecorded video file and a non-obscured portion of the video are rendered simultaneously.
System and method for operating a transmission network
Various embodiments are described herein for systems and methods that can be used to operate a media transmission network. In at least one embodiment, the media transmission network comprises a plurality of media processing devices configured to receive and process media streams based on control data. The media transmission network also comprises a controller coupled to the plurality of media processing devices and configured to generate a control signal for some or all of the media processing devices in the network. The controller is configured to determine the timing at which to transmit the control signal to a respective media processing device in order for the instructions in the control signal to be executed at the same time as the media data is received. The controller determines the transmission timing of each control signal by determining the latencies and delays of the network and the devices, such as, for example, network latency, processing delay, and/or control delay.
System and method for operating a transmission network
Various embodiments are described herein for systems and methods that can be used to operate a media transmission network. In at least one embodiment, the media transmission network comprises a plurality of media processing devices configured to receive and process media streams based on control data. The media transmission network also comprises a controller coupled to the plurality of media processing devices and configured to generate a control signal for some or all of the media processing devices in the network. The controller is configured to determine the timing at which to transmit the control signal to a respective media processing device in order for the instructions in the control signal to be executed at the same time as the media data is received. The controller determines the transmission timing of each control signal by determining the latencies and delays of the network and the devices, such as, for example, network latency, processing delay, and/or control delay.
Systems and methods for automatic mixing of media
A first device includes one or more processors and memory storing one or more programs configured to be executed by the one or more processors. The one or more programs include instructions for receiving, from a second device, audio mix information for a first audio item and receiving, from the second device, an indication that the first audio item is to be mixed with a second audio item distinct from the first audio item. In response to the indication, the one or more programs include instructions for transmitting to the second device an audio stream including the first audio item and the second audio item mixed in accordance with the audio mix information.
Systems and methods to enhance interactive engagement with shared content by a contextual virtual agent
Systems and methods are described to enhance interactive engagement during simultaneous delivery of serial or digital content (e.g., audio, video) to a plurality of users. A machine-based awareness of the context of the content and/or one or more user reactions to the presentation of the content may be used as a basis to interrupt content delivery in order to intersperse a snippet that includes a virtual agent with an awareness of the context(s) of the content and/or the one or more user reactions. This “contextual virtual agent” (CVA) enacts actions and/or dialog based on the one or more machine-classified contexts coupled with identified interests and/or aspirations of individuals within the group of users. The CVA may also base its activities on a machine-based awareness of “future” content that has not yet been delivered to the group, but classified by natural language and/or computer vision processing. Interrupting the delivery of content substantially simultaneously to a group of users and initiating dialog regarding content by a CVA enhances opportunities for users to engage with each other about their shared interactive experience.
LIVE STREAMING VIDEO INTERACTION METHOD AND APPARATUS, AND COMPUTER DEVICE
The present application discloses techniques of interacting with live videos. The techniques comprise obtaining a streaming video of a live streamer and images of a user captured in real time by a user terminal, and displaying the streaming video and the image of the user in a same video play box; obtaining and recognizing a first gesture of a user in the images of the user, and comparing the first gesture with a second gesture included in a preset table, wherein the preset table comprises information indicating corresponding relationships between gestures and special effects; obtaining a first special effect corresponding to the second gesture by querying the preset table when the first gesture matches with the second gesture; and displaying the first special effect in the video play box.
Methods and apparatus to reduce false positive signature matches due to similar media segments in different reference media assets
Methods, apparatus, systems and articles of manufacture to reduce false positive signature matches due to similar media segments in different reference media assets are disclosed. Example apparatus disclosed herein include a signature matcher to compare monitored media signatures with a library of reference media signatures, the monitored media signatures associated with monitored media, the library of reference media signatures including sequences of reference signatures associated with respective reference media assets. Disclosed example apparatus also include a match information identifier to identify a number of different matched reference media assets associated with ones of the sequences of reference media signatures that match a sequence of matched monitored media signatures. Disclosed example apparatus further include a false positive identifier to, in response to the number of different matched reference media assets satisfying a threshold number, eliminate one or more of the matched reference media assets from being credited to the monitored media.