Patent classifications
H04N21/234336
POLICY BASED TRANSCODING
Methods and systems are disclosed for providing video content in response to requests in a content delivery system with more speed and efficiency. In some aspects, network monitoring devices may gather content specific and network performance metrics, from user devices and content delivery components, to provide input to a computing device for deciding whether to store or delete different versions of the same or different items of content. The decision may be based on a policy which may include a weighted score based on a combination of usage and network efficiency scores. In other aspects, methods and systems are provided to initially provide to a user device a stored version of a content item, and then switch, as needed, to a different version of the content item using on-demand transcoding.
Cross-media storage coordination
Methods and a media system and storage system for cross-media storage coordination include but are not limited to storing a first data version of specified content based on a particular media format: storing at least a second data version of related content based on a different media format: providing a cross-reference between the first data version and the at least second data version to enable coordinated management by a designated user and/or an approved device for search and possible retrieval of the first data version and/or the at least second data version: and implementing communication access by one or more parties and/or the designated user via a communication type that is correlated with the first data version and/or the at least second data version.
METHOD AND SYSTEM OF PRESENTING MOVING IMAGES OR VIDEOS CORRESPONDING TO STILL IMAGES
The present application discloses a method of presenting moving images or videos corresponding to still images. The method includes: storing a still image and a moving image or video corresponding to the still image into a cloud storage; extracting feature points of the still image stored in the cloud storage, and storing the feature points in the cloud storage in a manner which associates the feature points with the still image; when a device obtains a first still image through scanning, extracting feature points from the first still image, comparing and judging whether the extracted feature points match feature points of each still image stored in the cloud storage to determine a second still image whose feature points match the feature points of the first still image; rendering a moving image or video corresponding to the second still image stored in the cloud storage at the position of the first still image. The present application can facilitate presenting a moving image corresponding to a still image, and increase the information and entertainment provided by a still image.
Image processing apparatus and method
There is provided an image processing apparatus and method allowing suppression of a decrease in coding efficiency. Coded data obtained by coding a captured image captured by a moving body with an image capturing section is transcoded on the basis of positional information indicating a position where the captured image has been generated. For example, the positional information includes at least one of GPS information indicating the position of the moving body or IMU information indicating movement of the moving body, and captured images are coded in frame images of a moving image on the basis of the information. For example, the present disclosure can be applied to an image processing apparatus, an image coding apparatus, a communication apparatus, an information processing apparatus, an image capturing apparatus, or the like.
Machine learning for recognizing and interpreting embedded information card content
Metadata for highlights of a video stream is extracted from card images embedded in the video stream. The highlights may be segments of a video stream, such as a broadcast of a sporting event, that are of particular interest to one or more users. Card images embedded in video frames of the video stream are identified and processed to extract text. The text characters may be recognized by applying a machine-learned model trained with a set of characters extracted from card images embedded in sports television programming contents. The training set of character vectors may be pre-processed to maximize metric distance between the training set members. The text may be interpreted to obtain the metadata. The metadata may be stored in association with the portion of the video stream. The metadata may provide information regarding the highlights, and may be presented concurrently with playback of the highlights.
Video feature extraction and video content understanding method, apparatus, storage medium and server
Provided are a video processing method and apparatus, a video retrieval method and apparatus, a medium, and a server. The video processing method includes: performing encoding and decoding on an original video by using the encoder and the decoder, to obtain a video feature of the original video and a hidden state of the original video at a decoding stage; reconstructing a video feature of a target video by using the reconstructor according to the hidden state of the original video at the decoding stage; obtaining a difference between the video feature of the target video and the video feature of the original video; and adjusting a processing parameter of at least one of the decoder and the reconstructor to reduce the difference between the video feature of the target video and the video feature of the original video.
Method and apparatus for commenting video
Embodiments of the present disclosure disclose a method and apparatus for commenting a video, and relate to the field of cloud computing. The method may include: acquiring content information of a to-be-processed video frame; constructing text description information based on the content information, the text description information being used to describe a content of the to-be-processed video frame; importing the text description information into a pre-trained text conversion model to obtain commentary text information corresponding to the text description information, the text conversion model being used to convert the text description information into the commentary text information; and converting the commentary text information into audio information.
REFERENCE OF NEURAL NETWORK MODEL FOR ADAPTATION OF 2D VIDEO FOR STREAMING TO HETEROGENEOUS CLIENT END-POINTS
A method, computer program, and computer system is provided for streaming immersive media. The method includes ingesting content in a two-dimensional format, the 2D format referencing at least one neural network; converting the ingested content to a three-dimensional format based on the referenced at least one neural network; and streaming the converted content to a client end-point.
Video production systems and methods
Live video streams are produced using a network server system. The network server system initially receives, via a network, one or more captured video streams that are live streams captured from a remotely-located camera, phone or the like. The captured streams are forwarded to a control device via the network to thereby permit a user of the control device to select one of the captured video streams for output to the video production stream. In response to a command received from the control device that indicates the selected capture stream, the selected capture video streams is encoded for output as the video production stream.
Generation of audience appropriate content
Multimedia content to be played on a multimedia player device can be received. Whether the multimedia content contains audience-inappropriate content can be determined. Replacement content corresponding to the audience-inappropriate content can be generated. The generated replacement content can be caused to play on the multimedia player device in lieu of the audience-inappropriate content.