Patent classifications
H04N19/40
Automatic data format detection
Systems, apparatuses, and methods for implementing automatic data format detection techniques are disclosed. A graphics engine receives data of indeterminate format and the graphics engine predicts an organization of the data. As part of the prediction, the graphics engine predicts the pixel depth (i.e., bytes per pixel (BPP)) and format separately. The graphics engine folds the data along pixel and channel boundaries to help in determining the pixel depth and format. The graphics engine scores modes against each other to generate different predictions for different formats. Then, the graphics engine generates scores for the predictions to determine which mode has a highest correlation with the input data. Next, the graphics engine chooses the format which attains the best score among the scores that were generated for the different modes. Then, the graphics engine compresses the unknown data using the chosen format with the best score.
METHOD, SYSTEM AND SERVER FOR LIVE STREAMING AUDIO-VIDEO FILE
A method for live streaming an audio-video file is disclosed, in which an original audio-video file is obtained; an audio frame and a video frame are read from the original audio-video file; the video frame is transcoded into video frames with different code rates; the video frames with different code rates are synthesized respectively with the audio frame into audio-video files with different code rates; the audio frames and the video frames are extracted from the audio-video files with different code rates respectively to form respective video streams; and different video streams are pushed.
METHOD, SYSTEM AND SERVER FOR LIVE STREAMING AUDIO-VIDEO FILE
A method for live streaming an audio-video file is disclosed, in which an original audio-video file is obtained; an audio frame and a video frame are read from the original audio-video file; the video frame is transcoded into video frames with different code rates; the video frames with different code rates are synthesized respectively with the audio frame into audio-video files with different code rates; the audio frames and the video frames are extracted from the audio-video files with different code rates respectively to form respective video streams; and different video streams are pushed.
CONTEXT DETERMINATION FOR MATRIX-BASED INTRA PREDICTION
Devices, systems and methods for digital video coding, which includes matrix-based intra prediction methods for video coding, are described. In a representative aspect, a method for video processing includes performing a first determination whether a luma video block of a video is coded using a matrix based intra prediction (MIP) mode; performing a second determination that the luma video block is applicable for determining a chroma intra mode for a current chroma video block of the video; performing, based on the first determination and the second determination, a third determination about the chroma intra mode to be used for the current chroma video block; and performing, based on the third determination, a conversion between the current chroma video block and a bitstream representation of the current chroma video block.
CONTEXT DETERMINATION FOR MATRIX-BASED INTRA PREDICTION
Devices, systems and methods for digital video coding, which includes matrix-based intra prediction methods for video coding, are described. In a representative aspect, a method for video processing includes performing a first determination whether a luma video block of a video is coded using a matrix based intra prediction (MIP) mode; performing a second determination that the luma video block is applicable for determining a chroma intra mode for a current chroma video block of the video; performing, based on the first determination and the second determination, a third determination about the chroma intra mode to be used for the current chroma video block; and performing, based on the third determination, a conversion between the current chroma video block and a bitstream representation of the current chroma video block.
METHODS AND APPARATUS FOR MAXIMIZING CODEC BANDWIDTH IN VIDEO APPLICATIONS
Methods and apparatus for processing of video content to optimize codec bandwidth. In one embodiment, the method includes capturing panoramic imaging content (e.g., a 360° panorama), mapping the panoramic imaging content into an equi-angular cubemap (EAC) format, and splitting the EAC format into segments for transmission to maximize codec bandwidth. In one exemplary embodiment, the EAC segments are transmitted at a different frame rate than the subsequent display rate of the panoramic imaging content. For example, the mapping and frame rate may be chosen to enable the rendering of 8K, 360° content at 24 fps, using commodity encoder hardware and software that nominally supports 4K content at 60 fps.
METHODS AND APPARATUS FOR MAXIMIZING CODEC BANDWIDTH IN VIDEO APPLICATIONS
Methods and apparatus for processing of video content to optimize codec bandwidth. In one embodiment, the method includes capturing panoramic imaging content (e.g., a 360° panorama), mapping the panoramic imaging content into an equi-angular cubemap (EAC) format, and splitting the EAC format into segments for transmission to maximize codec bandwidth. In one exemplary embodiment, the EAC segments are transmitted at a different frame rate than the subsequent display rate of the panoramic imaging content. For example, the mapping and frame rate may be chosen to enable the rendering of 8K, 360° content at 24 fps, using commodity encoder hardware and software that nominally supports 4K content at 60 fps.
EVENT/OBJECT-OF-INTEREST CENTRIC TIMELAPSE VIDEO GENERATION ON CAMERA DEVICE WITH THE ASSISTANCE OF NEURAL NETWORK INPUT
An apparatus including an interface and a processor. The interface may be configured to receive pixel data generated by a capture device. The processor may be configured to generate video frames in response to the pixel data, perform computer vision operations on the video frames to detect objects, perform a classification of the objects detected based on characteristics of the objects, determine whether the classification of the objects corresponds to a user-defined event and generate encoded video frames from the video frames. The encoded video frames may be communicated to a cloud storage service. The encoded video frames may comprise a first sample of the video frames selected at a first rate when the user-defined event is not detected and a second sample of the video frames selected at a second rate while the user-defined event is detected. The second rate may be greater than the first rate.
EVENT/OBJECT-OF-INTEREST CENTRIC TIMELAPSE VIDEO GENERATION ON CAMERA DEVICE WITH THE ASSISTANCE OF NEURAL NETWORK INPUT
An apparatus including an interface and a processor. The interface may be configured to receive pixel data generated by a capture device. The processor may be configured to generate video frames in response to the pixel data, perform computer vision operations on the video frames to detect objects, perform a classification of the objects detected based on characteristics of the objects, determine whether the classification of the objects corresponds to a user-defined event and generate encoded video frames from the video frames. The encoded video frames may be communicated to a cloud storage service. The encoded video frames may comprise a first sample of the video frames selected at a first rate when the user-defined event is not detected and a second sample of the video frames selected at a second rate while the user-defined event is detected. The second rate may be greater than the first rate.
COLOR CONVERSION BETWEEN COLOR SPACES USING REDUCED DIMENSION EMBEDDINGS
Exemplary embodiments may provide an approach to converting multidimensional color data for an image encoded in a first color space into an intermediate form that is a single dimensional value. The exemplary embodiments may then decode the intermediate form value to produce an encoding of the color data that is encoded in a second color space that differs from the first color space. In this manner, the data for the image may be efficiently converted from an encoding in the first color space into an encoding in the second color space.