Patent classifications
H04N19/114
Picture/video coding supporting varying resolution and/or efficiently handling region-wise packing
Video/picture coding of improved coding efficiency with supporting varying resolution and/or efficiently handling region-wise packing.
VIDEO TRANSMISSION METHOD AND DEVICE
This application disclosed video transmission method and devices. An example method includes obtaining a first video and a second video, where the first video and the second video have same content, and image quality of the first video is lower than image quality of the second video. M first video frames and identifier information of N target frames based on the first video are obtained. Related frames corresponding to the N target frames are obtained from the second video based on the identifier information of the N target frames, where the target frames and the related frames have same identifier information but different image quality. The M first video frames and the N related frames are recorded to obtain a third video, where the third video is transmitted to a receiving device, and a data volume of the third video is less than a data volume of the second video.
VIDEO TRANSMISSION METHOD AND DEVICE
This application disclosed video transmission method and devices. An example method includes obtaining a first video and a second video, where the first video and the second video have same content, and image quality of the first video is lower than image quality of the second video. M first video frames and identifier information of N target frames based on the first video are obtained. Related frames corresponding to the N target frames are obtained from the second video based on the identifier information of the N target frames, where the target frames and the related frames have same identifier information but different image quality. The M first video frames and the N related frames are recorded to obtain a third video, where the third video is transmitted to a receiving device, and a data volume of the third video is less than a data volume of the second video.
Data-driven event detection for compressed video
A system can obtain a labelled data set, including historic video data and labelled events. The system can divide the labelled data set into historic training/testing data sets. The system can determine, using the historic training data set, a plurality of different parameter configurations to be used by a video encoder to encode a video that includes a plurality of video frames. Each parameter configuration can include a group of pictures (“GOP”) size and a scenecut threshold. The system can calculate an accuracy of event detection (“ACC”) and a filtering rate (“FR”) for each parameter configuration. The system can calculate, for each parameter configuration of the plurality of different parameter configurations, a harmonic mean between the ACC and the FR. The system can then select a best parameter configuration of the plurality of different parameter configurations based upon the parameter configuration that has the highest harmonic mean.
Data-driven event detection for compressed video
A system can obtain a labelled data set, including historic video data and labelled events. The system can divide the labelled data set into historic training/testing data sets. The system can determine, using the historic training data set, a plurality of different parameter configurations to be used by a video encoder to encode a video that includes a plurality of video frames. Each parameter configuration can include a group of pictures (“GOP”) size and a scenecut threshold. The system can calculate an accuracy of event detection (“ACC”) and a filtering rate (“FR”) for each parameter configuration. The system can calculate, for each parameter configuration of the plurality of different parameter configurations, a harmonic mean between the ACC and the FR. The system can then select a best parameter configuration of the plurality of different parameter configurations based upon the parameter configuration that has the highest harmonic mean.
Method and apparatus for coding video, device and medium
A method and apparatus for coding a video, device and medium are provided. An implementation of the method include: determining a first video frame structure and a second video frame structure based on a pre-set threshold for a B-frame number; determining a target video frame structure based on the first video frame structure, the second video frame structure, and a pre-set condition; and coding video frames in a to-be-coded video frame sequence according to the target video frame structure.
Method and apparatus for coding video, device and medium
A method and apparatus for coding a video, device and medium are provided. An implementation of the method include: determining a first video frame structure and a second video frame structure based on a pre-set threshold for a B-frame number; determining a target video frame structure based on the first video frame structure, the second video frame structure, and a pre-set condition; and coding video frames in a to-be-coded video frame sequence according to the target video frame structure.
Processing Media By Adaptive Group of Pictures Structuring
A spatial complexity and a temporal complexity associated with one or more frames of media content may be determined. Based on the spatial complexity and the temporal complexity of the media content, a Group of Picture (GOP) size for the one or more frames of the media content may be determined. The GOP size may be inversely proportional to the spatial complexity and the temporal complexity of the one or more frames of media content. Certain frames of the media content may be arranged in a different GOP size as compared to one or more other frames of the media content. By varying the GOP size of the plurality of frames of the media content, the bitrate required to transmit the media content may be decreased without decreasing or substantially decreasing the overall quality of the media content.
Processing Media By Adaptive Group of Pictures Structuring
A spatial complexity and a temporal complexity associated with one or more frames of media content may be determined. Based on the spatial complexity and the temporal complexity of the media content, a Group of Picture (GOP) size for the one or more frames of the media content may be determined. The GOP size may be inversely proportional to the spatial complexity and the temporal complexity of the one or more frames of media content. Certain frames of the media content may be arranged in a different GOP size as compared to one or more other frames of the media content. By varying the GOP size of the plurality of frames of the media content, the bitrate required to transmit the media content may be decreased without decreasing or substantially decreasing the overall quality of the media content.
Processing media by adaptive group of pictures structuring
A spatial complexity and a temporal complexity associated with one or more frames of media content may be determined. Based on the spatial complexity and the temporal complexity of the media content, a Group of Picture (GOP) size for the one or more frames of the media content may be determined. The GOP size may be inversely proportional to the spatial complexity and the temporal complexity of the one or more frames of media content. Certain frames of the media content may be arranged in a different GOP size as compared to one or more other frames of the media content. By varying the GOP size of the plurality of frames of the media content, the bitrate required to transmit the media content may be decreased without decreasing or substantially decreasing the overall quality of the media content.