H04N19/40

Policy based transcoding

Methods and systems are disclosed for providing video content in response to requests in a content delivery system with more speed and efficiency. In some aspects, network monitoring devices may gather content specific and network performance metrics, from user devices and content delivery components, to provide input to a computing device for deciding whether to store or delete different versions of the same or different items of content. The decision may be based on a policy which may include a weighted score based on a combination of usage and network efficiency scores. In other aspects, methods and systems are provided to initially provide to a user device a stored version of a content item, and then switch, as needed, to a different version of the content item using on-demand transcoding.

Re-encoding predicted picture frames in live video stream applications
11700419 · 2023-07-11 · ·

In various examples, a media stream may be received by a re-encode system that may leverage a recode engine to convert (e.g., at an interval, based on a request, etc.) an inter-frame associated with the media stream to an intra-frame. The intra-frame may be converted from the inter-frame using parameters or other information associated with and received with the media stream. The converted intra-frame may be merged into an updated segment of the media stream in place of the original inter-frame to enable storage of the updated segment—or a portion thereof—for later use.

Video streaming

A server for streaming a video to a client involves making the video available from the server to the client upon request in at least a temporal independent version and a temporal dependent version. The server is configured for: i) receiving a request from the client to receive a stream of the video from an arbitrary starting point in time; and ii) retrieving at least the first frame from the temporal independent version; and iii) retrieving frames subsequent to the at least first frame from the temporal dependent version; and iv) sending the at least first frame to the client and send the frames subsequent to the at least first frame to the client.

PERCEPTUAL LUMINANCE NONLINEARITY-BASED IMAGE DATA EXCHANGE ACROSS DIFFERENT DISPLAY CAPABILITIES

A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.

PERCEPTUAL LUMINANCE NONLINEARITY-BASED IMAGE DATA EXCHANGE ACROSS DIFFERENT DISPLAY CAPABILITIES

A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.

Systems and Techniques for Retraining Models for Video Quality Assessment and for Transcoding Using the Retrained Models

A trained model is retrained for video quality assessment and used to identify sets of adaptive compression parameters for transcoding user generated video content. Using transfer learning, the model, which is initially trained for image object detection, is retrained for technical content assessment and then again retrained for video quality assessment. The model is then deployed into a transcoding pipeline and used for transcoding an input video stream of user generated content. The transcoding pipeline may be structured in one of several ways. In one example, a secondary pathway for video content analysis using the model is introduced into the pipeline, which does not interfere with the ultimate output of the transcoding should there be a network or other issue. In another example, the model is introduced as a library within the existing pipeline, which would maintain a single pathway, but ultimately is not expected to introduce significant latency.

SYSTEM FOR PROVIDING CUSTOMIZED VIDEO PRODUCING SERVICE USING CLOUD-BASED VOICE COMBINING
20220415362 · 2022-12-29 ·

A system for providing a customized video producing service using a cloud based voice combination of the present invention comprises a customized video production service providing server including: a user terminal that is input and uploads utterance of a user by voice data, selects any one category among at least one type of category to select content including an image or a video, selects a subtitle or background music, and plays a customized video including the content, the uploaded voice data, and the subtitle or background music; a database unit classifying and storing text, image, video, and background music by the at least one type of category; an upload unit receiving the voice data corresponding to the utterance of the user uploaded from the user terminal; a conversion unit that converts the uploaded voice data into text data using STT (Speech to Text) and stores the converted text data; a provision unit that provides an image or video previously mapped and stored in the selected category to the user terminal when any one category among the at least one type of category is selected from the user terminal; a creation unit that creates the customized video including the content, the uploaded voice, and the subtitles or background music when receiving subtitle data or selection of background music from the user terminal by the user terminal's selection of the subtitle or background music.

SYSTEM FOR PROVIDING CUSTOMIZED VIDEO PRODUCING SERVICE USING CLOUD-BASED VOICE COMBINING
20220415362 · 2022-12-29 ·

A system for providing a customized video producing service using a cloud based voice combination of the present invention comprises a customized video production service providing server including: a user terminal that is input and uploads utterance of a user by voice data, selects any one category among at least one type of category to select content including an image or a video, selects a subtitle or background music, and plays a customized video including the content, the uploaded voice data, and the subtitle or background music; a database unit classifying and storing text, image, video, and background music by the at least one type of category; an upload unit receiving the voice data corresponding to the utterance of the user uploaded from the user terminal; a conversion unit that converts the uploaded voice data into text data using STT (Speech to Text) and stores the converted text data; a provision unit that provides an image or video previously mapped and stored in the selected category to the user terminal when any one category among the at least one type of category is selected from the user terminal; a creation unit that creates the customized video including the content, the uploaded voice, and the subtitles or background music when receiving subtitle data or selection of background music from the user terminal by the user terminal's selection of the subtitle or background music.

DEVICE WITH DYNAMIC TRANSCODE THROTTLING
20220417528 · 2022-12-29 ·

Dynamic transcode throttling methods and devices for processing resource management and thermal mitigation in electronic devices such as eyewear devices. An electronic device monitors its temperature and, responsive to the temperature, configures a transcoding service to operate at different rates. A frame delay module is configured to add a delay between read frames prior to the transcoding service. This enables the electronic device to consume less power when temperatures are too high in order to provide thermal mitigation and can be performed without powering down the electronic device.

DEVICE WITH DYNAMIC TRANSCODE THROTTLING
20220417528 · 2022-12-29 ·

Dynamic transcode throttling methods and devices for processing resource management and thermal mitigation in electronic devices such as eyewear devices. An electronic device monitors its temperature and, responsive to the temperature, configures a transcoding service to operate at different rates. A frame delay module is configured to add a delay between read frames prior to the transcoding service. This enables the electronic device to consume less power when temperatures are too high in order to provide thermal mitigation and can be performed without powering down the electronic device.