H04N19/115

Systems and Methods for Encoding and Streaming Video Encoded Using a Plurality of Maximum Bitrate Levels
20230224519 · 2023-07-13 · ·

Systems and methods for streaming and playing back video having a variety of resolutions, frame rates, and/or sample aspect ratios, where the video streams are encoded at one of a number of maximum bit rate levels, in accordance with embodiments of the invention are disclosed. One embodiment includes a processor, and storage containing data relating combinations of resolution and frame rates to maximum bitrates, where a plurality of resolution and frame rates that are related to the same maximum bitrate form a maximum bitrate level. In addition, an encoding application configures the processor to encode a video stream as a plurality of video streams having different resolutions and frame rates, where the target maximum bitrate used during the encoding is selected based upon the maximum bitrate levels of the resolution and frame rate combinations indicated within the data relating combinations of resolution and frame rates to maximum bitrates.

Image processing method
11557185 · 2023-01-17 · ·

An image processing method is provided. The method includes acquiring a video. The method includes using an object detection engine to detect a person in the video. The object detection engine is integrated with an image signal processing pipeline. The method includes transmitting the video over a network. The method includes determining that the detected person has moved less than a pre-set distance. The method includes, responsive to the determining, pausing transmission of the video. An embedded image processor including an object detection engine is also provided.

Image processing method
11557185 · 2023-01-17 · ·

An image processing method is provided. The method includes acquiring a video. The method includes using an object detection engine to detect a person in the video. The object detection engine is integrated with an image signal processing pipeline. The method includes transmitting the video over a network. The method includes determining that the detected person has moved less than a pre-set distance. The method includes, responsive to the determining, pausing transmission of the video. An embedded image processor including an object detection engine is also provided.

Split Rendering To Improve Tolerance To Delay Variation In Extended Reality Applications With Remote Rendering
20230217034 · 2023-07-06 ·

An improved split rendering process of the present disclosure mitigates the impact of delay variation in XR application with remote rendering applications. A visual scene is split rendered to generate graphic layers from 3D objects in the visual scene. The server node groups and sorts the graphic layers based on QoE importance to create graphic layer groups, encodes each graphic layer group into a composite video frame and appends metadata to the composite video frame. The encoded video frame is then transmitted in sorted order based on quality rank to a client device (e.g., an HMD worn by a user) where the video frame is decoded and displayed. The client device further sends feedback to the server indicating the graphic layer groups that were timely received.

Split Rendering To Improve Tolerance To Delay Variation In Extended Reality Applications With Remote Rendering
20230217034 · 2023-07-06 ·

An improved split rendering process of the present disclosure mitigates the impact of delay variation in XR application with remote rendering applications. A visual scene is split rendered to generate graphic layers from 3D objects in the visual scene. The server node groups and sorts the graphic layers based on QoE importance to create graphic layer groups, encodes each graphic layer group into a composite video frame and appends metadata to the composite video frame. The encoded video frame is then transmitted in sorted order based on quality rank to a client device (e.g., an HMD worn by a user) where the video frame is decoded and displayed. The client device further sends feedback to the server indicating the graphic layer groups that were timely received.

Rate Controlled Image and Texture Data Compression
20230215054 · 2023-07-06 ·

A computing system may access first alpha values associated with first pixels in a first pixel region of an image and determine a bit budget for encoding the first alpha values. The computing system may then select a first alpha-encoding mode for the first alpha values to reflect a determination that the first alpha values are all fully transparent or all fully opaque, and encode the first alpha values by storing the selected first alpha-encoding mode as part of a metadata without using the bit budget to encode the first alpha values individually. The computing system may then update a record of unallocated bits available for allocation based on the bit budget unused in the encoding of the first alpha values, and allocate, based on the record of unallocated bits, bits to encode a set of alpha values different from the first alpha values.

Rate Controlled Image and Texture Data Compression
20230215054 · 2023-07-06 ·

A computing system may access first alpha values associated with first pixels in a first pixel region of an image and determine a bit budget for encoding the first alpha values. The computing system may then select a first alpha-encoding mode for the first alpha values to reflect a determination that the first alpha values are all fully transparent or all fully opaque, and encode the first alpha values by storing the selected first alpha-encoding mode as part of a metadata without using the bit budget to encode the first alpha values individually. The computing system may then update a record of unallocated bits available for allocation based on the bit budget unused in the encoding of the first alpha values, and allocate, based on the record of unallocated bits, bits to encode a set of alpha values different from the first alpha values.

Electronic device content provisioning adjustments based on wireless communication channel bandwidth condition
11695977 · 2023-07-04 · ·

An electronic device includes a wireless transceiver configured to receive content primitives via a wireless communication channel. The electronic device also includes control circuitry control circuitry coupled to the wireless transceiver, and configured to perform content provisioning operations based on the received content primitives, wherein the content provisioning operations comprise generating content image data and transmitting the content image data to the wireless communication channel using the wireless transceiver. In response to a bandwidth condition of the wireless communication channel being less than a threshold, the control circuitry is configured to perform adjusted content provisioning operations that decrease an amount of content image data conveyed by the wireless transceiver to the wireless communication channel.

Real-time deployment of machine learning systems

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for real-time deployment of machine learning networks. One of the operations is performed by the system receiving video data from a video image capturing device. The received video data is converted into multiple video frames. These video frames are encoded into a particular color space format. The system renders a first display output depicting imagery from the multiple encoded video frames. The system performs an inference on the video frames using a machine learning network in order to determine the occurrence of one or more objects in the video frames. The system renders a second display output depicting graphical information corresponding to the determined one or more objects from the multiple encoded video frames. The system then generates a composite display output including the imagery of the first display output overlaid with the graphical information of the second display output.

Real-time deployment of machine learning systems

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for real-time deployment of machine learning networks. One of the operations is performed by the system receiving video data from a video image capturing device. The received video data is converted into multiple video frames. These video frames are encoded into a particular color space format. The system renders a first display output depicting imagery from the multiple encoded video frames. The system performs an inference on the video frames using a machine learning network in order to determine the occurrence of one or more objects in the video frames. The system renders a second display output depicting graphical information corresponding to the determined one or more objects from the multiple encoded video frames. The system then generates a composite display output including the imagery of the first display output overlaid with the graphical information of the second display output.