Patent classifications
H04N21/4728
AUTOMATIC ANNOTATION FOR VEHICLE DAMAGE
Aspects described herein may allow an automated generation of an interactive multimedia content with annotations showing vehicle damage. In one method, a server may receive vehicle-specific identifying information of a vehicle. Image sensors may capture multimedia content showing aspects associated with exterior regions of the vehicle, and may send the multimedia content to the server. For each of the exterior regions of the vehicle, the server may determine, using a trained classification model, instances of damage. Furthermore, the server may generate an interactive multimedia content that shows images with annotations indicating instances of damage. The interactive multimedia content may be displayed via a user interface.
Overlay processing method in 360 video system, and device thereof
A 360 image data processing method performed by a 360 video receiving device, according to the present invention, comprises the steps of: receiving 360 image data; acquiring information and metadata on an encoded picture from the 360 image data; decoding the picture on the basis of the information on the encoded picture; and rendering the decoded picture and an overlay on the basis of the metadata, wherein the metadata includes overlay-related metadata, the overlay is rendered on the basis of the overlay-related metadata, and the overlay-related metadata includes information on a region of the overlay.
Omnidirectional video streaming
A method and system for streaming video is disclosed. The method and system can include sharing and playing an omnidirectional multi-2D-panel video stream wherein field of view data is obtained from a viewer, and the omnidirectional multi-2D-panel video stream is updated when the field of view data indicates a change in a field of view of the viewer.
REGION OF INTEREST BASED IMAGE DATA ENHANCEMENT IN A TELECONFERENCE
A system for updating/magnifying regions within one or more data frames containing a region of interest while regions outside the region of interest are not updated or are updated less frequently. Edge devices are thereby able to surgically apply super-resolution algorithms to relevant regions of image frames, thereby leveraging their overall effectiveness.
REGION OF INTEREST BASED IMAGE DATA ENHANCEMENT IN A TELECONFERENCE
A system for updating/magnifying regions within one or more data frames containing a region of interest while regions outside the region of interest are not updated or are updated less frequently. Edge devices are thereby able to surgically apply super-resolution algorithms to relevant regions of image frames, thereby leveraging their overall effectiveness.
VIDEO CLIENT OPTIMIZATION DURING PAUSE
A system and method for providing quality control in immersive video during pausing of a video streaming session. In one embodiment, a paused video frame may comprise a plurality of mixed quality video tiles depending on user gaze vector information. Under pause control, the video quality of all tiles of the paused video frame is equalized such that it is of same value for all the video tiles, which may be the video quality of the tiles presented in a viewport of the client device. The paused video frame having the same quality tiles throughout is used as a replacement video frame, which is presented to the client device player for decoding and displaying instead of the mixed quality video frame while the streaming session is paused.
SYSTEM, METHOD AND COMPUTER-READABLE MEDIUM FOR VIDEO PROCESSING
The present disclosure relates to a system, a method and a computer-readable medium for video processing. The method includes displaying a live video of a first user in a first region on a user terminal and displaying a video of a second user in a second region on the user terminal. A portion of the live video of the first user extends to the second region on the user terminal. The present disclosure can improve interaction during a conference call or a group call.
SYSTEM, METHOD AND COMPUTER-READABLE MEDIUM FOR VIDEO PROCESSING
The present disclosure relates to a system, a method and a computer-readable medium for video processing. The method includes displaying a live video of a first user in a first region on a user terminal and displaying a video of a second user in a second region on the user terminal. A portion of the live video of the first user extends to the second region on the user terminal. The present disclosure can improve interaction during a conference call or a group call.
Method and apparatus for efficient delivery and usage of audio messages for high quality of experience
A method and a system for virtual reality, augmented reality, mixed reality, or 360-degree Video environment is disclosed. The system receives Video Streams associated to audio and video scenes to be reproduced and Audio Streams associated to audio and video scenes to be reproduced. There are provided a Video decoder which decodes signal from the Video Stream for the representation of the audio and video scene; an Audio decoder which decodes signal from the Audio Stream for the representation of the audio and video scene to the user; and a region of interest processor deciding, based e.g. on the user's viewport, head orientation, movement data, or metadata, whether an Audio information message is to be reproduced. At the decision, the reproduction of the Audio information message is caused.
Method and apparatus for efficient delivery and usage of audio messages for high quality of experience
A method and a system for virtual reality, augmented reality, mixed reality, or 360-degree Video environment is disclosed. The system receives Video Streams associated to audio and video scenes to be reproduced and Audio Streams associated to audio and video scenes to be reproduced. There are provided a Video decoder which decodes signal from the Video Stream for the representation of the audio and video scene; an Audio decoder which decodes signal from the Audio Stream for the representation of the audio and video scene to the user; and a region of interest processor deciding, based e.g. on the user's viewport, head orientation, movement data, or metadata, whether an Audio information message is to be reproduced. At the decision, the reproduction of the Audio information message is caused.