Patent classifications
G06T3/16
METHODS AND APPARATUS FOR SPATIAL GROUPING AND COORDINATE SIGNALING FOR IMMERSIVE MEDIA DATA TRACKS
The techniques described herein relate to methods, apparatus, and computer readable media configured to encode and/or decode video data. Immersive media data includes a first patch track comprising first encoded immersive media data that corresponds to a first spatial portion of immersive media content, a second patch track comprising second encoded immersive media data that corresponds to a second spatial portion of the immersive media content that is different than the first spatial portion, an elementary data track comprising first immersive media elementary data, wherein the first patch track and/or the second patch track reference the elementary data track, and grouping data that specifies a spatial relationship between the first patch track and the second patch track in the immersive media content. An encoding and/or decoding operation is performed based on the first patch track, the second patch track, the elementary data track and the grouping data to generate decoded immersive media data.
PRODUCING 360 DEGREE IMAGE CONTENT ON RECTANGULAR PROJECTION IN ELECTRONIC DEVICE USING PADDING INFORMATION
Embodiments herein disclose a method for producing 360 degree image content on a rectangular projection in an electronic device. The method includes obtaining a 360 degree image content represented by packing one or more projection segments arranged in a rectangular projection. The method includes detecting whether at least one discontinuous boundary is present in the 360-degree image content. The at least one discontinuous boundary is detected using the packing of one or more projection segments. The method includes applying at least one padding information on the at least one discontinuous boundary. The method includes producing another 360 degree image content on the rectangular projection in the electronic device based on the padding information.
RECTILINEAR VIEWPORT EXTRACTION FROM A REGION OF A WIDE FIELD OF VIEW USING MESSAGING IN VIDEO TRANSMISSION
Rectilinear viewport extraction from a region of a wide field of view is described using messaging in the video transmission field. Some embodiments pertain to a method that includes receiving a wide field of view video, receiving position data from a remote coupled user device, selecting a region of interest in the video based on the received position data, extracting the region of interest from the video, generating region metadata describing the extracted region of interest, encoding the extracted region of interest of the video, and transmitting the encoded video with the region metadata to the user device.
VIDEO DATA PROCESSING METHOD AND APPARATUS
Example video data processing methods and apparatus are disclosed. One example method includes obtaining viewport information by a server. The server obtains spatial object information based on the viewport information, where the spatial object information is used to describe a specified spatial object in panoramic space. The server obtains a first bitstream, where the first bitstream is obtained by encoding image data in the specified spatial object. The server obtains a second bitstream, where the second bitstream is obtained by encoding image data in the panoramic space. The server transmits the first bitstream and the second bitstream to a client.
Generation device, identification information generation method, reproduction device, and image generation method
There is provided a generation device, an identification information generation method, a reproduction device, and an image generation method capable of easily acquiring a region with a margin and a region with no margin. An identification information generation unit generates margin identification information for identifying that a celestial sphere image includes a region with a margin. A client that uses a margin can easily acquire a region including the margin and a client that does not use a margin can easily acquire a region including no margin. The present technology can be applied to a case in which a celestial sphere image is transmitted from a server and is received and reproduced on a client side.
Rectilinear viewport extraction from a region of a wide field of view using messaging in video transmission
Rectilinear viewport extraction from a region of a wide field of view is described using messaging in the video transmission field. Some embodiments pertain to a method that includes receiving a wide field of view video, receiving position data from a remote coupled user device, selecting a region of interest in the video based on the received position data, extracting the region of interest from the video, generating region metadata describing the extracted region of interest, encoding the extracted region of interest of the video, and transmitting the encoded video with the region metadata to the user device.
Spherical rotation for encoding wide view video
Spherical rotation is described for encoding a video that has a wide field of view, such as a spherical or hemispherical video. One example relates to receiving encoded video including rotation orientation metadata, decoding the video, extracting the rotation orientation metadata, rotating the decoded video based on the rotation orientation metadata, generating a view of the rotated decoded video, and buffering the generated view for display.
Panoramic video compression method and device
Embodiments of the present application disclose a compression method and device of a panoramic video. The method comprises: for a target panoramic video, generating each frame picture of the target panoramic picture; for each frame picture respectively, compressing the frame picture; dividing the compressed frame picture; mosaicking the pictures obtained through the dividing; generating a new panoramic video according to all the pictures obtained through the mosaicking. With the embodiments of the present application, waste of network bandwidth can be reduced.
Information processing apparatus, video synopsis method, and storage medium for video synopsis
An information processing apparatus includes a specification unit, a determination unit, and a generation unit. The specification unit specifies a relationship between a plurality of objects detected in a video. The determination unit determines arrangement of motion trajectories of the plurality of objects, the relationship between which has been specified by the specification unit, while maintaining a relative appearance order in the video and avoiding an overlap of the motion trajectories in a temporal direction. The generation unit generates a summary video of the video based on the arrangement determined by the determination unit.
Three-dimensional object modeling method, image processing method, image processing device
The present disclosure provides a three-dimensional object modeling method, an image processing method, and an image processing device. The three-dimensional object modeling method includes: a plane contour extraction step, in which for at least one panoramic image shot for each three-dimensional object, a plane contour of the at least one panoramic image in a three-dimensional space is extracted; a scale normalization step, in which a scale of the plane contour of the at least one panoramic image is normalized based on a camera position, to obtain normalized plane contours of respective panoramic images in the three-dimensional space; and a multi-object splicing step, in which based on the camera position, rotation and translation operations are performed on three-dimensional point coordinates of the normalized plane contours, so as to unify the three-dimensional point coordinates in the same coordinate system, to splice plane contours of respective three-dimensional objects into a multi-object plane contour.