Apparatus and method for decoding a panoramic video
11202099 · 2021-12-14
Assignee
Inventors
- Maxim Borisovitch SYCHEV (Moscow, RU)
- Alexis DECURNINGE (Boulogne Billancourt, FR)
- Dimitrios Tsilimantos (Boulogne Billancourt, FR)
- Alexander Alexandrovich KARABUTOV (Moscow, RU)
- Stefan Valentin (Boulogne Billancourt, FR)
- Sergey Yurievich Ikonin (Moscow, RU)
Cpc classification
H04N13/161
ELECTRICITY
H04N19/573
ELECTRICITY
H04N19/105
ELECTRICITY
H04N19/139
ELECTRICITY
H04N19/597
ELECTRICITY
International classification
H04N19/597
ELECTRICITY
H04N19/105
ELECTRICITY
H04N19/139
ELECTRICITY
H04N13/161
ELECTRICITY
Abstract
A system and method for improving quality of encoding and decoding a panoramic video. The panoramic video comprises a sequence of encoded picture frames. In panoramic video picture frames are mapped on a spherical viewing area. In the method the projection center of the encoded frame is transformed before encoding. In the decoding phase the projection center is transformed back accordingly after decoding the frame.
Claims
1. A picture decoding apparatus comprising processing circuitry, wherein the processing circuitry is configured to: receive an encoded panoramic picture; decode the received encoded panoramic picture; compute a geometrical transformation in accordance with a replacement vector, wherein the replacement vector is included with the encoded panoramic picture and the replacement vector determines a difference between a projection center of the received encoded panoramic picture and a projection center of a panoramic output picture; and construct the panoramic output picture, wherein the processing circuitry is further configured to transform the decoded panoramic picture in accordance with the computed geometrical transformation.
2. The picture decoding apparatus of claim 1, wherein the geometrical transformation is a backward transformation from a transformed projection center to an original projection center.
3. The picture decoding apparatus of claim 1, wherein the processing circuitry is further configured to use the projection center of the received encoded panoramic picture or the projection center of the panoramic output picture as a focal point.
4. The picture decoding apparatus of claim 1, wherein the processing circuitry is further configured to use a midway point between the projection center of the received encoded panoramic picture and the projection center of the panoramic output picture as a focal point.
5. A picture encoding apparatus comprising processing circuitry, wherein the processing circuitry is configured to: receive a panoramic picture; construct a transformed panoramic picture by computing a geometrical transformation of the received panoramic picture; and encode the transformed panoramic picture, wherein a replacement vector for a projection center is stored with the encoded transformed picture, and the replacement vector comprises a parameter set of the geometrical transformation between a projection center of the received panoramic picture and a projection center of the constructed transformed panoramic picture.
6. The picture encoding apparatus according to claim 5, wherein the geometrical transformation is a forward transformation from an original to a replaced projection center.
7. The picture encoding apparatus of claim 5, wherein the processing circuitry is further configured to use the projection center of the received panoramic picture or the projection center of the constructed panoramic picture as a focal point.
8. The picture encoding apparatus of claim 5, wherein the processing circuitry is further configured to use a midway point between the projection center of the received panoramic picture and the projection center of the constructed panoramic picture as a focal point.
9. A picture decoding method comprising: receiving an encoded panoramic picture; decoding the received encoded panoramic picture; computing a geometrical transformation in accordance with a replacement vector, wherein the replacement vector is included with the encoded panoramic picture and the replacement vector determines a difference between a projection center of the received encoded panoramic picture and a projection center of a panoramic output picture; constructing the panoramic output picture; and transforming the decoded panoramic picture in accordance with the computed geometrical transformation.
10. The picture decoding method of claim 9, wherein the geometrical transformation is a backward transformation from a transformed projection center to an original projection center.
11. The picture decoding method of claim 9, wherein the method further comprises using the projection center of the received encoded panoramic picture or the projection center of the panoramic output picture as a focal point.
12. The picture decoding method of claim 9, wherein the processing circuitry is further configured to use a midway point between the projection center of the received encoded panoramic picture and the projection center of the panoramic output picture as a focal point.
13. A picture encoding method comprising: receiving a panoramic picture; constructing a transformed panoramic picture by computing a geometrical transformation of the received panoramic picture; and encoding the transformed panoramic picture, wherein a replacement vector for a projection center is stored with the encoded transformed panoramic picture, and the replacement vector comprises a parameter set of the geometrical transformation between a projection center of the received panoramic picture and a projection center of the transformed panoramic picture.
14. The picture encoding method according to claim 13, wherein the geometrical transformation is a forward transformation from an original to a replaced projection center.
15. The picture encoding method of claim 13, wherein the method further comprises using the projection center of the received panoramic picture or the projection center of the constructed panoramic picture as a focal point.
16. The picture encoding method of claim 13, wherein the method further comprises using a midway point between the projection center of the received panoramic picture and the projection center of the transformed panoramic picture as a focal point.
Description
DESCRIPTION OF THE DRAWINGS
(1) The present description will be better understood from the following detailed description read in light of the accompanying drawings, wherein:
(2)
(3)
(4)
(5)
(6)
DETAILED DESCRIPTION
(7) The detailed description, provided below in connection with the appended drawings, is intended as a description of the embodiments and is not intended to represent the only forms in which the embodiment may be constructed or utilized. However, the same or equivalent functions and structures may be accomplished by different embodiments. In the following description decoding is discussed, however, a corresponding encoder includes a similar decoder part.
(8) In the following description a video coding apparatus, a method and a corresponding computer program are disclosed. The description relates to encoding and decoding of a panoramic video. The panoramic video may be, for example of 180 or 360 degrees, and stored into a video stream or other similar encoded sequence of pictures in the form of picture frames of any projection. The decoded picture frames are then mapped to a surface of a sphere forming the panoramic view. Thus, each of the mapped picture frames has a projection center, which can also be determined when the picture frame is stored as a two-dimensional picture frame in the video stream. In the following description the term “projection center” means the center of a sphere on which the panoramic video is projected.
(9)
(10) The encoder 100 receives an input from a preprocessor that is not shown in
(11) At the encoding side the picture encoding is performed in conventional manner using the shown encoder and the respective components. However, the encoded picture data (encoded image data) 171 that is provided at the output 172 of the decoder is different also in that sense that the encoded picture data comprises the replacement vector, that may be also encoded, for example, in the entropy encoding block 170, or directly before or after the entropy encoding. As the introduction of the replacement vector does not directly relate to the encoding of the picture data of the picture block 101 it can be done in any suitable location of the encoder. Thus, because of the preprocessing, the encoded picture data 171 differs from the conventional encoding in that the projection center of the encoded picture is different and the encoding result is different. This provides better capability of improving quality as more details can be preserved. A further difference is the introduction of the replacement vector or other set of parameters that can be used for corresponding geometrical transformation.
(12)
(13) In the decoder 200 the encoded picture data 171 comprises picture data of a picture and a replacement vector as explained with regard
(14) The expression replacement vector is to be understood as a set of parameters that represents the geometrical difference between the projection centers of the received panoramic picture and the constructed second panoramic picture.
(15) In
(16) The constructed second transformed panoramic picture is then encoded into a video stream, step 303. The encoding step includes encoding of the replacement vector. The replacement vector is associated into the video stream so that when the video stream is later being decoded the first panoramic picture that represents the original picture can be constructed using backward transformation in accordance with the replacement vector. As can be seen in Figure the encoded data 304 includes both the second transformed panoramic picture and the replacement vector.
(17) In
(18) At the post processing phase 403 a backward transformation is applied to the second panoramic picture 402 in accordance with the replacement vector. As an outcome a third panoramic picture 404 is achieved. The third panoramic picture 404 is a reconstructed copy of the original panoramic picture, which is in this example the first panoramic picture 300 of
(19) Even if the result is a reconstructed copy of the first panoramic picture it is understood that it may not be necessarily identical. Deviations may be caused by, for example lossy compression algorithms, or other algorithms that irreversibly change the picture during pre-processing, encoding, decoding or post-processing.
(20) In
(21) A three-dimensional sphere centered at Si represents the initial panoramic picture or spherical picture. In the following 52 denotes the shifted center of the transformed spherical picture and R the scaling factor of the radius, i.e. the shifted sphere has a radius R, assuming a radius equal to 1 for the initial sphere. In the following, f represents the focal point of both pictures, so that any considered ray of light goes through f Without loss of generality, the f can be considered to represent the origin of the Cartesian coordinate system, i.e.
f=(0,0,0).sup.T
(22) In the following a=(ax, ay, az).sup.T denotes the local coordinates of a pixel at the surface of the initial sphere that is centered at coordinate Si. We denote by a′=(ax′, ay′, az′).sup.T the local coordinates of the transformed pixel at the surface of the normalized version of the shifted sphere that is centered at coordinate s.sub.2.
(23) Then, s.sub.2+Ra′ results in the intersection of the ray of light going through s.sub.1+a and focal point f Therefore, after some calculus, point a′ satisfies the following equation
(24)
(25) One can recover the initial picture from the transformed spherical picture through the following backward mapping:
(26)
(27) Note that, if f=s.sub.2, the forward transformation simplifies into:
(28)
(29) Furthermore, if
(30)
and R=1, the transformations are symmetrical, that is to say
T.sup.−1(a)=−T(−a).
(31) It is also possible to use the backward transformation when f=s.sub.1, i.e. the projection center of the current frame is used as focal point. This may be convenient in some applications.
(32) The video coding apparatus and method have been described in conjunction with various embodiments herein. However, other variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the claimed invention, from a study of the drawings, the disclosure, and the appended claims. In the claims, the word “comprising” does not exclude other elements or steps, and the indefinite article “a” or “an” does not exclude a plurality. A single processor or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measured cannot be used to advantage. A computer program may be stored/distributed on a suitable medium, such as an optical storage medium or a solid-state medium supplied together with or as part of other hardware, but may also be distributed in other forms, such as via the Internet or other wired or wireless telecommunication systems.