Device and method for broadcasting virtual reality images input from plurality of cameras in real time
11503268 · 2022-11-15
Assignee
Inventors
Cpc classification
H04N13/383
ELECTRICITY
H04N21/21805
ELECTRICITY
H04N13/117
ELECTRICITY
G06F3/011
PHYSICS
H04N21/4728
ELECTRICITY
H04N13/243
ELECTRICITY
International classification
G02B27/00
PHYSICS
Abstract
A real-time broadcasting device includes a plurality of image capturing units arranged to capture images at the same time in different directions from the same point; an image selection unit for comparing the angle of view covered by each of the plurality of image capturing units and the gaze direction of a user terminal, and selecting an image of the image capturing unit, among the plurality of image capturing units, capturing images corresponding to the gaze direction of the user terminal; an image transmission unit for transmitting virtual reality images captured by the image capturing unit selected by the image selection unit to the user terminal in real time; and a gaze direction reception unit for receiving feedback about which direction the gaze direction of the user terminal viewing the virtual reality images is, and transmitting same to the image selection unit.
Claims
1. A real-time broadcasting device for virtual reality (VR) images, wherein the device is for real-time broadcasting of VR images input from a plurality of cameras, comprising: a plurality of image shooters disposed to shoot images in different directions from the same point at the same time; an image selector configured to compare a field of view covered by each of the plurality of shooters and a gaze direction of an user terminal to select an image of a shooter shooting an image corresponding to the gaze direction of the user terminal among the plurality of shooters; an image transmitter configured to transmit a VR image shot by the shooter selected by the image selector to the user terminal in real time; and a gaze direction receiver configured to receive feedback on which direction the gaze direction of the user terminal for viewing a VR image is, and transmit the gaze direction to the image selector, wherein the real-time broadcasting device is configured to implement real-time broadcasting of VR content by transmitting, to the user terminal, only the image of the shooter in which the gaze direction of the user terminal is included, without stitching a plurality of VR images shot from the plurality of shooters, in order to secure real-time performance.
2. A real-time broadcasting method for broadcasting in real time using the real-time broadcasting device for VR images of claim 1, wherein the method is for real-time broadcasting of VR images input from a plurality of cameras, comprising: (a) determining a shooting field of view shot by a camera unit included in a plurality of shooters; (b) transmitting an image which is being shot by a shooter matching a gaze direction of an user terminal among the plurality of shooter to the user terminal in real time; (c) receiving the gaze direction of the user terminal and a play field of view unique to the user terminal which is able to be played by the user terminal; (d) determining whether or not one end of the play field of view and a shooting field of view of a shooter currently broadcasting in real time satisfy a predetermined switching criterion as the gaze direction of the user terminal changes; and (e) blocking an image of the shooter currently being broadcast in real time and switching the shooter such that an image of an adjacent shooter is transmitted if the switching criterion is satisfied, wherein the real-time broadcasting method implements real-time broadcasting of VR content by transmitting, to the user terminal, only the image of the shooter in which the gaze direction of the user terminal is included, without stitching a plurality of VR images shot from the plurality of shooters, in order to secure real-time performance.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1)
(2)
(3)
(4)
(5) The accompanying drawings are exemplified as a reference for understanding of the technical idea of the present disclosure, and the scope of the present disclosure is not limited thereby.
DETAILED DESCRIPTION
(6) When those skilled in the art determines that a known function related to the present disclosure is self-evident and may unnecessarily obscure the gist of the present disclosure, detailed descriptions thereof are omitted.
(7)
(8) The real-time broadcasting device according to the present disclosure may implement real-time broadcasting of VR content by transmitting, to an user terminal, only an image of an shooter including a gaze direction of the user terminal, without stitching a plurality of VR images shot from the plurality of shooters. Hereinafter, example embodiments will be described in detail.
(9) As can be seen from
(10) The plurality of shooters 110 include N camera units shooting different images at the same time. In a preferred example embodiment, the plurality of shooters 110 are disposed to shoot images at the same time in different directions from the same point.
(11) The plurality of shooters 110 may be disposed to shoot a temporary field of view not be overlapped each other as shown in
(12) As shown in
(13) For example, if a 360VR image is to be transmitted using three camera units, the temporary field of view of each camera unit will be 120 degrees. However, when both ends are further set to have a guard field of view of 15 degrees, the shooting field of view covered by one camera unit will be 150 degrees.
(14) Further, the camera unit included in the plurality of shooters may include a pair of cameras for implementing 3D. The pair of cameras may be a camera for a right eye image and a camera for a left eye image, respectively.
(15) The image selector 120 compares the shooting field of view covered by each of the plurality of shooters 110 and the gaze direction of a user terminal, and select an image of the shooter, among the plurality of shooters, which shoots images corresponding to the gaze direction of the user terminal.
(16) There may be various example embodiments of a method for the image selector 120 to select the shooter to be transmitted in real time.
(17) The user terminal has a unique field of view that a display device of the user terminal may cover, and it is defined as a play field of view. In addition, a specific direction that the user terminal looks at is defined as a gaze direction.
(18) The image selector 120 may use the play field of view and the gaze direction received from the gaze direction receiver 140.
(19) In a preferred example embodiment, the image selector 120 determines whether one end of the play field of view enters the guard field of view included in the shooting field of view of the shooter currently broadcasting in real time as the gaze direction of the user terminal changes. When it is determined that one end of the play field of view enters the guard field of view, the image selector 120 starts a procedure of determining whether one end of the play field of view satisfies a predetermined switching criterion. When it is determined that one end of the play field of view satisfies the predetermined switching criterion, the image selector 120 blocks the image of the shooter currently broadcasting in real time and switches the shooters so the image of the adjacent shooter is transmitted. Various example embodiments may exist for the predetermined switching criterion. In a preferred example embodiment, the switching criterion may be set as 1) the moment when one end of the play field of view entered the guard field of view, 2) the moment when one end of the play field of view reached a point corresponding to ½ of the guard field of view, but is not limited thereto.
(20) The image transmitter 130 transmits a VR image shot by the shooter selected by the image selector to the user terminal in real time.
(21) The gaze direction receiver 140 receives feedback about which direction the gaze direction of the user terminal viewing the VR image is, and transfers it to the image selector. The gaze direction receiver 140 may receive a play field of view unique to the user terminal corresponding to a unique range which may be played on a display device of the user terminal, and the user's gaze direction.
(22)
(23) As can be seen from
(24) In a preferred example embodiment, assuming that there are N camera units to take pictures in different directions at the same time in different directions from the same point, a value obtained by dividing the total field of view covered by the N camera units by N may be determined as a temporary field of view, It may be determined as a value obtained by adding a predetermined guard field of view to both sides of the temporary field of view.
(25) When the shooting field of view is determined in this way, the field of view shot by one camera unit is overlapped the field of view covered by the other camera unit adjacent to both sides by a predetermined field of view.
(26) Next, the image being shot by the shooter matching the gaze direction of the user terminal among the plurality of shooters is transmitted to the user terminal in real time (S1200).
(27) At this time, the real-time broadcasting method of the present disclosure implements real-time broadcasting of VR content by transmitting, to the user terminal, only the image of the shooter including the gaze direction of the user terminal, without stitching a plurality of VR images shot from the plurality of shooters.
(28) Next, the gaze direction of the user terminal and the play field of view unique to the user terminal which may be played by the user terminal are received from the user terminal (S1300).
(29) Next, it is determined whether or not one end of the play field of view and the shooting field of view of the shooter currently broadcasting in real time satisfy a predetermined switching criterion as the gaze direction of the user terminal changes (S1400).
(30) In a preferred example embodiment, whether the switching criterion is satisfied may be determined in the following way.
(31) First, it is determined whether or not one end of the play field of view enters the guard field of view included in the shooting field of view of the shooter currently broadcasting in real time as the gaze direction of the user terminal changes.
(32) When the one end of the play field of view enters the guard field of view, then a procedure of determining whether or not the one end of the play field of view satisfies the predetermined switching criterion starts.
(33) On the contrary, the determination of whether or not the switching criterion is satisfied is stopped when the one end of the play field of view deviates from the guard field of view to reduce load.
(34) When it is determined that the switching criterion is satisfied, the image of the shooter currently broadcasting in real time is blocked and the shooter is switched such that the image of the adjacent shooter is transmitted (S1500).
(35) [How to Synchronize Frames of Image Played Before and After Switching]
(36) If the image of the shooter being broadcast in real time is replaced, interruption of the image played in the user terminal may occur. In order to prevent this, in a preferred example embodiment of the present disclosure, it is possible to implement frame synchronization between the images before and after the switching by allowing images to be switched only in a key frame of the image of the shooter when the switching operation of the shooter starts.
(37) A more detailed description of this is as follows.
(38) A group of picture (GOP) means a set including at least one I-frame in MPEG image processing. Frames include I (Intra-coded), P (Predictive-coded), and B (Bidirectional-coded) frames. I-frames are key frames, and are encoded without reference to other frames. Accordingly, the I frame may be independently decoded without reference to other frames.
(39) The P frame is coded with reference to previous I frame and P frame. The B frame is coded using both previous and subsequent I frames and P frames. Accordingly, it takes a considerable amount of time to decode the B frame.
(40) Accordingly, when the frame of the image played after switching is frame B, the time required for decoding it may greatly increase, which may lead to synchronization failure.
(41) Accordingly, the present disclosure maintains the time required for decoding within a predictable range by switching between images only in the key frame.
(42) For this, a preferred example embodiment of the present disclosure shortens a key frame distance (KFD) of an image by using a predetermined method. In addition, when the played image is switched as the user's gaze changes, the image is switched only in the key frame among the frames forming the image, thereby implementing frame synchronization between the images before and after switching.
(43) In an example embodiment, the KFD may be shortened by a method such that the B frame is not included in the image set. This is because the time required for decoding the B frame is considerable.
(44) In addition, it is necessary to secure a predetermined delay time in order to perform the switching only at the key frame rather than an immediate switching. In other words, the switching should be delayed by a predetermined delay time, and an issue of the screen being played being cut off should not occur during the delay time. For this, in the present disclosure, such a delay time may be secured by setting the predetermined guard field of view and setting the switching criterion comfortably as described above.
(45) [Real-Time Broadcasting of Wide Area Image, Patch Image, Etc.]
(46) In a preferred example embodiment, the VR images transmitted in real time in the present disclosure is a concept including a wide area image, a patch image, and a division image.
(47)
(48) The concept of the wide area image and the patch image of the present disclosure will be described with reference to
(49) A patch image V.sub.1-3 is an image representing a part of the VR area 10 of 360 degrees. The patch image V.sub.1-3 may have different areas to cover, different spaces, and different image quality. For example, a first patch image V.sub.1 may be a high-quality image that covers upper and lower partial areas of the front part. If a VR content is a musical, the first patch image V.sub.1 may be an area covering a stage of the musical. A second patch image V.sub.2 may be an image that covers an upper partial area of the rear part, and a third patch image V.sub.3 may be an image that covers a lower partial area of the rear part.
(50) The patch image V.sub.1-3 is played by being overlapped or patched on the wide area image V.sub.0. Therefore, even if any one of the played patch images V.sub.1-3 is off as needed, no space is generated in the image because the wide area image V.sub.0 is played behind.
(51) The patch image V.sub.1-3 is played in synchronization with the wide area image V.sub.0. This is because the patch image V.sub.1-3 may cause dizziness to the user if synchronization of the wide area image V.sub.0 is not implemented.
(52) Asynchronous content V.sub.4 means content that is randomly inserted by the intention of a creator regardless of the plurality of VR images. The asynchronous content V.sub.4 may be a moving picture or a specific event action. In terms of content, it may be an advertisement, or it may be an event related to the VR image.
(53) The asynchronous content V.sub.4 is not synchronized with the plurality of VR images. In other words, regardless of the sync between the patch image V.sub.1-3 and the wide area image V.sub.0, it is played or operated according to separate trigger information. In a preferred example embodiment, the trigger information includes information on whether the user's gaze looks at the position of the asynchronous content V.sub.4.
(54) The concept of a division image V.sub.1-N of the present disclosure will be described with reference to
(55) In an example embodiment, the division image V.sub.1-N may be an image divided into N areas not overlapped each other. The plurality of division images V.sub.1-N may have different sizes and different image quality.
(56) Since the images overlapped each other by the predetermined guard field of view have been described with reference to
(57) It is apparent to those skilled in the art that the present disclosure may be embodied in other specific forms without departing from the features of the present disclosure. Accordingly, the above detailed description should not be construed as restrictive in all respects but as illustrative. The scope of the present disclosure should be determined by a reasonable construction of the appended claims, and all modifications within the equivalent scope of the present disclosure are included in the scope of the present disclosure.