Remote behavior navigation system and processing method thereof
09538131 ยท 2017-01-03
Assignee
Inventors
Cpc classification
H04N7/147
ELECTRICITY
Y02P90/02
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
G05B2219/31046
PHYSICS
G06F3/048
PHYSICS
H04N7/142
ELECTRICITY
International classification
G06T19/00
PHYSICS
Abstract
There are provided a remote behavior navigation system and a method thereof, which allow an instructor at a remote location to perform guidance to a worker while watching a work video picture of the worker. Corresponding to a video picture from a worker-side camera at time t1, an instructor-side computer extracts a work instruction video picture from a video picture of an instructor-side camera at time t2. The instructor-side computer superimposes the work instruction video picture at the time t2 on the video picture from the worker-side camera at the time t1, and displays the superimposed video picture on an instructor-side monitor. A worker-side computer provides the work instruction video picture at the time t2 with a time difference correction, then superimposes the corrected work instruction video picture on a video picture from the worker-side camera at time t3, and displays the superimposed video picture on a worker-side monitor.
Claims
1. A remote behavior navigation system configured to allow an instructor at a remote location to provide a work instruction to a worker while watching a work video picture of the worker, the system comprising: a first camera and a first monitor provided to the worker; a worker-side computer configured to control the first camera and the first monitor; a second camera and a second monitor provided to the instructor; and an instructor-side computer configured to control the second camera and the second monitor, the instructor-side computer being connected to the worker-side computer via a bidirectional communication network, the instructor-side computer extracts a work instruction video picture from a video picture of the second camera at time t2 corresponding to a video picture from the first camera at time t1, (1) the instructor-side computer superimposes the work instruction video picture at the time t2 on the video picture from the first camera at the time t1, and displays the superimposed video picture on the second monitor, and (2) the worker-side computer provides the work instruction video picture at the time t2 with a time difference correction being a function of =|t3t1|, then superimposes the corrected work instruction video picture on a video picture from the first camera at time t3, and displays the superimposed video picture on the first monitor.
2. The remote behavior navigation system according to claim 1, wherein the time difference correction is homographic transformation based on a comparison between the video pictures from the first camera at the time t1 and the time t3.
3. The remote behavior navigation system according to claim 2, wherein the first camera is a monocular device to cover one of the eyes of the worker, and the second camera is a binocular device to cover both of the eyes of the instructor.
4. The remote behavior navigation system according to claim 2, wherein each of the first camera and the second camera is a binocular device to cover both of the eyes of the worker or the instructor.
5. A processing method of remote behavior navigation allowing an instructor at a remote location to provide a work instruction to a worker while watching a work video picture of the worker, the method applicable to a system including a first camera and a first monitor provided to the worker, a worker-side computer configured to control the first camera and the first monitor, a second camera and a second monitor provided to the instructor, and an instructor-side computer configured to control the second camera and the second monitor, the instructor-side computer being connected to the worker-side computer via a bidirectional communication network, the method comprising: causing the instructor-side computer to extract a work instruction video picture from a video picture of the second camera at time t2 corresponding to a video picture from the first camera at time t1; (1) causing the instructor-side computer to superimpose the work instruction video picture at the time t2 on the video picture from the first camera at the time t1, and to display the superimposed video picture on the second monitor; and (2) causing the worker-side computer to provide the work instruction video picture at the time t2 with a time difference correction being a function of =|t3t1|, then to superimpose the corrected work instruction video picture on a video picture from the first camera at time t3, and to display the superimposed video picture on the first monitor.
6. The processing method of remote behavior navigation according to claim 5, wherein the time difference correction is homographic transformation based on a comparison between the video pictures from the first camera at the time t1 and the time t3.
7. The processing method of remote behavior navigation according to claim 6, wherein the first camera is a monocular device to cover one of the eyes of the worker, and the second camera is a binocular device to cover both of the eyes of the instructor.
8. The processing method of remote behavior navigation according to claim 6, wherein each of the first camera and the second camera is a binocular device to cover both of the eyes of the worker or the instructor.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
DESCRIPTION OF EMBODIMENTS
(7) An embodiment of a remote behavior navigation system and a processing method thereof according to the present invention will be described below by using
(8) As shown in
(9) Here, referring also to
(10) In particular, it is preferable to present the instruction video picture of the contents to be worked on from the instructor P2 to the worker P1, in such a way as to superimpose the instruction video picture on the work video picture showing a work environment which is shot with the camera 14. Thus, it is possible to cause the worker P1 to visually grasp the contents to be worked on while feeling presence.
(11) Meanwhile, the worker P1 has to stand by for a period of data transmission on the line 2 from time t=t1 of transmission of the work situation to time t=t3 of reception of the instruction from the instructor P2. If such a time delay =|t3t1| is large relative to a change in the work situation of the worker P1, the work situation subject to the instruction may have changed and it would no longer be possible to actually accomplish the action guidance by the instructor P2.
(12) Accordingly, an attempt to transmit the instruction video picture superimposed on the work video picture in the course of the aforementioned transmission (S102) of the instruction video picture from the instructor-side PC 21 to the worker-side PC 11 involves a large data volume and requires more time for data transfer. In other words, the time delay is increased. Given the circumstances, this embodiment takes into consideration transmission of only the instruction video picture at time t=t2, thus causing the worker-side PC 11 to superimpose the instruction video picture on the work video picture. In this case, assuming that line communication speeds are the same, time 2=|t3t2| required for the transmission of the instruction data to the worker-side PC 11 (S102) can be made smaller than time 1=|t2t1| required for the transmission of the work situation data to the instructor-side PC 21 (S101), and the time delay is not increased very much.
(13) When the time delay is small, the worker P1 can perform the work while putting on a monocular HMD (head display) provided with the monitor 15. In other words, the worker can visually check the actual work situation with one eye and the contents of the instructed work displayed on the HMD with the other eye. According to this configuration, even in case of a sudden change in the work situation, it is possible to deal with the change promptly since the actual work situation is visually checked. Here, even in the case of the monocular HMD, it is still desirable to further reduce the time delay so as to remove discomfort in the visual check by the worker P1.
(14) In this regard, it is possible to reduce the time delay to nearly zero by superimposing the instruction video picture on the work video picture shot with the camera 14 at the point of time t=t3 of reception of the instruction video picture. In this case, the video picture displayed on the monitor 15 substantially coincides with the actual work situation. Accordingly, it is possible to eliminate the discomfort in the visual check by the worker P1 even by using the monocular HMD. Moreover, it is also possible to employ a binocular HMD so as to improve the feeling of presence.
(15) Furthermore, details will be described in accordance with
(16) First, the worker P1 at the work site shoots a video picture of the work situation in front of the worker P1 with the camera 14 at the time t=t1, then compresses video picture (image) data of this video picture C1 (see
(17) Next, the instructor P2 chooses the operation tool A2 to be operated among the operation tools A1, A2, and so on while watching the monitor 25 of the binocular HMD displaying the video picture C1, i.e., from the same viewpoint as that of the worker P1 at the work site, and then reaches out a hand to attempt the operation. At this time t=t2, the camera 24 of the instructor P2 shoots a wrist image 6 of the instructor P2, who is attempting the operation as well as background 5 at that site together as a video picture C2 (see
(18) The video picture C2 is subjected to image processing by the PC 21. Specifically, the background 5 is erased from the video picture C2 to obtain a video picture C2 that extracts the wrist image 6 of the instructor P2 (S22, see
(19) Concurrently, the video picture C2 is superimposed on the video picture C1 (S24), and is displayed on the monitor 25 of the binocular HMD of the instructor P2 (S25). Thus, the instructor P2 can perform an instruction action while being provided with the feeling of presence as if the instructor P2 is present at the location of the worker P1.
(20) Meanwhile, the worker P1 side having received the video picture C2 corrects the video picture by calculating time elapsed from the shooting and transmission (S11), i.e., the time delay =|t3t1| from the transmission of the video picture C1 (S13). A video picture obtained by correcting the video picture C2 by means of a function of the time delay is superimposed on a video picture C3 with the camera 14 at t=t3 (S14).
(21) Here, as shown in
(22) In short, it is possible to achieve the superimposition on the video picture C3 of the camera 14 at the time t=t3 by subjecting a CG image of the wrist image 6 transmitted by the instructor P2 to the plane projective transformation based on the estimated homographic matrix f (see
(23) The superimposed video picture is displayed on the monitor 15 (S15), so that the worker P1 can work in accordance with the movement of the wrist image 6 of the instructor P2 (S16).
(24) The remote behavior navigation system and the processing method thereof, configured to extract the CG image of the wrist image 6 transmitted by the instructor P2, then to perform the plane projective transformation based on the estimated homographic matrix, and to present this outcome to the worker P1 have been described above. In the system and the method, a work instruction video picture is extracted and transmitted to the worker P1. Accordingly, it is possible to reduce a volume of data transmission and to reduce the time delay in the data transmission as compared to the case of superimposing the instruction video picture on the work video picture and then transmitting the superimposed video picture. Moreover, by using a time difference correction which is the function of the time delay =|t3t1|, it is possible to form the video picture to be visually checked by the worker P1 into the video picture that almost corresponds to the time t=t3 at that point. In other words, it is possible to suppress hindrance to workability due to the time delay, so that the worker can perform accurate work while efficiently receiving the instruction by the instructor P2.
(25) Here, it is also possible to use a terminal in which the camera 14 (24), the monitor 15 (25), and the voice device (the microphone and the speaker) 16 (26) are integrated with each PC 11 (21). In this case, the line 2 can partially include a wireless line. Meanwhile, although the transmission of the video pictures has been mainly described above, transmission of voices together with the video pictures can be performed as well. A line capacity necessary for the transmission of the voices is significantly smaller than that for the video pictures. Accordingly, it is possible to consider a transmission method in conformity with the video picture by using publicly known methods as appropriate.
(26) As described above, it is possible to provide the worker P1 at the work site with the appropriate work instruction for the worker P1 by displaying the transformed and corrected CG image of the hand of the instructor P2 matching the most recent situation at the site while demonstrating the most recent image of the site to the worker P1 at the work site. The transformed and corrected display can be achieved by correcting the instruction information in such a way as to correspond to the on-site image which has changed from the time of transmission of the on-site image from the worker P1 to the instructor P2 to the time of reception of the instruction information from the instructor P2. In other words, this is the change in the on-site image shot with the camera 14 of the worker P1. Accordingly, the change corresponds to the movement of the camera 14, so that the transformation method such as the homographic transformation is typically applicable.
(27) Here, in order to provide a work instruction regarding a moving object, the moving object may be rendered translucent for the purpose of clarification, and then be displayed in such a manner as to be superimposed on the on-site image as with the above-described method, for example. Meanwhile, methods of detecting the moving object include a method of constantly performing detection processing of the moving object from the on-site image shot with the camera 14 and recording detection results together with detection times, a method of performing the detection processing of the moving object by using a different portion obtained by comparing the on-site images shot with the camera 14 at different times, and so forth.
(28) While the embodiment according to the present invention and the modified examples based on the embodiment have been described above, it is to be understood that the present invention is not limited only to the foregoing. A person skilled in the art will be able to arrive at various alternative embodiments and modified examples without departing from the gist of the present invention or from the appended claims.
REFERENCE SIGNS LIST
(29) 1 remote behavior navigation system 2 communication line 11, 21 PC 14, 24 camera 15, 25 monitor