Intensity-modulated light pattern for active stereo
10928189 · 2021-02-23
Assignee
Inventors
Cpc classification
G06F3/0659
PHYSICS
G06F12/00
PHYSICS
B29C64/386
PERFORMING OPERATIONS; TRANSPORTING
B29C64/00
PERFORMING OPERATIONS; TRANSPORTING
A63F13/213
HUMAN NECESSITIES
G02B27/4233
PHYSICS
G06F11/3024
PHYSICS
G01B11/2545
PHYSICS
H04N13/239
ELECTRICITY
G01B11/2513
PHYSICS
G01B11/25
PHYSICS
H04N13/25
ELECTRICITY
H04N23/11
ELECTRICITY
G02B27/4205
PHYSICS
H04N13/254
ELECTRICITY
H04N2013/0081
ELECTRICITY
H04N13/271
ELECTRICITY
International classification
G01B11/25
PHYSICS
G06F9/30
PHYSICS
G06F12/00
PHYSICS
H04N13/25
ELECTRICITY
H04N13/254
ELECTRICITY
H04N13/271
ELECTRICITY
H04N17/00
ELECTRICITY
H04N13/239
ELECTRICITY
B29C64/386
PERFORMING OPERATIONS; TRANSPORTING
B29C64/00
PERFORMING OPERATIONS; TRANSPORTING
G02B27/42
PHYSICS
Abstract
The subject disclosure is directed towards projecting light in a pattern in which the pattern contains components (e.g., spots) having different intensities. The pattern may be based upon a grid of initial points associated with first intensities and points between the initial points with second intensities, and so on. The pattern may be rotated relative to cameras that capture the pattern, with captured images used active depth sensing based upon stereo matching of dots in stereo images.
Claims
1. A system comprising: a projector that projects a light pattern towards a scene, the projector including: two or more stereo cameras comprising a first stereo camera and a second stereo camera; a multimode laser; and a diffractive optical component configured to: diffract the light pattern from the multimode laser into a dot pattern comprising a first set of points of a higher intensity range and a second set of points of a medium intensity range that is less than the higher intensity range, wherein the second set of points are projected within polygons of the first set of points, wherein the first set of points comprise points of different intensities within the higher intensity range, rearrange one or more points of the second set of points into modified positions, and sense the first set of points by one or more stereo cameras.
2. The system of claim 1, wherein the two or more stereo cameras are configured to sense the first set of points and the second sets of points as left and right stereo camera images.
3. The system of claim 2, the system further comprising: an image processing component configured to process the stereo camera images into depth maps.
4. The system of claim 2, wherein the projector is incorporated into a single device with the two or more stereo cameras.
5. The system of claim 4, wherein two or more stereo cameras are positioned on opposite sides of the projector.
6. The system of claim 1, wherein the first set of points, and the second set of points are interleaved via subdivision.
7. The system of claim 1, wherein at least one of the points of the first set of points or the second set of points is randomly or pseudo-randomly assigned an intensity value within the higher intensity range.
8. The system of claim 7, wherein assigning intensity values to the points of the first set of points and the second set of points involves applying one or more weighted random techniques to bias assigned intensity values away from values of the medium intensity range.
9. The system of claim 1, wherein the projected light pattern comprises infrared light for sensing by infrared stereo cameras or visible light for sensing by color stereo cameras.
10. A method of projecting, through a presentation device, a light pattern toward a scene for use in depth sensing, the method comprising: generating a first set of points oriented in a grid as a collection of polygons; assigning different intensities to the first set of points within a first intensity range; generating a second set of points oriented within the polygons of the first set of points; assigning different intensities to the second set of points within a second intensity range that is lower than the first intensity range; projecting the first set of points and the second set of points as the light pattern toward the scenes; rearranging one or more points of the second set of points into modified positions, and sensing the first set of points by one or more stereo cameras.
11. The method of claim 10 further comprising: wherein the first set of points are sensed by a right stereo camera and the second set of points are sensed by a left stereo camera.
12. The method of claim 11 wherein the light pattern is outputted by the diffractive component in two instances at different angles rotated toward at least one of the one or more stereo cameras.
13. The method of claim 11, further comprising: processing, by an image processing component, the light pattern into a depth map.
14. The method of claim 13, further comprising extracting matching features from the light pattern.
15. The method of claim 10, further comprising: generating a third set of points oriented within the polygons of the first set of points; assigning different intensities to the third set of points within a third intensity range that differs from the first intensity range and the second intensity range; and projecting the third set of points as part of the light pattern.
16. The method of claim 10, further comprising randomly or pseudo-randomly selecting an intensity value within the first intensity range for at least some of the points in the first set of points.
17. The method of claim 10, wherein the first intensity range comprises a minimum intensity value and a maximum intensity value, the minimum intensity value being numerically smaller than the maximum intensity value.
18. The method of claim 10 wherein said projecting the first set of points and the second set of points as the light pattern includes pulsing at least one of the first set of points or the second set of points.
19. The method of claim 10, wherein the first set of points are sensed by a right stereo camera and the second set of points are sensed by a left stereo camera.
20. A method comprising: generating a first set of points oriented in a grid as a collection of polygons; assigning different intensities to the first set of points within a first intensity range; generating a second set of points oriented within the polygons of the first set of points; assigning different intensities to the second set of points within a second intensity range that is higher than the first intensity range; projecting the first set of points and the second set of points as the light pattern toward the scenes; rearranging one or more points of the second set of points into modified positions, and sensing the first set of points by one or more stereo cameras.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The present invention is illustrated by way of example and not limited in the accompanying figures in which like reference numerals indicate similar elements and in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION
(11) Various aspects of the technology described herein are generally directed towards having a light pattern projected into a scene, in which the light pattern is configured to provide for enhanced pattern matching, including at different depths to illuminated objects. In one aspect, a light pattern includes intermixed points of light (e.g., spots such as dots) of different intensities. The technology also leverages the depth-dependent appearance of the pattern by having the pattern include points that are semi-randomly distributed.
(12) As will be understood, the peak intensities of neighboring points are different. This results in local changes in intensity independent of the scene depth, to allow stereo matching to function properly.
(13) It should be understood that any of the examples herein are non-limiting. For example, the projected light pattern may use spots, generally exemplified herein as dots, but the dots may be of any shape. As another, the dots are exemplified as arranged according to a triangular grid, however this is only one example, and other arrangements (e.g., a hexagonal grid) may be implemented. Rotation angles of the patterns (described below), different ranges or values of intensity peaks (e.g., for large, medium and small intensities) from those described herein may be used, and so on. As such, the present invention is not limited to any particular embodiments, aspects, concepts, structures, functionalities or examples described herein. Rather, any of the embodiments, aspects, concepts, structures, functionalities or examples described herein are non-limiting, and the present invention may be used various ways that provide benefits and advantages in active depth sensing and image processing in general.
(14)
(15) In
(16)
(17) Note that the placement of the projector 106 may be outside the cameras (e.g.,
(18) In one implementation the example image capturing system or subsystem 104 includes a controller 108 that via a camera interface 110 controls the operation of the cameras 102 and 103. The exemplified controller via a projector interface 112 also controls the operation of the projector 106. For example, the cameras 102 and 103 are synchronized (genlocked) to capture stereo images at the same time, such as by a controller signal (or different signals for each camera). The projector 106 may be turned on or off, pulsed, and otherwise have one or more parameters controllably varied, for example.
(19) The images 116 captured by the cameras 102 and 103 are provided to an image processing system or subsystem 118. In some implementations, the image processing system 118 and image capturing system or subsystem 104, or parts thereof, may be combined into a single device. For example a home entertainment device may include all of the components shown in
(20) The image processing system or subsystem 118 includes a processor 120 and a memory 122 containing one or more image processing algorithms 124. One or more depth maps 126 may be obtained via the algorithms 124 such as by extracting matching features (such as dots and/or lines). For example, as is known, such as described in U.S. published patent application no. 20130100256, hereby incorporated by reference, different dots or other projected elements have different features when captured, including intensity (brightness), depending on the distance from the projector to the reflective surfaces and/or the distance from the camera to the reflective surfaces. As is also known, the dots in different images taken at the same time (e.g., with genlocked stereo cameras) may be correlated with one another, such as by matching small (e.g., RGB) patches between RGB color images of the same scene captured at the same instant. Thus, with captured images, known algorithms can determine individual depth-related features (depth maps) by matching projected light components (e.g., dots) in each image, using disparities of certain features between matched dots to determine depths. This is one way in which a depth map may be obtained via stereo image processing. Also shown in
(21)
(22)
(23) Step 506 represents adding points between the previously generated points, e.g., as smaller sets of triangles (a subdivision) such as shown in
(24) Step 510 evaluates whether subdivision has been completed to the lowest desired level, which is configurable. Thus, by returning to step 506, another subdivision of points may be optionally added, (such as exemplified in
(25) Note that once the intensity-modulated pattern is designed, such as via the example steps of
(26)
(27) As represented n
(28) As represented in
(29) Rotation and intensity distribution is generally shown in the partial image representation 990 of
(30) As can be seen, there is provided a light pattern modulated with different intensities. The pattern may be based upon a grid, and projected such that the cameras that capture the light pattern are not aligned with the grid on which the pattern was based. The intensity-modulated pattern provides for more robust stereo matching/depth sensing.
(31) Example Operating Environment
(32) It can be readily appreciated that the above-described implementation and its alternatives may be implemented on any suitable computing device, including a gaming system, personal computer, tablet, DVR, set-top box, smartphone and/or the like. Combinations of such devices are also feasible when multiple such devices are linked together. For purposes of description, a gaming (including media) system is described as one exemplary operating environment hereinafter.
(33)
(34) The CPU 1002, the memory controller 1003, and various memory devices are interconnected via one or more buses (not shown). The details of the bus that is used in this implementation are not particularly relevant to understanding the subject matter of interest being discussed herein. However, it will be understood that such a bus may include one or more of serial and parallel buses, a memory bus, a peripheral bus, and a processor or local bus, using any of a variety of bus architectures. By way of example, such architectures can include an Industry Standard Architecture (ISA) bus, a Micro Channel Architecture (MCA) bus, an Enhanced ISA (EISA) bus, a Video Electronics Standards Association (VESA) local bus, and a Peripheral Component Interconnects (PCI) bus also known as a Mezzanine bus.
(35) In one implementation, the CPU 1002, the memory controller 1003, the ROM 1004, and the RAM 1006 are integrated onto a common module 1014. In this implementation, the ROM 1004 is configured as a flash ROM that is connected to the memory controller 1003 via a Peripheral Component Interconnect (PCI) bus or the like and a ROM bus or the like (neither of which are shown). The RAM 1006 may be configured as multiple Double Data Rate Synchronous Dynamic RAM (DDR SDRAM) modules that are independently controlled by the memory controller 1003 via separate buses (not shown). The hard disk drive 1008 and the portable media drive 1009 are shown connected to the memory controller 1003 via the PCI bus and an AT Attachment (ATA) bus 1016. However, in other implementations, dedicated data bus structures of different types can also be applied in the alternative.
(36) A three-dimensional graphics processing unit 1020 and a video encoder 1022 form a video processing pipeline for high speed and high resolution (e.g., High Definition) graphics processing. Data are carried from the graphics processing unit 1020 to the video encoder 1022 via a digital video bus (not shown). An audio processing unit 1024 and an audio codec (coder/decoder) 1026 form a corresponding audio processing pipeline for multi-channel audio processing of various digital audio formats. Audio data are carried between the audio processing unit 1024 and the audio codec 1026 via a communication link (not shown). The video and audio processing pipelines output data to an A/V (audio/video) port 1028 for transmission to a television or other display/speakers. In the illustrated implementation, the video and audio processing components 1020, 1022, 1024, 1026 and 1028 are mounted on the module 1014.
(37)
(38) In the example implementation depicted in
(39)
(40) An application 1060 comprising machine instructions is typically stored on the hard disk drive 1008. When the console 1001 is powered on, various portions of the application 1060 are loaded into the RAM 1006, and/or the caches 1010 and 1012, for execution on the CPU 1002. In general, the application 1060 can include one or more program modules for performing various display functions, such as controlling dialog screens for presentation on a display (e.g., high definition monitor), controlling transactions based on user inputs and controlling data transmission and reception between the console 1001 and externally connected devices.
(41) The gaming system 1000 may be operated as a standalone system by connecting the system to high definition monitor, a television, a video projector, or other display device. In this standalone mode, the gaming system 1000 enables one or more players to play games, or enjoy digital media, e.g., by watching movies, or listening to music. However, with the integration of broadband connectivity made available through the network interface 1032, gaming system 1000 may further be operated as a participating component in a larger network gaming community or system.
CONCLUSION
(42) While the invention is susceptible to various modifications and alternative constructions, certain illustrated embodiments thereof are shown in the drawings and have been described above in detail. It should be understood, however, that there is no intention to limit the invention to the specific forms disclosed, but on the contrary, the intention is to cover all modifications, alternative constructions, and equivalents falling within the spirit and scope of the invention.