Method for capturing an object in an environmental region of a motor vehicle with prediction of the movement of the object, camera system as well as motor vehicle
11170232 · 2021-11-09
Assignee
Inventors
- Ciaran Hughes (Tuam, IE)
- Duong-Van Nguyen (Tuam, IE)
- Jonathan Horgan (Tuam, IE)
- Jan Thomanek (Bietigheim-Bissingen, DE)
Cpc classification
B60R11/04
PERFORMING OPERATIONS; TRANSPORTING
B60W30/0956
PERFORMING OPERATIONS; TRANSPORTING
B60W2554/00
PERFORMING OPERATIONS; TRANSPORTING
G06V20/58
PHYSICS
International classification
B60R11/04
PERFORMING OPERATIONS; TRANSPORTING
Abstract
The invention relates to a method for capturing an object (9) in an environmental region (8) of a motor vehicle (1) based on a sequence of images (10, 11) of the environmental region (8), which are provided by means of a camera (4) of the motor vehicle (1), including the steps of: recognizing a first object feature (24) in a first image (10) of the sequence, wherein the first object feature (24) describes at least a part of the object (9) in the environmental region (8), estimating a position of the object (9) in the environmental region (8) based on a predetermined movement model, which describes a movement of the object (9) in the environmental region (8), determining a prediction feature (26) in a second image (11) following the first image (10) in the sequence based on the first object feature (24) and based on the estimated position, determining a second object feature (25) in the second image (11), associating the second object feature (25) with the prediction feature (26) in the second image (11) if a predetermined association criterion is satisfied, and confirming the second object feature (25) as originating from the object (9) if the second object feature (25) is associated with the prediction feature (26).
Claims
1. A method for capturing an object in an environmental region of a motor vehicle based on a sequence of images of the environmental region, which are provided by means of a camera of the motor vehicle, comprising: recognizing a first object feature in a first image of the sequence, wherein the first object feature describes at least a part of the object in the environmental region; estimating a position of the object in the environmental region based on a predetermined movement model, which describes a movement of the object in the environmental region; determining a prediction feature in a second image following the first image in the sequence based on the first object feature and based on the estimated position; determining a second object feature in the second image; associating the second object feature with the prediction feature in the second image if a predetermined association criterion is satisfied; confirming the second object feature as originating from the object if the second object feature is associated with the prediction feature, wherein an association probability between the second object feature and the prediction feature is determined and the predetermined association criterion is deemed as satisfied if the association probability exceeds a predetermined value; and determining an object position in the environmental region based on the second object feature, determining a prediction position in the environmental region based on the prediction feature, determining a spatial similarity between the object position and the prediction position, and determining a current position of the object in the environmental region based on the association probability and the spatial similarity, wherein when the association of the second object feature with the prediction feature is omitted, an association probability between a last confirmed object feature and the second object feature is determined, wherein the last confirmed object feature describes the object feature which was last confirmed as originating from the object, and wherein the prediction feature is determined starting from a position in the environmental region, which is associated with the last confirmed object feature, when the association probability between the last confirmed object feature and the second object feature is greater than the association probability between the second object feature and the prediction feature.
2. The method according to claim 1, wherein the object is recognized as moving relative to the motor vehicle if the second object feature is confirmed as originating from the object.
3. The method according to claim 1, wherein the association probability is determined based on an overlap between the second object feature and the prediction feature in the second image and/or based on dimensions of the second object feature and the prediction feature in the second image and/or based on a distance between the centers of gravity of the second object feature and the prediction feature in the second image and/or based on a distance between the object and a prediction object associated with the prediction feature in the environmental region.
4. The method according to claim 1, wherein when at least two second object features are associated with the prediction feature the current position of the object is determined based on the second object feature, the object position of which has the greater spatial similarity to the prediction position of the prediction feature.
5. The method according to claim 1, wherein when a further object feature is recognized in one of the images, it is examined if the further object feature originates from an object entered the environmental region, wherein the examination is performed based on an entry probability, which depends on a position of the further object feature in the image.
6. The method according to claim 1, wherein the second object feature is determined as a polygon, wherein the polygon has a left base point, a central base point, a right base point and/or a tip point and wherein the polygon describes a width and/or a height of the object.
7. The method according to claim 6, wherein a plurality of regions of interest is determined in the second image, the regions of interest are grouped and the respective polygon is determined based on the grouped regions of interest.
8. The method according to claim 7, wherein the second image is divided into a plurality of image cells, object cells describing a moved object are selected from the image cells based on optical flow, and the object cells are associated with one of the regions of interest.
9. The method according to claim 7, wherein a roadway is recognized in the second image by segmentation and the regions of interest are determined based on the recognized roadway.
10. A camera system for a motor vehicle including at least one camera and an electronic image processing device, wherein the camera system is adapted to perform a method according to claim 1.
11. A motor vehicle with a camera system according to claim 10.
Description
(1) There show:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21) In the figures, identical and functionally identical elements are provided with the same reference characters.
(22)
(23) With the aid of the camera system 2, objects 9 in an environmental region 8 of the motor vehicle 1 can be captured. Hereto, a sequence of images 10, 11 is provided by each of the cameras 4. This sequence of images 10, 11 is then transmitted to an electronic image processing device 3 of the camera system 2. The objects 9 in the environmental region 8 can then be recognized in the images 10, 11 with the aid of the electronic image processing device 3.
(24) In particular, moved objects 9 in the environmental region 8 are to be recognized with the aid of the camera system 2. Hereto, a method of three-dimensional image processing is used. As explained in more detail below, first, regions of interest 16 are determined in the images 10, 11, which describe a moved object 9. Subsequently, object features 24, 25 are determined in the images 10, 11 based on the regions of interest 16, which describe the object 9 in more detail. Therein, it is further provided that the movement of the object 9 is tracked.
(25)
(26)
(27) The image 10, 11 shows the object 9, which is located in the environmental region 8. The object 9 is a moving object in the form of a pedestrian. This object 9 is now to be recognized in the image 10, 11. Hereto, an optical flow or a flow vector is determined in each of the image cells 12, which describes the movement of an object 9. If a flow vector has been determined with a sufficient confidence value, that image cell 12 is recognized as the object cell 12′ and identified in the weighting matrix or a value associated with the object cell 12′ in the weighting matrix is varied. Therein, the threshold value for a sufficient confidence value depends on the respective region 15 in the image 10, 11. Hereto,
(28)
W⊕H=U.sub.p∈WH.sub.p=U.sub.q∈HW.sub.q.
(29) Therein, H.sub.p describes the structuring element H, which has been shifted by p. W.sub.q describes the weighting matrix W, which has been shifted by q. Herein, q and p describe the directions. Therein, the structuring element H is a 3×3 matrix. The result of the dilation is represented on the right side of
(30)
(31) Based on the object cells 12′, which have been recognized as originating from a moved object 9, regions of interest 16 are now to be determined. This is explained in connection with
w.sub.ROI=II(x+w,y+h)−II(x+w,y)−II(x,y+h)+II(x,y).
(32) Herein, x and y describe the position of the lower left edge of the region of interest, w and h describe the width and the height of the region of interest 16. If the weighted sum w.sub.ROI is greater than a threshold value, the region of interest 16 is marked as a hypothesis. If the region of interest 16 is marked as a hypothesis, the search for further regions of interest 16 in the current column is aborted and continued in the next column. As indicated in
(33) For each of the columns, it is examined if a rectangle 18 can be formed from the sliding window 17, which includes the object cells 12′. Therein, it is further provided that the regions of interest 16 are corrected. Hereto,
(34) Further,
(35) Furthermore, it is provided that a roadway 19 is recognized in the image 10, 11. Hereto, a segmentation method is used. The roadway 19 can be recognized in the image 10, 11 with the aid of the segmentation method. Moreover, a boundary line 20 between the roadway 19 and the object 9 can be determined. Based on this boundary line 20, the rectangles 18 describing the regions of interest 16 can then be adapted. In this example, the rectangles 18 are downwards corrected. Presently, this is illustrated by the arrows 21.
(36) Furthermore, it is provided that the respective regions of interest 16 are grouped. This is explained in connection with
(37)
(38) The association of already existing and tracked objects 9 and newly captured objects is performed both within the images 10, 11 and in the real world. Therein, the steps S6 to S8 are performed within the sequence of images 10, 11. This is illustrated in
(39) The determination of the object feature 24, 25 according to the step S6 is illustrated in
(40)
(41) Further, an area A of the polygon 28 can be determined according to the following formula:
(42)
(43) Therein, (x.sub.i, y.sub.i), (x.sub.i+1, y.sub.i+1) are coordinates of two adjacent points of the polygon 28. N is the number of points of the polygon 28.
(44)
(45)
(46) In a second image 11, which follows the first image 10 in time, the prediction feature 26 is determined based on the first object feature 24. Presently, a picture 9′ of the object 9 or of the pedestrian is shown in the second image 11. The first object feature 24 determined in the first image 10 is presently shown dashed in the second image 11. For determining the prediction feature 26, a linear movement model is used, which describes the speed v of the object 9. Thus, it can be determined, in which position P2 the object 9 is at a point of time t1+Δt.
(47) For describing the movement of the object 9, a Kalman filter is used. Herein, it is assumed that the object 9 moves with a constant speed v. Hereto, a state vector {circumflex over (x)}.sub.k−1|k−1 and a corresponding state matrix P.sub.k−1|k−1 can be defined:
{circumflex over (x)}.sub.k−1|k−1=A.Math.{circumflex over (x)}.sub.k−1|k−1
P.sub.k|k−1=A.Math.P.sub.k−1|k−1.Math.A.sup.T+Q.
(48) Herein, A describes the system matrix. {circumflex over (x)}.sub.k−1|k−1 describes the state vector for the preceding point of time or for the first image 10. P.sub.k−1|k−1 describes the state matrix for the preceding point of time or for the first image 10. Q is a noise matrix, which describes the error of the movement model and the differences between the movement model and the movement of the object 9 in the real world.
(49) In the second image 11, which follows the first image 10 in time, a second object feature 25 can be determined based on the regions of interest 16. Now, it is to be examined if the second object feature 25 can be associated with the prediction feature 26. Hereto,
(50) All of these criteria shown in
p.sub.j=Σw.sub.mq.sub.m.
(51) If the association probability p.sub.j exceeds a predetermined threshold value, the second object feature 25 can be associated with the prediction feature 26. That is, it is confirmed that the second object feature 25 describes the object 9 in the environmental region 8.
(52) In real scenes or traffic situations, it is usually the case that a moved object 9, in particular a pedestrian, changes its direction of movement or its speed. Since the object features 24, 25 have been determined based on the optical flow, it can be the case that an object feature 24, 25 cannot be determined if the object 9 or the pedestrian currently stands still. Further, it can be the case that the moved object 9 changes its direction of movement.
(53) This is illustrated in connection with
(54) Moreover, it is provided that a spatial similarity between a prediction position P2 describing the position of the object 9 based on the movement model is determined. This is illustrated in
{circumflex over (x)}.sub.k|k.sup.j={circumflex over (x)}.sub.k|k−1+K(z.sub.k.sup.j−{circumflex over (z)}.sub.k)
P.sub.k|k=P.sub.k|k−1−KHP.sub.k|k−1.
(55) Therein, z.sub.k describes the data vector of the measurement or of the second object feature 25. {circumflex over (z)}.sub.k describes the expected data vector. K describes the Kalman gain, which can be determined according to the following formula:
(56)
(57) Herein, H describes a measurement matrix for generating the object features 24, 25 based on the movement model, and R describes a noise matrix, which describes the variation of the polygon 28 in the image 10, 11. The system model can then be determined according to the following formula, wherein w describes a weighting factor:
(58)
(59)
(60) In the images 10, 11, further object features can be recognized. Therein, it is examined if it is a new object or an object 9, which has entered the environmental region 8. Hereto, an entry probability is taken into account. Hereto,
(61) If a new object or a new object feature has been recognized in the images 10, 11, this can be correspondingly tracked. In the same manner, it can be determined if an object 9 has exited the environmental region 8 and thus can no longer be captured in the images 10, 11. Here too, an exit probability can be defined analogously to the entry probability.
(62) Overall, thus, moved objects 9 in an environmental region 8 of the motor vehicle 1 can be reliably recognized and tracked.