Method and a Device for Assigning a Bounding Box to an Object
20220309760 · 2022-09-29
Inventors
Cpc classification
G06V10/457
PHYSICS
G01S7/4802
PHYSICS
G06V20/56
PHYSICS
G06V10/762
PHYSICS
G06V10/25
PHYSICS
International classification
G06V10/25
PHYSICS
G06V10/44
PHYSICS
G06V10/762
PHYSICS
Abstract
A method is provided for assigning a bounding box to an object in an environment of a vehicle. Data related to objects located in the environment of the vehicle are acquired via a sensor. Based on the data, a respective spatial location and a respective size of a plurality of preliminary bounding boxes are determined such that each preliminary bounding box covers one of the objects at least partly. A respective velocity of each preliminary bounding box is estimated based on the data. A subset of the plurality of preliminary bounding boxes being related to a respective one of the objects is selected, where the subset is selected based on the respective velocity of each of the preliminary bounding boxes. A final bounding box is assigned to the respective one of the objects by merging the preliminary bounding boxes of the corresponding subset.
Claims
1. A computer implemented method comprising: acquiring, via a sensor, sensor data related to objects located in an environment of a vehicle; determining, via a processing unit and based on the sensor data, a respective spatial location and a respective size of a plurality of preliminary bounding boxes such that each preliminary bounding box covers one of the objects at least partly; estimating a respective velocity of each preliminary bounding box based on the sensor data; selecting a subset of the plurality of preliminary bounding boxes that are related to a respective one of the objects, the subset selected based on the respective velocity of each of the preliminary bounding boxes; and assigning a final bounding box to the respective one of the objects by merging the preliminary bounding boxes included in the subset.
2. The method according to claim 1, wherein: a direction of the velocity is determined for each preliminary bounding box; a respective size-modified box is generated for each preliminary bounding box by: shrinking the preliminary bounding box in the direction of the velocity and in a lateral direction perpendicularly to the direction of the velocity; and extending the preliminary bounding box in the direction of the velocity based on an absolute value of the velocity estimated for the preliminary bounding box; and preliminary bounding boxes having overlapping size-modified boxes are selected for the respective subset.
3. The method according to claim 2, wherein the shrinking of the preliminary bounding box includes reducing a length and a width of the respective preliminary bounding box by a same factor.
4. The method according to claim 2, wherein the extending of the preliminary bounding box includes multiplying a length of the respective preliminary bounding box in the direction of the velocity by the absolute value of the velocity and by an extend factor.
5. The method according to claim 2, wherein selecting the subset of the plurality of preliminary bounding boxes includes applying a maximal connected subgraph algorithm to all preliminary bounding boxes of the plurality of preliminary bounding boxes.
6. The method according to claim 2, wherein two of the size-modified boxes are regarded as overlapping if their intersection over union is greater than a predefined threshold.
7. The method according to claim 1, wherein: a centroid is estimated for the preliminary bounding boxes of a respective subset; farthest vertices having a greatest distance to the centroid are determined for the preliminary bounding boxes of the subset; and the final bounding box is determined based on the farthest vertices.
8. The method according to claim 1, wherein a velocity of the final bounding box is determined by averaging estimated velocities over a corresponding subset of preliminary bounding boxes.
9. The method according to claim 1, further comprising: determining an attribute for each of the preliminary bounding boxes; and based on the attribute, determining whether a selected preliminary bounding box is excluded from the corresponding subset.
10. The method according to claim 9, wherein: a probability distribution is assigned to values of the attribute of the preliminary bounding boxes belonging to a respective subset; and the selected preliminary bounding box is excluded from the corresponding subset if a probability assigned to the value of the attribute of the selected preliminary bounding box is smaller than a predetermined threshold.
11. The method according to claim 1, wherein a machine learning algorithm is applied to the sensor data for determining the spatial location and the size of each preliminary bounding box.
12. A device comprising: a sensor configured to acquire sensor data related to objects located in an environment of a vehicle; and a processing unit configured to: determine, based on the sensor data, a respective spatial location and a respective size of a plurality of preliminary bounding boxes such that each preliminary bounding box covers one of the objects at least partly; estimate a respective velocity of each preliminary bounding box based on the sensor data; select a subset of the plurality of preliminary bounding boxes being related to a respective one of the objects, the subset selected based on the respective velocity of each of the preliminary bounding boxes; and assign a final bounding box to the respective one of the objects by merging the preliminary bounding boxes included in the subset.
13. The device according to claim 12, wherein the sensor includes at least one of a radar sensor or a Lidar sensor.
14. The device according to claim 12, wherein: a direction of the velocity is determined for each preliminary bounding box; a respective size-modified box is generated for each preliminary bounding box by: shrinking the preliminary bounding box in the direction of the velocity and in a lateral direction perpendicularly to the direction of the velocity; and extending the preliminary bounding box in the direction of the velocity based on an absolute value of the velocity estimated for the preliminary bounding box; and preliminary bounding boxes having overlapping size-modified boxes are selected for the respective subset.
15. The device according to claim 14, wherein the processing unit is configured to shrink preliminary bounding box by reducing a length and a width of the respective preliminary bounding box by a same factor.
16. The device according to claim 14, wherein the processing unit is configured to extend the preliminary bounding box by multiplying a length of the respective preliminary bounding box in the direction of the velocity by the absolute value of the velocity and by an extend factor.
17. The device according to claim 14, wherein the processing unit is configured to select the subset of the plurality of preliminary bounding boxes includes applying a maximal connected subgraph algorithm to all preliminary bounding boxes of the plurality of preliminary bounding boxes.
18. The device according to claim 14, wherein two of the size-modified boxes are regarded as overlapping if their intersection over union is greater than a predefined threshold.
19. The device according to claim 12, wherein the processing unit is configured to: estimate a centroid for the preliminary bounding boxes of a respective subset; determine farthest vertices having a greatest distance to the centroid for the preliminary bounding boxes of the subset; and determine the final bounding box based on the farthest vertices.
20. Non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to perform operations including: determining, based on sensor data acquired by a sensor and related to objects located in an environment of a vehicle, a respective spatial location and a respective size of a plurality of preliminary bounding boxes such that each preliminary bounding box covers one of the objects at least partly; estimating a respective velocity of each preliminary bounding box based on the sensor data; selecting a subset of the plurality of preliminary bounding boxes that are related to a respective one of the objects, the subset selected based on the respective velocity of each of the preliminary bounding boxes; and assigning a final bounding box to the respective one of the objects by merging the preliminary bounding boxes included the subset.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0040] Exemplary embodiments and functions of the present disclosure are described herein in conjunction with the following drawings, showing schematically:
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
DETAILED DESCRIPTION
[0048]
[0049] The sensor 13 is a radar sensor and/or a Lidar sensor and is configured for monitoring the environment of the vehicle 10. That is, the sensor 13 is configured to provide data related to objects 17, which are located in the environment of the vehicle 10. The data provided by the sensor 13 are transferred to the processing unit 15 that is configured to generate a respective bounding box 19 related to each object 17.
[0050] It is noted that the objects 17 are depicted in
[0051]
[0052] Usually, a conventional perception system is based on a machine learning algorithm trained by a ground truth comprising a majority of small vehicles, e.g. cars since the number of cars is usually much higher in a typical environment of the vehicle 10 than the number of trucks and busses. As a consequence, the conventional perception system predicts a plurality of small bounding boxes 25, each of which covers the ground truth for the bendy bus only partly. As can be seen in
[0053] Although each single predicted bounding box 25 has a bad quality when compared with the ground truth 23, the entire set of predicted bounding boxes 25 is able to cover the ground truth 23 much better. This is a motivation for one of the principles according to the present disclosure, e.g. merging a certain number of small bounding boxes 25 in order to generate one big bounding box.
[0054] This is illustrated in
[0055] Based on the group of preliminary bounding boxes 27, a final bounding box 29 is generated, which is shown on the respective right side in
[0056] In the maximal connected subgraph algorithm, one important step is the computation of a so-called adjacency matrix, which represents the similarity between every pair of preliminary bounding boxes 27. The intersection over union (IoU), which is known in the art, is used as a measure for the similarity of two preliminary bounding boxes 27. Usually, a threshold is defined for the intersection over union in order to decide whether two objects, e.g. two preliminary bounding boxes 27, belong together or not. For the present case, e.g. for a plurality of preliminary bounding boxes 27, it is not easy to define a general threshold, as will be explained in detail below. One reason is that one big object has to be distinguished from two or more smaller objects, which are located closely to each other, e.g. having a small gap between them only such that the preliminary bounding boxes 27 overlap for both objects. After defining the clusters or subsets of the preliminary bounding boxes 27, which belong to a certain object 17 in the environment of the vehicle 10 (see
[0057] Challenging scenarios for generating a proper final bounding box 29 are depicted in
[0058] In the example of
[0059] In order to overcome such concerns, the device 11 and a method according to the disclosure additionally consider the respective velocity of assumed objects that is reflected in the respective velocity of the preliminary bounding boxes 27. Generally, the velocity of the object 17 (see
[0060] Along the moving direction of the objects 17, e.g. along the direction of their velocity vector, it is quite unlikely for two fast moving vehicle that they are close to each other. In other words, it is more likely that preliminary bounding boxes 27, which are close to each other in the moving direction of an object, belong to one big object even if the preliminary bounding boxes do not overlap (as shown in
[0061] In order to consider the velocity of objects when generating bounding boxes, a shrink step and an extend step are applied to the preliminary bounding boxes 27. The shrink step is performed in moving direction and in lateral direction, whereas the extend step is performed along the moving direction only.
[0062] Before performing the shrink step and the extend step, a respective velocity vector 31 is estimated for each preliminary bounding box 27 based on the data from the sensor 13 (see
Shrink step: L=L*shrink factor,W=W*shrink factor;
Extend step: L=L*velocity*extend factor
[0063] L and W denote the length and the width of the respective preliminary bounding box 27, e.g. in longitudinal or moving direction and in lateral direction, respectively. The shrink factor is a number between 0 and 1. In the formula of the extend step, the term “velocity” denotes the absolute value of the respective velocity vector 31. The product of the velocity and the extend factor is a number greater than 1.
[0064] The shrink factor and the extend factor may be suitably defined during a training procedure for the entire device 11. That is, both factors are determined empirically by applying the device 11 and the method to realistic scenarios for which the device 11 and the method are intended to be used.
[0065] For the examples as shown in
[0066] In the examples as shown in
[0067] In contrast, for the example as shown in
[0068] It is noted that the size-modified boxes 33 in the middle subgraphs of
[0069] The scheme for determining proper final bounding boxes 29 as described so far might be deteriorated by outliers of the prediction of the preliminary bounding boxes 27. in
[0070] Therefore, an additional outlier removal procedure is applied to the preliminary bounding boxes 27, which are the result of the original prediction. For the outlier removal, the respective distribution is modeled for the attributes of the preliminary bounding boxes, where these attributes include the size, the orientation and/or the velocity of the preliminary bounding boxes 27. That is, based on values for these attributes, a probability distribution is modeled. Thereafter, a probability value can be assigned to each attribute of each preliminary bounding box 27. If the respective probability value of at least one of the attributes of a certain preliminary bounding box 27 is lower than a predetermined threshold, the corresponding preliminary bounding box 27 is considered as an outlier box 37. The outlier boxes 37 are excluded from any subset selection and are therefore not considered for determining the final bounding boxes 29.
[0071] When all clusters or subsets of preliminary bounding boxes 27 are identified, the preliminary bounding boxes 27 belonging to the respective cluster or subset are merged as is schematically illustrated in
[0072] In order to merge a subset of preliminary bounding boxes 27, so-called farthest vertices 39 are identified for each subset. The farthest vertices 39 have the largest distances, e.g. a larger distance than the further vertices 21, with respect to a centroid of the entire subset of preliminary bounding boxes 27. In the examples of
[0073] In addition, further attributes of the final bounding box 29 can be determined as an average over all preliminary bounding boxes 27 belonging to the subset that has been used for generating the final bounding box 29. For example, the velocity vector of the final bounding box 29 is determined as an average of all velocity vectors 31 determined for the preliminary bounding boxes 27, which have been used for generating the final bounding box 29.