ANCHOR FOR LINE RECOGNITION
20230230394 ยท 2023-07-20
Inventors
- Azhar Sultan (Tunbridge Wells, GB)
- Joel Janai (Leonberg, DE)
- Tamas Kapelner (Hildesheim, DE)
- Thomas Wenzel (Hamburg, DE)
Cpc classification
G06V10/44
PHYSICS
G06V10/774
PHYSICS
G06V10/7715
PHYSICS
International classification
G06V20/56
PHYSICS
G06V10/44
PHYSICS
G06V10/77
PHYSICS
Abstract
A method for determining at least one anchor for an anchor-based lane line recognition and/or roadway marking recognition in a digital image representation on the basis of sensor data that are obtained from at least one surroundings sensor of a system. The method includes at least the following steps: a) receiving a digital image representation, b) setting at least one row or one column of possible anchors in at least one area of the digital image representation, the row or column of possible anchors being situated at a distance from at least the upper and lower or left and right edge of the area of the digital image representation.
Claims
1. A method for determining at least one anchor for an anchor-based lane line recognition and/or roadway marking recognition in a digital image representation based on sensor data that are obtained from at least one surroundings sensor of a system, the method comprising at least the following steps: a) receiving the digital image representation; and b) setting at least one row or column of possible anchors in at least one area of the digital image representation, the row or column of possible anchors being situated at a distance from at least an upper and lower or left and right edge of the area of the digital image representation.
2. The method as recited in claim 1, wherein the digital image representation includes a feature map or is provided in the form of a feature map.
3. The method as recited in claim 1, wherein the row or column of possible anchors is situated in a central area or at a central point between the upper and lower edge or left and right edge of the area of the digital image representation.
4. The method as recited in claim 1, wherein for at least one possible anchor, the features are combined along a line or along a roadway marking.
5. The method as recited in claim 1, wherein for at least one possible anchor, a column-wise and/or row-wise combining of features is carried out.
6. The method as recited in claim 1, wherein a first line of possible anchors is set in an upper area of the digital image and a second row of possible anchors is set in a lower area of the digital image representation.
7. The method as recited in claim 1, wherein a deep learning algorithm is used for carrying out at least one part of the method.
8. A non-transitory machine-readable memory medium on which is stored a computer program for determining at least one anchor for an anchor-based lane line recognition and/or roadway marking recognition in a digital image representation based on sensor data that are obtained from at least one surroundings sensor of a system, the computer program, when executed by a computer, causing the computer to perform the following steps: a) receiving the digital image representation; and b) setting at least one row or column of possible anchors in at least one area of the digital image representation, the row or column of possible anchors being situated at a distance from at least an upper and lower or left and right edge of the area of the digital image representation.
9. An object recognition system for a vehicle, the system being configured to determine at least one anchor for an anchor-based lane line recognition and/or roadway marking recognition in a digital image representation based on sensor data that are obtained from at least one surroundings sensor of a system, the system being configured to: a) receive the digital image representation; and b) set at least one row or column of possible anchors in at least one area of the digital image representation, the row or column of possible anchors being situated at a distance from at least an upper and lower or left and right edge of the area of the digital image representation.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0027]
[0028]
[0029]
[0030]
[0031]
DETAILED DESCRIPTION OF EXAMPLE EMBODIMENTS
[0032]
[0033] In block 110, a digital image representation 2 is received according to step a), in particular, including a plurality of features 6, which represent the image content. In block 120, at least one row 7 or column 8 of possible anchors 1 is set according to step b) in at least one area 9 of digital representation 2, row 7 or column 8 of possible anchors 1 being situated at a distance from at least the upper and lower or left and right edge 10 of area 9 of digital image representation 2. In block 130, at least one anchor 1 may be determined from the possible anchors 1 according to an optional step c), in particular, based on pieces of information obtained from features 6 of digital image representation 2.
[0034] The present invention advantageously provides better strategies for positioning anchors for the line recognition or for line recognition networks. Instead of the fixed positioning at the edges of the image, it is advantageous to place anchors at a more central point of the feature maps. An advantageous position of the anchors may be determined on the basis of the application and/or the data distribution. One advantageous, efficient and/or effective position for placing anchors for the traffic lane application is the center line as represented, for example, in
[0035] At least one anchor or every anchor 1 may have the form of at least one line or a line-shaped anchor. At least one anchor or every anchor 1 may have a start position (anchor point) 13 and at least one orientation 14. The anchor or every anchor 1 may include a plurality of lines 14 having different orientations. Lines 14 may start at, or extend through, the same start position or the same anchor point 13. The row or column of possible anchors 1 may be formed by a plurality of adjoining boxes 15, each of which includes at least one possible anchor 1/start point 13. At least one anchor or every anchor 1 may have the form of an origin of a coordinate system. The coordinate system may be adapted or adaptable to the intended use of the traffic lane detection and/or roadway marking detection.
[0036]
[0037] It is apparent that boxes 15 are situated relative to a left-hand column 8 and to a right-hand column 8 and to a lower row 7 of possible anchors 1. Furthermore, recognized features 6 such as, for example, lane lines 11 and roadway markings 12, are represented in the illustration according to
[0038]
[0039] The advantageously provided central (more central) position of anchors 1 may have multiple advantages: [0040] The effective receptive box of extracted features 6 may be significantly larger than the effective receptive box at edge 10 (in the case of center row 7, it may double in size), since a smaller portion is situated outside the feature map. This may ensure a better utilization of extracted features 6 from the backbone. [0041] Resulting anchors 1 may have display capabilities similar to the original strategy, since anchors 1 also cover the edge areas as is apparent, for example, in
[0044] Thus,
[0045] The method may operate using data such as, for example, digital images, which may be obtained by receiving sensor signals, for example, from video-, radar-, LIDAR-, ultrasonic-, motion-, infrared images or -sensors.
[0046] The method includes, in particular, the recognition of the presence of objects in the sensor data, in particular, of traffic lanes 11 and/or of other types of line-based road markings 12.
[0047] The method operates using images; thus an image recording may be used, which serves as input for the method.
[0048] For example, digital image representation 2 may include a feature map or may be provided in the form of one.
[0049] Furthermore, it may be advantageous if, in the method, features 6 are combined (pooled) for at least one anchor 1 along a line 11, 14 or roadway marking 12. This may be advantageously applied, for example, to center box 15c and to center roadway marking 12 in
[0050]
[0051] In this context,
[0052] The represents one example of, and optionally of how, a column-wise and/or row-wise combining (pooling) of features 6 may be carried out for at least one anchor 1.
[0053] Previously used anchors 1 are located at the left, right and lower edge of the image (see
[0054] The selection of a more central position for anchors 1, however, has multiple advantages, as mentioned above. For the traffic lane application, in particular, center row 7 provides an advantageously efficient and/or effective position for anchors 1. It may also represent roadway markings 11, 12 that cross left or right image edge 10 and is simultaneously more efficient since fewer anchors 1 may be sufficient.
[0055] One particular advantage is the location of anchors 1. For example, a backbone may be used in order to extract features 6 from an input image. Anchors 1 may be generated or placed, in particular, in center row 7 on final feature map 2 of the backbone.
[0056] Various strategies may be used to detect the pieces of information from the feature map: [0057] cut: it is advantageous that only features 6 of this row 7 are used and the remainder of feature map 2 is ignored. [0058] Combination/pooling along anchors 1: features 6 are advantageously combined along line 11, 14 for each anchor point 13. This may preferably include merging or combining the pieces of information from feature map 6 along anchor line(s) 14. [0059] Column-wise/row-wise combining/pooling: a column-wise or row-wise combining or pooling may be advantageously applied. In the case of anchors 1 in center row 7, a column-wise pooling would collect features 6 from each column defined by anchor position 13 as represented, for example, in
[0060] In addition to an advantageous positioning of anchors 1, it is possible to use multiple anchor positions in order to advantageously specialize in various tasks. A typical problem in the case of traffic lane recognition is the exact recognition of traffic lanes 11, 12 also at a great distance. This could be solved by two anchor rows, one of which is specialized in the lower portion of the image and one in the upper portion of the image.
[0061] This represents one example of, and optionally of how, a first row 7 of possible anchors 1 may be set in an upper area and a second row 7 of possible anchors may be set in a lower area of digital image representation 2.
[0062] One further problem connected with the traffic lane recognition is the recognition of roadsides in the image. While roadsides also appear in the vicinity of roadway markings, there is generally a strong difference in appearance. This may be solved by the creation of anchors 1 in the same center row, one being specialized in roadway markings and the other in roadsides.
[0063] The method may include at least one of the following conditions: [0064] no line-shaped anchors 1 are generated at image edge 10. [0065] a column-wise or row-wise pooling of features 6 is used with line-shaped anchors 1. [0066] line-shaped anchors 1 that are specialized in various tasks are used in parallel.
[0067]
[0068] A deep learning algorithm may be used, for example, for carrying out at least one part of the method, the deep learning algorithm being implemented preferably using at least one artificial neural network, which may be implemented, for example, in object recognition system 4.
[0069] In one specific embodiment, generalized anchors are provided for the traffic lane recognition using deep learning.
[0070] In principle, the method may be used for calculating a control signal for the control of a physical system 4 such as, for example, of a computer-controlled machine, of a robot, of a vehicle 5, of a household appliance, of a power tool, of a manufacturing machine, of a personal assistant or of an access control system.