Plausibility check of the output of neural classifier networks based on additional information about features

Abstract

A method for a plausibility check of the output of an artificial neural network (ANN) utilized as a classifier. The method includes: a plurality of images for which the ANN has ascertained an association with one or multiple classes of a predefined classification, and the association that is ascertained in each case by the ANN, are provided; for each image at least one feature parameter is determined which characterizes the type, the degree of specificity, and/or the position of at least one feature contained in the image; for each combination of an image and an association, a spatially resolved relevance assessment of the image is ascertained by applying a relevance assessment function; a setpoint relevance assessment is ascertained for each combination, using the feature parameter; a quality criterion for the relevance assessment function is ascertained based on the agreement between the relevance assessments and the setpoint relevance assessments.

Claims

1. A method for a plausibility check of an output of an artificial neural network (ANN) that is used as a classifier, the method comprising the following steps: providing a plurality of images for which the ANN has ascertained an association with one or multiple classes of a predefined classification, and the association that is ascertained for each of the images by the ANN; determining, for each of the images, at least one feature parameter which characterizes a type, and/or a degree of specificity, and/or a position, of at least one feature contained in the image; for each combination of an image of the images and an association of the associations, ascertaining a spatially resolved relevance assessment of the image by applying a relevance assessment function, the relevance assessment indicating which portions of the image, and to what extent, have contributed to the association; ascertaining a setpoint relevance assessment for each combination of an image of the images and an association of the associations, using the feature parameter; and ascertaining a quality criterion for the relevance assessment function based on an agreement between the relevance assessments and the setpoint relevance assessments, wherein based on a first combination of a first image of the images and a first association of the associations, a spatially resolved relevance assessment of the first combination, and a first feature parameter of the first image, a second setpoint relevance assessment for a second combination of a second image and a second association, and for a second feature parameter of the second image, is ascertained by updating the spatially resolved relevance assessment based on a difference between the first feature parameter and a second feature parameter.

2. The method as recited in claim 1, wherein the second image arises from the first image by a transformation and/or processing which leaves the feature parameter unchanged or changes it to a new feature parameter.

3. The method as recited in claim 1, wherein at least one image is synthetically generated by specifying a particular feature parameter.

4. The method as recited in claim 1, wherein at least one feature parameter of the feature parameters is evaluated from measured data that have been detected using at least one physical sensor that is different from a sensor that is used for recording the respective image.

5. The method as recited in claim 1, wherein the quality criterion is ascertained for a selection of multiple candidate relevance assessment functions, and a particular candidate relevance assessment function of the multiple candidate relevance assessment functions that has a best value of the quality criterion is selected as the relevance assessment function.

6. The method as recited in claim 5, wherein a relevance assessment function or candidate relevance assessment function whose quality criterion is poorer than the quality criterion ascertained for a comparative relevance assessment function is discarded as implausible.

7. The method as recited in claim 1, wherein the quality criterion is additionally ascertained for: an identical depiction of the image, and/or an area filled with random values, and/or an area filled with a constant value, and/or a semantic segmentation of the image, and/or an edge detection from the image as the spatially resolved comparative relevance assessment.

8. The method as recited in claim 1, wherein a parameterized approach with free parameters is established for the relevance assessment function, and parameters of the approach are optimized with an objective that the quality criterion of the relevance assessment function assumes an extreme value.

9. The method as recited in claim 1, wherein a plausibility of the output of the ANN is evaluated based on the relevance assessment function and/or based on the quality criterion of the relevance assessment function and/or based on a relevance assessment that is ascertained with the relevance assessment function.

10. The method as recited in claim 9, wherein the plausibility is output to a user of the ANN via a display.

11. The method as recited in claim 1, wherein the ANN is a convolutional ANN, and the relevance assessment function involves a weighted sum of activation maps of multiple convolution kernels which, in a layer of the ANN, are applied to the image or to a processing product of the image.

12. A method for a plausibility check of an output of an artificial neural network (ANN) that is used as a classifier, the method comprising the following steps: providing at least one image for which the ANN has ascertained an association with one or multiple classes of a predefined classification, and the association that is ascertained by the ANN; for a combination of the image and the association, ascertaining a spatially resolved relevance assessment of the image by applying a relevance assessment function, the relevance assessment indicating which portions of the image, and to what extent, have contributed to the association; ascertaining a correlation between: (i) the relevance assessment, and (ii) a semantic segmentation of the image and/or an edge detection from the image; and evaluating the correlation as a measure for the plausibility of the output of the ANN, wherein the relevance assessment function is selected and/or formed during the course of a method including: providing a plurality of images for which the ANN has ascertained an association with one or multiple classes of a predefined classification, and the association that is ascertained for each of the images by the ANN; determining, for each of the images, at least one feature parameter which characterizes a type, and/or a degree of specificity, and/or a position, of at least one feature contained in the image of the images; for each combination of an image of the images and an association of the associations, ascertaining a second spatially resolved relevance assessment of the image of the images by applying a second relevance assessment function, the relevance assessment indicating which portions of the image of the images, and to what extent, have contributed to the association; ascertaining a setpoint relevance assessment for each combination of an image of the images and an association of the associations, using the feature parameter; and ascertaining a quality criterion for the second relevance assessment function based on an agreement between the second relevance assessments and the setpoint relevance assessments; wherein the quality criterion is ascertained for a selection of multiple candidate relevance assessment functions, and a particular candidate relevance assessment function of the multiple candidate relevance assessment functions that has a best value of the quality criterion is selected as the relevance assessment function.

13. The method as recited in claim 12, wherein in response to the correlation falling below a predefined threshold value, a technical system that acts at least semi-automatedly based on the association ascertained by the ANN is activated in such a way that disadvantageous consequences of an incorrect association are reduced.

14. The method as recited in claim 12, wherein a first image is selected that has arisen by observing surroundings of a vehicle, and in response to the correlation falling below a predefined threshold value, the vehicle is controlled in such a way that: at least one additional physical sensor for observing the surroundings of the vehicle is activated; and/or a travel velocity of an at least semi-automatedly driving vehicle is reduced; and/or a driving assistance system and/or a system for the at least semi-automated driving of the vehicle is completely or partially deactivated; and/or an at least semi-automatedly driving vehicle is brought to a standstill on a preplanned emergency stop trajectory.

15. A non-transitory machine-readable data medium on which is stored a computer program for a plausibility check of an output of an artificial neural network (ANN) that is used as a classifier, the computer program, when executed by a computer, causing the computer to perform the following steps: providing a plurality of images for which the ANN has ascertained an association with one or multiple classes of a predefined classification, and the association that is ascertained for each of the images by the ANN; determining, for each of the images, at least one feature parameter which characterizes a type, and/or a degree of specificity, and/or a position, of at least one feature contained in the image; for each combination of an image of the images and an association of the associations, ascertaining a spatially resolved relevance assessment of the image by applying a relevance assessment function, the relevance assessment indicating which portions of the image, and to what extent, have contributed to the association; ascertaining a setpoint relevance assessment for each combination of an image of the images and an association of the associations, using the feature parameter; and ascertaining a quality criterion for the relevance assessment function based on an agreement between the relevance assessments and the setpoint relevance assessments, wherein based on a first combination of a first image of the images and a first association of the associations, a spatially resolved relevance assessment of the first combination, and a first feature parameter of the first image, a second setpoint relevance assessment for a second combination of a second image and a second association, and for a second feature parameter of the second image, is ascertained by updating the spatially resolved relevance assessment based on a difference between the first feature parameter and a second feature parameter.

16. A computer configured to plausibility check an output of an artificial neural network (ANN) that is used as a classifier, the computer configured to: provide a plurality of images for which the ANN has ascertained an association with one or multiple classes of a predefined classification, and the association that is ascertained for each of the images by the ANN; determine, for each of the images, at least one feature parameter which characterizes a type, and/or a degree of specificity, and/or a position, of at least one feature contained in the image; for each combination of an image of the images and an association of the associations, ascertain a spatially resolved relevance assessment of the image by applying a relevance assessment function, the relevance assessment indicating which portions of the image, and to what extent, have contributed to the association; ascertain a setpoint relevance assessment for each combination of an image of the images and an association of the associations, using the feature parameter; and ascertain a quality criterion for the relevance assessment function based on an agreement between the relevance assessments and the setpoint relevance assessments, wherein based on a first combination of a first image of the images and a first association of the associations, a spatially resolved relevance assessment of the first combination, and a first feature parameter of the first image, a second setpoint relevance assessment for a second combination of a second image and a second association, and for a second feature parameter of the second image, is ascertained by updating the spatially resolved relevance assessment based on a difference between the first feature parameter and a second feature parameter.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) FIG. 1 shows one exemplary embodiment of method 100 with which a quality criterion 4a for a relevance assessment function 4 is ascertained, in accordance with the present invention.

(2) FIG. 2 shows one exemplary embodiment of method 200 with which plausibility 6 of the output of ANN 1 is assessed based on a specific relevance assessment function 4, in accordance with the present invention.

(3) FIG. 3 shows an example of invariance of a relevance assessment 2a with respect to an exchange of an object 25 with an object 26 of virtually the same size, in accordance with the present invention.

(4) FIG. 4 shows an example of equivariance of a relevance assessment 2a with respect to a change in position and size of an object 24, in accordance with the present invention.

(5) FIG. 5 shows an example of recognition of an attack using an “adversarial example,” by method 200, in accordance with the present invention.

DETAILED DESCRIPTION OF EXAMPLE EMBODIMENTS

(6) FIG. 1 is a schematic flowchart of one exemplary embodiment of method 100. This method is used primarily to ascertain a quality criterion 4a for a relevance assessment function 4. This in turn allows evaluation of a plausibility 6 of the output of ANN 1.

(7) Images 2 as well as associations 3 of these images 2 for classes 3a through 3c of a predefined classification are provided in step 110.

(8) For each image, at least one feature parameter 8 that characterizes the type, the degree of specificity, and/or the position of at least one feature contained in image 2 is determined in step 120.

(9) This may take place according to block 121, for example, in that at least one image 2, 2′ is synthetically generated by specifying particular feature parameter 8, 8′.

(10) Alternatively or in combination therewith, according to block 122, for example, at least one feature parameter 8, 8′ may be evaluated from measured data 8a. These measured data 8a have been detected using at least one physical sensor that is different from the sensor used for recording image 2, 2′.

(11) Combinations of an image 2 and an association 3 in each case are processed in step 130 to form a relevance assessment 2a of image 2 with the aid of a relevance assessment function 4. This relevance assessment 2a indicates which portions of image 2, and to what extent, have contributed to association 3.

(12) For each combination of an image 2 and an association 3, a setpoint relevance assessment 2a* is ascertained in step 140, using feature parameter 8. This setpoint relevance assessment 2a* represents the spatially resolved relevance assessment 2a which a relevance assessment function 4, applicable for the specific application, is to supply for specific image 2 and specific association 3.

(13) For example, according to block 141, based on a first combination of a first image 2 and a first association 3, a spatially resolved relevance assessment 2a of this combination, and a first feature parameter 8 of first image 2,

(14) setpoint relevance assessment 2a* is ascertained for a second combination of a second image 2′ and a second association 3′, and is ascertained for a second feature parameter 8′ of second image 2′ by updating spatially resolved relevance assessment 2a based on the difference between first feature parameter 8 and second feature parameter 8′. It may thus be tested whether the examined relevance assessment function 4 is invariant or equivariant under certain developments of feature parameter 8, 8′.

(15) For this purpose, for example according to block 141a, second image 2′ may arise from first image 2 by a transformation and/or processing which leave(s) feature parameter 8 unchanged or change(s) it to a new feature parameter 8′ in a conventional manner. The synthetic generation of images discussed above may also be used for the update.

(16) A quality criterion 4a for relevance assessment function 4 is ascertained in step 150, based on the agreement between relevance assessments 2a and setpoint relevance assessments 2a*.

(17) Various exemplary embodiments are provided within box 150 that indicate how, with the aid of quality criterion 4a, a relevance assessment function 4 that is particularly suitable for the particular application may also be found at the same time.

(18) According to block 151, quality criterion 4a may be ascertained for a selection of multiple candidate relevance assessment functions 4*. According to block 152, a candidate relevance assessment function 4* having the best value of the quality criterion may then be selected as relevance assessment function 4.

(19) According to block 153, quality criterion 4a may additionally be ascertained for the identical depiction of image 2, and/or an area filled with random values, and/or an area filled with a constant value, and/or a semantic segmentation 2b of image 2, and/or an edge detection 2c from image 2

(20) as spatially resolved comparative relevance assessment 2a**.

(21) According to block 154, a relevance assessment function 4 or candidate relevance assessment function 4* whose quality criterion 4a is poorer than quality criterion 4a ascertained for comparative relevance assessment function 2a** may then be discarded as implausible.

(22) According to block 155, a parameterized approach 4′ with free parameters may be established for relevance assessment function 4. The parameters of this approach 4′ may then be optimized according to block 156, with the objective that quality criterion 4a of relevance assessment function 4 assumes an extreme value.

(23) Relevance assessment function 4 and/or quality criterion 4a of this relevance assessment function and/or a relevance assessment 2a that is ascertained with this relevance assessment function 4 may be utilized in step 160 to evaluate a plausibility 6 of the output of ANN 1. However, this is optional. Method 100 may also be applied, for example, for the sole purpose of finding an optimal relevance assessment function 4.

(24) FIG. 2 is a schematic flowchart of one exemplary embodiment of method 200. This method 200 assumes that a suitable relevance assessment function 4 is already implemented.

(25) Analogously to step 110 of method 100, at least one image 2 for which ANN 1 has ascertained an association 3 with one or multiple classes 3a through 3c of a predefined classification, and also association 3 that is ascertained by ANN 1, are provided in step 210 of method 200.

(26) Analogously to step 120 of method 100, for the combination of image 2 and association 3, a spatially resolved relevance assessment 2a of image 2 is ascertained in step 220 of method 200 by applying relevance assessment function 4. This relevance assessment 2a indicates which portions of image 2, and to what extent, have contributed to association 3.

(27) A correlation 7 between relevance assessment 2a on the one hand, and a semantic segmentation 2b of the image 2 and/or an edge detection 2c from image 2 on the other hand, is ascertained in step 230. This correlation 7 is evaluated in step 240 as a measure for plausibility 6 of the output of ANN 1.

(28) This plausibility 6, or also correlation 7 directly, may then be compared to a threshold value 7a in step 250, and a system that acts at least semi-automatedly may be activated in such a way that disadvantageous consequences of an incorrect association are reduced.

(29) Various examples are provided within box 250 that indicate how this activation may be designed specifically for vehicles.

(30) According to block 251, at least one additional physical sensor for observing the surroundings of the vehicle may be activated.

(31) According to block 252, the travel velocity of a vehicle that is driving at least semi-automatedly is reduced. For example, as a precaution the vehicle on the expressway may be controlled to drive into the slower traffic in the right lane.

(32) According to block 253, a driving assistance system and/or a system for the at least semi-automated driving of the vehicle may be completely or partially deactivated.

(33) According to block 254, an at least semi-automatedly driving vehicle may be brought to a standstill on a preplanned emergency stop trajectory. For the case of a system failure, in each system such an emergency stop trajectory is the standard default for the at least semi-automated driving.

(34) FIG. 3 schematically illustrates an example in which a relevance assessment function 4, and thus also a relevance assessment 2a supplied by this function 4, is invariant with respect to an exchange of objects 25, 26 in image 2.

(35) A first image 2 shows a man 25 that functions as a feature parameter 8. If this man 25 is recognized and classified by ANN 1, for this purpose area 25* that is marked with a higher point density is definitive in spatially resolved relevance assessment 2a.

(36) A second image 2′ shows a woman 26 as an altered feature parameter 8′. If this woman 26 is recognized and classified by ANN 1, for this purpose area 26* that is marked with a higher point density is definitive in spatially resolved relevance assessment 2a′.

(37) This area 26* is much more similar to area 25* than woman 26 is to man 25. The differences are due essentially to differences in the spatial extent.

(38) Relevance assessment function 4 with which spatially resolved relevance assessments 2a, 2a′ have been created is thus essentially invariant with respect to an exchange of man 25 with woman 26.

(39) FIG. 4 schematically illustrates an example in which a relevance assessment function 4, and thus also a relevance assessment 2a supplied by this function 4, is equivariant with respect to the change in position and size of objects 23, 24.

(40) A first image 2 shows a road 23 and a vehicle 24 that is moving toward the observer. Road 23 and vehicle 24 together form feature parameter 8 of image 2. In spatially resolved relevance assessment 2a for this image 2, the edges of road 23 are represented by areas 23* having higher relevance. Vehicle 24 is represented in this relevance assessment 2a via a further area 24* having higher relevance.

(41) A second image 2′ is a snapshot of same road 23 and of same vehicle 24 taken at a slightly later time. In contrast to first image 2, vehicle 24 has moved further toward the observer. In this regard, feature parameter 8′ is changed compared to feature parameter 8 of first image 2. This change is also reflected in relevance assessment 2a′ for second image 2′. The edges of road 23 are still represented by same areas 23* as in relevance assessment 2a. However, area 24* representing vehicle 24 has become larger, and within spatially resolved relevance assessment 2a′, compared to original relevance assessment 2a, has moved similarly to the vehicle in comparison to the two images 2 and 2′.

(42) Relevance assessment function 4, with which spatially resolved relevance assessments 2a, 2a′ have been created, is thus equivariant with respect to a movement of objects in image 2, 2′.

(43) FIG. 5 shows an example of how an attack on an ANN 1 used as a classifier may be recognized by method 200, using an “adversarial example,” i.e., a manipulated image 2. In this example, image 2 shows a stop sign 21 which with malicious intent has been provided with an adhesive sticker 22. This adhesive sticker 22 has intentionally been designed so that ANN 1 is to classify stop sign 21 not as a stop sign, but instead as a speed limit 70 sign, for example.

(44) If ANN 1 is successfully “deceived” in this way, this implies that the area with adhesive sticker 22 has a particularly strong influence on association 3 that is ascertained by ANN 1. This means that this area has a particularly high weight in spatially resolved relevance assessment 2a compared to the remainder of image 2. This is illustrated in FIG. 3 in that only the area with adhesive sticker 22 is marked and provided with reference numeral 2a.

(45) In contrast, edge detection 2c from image 2 places particular emphasis on the features of stop sign 21, while adhesive sticker 22 is only barely recognizable, if at all. Adhesive stickers 22 that are affixed with malicious intent to signs are in fact intended to be as visually inconspicuous as possible so that no one discovers and removes them.

(46) Thus, the very features of stop sign 21 that are particularly prominent in edge detection 2c have virtually no relevance in spatially resolved relevance assessment 2a. Likewise, adhesive sticker 22, which is so important for relevance assessment 2a, has virtually no relevance in edge detection 2c. Correlation 7 between relevance assessment 2a and edge detection 2c is thus poor, which may be recognized using method 200.

Plausibility check of the output of neural classifier networks based on additional information about features

Assignee

Inventors

Cpc classification

Classification Explorer

G06V10/82

PHYSICS

Classification Explorer

G06V30/1916

PHYSICS

Classification Explorer

G06F18/217

PHYSICS

Classification Explorer

G06F18/2413

PHYSICS

Classification Explorer

G06V30/19173

PHYSICS

Classification Explorer

G06V30/194

PHYSICS

Classification Explorer

G06V20/56

PHYSICS

International classification

Classification Explorer

G06V30/194

PHYSICS

Abstract

Claims

Description