MOBILE ROBOT CAPABLE OF LEARNING RISK-AWARE TRAVERSABILITY VIA ACCUMULATED NAVIGATION EXPERIENCE
20250284288 ยท 2025-09-11
Assignee
Inventors
Cpc classification
G05D1/617
PHYSICS
International classification
Abstract
Proposed is a mobile robot capable of learning risk-aware traversability via accumulated navigation experience. The mobile robot includes a point cloud sensor which acquires point cloud data, an elevation map generation part which generates a grid-based elevation map by using the point cloud data, an attribute data generation part which calculates a plurality of types of terrain attribute values for each of grid cells from the elevation map and generates attribute data sets having the plurality of types of terrain attribute values for each of the grid cells, a training data generation part which generates a plurality of traversability training data sets and a plurality of non-traversability training data sets by labeling whether each of the attribute data sets is in traversable condition, a risk calculation part which calculates a traversal risk, and a risk-aware self-training part which trains a traversability evaluation model for traversability evaluation through self-training.
Claims
1. A mobile robot capable of learning risk-aware traversability via accumulated navigation experience, the mobile robot comprising: a point cloud sensor which acquires point cloud data; an elevation map generation part which generates a grid-based elevation map by using the point cloud data; an attribute data generation part which calculates a plurality of types of terrain attribute values for each of grid cells from the elevation map and generates attribute data sets having the plurality of types of terrain attribute values for each of the grid cells; a training data generation part which generates a plurality of traversability training data sets and a plurality of non-traversability training data sets by labeling whether or not each of the attribute data sets is in traversable condition; a risk calculation part which calculates a traversal risk for each of the traversability training data sets; and a risk-aware self-training part which trains a traversability evaluation model for traversability evaluation through self-training thereof which uses the plurality of traversability training data sets and the plurality of non-traversability training data sets, wherein the traversal risk is reflected in the self-training.
2. The mobile robot of claim 1, wherein the training data generation part generates the traversability training data sets by labeling, as traversable, the attribute data sets for grid cells on a trajectory traversed by the mobile robot; and generates the non-traversability training data sets by labeling, as non-traversable, attribute data sets outside an attribute threshold set for at least one type of the plurality of types of terrain attribute values among the attribute data sets for grid cells other than the trajectory traversed by the mobile robot.
3. The mobile robot of claim 1, wherein the risk calculation part calculates an intrinsic risk as a weighted average of the plurality of types of terrain attribute values which constitute the traversability training data sets; and calculates a cumulative risk as a relative risk of the corresponding traversability training data set compared to the entire traversability training data sets on the basis of the intrinsic risk calculated for each of the traversability training data sets.
4. The mobile robot of claim 3, wherein the intrinsic risk is calculated by mathematical expression
.sub.int(x.sub.i)=w.sup.Tx.sub.i (x.sub.i refers to the attribute data set, i refers to the index of the grid cell,
.sub.int(x.sub.i) refers to the intrinsic risk of the attribute data set x.sub.i, and w refers to a vector consisting of a weight reflected in each of the terrain attribute values).
5. The mobile robot of claim 4, wherein the cumulative risk is calculated by mathematical expression .sub.cum(x.sub.i) refers to the cumulative risk of the attribute data set x.sub.i, n refers to the number of the traversability training data sets,
refers to an evaluation function that evaluates to 1 when condition in the parenthesis is met and 0 when the condition in the parenthesis is not met, r.sub.i refers to the intrinsic risk of the attribute data set x.sub.i, and r.sub.k refers to the intrinsic risk of the attribute data set with a grid-cell index k).
6. The mobile robot of claim 5, wherein the risk calculation part calculates an importance weight on the basis of the intrinsic risk and the cumulative risk for each of the traversability training data sets; and the risk-aware self-training part reflects the importance weight and the cumulative risk in the self-training of the traversability evaluation model.
7. The mobile robot of claim 6, wherein the importance weight is calculated by a product of the intrinsic risk and the cumulative risk.
8. The mobile robot of claim 6, wherein the risk-aware self-training part reflects the importance weight and the cumulative risk in a loss function of the traversability evaluation model.
9. The mobile robot of claim 8, wherein the loss function comprises: a weighted loss function in which the importance weight is reflected; and a regularization loss function in which the cumulative risk is reflected.
10. The mobile robot of claim 9, wherein the weighted loss function is defined as mathematical expression
11. The mobile robot of claim 1, wherein the terrain attribute values comprise at least two values of a step height value, a slope value, a roughness value, a curvature value, and an elevation variance value.
12. The mobile robot of claim 1, further comprising: a noisy training data removal part which removes, as label noise, a traversability training data set or a non-traversability training data set whose labeling value has changed on the basis of change over time in the labeling value for each of the traversability training data sets and each of the non-traversability training data sets.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0035] The above and other objectives, features, and other advantages of the present disclosure will be more clearly understood from the following detailed description when taken in conjunction with the accompanying drawings, in which:
[0036]
[0037]
[0038]
[0039]
[0040]
DETAILED DESCRIPTION OF THE INVENTION
[0041] Advantages and characteristics of the present disclosure, and methods for achieving them will become clear with reference to embodiments described later in detail in conjunction with the accompanying drawings. However, the present disclosure is not limited to the embodiments disclosed below and may be implemented in a variety of different forms. However, these embodiments are provided to make the disclosure of the present invention complete and to completely inform those skilled in the art of the scope of the invention to which the present disclosure belongs, and the present disclosure is only defined by the scope of the claims.
[0042] Terms used herein are intended to describe embodiments and are not intended to limit the present disclosure. In this specification, singular forms also include plural forms, unless specifically stated otherwise in the context. As used in the specification, comprises and/or comprising does not exclude the presence or addition of one or more other elements in addition to mentioned elements. Like reference numerals refer to like elements throughout the specification, and and/or includes each and every combination of one or more of the referenced elements. Although first, second, etc. are used to describe various components, these components are of course not limited by these terms. These terms are merely used to distinguish one component from another. Accordingly, a first component mentioned below may also be a second component within the technical idea of the present disclosure.
[0043] Unless otherwise defined, all terms (including technical and scientific terms) used herein may be used in the same manner as would be commonly understood by those skilled in the art to which the present disclosure belongs. In addition, terms defined in commonly used dictionaries are not to be idealized or over-interpreted unless explicitly and specifically defined.
[0044]
[0045] In the embodiment of the present disclosure, the self-learning process illustrated in
[0046] Referring to
[0047] The point cloud sensor 110 according to the embodiment of the present disclosure senses the driving environment of the mobile robot 100 while the mobile robot 100 is driving so as to acquire the point cloud data. In the embodiment of the present disclosure, as an example, the point cloud sensor 110 is a 3D LiDAR sensor to obtain 3D information on each point cloud. Of course, the point cloud sensor 110 may be provided in the form of a stereo camera.
[0048] For example, as illustrated in
[0049] In an embodiment, the robot control part 180 may traverse the mobile robot 100 according to the manipulation of a user by controlling the robot drive part 170. In the embodiment of the present disclosure, the robot drive part 170 is exemplified to be configured as a two-wheel differential drive type. However, this is merely one embodiment, and various other drive methods may be applied.
[0050] The elevation map generation part 120 according to the embodiment of the present disclosure may generate a grid-based elevation map by using the point cloud data acquired by the point cloud sensor 110. In an embodiment, the elevation map generated by the elevation map generation part 120 may include height information such as height, variance, maximum height, and minimum height at each grid cell.
[0051] The attribute data generation part 130 according to the embodiment of the present disclosure may calculate a plurality of types of terrain attribute values for each grid cell from the elevation map generated by the elevation map generation part. In the embodiment of the present disclosure, the terrain attribute values include at least two values of a step height value, a slope value, a roughness value, a curvature value, and an elevation variance value. In an embodiment, the five types of terrain attribute values exemplified above are calculated.
[0052] In an embodiment, the step height value may be derived by maximum height difference between adjacent cells. In addition, the slope value may be calculated by an angle between the normal vector and gravity vector of a road surface. In addition, the roughness value may be determined by the smallest eigenvalue of an elevation covariance, and the curvature value may be calculated from the eigenvalues of the covariance matrix of adjacent points.
[0053] In addition, the elevation variance value is calculated through the variance of the elevation values of the point cloud data within a corresponding grid cell, and reflects the uncertainty of the estimation of elevation, i.e., height, calculated for the corresponding grid cell.
[0054] When the terrain attribute values are calculated as described above, the attribute data generation part 130 can generate an attribute data set composed of the terrain attribute values for each grid cell in S12. In an embodiment, the attribute data set may be represented as x.sub.i={h.sub.i, s.sub.i, r.sub.i, c.sub.i, .sub.i.sup.2}, wherein h, s, r, c and 2 are the step height value, the slope value, the roughness value, the curvature value, and the elevation variance value, respectively, and i is an index of the grid cell.
[0055] The training data generation part 140 according to the embodiment of the present disclosure may generate a training data set by labeling whether or not each of attribute data sets is in traversable condition in S13. In one embodiment, the training data generation unit 140 labels the attribute data sets as traversable to generate traversability training data sets when the attribute data sets are determined to be in traversable condition, and labels the attribute data sets as non-traversable to generate non-traversability training data sets when the attribute data sets are determined to be in non-traversable condition.
[0056] Through the above process, until manual driving ends in S15, the plurality of traversability training data sets and the plurality of non-traversability training data sets may be generated.
[0057] In an embodiment, the training data generation part 140 may generate the traversability training data sets by labeling, as traversable, the attribute data sets for grid cells on a trajectory traversed by the mobile robot 100.
[0058] In addition, the training data generation part 140 may generate the non-traversability training data sets by labeling, as non-traversable, the attribute data sets outside an attribute threshold set for at least one type of the plurality of types of terrain attribute values among the attribute data sets for grid cells other than the traversing trajectory of the mobile robot 100.
[0059] In an embodiment, the training data generation part 140 may generate the non-traversability training data set by labeling the corresponding attribute data set as non-traversable when the step height value of the corresponding attribute data set is greater than the attribute threshold set for the step height value among the terrain attribute values.
[0060] Here, the terrain attribute valuewhose attribute threshold is determined according to the specification of the mobile robot 100is set to an allowable threshold. It is preferable that a person can intuitively understand the meaning of the value, and a representative example of this may be the step height value.
[0061] As another example, the slope value may also have a set attribute threshold, and in a case in which each of the step height value and the slope value has a set attribute threshold, when one of the values exceeds the attribute threshold, a corresponding attribute data set may be labeled as non-traversable.
[0062]
[0063] Additionally, grid cells that are not labeled as traversable or non-traversable areas are assigned pseudo-labels and used as auxiliary training data sets during the self-learning process.
[0064] Meanwhile, the mobile robot 100 according to the embodiment of the present disclosure may include a noisy training data removal part 141. The noisy training data removal part 141 according to the embodiment of the present disclosure may remove, as label noise, a traversability training data set or a non-traversability training data set whose labeling value has changed on the basis of change over time in the labeling value, i.e., change over time of traversability or non-traversability, for each of the traversability training data sets and each of the non-traversability training data sets in S14.
[0065] For example, the non-traversability training data sets, which are determined by the attribute threshold, are generated without the verification of a user, so the reliability of their values is not high.
[0066] In the actual traversing environment of the mobile robot 100, errors may occur in the terrain attribute values that constitute the attribute data sets due to various sources of error, such as noisy depth measurement, occlusion, and drift in pose estimation.
[0067] A common way to identify noisy labels may include considering statistical approaches to detect and remove an outlier. However, it is difficult to set appropriate parameters for detecting the outlier in heterogeneous terrains in which the distribution of terrain attributes varies from place to place. There is the risk that appropriate data samples with large standard deviations are excessively removed.
[0068] Accordingly, in the embodiment of the present disclosure, temporal changes in the label .sub.i of a grid-cell index i are detected, and training data sets for grid cells whose label values change over time are treated as noises and removed.
[0069] For example, before the mobile robot 100 traverses, certain grid cells in the front traversing environment are labeled as non-traversable by an attribute threshold, but when the mobile robot 100 passes through the corresponding grid cells during a manual traversing process by a user, the corresponding grid cells are labeled as traversable.
[0070] Therefore, since there is change in a labeling value as the corresponding cells are initially labeled as non-traversable and then later labeled as traversable, the corresponding grid cells are removed from the training data sets by the noisy training data removal part 141.
[0071] Through the above process, until the traversing of the mobile robot 100 by a user ends, a plurality of training data sets for the traversing trajectory of the mobile robot 100 and surrounding environment thereof, that is, the plurality of traversability training data sets and the plurality of non-traversability training data sets can be acquired.
[0072] Meanwhile, the risk calculation part 150 according to the embodiment of the present disclosure may calculate a traversal risk for each of the traversability training data sets in S16. In an embodiment, the risk calculation part 150 may calculate an intrinsic risk and a cumulative risk as the traversal risk.
[0073] In general, the experience of the mobile robot 100 in hazardous terrain during its actual traversing environment is a key to strong generalization for future traversing. However, it is not realistically feasible to perform actual traversing on hazardous terrain during training due to the risk of damage to the mobile robot 100 in very challenging terrain.
[0074]
[0075] As illustrated in
[0076]
[0077] Accordingly, in the embodiment of the present disclosure, the risk calculation part 150 calculates the intrinsic risk and the cumulative risk for the traversability training data sets as described earlier, and reflects the traversal risk in the traversability training data set.
[0078] Here, the intrinsic risk reflects the terrain attribute values which constitute the traversability training data sets, that is, inherent attributes of terrain. In addition, the cumulative risk reflects the increasing experience of the mobile robot 100 and adaptability to different terrains over time, and reflects the relative risk of each traversability training data set compared to the entire traversability training data sets.
[0079] In the embodiment of the present disclosure, the risk calculation part 150 calculates the intrinsic risk as the weighted average of the plurality of types of terrain attribute values which constitute the traversability training data sets.
[0080] In an embodiment, the risk calculation part 150 calculates the intrinsic risk by using [Mathematical expression 1].
[0081] Here, x.sub.i refers to the attribute data sets, i.e., the traversability training data set, labeled as traversable, as described above. In addition, .sub.int(x.sub.i) refers to the intrinsic risk of the attribute data sets x.sub.i, and w refers to a vector consisting of a weight reflected in each of the terrain attribute values. Here, the sum of the weights, which are vector elements constituting w, is set to 1, so that the weights may be determined according to the importance of each of the terrain attribute value.
[0082] For example, in the case of the mobile robot 100 traversing on a road within a university campus, wherein the mobile robot 100 moves by using wheels, the height of steps, such as curbs, is an important factor in determining whether the mobile robot 100 can traverse, so the weight for the height of steps may be set high.
[0083] Meanwhile, the risk calculation part 150 may calculate the cumulative risk by using the intrinsic risk calculated for the traversability training data sets. In an embodiment, the risk calculation part 150 may calculate the cumulative risk through [Mathematical expression 2].
[0084] As in the calculation of the intrinsic risk, x.sub.i refers to the attribute data sets generated from the traversability training data sets, and n refers to the total number of the traversability training data sets. In addition, refers to an evaluation function that evaluates to 1 when condition in the parenthesis is met and 0 when the condition in the parenthesis is not met, r.sub.i refers to the intrinsic risk of the attribute data sets x.sub.i, and r.sub.k refers to the intrinsic risk of the attribute data sets with the grid-cell index k.
[0085] [Mathematical expression 2] utilizes the empirical cumulative distribution function (eCDF), which reflects the possibility of encountering various terrains on the basis of past experience so as to effectively capture the exposure of the mobile robot 100 to various terrain risks over time.
[0086]
[0087] When the mobile robot 100 was manually driven in two different environments, one paved with asphalt and the other paved with cobblestone, situations with high intrinsic risk were rarely encountered in both cases. On the other hand, it can be seen that the cumulative risk of the traversability training data sets shows different aspects between different environments. This variability in distribution may make it difficult to assume a specific parametric distribution.
[0088]
[0089] As can be seen in
[0090] Referring to
[0091] That is, the cumulative risk perceived by the mobile robot 100 is different when the mobile robot 100 traverses only on a flat and smooth terrain and when the mobile robot 100 traverses on a rough terrain, and by utilizing this, the risk of the terrain is reevaluated on the basis of the traversing experience of the mobile robot 100, enabling adaptive data processing depending on the environment.
[0092] Meanwhile, the risk calculation part 150 according to the embodiment of the present disclosure may calculate an importance weight by using the intrinsic risk and the cumulative risk in S17.
[0093] Here, the importance weight is a quantification of the risk of a terrain, wherein the quantification is performed by combining the intrinsic risk of terrain attributes and the cumulative risk that reflects the traversing experience of the mobile robot 100.
[0094] In an embodiment, the risk calculation part 150 may calculate the importance weight via [Mathematical expression 3].
[0095] That is, the importance weight may be calculated by the product of the intrinsic risk and the cumulative risk.
[0096] Meanwhile, the risk-aware self-training part 160 according to the embodiment of the present disclosure may train the traversability evaluation model 161 for traversability evaluation through self-training thereof which uses the plurality of traversability training data sets and the plurality of non-traversability training data sets in S18.
[0097] Here, the risk-aware self-training part 160 trains the traversability evaluation model 161 by reflecting the traversal risk in the self-training of the traversability evaluation model 161. In an embodiment, the risk-aware self-training part 160 includes reflecting the importance weight and the cumulative risk in the self-training of the traversability evaluation model 161, wherein the importance weight and the cumulative risk are reflected in a loss function.
[0098] In an embodiment, the risk-aware self-training part 160 may reflect the importance weight in a weighted loss function and may reflect the cumulative risk in a regularization loss function.
[0099] In an embodiment, the weighted loss function may be defined as [Mathematical expression 4].
[0100] Here, w.sub.i refers to the importance weight, p.sub.i refers to a multi-layer perceptron (MLP) with learnable parameters , and y.sub.i refers to the labeling value. In [Mathematical expression 4], the notation of x.sub.i is omitted to simplify the formula.
[0101] The mobile robot 100 of the present disclosure incorporates entropy regularization to prevent prediction distribution from being overly concentrated. The mobile robot 100 of the present disclosure integrates the navigation experience of the mobile robot 100 into an entropy regularization framework. In particular, .sub.cum derived from the previous navigation of the mobile robot 100 is used to adjust the effects of the entropy regularization. The prediction reliability of a model may be adjusted according to the familiarity of a terrain. After that, regularization loss function for each sample xi may be defined as [Mathematical expression 5].
[0102] Finally, the learning objective function may be defined as [Mathematical expression 6].
[0103] In [Mathematical expression 6], N refers to the total number of traversability training data sets and the non-traversability training data sets, and refers to the contribution of the regularization loss function. , adjusts the contribution of the entropy regularization. Here, navigation experience of the mobile robot 100 is integrated into the entropy regularization formula, as illustrated by [Mathematical expression 5] and [Mathematical expression 6].
[0104] The traversability evaluation model 161, which is self-trained as described above, later evaluates traversability by taking as input the attribute data sets generated by the attribute data generation part 130 during the actual driving of the mobile robot 100 and outputs a result.
[0105] Hereinafter, the experimental results of the mobile robot 100 according to the embodiment of the present disclosure will be described with reference to
[0106] First, the mobile robot 100 is applied as a two-wheeled differential robot. The mobile robot 100 is designed for urban living environments and may operate in a variety of outdoor terrains. Experiments were conducted in two different environments. The first experiment was conducted in an urban college campus environment, and the second experiment was conducted in a farm road environment.
[0107] [Table 1] shows training data sets collected in each environment. Since training data sets are collected from each grid cell on the elevation map, a significant amount of data may be acquired in a short period of time while performing manual traversing.
TABLE-US-00001 TABLE 1 Dataset No. of training sample(N.sub.l/N.sub.) Label noise rate Urban campus 92,213/232,682 2.7% Farm road 30,835/260,091 5.1%
[0108] The urban campus environment shown in
[0109] In the urban campus environment, a total of 324,895 data samples were collected, of which 28.4% were labeled and created as a training data set. For labeling as non-traversable, the attribute threshold for the step height value was set to 0.1 m.
[0110]
[0111] Ramps, bollards, and speed bumps were identified as major urban environmental elements that restrict the traversing direction of the mobile robot 100, as illustrated in
[0112] As illustrated in
[0113] Tree roots, cobblestone-paved roads, and grassy terrain in urban environments pose serious challenges to a trained model because they do not appear frequently during training. In
[0114] In particular, low-height terrain obstacles that were not present in a labeled data set were accurately predicted as non-traversable. This is because the use of pseudo-labels during self-training enables the traversability evaluation model 161 to predict situations beyond those explicitly represented in the labeled training data set.
[0115] The result of predicting traversability in the farm road environment is shown in
[0116] Although some embodiments of the present disclosure have been illustrated and described, it will be apparent to those skilled in the art that modifications may be made to the embodiments without departing from the principles or spirit of the present disclosure. The scope of the present disclosure will be determined by the appended claims and equivalents thereof.