Real-time structural damage detection by convolutional neural networks
11630040 · 2023-04-18
Assignee
Inventors
Cpc classification
E04B1/2403
FIXED CONSTRUCTIONS
Y02B10/30
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
International classification
Abstract
Certain embodiments may generally relate to structural damage detection. An embodiment may be directed to method for identifying a presence and a location of structural damage. Such method may include training a convolutional neural network (CNN) for a joint of a structure, sending instructions to a modal shaker to induce an input to the structure, receiving, as a result of the induced input, a raw acceleration signal at the joint, computing, based on the trained CNN and the raw acceleration signal, an index value of the joint, and identifying, according to the index value, a presence of a location of structural damage of the structure. In a further embodiment, the index value represents a likelihood of damage at the joint.
Claims
1. A method for identifying a presence and a location of structural damage, comprising: training a one-dimensional convolutional neural network (CNN) applied a joint of a structure; sending instructions to a modal shaker to induce an input to the structure; receiving, as a result of the induced input, a raw acceleration signal at the joint; computing, based on the trained one-dimensional CNN and the raw acceleration signal, an index value of the joint; and identifying, according to the index value, a presence of a location of structural damage of the structure, wherein the index value represents a likelihood of damage at the joint, and wherein the computing of the index value comprises dividing the acceleration signal to a number of frames that each include a total number of n.sub.s samples; normalizing the frames between −1 to 1; feeding the normalized frames measured at the joint to the one-dimensional CNN; and determining a probability of damage (PoD) at the joint by dividing a number of frames classified s damaged by a total number of frames processed by the one-dimensional CNN.
2. The method of claim 1, wherein a high PoD value within a range of about 0.8 to about 1.0 provides an indication that the joint is likely to be damaged, and a low PoD value within a range of about 0.0 to about 0.5 provides an indication that the joint is likely to be undamaged.
3. The method of claim 1, wherein the acceleration signal is measured by an accelerometer that is disposed at the location of the joint.
4. The method of claim 1, wherein the training of the one-dimensional CNN comprises conducting a plurality of experiments to generate a training data set for training the one-dimensional CNN, wherein each of the plurality of experiments comprises measuring acceleration signals at an undamaged joint as a result of an application of a random shaker excitation at the undamaged joint, and wherein each of the plurality of experiments comprises introducing damage at the undamaged joint to create a damaged joint, and measuring acceleration signals at the damaged joint as a result of the application of the random shaker excitation at the damaged joint.
5. An apparatus, comprising: at least one memory comprising computer program code; and at least one processor; wherein the at least one memory and the computer program code are configured, with the at least one processor, to cause the apparatus at least to: train a one-dimensional convolutional neural network (CNN) applied a joint of a structure; send instructions to a modal shaker to induce an input to the structure; receive, as a result of the induced input, a raw acceleration signal at the joint; compute, based on the trained one-dimensional CNN and the raw acceleration signal, an index value of the joint; and identify, according to the index value, a presence of a location of structural damage of the structure, wherein the index value represents a likelihood of damage at the joint, and wherein the apparatus is further caused to, in the computation of the index value, divide the acceleration signal to a number of frames that each include a total number of n.sub.s samples; normalize the frames between −1 to 1; feed the normalized frames measured at the joint to the one-dimensional CNN; and determine a probability of damage (PoD) at the joint by dividing a number of frames classified s damaged by a total number of frames processed by the one-dimensional CNN.
6. The apparatus of claim 5, wherein a high PoD value within a range of about 0.8 to about 1.0 provides an indication that the joint is likely to be damaged, and a low PoD value within a range of about 0.0 to about 0.5 provides an indication that the joint is likely to be undamaged.
7. The apparatus of claim 5, wherein the acceleration signal is measured by an accelerometer that is disposed at the location of the joint.
8. The apparatus of claim 5, wherein the at least one memory and the computer program code are further configured, with the at least one processor, to cause the apparatus at least to, in the training of the one-dimensional CNN: conduct a plurality of experiments to generate a training data set for training the one-dimensional CNN, wherein each of the plurality of experiments includes measuring acceleration signals at an undamaged joint as a result of an application of a random shaker excitation at the undamaged joint, and wherein each of the plurality of experiments includes introducing damage at the undamaged joint to create a damaged joint, and measuring acceleration signals at the damaged joint as a result of the application of the random shaker excitation at the damaged joint.
9. A computer program, embodied on a non-transitory computer readable medium, the computer program, when executed by a processor, causes the processor to: train a one-dimensional convolutional neural network (CNN) applied a joint of a structure; send instructions to a modal shaker to induce an input to the structure; receive, as a result of the induced input, a raw acceleration signal at the joint; compute, based on the trained one-dimensional CNN and the raw acceleration signal, an index value of the joint; identify, according to the index value, a presence of a location of structural damage of the structure, wherein the index value represents a likelihood of damage at the joint, and wherein the processor is further caused to, in the computation of the index value, divide the acceleration signal to a number of frames that each include a total number of n.sub.s samples; normalize the frames between −1 to 1; feed the normalized frames measured at the joint to the one-dimensional CNN; and determine a probability of damage (PoD) at the joint by dividing a number of frames classified s damaged by a total number of frames processed by the one-dimensional CNN.
10. The computer program of claim 9, wherein a high PoD value within a range of about 0.8 to about 1.0 provides an indication that the joint is likely to be damaged, and a low PoD value within a range of about 0.0 to about 0.5 provides an indication that the joint is likely to be undamaged.
11. The computer program of claim 9, wherein the acceleration signal is measured by an accelerometer that is disposed at the location of the joint.
12. The computer program of claim 9, wherein the computer program, when executed by the processor, further causes the processor to, in the training of the one-dimensional CNN: conduct a plurality of experiments to generate a training data set for training the one-dimensional CNN, wherein each of the plurality of experiments includes measuring acceleration signals at an undamaged joint as a result of an application of a random shaker excitation at the undamaged joint, and wherein each of the plurality of experiments includes introducing damage at the undamaged joint to create a damaged joint, and measuring acceleration signals at the damaged joint as a result of the application of the random shaker excitation at the damaged joint.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate preferred embodiments of the invention and together with the detailed description serve to explain the principles of the invention. In the drawings:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15) In the following detailed description of the illustrative embodiments, reference is made to the accompanying drawings that form a part hereof. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, and it is understood that other embodiments may be utilized and that logical or structural changes may be made to the invention without departing from the spirit or scope of this disclosure. To avoid detail not necessary to enable those skilled in the art to practice the embodiments described herein, the description may omit certain information known to those skilled in the art. The following detailed description is, therefore, not to be taken in a limiting sense.
DETAILED DESCRIPTION
(16) The features, structures, or characteristics of the invention described throughout this specification may be combined in any suitable manner in one or more embodiments. For example, the usage of the phrases “certain embodiments,” “some embodiments,” or other similar language, throughout this specification refers to the fact that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the present invention.
(17) In the following detailed description of the illustrative embodiments, reference is made to the accompanying drawings that form a part hereof. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, and it is understood that other embodiments may be utilized and that logical or structural changes may be made to the invention without departing from the spirit or scope of this disclosure. To avoid detail not necessary to enable those skilled in the art to practice the embodiments described herein, the description may omit certain information known to those skilled in the art. The following detailed description is, therefore, not to be taken in a limiting sense.
(18) Certain embodiments are described herein for using various tools and procedures used by a software application to detect and determine a location of structural damage of a structure. The examples described herein are for illustrative purposes only. As will be appreciated by one skilled in the art, certain embodiments described herein, including, for example, but not limited to, those shown in
(19) Any combination of one or more computer usable or computer readable medium(s) may be utilized with various embodiments described herein. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The computer-usable or computer-readable medium may be, for example but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium. More specific examples (a non-exhaustive list) of the computer-readable medium may independently be any suitable storage device, such as a non-transitory computer-readable medium. Suitable types of memory may include, but not limited to: a portable computer diskette; a hard disk drive (HDD), a random access memory (RAM), a read-only memory (ROM); an erasable programmable read-only memory (EPROM or Flash memory); a portable compact disc read-only memory (CDROM); and/or an optical storage device.
(20) The memory may be combined on a single integrated circuit as a processor, or may be separate therefrom. Furthermore, the computer program instructions stored in the memory may be processed by the processor can be any suitable form of computer program code, for example, a compiled or interpreted computer program written in any suitable programming language. The memory or data storage entity is typically internal, but may also be external or a combination thereof, such as in the case when additional memory capacity is obtained from a service provider. The memory may also be fixed or removable.
(21) The computer usable program code (software) may be transmitted using any appropriate transmission media via any conventional network. Computer program code, when executed in hardware, for carrying out operations of certain embodiments may be written in any combination of one or more programming languages, including, but not limited to, an object oriented programming language such as Java, Smalltalk, C++, C# or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. Alternatively, certain embodiments may be performed entirely in hardware.
(22) A classical signal-based SHM approach may include a continuous acquisition of signals by sensors, extraction of certain (hand-crafted) features and feature classification by a classifier. In certain cases, it may be important to extract damage-sensitive features correlated with the severity of the damage in the monitored structure, and a well-configured and trained classifier that has the utmost ability to discriminate those features. This may be why the choice of both features extracted and the classifier used usually depends on a trial-and-error process for a particular SHM application. Such fixed features/classifiers that are either manually selected or hand-crafted, may not optimally characterize the acquired signal and, thus, cannot accomplish a reliable performance level for SHM. In other words, it may be challenging to determine which is the optimal choice for a particular signal (SHM data) and classifier. In addition, feature extraction may turn out to be a computationally costly operation which eventually may hinder the usage of such methods for a real-time SHM application.
(23) Certain embodiments may implement Convolutional Neural Networks (CNNs). CNNs are generally known as a standard for “Deep Learning” tasks such as object recognition in large image archives as they achieved the state-of-the-art performances, with a significant performance gap. Typically, CNNs are feed-forward artificial neural networks (ANNs) that have both alternating convolutional and subsampling layers. As the convolutional layers basically model the cells in the human visual cortex, CNNs are developed primarily for 2D signals such as images and video frames. However, 1D CNNs have successfully been used for the classification of electrocardiogram (ECG) beats achieving the state-of-the-art performance in terms of both accuracy and speed. Furthermore, 1D CNNs have achieved the fastest solution with an elegant accuracy for fault detection in high power engines. A reason behind such superiority may be in the configuration of CNNs. Convolutional layers generally use linear filters, whose parameters are optimized during the training process. These filters may extract crucial information (features), which characterize the object/pattern in an image/signal. The convolutional layers may be followed by a feed-forward and fully connected layers, which are identical to the hidden layers of multi-layer perceptrons (MLPs) where the classification task is mainly realized. As a result, regardless of the variations in the signal characteristics and patterns, CNNs may have the natural ability to learn the optimal features and the classifier parameters in a combined optimization process, known as back-propagation (BP).
(24) Furthermore, CNNs are biologically inspired feed-forward ANNs that present a simple model for the mammalian visual cortex.
(25) According to certain embodiments, an adaptive 1D CNN configuration may be used in order to fuse feature extraction and learning (damage detection) phases of raw accelerometer data. The adaptive CNN topology may allow the variations in the input layer dimension. In particular,
(26) As seen in
(27) According to certain embodiments, in 1D CNNs, the 1D forward propagation (FP) from the previous convolution layer, l−1, to the input of a neuron in the current layer, l, can be expressed as:
(28)
(29) where x.sub.k.sup.l is the input, b.sub.k.sup.l is a scalar bias of the k.sup.th neuron at layer l, and s.sub.i.sup.l−1 is the output of the i.sup.th neuron at layer l−1. w.sub.i,k.sup.l−1 is the kernel weight from the i.sup.th neuron at layer l−1 to the k.sup.th neuron at layer l. The intermediate output of the neuron, y.sub.k.sup.l can then be expressed from the input x.sub.k.sup.l, as follows:
y.sub.k.sup.l=f(x.sub.k.sup.l) and s.sub.k.sup.l=y.sub.k.sup.l↓ss (2)
where s.sub.k.sup.l is the output of the neuron and L ss represents the down-sampling operation with the factor, ss.
(30) In certain embodiments, the adaptive CNN configuration may require automatic assignment of the sub-sampling factor of the output CNN layer (the last CNN layer). In particular, it may be set to the size of its input array. For instance, in
(31) According to certain embodiments, a training methodology may be provided called BP. As noted above, CNNs may have the natural ability to learn the optimal features and the classifier parameters in a combined optimization process. The BP steps may be formulated as follows. The BP of the error may start from the output fully-connected layer. For instance, let l=1 and l=L be the input and output layers, respectively. Also, let N.sub.L be the number of classes in the database. For an input vector p, and its corresponding target and output vectors, t.sub.i.sup.p and [y.sub.1.sup.L, . . . , y.sub.N.sub.
(32)
(33) An objective of the BP is to minimize the contributions of the network parameters to this error. Therefore, it is needed to compute the derivative of the MSE with respect to an individual weight (connected to that neuron, k) w.sub.ik.sup.l−1, and bias of the neuron k, b.sub.k.sup.l, so that a gradient descent method can be performed to minimize their contributions and hence the overall error in an iterative manner. Specifically, the delta of the k.sup.th neuron at layer l, Δ.sub.k.sup.l may be used to update the bias of that neuron and all weights of the neurons in the previous layer connected to that neuron as:
(34)
(35) Thus, from the first MLP layer to the last CNN layer, the regular (scalar) BP may simply be performed as:
(36)
(37) Once the first BP is performed from the next layer l+1, to the current layer l, then it may be possible to further back-propagate it to the input delta Δ.sub.k.sup.l. In such a case, let zero order up-sampled map be us.sub.k.sup.l=up (s.sub.k.sup.l), then one can write:
(38)
where β=(ss).sup.−1 since each element of s.sub.k.sup.l was obtained by averaging ss number of elements of the intermediate output, y.sub.k.sup.l. The inter BP of the delta error
(39)
can be expressed as:
(40)
where rev(.) reverses the array and conv 1Dz(. , .) performs full convolution in 1D with K−1 zero padding. Finally, the weight and bias sensitivities can be expressed as:
(41)
(42) As a result, the iterative flow of the BP algorithm can be written as the following: 1. Initialize all weights (usually randomly, U(−α, α)) 2. For each BP iteration DO: a. For each item (or a group of items or all items) in the dataset, DO: i. FP: Forward propagate from the input layer to the output layer to find outputs of each neuron at each layer, y.sub.i.sup.l, ∀i ∈ [1, N.sub.l] and ∀ l ∈ [1, L]. ii. BP: Compute delta error at the output layer and back-propagate it to first hidden layer to compute the delta errors, Δ.sub.k.sup.l, ∀k ∈ [1, N.sub.l] and ∀ l ∈ [2, L−1]. iii. PP: Post-process to compute the weight and bias sensitivities. iv. Update: Update the weights and biases with the (accumulation of) sensitivities found in (c) scaled with the learning factor, ϵ:
(43)
(44) According to certain embodiments, it may be possible to accurately detect damages (if any), and identify the location of the damaged areas of a structure. To do so, in certain embodiments, may require designing and training a unique 1D CNN for joints of an example structural simulator.
(45) As shown in
(46) The CNN-based algorithm according to certain embodiments may require designing and training a unique 1D CNN for each one of the 30 joints instrumented with accelerometers in the structural simulator. However, the CNN-based algorithm is not limited to the simulator structure shown in
(47)
(48) According to certain embodiments, in each experiment, E=k+1, damage may be introduced at the joint J=k (by loosening its connection bolts in this study), and the n acceleration signals may be measured under random excitation. The measured signals may be denoted as U.sub.E=k+1,J=1, . . . , D.sub.E=k+1,J=k, . . . , U.sub.E=k+1,J=n, where the notation D indicates that this signal was measured at the damaged joint k. After conducting the n+1 experiments, the signals measured at each joint i may be grouped together as follows in order to create the damaged/undamaged vectors required to train the corresponding CNN, CNN.sub.i:
Undamaged.sub.i=[U.sub.E=1,J=i U.sub.E=2,J=i . . . U.sub.E=i,J=i U.sub.E=i+2,J=i . . . U.sub.E=n+1,J=i] (11)
Damaged.sub.i=[D.sub.E=i+1,J=i] (12)
(49) According to certain embodiments, the undamaged data set for a particular joint i may include signals measured while, the other joints are undamaged (U.sub.E=1,J=i) as well as signals measured while one of the other joints are damaged. The data generation and collection process may ensure that the effect of damaging a particular joint on the response of the other joints will not cause the CNNs to misclassify the undamaged joints as damaged.
(50) Next, the aforementioned undamaged and damaged vectors may be divided to a large number of frames, where each frame contains a certain number of samples n.sub.s. The result of this operation for joint i can be written as:
UF.sub.i=[UF.sub.i,1 UF.sub.i,2 . . . UF.sub.i,n.sub.
DF.sub.i=[DF.sub.i,1 DF.sub.i,2 . . . DF.sub.i,n.sub.
where UF.sub.i and DF.sub.i are vectors containing the undamaged and damaged frames for the joint i, respectively, and n.sub.uf and n.sub.df are the total number of undamaged and damaged frames, respectively.
(51) Given the total number of samples in each acceleration signal n.sub.T; n.sub.uf and n.sub.df can be computed as:
(52)
(53) From Equations (15) and (16), it can be seen that for a structure with a large number of joints (accelerometers) n, the number of undamaged frames for a particular joint may be significantly larger than the number of damaged frames. Training the CNN using extremely unbalanced undamaged/damaged frames may degrade the classification performance. Therefore, the frames corresponding to joint i may be balanced according to the following procedure: (1) the n.sub.uf frames in UF.sub.i are randomly shuffled to yield a new vector, UFS.sub.i; and (2) the shuffled vector UFS.sub.i is truncated by taking only the first n.sub.df and remove the remaining frames resulting in a new undamaged vector UFN.sub.i that contains a total of n.sub.df frames.
(54) In certain embodiments, UF.sub.i may be shuffled in order to make sure that the undamaged frames from all experiments have an equal chance of being selected. In other embodiments, all frames in vectors UFN.sub.i and DF.sub.i may be normalized between −1 to 1, resulting in the final vectors UFF.sub.i and DFF.sub.i that may be used to train the CNN, CNN.sub.i.
(55) To train CNN.sub.i, in certain embodiments, its parameters may be specified such as the number of convolutional layers and neurons, the number of hidden fully-connected layers and neurons, the kernel size, K, and the sub-sampling factor, ss. Finally, the CNN training may be carried out based on the data in UFF.sub.i and DFF.sub.i using BP as explained above. The entire data generation and CNNs training process according to certain embodiments is illustrated in
(56) According to certain embodiments, once all of the CNNs have been trained, they can be directly used to assess the condition of the structure. For example, each CNN.sub.i may be utilized to compute an index that reflects the likelihood of damage at joint i directly from the raw acceleration signal measured at its location. This can be done by: (1) inducing damage at one or more locations (or keep the structure undamaged); (2) applying a random shaker input; (3) measuring the acceleration signal at each joint; (4) dividing each acceleration signal to a number of frames, each containing a total of n.sub.s samples; (5) normalizing the frames between −1 to 1; (6) feeding the normalized frames measured at each joint to the corresponding CNN (CNN.sub.i); and (7) computing the probability of damage (PoD.sub.i) at the i.sup.th joint as below:
(57)
where T.sub.i is the total number of frames processed by CNN.sub.i, and D.sub.i is the number of frames classified as “damaged”. The PoD computed at damaged joints may be significantly higher than the values for the undamaged joints. This gives a clear indication regarding both the presence and the location of a structural damage. For example, in certain embodiments, a PoD value close to 0.0 indicates that the corresponding location is undamaged, while a PoD value close to 1.0 indicates that the corresponding location is damaged. Further, PoD values within a range of about 0.0 to about 0.5 may indicate that the corresponding location is undamaged, while PoD values within a range of about 0.80 to about 1.0 may indicate that the corresponding location is damaged.
Experimental Results
(58) The efficiency of the damage detection procedures described above may be determined. For instance, according to certain embodiments, in a first phase, only a single beam on the steel frame (n=5 joints) of the structural simulator was monitored. In a second phase, the performance of the damage detection approach was tested utilizing the entire structural simulator (n=30 joints).
(59) Experimental Setup
(60) In a setup for determining the efficiency of the damage detection procedures according to certain embodiments, the horizontal girder at the middle of the structural simulator was considered for the first phase of the experimental work. A total of 6 experiments were conducted to generate the data required for training. In each experiment, the acceleration signals were collected under a 0-512 Hz band-limited white noise shaker excitation at a sampling frequency of 1024 Hz. The signals were recorded for 256 s, so that each signal contains n.sub.τ=262144 samples. The shaker control and data acquisition operations were conducted using ME'ScopeVES software. Further, a Matlab code was used to group, divide into frames, balance, and normalize the data sets. The frame length n.sub.s was taken as 128 samples, therefore, vectors UFF.sub.i and DFF.sub.i contain a total of 2048 frames for each joint i. Only 50% of these frames were used for the training process (i.e., 1024 undamaged frames and 1024 damaged frames for each CNN).
(61) Additionally, five CNNs were trained for the five joints along the tested girder. All the CNNs were selected to have a compact configuration with only two hidden convolution layers and two hidden fully-connected layers. According to certain embodiments, it may be possible to accomplish a high computational efficiency required, particularly for real-time detection. This also demonstrates that deep and complex CNN configurations are not really needed to achieve the desired detection performance. The structure and parameters of the 1D CNNs were obtained by trial-and-error. The 1D CNN configuration used in all experiments has (64, 32) neurons on the two hidden convolution layers and (10, 10) neurons on the two hidden fully-connected layers. The output (MLP) layer size was 2, which is the same as the number of classes. In addition, each CNN has a single input neuron which takes the input signal as the 128 time-domain samples of each frame in the training data set. The kernel size K, and the sub-sampling factor ss for all CNNs were set to 41 and 2, respectively.
(62) For all experimental results, a two-fold stopping criteria for BP training was assigned: (1) the train classification error (CE) of 1%; and (2) maximum 100 BP iterations. Whenever either criterion is met, the BP training stops. The learning factor, ϵ, may initially be set as 0.001 and the global adaptation is performed at each BP iteration. At each iteration, if the trained MSE decreases in the previous iteration, ϵ is increased by 5%; otherwise, ϵ is reduced by 30%.
(63) Phase-1 Evaluation
(64)
(65) The results of the first phase indicates that the algorithm successfully evaluated the condition of the monitored girder as undamaged. For Cases 2 to 6 (single damage cases), the algorithm successfully assigned high PoD values (i.e., close to 1.0) to the damaged joints, while the computed PoD values for the intact joints were very low. Further, for both double damage cases (Cases 7 and 8), high PoD values were obtained for the two damaged joints, and much lower PoD values were assigned to the remaining joints.
(66) Phase—2 Evaluation
(67) For the second phase of the experimental work, the entire steel frame consisting of n=30 joints was monitored. Therefore, a total of 31 experiments were needed to generate the training data. The same data generation and CNN training parameters used in the first phase were used again for the second phase. The average classification error of the resulting 30 CNNs over the training data was found to be about 0.54%.
(68) To test the performance of the resulting 30 CNNs, the algorithm was tested against 24 structural cases (undamaged case +18 single damage cases +5 double damage cases). The PoD distributions obtained for 24 structural cases are illustrated in
(69) In Case 23, the performance of the algorithm was tested when two adjacent joints were damaged. The results obtained for this case were satisfactory, however, a high PoD value was incorrectly assigned to an undamaged joint (i.e., joint (6,3)). Also, for Case 24 where two joints along the structure's line of symmetry were damaged, the PoD map did not reflect the damage location accurately. Considering the very slight damage introduced to the structural system (only loosening the bolts of the connections), the results of the two phases demonstrate an elegant performance of the damage detection algorithm in assessing the condition of structures and locating single damages.
(70) The adaptive 1D CNN classifier, according to certain embodiments, may be implemented in C++ using MS Visual Studio 2013 in 64-bit. This program may be capable of carrying out the forward and back-propagation operations required for training and using the CNNs. Also, a Matlab code may be written and used to extract vectors UFF.sub.i and DFF.sub.i directly from the signals collected in the experiments as detailed above. Another Matlab code may be used to generate the PoD distribution directly from the raw acceleration signals using the trained CNNs as explained above. The experiments were conducted using a computer with I7-4910 MQ at 2.9 GHz (8 cores) and 32-Gb memory.
(71)
(72) In certain embodiments, computing the index value may include dividing the acceleration signal to a number of frames that each include a total number of n.sub.s samples. The computation of the index value may also include normalizing the frames between −1 to 1, and feeding the normalized frames measured at the joint to the CNN. The computation of the index value may further include determining a probability of damage (PoD) at the joint by dividing a number of frames classified s damaged by a total number of frames processed by the CNN. When the PoD value is high, then it may provide an indication that the joint is likely to be damaged. Alternatively, when the PoD value is low, it may provide an indication that the joint is likely to be undamaged.
(73) For example, in certain embodiments, a PoD value close to 0.0 indicates that the corresponding location is undamaged, while a PoD value close to 1.0 indicates that the corresponding location is damaged. Further, PoD values within a range of about 0.0 to about 0.5 may indicate that the corresponding location is undamaged, while PoD values within a range of about 0.80 to about 1.0 may indicate that the corresponding location is damaged.
(74)
(75) Apparatus 10 may include a processor 22 for processing information and executing instructions or operations. Processor 22 may be embodied by any computational or data processing device, such as a central processing unit (CPU), digital signal processor (DSP), application specific integrated circuit (ASIC), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), digitally enhanced circuits, or comparable device or a combination thereof. Processor 22 may be implemented as a single controller, or a plurality of controllers or processors.
(76) For firmware or software, the implementation may include modules or unit of at least one chip set (for example, procedures, functions, and so on). Memory 14 may independently be any suitable storage device such as those described above. The memory and the computer program instructions may be configured, with the processor for the particular device, to cause a hardware apparatus such as apparatus 10, to perform any of the processes described above (see, for example,
(77) According to certain embodiments, memory 14 may be coupled to processor 22, for storing information and instructions that may be executed by processor 22. Memory 14 may be one or more memories and of any type suitable to the local application environment, and may be implemented using any suitable volatile or nonvolatile data storage technology such as a semiconductor-based memory device, a magnetic memory device and system, an optical memory device and system, fixed memory, and removable memory. For example, memory 14 can be comprised of any combination of random access memory (RAM), read only memory (ROM), static storage such as a magnetic or optical disk, or any other type of non-transitory machine or computer readable media.
(78) Apparatus 10 may also include one or more antennas (not shown) for transmitting and receiving signals and/or data to and from apparatus 10. Apparatus 10 may further include a transceiver 28 that modulates information onto a carrier waveform for transmission by the antenna(s), demodulates information received via the antenna(s) for further processing by other elements of apparatus 10. In other embodiments, transceiver 28 may be capable of transmitting and receiving signals or data directly.
(79) According to certain embodiments, processor 22 may perform functions associated with the operation of apparatus 10 including, without limitations, any of the procedures described above and illustrated in the figures.
(80) In other embodiments, memory 14 may store software modules that provide functionality when executed by processor 22. The modules may include an operating system 15 that provides operating system functionality for apparatus 10. Memory 14 may also store one or more functional modules 18, such as an application or program, to provide additional functionality for apparatus 10. The components of apparatus 10 may be implemented in hardware, or as any suitable combination of hardware and software.
(81) According to certain embodiments, it may be possible to provide an adaptive implementation of 1D CNNs, which may demonstrate a high performance level for real-time SHM and structural damage detection processes. For instance, the results described herein demonstrated the superior ability of the compact 1D CNNs to learn the extraction of optimal features automatically and directly from the raw accelerometer data, not requiring any feature extraction, and any pre- or post-processing. The approach according to certain embodiments not only achieves a high level of generalization, but also eliminates the need for manual model or parameter tuning on any hand-crafted feature extraction. This fact makes the algorithm applicable for monitoring almost any civil infrastructure.
(82) Due to the simple structure of the 1D CNNs that requires only 1D convolutions (scalar multiplications and additions), a mobile and low-cost hardware implementation of the described approach is quite feasible. Moreover, since the CNN-based method is computationally inexpensive, it can be easily applied for real-time structural health monitoring of any engineering structure (e.g., civil, mechanical, or aerospace).
(83) According to other embodiments, the CNNs were capable of learning directly from the acceleration data measured under random excitations. In all training and testing steps, the excitation input was assumed to be unknown. Therefore, the described algorithm may very promising for monitoring civil structures under ambient vibration (i.e., damage detection using output-only data).
(84) Further, conventional centralized algorithms require the signals measured at all locations to be collected and transferred to a single processing unit. Thus, transferring and synchronizing large amount of data can be problematic especially when a wireless sensor network is used for SHM. On the other hand, the algorithm according to certain embodiments may be decentralized, which means that a unique classifier (CNN) may be assigned to each location. Each CNN may process only the locally-available data to assess the structural condition at its location. Hence, the method according to certain embodiments offers an effective solution to overcome this problem.
(85) According to certain embodiments therefore, it may be possible to provide a fast and highly accurate nonparametric vibration-based algorithm for structural damage detection based on adaptive 1D CNN. It may also be possible to identify and locate any structural damage in real-time by processing raw vibration signals acquired by a network of accelerometers. With a proper adaptation over the traditional CNNs, certain embodiments can directly classify the accelerometer signal without requiring any feature extraction, pre- or post-processing. Consequently, this leads to an efficient system in terms of speed, allowing a real-time application. Due to the CNNs ability to learn to extract the optimal features, with a proper training, the system according to certain embodiments can achieve superior damage detection and localization accuracy despite the noise-like and uncorrelated patterns of the accelerometer signal. Some samples of the latter are shown in
(86) According to certain embodiments, with the CNN-based damage detection technique, it may be possible to significantly reduce the computational time and effort required to classify the signals. To illustrate this feature, the same CNN configuration may be used. The acceleration signal used for this illustration may be acquired at a sampling frequency of 1024 Hz, and therefore it may include 1024 samples. The signal may be divided to eight frames, each having 128 samples. Accordingly, the total time required for the classification of 1-sec signal was only 22 msec. Further, this speed was about 45× faster than the real-time requirement.
(87) According to other embodiments, it may be possible to use the CNN-algorithm described herein in any SHM and damage detection system for any large-scale structure including, for example, but not limited to: buildings; stadia; bridges; tunnels; off-shore platforms; towards; pipeline networks; dams; wind turbines; airplanes; ships; aerospace structures; and more. Certain embodiments may also serve as an automated damage detection and classification system for civil engineers, mechanical engineers, aerospace engineers, and experts in this field. The CNNs, according to certain embodiments, may be capable of learning directly from the acceleration data measured under random excitations. In all training and testing steps, the excitation input may be assumed to be unknown. Thus, the algorithm according to certain embodiments may be quite promising for monitoring civil structures under ambient vibration (i.e., damage detection using output-only data).
(88) Although the foregoing description is directed to the preferred embodiments of the invention, it is noted that other variation and modifications will be apparent to those skilled in the art, and may be made without departing from the spirit or scope of the invention. Moreover, features described in connection with one embodiment of the invention may be used in conjunction with other embodiments, even if not explicitly stated above.