Effective feature set-based high impedance fault detection
11656263 · 2023-05-23
Assignee
Inventors
- Qiushi Cui (Tempe, AZ, US)
- Yang Weng (Chandler, AZ, US)
- Khalil El-Arroudi (Montreal, CA)
- Syed Muhammad Yousaf Hashmy (Tempe, AZ, US)
Cpc classification
H02H1/0092
ELECTRICITY
G01R31/085
PHYSICS
H02H7/22
ELECTRICITY
International classification
G01R31/08
PHYSICS
H02H3/38
ELECTRICITY
Abstract
Effective feature set-based high impedance fault (HIF) detection is provided. Systems, methods and devices described herein present a systematic design of power feature extraction for HIF detection and classification. For example, power features associated with HIF events are extracted according to when a fault happens, how long it lasts, and the magnitude of the fault. Complementary power expert information is also integrated into feature pools. In another aspect, a ranking procedure is deployed in a feature pool for balancing information gain and complexity in order to avoid over-fitting of features. In aspects described herein, a logic-based HIF detector implements HIF feature extraction. To determine when an HIF occurs, the HIF detector calculates different quantities, such as active power and reactive power, based on a voltage and current time series, and uses the derivative of these quantities to tell when there is a potential change due to HIF.
Claims
1. A high impedance fault (HIF) detector, comprising: a system characteristic averager configured to store instantaneous power characteristic values and provide averaged power characteristic values of the instantaneous power characteristic values; a decision circuit configured to determine occurrence of an HIF based on the instantaneous power characteristic values and the averaged power characteristic values; and a Kalman filter configured to provide a first portion of the instantaneous power characteristic values to the system characteristic averager after filtering a received voltage signal and a received current signal, wherein the first portion of the instantaneous power characteristic values comprises at least one of an estimated in-phase component of a third harmonic of the received voltage signal (KF.sub.V.sub.
2. The HIF detector of claim 1, further comprising a comparison circuit configured to compare the instantaneous power characteristic values with the averaged power characteristic values to provide comparison values to the decision circuit.
3. The HIF detector of claim 1, further comprising a discrete Fourier transform (DFT) circuit configured to provide a second portion of the instantaneous power characteristic values to the system characteristic averager based on a DFT of the received voltage signal and the received current signal.
4. The HIF detector of claim 3, further comprising a comparison circuit configured to: receive the instantaneous power characteristic values from the Kalman filter and the DFT circuit; receive the averaged power characteristic values from the system characteristic averager; and compare the instantaneous power characteristic values with the averaged power characteristic values to provide comparison values to the decision circuit.
5. The HIF detector of claim 1, further comprising a discrete Fourier transform (DFT) circuit configured to provide at least a portion of the instantaneous power characteristic values to the system characteristic averager based on a DFT of a received voltage signal and a received current signal.
6. The HIF detector of claim 5, wherein the at least the portion of the instantaneous power characteristic values provided by the DFT circuit to the system characteristic averager comprises a negative sequence voltage (V.sub.2) and a negative sequence current (I.sub.2).
7. The HIF detector of claim 6, wherein the at least the portion of the instantaneous power characteristic values provided by the DFT circuit further comprises an angle difference between the negative sequence voltage and a zero sequence voltage (θ.sub.V.sub.
8. The HIF detector of claim 1, wherein the instantaneous power characteristic values comprise an effective feature set.
9. The HIF detector of claim 8, wherein the effective feature set comprises at least one of: a negative sequence voltage (V.sub.2), a negative sequence current (I.sub.2), an angle difference between the negative sequence voltage and a zero sequence voltage (θ.sub.V.sub.
10. A method for detecting a high impedance fault (HIF), comprising: receiving power measurements from a power distribution system; extracting an angle difference between a negative sequence voltage and a zero sequence voltage (θ.sub.V.sub.
11. The method of claim 10, wherein extracting the angle difference θ.sub.V.sub.
12. The method of claim 11, further comprising averaging the angle difference θ.sub.V.sub.
13. The method of claim 10, further comprising: extracting an effective feature set (EFS) comprising the angle difference θ.sub.V.sub.
14. The method of claim 13, wherein extracting the EFS comprises performing a set of discrete Fourier transforms (DFTs) on a received voltage signal and a received current signal.
15. The method of claim 14, wherein extracting the EFS further comprises Kalman filtering the received voltage signal and the received current signal.
16. The method of claim 13, further comprising averaging the EFS to produce an averaged EFS, wherein the occurrence of the HIF is determined based on comparing the averaged EFS with the EFS.
17. A protective relay for a power distribution line, the protective relay comprising: a power coupler; and a high impedance fault (HIF) detector connected to the power coupler, comprising: feature extraction logic configured to extract instantaneous power characteristic values from a signal of the power coupler, the feature extraction logic comprising a Kalman filter that is configured to provide a first portion of the instantaneous power characteristic values after filtering a received voltage signal and a received current signal, the first portion of the instantaneous power characteristic values comprising at least one of an estimated in-phase component of a third harmonic of the received voltage signal (KF.sub.V.sub.
Description
BRIEF DESCRIPTION OF THE DRAWING FIGURES
(1) The accompanying drawing figures incorporated in and forming a part of this specification illustrate several aspects of the disclosure, and together with the description serve to explain the principles of the disclosure.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
DETAILED DESCRIPTION
(19) The embodiments set forth below represent the necessary information to enable those skilled in the art to practice the embodiments and illustrate the best mode of practicing the embodiments. Upon reading the following description in light of the accompanying drawing figures, those skilled in the art will understand the concepts of the disclosure and will recognize applications of these concepts not particularly addressed herein. It should be understood that these concepts and applications fall within the scope of the disclosure and the accompanying claims.
(20) It will be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first element could be termed a second element, and, similarly, a second element could be termed a first element, without departing from the scope of the present disclosure. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
(21) It will be understood that when an element is referred to as being “connected” or “coupled” to another element, it can be directly connected or coupled to the other element or intervening elements may be present. In contrast, when an element is referred to as being “directly connected” or “directly coupled” to another element, there are no intervening elements present.
(22) The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. As used herein, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises,” “comprising,” “includes,” and/or “including” when used herein specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
(23) Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. It will be further understood that terms used herein should be interpreted as having a meaning that is consistent with their meaning in the context of this specification and the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
(24) Effective feature set (EFS)-based high impedance fault (HIF) detection is provided. Systems, methods and devices described herein present a systematic design of power feature extraction for HIF detection and classification. For example, power features associated with HIF events are extracted according to when a fault happens, how long it lasts, and the magnitude of the fault. Complementary power expert information is also integrated into feature pools. In another aspect, a ranking procedure is deployed in a feature pool for balancing information gain and complexity in order to avoid over-fitting of features.
(25) In aspects described herein, a logic-based HIF detector implements HIF feature extraction. To determine when an HIF occurs, the HIF detector calculates different quantities, such as active power and reactive power, based on a voltage and current time series. The HIF detector uses the derivative of these quantities to tell when there is a potential change due to HIF. To determine duration of the HIF, the HIF detector uses a discrete Fourier transform (DFT) to quantify the harmonics of the voltage and current, so that suspicious harmonics can be recorded for later inspection. To determine magnitude of the HIF, the HIF detector estimates coefficients of Kalman Filter (KF)-based harmonics. Numerical methods show the proposed HIF detector has very high dependability and security performance under multiple fault scenarios compared with traditional methods.
(26) I. High Impedance Fault Modeling
(27)
(28) TABLE-US-00001 TABLE I Value Values Component Range change every V.sub.p 5~6 kV 0.1 ms V.sub.n 7~8 kV 0.1 ms R.sub.p 30~1500 Ω 0.1 ms R.sub.n 30~1500 Ω 0.1 ms
(29)
(30)
(31) With reference to
(32) II. Feature Selection Method for High Impedance Fault
(33) Feature selection helps HIF detection identify a key feature set and reduce data amount/layers, which increases the applicability of the HIF detection approach described herein. Therefore, an approach to selecting the key features is elaborated in this section. A power distribution system which may be subject to HIF events is described, after which the variable-importance in feature evaluation is explained. The pool of features and selected features are then described, followed by a way of obtaining the feature pool data.
(34) A. Benchmark System
(35)
(36) TABLE-US-00002 TABLE II System Type Location A Location B Synchronous-machine- SG N/A based system Inverter-based system WF N/A Hybrid system SG WF
(37) B. Variable-Importance in Feature Evaluation
(38) A decision-tree-based algorithm in machine learning provides protection engineers with optimal relay logic and settings in distribution network protection. However, locating the key features of HIF is a significant challenge, given its randomness and irregularity. In other words, an effective and unbiased feature evaluator is required to calculate the merit of each tested feature before the classification between HIF event and non-HIF event. Embodiments described herein take advantage of an information gain and minimum description length (MDL)-based discretization algorithm to select important features during HIF.
(39) The MDL-based method relies on information gain (also referred to as entropy). Once the information gain of each feature is calculated for a classification variable, those features that contribute more information will have a higher information gain value over others, whereas those that do not add much information will have a lower score and can be removed.
(40) The score of variable-importance is one type of selection measure in machine learning. The problem of selecting the best attribute can be stated as the problem of selecting the most compressive attribute. Assuming that all features are discrete, the objective is to find the best features that maximize the selection measure. Given C classes, the MDL can be defined as follows using the logarithm of all possible combinations of class labels:
(41)
(42) in which n.sub... denotes the number of training instances and n.sub.i. is the number of training instances from class C.sub.i, n.sub..j is the number of instances with the j-th value of the given attribute, and n.sub.ij is the number of instances from class C.sub.i and with the j-th value of the given attribute. More details regarding how the variable-importance approach has been used in the feature evaluation for the HIF detection can be found in Section V.
(43) C. The Pool of Candidate Features
(44) An exemplary pool of candidate features for use in HIF detection is illustrated below in Table III. The feature pool in Table III is designed in four steps. First, time series data of voltage and current is obtained with a DFT-based technique. Second, the feature pool is greatly expanded with multiple physical quantities through calculation using the times series data. These calculated measurements range from basic values (e.g., df, frequency) to first order derivatives (e.g., df/dt, the rate of change of frequency), considering both the absolute value and its changing rate. Third, the harmonic coefficients are estimated through a KF-based technique, presenting the in-phase and in-quadrature components (e.g., KF|sin H1). Finally, in order to capture some unconventional phenomena, some features are invented in the category of “other feature.” For example, θ.sub.V2−θ.sub.V0, the angle difference between the negative and zero sequence voltage, is a good indicator of the unbalance level in distribution grids. Note that harmonic phase angles are in harmonic degrees and are the phase difference between the zero crossing of the fundamental frequency reference and the next zero crossing in the same direction of the harmonic.
(45) TABLE-US-00003 TABLE III Feature Type Designed Feature Other Feature DFT- df, df/dt, P, dP/dt, pf, dpf/dt, Q, dQ/dt, df/dP, df/dQ, based ϕ, dϕ/dt, H.sub.V1~H.sub.V6, H.sub.I1~H.sub.I6, V.sub.abc, V.sub.012, dV/dP, dV/dQ, I.sub.abc, I.sub.012, dI/dt, dV/dt, V.sub.ph, V.sub.ll, θ.sub.V012, dH.sub.V1/dt, dH.sub.I1/dt, θ.sub.I012, θ.sub.H.sub.
(46) Embodiments disclosed herein can include a number of important and implementable features, and may attempt to maximize the number of candidate features. For instance, the parameters of “when” a HIF occurs, such as the rate of change of active power (dP/dt), form a certain group of features in the feature pool. It may not matter if some non-HIF cases, such as capacitor bank switching, lead to similar changes since these cases only trigger certain features but not all features together in the proposed feature set or any well-trained statistical machine learning model. The reason for such choice is the adopted machine learning model might be so complicated that it needs the assistance of the “when” feature group at different thresholds for decision-making.
(47) D. Events for Feature Selection
(48) Returning in greater detail to
(49) TABLE-US-00004 TABLE IV Number of Event Category Event Type Events System Operating Loading Condition (30%-100%) 8 Condition DER Tech. (SG, inverter, hybrid) 3 Fault Event Type 1: SLG, LLG, LL, LLLG 10 Type 2: Downed conductor 3 Fault impedance 6 Inceptioin Angle 4 (0°, 30°, 60°, 90°) Fault location 3 Non-fault Event Normal State 1 Load Switching 6 Capacitor Switching 2
(50) The 10 events from Type 1 in Table IV are associated with the undowned conductor, where 3 single-line-to-ground (SLG) (AG, BG, CG), 3 line-to-line-to-ground (LLG) (ABG, ACG, BCG), 3 line-to-line (LL) (Aft BC, AC), and 1 three-line-to-ground (LLLG) (ABCG) faults are included. The 3 events of Type 2 fault are the downed conductor for each phase. The fault impedance values include 50, 150, 250, 350, 450, and 550 ohms (Ω) for illustrative purposes, though other values may be used. In load switching, the 6 types of non-fault events include 4 single load switching (L-4, L-9, L-19, L-23) and 2 combinational load switching ((L-2, L-4, L-5) and (L-9, L-10)) events. The system loading for the normal state is shown in Table V. The 2 capacitor switching events have both the on and off status of the capacitor bank near bus B-15.
(51) TABLE-US-00005 TABLE V Active Reactive Phase power (kW) power (kVar) A 3,297 745 B 3,052 671 C 4,425 987 Total 10,774 2,403
(52) Moreover, the event category is flexible and can be tailored for other special systems by adding or deleting some of the event categories/types. In an exemplary aspect, comprehensive scenarios are considered in the event category (refer to Table IV). A loading condition ranging from 30% to 100%, in a step of 10%, is simulated. Furthermore, eight loading conditions and three distributed generation (DG) technologies are examined respectively on top of the base case scenario. Therefore, the number of fault and non-fault events are calculated as follows: Fault event: since two types of fault, summing up to 13 cases, are included, the number of fault events with one fault impedance, one fault inception angle, and one fault location (given the 8 loading conditions and 3 DG technologies) is (10+3)×8×3=312. Given 6 simulated fault impedances, 4 fault inception angles, and 3 fault locations, the total number of fault events adds up to 312×6×4×3=22464. Non-fault event: it comprises normal state, load switching (adding and shedding) and capacitor switching events. Therefore the total number of non-fault events equals to (1+6+2)×8×3=216.
(53) The above event number results in an imbalanced dataset, where the number of data points belonging to the minority class (“non-fault”) is far smaller than the number of the data points belonging to the majority class (“fault”). Under this circumstance, an algorithm gets insufficient information about the minority class to make an accurate prediction. Therefore, the synthetic minority over-sampling technique (SMOTE) is employed to generate synthetic samples and shift the classifier learning bias towards the minority class.
(54) In some examples, the HIF detection technique includes spatial data by implementing current and voltage transformers and measurement devices at a substation and downstream of the feeder. Knowledge extracted from these measurements is able to serve data from the spatial dimension for better detection coverage. Moreover, the proposed HIF detector can be installed along the distribution feeder and may be supplementary to the devices installed near the substation. This can address signal sensitivity and accuracy issues, since the further an HIF event is from the substation, the lower the signal magnitude becomes if the HIF detector were installed near the substation.
(55) E. Effective Feature Set
(56) Table VI presents an exemplary effective feature set (EFS) for HIF detection in three types of distribution systems. The EFS in Table VI was produced after mining the collected data, applying the feature ranking algorithm and selecting the EFS by considering the comprehensive performance in different distribution systems in Table II. The cut-off point was determined through a simple descending search process in the variable-importance list. A trade-off is realized between the detection performance improvement and the complexity of the detection logic.
(57) TABLE-US-00006 TABLE VI Fault Type Proposed Feature SLG, LL, LLG V.sub.2, I.sub.2, θ.sub.V.sub.
(58) The search process for the EFS began with testing the first feature that has the highest score, then the first two features with the highest scores, then the first three features, and so on. Then two stopping criteria were set: (1) the incremental of detection accuracy (A, defined in Section VI) of two adjacent tests is larger than 0.1%, and (2) the number of features is smaller than a practical number N (N=15 is used for illustrative purposes herein). Extensive tests showed that the increase of the performance indices becomes marginal when the variable-importance threshold is selected at 0.787. Meanwhile, the complexity of the detection logic and the number of signals are within an acceptable level (6 signals ignoring phases, three categories, suitable for unbalanced faults as shown in Table VI).
(59) According to the mathematical formulation and physical interpretation described above, these features are used for fault detection because: (1) some physical quantities are statistically more relative to the classification results than others, and (2) based on the merit of each feature, the features in Table IV contribute more information gain than others. For example, θ.sub.V.sub.
(60) III. High Impedance Fault Detector
(61)
(62) Generally, the proposed HIF detection scheme of the HIF detector 14 updates its comparison logic 22 and decision logic 24 according to the obtained decision tree structure. As indicated in
(63) The HIF detector 14 of
(64) A. System Characteristic Averager
(65) The input of the system characteristic averager 20 includes the extracted instantaneous signals (e.g., instantaneous power characteristic values) after the DFT and KF circuits 16, 18. Meanwhile, the time duration
(66) After each batch of average value calculation, the system will automatically overwrite the earliest records once the storage capacity has been reached. The five minutes interval is subject to change depending on the case-specific analysis. Normally, the five-minute data is feasible for the distribution system condition evaluation and for modern digital relay implementation. For example, the HIF solution by SEL Inc. also deploys a memory function to record unusual signal changes related to system HIF. In-depth simulation or experimental results can be conducted to validate the effectiveness of this time constant over a large time scale. In the end, each time constant is either increased or decreased depending on the signal's slow or fast dynamic process.
(67) To avoid signal spikes, a limiter is implemented at the beginning of each signal channel. Meanwhile, the time constant
(68) B. Comparison Logic
(69)
(70) The sensitivity gain of k.sub.i is incorporated in order to 1) set the margin of detection and 2) add a handle to the detection sensitivity. Where the undefined parameter of
(71) The sensitivity gain
(72) The output of the comparison logic 22 is the comparison assertion bit of b.sub.i (i=1, 2, . . . , 10), the
(73) C. Decision Logic
(74)
(75) A time delay 28 of T.sub.D is implemented because an appropriate selection of T.sub.D can effectively avoid the false operation resulting from normal switching, which sometimes contributes to third harmonics. The output of the HIF detector 14 is either alarming or tripping signal.
(76) D. Performance Test of the HIF Detector
(77) With reference to
(78) Testing Environment:
(79) The HIF detector 14 was tested under 7884 new scenarios: 7776 unbalanced faults and 108 non-faults. The fault locations under testing include faults near B-3, B-11, and B-19 of the power distribution system 12 of
(80) Testing Criteria:
(81) In order to compare the proposed technique with some existing ones in the field, adopted criteria include accuracy (A), dependability (D), security (S), speed (V), objectivity (OBJ), and completeness (COM). The detailed definition of these evaluation criteria is shown in Section VI. The performance of the proposed method was compared with four representative HIF detection methods: 1) D. C. Yu and S. H. Khan, “An adaptive high and low impedance fault detection method,” Power Delivery, IEEE Transactions on, vol. 9, no. 4, pp. 1812-1821, 1994 (hereinafter “Yu”); 2) A.-R. Sedighi, M.-R. Haghifam, O. Malik, and M.-H. Ghassemian, “High impedance fault detection based on wavelet transform and statistical pattern recognition,” Power Delivery, IEEE Transactions on, vol. 20, no. 4, pp. 2414-2421, 2005 (hereinafter “Sedhighi”); 3) A. Ghaderi, H. A. Mohammadpour, H. L. Ginn, and Y.-J. Shin, “High-impedance fault detection in the distribution network using the time-frequency-based algorithm,” Power Delivery, IEEE Transactions on, vol. 30, no. 3, pp. 1260-1268, 2015 (hereinafter “Ghaderi”); 4) S. Sahoo and M. E. Baran, “A method to detect high impedance faults in distribution feeders,” in T&D Conference and Exposition, 2014 IEEE PES, IEEE, 2014, pp. 1-6 (hereinafter “Sahoo”); as well as the combined conventional relay elements (including frequency, over/under voltage, over current) in Table VII. The methods in the comparison group cover the logic-gate based HIF detector 14 of
(82) TABLE-US-00007 TABLE VII A D S Solution under test (%) (%) (%) V OBJ COM HIF Detector 97.0 98.3 95.7 0.13 No Yes (with EFS) Yu N/A 69.0 90.7 N/A No Yes Sedhighi 96 90 100 0.25 No No Ghaderi 93.6 100 81.5 1.00 Yes Yes Sahoo 94.9 90.0 90.9 0.11 No Yes Combined 49.1 0.0 98.2 N/A Yes Yes conventional relay elements
(83) Performance Comparison:
(84) Comparing with the other five methods in Table VII, it is indicated that the proposed method has a superior overall performance in terms of the six evaluation criteria. For example, the detection accuracy of the proposed method is the highest among the solutions under test; its detection speed (1/60/0.126=0.13, according to Section VI) is not the fastest but fits well in the HIF detector 14 requirements on response time. The detection time of less than 1 second, which means the minimum speed of 0.017 in a 60 Hz network, is viewed as a conservative setting.
(85) Security Performance Under Inrush Currents:
(86) Inrush currents resulting from transformer energization and motor starting are investigated in this subsection. The power distribution system 12 in
(87) TABLE-US-00008 TABLE VIII Number S Non-fault Event Type of Events (%) Transformer energization 96 100 Motor stating 72 100
(88) Noise Immunity Capability:
(89) Environmental noise can potentially cause false tripping for the HIF detection techniques. The performance of the HIF detector 14 of
(90) The obtained results under these conditions closely match those in Table VII. The proposed technique is designed to be immune to noise for the following reasons. First, the signal processing techniques in both the DFT circuit 16 and the KF circuit 18 in
(91) Third, both the proposed detection logic and the machine learning model to be tested in the next section are immunized to the noise by itself because they are statistically reliable and the spikes whether from noise or from derivatives cannot trigger the whole detection logic or the machine learning model. Fourth, since the high frequency noise is typically very short (few cycles), the time delay that is implemented in the decision logic 24 of
(92) E. Method for Detecting High Impedance Fault
(93)
(94) The process begins at operation 800, with receiving power measurements from a power distribution system. The process optionally continues at operation 802, with performing a DFT of one or more received signals (e.g., a received voltage signal and/or current signal). The process optionally continues at operation 804, with Kalman filtering the one or more received signals (e.g., the received voltage signal and/or the received current signal).
(95) The process continues at operation 806, with extracting an angle difference between a negative sequence voltage and a zero sequence voltage (θ.sub.V.sub.
(96) Although the operations of
(97) IV. Performance Analysis
(98) Performance analysis includes the most commonly occurring single-line-to-ground-fault, the fault scenario analysis, and the testing results.
(99) A. Single-Line-to-Ground Fault Analysis
(100)
(101) B. Fault Scenario Analysis
(102) The EFS has been evaluated in terms of different fault impedances, fault inception angles, and fault locations. The quantifier for evaluation is the variable of importance explained above in Section II-B.
(103)
(104) Fault Impedance:
(105) To be practical, a fault impedance up to 500Ω was evaluated to cover typical HIFs whose fault currents are as low as 10 A. The variable-importance performances of each feature in EFS upon SLG fault, LL fault, LLG fault, and LLLG fault are all depicted. It is concluded that: The negative sequence of voltage and current are the most reliable features that can keep unaffected during any unbalanced fault upon a varying fault impedance (see
(106) Furthermore, the proposed algorithm is applicable to unbalanced power systems. Since the employed feature selection method is based on the information gain, what is captured by the information gain is the incremental or variation of the negative sequence signal. Only when the variation pattern of the negative sequence feature contributes to the information gain given the output label belongs to the HIF, does this feature get selected by the proposed algorithm. As a result, the proposed method is applicable to an already unbalanced system.
(107)
(108) Fault Inception Angle: The effect of fault inception angle was also examined. The results of unbalanced faults and three phase faults are selectively shown in
(109) The fault inception angle is an insignificant factor that can perturb variable importance. The angles of 30° and 60° result in a subtle decrease in the variable of importance of the KF estimated third harmonic, but the change is limited. For an LLLG fault, the first order harmonic components of current estimated by KF have a performance drop in non-zero angles.
(110) Fault Location: The variable-importance of the features in EFS is presented at three fault locations (bus numbers refer to
(111) The result is demonstrated in
(112) C. Testing Results of the Effective Feature Set
(113) The proposed EFS was tested under the aforementioned conditions in Table IV, but with dataset on different events. 1944 HIF events and another 1944 non-HIF events (they do not have to be the same number) are simulated for the training of the HIF detector 14 of
(114) The results with the proposed EFS under different classifiers, shown in Table IX, reveal the effectiveness of the feature selection. To limit problems such as over-fitting and inaccuracy in prediction, each classifier model was acquired through 10-fold cross-validation. The performance with Naive Bayes presented the lowest values. The accuracy of the other five classifiers were all above 90%, and remarkably, the performance of the selected artificial neural network (ANN) classifier was exceptionally good. This shows that the proposed EFS works well with most of the non-linear classifiers in HIF detection. However, due to the limited interpretability and debuggability of these classifiers, some linear logic from the tree based classifiers were adopted.
(115) TABLE-US-00009 TABLE IX Classifier A (%) D (%) S (%) Naïve Bayes 78.0 73.0 82.9 SVM 91.9 89.6 94.1 k-nearest neighbor 98.0 97.7 98.3 Decision Tree 99.4 99.5 99.3 Random Forest 99.7 99.7 99.8 ANN 100.0 100.0 100.0
(116) D. The HIF Detection Logic and Classifiers
(117) Derived from the tree structure of the machine learning classifier model, the proposed HIF detection logic aims to simplify the HIF detection process and be implemented in microprocessor-based relays. The logic complexity of the proposed technique in
(118) The most computationally expensive parts in a machine learning-based HIF detection algorithm are usually the feature extraction and classifier. In the proposed algorithm, both the features and classifiers are obtained from offline simulation. Therefore, the offline simulation is computationally expensive. An Intel i7 CPU suffices for the majority of HIF detection simulation tasks. If the electric network and operational complexity increase significantly compared to the system under study, a GPU is required to conduct the offline simulation and obtain the settings. The proposed HIF detector 14 of
(119) E. Detection Under Low Current Magnitude Levels
(120) The proposed HIF detection technique relies on time domain features. Therefore, its performance can be influenced when the fault current magnitude is low. In the 25 kV benchmark distribution network, the current magnitude in the case study of 500Ω fault impedance goes down to (25000/√3)/500=28.87 A. The proposed EFS and HIF detection logic can still maintain a high detection performance (97.0% accuracy). Further tests indicate that the proposed method can detect a fault current down to 14 A (fault impedance up to 1000Ω) with the detection accuracy of 87.9%. However, as the fault impedance rises, the variable importance of some features like θ.sub.V.sub.
(121) V. Derivation of the Minimum Description Length
(122) Assuming that all features are discrete, the objective is to find the best features that maximize the selection measure. Let C, A, and V denote the number of classes, the number of features, and the number of values of the given feature. With this notation, the following equations show the entropy of the classes (H.sub.C), the values of the given feature (H.sub.A), the joint events class-feature value (H.sub.CA), and the classes given the value of the attribute (H.sub.C|A).
H.sub.C=−Σ.sub.ip.sub.i. log p.sub.i. H.sub.A=−Σ.sub.jp.sub..j log p.sub..j
H.sub.CA=−Σ.sub.iΣ.sub.jp.sub.ij log p.sub.ij H.sub.C|A=H.sub.CA−H.sub.A
where p.sub.ij=n.sub.ij/n.sub..., p.sub.i.=n.sub..., and p.sub..j=n.sub..j/n.sub....Math.n.sub..., denotes the number of training instances and n.sub.i. is the number of training instances from class C.sub.i, n.sub..j is the number of instances with the j-th value of the given attribute, and n.sub.ij is the number of instances from class C.sub.i and with the j-th value of the given attribute.
(123) The approximation of the total number of bits that are needed to encode the classes of n.sub... is:
(124)
and the approximation of the number of bits to encode the classes of examples in all subsets corresponding to all values of the selected attribute is:
(125)
(126) The last term (log A) is needed to encode the selection of an attribute among A attributes. However, this term is constant for a given selection problem and can be ignored. The first term equals n.sub...H.sub.C|A. Therefore, the MDL′ measure evaluates the average compression (per instance) of the message by an attribute. The measure is defined by the difference Prior MDL′−Post MDL′, normalized with n.sub...:
(127)
(128) However, entropy H.sub.C can be used to derive MDL′ if the messages are of arbitrary length. If the length of the message is known, the more optimal coding uses the logarithm of all possible combinations of class labels for given probability distribution:
(129)
(130) Similarly, if the priori minus the posterior of the MDL is used, Equation 1 is obtained. The MDL value in Equation 1 is the evaluation index we deployed for the variable-importance approach.
(131) VI. Definition to the HIF Detection Evaluation Criteria
(132) The concepts of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) are borrowed from statistical classification. These concepts and their related evaluation criteria are defined as follows: TP: the number of correctly detected fault events. TN: the number of correctly detected non-fault events. FP: the number of incorrectly detected fault events that are actually non-fault events. FN: the number of incorrectly detected non-fault events that are actually fault events. Accuracy:
(133)
(134)
(135)
(136)
(137) Those skilled in the art will recognize improvements and modifications to the preferred embodiments of the present disclosure. All such improvements and modifications are considered within the scope of the concepts disclosed herein and the claims that follow.