FEATURE GENERATION BASED ON EIGENFUNCTIONS OF THE SCHRÖDINGER OPERATOR
20220133242 · 2022-05-05
Inventors
Cpc classification
A61B5/7285
HUMAN NECESSITIES
G16H50/20
PHYSICS
G16H50/70
PHYSICS
A61B5/7278
HUMAN NECESSITIES
International classification
Abstract
A method for generating a feature associated with input data includes receiving the input data; projecting the input data with a set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator; selecting the feature to be a number of the negative eigenvalues λ.sub.nh of the Schrödinger operator; and classifying the input data based on the feature.
Claims
1. A method for generating a feature associated with input data, the method comprising: receiving the input data; projecting the input data with a set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator; selecting the feature to be a number of the negative eigenvalues λ.sub.nh of the Schrödinger operator; and classifying the input data based on the feature.
2. The method of claim 1, wherein the set of square functions ψ.sub.nh.sup.2 is associated with the negative eigenvalues λ.sub.nh of the Schrödinger operator.
3. The method of claim 1, further comprising: splitting the input data into frames.
4. The method of claim 3, further comprising: concatenating plural signals from a frame to form a single signal.
5. The method of claim 4, further comprising: using the single signal as a potential for the Schrödinger operator.
6. The method of claim 5, further comprising: reconstructing the single signal using the set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator and the number of the negative eigenvalues λ.sub.nh of the Schrödinger operator.
7. The method of claim 6, further comprising: identifying a peak of the reconstructed single signal.
8. The method of claim 7, wherein the step of classifying comprises: classifying the input data based on the peak.
9. The method of claim 1, wherein the input data is a magnetoencephalography signal.
10. The method of claim 9, wherein the step of classifying comprises: identifying a signal from the input data that indicates an epileptic patient.
11. The method of claim 1, wherein the feature is a minimum number of negative eigenvalues for each of the frames.
12. A computing device for generating a feature associated with input data, the computing device comprising: an interface for receiving the input data; and a processor connected to the interface and configured to, project the input data with a set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator; select the feature to be a number of the negative eigenvalues λ.sub.nh of the Schrödinger operator; and classify the input data based on the feature.
13. The computing device of claim 12, wherein the set of square functions ψ.sub.nh.sup.2 is associated with the negative eigenvalues λ.sub.nh of the Schrödinger operator.
14. The computing device of claim 12, wherein the processor is further configured to: split the input data into frames; and concatenate plural signals from a frame to form a single signal.
15. The computing device of claim 14, wherein the processor is further configured to: use the single signal as a potential for the Schrödinger operator; and reconstruct the single signal using the set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator and the number of the negative eigenvalues λ.sub.nh of the Schrödinger operator.
16. The computing device of claim 15, wherein the processor is further configured to: identify a peak of the reconstructed single signal; and classify the input data based on the peak.
17. The computing device of claim 12, wherein the input data is a magnetoencephalography signal and wherein the processor is further configured to identify a signal from the input data that indicates an epileptic patient.
18. A non-transitory computer readable medium including computer executable instructions, wherein the instructions, when executed by a processor, implement instructions for generating a feature associated with input data, the instructions comprising: receiving the input data; projecting the input data with a set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator; selecting the feature to be a number of the negative eigenvalues λ.sub.nh of the Schrödinger operator; and classifying the input data based on the feature.
19. The medium of claim 18, wherein the set of square functions ψ.sub.nh.sup.2 is associated with the negative eigenvalues λ.sub.nh of the Schrödinger operator.
20. The medium of claim 18, further comprising: splitting the input data into frames; concatenating plural signals from a frame to form a single signal; using the single signal as a potential for the Schrödinger operator; reconstructing the single signal using the set of square functions ψ.sub.nh.sup.2 of the Schrödinger operator and the number of the negative eigenvalues λ.sub.nh of the Schrödinger operator; identifying a peak of the reconstructed single signal; and classifying the input data based on the peak.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0012] The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate one or more embodiments and, together with the description, explain these embodiments. In the drawings:
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
DETAILED DESCRIPTION
[0023] The following description of the embodiments refers to the accompanying drawings. The same reference numbers in different drawings identify the same or similar elements. The following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims. For simplicity, the following embodiments are discussed with regard to MEG signals. However, the methods and systems discussed herein are equally applicable to any signal that exhibits peaks. For example, the methods discussed herein can be applied to water peak estimation, water suppression signal in magnetic resonance spectroscopy (MRS) signals, MRS signal denoising, pulse-shaped signal decomposition and denoising, etc. The novel methods can be integrated in any processing unit to process biomedical signals such as MRS signals, electroencephalogram (EEG) signals, or any other pulse-shaped signal.
[0024] Reference throughout the specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with an embodiment is included in at least one embodiment of the subject matter disclosed. Thus, the appearance of the phrases “in one embodiment” or “in an embodiment” in various places throughout the specification is not necessarily referring to the same embodiment. Further, the particular features, structures or characteristics may be combined in any suitable manner in one or more embodiments.
[0025] According to an embodiment, there is a method that introduces a new characterization of signals. The signals may be spectrum data, biomedical signals or any other type of signal. This new characterization generates new features that can be used for the classification of the signals. The proposed feature generation technique is based on the semi-classical analysis (SCSA) method, which includes the projection of the input signal into a set of functions given by the squared eigenfunctions of the Schrödinger operator associated to the negative eigenvalues, and whose potential is given by the input signal. The computed eigenvalues, eigenfunctions and their different combinations introduce new types of features, which can be used all together or in different combination forms to provide a suitable and accurate discrimination of the data signals.
[0026] More specifically, as illustrated in
[0027] The process illustrated in
[0028] A signal 303 from the input data 302 (i.e., an MEG signal from a healthy subject) and a signal 305 from the input data 304 (i.e., an MEG signal from an epileptic subject) are shown in
[0029] In step 202, the input signal 300 is split into frames, for example, using sliding frames. In one application, a sliding frame 306 includes 100 sample points and the sliding frame slides with a step of 2 samples, i.e., the frame 306 is moved 2 samples and another 100 samples points are considered for a second sliding frame, and so on. The inset of
[0030] The signals from each frame 306 are then concatenated in step 204 to build a classification dataset 310. Note that for this specific embodiment, the MEG signals include 26 channels, i.e., 26 different sensors have been used to collect each MEG signal. For other types of signals, the number of sensors may be fewer or more. Regardless of the number of sensors, the signals in each channel are concatenated for each given frame, to generate a single signal. Two different classes are defined in this embodiment, the negative samples 312 and the positive samples 314. The negative samples 312 include the frames from the healthy subjects 302 and the positive samples 314 include the frames from the epileptic subjects 304. Each class includes the same number of frames.
[0031] In step 206, one or more features is generated for these signals. For this step, the semi-classical signal analysis (SCSA) 320 is applied. The SCSA analysis is now discussed in more detail. The SCSA uses signal-dependent functions given by the squared eigenfunctions of the Schrödinger operator to decompose the signal (see, for example, [4] and [5]). The potential V of the Schrödinger operator H(y), in this case, is given by the positive function y(t) representing the signal. The Schrödinger operator is written as follows:
where H is the Schrödinger operator, V is the potential, h is a constant, t is the time, and d indicates a derivative. For the SCSA analysis, the potential V is selected to be the positive function y(t) representing the signal, which means that equation (1) becomes:
The Schrödinger equation for the SCSA analysis is given by:
H(y)ψ(t)=λψ(t), (3)
where ψ(t) is the eigenfunction of the Schrödinger operator, and is the eigenvalue of the Schrödinger operator.
[0032] Based on the SCSA analysis, a real positive input signal y(t) can be approximated by plural signals y.sub.h(t), which are given by:
where n varies from 1 to N.sub.h, and n represents the negative eigenvalues λ.sub.nh of the Schrödinger operator H(y). In addition, λ.sub.1h< . . . <λ.sub.nh<0. The accuracy of the reconstructed signal y.sub.h(t) depends on the value of h and also on how many negative eigenvalues λ.sub.nh are used. Equation (4) provides an exact reconstruction of the original signal y(t) when h converges to zero. When the value of h decreases, the number of eigenvalues λ.sub.nh increases and the reconstruction improves. However, as the h converges to zero, the calculation amount increases and may become unpractical for a practical application for which the computer power is limited. Thus, for a real situation, a balance between the value of h and the amount of computational power necessary to reconstruct the signal needs to be found.
[0033] The SCSA analysis is used in step 206 to generate a feature associated with the signals 302 and 304. The SCSA analysis has been used for reconstruction and de-noising of some biomedical signals such as the Magnetic Resonance Spectroscopy (MRS) spectra and the Arterial Blood Pressure (ABP) [6], [7]. Due to the localized and shape-dependent structure of the squared eigenfunctions ψ.sub.nh.sup.2 of the Schrödinger operator, the SCSA analysis introduces an effective analysis tool for pulse shaped signals (signals with peaks).
[0034] In this embodiment, the feature 330 is related to the number of negative eigenvalues that are used to reconstruct the signal for each frame 306. For this purpose, the parameter N.sub.h.sup.* is introduced as being the lower feature size and it is defined as:
N.sub.h.sup.*=min(N.sub.h1,N.sub.h2,N.sub.hM), (5)
where N.sub.hi is the number of negative eigenvalues of the i.sup.th frame for a given value h, and M is number of frames. This means that for each frame i, a corresponding number N.sub.hi of negative eigenvalues λnh is selected in step 206 to reconstruct the signal for that frame, and then, based on equation (5), the minimum number of negative eigenvalues is selected for all the frames. Thus, for each frame of the M frames used in these calculations, only N.sub.h.sup.* negative eigenvalues are used for reconstructing the signals, and the N.sub.h.sup.* negative eigenvalues is the generated feature 330.
[0035] One way to select the number N.sub.hi of negative eigenvalues λ.sub.nh that is used to reconstruct the signal for each frame, is to define a set threshold value. Then, for a given instant, reconstruct the signal with a given number of negative eigenvalues λ.sub.nh and calculate a different between the original signal and the reconstructed signal at the given instant. If the difference is smaller than the set threshold value, the given number of negative eigenvalues corresponds to N.sub.hi. If not, increase the given number of negative eigenvalues and evaluate again the difference between the original signal and the reconstructed signal. Repeat this process until the difference is smaller than the set threshold value and that is the value of the given number of negative eigenvalues. This is only one way to determine the N.sub.hi for each frame. Other criteria may be used for selecting the negative eigenvalues λ.sub.nh that reproduce the original signal for each frame.
[0036] The feature 330 is fed in step 208 to a classifier 340 for classifying the reconstructed signals. The classifier 340 may be, for example, a Support Vector Machine (SVM) predictive model. The SVM model may be developed in 5-fold cross-validation (CV) process with the following subjects: 1734 spiky frames and 1734 healthy frames from different MEG test sessions of the eight healthy and eight epileptic patients.
[0037] The performance of the classifier 340 has been measured using the average accuracy, the sensitivity, the specificity and other metrics defined as follows:
where TP.sub.n, FP.sub.n, TN.sub.n, and FN.sub.n are the True Positive, False Positive, True Negative, and False Negative values, respectively, for the n.sup.th fold. Note that these values are calculated by comparing the results of the classifier 340 made in step 208, and the actual peaks determined by the expert neurologists based on the input data 300.
[0038] Table 1 (see
[0039] The results of the method illustrated in
[0040] A method for generating a feature having a small size is now discussed with regard to
[0041] The methods discussed herein advantageously present a new feature generation and dimensionality reduction algorithm. While the algorithm has been presented as a specific application for epileptic spikes detection in MEG signals, the same algorithm may be used for any signal that requires spike identification. The algorithm projects an input signal into the discrete spectrum of the Schrödinger operator and then selects a number of eigenvalues to be used for regenerating the signal from the eigenfunctions of the Schrödinger operator. As illustrated in FIGS. 6 and 7, the novel algorithm obtains the highest sensitivity up to 92.52% with a specificity of 89.10% for a given dataset.
[0042] The above-discussed procedures and methods may be implemented in a computing device as illustrated in
[0043] Computing device 1000 suitable for performing the activities described in the embodiments discussed above may include a server 1001. Such a server 1001 may include a central processor (CPU) 1002 coupled to a random access memory (RAM) 1004 and to a read-only memory (ROM) 1006. ROM 1006 may also be other types of storage media to store programs, such as programmable ROM (PROM), erasable PROM (EPROM), etc. Processor 1002 may communicate with other internal and external components through input/output (I/O) circuitry 1008 and bussing 1010 to provide control signals and the like. Processor 1002 carries out a variety of functions as are known in the art, as dictated by software and/or firmware instructions.
[0044] Server 1001 may also include one or more data storage devices, including hard drives 1012, CD-ROM drives 1014 and other hardware capable of reading and/or storing information, such as DVD, etc. In one embodiment, software for carrying out the above-discussed steps may be stored and distributed on a CD-ROM or DVD 1016, a USB storage device 1018 or other form of media capable of portably storing information. These storage media may be inserted into, and read by, devices such as CD-ROM drive 1014, disk drive 1012, etc. Server 1001 may be coupled to a display 1020, which may be any type of known display or presentation screen, such as LCD, plasma display, cathode ray tube (CRT), etc. A user input interface 1022 is provided, including one or more user interface mechanisms such as a mouse, keyboard, microphone, touchpad, touch screen, voice-recognition system, etc.
[0045] Server 1001 may be coupled to other devices, such as medical instruments, detectors, sensors, etc. The server may be part of a larger network configuration as in a global area network (GAN) such as the Internet 1028, which allows ultimate connection to various landline and/or mobile computing devices.
[0046] The disclosed embodiments provide a method and system that is capable to detect and classify peaks associated with one or more signals. The embodiments are intended to cover alternatives, modifications and equivalents, which are included in the spirit and scope of the invention as defined by the appended claims. Further, in the detailed description of the embodiments, numerous specific details are set forth in order to provide a comprehensive understanding of the claimed invention. However, one skilled in the art would understand that various embodiments may be practiced without such specific details.
[0047] Although the features and elements of the present embodiments are described in the embodiments in particular combinations, each feature or element can be used alone without the other features and elements of the embodiments or in various combinations with or without other features and elements disclosed herein.
[0048] This written description uses examples of the subject matter disclosed to enable any person skilled in the art to practice the same, including making and using any devices or systems and performing any incorporated methods. The patentable scope of the subject matter is defined by the claims, and may include other examples that occur to those skilled in the art. Such other examples are intended to be within the scope of the claims.
REFERENCES
[0049] [1] A. Ossadtchi, S. Baillet, J. C. Mosher, D. Thyerlei, W. Sutherling, and R. M. Leahy, “Automated interictal spike detection and source localization in magnetoencephalography using independent components analysis and spatiotemporal clustering,” Clinical Neurophysiology, vol. 115, no. 3, pp. 508-522, 2004. [0050] [2] M. I. Khalid, T. Alotaiby, S. A. Aldosari, S. A. Alshebeili, M. H. Al-Hameed, F. S. Y. Almohammed, and T. S. Alotaibi, “Epileptic MEG Spikes Detection Using Common Spatial Patterns and Linear Discriminant Analysis,” IEEE Access, vol. 4, pp. 4629-4634, 2016. [0051] [3] M. I. Khalid, T. N. Alotaiby, S. A. Aldosari, S. A. Alshebeili, M. H. Alhameed, and V. Poghosyan, “Epileptic MEG Spikes Detection Using Amplitude Thresholding and Dynamic Time Warping,” IEEE Access, vol. 5, pp. 11 658-11 667, 2017. [0052] [4] T.-M. Laleg-Kirati, E. Crepeau, and M. Sorine, “Semi-classical signal analysis,” Mathematics of Control, Signals, and Systems, vol. 25, no. 1, pp. 37-61, 2013. [0053] [5] B. Helffer and T.-M. Laleg-Kirati, “On semi-classical questions related to signal analysis,” Asymptotic Analysis. [0054] [6] T.-M. Laleg, E. Crepeau, Y. Papelier, and M. Sorine, “Arterial blood pressure analysis based on scattering transform i,” in Engineering in Medicine and Biology Society, 2007. EMBS 2007. 29th Annual International Conference of the IEEE. IEEE, 2007, pp. 5326-5329. [0055] [7] T. Laleg-Kirati, J. Zhang, E. Achten, and H. Serrai, “Spectral data de-noising using semi-classical signal analysis: application to localized MRS,” NMR in Biomedicine, vol. 29, no. 10, pp. 1477-1485, 2016.