METHODS AND SYSTEMS TO QUANTIFY AND REMOVE ASYNCHRONOUS NOISE IN BIOPHYSICAL SIGNALS
20220142583 · 2022-05-12
Inventors
- Michael Garrett (Wilmette, IL, US)
- Timothy William Fawcett Burton (Toronto, CA)
- Shyamlal Ramchandani (Kingston, CA)
- Abhinav Doomra (Toronto, CA)
Cpc classification
A61B5/7282
HUMAN NECESSITIES
A61B5/7221
HUMAN NECESSITIES
A61B5/349
HUMAN NECESSITIES
A61B5/242
HUMAN NECESSITIES
G16H50/70
PHYSICS
A61B5/02007
HUMAN NECESSITIES
A61B5/7278
HUMAN NECESSITIES
International classification
A61B5/00
HUMAN NECESSITIES
A61B5/349
HUMAN NECESSITIES
G06F16/28
PHYSICS
Abstract
The exemplified methods and systems described herein facilitate the quantification and/or removal of asynchronous noise, such as muscle artifact noise contamination, to more accurately assess complex nonlinear variabilities in quasi-periodic biophysical-signal systems such as those in acquired cardiac signals, brain signals, etc.
Claims
1. A method to filter asynchronous noise from an acquired biophysical-signal data set, the method comprising: receiving, by a processor, a biophysical-signal data set of a subject; determining, by the processor, at least one template-signal vector data set characteristic of a representative quasi-periodic signal pattern of the subject from a plurality of detected quasi-periodic cycles detected in the received biophysical-signal data set; applying, by the processor, the at least one determined template-signal vector data set to one or more denoising vector data sets,; and generating a filtered biophysical-signal data set of the biophysical-signal data set, or a portion thereof, by merging the portion of the received biophysical-signal data set to be filtered and the one or more generated denoising vector data sets.
2-30. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0076] The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments and together with the description, serve to explain the principles of the methods and systems contained herein. The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0077] Embodiments of the present invention may be better understood from the following detailed description when read in conjunction with the accompanying drawings. The drawings include the following figures:
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
DETAILED SPECIFICATION
[0098] Each and every feature described herein, and each and every combination of two or more of such features, is included within the scope of the present invention provided that the features included in such a combination are not mutually inconsistent.
[0099] Example System
[0100]
[0101] In
[0102] In some embodiments, and as shown in
[0103] The measurement system 102, in some embodiments, includes a biopotential-based measurement system configured to acquire wide-band biopotential biophysical signals. In the electrocardiography context, the measurement system 102 is configured to capture cardiac-related biopotential or electrophysiological signals of a mammalian subject (such as a human) as wide-band cardiac phase gradient signals. An example of the measurement system 102 is described in U.S. Publication No. 2017/0119272 and in U.S. patent application Ser. No. 15/248,838, each of which is incorporated by reference herein in its entirety.
[0104] In some embodiments, the wide-band biopotential biophysical signals are captured as unfiltered mammalian electrophysiological signals such that the spectral component(s) of the signals are not altered. Indeed, the wide-band biopotential biophysical signals are captured, converted, and even analyzed without having been filtered (via, e.g., hardware circuitry and/or digital signal processing techniques, etc.) (e.g., prior to digitization) that otherwise can affect the phase linearity of the biophysical signal of interest. In some embodiments, the wide-band biopotential biophysical signals are captured in in microvolt or sub-microvolt resolutions that are at, or significantly below, the noise floor of conventional electrocardiographic and other biophysical-signal acquisition instruments. In some embodiments, the wide-band biopotential biophysical signals are simultaneously sampled having a temporal skew or “lag” of less than about 1 microseconds, and in other embodiments, having a temporal skew or lag of not more than about 10 femtoseconds. Notably, the exemplified system minimizes non-linear distortions (e.g., those that can be introduced via certain filters) in the acquired wide-band phase gradient signal to not affect the information therein.
[0105] As noted above, the measurement system 102 may be used to capture other mammalian biopotential or electrophysiological signals, such as, e.g., cerebral/neurological biopotential signals or other mammalian biopotential signals associated with various biological systems as described elsewhere herein.
[0106] Referring still to
[0107] In some embodiments, the measurement system 102 is configured to assess the signal quality of the acquired biophysical signal and to reject the acquired signal data set based on such assessment.
[0108] Because the clinical analysis of the acquired biophysical signal 108 is performed, in some embodiments, on a separate system (e.g., by the assessment system 110) from the measurement system 102, a signal quality check ensures that the acquired biophysical-signal data set 108 is suitable for subsequent clinical analysis. The near real-time operation may facilitate the prompting of the re-acquisition of the biophysical-signal data set by the non-invasive measurement system 102, thus, ensuring that the acquired biophysical-signal data set is not contaminated by asynchronous noise (such as skeletal-muscle-related noise) prior to the biophysical-signal data set being subjected, or made available, to further processing and analysis for a clinical assessment.
[0109] In some embodiments, the signal quality assessment operation is performed in near real-time, e.g., less than about 1 minute or less than about 5 minutes, to which the system can prompt for the re-acquisition of the biophysical-signal data set. The near real-time assessment allows the re-acquisition of the biophysical-signal data set prior to the patient leaving the testing room where the test is conducted.
[0110] In some embodiments, the non-invasive measurement system 102 is configured to generate a notification 126 (labeled in
[0111] In some embodiments, the rejected biophysical-signal data set may be stored (128) for further troubleshooting analysis (132) of defects that led to the rejection of the acquired signal. To this end, the rejected biophysical-signal data set is not used in subsequent analysis (e.g., 120, 124) to yield the clinical output 112.
[0112]
[0113] Asynchronous Noise Removal
[0114]
[0115] Other conventional electrode sets, and electrographic acquisition methodologies may be used to which the method and system disclosed herein can be applied.
[0116] Referring still to
[0117] Referring still to
[0118] Referring still to
[0119] In other embodiments, the merging operation is performed in the time domain.
[0120]
[0121] Referring still to
[0122] Step 306 includes populating, by a processor, the template vector data set with the representative vector data set(s). That is, in some embodiments, for each detected cycle in the raw signal data set, method 118 includes placing or duplicating the representative vector data set 602 in the template vector data set. Each of the representative vector data set 602 is placed such that a determined peak (e.g., R-peak) of the representative vector data set 602 is aligned to a same, or similar, time-index as a corresponding peak (e.g., R-peak) of each detected cycle.
[0123]
[0124] It is possible that more than one representative vector data sets may exist with each corresponding to an assessed quasi-periodic signal pattern (e.g., heat-beat pattern for cardiac signals). When there are more than one representative cycle data sets, then method 118, in some embodiments, further includes placing a representative cycle data set selected to correspond (i.e., more closely matches) to a given current cycle in the raw signal data set.
[0125] Referring still to
[0126] Once the template vector data set 702 has been created, method 118 includes initiating (step 310 in
[0127] In some embodiments, each window has a window size of about 0.25 seconds. In other embodiments, a window size less than about 0.25 seconds is used. In yet other embodiments, a window size greater than about 0.25 seconds is used.
[0128] In some embodiments, a static value for the interpolation coefficient is about 0.75 (that is, the influence attributed to the template vector data set is about three times that of the raw signal data set). In other embodiments, the interpolation coefficient has a value less than about 0.75. In yet other embodiments, the interpolation coefficient has a value greater than about 0.75. The values of the interpolation coefficient and the window size can be assessed based on the need to eliminate the noise versus that to maintain signal variability. In some embodiments, the window size or the interpolation coefficients are allowed to vary dynamically, e.g., based on an assessment of the signal, e.g., change with respect to an automatically quantified level of contamination (e.g., skeletal-muscle related contamination) in the signal.
[0129] Referring still to
[0130] Referring still to
[0131] Referring still to
[0132] Step 316 includes performing, by a processor, a Fast Fourier Transform (e.g., discrete FFT) of each of the modified template vector envelope data set and the modified raw signal data set to transform each of them into the frequency domain.
[0133] Step 318 includes merging, by a processor, the modified template vector envelope data set and the modified raw signal data set in the frequency domain. In some embodiments, a weighted average operation of the modified template vector envelope data set and the modified raw signal data set in the frequency domain is performed. In some embodiments, the weights used in the interpolation are the interpolation coefficients that was initially set to control the influence of the template vector against the raw signal.
[0134] Step 320 includes performing, by a processor, an inverse Fourier Transform operation to transform the resultant data back to the time domain. The resultant data is assigned as a current window of the denoised signal data set. The process is repeated for all the windows, or a portion thereof, to populate the remaining portion of the denoised signal data set.
[0135]
[0136] As shown in
[0137] Indeed, methods described herein involve generating a filtered cardiac signal (namely, the denoised signal) of the cardiac signal, or a portion thereof, by merging the portion of the received biophysical signal to be filtered (e.g., as the modified raw signal) and the one or more generated denoising vectors (e.g., as the modified template vector envelope).
[0138] Determination of a Representative Cycle of a Quasi-Periodic Signal Pattern
[0139] As noted above,
[0140] As shown in
[0141] In some embodiments, the system is configured to assess the number of cycles and boundaries of the cycles in the biophysical-signal data set (e.g., data set 108) to which subsets of the cycles in determined groups of neighboring cycles are subsequently used to determine template-signal vector data sets of representative cycles. In some embodiments, the system is configured to assess the boundaries of the cycles in the entire biophysical-signal data set, or the portion desired to be analyzed. In other embodiments, the system is configured to assess the boundaries of the cycles for a pre-defined number of neighboring cycles in the portion of the biophysical-signal data set (e.g., data set 108) of interest.
[0142] Neighborhood/neighboring cycles may be defined as, in some embodiments, as ±1, 2, . . . 10 cycles around a middle cycle of a set of determined cycles. In other embodiments, the neighborhood/neighboring cycles may be defined as, +1, 2, . . . 20 cycles with respect to a beginning cycle of a set of determined cycles. In other embodiments, the neighborhood/neighboring cycles may be defined as −1, 2, . . . 20 cycles with respect to a last cycle of a set of determined cycles.
[0143] To this end, multiple template-signal vector data sets may be generated to which each template-signal vector data set is respectively applied to the cycles used to generate it. For example, where neighborhood group 1 is composed of cycles 1 . . . 10 and derives template vector # 1, analysis of cycles 1-10 (e.g., as discussed herein) are evaluated against only template vector #1; where neighborhood group 2 is composed of cycles 5 . . . 15 (e.g., having some and derives template vector # 2), analysis of cycles 5-10 (e.g., as discussed herein) are evaluated against template vector #1 and template vector #2 (e.g., by an average of vector #1 and #2); where neighborhood group 3 is composed of cycles 10 . . . 20 (e.g., having some and derives template vector # 2), analysis of cycles 10-15 (e.g., as discussed herein) are evaluated against template vector #2 and template vector #3 (e.g., by an average of vector #2 and #3).
[0144] Indeed, in some embodiments, the analysis is performed until all cardiac cycles (e.g., 3.5-minute PSR recording×60 BPM=210), or portions of the biophysical-signal data set of interest, have been evaluated.
[0145] This neighborhood approach may reduce sensitivity to long-term variation by only incorporating local cycles into the template though may also reduce robustness to noise because fewer component cycles are used or analyzed. Indeed, using the full recording may capture natural cardiac variation in the entire data set but may also create non-noise-based deviations between the template and the test cardiac cycles. By using all of the signals in neighborhoods, but in neighboring groups, all of the signals (and inherent variation in the acquired signal) are still assessed and sensitivity is locally improved.
[0146] The number of neighborhood size may be 10, e.g., as discussed above, or it may be user-defined parameter. In some embodiments, the neighborhood size is determined based on some assessed variation in the signals. Indeed, the number of neighborhood size may be 2, 3, 4, 5, 6, 7, 8, 9, 10. In some embodiments, the number of number of neighborhood size may be greater than 10, e.g., between 10 or 15. In some embodiments, the number of number of neighborhood size may be greater 15, e.g., between 15-25. In some embodiments, the number of number of neighborhood size may be greater 25, e.g., between 25-50.
[0147] In some embodiments, the neighborhood or groupings of cycles are defined by an offset size. In some embodiments, the offset size is the distance in the index count from a reference point in one cycle to the next cycle. The reference point may be a middle point, a beginning point, or an ending point in the cycle. In the example above, where cycles are defined from 1 . . . 10, 5 . . . 15, 10 . . . 20, etc., the offset size is 5 (per the reference point being at the beginning, middle, or end).
[0148] In some embodiments, depending on the offset size and neighborhood size, each given cycle may have one or more template-signal vector data sets compared to it to determine a metric (e.g., mean, median, mode, among others as discussed herein) for that template-signal vector data set. Then the metric can be combined to provide a revised score for that template-signal vector data set.
[0149] For example, a template-signal vector data set may be defined as preceding and tailing neighboring points of a reference point defined in the middle of a given defined cycle. The template-signal vector data set can be generated (e.g., based on mean, mode, median, etc.) based only on the preceding and tailing neighboring points (and not on the reference point defined in the middle of the cycle). Once the template-signal vector data set is generated, the template-signal vector data set is compared to the middle of a given defined cycle to determine a score for that score.
[0150] In some embodiments, the analysis can be iterative where the score for subsequent cycles are combined. For example, in cycle #1, a score #1 is determined for cycle #2. Then, for cycle #2, a score #2 is determined based on a local score determined from only cycle #2 and then having that local score combined with the score from cycle #1. Then, for cycle #3, a score #3 is determined based on a local score determined from only cycle #3 and then having that local score combined with the score from cycles #1 and #2. This iterative analysis can be applied for all or portion of the input data set of interest.
[0151] Indeed, the system may choose to only apply the template to the single cycle at, e.g., the exact center of the neighborhood. As discussed, every single cycle is then assessed against a single template, and that single template is unique across all the possible templates. This type of analysis provides different vantages of viewing local effects of the cycles.
[0152] Method 400 includes using (step 404), by a processor, the detected peak locations to determine a median peak-to-peak interval (e.g., median R-R peak for a cardiac signal) and to set a cycle region around each peak (e.g., R-peak for a cardiac signal). For cardiac signals, the cycle region is set around the R-peak and includes both the P wave and completion of the T wave.
[0153] Referring to
[0154] In some embodiments, the centering operation includes the operation of time-aligning the same feature (e.g., peaks) among the waveforms. Examples of these features include, for cardiac signals, an initiation of the Q wave, a peak of the R wave, or a delay estimate determined by a cross correlation operation, among others. In some embodiments, the amplitude normalization operation uses features of the QRS waveform as a basis to determine gain term (e.g., a short average may be taken just prior to the QRS).
[0155] In other embodiments, each cycle is normalized according to z-scores. Z-score value for a given data point in the template signal vector data set can be calculated as a difference between the value of the given data point and a mean of a set of cycles in which the difference is then normalized by the standard deviation of that given data point to the same indexed data value of the set of cycles. In some embodiments, the z-score may be outputted as a cycle variability score. Cycle variability may refer to the degree of variability between cycles in an acquired biophysical data set that may be attributed to asynchronous noise, among others.
[0156] Referring still to
[0157] Referring still to
[0158] Referring still to
[0159] As noted above,
[0160] Put another way, the mean beat can be used to generate a “cleaner” representation of the cycle (i.e., less high-frequency content, where that high-frequency content includes both signal and noise characteristics), whereas the median beat contains that high-frequency content. Either one of these approaches may be more desired depending on the situation. For example, the median beat may be used when i) it is desired to ensure that the high-frequency component of the biophysical signal characteristics is captured and maintained for analysis even if there are some high-frequency noise present that could cloud the analysis or ii) there is little or low high-frequency noise in the signal.
[0161] The process of
[0162] In some embodiments, portions of the resulting windows that are neighbors within a set of windows are combined and assessed (e.g., to generate the template signal or to reject a signal).
[0163] Once is cycle is identified (e.g., in each of these cases), the identified cycles can be stacked (i.e., plotted or arranged on top of each other). For example, cycle 1 data point 1 is placed at x=1 and cycle 1 data point 6000 is placed at x=6000; then cycle 150 (for example) data point 1 is also placed at x=1 and cycle 150 data point 6000 is also placed at x=6000.
[0164] In
[0165] Notably,
[0166] Quantification of Skeletal-muscle Artifact Noise Contamination in a Biophysical Signal
[0167]
[0168] Method 500 includes steps 402-412 as discussed in relation to
[0169] Method 500 further includes comparing each detected cycle in the raw signal data set cycle to the representative cycle data set. The comparison is performed by, first, phase-aligning (step 502) the representative cycle with each of the cycles under examination. In some embodiments, a method such as finding the maximum of the cross-correlation is used.
[0170] The comparison further includes determining (step 504) a difference between the representative cycle data set and the phase-aligned cycle under examination. In some embodiments, a method such as correlation between the two signals is used. In other embodiments, a median absolute error is used. In yet other embodiments, a mean absolute error is used. If there is more than one representative cycle data set (as, e.g., detected through clustering on the two-dimensional PCA output), then corresponding representative cycle data set that most match a given cycle is used.
[0171] The comparison further includes differentiating (step 506) outlying cycles and inlying cycles based on a difference score determined, e.g., using a distribution-based filter. In some embodiments, the distribution-based filter is configured to identify cycles having a standard-deviation greater than one from the mean.
[0172] Without wishing to be bound to a particular theory, the presence of outlying cycles can be attributed to several factors, including noise introduced by physiological variability of the biophysical signals and underlying physiological system under study. For cardiac signals, the outlying cycles may be due to variability in the length and/or energy of depolarization or repolarization cycles, among others.
[0173] Discussion
[0174] As noted above, quantification of asynchronous noise contamination such as skeletal-muscle-related artifact and noise contamination in a biophysical signal (such as a cardiac signal) can be complex. Skeletal-muscle-related artifact and noise, for example, can appear as in-band noise with respect to the biophysical signal—that is, it can occur in the same frequency range as the dominant components of the biophysical signal, typically around 0.5 Hz-80 Hz for cardiac signals and around 0.1-50 Hz for brain signals. Further, EMG can also have a similar amplitude as typical cardiac or brain waveform.
[0175] Similarity of skeletal-muscle-related artifact and noise contamination to the biophysical signal can cause issues for automated diagnostic analysis of such signals, and therefore, quantifying the level of skeletal-muscle-related artifact and noise contamination in a signal can facilitate the automated rejection of signals that are likely to be unsuccessful in subsequent analyses and/or the compensation for such contamination in subsequent analyses.
[0176] When quantifying the level of skeletal-muscle-related artifact and noise in a biophysical signal, particularly for cardiac signals, it is observed that skeletal-muscle-related artifact and noise is not in synchrony with the biophysical signal. Because the sources of the two are different (i.e., whereby the cardiac signal is derived from the summation of the action potentials of the cardiac myocytes, while the EMG is derived from the summation of the action potentials of the originating muscle (such as the pectoral muscles, biceps, triceps, etc.)), the sources are unlikely to share a deeper common source that could create synchronicity. Indeed, skeletal-muscle-related artifact and noise can be quantified by comparing each cardiac cycle to the idealized cardiac cycle for that patient in which the gross differences can be accounted by the presence of skeletal-muscle-related artifact and noise contamination in the biophysical signal.
[0177] In the same way that skeletal-muscle-related artifact and noise quantification is a problem (e.g., skeletal-muscle-related artifact and noise being in-band with the physical signal), so is the challenge of skeletal-muscle-related artifact and noise denoising.
[0178] By leveraging the same insight from skeletal-muscle-related artifact and noise quantification, a time-series data set of the representative cycles can be generated to which a frequency-based analysis or time-based analysis can be performed to remove, or reduce, the skeletal-muscle-related artifact and noise and other asynchronous contamination.
[0179] Indeed, in some embodiments, a sample-by-sample comparison of the original signal in the frequency domain can be performed followed by a frequency domain denoising operation between the signals to derive the denoised signals based on a spectral mask determined from the representative cycle vector and using that to mask noise features in the original signal in frequency domain. The exemplified denoising approach leverages the robust information contained in the representative cycle along with the information on the variation of the biophysical signal contained in the raw data.
[0180] The exemplified methods and systems is demonstrated above in relation to cardiac signals. It is noted that exemplified methods and systems can be applied to brain signals and other biophysical signals.
[0181] Device Normalization Process
[0182] In another aspect, the asynchronous contamination removal operation as described herein can be used to normalize cardiac signals acquired from multiple and different acquisition platforms; e.g., prior to subjecting data acquired from such platform for machine-learning-based disease association. The normalization is driven, at least in part, by knowledge of theoretical topological differences and insights from deep learning. The device normalization process can be applied to data acquired from multiple acquisition devices, e.g., that are from two or more different generations to increase similarity (as guided by both machine learning and electrical engineering theory) between the groups of signals that can improve the machine learning training process.
[0183]
[0184] In some embodiments, the first set of cardiac signal data sets (e.g., 1502) is processed with a processor to remove asynchronous noise contamination as described in relation to
[0185] In some embodiments, the second set of cardiac signal data sets (e.g., 1508) is processed with a processor to remove asynchronous noise contamination as described in relation to
[0186] Experimental Results
[0187] A Coronary Artery Disease—Learning Algorithm Development (CADLAD) study was undertaken involving two distinct stages to support development and testing of machine-learned algorithms. In stage 1, paired clinical data were used to guide the design and development of the pre-processing, feature extraction and machine learning steps. That is, the collected clinical study data is split into three cohorts: Training (50%), validation (25%), and verification (25%). Similar to the steps described above for processing signals from a patient for analysis, each signal is first pre-processed, to clean and normalize the data. Following these processes, a set of features are extracted from the signals in which each set of features is paired with a representation of the true condition—for example, the binary classification of the presence or absence of significant CAD. The final output of this stage is a fixed algorithm embodied within a measurement system. In Stage 2 of the CADLAD study, the machine-learned algorithms will be used to provide a determination of significant CAD against a pool of previously untested clinical data. The criteria for disease is established as that defined in the American College of Cardiology (ACC) clinical guidelines, specifically as greater than 70% stenosis by angiography or less than 0.80 fraction-flow by flow wire.
[0188] For part of the study, a first set of cardiac signal data sets associated with an earlier acquisition hardware (e.g., “Gen 1”) (e.g., measurement system 104) is processed to remove the asynchronous noise contamination as described in relation to
[0189] The assessment system 110, in some embodiments, automatically and iteratively explores combinations of features in various functional permutations with the aim of finding those combinations which can successfully match a prediction based on the features. To avoid overfitting of the solutions to the training data, the validation set is used as a comparator. Once candidate predictors have been developed, they are then manually applied to a verification data set to assess the predictor performance against data that has not been used at all to generate the predictor. Provided that the data sets are sufficiently large, the performance of a selected predictor against the verification set will be close to the performance of that predictor against new data.
[0190]
[0191] In some implementations, six simultaneously sampled signals are captured from a resting subject as the raw differential channel signal data set in which the signals embed the inter-lead timing and phase information of the acquired signals, specific to the subject. Geometrical contrast arising from the interference in the phase plane of the depolarization wave with the other orthogonal leads can be used which can facilitate superimposition of phase space information on a three-dimensional representation of the heart. Noiseless subspaces further facilitate the observation of the phase of these waves. That is, the phase of the orthogonal leads carries the information about the structure and generates geometrical contrast in the image. Phase-contrast takes advantage of the fact that different bioelectric structures have different impedances, and so spectral and non-spectral conduction delays and bends the trajectory of phase space orbit through the heart by different amounts. These small changes in trajectory can be normalized and quantified beat to beat and corrected for abnormal or poor lead placement, and the normalized phase space integrals can be mapped to a geometric mesh for visualization.
[0192] In some implementations, the raw differential channel signal data set is normalized, and baseline wander are removed using a modified moving average filter. For example, in some implementations, the baseline wander is extracted from each of the raw differential channel signals using a median filter with an order of 1500 milliseconds, smoothed with a 1-Hz low-pass filter, and subtracted from the signals. The bias is then removed from the resulting signals by subtracting estimations of the signals using maximums of probability densities calculated with a kernel smoothing function. All of the signals, or a portion thereof, may be divided by their respective interquartile ranges to complete the normalization process.
[0193] The method 1600 then includes, in some embodiments, reconstructing (operation 1604) a noiseless model signal by decomposing and selecting sets of candidate basis functions to create a sparse mathematical model. In some implementations, a Modified Matching Pursuit (MMP) algorithm is used to find a noiseless model of the raw differential channel signals. Other sparse approximation algorithms can be used, including, and not limited to, evolvable mathematical models, symbolic regression, orthogonal matching pursuit, LASSO, linear models optimized using cyclical coordinate descent, orthogonal search, fast orthogonal search, and cyclical coordinate descent. In some implementations, the reconstructing operation 504 generates a model as a function with a weighted sum of basis functions in which basis function terms are sequentially appends to an initially empty basis to approximate a target function while reducing the approximation error.
[0194] The method 1600 then includes, in some embodiments, selecting (operation 506) subspace components (e.g., low energy frequency subspace components) from the selected basis functions and coefficients. The low-energy subspace components comprise a model reconstructed by using only the X% low magnitude subset coefficients (frequency content) contributing least to the modelling error. Low-energy subspace components, in some implementations, includes higher order candidate terms that are later selected, in the phase space coordinates, as part of the sparse representation of a signal. That is, the last 5 percent, 10 percent, 15 percent, 20 percent, 25 percent, 30 percent of the candidate terms (as the higher order candidate terms) last selected via the sparse approximation is used. Other percentage values can be used.
[0195] The method 1600 then includes, in some embodiments, reconstructing (operation 1608) a pre-defined set of n.sup.th order fractional-calculus result set (e.g., via a numeric fractional-calculus operation) to generate a three-dimensional point cloud defining, in part, the phase space volumetric object 122. In some implementations, the fractional-calculus operation is based on Grunwald-Letnikov fractional-derivative method. In some implementations, the fractional derivative operation is based on the Lubich's fractional linear multi-step method. In some implementations, the fractional-calculus operation is based on the fractional Adams-Moulton method. In some implementations, the fractional-calculus operation is based on the Riemann-Liouville fractional derivative method. In some implementations, the fractional derivative operation is based on Riesz fractional derivative method. Other methods of performing a fractional calculus may be used.
[0196] The method 1600 then includes, in some implementations, performing (1610) triangulation operation to generate surface features (i.e., face) of the point cloud. In some implementations, Alpha Hull triangulation with a pre-predetermined radius (a) is performed on the reconstructed noiseless model signals. In other implementations, Delaunay triangulation, alpha shapes, ball pivoting, Mesh generation, Convex Hull triangulation, and the like, is used.
[0197] The method 1600 then includes, in some implementations, computing (1612) one or more values for each of the vertices in the point cloud. The vertex values, in some implementations, are scaled over a presentable color range. The vertex values, in some implementations, is a variance between a modeled channel data set (e.g., X-axis data set, Y-axis data set, or Z-axis data set) a base-line raw channel data set (e.g., corresponding X-axis data set, Y-axis data set, or Z-axis data set). In some implementations, the variance is determined by subtracting data points of the base-line raw channel data set with the corresponding data points of the modeled channel data set. The modeled channel data set, in some implementations, is based on a sparse approximation of the base-line raw channel data set to generate a reconstructed noiseless signal of the base-line raw channel data. In some implementations, each face of the phase space volumetric object 122 is assigned a face color value triangularly interpolated among neighboring bounding vertex color values (e.g., 3 bounding vertex colors).
[0198] In some implementations, various views of the phase space volumetric object 122 are captured for presentation as computed phase space tomographic images, e.g., via a web portal, to a physician to assist the physician in the assessment of presence or non-presence of pulmonary arterial hypertension. In some implementations, the phase space volumetric object or the computed phase space tomographic images are assessed by a trained neural network classifier configured to assess for presence or non-presence of pulmonary arterial hypertension. In some implementations, the computed tomographic images are presented (e.g., a set of two-dimensional views) alongside the results of a machine-generated predictions to assist in the physician in making a diagnosis.
[0199] In other implementations, the phase space volumetric object 122 is analyzed in subsequent machine learning operations (e.g., image-based machine learning operations or feature-based machine learning operations) to determine the one or more coronary physiological parameters. In some implementations, the assessment system 110 is configured to determine a volume metric (e.g., alpha hull volume) of the phase space volumetric object 122. In some implementations, the assessment system 110 is configured to determine a number of distinct bodies (e.g., distinct volumes) of the generated phase space volumetric object 122. In some implementations, the assessment system 110 is configured to assess a maximal color variation (e.g., color gradient) of the generated phase space volumetric object 122. In some implementations, all these features are assessed from phase space volumetric object 122 as a mathematical feature.
[0200] In some implementations, the mathematical features of the phase space volumetric object 122 are extracted along with hundreds of other distinct mathematical features that represent specific aspects of the biophysical signals collected. A feature extraction engine of the assessment system 110 may extract each feature as a specific formula/algorithm. In some implementations, when the feature extraction process is applied to an incoming biophysical signal, the output is a matrix of all calculated features which includes a list, for example, of over hundreds of real numbers; one number per feature in which each feature represents one or more aspects of the signal's dynamical, geometrical, fractional calculus, chaotic, and/or topological properties.
[0201] A machine learning algorithm (e.g., meta-genetic algorithm), in some implementations, is used to generate predictors linking aspects of the phase space model (e.g., pool of features) across a population of patients representing both positive (i.e., have disease) and negative (i.e., do not have disease) cases to detect the presence of myocardial tissue associated with pulmonary arterial hypertension. In some implementations, the performances of the candidate predictors are evaluated through a verification process against a previously unseen pool of patients. In some implementations, the machine learning algorithm invokes a meta-genetic algorithm to automatically select a subset of features drawn from a large pool. This feature subset is then used by an Adaptive Boosting (AdaBoost) algorithm to generate predictors to diagnose pulmonary arterial hypertension across a population of patients representing both positive and negative cases. The performances of the candidate predictors are determined through verification against a previously unseen pool of patients. A further description of the AdaBoost algorithm is provided in Freund, Yoav, and Robert E. Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting,” European conference on computational learning theory. Springer, Berlin, Heidelberg (1995), which is incorporated by reference herein in its entirety.
[0202] In some implementations, the assessment system 110 generates one or more images of a representation of the phase space volumetric object 122 in which the vertices, face triangulations, and vertex colors are presented. In some implementations, multiple views of the representation are generated and included in a report. In some implementations, the one or more images are presented as a three-dimensional object that can be rotated, scaled, and/or panned based on user's inputs. Indeed, such presentation can be used to be assessed visually by a skilled operator to determine whether a subject has presence of non-presence of pulmonary arterial hypertension.
[0203] Neural Network Classification
[0204] The three-dimensional phase-space volumetric object or the computed phase-space tomographic images can be directly evaluated by a trained neural network classifier to determine presence or non-presence of pulmonary arterial hypertension. In some implementations, the neural network classifier may be a neural network trained on a set of grayscale tomographic images which are paired with coronary angiography results assessed for presence and non-presence of pulmonary arterial hypertension. In some implementations, a neural network-based nonlinear classifier is used. In some implementations, the neural network-based non-linear classifier is configured to map individual pixels from the generated tomographic images to a binary disease-state prediction (i.e., the condition exists or does not exist) or an estimated physiological characteristic. In some implementations, the neural network's weights, which govern this mapping, is optimized using gradient descent techniques.
[0205] Examples of a disease state prediction can include, but not limited to, presence/non-presence of significant coronary arterial disease, presence/non-presence of pulmonary hypertension, presence/non-presence of pulmonary arterial hypertension, presence/non-presence of pulmonary hypertension due to left heart disease, presence/non-presence of pulmonary hypertension due to lung disease, presence/non-presence of pulmonary hypertension due to chronic blood clots, etc.
[0206] Examples of an estimated physiological characteristic can include, but not limited to, fractional flow reserve, degree of stenosis, degree of ischemia, blood glucose levels, cardiac chamber size and mechanical function, etc.
[0207] Further examples of processing that may be used with the exemplified method and system are described in: U.S. Pat. No. 9,289,150, entitled “Non-invasive Method and System for Characterizing Cardiovascular Systems”; U.S. Pat. No. 9,655,536, entitled “Non-invasive Method and System for Characterizing Cardiovascular Systems”; U.S. Pat. No. 9,968,275, entitled “Non-invasive Method and System for Characterizing Cardiovascular Systems”; U.S. Pat. No. 8,923,958, entitled “System and Method for Evaluating an Electrophysiological Signal”; U.S. Pat. No. 9,408,543, entitled “Non-invasive Method and System for Characterizing Cardiovascular Systems and All-Cause Mortality and Sudden Cardiac Death Risk”; U.S. Pat. No. 9,955,883, entitled “Non-invasive Method and System for Characterizing Cardiovascular Systems and All-Cause Mortality and Sudden Cardiac Death Risk”; U.S. Pat. No. 9,737,229, entitled “Noninvasive Electrocardiographic Method for Estimating Mammalian Cardiac Chamber Size and Mechanical Function”; U.S. Pat. No. 10,039,468, entitled “Noninvasive Electrocardiographic Method for Estimating Mammalian Cardiac Chamber Size and Mechanical Function”; U.S. Pat. No. 9,597,021, entitled “Noninvasive Method for Estimating Glucose, Glycosylated Hemoglobin and Other Blood Constituents”; U.S. Pat. No. 9,968,265, entitled “Method and System for Characterizing Cardiovascular Systems From Single Channel Data”; U.S. Pat. No. 9,910,964, entitled “Methods and Systems Using Mathematical Analysis and Machine Learning to Diagnose Disease”; U.S. Patent Publication No. 2017/0119272, entitled “Method and Apparatus for Wide-Band Phase Gradient Signal Acquisition”; PCT Publication No. WO2017/033164, entitled “Method and Apparatus for Wide-Band Phase Gradient Signal Acquisition”; U.S. Patent Publication No. 2018/0000371, entitled “Non-invasive Method and System for Measuring Myocardial Ischemia, Stenosis Identification, Localization and Fractional Flow Reserve Estimation”; PCT Publication No. WO2017/221221, entitled “Non-invasive Method and System for Measuring Myocardial Ischemia, Stenosis Identification, Localization and Fractional Flow Reserve Estimation”; U.S. Pat. No. 10,292,596, entitled “Method and System for Visualization of Heart Tissue at Risk”; U.S. patent application Ser. No. 16/402,616, entitled “Method and System for Visualization of Heart Tissue at Risk”; U.S. Patent Publication No. 2018/0249960, entitled “Method and System for Wide-band Phase Gradient Signal Acquisition”; U.S. patent application Ser. No. 16/232,801, entitled “Method and System to Assess Disease Using Phase Space Volumetric Objects”; PCT Application No. IB/2018/060708, entitled “Method and System to Assess Disease Using Phase Space Volumetric Objects”; U.S. Patent Publication No. US2019/0117164, entitled “Methods and Systems of De-Noising Magnetic-Field Based Sensor Data of Electrophysiological Signals”; U.S. patent application Ser. No. 16/232,586, entitled “Method and System to Assess Disease Using Phase Space Tomography and Machine Learning”; PCT Application No. PCT/IB2018/060709, entitled “Method and System to Assess Disease Using Phase Space Tomography and Machine Learning”; U.S. patent application Ser. No. __/______, entitled “Method and System to Assess Disease Using Phase Space Tomography and Machine Learning” (having attorney docket no. 10321-034pv1 and claiming priority to U.S. Patent Provisional Application No. 62/784,984); U.S. patent application Ser. No. 16/429,593, entitled “Method and System to Assess Pulmonary Hypertension Using Phase Space Tomography and Machine Learning”; U.S. patent application Ser. No. __/______, entitled “Method and System for Automated Quantification of Signal Quality” (having attorney docket no. 10321-036pv1 and claiming priority to U.S. Patent Provisional Application No. 62/784,962); U.S. patent application Ser. No. __/______, entitled “Method and System to Configure and Use Neural Network To Assess Medical Disease” (having attorney docket no. 10321-037pv1 and claiming priority to U.S. Patent Provisional Application No. 62/784,925); U.S. patent application Ser. No. __/______, entitled “Method and System to Assess Disease Using Phase Space Volumetric Object and Machine Learning” (having attorney docket no. 10321-038pv1 and claiming priority to U.S. Patent Provisional Application No. 62/785,158); U.S. patent application Ser. No. 15/653,433, entitled “Discovering Novel Features to Use in Machine Learning Techniques, such as Machine Learning Techniques for Diagnosing Medical Conditions”; U.S. patent application Ser. No. 15/653,431, entitled “Discovering Genomes to Use in Machine Learning Techniques”; U.S. Provisional Application No. __/______, filed concurrently herewith (having attorney docket no. 10321-041pv1), entitled “Method and System to Assess Disease Using Dynamical Analysis of Cardiac and Photoplethysmographic Signals”, each of which is incorporated by reference herein in its entirety.
[0208] Unless otherwise expressly stated, it is in no way intended that any method set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not actually recite an order to be followed by its steps or it is not otherwise specifically stated in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including: matters of logic with respect to arrangement of steps or operational flow; plain meaning derived from grammatical organization or punctuation; the number or type of embodiments described in the specification.
[0209] While the methods and systems have been described in connection with certain embodiments and specific examples, it is not intended that the scope be limited to the particular embodiments set forth, as the embodiments herein are intended in all respects to be illustrative rather than restrictive.
[0210] The methods, systems and processes described herein may be used generate stenosis and FFR outputs for use in connection with procedures such as the placement of vascular stents within a vessel such as an artery of a mammalian (e.g., human) subject, and other interventional and surgical system or processes. In one embodiment, the methods, systems and processes described herein can be configured to use the FFR/stenosis outputs to determine and/or modify, intra operation, a number of stents to be placed in a mammalian (e.g., human), including their optimal location of deployment within a given vessel, among others.
[0211] Examples of other biophysical signals that may be analyzed in whole, or in part, using the exemplary methods and systems include, but are not limited to, an electrocardiogram (ECG) data set, an electroencephalogram (EEG) data set, a gamma synchrony signal data set; a respiratory function signal data set; a pulse oximetry signal data set; a perfusion data signal data set; a quasi-periodic biological signal data set; a fetal ECG data set; a blood pressure signal; a cardiac magnetic field data set, and a heart rate signal data set.
[0212] The exemplary analysis can be used to identify various pathologies and condition including, but are not limited to heart disease, cardiac arrhythmia, diabetic autonomic neuropathy, Parkinson's disease, forms of epilepsy, brain injury, altered state of cognition, stability of a heart at different heart rates, effectiveness of medication, ischemic, silent ischemia, atrial fibrillation, ventricular fibrillation, ventricular tachycardia, blood vessel block, pulmonary hypertension, attention deficit disorder, etc.