DEEP NEURAL NETWORK (DNN) ASSISTED SENSOR FOR ENERGY-EFFICIENT ELECTROCARDIOGRAM (ECG) MONITORING
20210174961 · 2021-06-10
Inventors
Cpc classification
G16H50/20
PHYSICS
A61B5/7232
HUMAN NECESSITIES
International classification
G16H50/20
PHYSICS
A61B5/00
HUMAN NECESSITIES
Abstract
The present invention is directed to an energy-efficient method of monitoring a physiological signal while maintaining high accuracy. The method may comprise a Deep Neural Network (DNN) receiving an uncompressed sample of a continuous ECG signal from a sensor. The method may further comprise the DNN determining a first probability that the received sample is abnormal and a second probability that the received sample is normal. Finally, the method may further comprise the DNN determining to transmit the uncompressed sample if a threshold of abnormality is less than or equal to the difference between the first probability and the second probability. In some embodiments, the DNN may be a Convolutional Neural Network (CNN).
Claims
1. An energy-efficient method of monitoring a physiological signal while maintaining high accuracy, the method comprises: a. receiving, by a deep neural network (DNN), an uncompressed sample of a continuous physiological signal from a sensor; b. determining, by the DNN, a first probability that the received sample is abnormal and a second probability that the received sample is normal; and c. determining, by the DNN, to transmit the uncompressed sample if a threshold of abnormality is less than or equal to the difference between the first probability and the second probability.
2. The method of claim 1, wherein the continuous physiological signal comprises an ECG signal.
3. The method of claim 1, wherein the DNN is a convolutional neural network (CNN).
4. The method of claim 3, wherein the CNN has been trained with a dataset comprising at least 205 records.
5. The method of claim 4, wherein the CNN has 91,920 parameters and 1% or less of samples determined to be normal are abnormal.
6. The method of claim 5, wherein the CNN has been trained with a dataset comprising records of a plurality of heart cycles.
7. An energy-efficient system of monitoring an ECG sensor while maintaining high accuracy, wherein the system comprises: a. an ECG sensor; b. a transmitter; c. a processor; d. a RAM; and e. a memory, wherein the memory comprises instructions for: i. receiving, by a DNN, an uncompressed sample of a continuous ECG signal from a sensor, ii. determining, by the DNN, a first probability that the received sample is abnormal and a second probability that the received sample is normal, and iii. determining, by the DNN, to transmit the uncompressed sample if a threshold of abnormality is less than or equal to the difference between the first probability and the second probability.
8. The system of claim 7, wherein the continuous physiological signal comprises an ECG signal.
9. The system of claim 7, wherein the DNN is a CNN.
10. The system of claim 9, wherein the CNN has been trained with a dataset comprising at least 205 records.
11. The system of claim 10, wherein the CNN has 91,920 parameters and 1% or less of samples determined to be normal are abnormal.
12. The system of claim 11, wherein the CNN has been trained with a dataset comprising records of a plurality of heart cycles.
13. A non-transitory computer-readable storage medium having computer-executable instructions recorded thereon for an energy-efficient DNN-driven ECG monitor that, when executed by at least one processing circuit, perform a computer process, wherein the computer process comprises: a. receiving, by the DNN, an uncompressed sample of a continuous ECG signal from a sensor; b. determining, by the DNN, a first probability that the received sample is abnormal and a second probability that the received sample is normal; and c. determining, by the DNN, to transmit the uncompressed sample if a threshold of abnormality is less than or equal to the difference between the first probability and the second probability.
14. The non-transitory computer-readable storage medium of claim 13, wherein the continuous physiological signal comprises an ECG signal.
15. The non-transitory computer-readable storage medium of claim 13, wherein the DNN is a CNN.
16. The non-transitory computer-readable storage medium of claim 15, wherein the CNN has been trained with a dataset comprising at least 205 records.
17. The non-transitory computer-readable storage medium of claim 16, wherein the CNN has 91,920 parameters and 1% or less of samples determined to be normal are abnormal.
18. The non-transitory computer-readable storage medium of claim 17, wherein a duration of the sample comprises a plurality of heart cycles.
Description
BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING(S)
[0008] This patent application contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the office upon request and payment of the necessary fee.
[0009] The features and advantages of the present invention will become apparent from a consideration of the following detailed description presented in connection with the accompanying drawings in which:
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
DETAILED DESCRIPTION OF THE INVENTION
[0026] Following is a list of elements corresponding to a particular element referred to herein: A sensor may acquire an ECG signal and wirelessly transmits it to a server. Additionally, it is assumed that only the abnormal rhythms need to be transmitted for further interpretation by either an expert or a machine learning model. Intuitively, transporting the whole signal to a server does not require any computation at the sensor, but imposes a high traffic load to the wireless channel, whose capacity is often limited. Compression can reduce traffic. However, compression in a narrow sense assumes lossless recovery of all the rhythms whose completeness may not be necessary in the application. Arguably, adopting lossy compression can aggressively reduce channel usage while taking a toll on the overall quality of the recovered signal and the final classification accuracy on the server end.
[0027] The technique proposed whose schematic is represented in
[0028] Formal definitions of the signals and performance metrics mentioned earlier are provided below. Signal label pairs consisting of a digitized ECG signal and samplewise anomaly labels annotated by ECG specialists are considered:
x(t)=[x.sub.1,x.sub.2,x.sub.3,x.sub.4,x.sub.5, . . . ,x.sub.n] (1)
L(t)=[(1,0),(0,1),(1,0),(0,1),(0,1), . . . ,(1,0)] (2)
where brackets denote a time series of discrete samples. Labels in the following discussion are expected to take a 2-tuple format where abnormality and normality are indicated by the first and the second element, respectively. For abnormal samples, say x.sub.1 in Eq. 3, the label is (1,0). On the other hand, (0,1) are tagged on normal samples such as x.sub.3. Note that not all samples are annotated. Therefore, some samples may have (0,0) meaning they are neither tagged positive nor negative. Acquisition is followed by decision logic, denoted by {⋅} which selects positive samples from x(t) and output transmitting time series x.sub.τ(t). For example, if samples x.sub.2, x.sub.4, and x.sub.5 are predicted to be associated with normal rhythms,
{⋅} replaces them with a special symbol ϕ to command wireless transmitter not to transmit those samples. The following equation shows the corresponding output of decision logic if the realization of x(t) in Equation (1) is the input:
x.sub.τ(t)={x(t)}=[x.sub.1,ϕ,x.sub.3,ϕ,ϕ, . . . ,x.sub.n], (3)
Intuitively, the bandwidth utilization b.sub.tx of an ECG frame is defined as a ratio of valid samples between raw signal x.sub.τ(t) and transmitted signal x(t)
To evaluate the loss of sensitivity caused by dropped positive samples, the ground truth L(t) is used to define the mistransmission rate ϵ.sub.mt as the ratio of positive samples between x.sub.τ(t) and in x(t), that is,
[0029] To justify the new system architecture, the following question arises: Will the energy savings in RF transmission be greater than the cost of running an extra CNN?
[0030] To address this question, this section proposes an analytical model for energy consumption that takes into consideration both RF transmission and CNN inference[11].
TABLE-US-00001 TABLE I NORMALIZED ENERGY COSTS FOR CRITICAL OPERATIONS IN UNIFIED ENERGY MODEL (ASSUME a AND b ARE 8-BIT NUMBERS) Operation Symbol Norm. Energy Energy DRAM R/W E.sub.d 4000× 320 Global Buffer R/W E.sub.g 15× 1.2
Inter
PE Comm. E
5× 400
Register File R/W E
2.5× 200
a
b E
.sup. 10.sup.−2× 0.8
a × b E.sub.m 1× 80
RF Transmission E
10.sup.6× 80
indicates data missing or illegible when filed
[0031] If the RF transmitter energy model described in [12] is considered, transmitter energy consumption E.sub.tx can be broken down into two components: transmitted energy E.sub.t, and dissipated energy E.sub.h, i.e. waste heat, as in
E.sub.tx=E.sub.t+E.sub.h=(P.sub.t+P.sub.h)T.sub.tx=P.sub.txT.sub.tx (6)
where T.sub.tx is the accumulated time for active signaling, i.e. x.sub.τ(t)≠ϕ. Since a body area network (BAN) application is targeted, the transmitter-receiver distance is set to d=10 m. For simplicity, 8-DPSK is selected, which transports 3 bits per symbol, as the modulation scheme. Transmission energy cost is approximately 10 μJ/bit, in the case of the same set of system design parameters as in [12].
[0032] In order to estimate the energy consumption per inference, an energy model analogous to proposed by Chen et. al. in [13] is used. The energy costs associated with each operation are listed in Table I where 8-bit fixed-point numbers are assumed in each operation. Then, the average energy ε for the proposed system to process an ECG sample can, then, be computed as
where ⋅ denotes vector inner product, I.sub.f is the length of filter; {right arrow over (l)}.sub.l denotes an all one vector of length l; {right arrow over (n)}.sub.c is the number of channels in the CNN. Data and weight reuses are modeled by γ factors: the first character in subscript denotes data (d) and weight (w), while the second subscript denotes a particular memory hierarchy. Specifically γ.sub.dd, γ.sub.dg, γ.sub.dp, γ.sub.dr and corresponds to data reuse at DRAM (d), global buffer (g), array (p), and register files (r), respectively. Note Eq. 7 represents the energy cost to generate a single output.
[0033] The relative reduction in energy over a naive system is
a metric that will be used in Section V to assess the savings in energy consumption through the proposed approach.
[0034] In the following, the components of the system are described in detail.
[0035] Intuitively, a neural network as complex as that used at the remote device to perform final classification would enable a “perfect” selection of anomalous signal sections. In fact, as the two classifiers have the same output, only positive cycles would be selected for transmission, thus minimizing channel usage without any accuracy degradation. Clearly, such a complex neural network cannot be executed within wearable sensors. Therefore, a range of parameters corresponding to different levels of complexity are explored—and thus energy consumption and selection accuracy—for DNNs deployed on wearable sensors.
[0036] A total of 26 variants of customized sequence were trained to sequence CNN (S2SCNN) models to translate single-lead ECG signals into sample-wise annotation of arrhythmia.
[0037] All CNNs are implemented in Keras with Tensorflow backend. Training is performed de novo with random initialization of weights. The Adam optimizer is used with gradient clipping enabled and a mini-batch size of 128. The initial learning rate is set to 1×10-3 and reduced by tenfold when validation loss levels off. Convergence can be reached within 20 training epochs in half an hour on the nVidia 2080Ti GPU.
[0038] Signal compression is realized by the decision logic, which takes the raw signal x(t) and the outputs of the neural network to determine whether sections of the signal are transmitted or not. The logic will selectively transmit samples if the output of the neural network exceeds threshold T. The transfer function of the decision logic can be formulated by
[0039] Threshold τ is a system design parameter which allows adjustment on sensitivity. Lower τ is more conservative in generating negative predictions. Also, reducing τ implies more data will be transmitted.
TABLE-US-00002 TABLE II Performance Summary of CNN Variants. Parameters #CONV Channels {right arrow over (n)}.sub.C AUC TPR 1460160 4 [106, 106, 106, 2] 0.862 0.683 1224441 4 [97, 97, 97, 2] 0.863 0.799 1032321 4 [89, 89, 89, 2] 0.868 0.729 856585 4 [81, 81, 81, 2] 0.867 0.782 800010 5 [64, 64, 64, 64, 2] 0.849 0.737 697233 4 [73, 73, 73, 2] 0.864 0.759 537546 4 [64, 64, 64, 2] 0.870 0.742 413010 4 [56, 56, 56, 2] 0.880 0.759 343495 4 [51, 51, 51, 2] 0.863 0.762 330360 4 [50, 50, 50, 2] 0.870 0.779 317481 4 [49, 49, 49, 2] 0.860 0.777 304858 4 [48, 48, 48, 2] 0.867 0.813 280380 4 [46, 46, 46, 2] 0.845 0.575 268525 4 [45, 45, 45, 2] 0.869 0.676 256926 4 [44, 44, 44, 2] 0.864 0.753 245583 4 [43, 43, 43, 2] 0.855 0.778 234496 4 [42, 42, 42, 2] 0.844 0.759 213090 4 [40, 40, 40, 2] 0.882 0.795 137706 4 [32, 32, 32, 2] 0.874 0.724 91920 4 [26, 26, 26, 2] 0.864 0.769 72010 3 [32, 32, 2] 0.865 0.836 60805 4 [21, 21, 21, 2] 0.863 0.731 48526 3 [26, 26, 2] 0.877 0.716 36090 4 [16, 16, 16, 2] 0.870 0.724 32476 3 [21, 21, 2] 0.842 0.819 19626 3 [16, 16, 2] 0.858 0.719 AUC: Area under ROC curve; TPR: true positive rate when τ = 0
TABLE-US-00003 TABLE III RELATIVE ENERGY SAVINGS ON DIFFERENT RECORDS IF 1% IS GUARANTEED Record Δε (%) τ b
(%) 105 33.91 −0.14 59.71 1121a 18.12 0.03 75.50 210 13.47 −0.44 80.15 114 8.48 −0.56 85.15 ECG-P18-02 −3.65 −0.72 97.28 107 −4.75 −0.14 98.37 124 −5.65 −0.81 99.28 Mean 8.56 −0.40 85.06 Max 33.91 −0.03 99.28 Min −5.65 −0.81 59.71
indicates data missing or illegible when filed
[0040] To evaluate the proposed system, energy efficiency in recording arrhythmia which is evident from uneven heartbeats in the is maximized ECG. In some embodiments, the CNN may be trained with a dataset comprising records of a plurality of heart cycles. Multiple ECG datasets were obtained from PhysioNet and Telemetric and Holter ECG warehouse (THEW)[15]. Five specific datasets are combined to obtain a 205-record dataset for CNN training and system evaluation. Each of the records has associated labeling information that is compliant to ANSI/AAMI EC57 standard (AAMI). The dataset is then split into training, development, and test subsets. Each of them has 184, 10, and 11 records, respectively. Records from the same subject do not overlap or duplicate across different datasets. According to Luz et. al. [16], (modified) lead II (II/MLII) is selected to be the most informative feature for arrhythmia detection.
[0041] Abnormal rhythms are derived from disjunction of S, V, F, and Q categories; Normal rhythms are marked by label N. Indexes of both types of heartbeats are converted into two impulse trains δ.sub.a(t) and δ.sub.n(t), respectively. To account for average PR interval and QRS complex duration, the intervals advancing and lagging R-peak 0.2 seconds and 0.1 seconds are both assumed to share the same label assigned to corresponding R-peak.
[0042] Three system parameters, namely 1) topology of CNN p, 2) computation resource parameters γ, and 3) communication system parameters t, determine the overall system performance which is defined by application performance ϵ.sub.mt and energy cost ε. Here, 2) and 3) are assumed to be fixed and orthogonal to the CNN parameters. On dataset , the problem of finding the optimal system parameters can be formulated as
where r.sub.p[CNN()] and σ[CNN(
)] denote the rate of positive calls of CNN and the total amount of computations, respectively. Grid search on parameter space
was used to determine the best configuration of p which is derived numerically in the next section.
[0043] Table III summarizes the energy savings as computed in Eq. 8. Only the best performing CNN (indicated by a * in
[0044] Contour plots of the mistransmission rate are shown in
[0045] As summarized in
[0046] A new technique that leverages a lightweight DNN is proposed to minimize energy consumption on wearable ECG sensors by selectively transmitting only the informative parts of an acquired ECG signal. A unified energy model for DNN-assisted embedded systems is also formulated. The extensive experiments on real-world datasets demonstrate that the proposed methodology locates the optimal CNN hyperparameters that minimizes the overall energy cost, while maintaining classification performance above a predefined threshold, making this a viable approach for deploying DNNs on resource-constrained sensors.
[0047] The following description sets forth numerous specific details (e.g., specific configurations, parameters, examples, etc.) of the disclosed embodiments, examples of which are illustrated in the accompanying drawings. It should be recognized, however, that such description is not intended as a limitation on the scope of the disclosed embodiments, but is intended to elaborate upon the description of these embodiments. It will be evident to a person of ordinary skill in the art that the present invention can be practiced without every specific detail described infra. Moreover, well-known methods, procedures, components, and circuits have not been described in detail so as not to unnecessarily obscure aspects of the embodiments of the present invention.
[0048] It is fully contemplated that the features, components, and/or steps described with respect to one embodiment may be combined with the features, components, and/or steps described with respect to other embodiments of the present disclosure. To avoid needless descriptive repetition, one or more components or actions described in accordance with one exemplary embodiment can be used or omitted as applicable from other embodiments. For the sake of brevity, the numerous iterations of these combinations will not be described separately. The same reference numbers may be used to refer to the same or similar elements in different drawings. Alternately, different reference numbers may be used to refer to the same or similar elements in the drawings of different embodiments. Any distinction of an element's reference number in one embodiment from another is not limiting in any way, does not suggest that elements of one embodiment could not be combined with or substituted for elements in another embodiment, and (most importantly) is specifically intended only to facilitate the matching of elements in the disclosure to their corresponding claim recitations.
[0049] Embodiments within the scope of the present invention may comprise non-transitory computer-readable media for storing computer-executable instructions. Instructions that cause at least one processing circuit to perform one or more operations are “computer-executable.” The term “non-transitory” is used herein to distinguish two distinctly different kinds of computer-readable media: physical storage media that stores computer-executable instructions and transmission media that carries computer-executable instructions. Physical storage media includes RAM and other volatile types of memory; ROM, EEPROM and other non-volatile types of memory; CD-ROM, CD-RW, DVD-ROM, DVD-RW, and other optical disk storage; magnetic disk storage or other magnetic storage devices; and any other tangible medium that can store computer-executable instructions that can be accessed and processed by at least one processing circuit.
[0050] Transmission media can include signals carrying computer-executable instructions over a network to be received by a general-purpose or special-purpose computer. Embodiments of the present invention expressly (by exemplary recitation such as “non-transitory”) exclude signals carrying computer-executable instructions. However, it should be understood that once a signal carrying computer-executable instructions is received by a computer, the type of computer-readable storage media transforms automatically from transmission media to physical storage media. This transformation may even occur early on in intermediate memory such as (by way of example and not limitation) a buffer in the RAM of a network interface card, regardless of whether the buffer's content is later transferred to less volatile RAM in the computer. Thus, devices that merely repeat a signal are contemplated by the embodiments of the present invention, even though the media that carry the signal between such devices and the signal itself are expressly not included within the claim scope. Thus, it should be understood that “non-transitory computer-readable storage media” is used herein instead of simply “physical storage media” or “physical computer-readable storage media” in order to underscore that even transmission media necessarily involves eventual transformation into physical storage media and to therefore capture all embodiments where the computer-readable instructions are stored in physical storage media—even if only temporarily before transforming back into transmission media.
[0051] Where two or more elements are said to be “coupled,” the meaning shall include (in addition to configurations where the elements directly operate with each other because they are joined) configurations where the elements indirectly operate with each other (e.g., through one or more intermediate elements) so long as there is a link.
[0052] In some embodiments, a wearable echocardiographic (ECG) device may comprise an ECG sensor, a transmitter, a processor for executing instructions, RAM, and hard disk memory upon which are stored computer-executable instructions for a convoluted neural network (CNN). In some embodiments, when executed by the processor, the instructions for the CNN may cause the processor to perform operations. The operations may comprise receiving an uncompressed sample of a continuous ECG signal from a sensor. In some embodiments, the CNN may assess the probability that the received sample is abnormal as well as assess the probability that the received sample is normal. In some embodiments, the CNN may determine whether to transmit the uncompressed sample based on whether a threshold of abnormality is less than or equal to the difference between the first probability and the second probability.
[0053] In some embodiments, the CNN may be trained with a dataset comprising at least 205 records of heart cycles. In some embodiments, the CNN may have 91,920 parameters. In some embodiments, 1% or less of the samples determined by the CNN to be normal may actually be abnormal. In some embodiments, the sample may encompass a plurality of heart cycles.
[0054] Although there has been shown and described the preferred embodiment of the present invention, it will be readily apparent to those skilled in the art that modifications may be made thereto which do not exceed the scope of the appended claims. Therefore, the scope of the invention is only to be limited by the following claims. In some embodiments, the figures presented in this patent application are drawn to scale, including the angles, ratios of dimensions, etc. In some embodiments, the figures are representative only and the claims are not limited by the dimensions of the figures. In some embodiments, descriptions of the inventions described herein using the phrase “comprising” includes embodiments that could be described as “consisting essentially of” or “consisting of”, and as such the written description requirement for claiming one or more embodiments of the present invention using the phrase “consisting essentially of” or “consisting of” is met.
[0055] The reference numbers recited in the below claims are solely for ease of examination of this patent application, and are exemplary, and are not intended in any way to limit the scope of the claims to the particular features having the corresponding reference numbers in the drawings.