METHOD OF ANALYSING THE MOTIONAL ACTIVITY OF PARTICLES
20250198902 ยท 2025-06-19
Assignee
Inventors
- Grzegorz JOZWIAK (Muttenz, CH)
- Alexander STURM (Muttenz, CH)
- Eric DELARZE (Muttenz, CH)
- Gino Sebastian CATHOMEN (Muttenz, CH)
- Danuta CICHOCKA (Muttenz, CH)
Cpc classification
International classification
Abstract
A method of analysing a motional activity of particles with a motion detector including the steps of bringing at least one particle into contact with a flexible support, detecting at least one time-dependent signal that is indicative of deflections of the flexible support due to motions of the at least one particle, and evaluating the detected time-dependent signal. The evaluation includes analysing a time-dependency of the signal to derive a plurality of signal parameters from the signal that characterise a variation of the signal as a function of time, and executing a linking algorithm that has, as input variables, an input vector having at least a selection of the signal parameters, and has, as an output variable, at least one activity indicator that is indicative of a motional activity of the particle.
Claims
1. A method of analysing a motional activity of particles with a motion detector, wherein the motion detector comprises: a flexible support being configured to deflect, a detection device for detecting signals being generated upon the deflection of the flexible support, and an evaluation device for evaluating the detected signals, wherein the method comprises the steps of: bringing at least one particle into contact with the flexible support, detecting at least one time-dependent signal that is indicative of deflections of the flexible support due to motions of the at least one particle, using the detection device, and evaluating the detected time-dependent signal with the evaluation device, wherein the evaluation of the detected time-dependent signal comprises: analysing a time-dependency of the signal to derive a plurality of signal parameters from the signal that characterise a variation of the signal as a function of time, and executing a linking algorithm that has, as input variables, an input vector comprising at least a selection of said signal parameters, and has, as an output variable, at least one activity indicator that is indicative of a motional activity of the particle.
2. The method according to claim 1, wherein, in the step of analysing the time-dependency of the signal, a plurality of time intervals are defined, wherein the time-dependency of the signal is analysed within each time interval separately in order to obtain a value for each of a plurality of signal estimators for each time interval, and wherein a time series of each signal estimator is analysed in order to obtain each of the signal parameters.
3. The method according to claim 2, wherein at least one of: i) each signal estimator value is determined by at least one of: fitting a noise model to the signal or by applying at least one statistical algorithm to the signal, or ii) each signal estimator is selected from the group consisting of estimators of moments of probability density functions, geometrical properties of power spectral density functions, percentiles of probability density functions, correlation parameters, partial correlation parameters, parameters of auto-regressive moving average models and parameters of nonlinear auto-regressive moving average models.
4. The method according to claim 2, wherein each signal estimator is a geometrical property of a power spectral density function.
5. The method according to claim 2, Determine, for each time interval, a plurality of periodograms, and For all periodograms, determine at least one quantile for at least one frequency range, whereby the signal estimator is obtained.
6. The method of claim 5, wherein the time series of at least one of the signal estimators is extended so as to form an extended time series by at least one of: Combining at least two signal estimators into at least one of differences or ratios thereof; Transforming a plurality of quantiles into values of an empirical probability density function, and wherein the at least one signal parameter is obtained from the extended time series.
7. The method according to claim 2, wherein at least one signal parameter is derived from at least one of: the time series of at least one of the signal estimators and/or from the extended time series of at least one of the signal estimators by at least one of: Defining a plurality of sub-series for at least one of the time series or for the extended time series and, for each sub-series, determine statistical properties, Fitting one or more parametrized curves comprising one or more parameters to at least one of the time series or to the extended time series, and Defining a plurality of sub-series for at least one of the time series or for the extended time series and, for each sub-series, fit one or more parametrized curves comprising one or more parameters in order to obtain fitted parameters.
8. The method according to claim 1, wherein evaluating the detected time-dependent signal with the evaluation device comprises detecting the time-dependent signal during a time t, and wherein analysing the time-dependency of the signal comprises the steps of: Dividing the signal of time t into time intervals of length T, For each time interval j=1, . . . t/T, performing the steps of: calculating the digital integral of the signal within each time interval; divide the time interval into at least one number of subintervals; for each subinterval, detrend the digital integral of the signal by applying a detrending algorithm to the digital integral of the signal; for the at least one number of subintervals, combine the subintervals into a detrended interval; and determine at least one generalized mean of the detrended signal of the detrended interval for the at least one number of subintervals, whereby at least one signal parameter is obtained.
9. The method according to claim 8, wherein at least one signal parameter is derived from at least one of ratios or differences between at least two signal parameters.
10. The method according to claim 1, further comprising executing a feature selection algorithm to automatically select a subset of signal parameters so as to obtain the input vector being fed to the linking algorithm.
11. The method according to claim 1, wherein a drift cancellation algorithm is applied to the signal prior to the derivation of the signal parameters from the signal.
12. The method according to claim 1, wherein the linking algorithm is a regression algorithm or a classification algorithm.
13. The method according to claim 1, wherein the linking algorithm is a machine learning algorithm.
14. The method according to claim 1, wherein the particle is at least one of a cell such as a prokaryotic cell or an eukaryotic cell, a spore, a virus, a phage, a vesicle, a peptide, a protein, a polysaccharide, a lipid, a glucide, a nucleic acid or a co-polymer thereof, a protein-RNA co-polymer, a RNA-DNA co-polymer, a protein-DNA co-polymers, a RNA/DNA-protein co-polymer, a protein-protein co-polymer, a capsule, an organelle, or a nanodevice.
15. The method according to claim 1, wherein the activity indicator is indicative of at least one of a metabolic activity of the particle, a physiological state of the particle, a respiration level of the particle, an ATP level of the particle, a ratio of NADH/NAD+ of the particle, a membrane potential of the particle, a growth rate of the particle, a kill rate of the particle, an interaction of the particle with other particles, and an interaction of the particle with an environmental factor of the particle or a combination of any of those.
16. The method according to claim 1, wherein the particle is subjected to at least one of: at least one chemical stimulus and/or at least one physical stimulus.
17. The method according to claim 16, wherein the activity indicator is determined at least one of before, during or after the particle is subjected to at least one of the chemical stimulus or the physical stimulus, and wherein the thus determined activity indicators are compared with one another.
18. A method of generating a training data set of a machine learning algorithm comprising: a) Bringing at least one particle into contact with a flexible support, b) Detecting at least one time-dependent signal that is indicative of deflections of the flexible support due to motions of the at least one particle, using a detection device, and c) Evaluating the detected time-dependent signal with an evaluation device, wherein the evaluation of the detected time-dependent signal comprises analysing a time-dependency of the signal in order to derive a plurality of signal parameters from the signal that characterise a variation of the signal as a function of time, d) Performing an independent measurement in order to derive at least one associated activity indicator that is indicative of a motional activity of the particle, e) Saving the plurality of signal parameters and the at least one associated activity indicator in the training data set, and f) Repeating steps a) to e) for a plurality of particles exhibiting different motional activities.
19. A method of training a machine learning algorithm, wherein the machine learning algorithm is trained with a training data set generated by the method of claim 18.
20. The method according to claim 4, wherein the geometrical property is an extreme point of the power spectral density function, an inflection point of the power spectral density function, an area under a curve described by the power spectral density function or at least one of a horizontal or a vertical distance between them.
21. The method according to claim 5, wherein, for all periodograms, at least one percentile is determined for at least one frequency range, whereby the signal estimator is obtained.
22. The method of claim 6, wherein at least one of: at least one distance between the values of the empirical probability density function and values of a theoretical probability density function is determined; or at least one of: at least one geometrical property or various moments of the empirical probability density function is determined.
23. The method according to claim 7, wherein at least one of: for each sub-series, determine furthermore at least one of ratios, differences and ratios of differences of the statistical properties, or for each sub-series, furthermore determine ratios, differences and ratios of differences of the fitted parameters.
24. The method according to claim 10, wherein the feature selection algorithm is configured to select the subset of signal parameters in such a manner that the subset is optimal to a multi-objective criterion.
25. The method according to claim 11, wherein at least one of: the drift cancellation algorithm comprises a curve-fitting procedure that models a drift component of the signal or a high-pass filtering, or the drift cancellation algorithm is a detrending algorithm.
26. The method according to claim 16, wherein at least one of: the particle is a cell and wherein the chemical stimulus is an addition of a drug such as an antibiotic or a compound affecting the metabolism or viability of the cell or a change in an environmental condition such as a culture condition, or the physical stimulus is an application of stress.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0082] Preferred embodiments of the invention are described in the following with reference to the drawings, which are for the purpose of illustrating the present preferred embodiments of the invention and not for the purpose of limiting the same. In the drawings,
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
[0111]
[0112]
[0113]
[0114]
[0115]
[0116]
[0117]
[0118]
[0119]
[0120]
[0121]
[0122]
[0123]
[0124]
[0125]
[0126]
[0127]
[0128]
[0129]
[0130]
DESCRIPTION OF PREFERRED EMBODIMENTS
[0131] Various aspects of the invention shall now be illustrated with reference to the figures.
[0132] As mentioned earlier, the present invention enables an improved analysis of the motional activity of particles with a motion detector 1 comprising a flexible support 2 being in contact with the particles to be analysed and being configured to deflect due to motions of the particles, see
[0133] For illustrative purposes only, preferred steps as well as a preferred ordering of these steps that allow the analysis of the motional activity of the particles according to the method of the invention are summarised hereinbelow and with reference to
[0134] S0: The starting point of the analysis is at least one time-dependent signal being indicative of the deflections of the flexible support of the motion detector. Here, said signal is referred to as raw input signal, RIS.
[0135] In order to cancel any drifts in the RIS, the method preferably subjects the RIS to a drift cancellation.
[0136] S1: From the signal, either from the RIS but preferably after its drift cancellation, the method calculates a time series of signal estimators. The calculation of the time series of signal estimators can be done in the time domain or in the frequency domain.
[0137] If signal estimators in the time-domain are to be calculated, the signal is split into time intervals, wherein the time-dependency of the signal is analysed within each time interval separately in order to obtain signal estimators, and wherein a time series of the signal estimators is analysed in order to obtain the signal parameters. For instance, the signal estimators can be determined by using at least one of the following: [0138] Estimators of moments of probability density functions; [0139] Percentiles of probability density functions; [0140] Correlations, partial correlations, or results of auto-regressive moving average (ARMA) and nonlinear auto-regressive moving average (NARMA) modelling.
[0141] The signal estimators, for instance the values of particular statistical algorithms or noise models such as time-varying noise model parameters, preferably become samples of a new time series. These new time series of signal estimators preferably consist of fewer points, reveal characteristic patterns, and can be used as input for a determination or an extraction of the signal parameters, see further below.
[0142] If signal estimators in the frequency-domain are to be calculated, the signal is preferably split into intervals for which power spectra are estimated. The method of estimation and windowing is preferably optimised for particular applications. After that, various noise models are preferably fitted to a given spectrum and the signal estimators, here the fitting parameters, are preferably used as samples of new time series. Apart from signal estimators being parameters of noise models, also geometrical properties of the estimated spectrum could be measured (e.g. extreme points, inflection points, area under different parts of the curve, horizontal and vertical distances between them).
[0143] The result is a set of time series, each one for a signal estimator being a particular fitting parameter or property. Also in this way, the number of points can be significantly reduced, allowing identification of patterns or shapes of the time series characteristic for a given metabolic activity by the extraction of signal parameters, see further below.
[0144] S2: After the calculation of the time series of signal estimators, the times series of the signal estimators in the time-domain or in the frequency-domain are transformed to vectors of signal parameters. The transformation is preferably done by means of one of the following three methods. [0145] The estimated time series are split into intervals. The statistical properties such as percentiles or moments are calculated for each interval. They form a vector of basic signal parameters. After that, ratios, differences, and ratios of differences of the basic signal parameters can be utilised to calculate vectors of extended signal parameters. In this way, the signal parameters become more robust to environmental conditions. [0146] The estimated time series are fitted using curves of type and parameterization preferably being optimised for the particular application. The type of curves might be exponential, polynomial, trigonometric, and their linear and nonlinear mixtures. [0147] The third method of extraction of signal parameters is a combination of the first two methods. To this end, it is preferred that the fitting is performed not to the whole signal but within intervals only and that a mixing using ratios and differences is performed.
[0148] The result of the extraction of the signal parameters is a large vector of real numbers (usually more than 200). This vector is preferably shortened by selecting a subset of the signal parameters by executing a feature selection algorithm.
[0149] S3: Hence, after the extraction of the signal parameters, a selection of signal parameters is performed by means of a feature selection algorithm. In this way, a robustness and a generalisation level of classification or, in a sense, prediction algorithm are provided. The feature selection algorithm selecting the signal parameters is preferably optimised for a particular application and might be one of the following: [0150] Univariate selection (like F-test, mutual information, chi chi-squared), [0151] Penalty based methods (L1, L2, elastic net), [0152] Forward or backward selection of signal parameters based on properties of a particular model or based on cross-validation or bootstrapping.
[0153] The model Pareto optimality concept can be used to select signal parameters' vectors optimal to a multi-objective criterion in forward or backward selection. The number of signal parameters should be a minimum while accuracy is a maximum. Usually, it is not possible to decrease the number of used signal parameters without decreasing accuracy. From these Pareto optimal algorithms, the final linking algorithm comprising the optimal input vector of signal parameters is therefore preferably selected based on domain-specific criteria with human expert assistance, for instance. This additional criterion could be the performance for a special test dataset or additional requirements defined by an expert.
[0154] S4: The input vector obtained after the selection of the signal parameters is fed to a linking algorithm. The linking algorithm uses the input vector of signal parameters to output the activity indicator of the particles. This activity indicator may be discrete, where the input vector of signal parameters extracted and selected in the previous steps is used as an input for a classification algorithm. That is, the linking algorithm corresponds in this case to a classification algorithm. The classification algorithm can be one of the following: [0155] Logistic regression, [0156] Decision tree, [0157] Random forest, [0158] Gradient boosting trees, [0159] Support vector machines, [0160] Multiperceptron neural networks, [0161] Radial basis functions (RBF) neural networks.
[0162] The type of linking algorithm and values of its hyper-parameters are preferably optimised for particular applications.
[0163] The activity indicator may also be a real number, where the input vector of signal parameters is also used as an input for a regression algorithm. That is, the linking algorithm corresponds in this case to a regression algorithm. The regression algorithm is preferably one of the following: [0164] Linear regression, [0165] Decision tree regression, [0166] Random forest regression, [0167] Gradient boosting trees regression [0168] Kernel regression, [0169] Multiperceptron neural network, [0170] RBF neural network regression.
[0171] The type of linking algorithm and values of its hyper-parameters are preferably optimised for particular applications.
[0172] More practically speaking, the particles being in contact with the flexible support cause random signals buried in a significant noise background with useful information coded in both time and frequency domain. Particles such as organisms evolve in time under different stimuli (like nutrients, antibiotics, chemicals) that affect the properties of the random signals. The signal properties that provide for instance diagnostic information can be extracted using the method according to the invention by observing their changes in time under properly optimised measurement conditions, for instance. Due to the biological variability between tested organisms and between the different conditions measured, the herein proposed method has proven as a powerful tool that allows separating diagnostic information from measurement artefacts and noise background. Relating this example to the analytic outline depicted in
[0173]
[0174]
[0175]
[0176]
[0177] In this experiment, the reference strain ATCC-25922 of the Gram-negative bacterium E. coli was cultured to late logarithmic phase (OD.sub.600=1=10.sup.9 cells/mL) under standard laboratory conditions (i.e., nutrient-rich Miller's Luria Bertani media (LB), aerobic, 37 C.) and harvested using centrifugation (2000 g, 3). The pellet was separated and split into two groups for the subsequent analysis. The first group of bacteria was kept alive at room temperature while the second group was exposed to 60 C. for 20 min to generate non-viable bacteria due to the denaturation of enzymatic complexes and other cellular essential processes. As expected and confirmed by an independent measurement, in this latter group, we were unable to detect metabolic activity using the redox-dependent fluorescent dye resazurin that shifts its emission/excitation spectrum due to respiratory activity, a main indicator for the metabolic state of a cell.
[0178] Prior to the start of the experiment, the microcantilever of the motion detector was functionalized with a linking agent required for attaining a sustainable cell attachment that reliably lasts throughout the nanomotion recording. The attachment was done as described in WO 2021/130339 A1. In fact, in this experimental setup, positively charged Poly-D-Lysine (PDL) was used as it can facilitate the attachment of E. coli cells exhibiting a negatively charged lipopolysaccharide surface. In fact, most cells exhibit a negative net charge on their surface and in cases where this is not the case the functionalizing agent can be adapted, see also WO 2021/130339A1. Here, a solution of 0.1 mg/ml Poly-D-Lysine in ultrapure water was applied on the microcantilever for 20 min, rinsed off again with ultrapure water and dried. Using the motion detector, the deflections of the functionalized bare microcantilever were recorded in concentrated LB and served as the Blank for the subsequent measurements of the two groups of E. coli cells. Both viable and heat-killed groups of E. coli cells were then attached to a microcantilever in parallel recordings. That is, two phases of a signal recording were done: Blank phase, where the deflections of an empty cantilever are measured followed by the Bac phase where the deflections of a cantilever with attached bacteria are measured.
[0179] From the deflection of the microcantilever and after drift cancellation using linear detrending (1.sup.st order polynomial) of 10s long intervals a time series of a signal estimator, in this case the variance (second moment of probability density function), is calculated and plotted for 20 min, first for the bare microcantilever (Blank phase) and second for the microcantilever with attached ATCC-25922 for each group of viable and non-viable cells (Bac phase). That is, in
[0180] From the variance, i.e., the signal estimator, the signal parameter was calculated: the median of the variance over 20 min of recording for the Blank and the Bac phases. In the Blank phase the mean was 1.610.sup.6 V.sup.2/V.sup.2. After attachment of viable cells, in the Bac phase, the variance increased by one order of magnitude to 1.510.sup.5 V.sup.2/V.sup.2, whereas the median of the variance of heat-killed, non-viable bacteria in the Bac phase stayed at the level of the Blank phase (.sub.viable=1.510.sup.5 V.sup.2/V.sup.2 vs .sub.heatkilled=2.510.sup.6 V.sup.2/V.sup.2, p=0.0013, t-test, p of the variance was calculated over 20 min) in concordance with other metabolic activity assessments as, e.g., using the fluorescent dye resazurin, mentioned earlier.
[0181]
[0182] With reference to the further figures, more complex scenarios with less extreme differences between groups (in regard to biological qualities or phenotypes) are analysed by the method according to the invention.
[0183] To this end,
[0184] In fact, for the analysis presented in
[0185] The two patterns recognizable in
[0186] For this reason, signal parameters were calculated and thresholded to determine if a given strain is susceptible or resistantthat is, if the value of a signal parameter is greater than a threshold value, the strain is determined as resistant, and if the value is not greater than the threshold value, the strain is determined as susceptible. To determine whether cells with different reactions to antibiotic stress can be classified, the signal of two additional susceptible clinical isolates RN-26 and RN-49, as well as of two additional resistant clinical isolates B1 and B15 from a Swiss hospital were detected with the motion detector. In total 67 experiments with these strains exposed to ceftriaxone were recorded. Signal parameters were extracted from single signal estimators that gave one of the best separations between the two different groups as for example the Median of the first 30 min of the Drug phase normalised to the Median of the entire Drug phase or the last 30 min of the Drug phase normalised to the Median of the entire Drug phase.
[0187]
[0188] From
[0189] In
[0190] In practical applications however, the extraction and selection of signal parameters need to be optimised carefully. The large number of signal parameters results in so-called algorithm overfitting. In this case, high performance for the dataset used for fitting is not maintained when the algorithm is used in real conditions. For this reason, the cross-validation procedure is applied, where the dataset is split into k subsets. The algorithm is fitted on k1 subsets and validated on the last one not used for fitting. The process is repeated k times every time with the new subset excluded. In this way the estimated mean accuracy reveals the problem of overfitting (an overfitted algorithm has worse accuracy than not overfitted).
[0191] The cross-validation helps to select only those signal parameters that handle useful information. The selection starts from one signal parameter and continuously increases the number of parameters one by one (forward selection) and after that starts to remove signal parameters (backward selection) up to a single one. The cross-validation helps to assess how addition or removal of signal parameters affects the indicator performance measured by specified single or multi-objective criterion. This procedure allows the extraction of many signal parameters from many time series of signal estimators and the selection of only the most important ones.
[0192] In the just presented example, a rather small dataset was used. In the following and with reference to
[0193] In a first step, a drift cancellation of the detected RIS was performed. To this end the detected RIS was split into 10 s intervals. Thereafter, a linear fitting of a polynomial of the 1.sup.st order to the RIS for 10 second intervals was applied to calculate the fitting error being the input for the calculation of the signal estimator, herein below of the variance (estimator of the 2.sup.nd moment of probability density function). In this way the signal estimator over time, i.e. a variance signal s(t) with 0.1 Hz sampling frequency is obtained.
[0194] In the frequency domain, the 85 seconds intervals are used for power spectrum estimation and fitting by mixture of white and second order band noise models with common equation (1),
where f means frequency, A, f.sub.0, and Q are parameters of the band noise, and B is a parameter of the white noise (a constant). This results in additional four signals (A(t), f.sub.0(t), Q(t) and B(t)) with sampling frequency around 0.01 Hz ( 1/85). Together with the variance signal s(t) these (time series of statistical estimators) are the input for the determination of the signal parameters.
[0195] In said determination of the signal parameters, the signal parameters are calculated. The signals s(t), A(t), f.sub.0(t), Q(t), B(t) are divided into intervals of 20 minutes length for the Bac phase and 30 minutes for the Drug phase, see
where m.sub.i, m.sub.j are medians within the i.sup.th and j.sup.th interval as defined in
[0196] For the variance signal in the drug phase, the linear and exponential fitting are also applied, and the fitted parameters are additional four signal parameters (the slope a, the intersection b, the growth/decay rate , and the magnitude c). Equation (3) defines the exact mathematical formulas,
where coefficients of determination of both fits and ratios c/T and b/a are among the set of extracted signal parameters. The illustration of linear and exponential fit is shown in
[0197] As a result, a set of 192 signal parameters has been determined or extracted and has been subjected to signal parameters selection stage, i.e., has been fed to the feature selection algorithm.
[0198] In the table depicted in
Correlation to the Kill Rate of an Organism
[0199] With reference to
[0200] That is, susceptible cells exposed to a toxin or antibiotic exhibit a decrease in viability resulting eventually in death. In the case of bacteria exposed to a bactericidal antibiotic, a kill rate can be measured by determining the change of the number of colony-forming units (CFUs) at different time points. A CFU is thereby defined as a bacterium (=a unit), that through multiple replication cycles forms a visible colony on growth permitting medium. The number of these CFUs is therefore a measure of the concentration of viable bacteria in the bacterial suspension of interest. Multiple samplings over time assess the change in CFU numbers of the surviving bacteria and therefore serve to calculate a kill rate. This method has been a very reliable standard technique for more than a century already applied by for instance Robert Koch.
[0201] However, one of the major drawbacks is its dependence on growth and thus the time a bacterium needs to form a visible colony for analysis. Fast growing bacteria like K. pneumoniae form colonies visible by eye in ten hours, slow growing pathogens like M. tuberculosis however take several weeks. In any case, it is a very reliable but slow method.
[0202] In the present example depicted in
[0203] However, and as follows from
Classification of Metabolic Activity
[0204] With reference to
[0205] In the example depicted in
[0206]
[0207] As mentioned earlier and as illustrated in
[0208] In order to structuralize the description of signal parameters the polish notation together with some formal structure of signal parameters names are used.
[0209] The signal parameters names have the following structure
TABLE-US-00001 TIME_SERIES_NAME.sub.PHASE_NAME.sub.START_TIME- END_TIME.sub.STATISTICS_USED
(e.g f0_Bac__80-90_p50means f0 signal estimator for which 50.sup.th percentile is calculated for bac phase for time interval from 80 minute to 90 minute). Statistics used are percentiles, means, standard deviation (std), and slope.
[0210] Polish notation is a way to code arithmetic expressions without using parentheses e.g.
[0211] /+a ba b is equivalent to (a+b)/(ab). It is used, to automatically code signal parameters being ratios and differences of other signal parameters.
[0212] The feature selection algorithm selected the following noise signal parameters as pareto optimal based on accuracy and number of signal parameters:
TABLE-US-00002 - f0.sub.Bac.sub.80-90_p50 f0.sub.Bac.sub.0-10_p50 - A.sub.Bac.sub.110-120_p50 A.sub.Bac.sub.0-120 / Q.sub.Bac.sub.80-90.sub.p50 f0.sub.Bac.sub.20-30.sub.p50 / f0.sub.Bac_110-120_p50 Q.sub.Bac.sub.70-80_p50 - f0.sub.Bac.sub.0-10_p50 f0.sub.Bac.sub.0-120.sub.p50.
[0213]
[0214] As follows from
[0215] The following embodiments explore three aspects of the time-dependency of the signal being evaluated by the evaluation device: the complex aspects of power spectral densities not covered by theoretical noise models, the related to the particle vibrations scarce events in the signal that might vanish during power spectral density estimation and self-affinity of cell vibrations.
[0216] In particular, a typical spectrum and signal estimators that could be derived from a signal being detected with a motion detector and being analysed with the method according to the invention are presented in
[0223] The normalised power normN.sub.i and normNE.sub.i for every i N.sub.i/(N.sub.1+N.sub.2+N.sub.3+N.sub.4+N.sub.5) and NE.sub.i/(N.sub.1+N.sub.2+N.sub.3+N.sub.4+N.sub.5) These time series of signal estimators can be used to calculate signal parameters and/or their ratios and differences.
[0224] As follows from
[0225] KLPQKullback-Leibler divergence between Chi.sup.2 probability density function and empirical probability density function calculated basing on percentiles,
[0226] KLQPKullback-Leibler divergence between empirical probability density function calculated basing on percentiles and Chi.sup.2 probability density function,
[0227] Kurtosisis kurtosis (a moment) of empirical probability density function,
[0228] JSis Jensen-Shannon distance between Chi.sup.2 probability density function and empirical probability density functions,
[0229] MO(p.sub.60p.sub.40)/(p.sub.90p.sub.10) (ratios and differences)
[0230] EEPDFis mean (a moment) of the empirical probability density function
[0231] They provide every 5 min samples of 112 time series of signal estimators from which signal parameters and their ratios and differences for different percentiles, frequency ranges and time intervals are calculated. They are objects of the feature selection algorithm.
[0232] The current findings about cell vibrations revealed that their power spectral density function has strong 1/f characteristic. This kind of spectrum is often connected with signal self-affinity. Self-affinity can be analysed by detrended fluctuations analysis (DFA) and its generalisation multifractal detrended fluctuations analysis (MF-DFA). In these methods the dependence of the signal statistical properties on the time scale is investigated. In the presented embodiments, the MF-DFA based signal parameters were used. In particular, the following 21 generalized mean exponents/powers are used: 10.0, 8.0, 6.0, 4.0, 2.0, 1.0, 0.8, 0.6, 0.4, 0.2, 0.0, 0.2, 0.4, 0.6, 0.8, 1.0, 2.0, 4.0, 6.0, 8.0, 10.0. Together with 9 numbers of subintervals (4, 8, 16, 32, 64, 128, 256, 512, 1024) results in 189 signal parameters, from which the ratios and or differences are further calculated.
[0233] In
[0234] Analogous to the examples consisting of relatively small datasets depicted in
[0235] In
[0236] In this, True positives are considered correctly classified experiments with susceptible strains, True negatives are correctly classified experiments with resistant strains, False positives are experiments with falsely classified resistant strains, and False negatives refer to experiments with falsely classified susceptible strains.
[0237] Classification model no. 1 (classification model is an instance of a linking algorithm being a classification algorithm) is based on a single SP (F128) with arguably the biggest impact, leading to a classification accuracy of 85.8%. Model no. 2 with two SPs (F128, F61) achieved 89.7% accuracy, with three SPs (F128, F61, F146_F129) 91.4%, and finally, 93.1% accuracy was reached with model no. 4 combining four SPs (F128, F61, F146_F129, F7_F0)summarised in
[0238] The score, i.e. the classification indicator for each of the 233 experiments is benchmarked against the reference MICs that are either considered susceptible (S REFERENCE) or resistant (R REFERENCE) in
[0239] In conclusion, one SP, that can be considered a single aspect of the nanomotion signal, is less suited than 4 SPs for describing the diverse response to the drug found in such a diverse dataset of 83 different strains. The increase of SPs and their combination significantly improves the performance of the classification algorithm.
[0240] In another example, an even bigger and more diverse dataset of 487 samples comprising 160 clinical isolates of two different bacterial species, E. coli and K. pneumoniae, were exposed to one concentration of ceftriaxone (indicated as Conc.,
TABLE-US-00003 / NE0.sub.Drug.sub.90-120_p50 NE0.sub.Drug.sub.0-30.sub.p50 / normNE19.sub.Drug.sub.90-120.sub.p50 normNE19.sub.Drug.sub.60-90.sub.p50 / normNE13.sub.Bac.sub.90-120.sub.p50 normNE13.sub.Bac.sub.90-120.sub.p50 / normNE1.sub.Drug.sub.90-120.sub.p50 normNE1.sub.Drug.sub.0-120.sub.p50
[0241] The linking algorithm being classification algorithm with a high performance of separating experiments with resistant strains from experiments with susceptible strains is based on the combination of the four signal parameters described above resulting in a score, i.e., classification indicator that assumes positive values for experiments with predicted susceptibility (S INVENTION) and negative values with experiments of predicted resistance (R INVENTION). The accuracy reached 91.6%, sensitivity 89.0% and the specificity 94.6% (
[0242] A similar analysis of 210 samples of 127 E. coli strains exposed to 8 g/ml ciprofloxacin (CIP, indicated as Conc. in
TABLE-US-00004 / / / - p90_0-10.sub.Drug.sub.90-120.sub.mean p50_0-10.sub.Drug.sub.90-120.sub.mean - p50_0- 10.sub.Drug.sub.90-120.sub.mean p10_0-10.sub.Drug.sub.90-120.sub.mean / - p90_5000- 6000.sub.Drug.sub.90-120.sub.mean p50_5000-6000.sub.Drug.sub.90-120.sub.mean - p50_5000- 6000.sub.Drug.sub.90-120.sub.mean p10_5000-6000.sub.Drug.sub.90-120.sub.mean / / - p90_0- 10.sub.Drug.sub.0-30.sub.mean p50_0-10.sub.Drug.sub.0-30.sub.mean - p50_0-10.sub.Drug.sub.0- 30.sub.mean p10_0-10.sub.Drug.sub.0-30.sub.mean / - p90_5000-6000.sub.Drug.sub.0-30.sub.mean p50_5000-6000.sub.Drug.sub.0-30.sub.mean - p50_5000-6000.sub.Drug.sub.0-30.sub.mean p10_5000- 6000.sub.Drug.sub.0-30.sub.mean / / p20_200-400.sub.Drug.sub.90-120.sub.mean p20_10-100.sub.Drug.sub.90-120.sub.mean / p20_200- 400.sub.Drug.sub.30-60.sub.mean p20_10-100.sub.Drug.sub.30-60.sub.mean / p30_10-100.sub.Bac.sub.60-90_mean p30_0-10.sub.Bac.sub.60-90_mean F75
[0243] The final linking algorithm being classification algorithm was based on four SPs and from the 210 samples it correctly classified 190. Thus, it achieved an accuracy of 90.5%, a sensitivity of 89.8% and a specificity of 91.2% (
[0244] Yet, in another analysis, 155 samples of 125 diverse E. coli strains from different strain collections were exposed to a third antibiotic cefotaxime (CTX). In the same nanomotion measurement setup as described in 7a and 7b, we used 32 g/ml cefotaxime (indicated as Conc. in
TABLE-US-00005 / / p50_400-1000.sub.Drug.sub.90-120.sub.mean p50_100-200.sub.Drug.sub.90-120.sub.mean / p50_400-1000.sub.Drug.sub.0-30.sub.mean p50_100-200.sub.Drug.sub.0-30.sub.mean / / / - p50_200-400.sub.Drug.sub.30-60.sub.mean p10_200-400.sub.Drug.sub.30-60.sub.mean - p90_200-400.sub.Drug.sub.30-60.sub.mean p50_200-400.sub.Drug.sub.30-60.sub.mean / - p50_100- 200.sub.Drug.sub.30-60.sub.mean p10_100-200.sub.Drug.sub.30-60.sub.mean - p90_100- 200.sub.Drug.sub.30-60.sub.mean p50_100-200.sub.Drug.sub.30-60.sub.mean / / - p50_200- 400.sub.Drug.sub.0-30.sub.mean p10_200-400.sub.Drug.sub.0-30.sub.mean - p90_200-400.sub.Drug_0- 30.sub.mean p50_200-400.sub.Drug.sub.0-30.sub.mean / - p50_100-200.sub.Drug.sub.0-30.sub.mean p10_100-200.sub.Drug.sub.0-30.sub.mean - p90_100-200.sub.Drug.sub.0-30.sub.mean p50_100- 200.sub.Drug.sub.0-30.sub.mean F186_F64 F45_F38 / / p70_1000-2000.sub.Drug.sub.90-120.sub.mean p70_100-200.sub.Drug.sub.90-120.sub.mean / p70_1000-2000.sub.Drug.sub.00-30.sub.mean p70_100-200.sub.Drug.sub.0-30.sub.mean
[0245] The linking algorithm being classification algorithm based on five SPs led to only 11 false classifications, resulting in an accuracy of 92.9%, a sensitivity of 91.7% and a specificity of 94% (
[0246] In summary, for each of the four bacteria-drug combinations in
[0247] The two aforementioned drugs cefotaxime and ciprofloxacin impede different cellular processes of the bacterial cell. Cefotaxime belongs to the family of beta-lactam antibiotics interfering with the cell wall metabolism while ciprofloxacin as a quinolone binds topoisomerases involved in DNA folding, a process that impact replication and effectively all processes in which the DNA is involved. The proper functioning of both drug targets is essential. While for both, their impediment is in the long run detrimental for the cell, both their triggered cellular stress responses differ.
[0248] On a total dataset size of 404 experiments, of which 202 were performed with susceptible E. coli strains and 32 g/ml cefotaxime and an equal number of experiments with susceptible E. coli strains and 8 g/ml ciprofloxacin, the combination of nanomotion recordings and machine learning was applied to develop linking algorithms being classification algorithms to separate the information entailed in the nanomotion response to cefotaxime from ciprofloxacin. All 404 experiments were used simultaneously to develop the classification model. If an experiment was performed with cefotaxime and afterwards correctly predicted as such by the method, it was considered correctly classified. The same accounted for ciprofloxacin. The experimental setup was again identical to the one described for
[0249] Quantile signal parameters are used and linking algorithm being classification algorithm being a support vector machine algorithm with radial basis functions. The feature selection algorithm selected the following pareto optimal signal parameters on the basis of accuracy and number of signal parameters:
TABLE-US-00006 - KLQP_200-400.sub.Drug.sub.90-120.sub.p95 KLQP_200-400.sub.Drug.sub.0-30.sub.p95 / EEPDF_0-10.sub.Drug.sub.090-120.sub.p95 EEPDF_0-10.sub.Drug.sub.0-30.sub.p95 / Kurtosis_0-10.sub.Drug.sub.90-120.sub.p10 JS_0-10.sub.Drug.sub.90-120.sub.p10 - EEPDF.sub.Drug.sub.90-120.sub.mean JS_2000-4000.sub.Drug.sub.0-30.sub.mean MO_0-10.sub.Drug.sub.90-120.sub.p75 MO_100-200.sub.Drug.sub.30-60.sub.std KLQP_0-10.sub.Drug.sub.0-30.sub.std
[0250] In a linking algorithm being classification algorithm the predicted susceptible response to ciprofloxacin was assigned negative score values, while the predicted response to cefotaxime was assigned positive score values (similar to a classification algorithm for R and S phenotypes). Wrongly classified experiments assumed positive values for ciprofloxacin and negative values for cefotaxime, accordingly. On a high-performing classification algorithm based on seven SPs the accuracy for both drugs reached 85.4%. 87.0% of the ciprofloxacin experiments were correctly classified and 83.7% for cefotaxime. The impact of every additional signal parameter is presented in
[0251] Besides chemical stressors, as shown for different drugs (
TABLE-US-00007 / NE17.sub.Bac.sub.0-15.sub.p25 NE12.sub.Bac.sub.0-15.sub.p25 / NE17.sub.Bac.sub.15-30.sub.p10 NE13.sub.Bac.sub.15-30.sub.p10 / NE16.sub.Bac.sub.15-30.sub.p75 NE15.sub.Bac.sub.15-30.sub.p75 - NE19.sub.Bac.sub.15-30.sub.p10 NE19.sub.Bac.sub.0-15.sub.p10 N1.sub.Bac.sub.0-15.sub.std
[0252] Comparing the nanomotions at two different temperatures and thus expectedly a rather global change in the cell's metabolic activity, the classification algorithm's accuracy in separating both conditions ranges from 97% with a single SP up to 99% for five SP. In
Drug Sensitivity Testing on Cancer Cells
[0253]
[0254] The doxorubicin susceptible SW480 was cultured under standard laboratory conditions (i.e., cell culture medium (Dulbecco's Modified Eagle Medium (DMEM) containing 10% heat-inactivated fetal calf serum (FCS), 37 C., 5% CO.sub.2) in cell culture flasks. In preparation of nanomotion experiments, cells were detached, collected in cell culture medium, and washed in DMEM containing 10% FCS. The cell suspension was used for the attachment to the cantilever, and all nanomotion measurements with the device were performed in DMEM containing 10% FCS. The colon cancer cells were attached to the cantilever. Three phases of signal recording were performed: (i) Blank phase, where the deflections of the bare cantilevers are measured, (ii) the Medium phase, where the deflections of the cantilever with attached SW480 were measured followed by (iii) the Drug phase, where the deflections of the cantilever with attached SW480 after exposure to a drug were recorded or a second Medium phase without doxorubicin. The recordings presented in the following were conducted in a motion detector installed in a CO.sub.2 supplied incubator to allow optimal culture conditions for cancer cells.
[0255]
[0256] The signal parameters selected by the feature selection algorithm basing on pareto optimality, accuracy and number of signal parameters are as follows:
TABLE-US-00008 / NE8.sub.Drug.sub.0-30.sub.std NE2.sub.Drug.sub.0-30.sub.std - NE0.sub.Drug.sub.90-120.sub.p75 NE0.sub.Drug.sub.0-30.sub.p75 / NE10.sub.Drug.sub.90-120.sub.slope NE3.sub.Drug.sub.90-120.sub.slope - NE19.sub.Drug.sub.90-120.sub.std NE13.sub.Drug.sub.30-60.sub.std - NE9.sub.Drug.sub.60-90.sub.std NE5.sub.Drug_0-30.sub.std - NE7.sub.Drug.sub.0-30.sub.std NE1.sub.Drug.sub.0-30.sub.std NE4.sub.Drug.sub.0-30.sub.p95
[0257] Indeed, the accuracy of pareto optimal linking algorithms ranges from 83.2% for one SP up to 91.6% for seven SPs. In
Quantification of Metabolic Activity
[0258] Besides qualitative assessments of metabolic states of a cell by using classification models, the impact of interference at a cell's metabolic activity by a drug can be quantified using regression models. We used the clinical E. coli isolate IMHA-2155385, which is susceptible to the drug combination ceftazidime-avibactam. This cephalosporin/beta-lactamase inhibitor combination attacks bacterial cell wall synthesis and simultaneously blocks the beta-lactamase-mediated resistance mechanism. In
TABLE-US-00009 / / - p50_8000-10000.sub.Drug.sub.90-120.sub.mean p10_8000-10000.sub.Drug.sub.90-120.sub.mean - p50_1000-2000.sub.Drug.sub.90-120.sub.mean p10_1000-2000.sub.Drug.sub.90-120.sub.mean / - p50_8000-10000.sub.Drug.sub.0-30.sub.mean p10_8000-10000.sub.Drug.sub.0-30.sub.mean - p50_1000-2000.sub.Drug.sub.0-30.sub.mean p10_1000-2000.sub.Drug.sub.0-30.sub.mean / / p10_6000-7000.sub.Bac.sub.90-120.sub.mean p10_0-10.sub.Bac.sub.90-120_mean / p10_6000- 7000.sub.Bac.sub.60-90_mean p10_0-10.sub.Bac.sub.60-90_mean / / p20_6000-7000.sub.Drug.sub.90-120.sub.mean p20_5000-6000.sub.Drug.sub.90-120.sub.mean / p20_6000-7000.sub.Drug.sub.60-90.sub.mean p20_5000-6000.sub.Drug.sub.60-90.sub.mean / / / - p90_7000-8000.sub.Drug.sub.90-120.sub.mean p10_7000-8000.sub.Drug.sub.90-120.sub.mean - p60_7000-8000.sub.Drug.sub.90-120.sub.mean p40_7000-8000.sub.Drug.sub.90-120.sub.mean / - p90_2000-4000.sub.Drug.sub.90-120.sub.mean p10_2000-4000.sub.Drug.sub.90-120.sub.mean - p60_2000-4000.sub.Drug.sub.90-120.sub.mean p40_2000-4000.sub.Drug.sub.90-120.sub.mean / / - p90_7000-8000.sub.Drug.sub.0-30.sub.mean p10_7000-8000.sub.Drug.sub.0-30.sub.mean - p60_7000- 8000.sub.Drug.sub.0-30.sub.mean p40_7000-8000.sub.Drug.sub.0-30.sub.mean / - p90_2000- 4000.sub.Drug.sub.0-30.sub.mean p10_2000-4000.sub.Drug.sub.0-30.sub.mean - p60_2000- 4000.sub.Drug.sub.0-30.sub.mean p40_2000-4000.sub.Drug.sub.00-30.sub.mean / / p30_6000-7000.sub.Drug.sub.60-90.sub.mean p30_5000-6000.sub.Drug.sub.60-90.sub.mean / p30_6000-7000.sub.Bac.sub.60-90.sub.mean p30_5000-6000.sub.Bac.sub.60-90.sub.mean
[0259] The presented results confirm that the invention allows measuring the increased impact of metabolic activity related to increasing drug concentration.
TABLE-US-00010 LIST OF REFERENCE SIGNS 1 motion detector 4 evaluation device 2 flexible support 5 source of radiation 3 detection device