BREATHING ADAPTATION SYSTEM AND METHOD FOR INFLUENCING A BREATHING PARAMETER
20210386319 · 2021-12-16
Inventors
- MARA HOUBRAKEN (EINDHOVEN, NL)
- Raymond Van Ee (Geldrop, NL)
- CLIFF JOHANNES ROBERT HUBERTINA LASCHET (GULPEN, NL)
- IRIS TIMMERS (BEST, NL)
- GIUSEPPE COPPOLA (EINDHOVEN, NL)
- BAS ARNOLD JAN BERGEVOET (EINDHOVEN, NL)
Cpc classification
A61B5/0004
HUMAN NECESSITIES
A61B5/055
HUMAN NECESSITIES
G16H40/20
PHYSICS
A61B5/4836
HUMAN NECESSITIES
A61B5/7264
HUMAN NECESSITIES
A61B5/0816
HUMAN NECESSITIES
International classification
A61B5/08
HUMAN NECESSITIES
A61B5/00
HUMAN NECESSITIES
A61B5/055
HUMAN NECESSITIES
G16H40/20
PHYSICS
Abstract
It is an object of the invention to improve a user's breathing for a specific application (e.g. diagnostic imaging, falling asleep improvement (shortening sleep onset), delivery, tinnitus control therapy, CORD, blood pressure control). This object is achieved by a breathing adaptation system configured for influencing a breathing parameter of a user's breathing pattern in order to meet a goal of decreasing or increasing the influenced breathing parameter to a certain extent in a user specific manner. The breathing adaptation system comprises a sensor, configured for monitoring a current value of the influenced breathing parameter of the user. The breathing adaptation system further comprises a feedback unit configured for providing feedback to the user about a preferred value of the influenced breathing parameter, wherein the preferred value is different from the current value of the influenced breathing parameter (310). The breathing adaptation system further comprises a control unit, comprising program code means for determining the preferred value of the influenced breathing parameter in a user specific manner by means of an intelligent agent, wherein the control unit is implemented such that the intelligent agent is rewarded (330) by means of a reward function after performing the determination of the preferred value (320) of the influenced breathing parameter, wherein the reward function is such that it balances the reward for obtaining the goal and the ability of the user to breathe according to a breathing pattern having the preferred value for the influenced breathing parameter.
Claims
1. A breathing adaptation system configured for influencing a breathing parameter of a user's breathing pattern based on provision of feedback to the user, in order to meet a goal comprising decreasing or increasing the influenced breathing parameter, wherein the breathing adaptation system comprising: a sensor configured to monitor a current value of the breathing parameter of the user; a feedback unit configured to provide sensory feedback to the user about a determined preferred value of the breathing parameter for achieving the goal, wherein the preferred value is different from the current value of the breathing parameter and; a control unit configured to determine the preferred value of the influenced breathing parameter by an artificial intelligence intelligent agent, and the control unit further configured for implementing a reward function configured to reward the intelligent agent after performing the determination of the preferred value of the breathing parameter, the rewarding being based on the monitored values of the breathing parameter, and wherein the reward is higher in response to the breathing parameter moving closer to the goal and lower in response to the parameter moving further from the goal, wherein the reward function is configured such that it balances reward for obtaining the goal and a detected ability of the user to breathe according to a breathing pattern having the preferred value for the breathing parameter.
2. A breathing adaptation system according to claim 1, configured for providing the feedback in the form of a multi-sensory feedback, wherein the frequencies of different sensory feedbacks are equal.
3. A breathing adaptation system according to claim 1, wherein the feedback unit is configured for providing a current value of the influenced breathing parameter to the user for a period of time before starting to provide the preferred value of the influenced breathing parameter.
4. A breathing adaptation system according to claim 1, wherein the feedback unit is configured for providing a current or preferred value of the influenced breathing parameter to the user by providing a breathing pattern, which is based on the current or preferred value of the influenced breathing parameter.
5. A breathing adaptation system according to claim 1, configured for repeating the determination of the preferred value of the influenced breathing parameter and providing this to the user until a predefined criterion is met.
6. A breathing adaptation system according to claim 1, wherein the feedback signal provided has a frequency below 1 Hz.
7. A breathing adaptation system according to claim 1, wherein the influenced breathing parameter is representative of breathing amplitude, breathing frequency and/or repeatability of breathing amplitude and/or breathing frequency, and/or breathe hold duration.
8. A breathing adaptation system according to claim 1, configured to be used for the purpose of improving diagnostic imaging or treatment delivery.
9. A breathing adaptation system according to claim 1, further comprising a diagnostic imaging system and/or a treatment delivery system.
10. A breathing adaptation system according to claim 9, wherein the breathing adaptation system comprises a diagnostic imaging system and the diagnostic imaging system is an MRI system.
11. A breathing adaptation system according to claim 5, a communication module configured to communicate with a remote computer of a diagnostic imaging or treatment center and configured in use for communicating a final preferred value of the influenced breathing parameter with the diagnostic imaging or treatment center.
12. A breathing adaptation system according to claim 10, further comprising a patient scheduling module configured for scheduling a timeslot for diagnostic imaging, wherein the length of the scheduled timeslot is dependent on the final preferred value of the influenced breathing parameter.
13. A breathing adaptation system according to claim 10, further comprising an image protocol optimizer, wherein the image protocol optimizer is configured for optimizing one imaging parameter of a diagnostic imaging system at least partly based on the final preferred value of the influenced breathing parameter, and configured to provide the optimized parameter as an input to a diagnostic imaging system.
14. A method for influencing a breathing parameter of a user's breathing pattern in order to meet a goal comprising decreasing or increasing the breathing parameter, wherein the method comprises the steps of monitoring a current value of the breathing parameter of the user and; providing sensory feedback to the user about a determined preferred value of the breathing parameter for achieving the goal, wherein the preferred value is different from the current value of the breathing parameter and; determining the preferred value of the breathing parameter by an artificial intelligence intelligent agent, wherein the intelligent agent is rewarded by a reward function after performing the determination of the preferred value of the breathing parameter, the rewarding based on the monitored values of the breathing parameter and wherein the reward is higher in response to the breathing parameter moving closer to the goal and lower in response to the parameter moving further from the goal, and wherein the reward function is such that it balances reward for obtaining the goal and a detected ability of the user to breathe according to a breathing pattern having the preferred value for the influenced breathing parameter.
15. A method according to claim 14, wherein the method is used as part of a diagnostic imaging or treatment delivery procedure.
16. A method according to claim 15, further comprising sharing a final preferred value of the preferred influenced breathing parameter with a diagnostic imaging or treatment center.
17. A method according to claim 14, further comprising scheduling a timeslot for diagnostic imaging, wherein the length of the scheduled timeslot is dependent on a final preferred value of the influenced breathing parameter.
18. A method according to claim 14, further comprising optimizing at least one imaging parameter at least partly based on a final preferred value of the breathing parameter.
19. A computer program product comprising program code configured to perform the method according to claim 14.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0044]
[0045]
[0046]
[0047]
[0048]
DETAILED DESCRIPTION OF THE INVENTION
[0049] Reinforcement learning is a field in machine learning that is inspired by behaviorist psychology, and is concerned with how an intelligent agent can take actions in an environment so as to maximize its cumulative reward.
[0050]
[0051] To reach an optimal policy, the intelligent agent will often first go through a stage called exploration, where it will explore different actions regardless of the estimate of the value for each action. However, once this estimate has reached an acceptable accuracy, the agent should switch to a stage called exploitation. In this stage, it will only select the action with the highest value. In reality, there is no such clear boundary between the exploration and exploitation stages. It is more likely that the agent will start with a fully explorative policy and slowly move towards a more exploitative policy. However, preferably the agent should always keep some explorative actions in its policy.
[0052] Usually, the agent initiates the estimates of the values randomly. In order to improve on that, domain knowledge about the environment can be added to the reasoning to form an initial policy. This ensures that the agent will take less time to converge to an optimal policy.
[0053] There are several reasoning algorithms available in the prior art among one is Q-learning, which utilizes the Bellman equation to estimate the value of executing a certain action in a certain state (a state-action pair). This algorithm assigns a Q-value to each state-action pair by taking the reward obtained in the state into account, plus the expected Q-value of the best action in the next state, multiplied by a decay factor. This decay factor is a numeric value between 0 and 1. The expected Q-value of the next state is multiplied by the decay factor since the agent has to take one more action at a certain cost to reach this Q-value.
[0054] The best action in a certain state is computed by calculating the Q-values for all the possible actions, and choosing the action with the highest Q-value. The optimal policy corresponds to a policy where the expected value of the total reward return over all successive steps, starting from the current state, is the maximum achievable.
[0055] An alternative method to Q-learning is to search directly in policy space, instead of estimating a value for each possible action. Policy gradient is such a known method, based on optimizing parametrized policies by gradient descent.
[0056] In order to estimate an approximately correct value of a certain state, approximators can be used. These could for example be decision trees or random forest, both are known in the art.
[0057] When using an intelligent agent for influencing breathing parameters it is preferable to smooth the data that is sensed by the sensor in such a way that relevant patterns are captured while white noise is filtered out. Smoothing methods are known in the art among which are rectangular and triangular smoothing. It is further preferable to normalize the data sensed by the sensor.
[0058] Preferably, the data sensed by the sensor is modeled in order to extract the important breathing parameters. However, depending on the type of sensor and the goal, the one or more breathing parameters may also be measured directly. The breathing pattern could for example be modeled by sine or cosine function having parameters describing the amplitude, period and/or phase shift of the breathing pattern sensed by the sensor.
[0059] The current state of the breathing parameter(s) may be monitored by regular intervals. These intervals should not be too short, such that the user or patient has sufficient time to act upon the determined preferred value of the influenced breathing parameter. However, preferably this interval is neither too long, because this would make the method take too long.
[0060] The actions that the intelligent agent can take are preferably defined by the same parameters as a state is defined, which are for example amplitude, period and/or phase. Preferably, the actions that the intelligent agent is allowed to perform are dependent of the current state. In order to prevent too strong hyperventilation or too slow breathing, the value for the breathing frequency or interval may be constrained.
[0061] The reward function used is such that the reward balances the reward for obtaining the goal and the ability of the user to breathe according to a breathing pattern having the preferred value for the influenced breathing parameter. For example, the goal could be reducing the breathing parameter breathing frequency. In this example, the reward function should be designed such that it rewards lower breathing frequencies more than higher breathing frequencies. Also, alternatively or additionally the reward function in this example may be designed such that it rewards a decrease in breathing frequency compared to a previous state.
[0062] The ability of the user to breathe according to the breathing pattern having the preferred value for the influenced breathing parameter has to be taken into account in the reward function as well. This could for example be achieved by taking into account the difference between the preferred value of the influenced breathing parameter and the actually measured or sensed value for this influenced breathing parameter.
[0063] Alternatively, the ability of the user to breathe according to the breathing pattern could be determined in a broader sense. For example the breathing adaptation system could calculate an “ideal” breathing pattern using the measured values for the user's breathing frequency, amplitude and/or phase shifts. This could for example be achieved by using the above mentioned (co)sine function.
[0064] When calculating the ideal breathing pattern, preferably constraints could be provided to the non-influenced breathing parameters, such that they remain in a range that is assumed to be comfortable for the patient. When the measured non-influenced breathing parameters are within the allowed range, the breathing adaptation system will use the user's values when calculating the ideal breathing pattern. However, if the measured values are above or below the values set by the constraints, values taken from the allowed range will be used for calculating the ideal breathing pattern. Usually these values will be the upper or lower limit of the allowed range.
[0065] The control unit could then determine the difference between the calculated ideal breathing pattern and the actual breathing pattern of the patient/user. The difference could for example be expressed in a sum of squared differences, but alternatives are possible. In this way not only the breathing parameter that is being optimized (e.g. frequency) is taken into account, but also other parameters that may have an effect on the patient's comfort (e.g. breathing amplitude) when assessing the patient's comfort. Also, other irregularities in the measured breathing parameter could be detected in this way and be fed back to the intelligent agent by means of the reward function.
[0066] In order to obtain a value for the measured breathing parameter(s) it is advantageous to take into account a (weighted) average over multiple breathing cycles of the user or patient.
[0067]
[0068] When the preferred value of the influenced breathing parameter is provided to the user, the sensor keeps on measuring the values of the influenced breathing parameter achieved by the user. Depending on how well the user is capable in adapting to the preferred value of the influenced breathing parameter, the control unit 250 will not change the value for the preferred value of the influenced breathing parameter at all or will increase or decrease it in a faster or slower fashion.
[0069]
[0070]
[0071] The diagnostic imaging system (e.g. MRI system) 360 will receive input from the image protocol optimizer. The diagnostic imaging system does also comprise the breathing adaptation system according to claim 1 and will hence be configured for influencing the patient's breathing pattern.
[0072] Of course, the breathing pattern at home may not be the same as the breathing pattern during actual diagnostic imaging, but the output from the computer program product may give an indication of the patient's breathing pattern during diagnostic imaging or treatment delivery.
[0073]
[0077] Whilst the invention has been illustrated and described in detail in the drawings and foregoing description, such illustrations and description are to be considered illustrative or exemplary and not restrictive; the invention is not limited to the disclosed embodiments.