A METHOD AND SYSTEM FOR MONITORING AND ANALYSING COUGH

20230008906 · 2023-01-12

    Inventors

    Cpc classification

    International classification

    Abstract

    The method and system for monitoring cough comprises receiving audio signals or audio recordings, where said signals or audio recordings comprises one or more of silent segments, cough sound segments, speech segments and extraneous noise. The processing of said received sound signals or sound recordings comprise one or more of removing one or more speech components from speech segments to render the speech unintelligible and clipping said silent segments, wherein one or more speech components include vowel sounds. Further processing of said received audio signals or audio recordings further comprises compressing said audio signals or audio recordings. In the alternative, processing of audio signals or audio recordings comprises compressing a resultant signal after said removal of one or more speech components and/or clipping of silent segments from said audio signals.

    Claims

    1. A cough monitor for a subject, comprising: a processor; a microphone module, having a first microphone and a second microphone, operatively coupled to the processor; a memory operatively coupled to the processor; said processor configured to: receive signals from said first microphone and second microphone, said signals comprising audio, wherein the audio comprises cough sound segments and speech segments to define a cough audio event; process said received signals from said microphone module by removing one or more speech components from speech segments to render the speech unintelligible, such that only speech utterances are removed from the cough audio event; and store said processed signals in said memory.

    2. The cough monitor of claim 1, wherein processing said received signals further comprises: clipping said silent segments; and/or removing extraneous non-cough noise.

    3. The cough monitor of claim 1, wherein specific audio features are extracted to detect speech utterances from an audio signal received from said first microphone.

    4. The cough monitor of claim 1, wherein the specific audio features extracted comprises a measure of periodicity in the audio signal relating to the vibration of the vocal folds within a specific frequency range using a custom autocorrelation function.

    5. The cough monitor of claim 1, wherein said processing comprises the step of using values of surrounding audio frames to determine a voiced threshold value for detecting speech over a specific frequency range.

    6. (canceled)

    7. The cough monitor of claim 1, wherein processing said signals comprises measuring the changes in acoustic energy over time using an energy ratio to discriminate between speech utterances and cough events,

    8. The cough monitor of claim 1, wherein an energy ratio comprises a measure of the ratio of acoustic energy between the first microphone and the second microphone to discriminate between cough events and third party speech.

    9. The cough monitor of claim 1, wherein processing said received signals further comprises: compressing said signals comprising said audio; or compressing a resultant signal after said removal of one or more speech components and/or clipping of silent segments from said signals comprising audio.

    10. The cough monitor of claim 1, wherein one or more speech components include vowel sounds.

    11. The cough monitor of claim 1, further comprising an accelerometer operatively coupled to said processor to obtain the severity of cough from said accelerometer readings, wherein said accelerometer is mechanically coupled to the chest of the subject.

    12. The cough monitor of claim 1, further comprising a gyroscope operatively coupled to said processor to obtain the severity of cough from said gyroscope readings.

    13. The cough monitor of claim 1, further comprising a wireless transceiver for transmitting said processed signals to a server or for wireless communication with one or more sensors.

    14. The cough monitor of claim 1, wherein the first microphone comprises an air microphone configured to be attached to a lapel of the subject; and the second microphone comprises a contact microphone configured to be attached to the chest of the subject.

    15. The cough monitor of claim 14, wherein an air microphone and contact microphone are connected to the cough monitor via a single connection port or a wireless connection.

    16. The cough monitor of claim 1, wherein said first microphone module comprises: an air microphone built into the cough monitor; and said second microphone comprises a contact microphone built into the cough monitor, said cough monitor and said contact microphone configured to be attached to the chest of the subject.

    17. A method for cough monitoring, comprising the steps of: receiving signals from a first microphone and a second microphone, said signals comprising audio, wherein the audio comprises cough sound segments and speech segments to define a cough audio event; processing said received signals by removing one or more speech components from speech segments to render the speech unintelligible, such that only speech utterances are removed from the cough audio event; and storing said processed signals in memory.

    18. The method of claim 17, wherein processing said received signals comprise one or more of: removing one or more speech components from speech segments to render the speech unintelligible; clipping said silent segments; and removing extraneous non-cough noise.

    19. The method of claim 17, wherein specific audio features are extracted to detect speech utterances from an audio signal received from said first microphone.

    20. The method of claim 17, wherein specific audio features extracted comprises measuring a periodicity in the audio signal relating to the vibration of the vocal folds within a specific frequency range using a custom autocorrelation function.

    21. The method of claim 17 comprises the step of using values of surrounding audio frames to determine a voiced threshold value for detecting speech over a specific frequency range.

    22. (canceled)

    23. The method of claim 17 comprising the step of measuring the changes in acoustic energy over time using an energy ratio to discriminate between speech utterances and cough events,

    24. The method of claim 17, wherein an energy ratio comprises a measure of the ratio of acoustic energy between the first microphone and the second microphone to discriminate between cough events and third party speech.

    25. The method of claim 17, wherein processing said received signals further comprises: compressing said signals comprising said audio; or compressing a resultant signal after said removal of one or more speech components and/or clipping of silent segments from said signals comprising audio.

    26. The method of claim 17, further comprising detection of one or more fault conditions, wherein the one or more conditions comprises low battery, battery door removal, faulty sensors, short circuit across sensors, open circuit across sensors, insufficient memory, memory absent, and/or clock reset.

    27. The method of claim 17, further comprising monitoring the status of the module by determining the status of energy harvesting parameters during use.

    28. A cough monitor for a subject, comprising: a processor; a microphone module, having a first microphone and a second microphone, operatively coupled to the processor; a memory operatively coupled to the processor; said processor configured to: receive signals from said first microphone and second microphone, said signals comprising audio, wherein the audio comprises cough sound segments and speech segments to define a cough audio event; process said received signals from said microphone module by synthesising one or more speech components from speech segments to render the speech unintelligible, such that only speech utterances are synthesised from the cough audio event; and store said processed signals in said memory.

    Description

    BRIEF DESCRIPTION OF THE DRAWINGS

    [0049] The invention will be more clearly understood from the following description of an embodiment thereof, given by way of example only, with reference to the accompanying drawings, in which:—

    [0050] FIG. 1 exemplarily illustrates a flowchart of the method for cough monitoring;

    [0051] FIG. 2 exemplarily illustrates a block diagram of the cough monitor device; and

    [0052] FIGS. 3-6 illustrates a number of speech and cough audio signals outputted by the algorithm according to an embodiment of the present invention.

    DETAILED DESCRIPTION OF THE DRAWINGS

    [0053] The present invention relates to a method and system for monitoring a cough. More specifically, the present invention relates to efficiently recording and analysing a cough.

    [0054] FIG. 1 exemplarily illustrates a flowchart of the method for cough monitoring. The method for cough monitoring comprises receiving 101 signals from a microphone module, said signals comprising audio. In one embodiment the microphone module comprises a first microphone and a second microphone each configured with a separate channel. Alternatively, audio recordings may be received 101 via a network or physically on a removable memory device such as a secure digital card. The audio signals or recordings comprise of one or more of silent segments, cough sound segments and speech segments and/or extraneous noise. The audio signals or recordings are processed 102 and the processed signals or recordings stored 103 in memory thereafter. The processing of said received sound signals or sound recordings comprise of removing one or more speech components from speech segments to render the speech unintelligible and clipping said silent segments, wherein one or more speech components include vowel sounds.

    [0055] Further, processing of said received audio signals or audio recordings comprises compressing said audio signals or audio recordings. In the alternative, processing of audio signals or audio recordings comprises compressing a resultant signal after said removal of one or more speech components and/or clipping of silent segments from said signals comprising audio.

    [0056] In an embodiment, the processing of the audio signals is carried out by a cough monitoring device and said processed signals are then transmitted to a server via a wireless network. The method further comprises detection of one or more fault conditions comprising low battery, battery door removal, faulty sensors, short circuit across sensors, open circuit across sensors, insufficient memory, memory absent, and/or clock reset, by the cough monitoring device. It will be appreciated that in the context of the present invention the microphone can be interpreted as a sensor.

    [0057] The processed audio signals or audio recordings are then reviewed by semi-skilled personnel. The personnel thereafter identifies the cough sounds by listening to said processed audio signals or audio recordings. A person skilled in the art would appreciate that cough sounds are generally divided into three phases namely the explosive phase, the intermediate phase and the voiced phase. The personnel tags the explosive phase of each cough sound to finally generate cough data for a subject. A person skilled in the art would appreciate that various cross checks or quality assurance audits or checks may be carried out to rule out human error in identification of coughs. For example, the timeline of each recording is maintained across the process, with cough tags and events marked/timestamped. The cough tags and events can be ascertained from measurements obtained from the accelerometer or a gyroscope. In the context of the present invention coughs tagged by the skilled person are events. Other events can be from the subject pressing event marker buttons. Measurements from the accelerometer/gyroscope can indicate severity of cough and support skilled person in identify a sound signal as a cough. These events can be marked as a timed event.

    [0058] Also, a person skilled in the art would appreciate that by clipping the silent audio segments from the audio recordings and compressing the remaining audio segments significantly reduces the play time a semi-skilled personnel needs to review/analyse for obtaining the objective cough information of the subject/patient. Further, since the speech in the remaining segments would be rendered unintelligible, hence, the privacy of the subject is maintained.

    [0059] The system for monitoring cough comprises a cough monitor 200 of a subject and FIG. 2 exemplarily illustrates a block diagram of the cough monitor device 200.

    [0060] The cough monitor comprises a processor 201, a microphone module operatively coupled to the processor and a memory 202 operatively coupled to the processor. The processor 201 is configured to receive signals from said microphone module, said signals comprising audio, said audio comprising one or more of silent segments, cough sound segments and speech segments.

    [0061] Further, the processor 201 is configured to process said received signals from said microphone module and store said processed signals in said memory 202. Processing said received signals comprise one or more of removing one or more speech components from speech segments to render the speech unintelligible and clipping said silent segments where one or more speech components include vowel sounds. Also, processing said received signals further comprises compressing said signals comprising said audio, or compressing a resultant signal after said removal of one or more speech components and/or clipping of silent segments from said signals comprising audio.

    [0062] In a preferred embodiment of the invention a signal processing algorithm processes the audio signals from two microphones (two channels) to obfuscate speech within the recorded audio signals. The microphone module can comprise a non-contact microphone 203 and a contact microphone 204, suitably configured to be attached to the chest of the subject. Specific audio features are extracted from each of the two separate microphone channels to ensure that speech utterances are made unintelligible while leaving whole cough events intact. From the non-contact microphone channel, several specific audio features are extracted from the said audio signal to detect speech utterances.

    [0063] Features are extracted from a number of audio frames and subsequently overlapped, for example 40 ms audio frames are overlapped by 20 ms. These features can comprise one or more of the following:

    [0064] Non-contact microphone audio features (for detecting speech utterances) such as an adaptive voiced feature. The adaptive voiced feature can be defined as a measure of periodicity in the audio signal relating to the vibration of the vocal folds within a specific frequency range. For example, a frequency range of 45-500 Hz using a custom autocorrelation function can be used. A threshold value can be used, which uses values of surrounding audio frames to determine a voiced threshold value for detecting speech. A spectral centroid feature can also be used using measure of centre of mass of the frequency spectrum.

    [0065] In relation to the contact microphone audio features (for detecting speech utterances) an Energy Slope feature is used in the processing. A measure of the changes in acoustic energy over time to define an energy slope. The energy slope feature is notably different when comparing speech utterances and cough events, hence, both microphone channels are employed in the detection of speech utterances.

    [0066] An important aspect of the processing is the use of dual channel audio features obtained from the microphones where an energy ratio can be calculated. The energy ratio is a measure of the ratio of acoustic energy between the contact microphone and the non-contact microphone. This feature is advantageous in discriminating between cough events and third party speech.

    [0067] It will be appreciated that the algorithm and features are specifically designed to detect not only adult speech, but also child speech, third party speech, and speech coming from a loudspeaker (such as a loudspeaker on a mobile phone). Loudspeaker speech has different acoustic properties compared to natural speech.

    [0068] In an alternative embodiment of the invention is that the processing can be used to implement synthesising one or more speech components from speech segments to render the speech unintelligible, such that only speech utterances are synthesised from the cough audio event. In other words the invention can provide the option to “synthesise” the speech utterances rather than remove them from the said audio signals. The voiced segment of speech is synthesised by extracting specific features from the said audio frame and transforming it into a synthesised waveform.

    [0069] The advantage of this approach is that the audio signal visually resembles the original audio signal however, the voiced segment of speech signals are made completely unintelligible. This approach can be useful, for example, for determining events such as sleep events from the cough audio recordings.

    [0070] It can show that the subject wearing the device may be speaking (showing the subject is awake) but with sensitive information contained in the speech obfuscated. The synthetic voiced signal is generated by extracting the fundamental frequency relating to the original pitch of the voiced signal. The acoustic energy of the audio frame is also extracted.

    [0071] A synthetic signal is then generated using the extracted fundamental frequency (with the first two harmonics), acoustic energy and with some random noise added to it. This synthetic signal can be constructed as:


    xsynthetic(t)=A.Math.(sin(2πf0t)+B.Math.sin(2πf1t)+C.Math.sin(2πf2t)+γ(t)) [0072] Where xsynthetic(t) is the generated synthetic signal [0073] A is the amplitude of the original speech audio frame [0074] B and C are between 0 and 1 [0075] f0 is the fundamental frequency of the original speech audio frame [0076] f1 is the first harmonic of the original speech audio frame [0077] f2 is the second harmonic of the original speech audio frame [0078] γ(t) is random white noise.

    [0079] The algorithm synthesises voiced frames of audio and is specifically designed to suit both human manual counting of cough events through visual and aural assessment, and automatic detection of cough events using an audio-based cough detection algorithm.

    [0080] FIGS. 3-6 illustrates a number of speech and cough audio signals outputted by the algorithm according to an embodiment of the present invention showing both original and synthesised versions from the algorithm output. The speech example shown in FIGS. 3 and 4 is a snippet of a conversation between the patient wearing the device and a third party speaker on the other end of the phone where the third party speech is audible in the original file. The cough example in FIGS. 5 and 6 contains two separate cough events.

    [0081] The cough monitor further comprises an accelerometer 205 operatively coupled to said processor 201 to obtain the severity of cough from said accelerometer readings, wherein said accelerometer 205 is mechanically coupled to the chest of the subject. The cough monitor further comprises a wireless transceiver 206 for transmitting said processed signals to a server.

    [0082] The microphone module comprises an air microphone 203 configured to be attached to the lapel and a contact microphone 204 configured to be attached to the chest of the subject. The air microphone 203 and said contact microphone 204 are connected to the cough monitor 200 via a single connection port. In an embodiment, the microphone module comprises an air microphone 203 built into the cough monitor and a contact microphone 204 built into the cough monitor 200, said cough monitor 200 and said contact microphone 204 configured to be attached to the chest of the subject using a biodegradable adhesive.

    [0083] In an embodiment, the processor 201 is configured to store the processed signals in a selected format selected from a set of predetermined formats. In an embodiment, memory 202 comprises a solid state drive, a removable secure digital card, and/or an encrypted memory encrypted with Advanced Encryption Standard 256 bit.

    [0084] In an embodiment, the cough monitor 200 switches/powers on when a removable secure digital card is inserted in a secure digital card slot of the cough monitor and switches/powers off in absence thereof.

    [0085] In an embodiment, the processor 201 is configured to detect one or more fault condition comprising low battery, battery door removal, faulty sensors, short circuit across sensors, open circuit across sensors, insufficient memory, memory absent, and/or clock reset or other fault condition.

    [0086] The cough monitor, further comprises a user interface to allow the subject to mute said air microphone, or indicate waking time, or indicate sleeping time, or indicate medication dosing time or programmable events.

    [0087] In an embodiment, the system comprises a server, where the server is configured to receive one or more audio recordings from a cough monitor via a network. The server may also receive one or more audio recordings physically on a secure digital card. The audio recordings comprising one or more of silent segments, cough sound segments and speech segments. The server is configured to process said audio recordings where processing of said received audio recordings comprise one or more of removing one or more speech components from speech segments to render the speech unintelligible and clipping said silent segments where one or more speech components include vowel sounds. Also, processing said received signals further comprises compressing said audio recordings comprising said audio, or compressing a resultant audio recording after said removal of one or more speech components and/or clipping of silent segments from said audio recordings comprising audio.

    [0088] Thereby, the method and system for recording using a microphone for monitoring cough for extended periods without affecting the privacy of the subject individual or of individuals surrounding said subject individual and only requiring a fraction of duration of recorded time for a semi-skilled person to monitor cough of a subject/patient.

    [0089] Further, a person ordinarily skilled in the art will appreciate that the various illustrative logical/functional blocks, modules, circuits, and process steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or a combination of hardware and software. To clearly illustrate this interchangeability of hardware and a combination of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or a combination of hardware and software depends upon the design choice of a person ordinarily skilled in the art. Such skilled artisans may implement the described functionality in varying ways for each particular application, but such obvious design choices should not be interpreted as causing a departure from the scope of the present invention.

    [0090] The process described in the present disclosure may be implemented using various means. For example, the apparatus described in the present disclosure may be implemented in hardware, firmware, software, or any combination thereof. For a hardware implementation, the processing units, or processors(s) or controller(s) may be implemented within one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, electronic devices, other electronic units designed to perform the functions described herein, or a combination thereof.

    [0091] For a firmware and/or software implementation, software codes may be stored in a memory and executed by a processor. Memory may be implemented within the processor unit or external to the processor unit. As used herein the term “memory” refers to any type of volatile memory or non-volatile memory.

    [0092] In the specification the terms “comprise, comprises, comprised and comprising” or any variation thereof and the terms include, includes, included and including” or any variation thereof are considered to be totally interchangeable and they should all be afforded the widest possible interpretation and vice versa.

    [0093] A person skilled in the art would appreciate that the above invention provides a robust and economical solution to the problems identified in the prior art.

    [0094] The invention is not limited to the embodiments hereinbefore described but may be varied in both construction and detail.