MULTIMODAL CONVERSATIONAL PLATFORM FOR REMOTE PATIENT DIAGNOSIS AND MONITORING
20230018524 · 2023-01-19
Inventors
- Vikram Ramanarayanan (Berkeley, CA, US)
- Oliver Roesler (Weyhe, DE)
- Michael Neumann (Waiblingen, DE)
- David Pautler (San Francisco, CA, US)
- Doug Habberstad (Hilton Head, SC, US)
- Andrew Cornish (Gore, NZ)
- Hardik Kothare (Burlingame, CA, US)
- Vignesh Murali (Cambridge, MA, US)
- Jackson Liscombe (New Marlborough, MA, US)
- Dirk Schnelle-Walka (Rhineland-Palatinate, DE)
- Patrick Lange (San Francisco, CA, US)
- David Suendermann-Oeft (San Francisco, CA, US)
Cpc classification
A61B5/097
HUMAN NECESSITIES
G16H50/30
PHYSICS
A61B5/4803
HUMAN NECESSITIES
A61B5/7465
HUMAN NECESSITIES
A61B5/7278
HUMAN NECESSITIES
International classification
A61B5/00
HUMAN NECESSITIES
Abstract
A virtual agent instructs a responding person to perform specific verbal exercises. Audio and image inputs from the responding person's performance of the exercises are used to identify speech, video, cognitive, and/or respiratory biomarkers, which are then used to evaluate speech motor function and/or neurological health. Contemplated exercises include test aspects of oral motor proficiency, sustained phonation, diadochokinesis, reading speech, spontaneous speech, spirometry, picture description, and emotion elicitation. Metrics from evaluation of the responding person's performance are advantageously produced automatically, and are presented in spreadsheet format.
Claims
1. A method of assessing a medical or psychological condition of a responding person, comprising configuring a processor to execute instructions that operate a virtual agent configured to: instruct the responding person to perform specific verbal exercises; utilize audio and image inputs from the responding person's performance of the exercises, to identify at least one of speech, video, cognitive, and respiratory biomarkers with respect to at least one of speech motor function and neurological health; and providing metrics corresponding to the responding person's performance with respect to at least some of the exercises.
2. The method of claim 1, wherein at least one of the exercises is selected to test aspects of oral motor proficiency.
3. The method of claim 1, wherein at least one of the exercises is selected to test aspects of sustained phonation.
4. The method of claim 1, wherein at least one of the exercises is selected to test aspects of diadochokinesis
5. The method of claim 1, wherein at least one of the exercises is selected to test aspects of reading speech.
6. The method of claim 1, wherein at least one of the exercises is selected to test aspects of spontaneous speech.
7. The method of claim 1, wherein at least one of the exercises is selected to test aspects of spirometry.
8. The method of claim 1, wherein at least one of the exercises is selected to test aspects of picture description.
9. The method of claim 1, wherein at least one of the exercises is selected to test aspects of emotion elicitation
10. The method of claim 1, further comprising rendering the metrics in a spreadsheet format.
11. The method of claim 1, wherein the utilizing the audio and image inputs is completely automatic.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
DETAILED DESCRIPTION
[0021] The following discussion provides many example embodiments of the inventive subject matter. Although each embodiment represents a single combination of inventive elements, the inventive subject matter is considered to include all possible combinations of the disclosed elements. Thus if one embodiment comprises elements A, B, and C, and a second embodiment comprises elements B and D, then the inventive subject matter is also considered to include other remaining combinations of A, B, C, or D, even if not explicitly disclosed.
[0022] As used herein, and unless the context dictates otherwise, the term “coupled to” is intended to include both direct coupling (in which two elements that are coupled to each other contact each other) and indirect coupling (in which at least one additional element is located between the two elements). Therefore, the terms “coupled to” and “coupled with” are used synonymously.
[0023] As used in the description herein and throughout the claims that follow, the meaning of “a,” “an,” and “the” includes plural reference unless the context clearly dictates otherwise. Also, as used in the description herein, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
[0024] All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g. “such as”) provided with respect to certain embodiments herein is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention otherwise claimed. No language in the specification should be construed as indicating any non-claimed element essential to the practice of the invention. Unless a contrary meaning is explicitly stated, all ranges are inclusive of their endpoints, and open-ended ranges are to be interpreted as bounded on the open end by commercially feasible embodiments.
[0025] Groupings of alternative elements or embodiments of the invention disclosed herein are not to be construed as limitations. Each group member can be referred to and claimed individually or in any combination with other members of the group or other elements found herein. One or more members of a group can be included in, or deleted from, a group for reasons of convenience and/or patentability. When any such inclusion or deletion occurs, the specification is herein deemed to contain the group as modified thus fulfilling the written description of all Markush groups used in the appended claims.
[0026]
[0027]
[0028] Although virtual agent 120 can be presented simplistically to the responding person 130 as a disembodied voice, or perhaps a still image or cartoon (not shown), virtual agent 120 is preferably presented in a more realistic approximation of a live person. In
[0029] Virtual agent 120 should be interpreted as including one or more processors storing and executing instructions on one or more computer readable, non-transitory storage devices. Contemplated computing and storage devices include one or more computers operating as a web server, database server, or other type of computer server, and related storage devices, and can be physically local to one another, or more likely are distributed in different cities and even different countries. Although virtual agent 120 is depicted as interacting with a single responding person 130, virtual agent 120 should be interpreted as being configured in a cloud or other computing environment that allows virtual agent 120 to concurrently assess multiple responding persons.
[0030] Cloud 110 should be viewed generically as any suitable communications network, over which are traveling communications between the virtual agent 120 and the responding person 130.
[0031] In
[0032] Although responding person 130 is depicted as sitting at a desk, it is contemplated that responding person 130 could be interacting in any suitable posture, including for example, walking about, sitting on a couch, or lying in bed. However, it is important that responding person 130 is situated with respect to the camera and microphone such that the virtual agent can obtain sufficient information from the responding person's lip and other facial movements, and speech characteristics.
[0033] Although responding person 130 is shown as an older man,
[0034]
[0035] Contemplated oral motor exercises include, but are not limited to, measurements of facial extremes, range of motion probes like spreading of lips (smiling), puckering (with the jaw closed) and combinations thereof.
[0036] Contemplated sustained phonation exercises include, but are not limited to, taking a deep breath and voicing and holding different vowels such as “aa”, “ii” and “uu” for specified amounts of time.
[0037] Contemplated diadochokinesis exercises include, but are not limited to, speaking certain mono- or poly-syllabic utterances such as “pa-pa-pa” or “pa-to-ka” repeatedly and continuously until one runs out of breath.
[0038] Contemplated read speech exercises include, but are not limited to, reading out loud various standardized read speech passages, such as the Bamboo Passage or the Rainbow Passage.
[0039] Contemplated spontaneous speech exercises include, but are not limited to, speaking for specified amounts of time about various topics, such as hobbies, vacations or favorite foods.
[0040] Contemplated spirometry exercises include, but are not limited to, guided inhalation, exhalation and coughing exercises.
[0041] Contemplated picture description exercises include, but are not limited to, spoken descriptions of different pictures presented to the participant or patient.
[0042] Contemplated emotion elicitation exercises include, but are not limited to, elicitation of pitch glides and acted vocal readings of various sentences with different evoked emotional affect.
[0043]
[0044] It should also be appreciated that practice of the concepts disclosed herein are especially valuable when communication with responding persons is executed entirely or almost entirely automatically, and assessment of the various performances to produce metrics as in
[0048]
[0049] It should be apparent to those skilled in the art that many more modifications besides those already described are possible without departing from the inventive concepts herein. The inventive subject matter, therefore, is not to be restricted except in the spirit of the appended claims. Moreover, in interpreting both the specification and the claims, all terms should be interpreted in the broadest possible manner consistent with the context. In particular, the terms “comprises” and “comprising” should be interpreted as referring to elements, components, or steps in a non-exclusive manner, indicating that the referenced elements, components, or steps may be present, or utilized, or combined with other elements, components, or steps that are not expressly referenced. Where the specification refers to at least one of something selected from the group consisting of A, B, C . . . and N, the text should be interpreted as requiring only one element from the group, not A plus N, or B plus N, etc.