Electrolarynx devices and uses thereof
11076950 · 2021-08-03
Assignee
Inventors
Cpc classification
A61F2/20
HUMAN NECESSITIES
A61F2002/206
HUMAN NECESSITIES
International classification
A61F2/20
HUMAN NECESSITIES
Abstract
The present disclosure relates to electrolarynx devices, systems, and their use. In particular, the present disclosure relates to methods and compositions (e.g., devices) that provide electrolarynx (EL) users with improved speech quality.
Claims
1. An electrolarynx system, comprising; a) a sensor configured to be placed in the stoma of a human subject; b) a control element controlled by said subject's breath or movement of said subject's jaw configured to obtain signal from said sensor and translate said signal into speech, wherein said control element is configured to turn said speaker on when said subject is talking and turn said speaker off when said subject is silent; and c) a speaker configured to amplify said speech.
2. The system of claim 1, wherein said control element comprises one or more of an electrical circuit, a computer processor, and computer memory.
3. The system of claim 1, wherein said control element is further configured to perform articulatory pattern recognition of said subject's speech.
4. The system of claim 1, wherein said signal is transmitted from said sensor to said control element via an electrical cable.
5. A method of generating artificial speech, comprising: contacting the system of claim 1 with a subject's stoma under conditions such that said device generates speech and transmit said speech through the speaker of said device.
6. The method of claim 5, wherein said subject lacks a larynx.
7. The method of claim 6, wherein said subject has previously been diagnosed with oral cancer.
8. The method of claim 6, wherein method is performed hands free.
Description
DESCRIPTION OF THE FIGURES
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION
(11) The present disclosure relates to electrolarynx devices, systems, and their use. In particular, the present disclosure relates to methods and compositions (e.g., devices) that provide electrolarynx (EL) users with improved speech quality.
(12) With the surgical removal of the natural larynx, there is a loss of the vocal cord and the connection thereof with the trachea. Laryngectomees therefore lack phonation due to this missing air-pressure-modulating functionality and hence have to undergo speech or voice rehabilitation.
(13) There are three methods of voice rehabilitation after laryngectomy: esophageal speech, tracheo-esophageal speech, and electrolaryngeal speech. Considering its ease of use compared to the other alternatives, the electronic larynx device (electrolarynx) is the most widely adopted for voice rehabilitation after laryngectomy. Its limitations include distortion and mechanical perception demonstrated by the robotic nature of the speech produced with the device (this leads to difficulty in comprehension for the listener and unwanted attention to the user), confusion with voiced and voiceless pairs especially at the start of sentences hence reduced speech intelligibility, inability to track and match speech prosody as the electrolaryngeal speech sounds monotonous without the needed inflections (this is especially evident when asking questions or in heavily tonal languages e.g. the Igbo language and many East Asian languages), limited speech intensity (the amplitude is relatively lower than laryngeal speech and can be easily drowned in noisy environment), and inconvenience from the hand-held nature (EL users cannot simultaneously talk while using both of their hands).
(14) The anatomy changes after a laryngectomy. After a total laryngectomy, the individual is breathing through the stoma where the tracheostomy opens in the neck. There is no longer a connection between the trachea and the mouth and nose. These individuals are termed total neck breathers. After a partial laryngectomy, the individual breathes mainly through the stoma, but a connection still exists between the trachea and upper airways; these individuals are able to breathe air through the mouth and nose. They are therefore termed partial neck breathers. The extent of breathing through the upper airways in these individuals varies and a tracheostomy tube is present in many of them. Ventilation and resuscitation of total and partial neck breathers is through their stoma; however, in these individuals the mouth should be kept closed and the nose sealed to prevent air escape.
(15) People do not keep track of their pitch, volume, and on/off or pauses in-between words when speaking. Existing devices use a capacitive slider or pressure knob that require an extreme level of manual dexterity and consciousness which is not quite reasonable in addition to other disadvantages (e.g. charge buildup). In addition, existing devices do not mimic pitch. Pitch is a parameter that is very difficult to estimated externally hence it is important to use the human's signal as control signals for the pitch adjustments.
(16) One of the distinguishing features of devices described herein is that they are not a corrective measure as many other techniques that aim to reverse-engineer the electrolarynx in order to make it sound better. The devices described herein instead generate accurate information from the sensor in the stoma to accurately form the speech. This can also eliminate the use of square waves for modulation, which is by itself a source of distortion to the speech produced.
(17) Accordingly, provided herein is an electrolarynx that overcomes many of the limitations of existing devices. In some embodiments, the non-ideal excitation sound of the electrolarynx is corrected. In some embodiments, devices described herein utilize a subsystem to fix the voice/voiceless indistinguishability by creating a synchronous on/off circuitry using human breath to turn on the electrolarynx at the onset of speech and turn it off when speech is over. By so doing, the buzz which would have otherwise filled in the spectral valley of the region of silence is eliminated that makes a “voiceless sound” be perceived as “voiced’ in existing devices. In addition, in some embodiments, the sensed-human breath “amplitude information” is used to control the speech volume.
(18) In some embodiments, the frequency part of the sensed human breath is used to control speech pitch (See e.g., UEMI et al. (1994)).
(19) In some embodiments, devices described herein encode speech articulatory patterns into bits recognizable as inputs to ASR systems for clearer communication. (e.g., flex-sensors with resistors are used to measure jaw excursions (e.g., by use of an accelerometer placed underneath the jaw) and then the info is encoded into bits and fed into a microcontroller which is used to “switch” the speech into “normal sounding speech”).
(20) An exemplary device is described in
(21) The control element 5 senses via information from the stroma sensor 1 when speech starts and stops and silences the speaker 6 accordingly.
(22)
(23) A flow chart of an exemplary process for using the devices described herein is shown in
(24) All publications, patents, patent applications and accession numbers mentioned in the above specification are herein incorporated by reference in their entirety. Although the disclosure has been described in connection with specific embodiments, it should be understood that the disclosure as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications and variations of the described compositions and methods of the disclosure will be apparent to those of ordinary skill in the art and are intended to be within the scope of the following claims.