Patent classifications
G10L2013/083
SYSTEMS AND METHODS FOR PROVIDING NON-LEXICAL CUES IN SYNTHESIZED SPEECH
Systems and methods are disclosed for providing non-lexical cues in synthesized speech. An example system includes one or more storage devices including instructions and a processor to execute the instructions. The processor is to execute the instructions to: generate first and second non-lexical cues to enhance speech to be synthesized from text; determine a first insertion point of the first non-lexical cue in the text; determine a second insertion point of the second non-lexical cue in the text; and insert the first non-lexical cue at the first insertion point and the second non-lexical cue at the second insertion point. The example system also includes a transmitter to communicate the text with the inserted first non-lexical cue and the inserted second non-lexical cue over a network.
SYSTEMS AND METHODS FOR PROVIDING NON-LEXICAL CUES IN SYNTHESIZED SPEECH
Systems and methods are disclosed for providing non-lexical cues in synthesized speech. An example system includes one or more storage devices including instructions and a processor to execute the instructions. The processor is to execute the instructions to: determine a user tone of the user input; generate a response to the user input based on the user tone; and identify a response tone associated with the user tone. The example system also includes a transmitter to communicate the response and the response tone over a network.
INTERNET-ENABLED AUDIO-VISUAL GRAPHING CALCULATOR
A method of graphically representing mathematical expressions in both audio and visual formats on a user device is described. Embodiments of the present invention include an Internet-enabled audio-visual graphing calculator that receives input from a user device in at least one of at least one of audio, visual, or Braille formats. An embodiment of the present invention interprets input received from the user device as a typeset mathematical expression, parses the typeset mathematical expression into an interpreted mathematical expression and compiles the interpreted mathematical expression into an evaluation function. At least one point is sampled on the evaluation function. The sampled evaluation function is rendered as a graph in a tactile graphic using a Braille display or embosser, or on a visual display. In an embodiment of the invention, an audible representation of the rendered graph is generated for playback on the user device.
AIR WRITING TO SPEECH SYSTEM USING GESTURE AND WRIST ANGLE ORIENTATION FOR SYNTHESIZED SPEECH MODULATION
A gesture to speech conversion device may receive indications of user gestures via at least one sensor, the indications identifying movement in three dimensions. A 2-dimensional (2D) plane on which a beginning of the movement and an end of the movement is substantially planar and a third dimension orthogonal to the 2D plane may be determined. A change of the movement in a direction of the third dimension in a course of the movement occurring on the 2D plane is detected. The change of the movement in the third dimension is mapped to an emphasis in the movement. The movement is transformed into speech with emphasis on a part of the speech corresponding to a part of the movement having the detected change.
Immersive electronic reading
Electronic reading devices provide readers with text on a display, and enhancements to their functionality and efficiency are discussed herein. Text is provided to the reader in an enhanced contrast mode that highlights the active word and line of the text as well as words of interest in the text so as to improve the functionality of the electronic reading device itself as a provider of textual content.
Systems and methods for providing non-lexical cues in synthesized speech
Systems and methods are disclosed for providing non-lexical cues in synthesized speech. Original text is analyzed to determine characteristics of the text and/or to derive or augment an intent (e.g., an intent code). Non-lexical cue insertion points are determined based on the characteristics of the text and/or the intent. One or more non-lexical cues are inserted at insertion points to generate augmented text. The augmented text is synthesized into speech, including converting the non-lexical cues to speech output.
Methods and systems for teaching a hebrew bible trope lesson
A method for teaching a Hebrew Bible trope lesson, comprising: accessing, using a computing device, a symbolic representation of the Hebrew Bible trope lesson, comprising either Torah tropes or Haftorah tropes or Five Scrolls tropes, each according to a lesson plan; accessing, using a computing device, an audio recording of a human chanting the Hebrew Bible trope lesson; accessing, using a computing device, a first boundary time, denoting a word boundary within said audio recording; accessing, using a computing device, a second boundary time, denoting a word boundary within said audio recording; playing, using an electronic device, said audio recording of said human at the first boundary time, until the second boundary time; displaying the symbolic representation, said displaying comprising: visually distinguishing said symbolic representation, said visually distinguishing being synchronized with the playing of said audio recording of said human using said first boundary time and said second boundary time.
INTERNET-ENABLED AUDIO VISUAL GRAPHING CALCULATOR
A method of graphically representing mathematical expressions in both audio and visual formats on a user device is described. Embodiments of the present invention include an Internet-enabled audio-visual graphing calculator that receives input from a user device in at least one of at least one of audio, visual, or Braille formats. An embodiment of the present invention interprets input received from the user device as a typeset mathematical expression, parses the typeset mathematical expression into an interpreted mathematical expression and compiles the interpreted mathematical expression into an evaluation function. At least one point is sampled on the evaluation function. The sampled evaluation function is rendered as a graph on a visual display of a user device. In an embodiment of the invention, an audible representation of the rendered graph is generated for playback on the user device.
TEXT NORMALIZATION AND INVERSE TEXT NORMALIZATION USING WEIGHTED FINITE-STATE TRANSDUCERS AND NEURAL LANGUAGE MODELS
Systems and methods provide for text normalization or inverse text normalization using a hybrid language system that combines rule-based processing with neural or learned processing. For example, a hybrid rule-based and neural approach identifies semiotic tokens within a textual input and generates a set of potential plain-text conversions of the semiotic tokens. The plain-text conversions are weighted and evaluated by a trained language model that rescores the plain-text conversion based on context to identify a highest scoring plain-text conversion for further processing within a language system pipeline.
SYSTEMS AND METHODS FOR PROVIDING NON-LEXICAL CUES IN SYNTHESIZED SPEECH
Systems and methods are disclosed for providing non-lexical cues in synthesized speech. An example system includes processor circuitry to generate a breathing cue to enhance speech to be synthesized from text; determine a first insertion point of the breathing cue in the text, wherein the breathing cue is identified by a first tag of a markup language; generate a prosody cue to enhance speech to be synthesized from the text; determine a second insertion point of the prosody cue in the text, wherein the prosody cue is identified by a second tag of the markup language; insert the breathing cue at the first insertion point based on the first tag and the prosody cue at the second insertion point based on the second tag; and trigger a synthesis of the speech from the text, the breathing cue, and the prosody cue.