G06F40/263

CAPTIONED TELEPHONE SERVICE SYSTEM HAVING TEXT-TO-SPEECH AND ANSWER ASSISTANCE FUNCTIONS
20230239401 · 2023-07-27 ·

A captioned telephone service system having the text-to-speech and answer assistance functions includes a captioner, a text-to-speech system, and an answer assistance system. The captioner provides captions to a user during a phone call between the user and a peer by receiving the peer’s voice from a peer device, transcribing the peer’s voice into caption data, and transferring the caption data to the user device. The text-to-speech system is configured to receive text data from the user device, convert the text data into speech, and transfer the voice of the speech to the peer device via the voice path in real time. The answer assistance system is configured to receive the caption data from the captioner, analyze the caption data to identify a question, analyze the question to generate answer suggestions, and forward the answer suggestions to the user device for review, editing, and selection.

CAPTIONED TELEPHONE SERVICE SYSTEM HAVING TEXT-TO-SPEECH AND ANSWER ASSISTANCE FUNCTIONS
20230239401 · 2023-07-27 ·

A captioned telephone service system having the text-to-speech and answer assistance functions includes a captioner, a text-to-speech system, and an answer assistance system. The captioner provides captions to a user during a phone call between the user and a peer by receiving the peer’s voice from a peer device, transcribing the peer’s voice into caption data, and transferring the caption data to the user device. The text-to-speech system is configured to receive text data from the user device, convert the text data into speech, and transfer the voice of the speech to the peer device via the voice path in real time. The answer assistance system is configured to receive the caption data from the captioner, analyze the caption data to identify a question, analyze the question to generate answer suggestions, and forward the answer suggestions to the user device for review, editing, and selection.

SYSTEMS AND METHODS FOR AUTOMATED AUDIO TRANSCRIPTION, TRANSLATION, AND TRANSFER FOR ONLINE MEETING

The present invention discloses systems and methods for multimedia processing. For example, the present invention provides systems and methods for receiving spoken audio, converting the spoken audio to text, and transferring the text to a user. As desired, the speech or text can be translated into one or more different languages. Systems and methods for real-time conversion and transmission of speech and text are provided, including systems and methods for large scale processing of multimedia events.

ELECTRONIC DEVICE FOR MANAGING INAPPROPRIATE ANSWER AND OPERATING METHOD THEREOF
20230027222 · 2023-01-26 ·

An electronic device is provided. The electronic device includes processor, and a memory that stores instructions. The instructions, when executed by the processor, cause the electronic device to receive a user input, to identify a natural language input corresponding to the user input, to identify a first natural language output corresponding to the natural language input, to identify at least one specified word from at least one word included in the first natural language output, to identify a second natural language output based on a fact that the at least one specified word is identified, and to output the second natural language output such that the second natural language output is provided to a user.

ELECTRONIC DEVICE FOR MANAGING INAPPROPRIATE ANSWER AND OPERATING METHOD THEREOF
20230027222 · 2023-01-26 ·

An electronic device is provided. The electronic device includes processor, and a memory that stores instructions. The instructions, when executed by the processor, cause the electronic device to receive a user input, to identify a natural language input corresponding to the user input, to identify a first natural language output corresponding to the natural language input, to identify at least one specified word from at least one word included in the first natural language output, to identify a second natural language output based on a fact that the at least one specified word is identified, and to output the second natural language output such that the second natural language output is provided to a user.

Systems and methods for determining the impact of issue outcomes
11562453 · 2023-01-24 · ·

A system for predicting and prescribing actions for impacting policymaking outcomes may include at least one processor configured to access first information scraped from the Internet to identify, for a particular pending policy, information about a plurality of policymakers slated to make a determination on the pending policy. The processor may parse the scraped first information to determine an initial prediction relating to an outcome of the pending policy. The processor may access second information to identify an action likely to change at least one of the initial prediction and the propensity of at least one policymaker, to thereby generate a subsequent prediction corresponding to an increase in a likelihood of achieving the desired outcome. The processor may display to the system user a recommendation to take the action in order to increase the likelihood of achieving the desired outcome.

Systems and methods for determining the impact of issue outcomes
11562453 · 2023-01-24 · ·

A system for predicting and prescribing actions for impacting policymaking outcomes may include at least one processor configured to access first information scraped from the Internet to identify, for a particular pending policy, information about a plurality of policymakers slated to make a determination on the pending policy. The processor may parse the scraped first information to determine an initial prediction relating to an outcome of the pending policy. The processor may access second information to identify an action likely to change at least one of the initial prediction and the propensity of at least one policymaker, to thereby generate a subsequent prediction corresponding to an increase in a likelihood of achieving the desired outcome. The processor may display to the system user a recommendation to take the action in order to increase the likelihood of achieving the desired outcome.

Speech-to-text transcription with multiple languages

One embodiment provides a method that includes obtaining a default language corpus. A second language corpus is obtained based on a second language preference. A first transcription of an utterance is received using the default language corpus and natural language processing (NLP). At least one problem word in the first transcription is determined based on an associated grammatical relevance to neighboring words in the first transcription. Upon determining that a first probability score is below a first threshold, an acoustic lookup is performed for an audible match for the problem word in the first transcription based on an associated acoustical relevance. Upon determining that a second probability score is below a second threshold, it is determined whether a match for the problem word exists in the secondary language corpus. Upon determining that the match exists in the secondary language corpus, a second transcription for the utterance is provided.

Constructing a computer-implemented semantic document

Technologies pertaining to electronic document understanding are described herein. A document is received, wherein the document includes a section of a type. An image of the document is generated, and a candidate region is identified in the image of the document, wherein the candidate region encompasses the section. A label is assigned to the candidate region based upon text of the section, wherein the label identifies the type of the section. An electronic document understanding task is performed based upon the label assigned to the candidate region.

Constructing a computer-implemented semantic document

Technologies pertaining to electronic document understanding are described herein. A document is received, wherein the document includes a section of a type. An image of the document is generated, and a candidate region is identified in the image of the document, wherein the candidate region encompasses the section. A label is assigned to the candidate region based upon text of the section, wherein the label identifies the type of the section. An electronic document understanding task is performed based upon the label assigned to the candidate region.