Patent classifications
G10L13/00
AUTOMATED PROCESS FOR GENERATING NATURAL LANGUAGE DESCRIPTIONS OF RASTER-BASED WEATHER VISUALIZATIONS FOR OUTPUT IN WRITTEN AND AUDIBLE FORM
Generating specific and contextualized natural language descriptions based upon raster-based weather visualizations for a defined geographic region. The generated natural language descriptions are provided in a written and/or audible form. In some cases, these natural language descriptions are generated based on weather forecast data sets that indicate a relative motion of certain weather-related events.
Integrated System and Related Methods for Learning, Collaboration, Tournament Hosting, and Business Management
The present disclosure provides a system for hosting an online platform with multiple functionalities, separated into a plurality of interfaces, but all hosted within an integrated system to increase the immersion of a user in the learning experience. Multimedia content streaming, educational course, history, and tracking, and business management functions are provided on the various interfaces that quickly educate a user about a given industry. The platform is industry agnostic but can also be provided with specific functionalities such as competitive tournament hosting for the e-sports industry. Also provided herein is a method of translating an educational lecture from a first language into a plurality of second languages.
Transportation vehicle control with phoneme generation
A transportation vehicle having a navigation system and an operating system connected to the navigation system for data transmission via a bus system. The transportation vehicle has a microphone and includes a phoneme generation module for generating phonemes from an acoustic voice signal or the output signal of the microphone; the phonemes are part of a predefined selection of exclusively monosyllabic phonemes; and a phoneme-to-grapheme module for generating inputs to operate the transportation vehicle based on monosyllabic phonemes generated by the phoneme generation module.
Provision of targeted advertisements based on user intent, emotion and context
An electronic device and method are disclosed herein. The electronic device includes a microphone, a camera, an output device, a memory, and a processor. The processor implements the method, including receiving a voice input and/or capturing an image, and analyze the first voice input or the image to determine at least one of a user's intent, emotion, and situation based on predefined keywords and expressions, identifying a category based on the input, selecting first information based on the category, selecting and outputting a first query prompting confirmation of output of the first information, detect a first responsive input to the first query, and when a condition to output the first information is satisfied, output a second query, detecting a second input responsive to the second query, and selectively outputting the first information based on the second input.
Provision of targeted advertisements based on user intent, emotion and context
An electronic device and method are disclosed herein. The electronic device includes a microphone, a camera, an output device, a memory, and a processor. The processor implements the method, including receiving a voice input and/or capturing an image, and analyze the first voice input or the image to determine at least one of a user's intent, emotion, and situation based on predefined keywords and expressions, identifying a category based on the input, selecting first information based on the category, selecting and outputting a first query prompting confirmation of output of the first information, detect a first responsive input to the first query, and when a condition to output the first information is satisfied, output a second query, detecting a second input responsive to the second query, and selectively outputting the first information based on the second input.
System and method to retrieve a secure message when a display of a mobile device is inaccessible
Systems and methods are described for providing a security code to a second device. A first device receives a textual representation of a security code that is required for authorization of a second device with a remote application server. The first device checks if the textual representation of the security code is accessed during a predefined time period. If not, the first device also checks if the second device is within an output range of the first device. If so, the first device outputs an audio representation of the security code.
Systems and Methods for Voice Based Audio and Text Alignment
The present disclosure relates to systems and methods for temporally aligning media elements. Example methods include providing an audio input waveform based on an audio input and receiving a text input. The example method also includes converting the text input to a text-to-speech input waveform and extracting, with an audio feature extractor, characteristic audio features from the audio input waveform and the text-to-speech input waveform. The example method yet further includes comparing audio input waveform features and text-to-speech waveform features and, based on the comparison, temporally aligning a displayed version of the text input with the audio input.
SYSTEM FOR DECISIONING RESOURCE USAGE BASED ON REAL TIME FEEDBACK
Embodiments of the invention are directed to systems, methods, and computer program products for advising users on resource decisioning based on real-time user feedback. The invention utilized advanced machine learning technology in order emulate the voice patterns of familiar figures and generate text-to-speech audio files containing relevant recommendations to one or more users as determined by their user resource account history or indicated preferences. The invention may further account for the user's response in resource usage patterns after the recommendation is provided via continuous monitoring of the user's resource usage history, and may use this data to adapt over time to learn which voices or emulations the user prefers.
ARTIFICIAL INTELLIGENCE (AI) LIFELIKE 3D CONVERSATIONAL CHATBOT
A 3D conversational chatbot is disclosed. The conversational chatbot is embodied in an avatar to provide a human-like experience for end-users. The chatbot is an artificial intelligence-based chatbot. The chatbot is configured with the knowledge of the chatbot owner. The knowledge may depend on the owner, such as the products and/or services provided by the owner. For example, the chatbot is customized with AI for the specific needs of its owner. The avatar communicates with the user, such as a customer, to answer questions with life-like speech and facial movement.
Communications network security for handling proxy voice calls
Concepts and technologies are disclosed herein for communications network security for handling proxy voice calls that employ a voicebot. According to one aspect disclosed herein, a call handling system can intercept, from a communications network, a call request that is directed to a called target device. The call handling system can determine that the call request was generated by a voicebot on behalf of a user equipment. The call handling system can suspend the call request from being routed to the called target device. The call handling system can generate a voicebot confirmation request that identifies the voicebot and the user equipment. The call handling system can provide the voicebot confirmation request to the called target. The call request can be suspended while the voicebot confirmation request is provided to the called target device.