Patent classifications
G06F40/00
Template-based intelligent document processing method and apparatus
A blank template form generation method and system may employ synthetically generated blank template forms, differing from each other in one or more respects, to train a neural network to recognize relevant differences between otherwise similar forms, including types and locations of keywords and potential locations of values corresponding to the keywords. In an embodiment, filled or partly filled forms as well as blank template forms may be used later in training. Forms are input in pairs to identify differences between the two. Depending on the differences, weights of a neural network may be adjusted. After training, when a form is input into the system, whether the form is filled or blank, a blank template may be generated for future use.
Multi-turn dialogue response generation with autoregressive transformer models
Machine classifiers in accordance with embodiments of the invention capture long-term temporal dependencies in the dialogue data better than the existing RNN-based architectures. Additionally, machine classifiers may model the joint distribution of the context and response as opposed to the conditional distribution of the response given the context as employed in sequence-to-sequence frameworks. Machine classifiers in accordance with embodiments further append random paddings before and/or after the input data to reduce the syntactic redundancy in the input data, thereby improving the performance of the machine classifiers for a variety of dialogue-related tasks. The random padding of the input data may further provide regularization during the training of the machine classifier and/or reduce exposure bias. In a variety of embodiments, the input data may be encoded based on subword tokenization.
Form template matching to populate forms displayed by client devices
A server includes a memory and a processor to receive from a client device a screenshot of an application page from an application. The application page includes a form requiring data to be filled in by a user of the client device. A form template is extracted from the screenshot, with the extracted form template not including form field values. The extracted form template is compared to a private form template database for a match. The private form template database includes private form templates from different applications, with each private form template having form field values previously filled in for the user. Form field values from a matched private form template are provided to the client device for the client device to populate the form in the screenshot.
System and method for automatic detection of webpage zones of interest
A system and method for detecting webpage zones of interest. A method includes receiving at least one webpage analysis request, wherein the received at least one webpage analysis request includes at least one webpage in a website; identifying, in the at least one webpage, at least one zone, wherein the at least one zone is a content element of a webpage; classifying the at least one zone into a category of interest, wherein the classification is based on a trained machine learning model configured to classify DOM elements of the least one webpage, and wherein a category of interest is a category determined based on a functionality of the website; and storing the classification by indicating the category of interest for each zone.
METHOD AND SYSTEM FOR EVALUATING AND IMPROVING LIVE TRANSLATION CAPTIONING SYSTEMS
Methods, systems, and apparatus, including computer programs encoded on computer storage media for evaluating and improving live translation captioning systems. An exemplary method includes: displaying a word in a first language; receiving a first audio sequence, the first audio sequence comprising a verbal description of the word; generating a first translated text in a second language; displaying the first translated text; receiving a second audio sequence, the second audio sequence comprising a guessed word based on the first translated text; generating a second translated text in the first language; determining a matching score between the word and the second translated text; determining a performance score of the live translation captioning system based on the matching score.
Systems and methods for a visual interface for grid-based programs
The current disclosure provides techniques for visualizing text expressions in spreadsheet cells in a more intuitive and user friendly manner by mapping syntactic elements of the text expressions to two-dimensional (2D) configurations of 2D elements, and displaying the 2D configurations in a graphical user interface, wherein the syntactic relationships between syntactic elements in the text expressions are rendered as spatial relationships between the 2D elements in the 2D configuration. In one embodiment, a method for converting a text expression into a 2D configuration comprises selecting a spreadsheet cell based on input received from a user input device, wherein the spreadsheet cell comprises a text expression, parsing the text expression, using a logic subsystem, into at least a first syntactic element, mapping the first syntactic element, using the logic subsystem, to a first two-dimensional (2D) element, and displaying the first 2D element in a graphical user interface via a display subsystem.
Time Optimized Communications
A time optimizing communications system and method is provided because “loose lips sink ships”. Orders get “do by” parameters, “deliver by” times and may be broken into parts according to “do by” parameters, and/or by prioritization for delivery only when the recipient has the need-to-know. Time sensitive and most secret parts are communicated just in time, some data may be sent at randomized times that may bias traffic on communications infrastructure towards bandwidth optimization. Reducing risk of decryption by adversaries occurring quickly enough to frustrate the purposes of orders. Parts may be broken into data blocks and routed and/or stored randomly. An array of pointers records details of their creation and/or storage locations to provide a key for retrieving data blocks and/or reconstructing messages; timing is managed according to mission needs, and priorities. May also reduce peak demand on communications bandwidth.
Time Optimized Communications
A time optimizing communications system and method is provided because “loose lips sink ships”. Orders get “do by” parameters, “deliver by” times and may be broken into parts according to “do by” parameters, and/or by prioritization for delivery only when the recipient has the need-to-know. Time sensitive and most secret parts are communicated just in time, some data may be sent at randomized times that may bias traffic on communications infrastructure towards bandwidth optimization. Reducing risk of decryption by adversaries occurring quickly enough to frustrate the purposes of orders. Parts may be broken into data blocks and routed and/or stored randomly. An array of pointers records details of their creation and/or storage locations to provide a key for retrieving data blocks and/or reconstructing messages; timing is managed according to mission needs, and priorities. May also reduce peak demand on communications bandwidth.
Systems and methods for formatting informal utterances
Methods and systems are presented for translating informal utterances into formal texts. Informal utterances may include words in abbreviation forms or typographical errors. The informal utterances may be processed by mapping each word in an utterance into a well-defined token. The mapping from the words to the tokens may be based on a context associated with the utterance derived by analyzing the utterance in a character-by-character basis. The token that is mapped for each word can be one of a vocabulary token that corresponds to a formal word in a pre-defined word corpus, an unknown token that corresponds to an unknown word, or a masked token. Formal text may then be generated based on the mapped tokens. Through the processing of informal utterances using the techniques disclosed herein, the informal utterances are both normalized and sanitized.
Systems and Methods Involving a Hub Platform and Communication Network Configured for Processing Data Involving Time-Stamped/Time-Sensitive Aspects and/or Other Features
Systems and methods involving a hub platform, communication network, and memory configured for processing data involving time-stamped time-sensitive aspects and other features are disclosed. In one example, an illustrative system may comprise a hub computer platform and associated computing components configured to generate a plurality of portals including at least first and second portals, including aspects such as automatically updating information displayed therein in real-time between portals, automatically attaching and or processing timestamps and identifier information that are attached to orders upon receipt and acceptance thereof, automatically generating and or processing order book data, generating, updating and or interactively displaying various tabular and or graphical information such as order information that is automatically processed based on timestamps and or other inputs and data, and or generating other GUI features that, for example, may graphically display and automatically update level-of-involvement information.