G06F40/194

MULTI-MODEL APPROACH TO NATURAL LANGUAGE PROCESSING AND RECOMMENDATION GENERATION
20230046851 · 2023-02-16 ·

In some implementations, a device may monitor a set of data sources to generate a set of language models corresponding to the set of data sources. The device may determine a plurality of sets of keyword groups. The device may generate a plurality of sets of skill catalogs. The device may receive a source document for processing. The device may process the source document to extract a key phrase set and to determine a first similarity distance. The device may select a corresponding skill catalog and an associated language model based on a relevancy value. The device may determine second similarity distances between the source document and one or more target documents using the corresponding skill catalog and the associated language model. The device may output information associated with one or more target documents based at least in part on the second similarity distances.

Identifying similar documents in a file repository using unique document signatures
11593439 · 2023-02-28 · ·

Methods, systems, and non-transitory computer readable storage media are disclosed for determining clusters of similar digital documents using unique document signatures. Specifically, the disclosed system processes digital text in a digital document to tokenize character strings (e.g., words) in the digital document by combining a subset of character values and string lengths in the character strings. Additionally, the disclosed system generates a document signature for the digital document by combining subsets of tokens generated for the digital document into a token sequence indicative of the digital text in the digital document. The disclosed system determines a cluster of similar digital documents including the digital document by comparing the document signature of the digital document to document signatures corresponding to a plurality of digital documents.

SYSTEM FOR COMPARISON AND MERGING OF VERSIONS IN EDITED WEBSITES AND INTERACTIVE APPLICATIONS

A device implementable in a website design program includes a component based version comparer to compare at least two versions of a website, the at least two versions having components in a set of hierarchies and to generate a difference tree representing the differences in the components between the at least two versions of the same website, where the version comparer preprocesses the components in each single version of the at least two versions of the website to determine at least one of internal geometric, semantic, content and attribute relationships before comparing the components and the relationships between the at least two versions of the website using at least one of semantic, geometrical, content and attribute analysis. The device also includes a version merger to create an integrated version of the two versions of the website based on the difference tree.

Method and system for suggesting revisions to an electronic document

A method for suggesting revisions to a document-under-analysis from a seed database, the seed database including a plurality of original texts each respectively associated with one of a plurality of final texts, the method for suggesting revisions including selecting a statement-under-analysis (“SUA”), selecting a first original text of the plurality of original texts, determining a first edit-type classification of the first original text with respect to its associated final text, generating a first similarity score for the first original text based on the first edit-type classification, the first similarity score representing a degree of similarity between the SUA and the first original text, selecting a second original text of the plurality of original texts, determining a second edit-type classification of the second original text with respect to its associated final text, generating a second similarity score for the second original text based on the second edit-type classification, the second similarity score representing a degree of similarity between the SUA and the second original text, selecting a candidate original text from one of the first original text and the second original text, and creating an edited SUA (“ESUA”) by modifying a copy of the first SUA consistent with a first candidate final text associated with the first candidate original text.

Method and system for suggesting revisions to an electronic document

A method for suggesting revisions to a document-under-analysis from a seed database, the seed database including a plurality of original texts each respectively associated with one of a plurality of final texts, the method for suggesting revisions including selecting a statement-under-analysis (“SUA”), selecting a first original text of the plurality of original texts, determining a first edit-type classification of the first original text with respect to its associated final text, generating a first similarity score for the first original text based on the first edit-type classification, the first similarity score representing a degree of similarity between the SUA and the first original text, selecting a second original text of the plurality of original texts, determining a second edit-type classification of the second original text with respect to its associated final text, generating a second similarity score for the second original text based on the second edit-type classification, the second similarity score representing a degree of similarity between the SUA and the second original text, selecting a candidate original text from one of the first original text and the second original text, and creating an edited SUA (“ESUA”) by modifying a copy of the first SUA consistent with a first candidate final text associated with the first candidate original text.

Text Analysis System, and Characteristic Evaluation System for Message Exchange Using the Same
20230237258 · 2023-07-27 ·

Aspects of this disclosure provide a device, system, and method for analyzing text. In an embodiment, a system is configured to convert characters of the text into a numerical time series signal. The numerical time series signal includes a time series conversion of the characters in numerical format. The system is further configured to generate a waveform with extracted information from the numerical time series signal. The extracted information having features based on politeness in language, a quantifiable use of punctuations, a quantifiable use of conjunctions, use of idioms, or a combination thereof. The system is additionally configured to determine whether the text is written by a specific user based on an analysis of the waveform against a threshold.

NAVIGATING BUILDING PLANS
20230025398 · 2023-01-26 · ·

Embodiments of the present disclosure relate generally to modifying and navigating building plans. In particular, one or more embodiments relate to a system that improves the ease and efficiency in which a user, through a client device, may view and use building plans comprised of a plurality of content items. The system may receive a plurality of content items associated with a building plan, combine sets of two or more content items representing a substructure, form locational links between the plurality of content items, and provide a navigational interface for display on a client device associated with a user through which the client device may navigate through the plurality of content items of the building plan. Further, alternate content items may be received and overlayed on corresponding content items. Any differences between the corresponding content items may be highlighted.

NAVIGATING BUILDING PLANS
20230025398 · 2023-01-26 · ·

Embodiments of the present disclosure relate generally to modifying and navigating building plans. In particular, one or more embodiments relate to a system that improves the ease and efficiency in which a user, through a client device, may view and use building plans comprised of a plurality of content items. The system may receive a plurality of content items associated with a building plan, combine sets of two or more content items representing a substructure, form locational links between the plurality of content items, and provide a navigational interface for display on a client device associated with a user through which the client device may navigate through the plurality of content items of the building plan. Further, alternate content items may be received and overlayed on corresponding content items. Any differences between the corresponding content items may be highlighted.

METHOD AND APPARATUS FOR CONTRACT ANALYSIS

A method is provided comprising: obtaining a counterparty contract, the counterparty contract including a contract that is being proposed by a counterparty to a user; performing a segmentation of the counterparty contract to identify a plurality of sentence clusters, each of the sentence clusters corresponding to a different provision in the counterparty contract; generating a plurality of counterparty provision vectors based on the counterparty contract, each of the counterparty provision vectors being generated based on a different one of the plurality of sentence clusters; retrieving a user provision vector, the user provision vector corresponding to a user provision; calculating a plurality of similarity scores for the user provision vector; detecting whether the plurality of similarity scores satisfies a condition that is associated with the user provision; and outputting a notification associated with the user provision when the condition is satisfied.

ACCURATE AND EFFICIENT RECORDING OF USER EXPERIENCE, GUI CHANGES AND USER INTERACTION EVENTS ON A REMOTE WEB DOCUMENT

The present disclosure describes how to capture events (e.g., changes and user interactions) of a document and combine those changes with the original tree data structure displayed to accurately and efficiently enable a replay engine to redisplay the tree data structure, changes, and user interactions which occurred at the client device. The data collected from a client-side capture engine can be combined with a minimal amount of contextual information to a replay engine so as to accurately and efficiently replay a session of a plurality of documents.