Patent classifications
G06F40/194
METHOD AND SYSTEM FOR OBTAINING SIMILARITY RATES BETWEEN ELECTRONIC DOCUMENTS
A method is disclosed for calculating similarity rates between electronic documents. The similarity rate is calculated based on a count of matching phrases between the electronic documents and distances between subsequent matching phrases in each of the electronic documents. A system is also disclosed for comparing the electronic documents to obtain their similarity rates. A computing device determines at least one first proximity parameter based on the number of matched words in a matching phrase and at least one second proximity parameter based on distances between the subsequent matching phrases in each of the electronic documents. The similarity rate is determined based on the first and second proximity parameters.
METHOD AND SYSTEM FOR OBTAINING SIMILARITY RATES BETWEEN ELECTRONIC DOCUMENTS
A method is disclosed for calculating similarity rates between electronic documents. The similarity rate is calculated based on a count of matching phrases between the electronic documents and distances between subsequent matching phrases in each of the electronic documents. A system is also disclosed for comparing the electronic documents to obtain their similarity rates. A computing device determines at least one first proximity parameter based on the number of matched words in a matching phrase and at least one second proximity parameter based on distances between the subsequent matching phrases in each of the electronic documents. The similarity rate is determined based on the first and second proximity parameters.
AUTOMATIC HANDLING OF SECURITY DRIFT IN CLOUD ENVIRONMENTS
Security drift can be automatically handled in cloud environments. A security audit engine can be configured to extract security configuration datasets from cloud resources and create text sentences from the datasets as well as from a golden configuration. These text sentences can be encoded as vectors in an n-dimensional space. Probability distributions can then be generated using the vectors such as by using an unsupervised clustering algorithm. Distance matrixes can then be generated from the probability distributions. A probability distribution pertaining to a dataset and a probability distribution pertaining to the golden configuration can then be compared and normalized using a transport to thereby yield a security drift score representing a divergence of the corresponding security settings from the golden configuration. When a security drift score exceeds a threshold, the security audit engine can take appropriate action.
FINDING EXPRESSIONS IN TEXTS
A technique for presenting a text is disclosed. In the technique, a target text is obtained and a difference summary between the target text and a set of similar texts similar to the target text is prepared. The difference summary includes one or more variable text parts in the target text each varied in at least one text in the set of similar texts and a statistic of varying of each variable text part over the set of similar texts. The one or more variable text parts are marked in the target text based on the statistic of each variable text part. The target text is shown with the one or more variable text parts marked.
Accurate and efficient recording of user experience, GUI changes and user interaction events on a remote web document
The present disclosure describes how to capture events (e.g., changes and user interactions) of a Web document and combine those changes with the original DOM displayed to accurately and efficiently enable a replay engine to redisplay the DOM, changes, and user interactions which occurred within a user's browser. The data collected from a client-side HTML DOM capture engine can be combined with a minimal amount of contextual information to a replay engine so as to accurately and efficiently replay a session of a plurality of web documents.
Accurate and efficient recording of user experience, GUI changes and user interaction events on a remote web document
The present disclosure describes how to capture events (e.g., changes and user interactions) of a Web document and combine those changes with the original DOM displayed to accurately and efficiently enable a replay engine to redisplay the DOM, changes, and user interactions which occurred within a user's browser. The data collected from a client-side HTML DOM capture engine can be combined with a minimal amount of contextual information to a replay engine so as to accurately and efficiently replay a session of a plurality of web documents.
SYSTEM AND METHOD FOR COMPARING DOCUMENTS
The present invention relates to a system and a method for comparing information contained on at least two documents belonging to an entity. The present invention includes at least one device configured to receive information from at least one first document and at least one second document; then, compare at least one first document information and at least one second document information; and determine whether at least one second document contains at least one first document information. The present invention then outputs a result of whether the at least one second document contains at least one first document information.
SYSTEM AND METHOD FOR COMPARING DOCUMENTS
The present invention relates to a system and a method for comparing information contained on at least two documents belonging to an entity. The present invention includes at least one device configured to receive information from at least one first document and at least one second document; then, compare at least one first document information and at least one second document information; and determine whether at least one second document contains at least one first document information. The present invention then outputs a result of whether the at least one second document contains at least one first document information.
TEXT DUPLICATE CHECKING METHOD, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
Provided are a text duplicate checking method, an electronic device and a computer-readable storage medium. The method includes storing a fingerprint set and a corresponding text ID in a byte data manner to obtain a fingerprint library; acquiring a target text and creating a target fingerprint; obtaining a comparison fingerprint set from map memories according to the target fingerprint, and calculating a similarity between the target fingerprint and each comparison fingerprint in the comparison fingerprint set separately; and based on a determination result that a number of 1 s in binary values of one similarity is less than or equal to a preset value, querying a text ID corresponding to the one similarity, to implement duplicate checking of the target text.
TEXT DUPLICATE CHECKING METHOD, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
Provided are a text duplicate checking method, an electronic device and a computer-readable storage medium. The method includes storing a fingerprint set and a corresponding text ID in a byte data manner to obtain a fingerprint library; acquiring a target text and creating a target fingerprint; obtaining a comparison fingerprint set from map memories according to the target fingerprint, and calculating a similarity between the target fingerprint and each comparison fingerprint in the comparison fingerprint set separately; and based on a determination result that a number of 1 s in binary values of one similarity is less than or equal to a preset value, querying a text ID corresponding to the one similarity, to implement duplicate checking of the target text.