G06F40/221

Systems and methods for classification of data streams

Disclosed herein are embodiments of systems, methods, and apparatus that execute classification techniques to enable high-quality analysis of ingest data by interpreting and categorizing disparate data points of the ingest data. The execution of the classification techniques leads to isolation of intrinsic properties of each data point to represent the essence of what the overall ingest data indicates. The classification techniques further enables classification of the ingest data, which is unencumbered by any ingest data format changes, such as ordering of data components, encoding, or properties associated with the ingest data that are likely to change without altering meaning conveyed by the ingest data.

Systems and methods for classification of data streams

Disclosed herein are embodiments of systems, methods, and apparatus that execute classification techniques to enable high-quality analysis of ingest data by interpreting and categorizing disparate data points of the ingest data. The execution of the classification techniques leads to isolation of intrinsic properties of each data point to represent the essence of what the overall ingest data indicates. The classification techniques further enables classification of the ingest data, which is unencumbered by any ingest data format changes, such as ordering of data components, encoding, or properties associated with the ingest data that are likely to change without altering meaning conveyed by the ingest data.

SYSTEMS AND METHODS FOR CONVERSION OF DOCUMENTS TO REUSABLE CONTENT TYPES
20230196004 · 2023-06-22 ·

Embodiments of systems and methods for the conversion of documents to reusable content types are disclosed herein. Embodiments may extract the content and metadata of the original document and identify a set of reusable resources from the content and metadata. These reusable resources can each be one of a set of content types common across a plurality of document authoring platforms. Each of the content types may be represented using a content type object associated with that content type. The reusable resources identified by the parsing of the content and metadata of the original document may thus be represented with corresponding reusable objects in a content type format common to a plurality of document authoring tools.

METHOD AND APPARATUS FOR PREVENTING INJECTION-TYPE ATTACK IN WEB-BASED OPERATING SYSTEM

The present disclosure relates to a communication technique for fusing a 5G communication system for supporting a high data transmission rate after a 4G system with the IoT technology, and a system thereof. The present disclosure can be applied to an intelligent service (e.g., a smart home, a smart building, a smart city, a smart car or connected car, healthcare, digital education, retail business, security and safety related service, etc.) based on the 5G communication technology and the IoT related technology. In accordance with an embodiment of the present disclosure, a method for detecting a malicious code which is injected into the command stream of a widget miming on a web-based OS in a device by a web server in a wireless communication system is provided. The method includes: analyzing the widget in the web server; determining at least one invariant condition constantly maintained and conserved while the widget is running, on the basis of a result of the analyzing; generating a metadata file including data satisfying the at least one invariant condition; associating the metadata file with the widget and providing the widget in a state in which the associated metadata file is included in the widget.

METHODS AND SYSTEMS FOR REDIRECTING A USER FROM A THIRD PARTY WEBSITE TO A PROVIDER WEBSITE

Disclosed are methods, systems, and non-transitory computer-readable medium for redirecting a user. For instance, the method may include: determining whether a there is a presence of one or combinations of: a particular webpage of a third party website and particular DOM element(s); performing a first DOM analysis on the particular webpage to extract an entity and first data from at least one of the particular DOM element(s); determining whether an entity website is mapped based on the entity and a mapping of entities to entity websites; performing a navigation process to interact with the entity website and extract second data; and performing a comparison analysis on the first data and the second data to determine whether at least one difference is present.

Advertisement Filtering Method and Device
20170351644 · 2017-12-07 ·

An advertisement filtering method and device. The method comprises: access a web page by using a browser, acquire a selector of an advertisement element according to a domain name of the web page, and add a rule statement for hiding the advertisement element after the selector to generate a CSS style of a specific category (S10); inject the CSS style of the specific category into the browser (S11); set a cascading priority of the CSS style of the specific category to a highest cascading priority (S12); and the browser performs cascading on the CSS styles according to an order of the cascading priority of the CSS style to enable the CSS style of the specific category to take effect (S13). In this way, in a case in which an author uses a counter advertisement filtering method, an advertisement from the author can still be effectively filtered.

CLASSIFICATION CODE PARSER
20230186027 · 2023-06-15 ·

A classification code parser and method can include: reading a classification code having a description; reading a required keyword, and a total number of keywords associated with the classification code; reading text of a note; tokenizing the text of the note to create a note token stream, the note token stream having a note token and a position of the note token within the note token stream; creating a keyword map including a total number of matched keywords; determining a match ratio from the total number of the matched keywords and the total number of the keywords; determining a proximity factor based on a shortest span of tokens within the note token stream containing all the matched keywords; and determining a strength of a match between the classification code and the note based on the match ratio being multiplied by the proximity factor.

CLASSIFICATION CODE PARSER
20230186027 · 2023-06-15 ·

A classification code parser and method can include: reading a classification code having a description; reading a required keyword, and a total number of keywords associated with the classification code; reading text of a note; tokenizing the text of the note to create a note token stream, the note token stream having a note token and a position of the note token within the note token stream; creating a keyword map including a total number of matched keywords; determining a match ratio from the total number of the matched keywords and the total number of the keywords; determining a proximity factor based on a shortest span of tokens within the note token stream containing all the matched keywords; and determining a strength of a match between the classification code and the note based on the match ratio being multiplied by the proximity factor.

COMPUTER READABLE RECORDING MEDIUM, EXTRACTION APPARATUS, AND EXTRACTION METHOD

An extraction apparatus includes an input-data analysis unit that, when an extraction process is performed on input data containing a plurality of XBRL files using a combination of a plurality of extraction criteria, each of the extraction criteria directly specifying an element and an aspect of each of the plurality of XBRL files, calculates, from the input data, distribution information containing distribution of values of individual aspects of a plurality of elements that are individually provided by the plurality of XBRL files, and an application-sequence determining unit that determines an application sequence of the plurality of extraction criteria by referring to the calculated distribution information. Hence, the extraction apparatus can extract XBRL data pieces containing data items to be validated against a validation rule from the input data rapidly.

Browser extension for field detection and automatic population

Methods and systems for a browser extension application are disclosed. In some embodiments, a browser extension application is configured to receive from a browser extension server a regular expression configured to detect a plurality of fields in a web page and execute the regular expression to detect a transaction field in the web page and automatically populate the transaction field with stored data. The application is further configured to detect an unrecognized field in the web page, provide suggested transaction data, and detecting manual population of the unrecognized field with the suggested transaction data. The application is further configured to providing to the browser extension server an indication of the unrecognized field and receive from the browser extension server an updated regular expression configured to detect the unrecognized field in the web page.