Patent classifications
G06F40/117
EXTRACTIVE METHOD FOR SPEAKER IDENTIFICATION IN TEXTS WITH SELF-TRAINING
A method, computer program, and computer system is provided for identifying a speaker in at text based work. Labeled and unlabeled instances corresponding to one or more speakers are extracted. Pseudo-labels are inferred for the extracted unlabeled instances based on the labeled instances. One or more of the unlabeled instances are labeled based on the inferred pseudo-labels.
Quality-aware data interfaces
A set of unstructured data is analyzed to infer structural elements from the unstructured data, and quantized data quality levels, indicative of data quality in the structural elements, are assigned to the structural elements. A set of structured data is generated to include the structural elements inferred from the unstructured data and associations between respective ones of the structural elements in the set of structured data and the corresponding quantized quality levels assigned to the structural elements. The set of structured data, including the associations between respective ones of the structural elements and the corresponding quantized quality levels assigned to the structural elements, is provided to a user interface application to enable the user interface application to visually display varying data qualities in the set of structured data.
Automated honeypot creation within a network
Systems and methods for managing Application Programming Interfaces (APIs) are disclosed. Systems may involve automatically generating a honeypot. For example, the system may include one or more memory units storing instructions and one or more processors configured to execute the instructions to perform operations. The operations may include receiving, from a client device, a call to an API node and classifying the call as unauthorized. The operation may include sending the call to a node-imitating model associated with the API node and receiving, from the node-imitating model, synthetic node output data. The operations may include sending a notification based on the synthetic node output data to the client device.
Automated honeypot creation within a network
Systems and methods for managing Application Programming Interfaces (APIs) are disclosed. Systems may involve automatically generating a honeypot. For example, the system may include one or more memory units storing instructions and one or more processors configured to execute the instructions to perform operations. The operations may include receiving, from a client device, a call to an API node and classifying the call as unauthorized. The operation may include sending the call to a node-imitating model associated with the API node and receiving, from the node-imitating model, synthetic node output data. The operations may include sending a notification based on the synthetic node output data to the client device.
Intelligent text annotation
Text is intelligently annotated by first creating a topic map summarizing topics of interest of the user. A data structure is created. The topic map is used to create two linked user dictionaries, a topic dictionary reflecting topic names and a traversal dictionary reflecting the knowledge structure of a topic. Actions may be linked with topic types. When the text to be annotated is being read, the topic data structure of the topics found in the text are automatically instantiated using the dictionaries and any actions previously linked to topic types. Instantiated topic data structures are automatically attached to the text being annotated. A user GUI may be created to allow the user to access and interact with the text annotations.
Intelligent text annotation
Text is intelligently annotated by first creating a topic map summarizing topics of interest of the user. A data structure is created. The topic map is used to create two linked user dictionaries, a topic dictionary reflecting topic names and a traversal dictionary reflecting the knowledge structure of a topic. Actions may be linked with topic types. When the text to be annotated is being read, the topic data structure of the topics found in the text are automatically instantiated using the dictionaries and any actions previously linked to topic types. Instantiated topic data structures are automatically attached to the text being annotated. A user GUI may be created to allow the user to access and interact with the text annotations.
Combinatorial inflight analysis of multipart data
A method of enhanced data orchestration (EDO). The method comprises receiving a first message by a mediation application executing on a computer system and analyzing the first message by the mediation application based at least in part on invoking a machine learning (ML) model by the mediation application. The analyzing determines that a feature of the first message is a probable first component of an item of personally identifiable information (PII). The method further comprises receiving a second message by the mediation application, determining by the mediation application that a feature of the second message when combined with the feature of the first message constitutes an item of PII, and treating the first message and the second message in accordance with predefined PII handling protocols by the mediation application.
Combinatorial inflight analysis of multipart data
A method of enhanced data orchestration (EDO). The method comprises receiving a first message by a mediation application executing on a computer system and analyzing the first message by the mediation application based at least in part on invoking a machine learning (ML) model by the mediation application. The analyzing determines that a feature of the first message is a probable first component of an item of personally identifiable information (PII). The method further comprises receiving a second message by the mediation application, determining by the mediation application that a feature of the second message when combined with the feature of the first message constitutes an item of PII, and treating the first message and the second message in accordance with predefined PII handling protocols by the mediation application.
Content extraction system
A system includes a content extraction engine comprising at least one processor and configured to receive a content page for a target product including product data for the target product and noise content unrelated to the target product, identify noise content pertaining to data unrelated to the target product, remove noise content from the content page, thereby generating a remainder content page containing target product data usable to enable product comparison between multiple sources.
Content extraction system
A system includes a content extraction engine comprising at least one processor and configured to receive a content page for a target product including product data for the target product and noise content unrelated to the target product, identify noise content pertaining to data unrelated to the target product, remove noise content from the content page, thereby generating a remainder content page containing target product data usable to enable product comparison between multiple sources.