G06F40/279

Speaker identity and content de-identification

One embodiment of the invention provides a method for speaker identity and content de-identification under privacy guarantees. The method comprises receiving input indicative of privacy protection levels to enforce, extracting features from a speech recorded in a voice recording, recognizing and extracting textual content from the speech, parsing the textual content to recognize privacy-sensitive personal information about an individual, generating de-identified textual content by anonymizing the personal information to an extent that satisfies the privacy protection levels and conceals the individual's identity, and mapping the de-identified textual content to a speaker who delivered the speech. The method further comprises generating a synthetic speaker identity based on other features that are dissimilar from the features to an extent that satisfies the privacy protection levels, and synthesizing a new speech waveform based on the synthetic speaker identity to deliver the de-identified textual content. The new speech waveform conceals the speaker's identity.

System and method thereof for determining vendor's identity based on network analysis methodology
11580304 · 2023-02-14 · ·

A system and method for classifying digital images is presented. The method includes extracting a plurality of descriptive data items of a transaction evidence from a digital image indicating a plurality of purchased items; searching in data source for informative data based on the extracted plurality of descriptive data items, wherein the informative data includes a price; determining a correlated amount for each of at least one of the plurality of descriptive data items, wherein the correlated amount determined for one of the descriptive data items defines a paid price for the descriptive data item; determining, based on at least one expense type classification rule, a primary expense type of the transaction evidence, wherein the at least one expense type classification rule is applied to the plurality of descriptive data items and each of the correlated amount; and classifying the digital image based on the primary expense type.

Method and system for hybrid entity recognition

A hybrid entity recognition system and accompanying method identify composite entities based on machine learning. An input sentence is received and is preprocessed to remove extraneous information, perform spelling correction, and perform grammar correction to generate a cleaned input sentence. A POS tagger tags parts of speech of the cleaned input sentence. A rules based entity recognizer module identifies first level entities in the cleaned input sentence. The cleaned input sentence is converted and translated into numeric vectors. Basic and composite entities are extracted from the cleaned input sentence using the numeric vectors.

Method and system for hybrid entity recognition

A hybrid entity recognition system and accompanying method identify composite entities based on machine learning. An input sentence is received and is preprocessed to remove extraneous information, perform spelling correction, and perform grammar correction to generate a cleaned input sentence. A POS tagger tags parts of speech of the cleaned input sentence. A rules based entity recognizer module identifies first level entities in the cleaned input sentence. The cleaned input sentence is converted and translated into numeric vectors. Basic and composite entities are extracted from the cleaned input sentence using the numeric vectors.

Automated malware analysis that automatically clusters sandbox reports of similar malware samples

A system and a method for automatically clustering sandbox analysis reports of similar malware samples. An automated malware analysis process includes receiving from a sandbox server the sandbox analysis reports of the similar malware samples at an application programming interface (API) of the clustering server, clustering similar Uniform Resource Locators (URLs) together and clustering the sandbox analysis reports of events in sandbox reports clusters (1-n) based on the URL clustering, static properties of the malware samples and dynamic properties of the malware samples.

Method and device for keyword extraction and storage medium

A method and device for keyword extraction and a storage medium. The method includes receiving, at a terminal, an original document, acquiring, at the terminal, a candidate set by extracting at least one candidate phrase from the original document, acquiring, at the terminal, an association degree between the at least one candidate phrase in the candidate set and the original document, acquiring, at the terminal, a divergence degree of the at least one candidate phrase in the candidate set, and updating, at the terminal, a key phrase set of the original document by selecting the at least one candidate phrase from the candidate set as at least one key phrase based on the association degree and the divergence degree.

Selectively activating a resource by detecting emotions through context analysis

A method selectively activates a resource to accommodate an advanced emotion. A supervisor computer receives a first piece of content, and then applies an emotion classifier to the first piece of content in order to create a first concept/emotion/sentiment/time tuple. The supervisor computer creates a second concept/emotion/sentiment/time tuple for a second piece of content, and compares the first and second tuples. If the concept in the first piece of content matches the concept in the second piece of content but that at least one of the emotion, sentiment, and time of the first piece of content does not match the emotion, sentiment, and time of the second piece of content, the supervisor computer determines that the emotion of the second piece of content is an advanced emotion that is not expressed by the first or second pieces of content, and activates a resource that accommodates the advanced emotion.

NATURAL LANGUAGE PROCESSING COMPREHENSION AND RESPONSE SYSTEM AND METHODS
20230044048 · 2023-02-09 ·

An automatic, system-generated, multi-faceted comprehension and response capability, using Natural Language Processing, to provide value specific answers from available unstructured data, documents and text. Questions and queries are interpreted by the system's capability to determine the type of questions and provide a response or answer based on the data or information available. If the answer is in the ingested data, a response is provided that is either; a list of documents, a list of document snippets with the answer contained in the snippets, a formalized and templated response, or a highly relevant hand curated response.

NATURAL LANGUAGE PROCESSING COMPREHENSION AND RESPONSE SYSTEM AND METHODS
20230044048 · 2023-02-09 ·

An automatic, system-generated, multi-faceted comprehension and response capability, using Natural Language Processing, to provide value specific answers from available unstructured data, documents and text. Questions and queries are interpreted by the system's capability to determine the type of questions and provide a response or answer based on the data or information available. If the answer is in the ingested data, a response is provided that is either; a list of documents, a list of document snippets with the answer contained in the snippets, a formalized and templated response, or a highly relevant hand curated response.

QUESTION-AND-ANSWER PROCESSING METHOD, ELECTRONIC DEVICE AND COMPUTER READABLE MEDIUM
20230039496 · 2023-02-09 ·

The embodiment of the present disclosure provides a question-and-answer processing method, including: acquiring a to-be-answered question; determining standard questions meeting a preset condition as a plurality of candidate standard questions, from a plurality of preset standard questions, according to a text similarity with the to-be-answered question, based on a text statistical algorithm; determining, a candidate standard question with the highest semantic similarity with the to-be-answered question as a matching standard question, from the plurality of candidate standard questions, based on a deep text matching algorithm; and determining an answer to the to-be-answered question at least according to the matching standard question. The embodiment of the present disclosure also provides an electronic device and a computer readable medium.