Patent classifications
G06F40/289
DOCUMENT SPLITTING TOOL
Various embodiments disclosed relate to automated docketing of incoming electronic communications and documents. The present disclosure includes methods and systems for identifying omnibus documents containing more than one event, and splitting those omnibus documents into the individual events or documents.
DOCUMENT SPLITTING TOOL
Various embodiments disclosed relate to automated docketing of incoming electronic communications and documents. The present disclosure includes methods and systems for identifying omnibus documents containing more than one event, and splitting those omnibus documents into the individual events or documents.
SYSTEMS AND METHODS FOR DYNAMICALLY REMOVING TEXT FROM DOCUMENTS
Disclosed are techniques for building a dynamic dictionary and using the dictionary to remove phrases or words appearing in and out of context in a document. The techniques include, for example, receiving electronic health record (EHR) data, determining, using natural language processing (NLP), an instance of a personal health information (PHI) phrase in the EHR data based on a NLP system confidence metric being above a threshold, determining another instance of the PHI phrase in the EHR data that does not have the same context as the first context, removing the instances of the PHI phrase from the EHR data to produce cleaned EHR data, and taking an action based on the cleaned EHR data. The confidence metric can indicate likelihood that the PHI phrase is a PHI phrase and the metric can be based at least in part on a first context of the PHI phrase.
SYSTEMS AND METHODS FOR DYNAMICALLY REMOVING TEXT FROM DOCUMENTS
Disclosed are techniques for building a dynamic dictionary and using the dictionary to remove phrases or words appearing in and out of context in a document. The techniques include, for example, receiving electronic health record (EHR) data, determining, using natural language processing (NLP), an instance of a personal health information (PHI) phrase in the EHR data based on a NLP system confidence metric being above a threshold, determining another instance of the PHI phrase in the EHR data that does not have the same context as the first context, removing the instances of the PHI phrase from the EHR data to produce cleaned EHR data, and taking an action based on the cleaned EHR data. The confidence metric can indicate likelihood that the PHI phrase is a PHI phrase and the metric can be based at least in part on a first context of the PHI phrase.
WORD MINING METHOD AND APPARATUS, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM
The present disclosure provides a word mining method and apparatus, an electronic device and a readable storage medium, and relates to the field of artificial intelligence technologies, such as natural language processing technologies, deep learning technologies, cloud service technologies, or the like. The word mining method includes: acquiring search data; taking first identification information, a search sentence and second identification information in the search data as nodes, and taking a relationship between the first identification information and the search sentence, a relationship between the first identification information and the second identification information and a relationship between the search sentence and the second identification information as sides to construct a behavior graph; obtaining a label vector of each search sentence in the behavior graph according to a search sentence with a preset label in the behavior graph; determining a target search sentence in the behavior graph according to the label vector; and extracting a target word from the target search sentence, and taking the target word as a word mining result of the search data.
WORD MINING METHOD AND APPARATUS, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM
The present disclosure provides a word mining method and apparatus, an electronic device and a readable storage medium, and relates to the field of artificial intelligence technologies, such as natural language processing technologies, deep learning technologies, cloud service technologies, or the like. The word mining method includes: acquiring search data; taking first identification information, a search sentence and second identification information in the search data as nodes, and taking a relationship between the first identification information and the search sentence, a relationship between the first identification information and the second identification information and a relationship between the search sentence and the second identification information as sides to construct a behavior graph; obtaining a label vector of each search sentence in the behavior graph according to a search sentence with a preset label in the behavior graph; determining a target search sentence in the behavior graph according to the label vector; and extracting a target word from the target search sentence, and taking the target word as a word mining result of the search data.
METHODS AND SYSTEMS FOR EXPANDING VOCABULARY
The present disclosure provides a method and a system for expanding vocabulary. The method includes: obtaining a target vocabulary, the target vocabulary including a single word or a phrase composed of two or more words; obtaining at least one candidate text associated with the target vocabulary; determining a plurality of candidate vocabularies from the at least one candidate text, the plurality of candidate vocabularies including words from the at least one candidate text and a phrase formed by at least two consecutive words in position; and determining at least one expansion vocabulary of the target vocabulary from the plurality of candidate vocabularies.
METHODS AND SYSTEMS FOR EXPANDING VOCABULARY
The present disclosure provides a method and a system for expanding vocabulary. The method includes: obtaining a target vocabulary, the target vocabulary including a single word or a phrase composed of two or more words; obtaining at least one candidate text associated with the target vocabulary; determining a plurality of candidate vocabularies from the at least one candidate text, the plurality of candidate vocabularies including words from the at least one candidate text and a phrase formed by at least two consecutive words in position; and determining at least one expansion vocabulary of the target vocabulary from the plurality of candidate vocabularies.
USING EMAIL HISTORY TO ESTIMATE CREDITWORTHINESS FOR APPLICANTS HAVING INSUFFICIENT CREDIT HISTORY
In some implementations, a credit decision platform may receive a credit request from an applicant and obtain domestic historical data associated with the applicant from a credit bureau device. The credit decision platform may obtain access to an email account associated with the applicant based on determining that the domestic historical data associated with the applicant is insufficient to process the credit request. The credit decision platform may identify, using one or more machine learning models, a set of email messages included in the email account that are relevant to the credit request and may analyze content included in the set of email messages to generate non-domestic historical data associated with the applicant. The credit decision platform may generate a decision on the credit request based on an estimated creditworthiness of the applicant, which may be determined based on the non-domestic historical data.
USING EMAIL HISTORY TO ESTIMATE CREDITWORTHINESS FOR APPLICANTS HAVING INSUFFICIENT CREDIT HISTORY
In some implementations, a credit decision platform may receive a credit request from an applicant and obtain domestic historical data associated with the applicant from a credit bureau device. The credit decision platform may obtain access to an email account associated with the applicant based on determining that the domestic historical data associated with the applicant is insufficient to process the credit request. The credit decision platform may identify, using one or more machine learning models, a set of email messages included in the email account that are relevant to the credit request and may analyze content included in the set of email messages to generate non-domestic historical data associated with the applicant. The credit decision platform may generate a decision on the credit request based on an estimated creditworthiness of the applicant, which may be determined based on the non-domestic historical data.