G06F40/284

Computerized smart inventory search methods and systems using classification and tagging

A method and system operable for: receiving a search query including search terms; using a machine learning module, selecting features of the search terms and mapping an association between the search terms and a domain object, thereby generating a domain object classification; tagging the domain object with the domain object classification; and using the domain object tagged with the domain object classification to conduct a subsequent search. Conducting the subsequent search includes: receiving a subsequent search query including subsequent search terms; tokenizing the subsequent search terms; finding permutations of the tokenized subsequent search terms; matching the subsequent search terms to the domain object tagged with the domain object classification; and displaying subsequent search results via a user interface.

Computerized smart inventory search methods and systems using classification and tagging

A method and system operable for: receiving a search query including search terms; using a machine learning module, selecting features of the search terms and mapping an association between the search terms and a domain object, thereby generating a domain object classification; tagging the domain object with the domain object classification; and using the domain object tagged with the domain object classification to conduct a subsequent search. Conducting the subsequent search includes: receiving a subsequent search query including subsequent search terms; tokenizing the subsequent search terms; finding permutations of the tokenized subsequent search terms; matching the subsequent search terms to the domain object tagged with the domain object classification; and displaying subsequent search results via a user interface.

Tibetan Character Constituent Analysis Method, Tibetan Sorting Method And Corresponding Devices
20180011836 · 2018-01-11 ·

The present invention discloses a Tibetan character constituent analysis method, a Tibetan sorting method and corresponding devices, and relates to the field of natural language processing. The present invention is proposed to solve the problem that the existing Tibetan sorting methods have no universality or compatibility, which is inconvenient for the use of automatic computer Tibetan sorting. The technical solution provided by the present invention includes: S10, acquiring a Tibetan text to be analyzed; S20, using Tibetan characters in the Tibetan text as the input of a preset finite state automaton group; and S30, acquiring the constituents of the Tibetan characters according to a target finite state automaton, when the target finite state automaton in the finite state automaton group determines that the Tibetan characters in the Tibetan text are correctly spelled.

COMPUTER IMPLEMENTED METHODS AND SYSTEMS FOR COMPREHENSIVELY IDENTIFYING DECLINED SERVICES FROM SERVICE WRITE UP RECORDS
20180012266 · 2018-01-11 ·

Computer implemented methods and systems are disclosed for automatically identifying declined services from service records by extracting information from fields in the service record, analyzing the extracted information to identify issues found and issues addressed in the service record, comparing the issues found and issues addressed to identify issues found in the service record unrelated to the issues addressed, and inferring the issues found unrelated to the issues addressed to be declined services.

Systems and methods for targeted annotation of data

There is provided a system and a method of generating an annotated structured dataset, comprising: receiving a medical classification term, searching over the unstructured patient data for extracting unclassified unstructured text fragments, presenting a subset of the unclassified unstructured text fragments, receiving an indication of a selection of none or at least one of the text fragments, and one of: (i) classifying non-selected unclassified unstructured text fragments according to the medical classification term, and classifying selected text fragments as not satisfying the medical classification term, and (ii) classifying selected unclassified unstructured text fragments according to the medical classification term, and classifying non-selected unclassified unstructured text fragments as not satisfying the medical classification term, and iterating the searching, and/or the presenting, until no text fragments are obtained by the search, wherein the annotated structured dataset is created by the classification of unclassified unstructured text fragments into the medical classification term.

Hybrid approach to approximate string matching using machine learning

Systems, apparatuses, and methods are provided for identifying a corresponding string stored in memory based on an incomplete input string. A system can analyze and produce phonetic and distance metrics for a plurality of strings stored in memory by comparing the plurality of strings to an incomplete input string. These similarity metrics can be used as the input to a machine learning model, which can quickly and accurately provide a classification. This classification can be used to identify a string stored in memory that corresponds to the incomplete input string.

Hybrid approach to approximate string matching using machine learning

Systems, apparatuses, and methods are provided for identifying a corresponding string stored in memory based on an incomplete input string. A system can analyze and produce phonetic and distance metrics for a plurality of strings stored in memory by comparing the plurality of strings to an incomplete input string. These similarity metrics can be used as the input to a machine learning model, which can quickly and accurately provide a classification. This classification can be used to identify a string stored in memory that corresponds to the incomplete input string.

Method and system for suggesting revisions to an electronic document

A method for suggesting revisions to a document-under-analysis from a seed database, the seed database including a plurality of original texts each respectively associated with one of a plurality of final texts, the method for suggesting revisions including selecting a statement-under-analysis (“SUA”), selecting a first original text of the plurality of original texts, determining a first edit-type classification of the first original text with respect to its associated final text, generating a first similarity score for the first original text based on the first edit-type classification, the first similarity score representing a degree of similarity between the SUA and the first original text, selecting a second original text of the plurality of original texts, determining a second edit-type classification of the second original text with respect to its associated final text, generating a second similarity score for the second original text based on the second edit-type classification, the second similarity score representing a degree of similarity between the SUA and the second original text, selecting a candidate original text from one of the first original text and the second original text, and creating an edited SUA (“ESUA”) by modifying a copy of the first SUA consistent with a first candidate final text associated with the first candidate original text.

Systems and methods for modeling item similarity and correlating item information

Disclosed herein are systems and methods for correlating item data. A system for correlating item data may comprise a memory storing instructions and at least one processor configured to execute instructions to perform operations comprising: receiving reference text data associated with a reference item from a device; receiving reference image data associated with the reference item from the remote device; determining candidate text data and candidate image data associated with at least one candidate item; selecting a text correlation model; determining a first similarity score by applying the text correlation model to the reference text data and the candidate text data; selecting an image correlation model; determining a second similarity score by applying the image correlation model to the reference image data and the candidate image data; calculating a confidence score based on the first and second similarity scores; and performing a responsive action based on the calculated confidence score.

SPEECH ENDPOINTING BASED ON WORD COMPARISONS

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.