G06F2216/11

Methods, systems, and storage media for automatically identifying relevant chemical compounds in patent documents

Methods, systems, and non-transitory media for training a chemical entity recognition system to extract chemical compounds from a patent document and determine a relevance of the chemical compounds to the patent document are disclosed. A method includes obtaining patent documents from patent databases, normalizing each patent document into a unified format, and generating a chemical patent corpus. The chemical patent corpus includes chemical entities, each having relevancy annotations that indicate a relevance to the patent document from which the chemical entity is extracted. The method further includes providing the chemical patent corpus to the chemical entity recognition system, which tags the one or more chemical entities in a corresponding normalized patent document, extracts additional chemical entities, assigns a confidence score to each additional chemical entity, and labels each additional chemical entity as relevant or irrelevant to an associated patent document based on information contained in the chemical patent corpus.

SOFTWARE-AIDED CONSISTENT ANALYSIS OF DOCUMENTS

The present technology pertains to a system for automatic analysis and segregation of documents. The system provides a graphical user interface for receiving inputs pertaining to a first document of a plurality of documents in a document analysis project. For example, the graphical user interfaces may receive a classification input classifying the first document with a first classification. The system automatically analyzes other documents in the plurality of documents to identify a subset of documents that are similar to the first document, and automatically classify the subset of the documents that are similar to the first document with the first classification. Further, the present technology pertains to conducting a patent analysis project by a team of analysts, including presenting a detailed analysis user interface for reviewing patent-related documents, where the detailed analysis user interface includes text of a first patent-related document to be analyzed and categories and related subcategories.

User interface for providing docketing data

Methods and systems for receiving docketing data are disclosed. The methods and systems perform operations comprising: obtaining, by a first party, a patent file wrapper from a publicly accessible database of patent records, the patent file wrapper including a plurality of patent documents; receiving, from the first party, user input that tags a patent document of the plurality of patent documents in the patent file wrapper, wherein the patent document that is tagged is associated with a patent activity that occurred within a threshold period of time; and transmitting, to a second party by the first party, a communication that includes the tagged patent document.

AUTOMATIC INDUSTRY CLASSIFICATION METHOD AND SYSTEM
20220374462 · 2022-11-24 · ·

An automatic industry classification method comprises: determining a scope of target patents, defining a target industry tree; generating marks on the target industry tree; performing a rough classification for the target patents by using the marks; performing a fine classification for the target patents according to a result of the rough classification. The automatic industry classification method and system provided by the present invention uses a transductive learning method, so that full mining of small annotation quantity information is realized. The automatic industry classification method and system uses information of IPC, so that information dimension is enriched, and calculation amount needed in the classification is reduced. The automatic industry classification method and system further uses the hierarchical vectors generated by the abstract, the claims and the description, so that the information of word order relation is reserved, and the patent text is deeply mined.

Intellectual property recommending method and system

An intellectual property (IP) recommending method and an IP recommending system are provided. In the method, a plurality of IP portfolios respectively designated for a plurality of product designs are retrieved and usage data of a plurality of IPs included in each of the plurality of IP portfolios are extracted. A machine learning (ML) model is trained by using a portion of the retrieved IP portfolios and the extracted usage data. In response to receiving at least one criterion for a desired product design from a user, a plurality of IPs adapted for the desired product design are predicted based on the ML model and recommended for the user.

DYNAMIC DATA SET MODIFICATION AND MAPPING
20230077956 · 2023-03-16 ·

Systems and methods for generating a data map based on a dynamically updated data set. The system includes a client-side device. The client-side device includes a controller and is operably connected to a communication network. The controller includes a processor and a non-transitory computer readable data storage medium, the processor is configured to retrieve from the medium and execute computer readable instructions to receive a data set including one or more data assets and generate the data map based on the received data set. Each data asset includes four or more attributes. The data map includes one or more segments, and the one or more segments illustrate each of the four or more attributes of the one or more data assets.

SEARCH SYSTEM AND SEARCH METHOD
20230078094 · 2023-03-16 ·

A search system capable of searching for an image with a similar represented concept is provided. The search system includes an input unit, a text extraction unit, a tag obtaining unit, and a tag similarity calculation unit. When image data to which an image label is assigned and document data including the image label are supplied to the input unit, the text extraction unit is configured to extract tag-obtaining-purpose text data from the document data on the basis of the image label. The tag obtaining unit is configured to obtain a tag including at least a part of words included in the tag-obtaining-purpose text data. The tag similarity calculation unit is configured to calculate similarity between tags. It is possible to search for an image having a greatly different feature value of the image itself but having a similar represented concept.

Dynamic data set modification and mapping
11475037 · 2022-10-18 ·

Systems and methods for generating a data map based on a dynamically updated data set. The system includes a client-side device. The client-side device includes a controller and is operably connected to a communication network. The controller includes a processor and a non-transitory computer readable data storage medium, the processor is configured to retrieve from the medium and execute computer readable instructions to receive a data set including one or more data assets and generate the data map based on the received data set. Each data asset includes four or more attributes. The data map includes one or more segments, and the one or more segments illustrate each of the four or more attributes of the one or more data assets.

METHOD AND APPARATUS FOR DERIVING KEYWORDS BASED ON TECHNICAL DOCUMENT DATABASE
20230126421 · 2023-04-27 ·

A method includes searching a technical document including a first data field, a second data field and a third data field based on search terms and search year ranges related to a technical field, generating a keyword set using the first data field, the second data field and the third data field of the searched technical document, scoring a plurality of keywords included in the keyword set, and selecting some of the plurality of keywords, re-searching the technical document related to the technical field, using the selected keywords, scoring the re-searched technical document to derive a representative document representing the technical field, and deriving a representative keyword representing the technical field, using the second data field included in the representative document, wherein the first data field includes a title of the technical document, the second data field includes a summary of the technical document, and the third data field includes keywords of the technical document.

TECHNOLOGY MATURITY JUDGMENT METHOD AND SYSTEM BASED ON SCIENCE AND TECHNOLOGY DATA

A technology maturity judgment method based on science and technology data comprises establishing a database, an algorithm library, and an index library, and further comprises: performing a data retrieval in the database; performing a data calculation and organization on a retrieval result; performing a regression calculation on organized data to obtain a technical maturity index; and obtaining a judgment conclusion according to the technical maturity index. The technology maturity judgment method is mainly based on a patent analysis method, which assisted by technical data analysis of papers and projects. Technical points are placed in a technology cluster associated with the technology to perform a comprehensive analysis by multi-dimensional evaluation indexes and algorithm. Mapping between science and technology data indexes and technology maturity is established and automatic judgment is achieved. The method does not need a large amount of artificial subjective work.