System and Method for Natural Language Translation and Reconciliation of Colloquial Chemical Terminology
20230342547 · 2023-10-26
Inventors
Cpc classification
International classification
Abstract
Disclosed is a system and method for natural language translation and reconciliation of colloquial chemical terminology.
Claims
1. A method of automatically generating a colloquial complete definitions table for chemical compounds, the method comprising the steps of: (A) providing an algorithmically technology that incorporates both the chemical structure and the textual colloquial definitions; (B) reconciling the ecosystem literature for chemical and active pharmaceutical ingredients as well as cannabinoids, psychedelics, and other types of treatments; (C) executing an enhanced capability for performing sentiment analysis relating to, but not limited to, pharmaceutical and non-pharmaceutical; and (D) executing an augmented NLP algorithm for reconciling scientific definitions, and utilizing, but is not limited to, a graph DB structure, wherein implementation can be activated through SQL, NoSQL, and other relational and non-relational data structures.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0004]
DETAIL DESCRIPTIONS OF THE INVENTION
[0005] All illustrations of the drawings are for the purpose of describing selected versions of the present invention and are not intended to limit the scope of the present invention.
[0006] As can be seen in
[0007] The present inventor has invented a Two-part data structure where the base of the structure (initialization node) is the seed. This seed contains a meta-data interaction definition of a molecule that is exogenous and unaffected by linguistic translations because it's anchored in a language-agnostic chemical definition.
[0008] From this seed definition that is non-linguistic, the present inventor then activates a combing process for navigating a pool of documents and creating an interactive, nested Ranking system that utilizes breadth-first search in order to associate and identify terms immediately associated with the molecule. Propagating a colloquial crawler to then find subsequent terms that may have a further geometric distance in the dataset from the seed molecule and its most immediate definition in the particular language or alphabet.
[0009] The documents in the “document pool” can include by not limited to: academic publications, public regulator announcements, conversation transcripts, blogs, forums, social media platforms, online menus, scanned documents, metadata, computer code, etc, and can span any existing, or yet to be developed symbolic language.
[0010] The outcome of all of this is a deterministic process that identifies, ranks, and consolidates all of the necessarily associated terminologies that can then enable commercial and academic explorers to have a more comprehensive and detailed view of the literary ecosystem that contains the terminology associated with the chemical.
[0011] The outcome is also a weighted average nodal representation which can then be used to calculate colloquial sentiment surrounding the chemical in real time. The present invention also works to help to overcome social inequity and racial barriers given the necessity and use of slang in identifying and naming these molecules. This is historical because of the disjointed regulatory space both at a national and international level with regards to the lack of agreement with regards to access to these substances. By creating a methodology to reconcile these terminologies we can jumpstart research that may have slowed due to the fragmented environment. It may be possible to enhance health outcomes and accelerate research processes and workflows by creating a single point of understanding with regards to how we as biological and social creatures interact, and report their interactions, with these molecules.
[0012] The prior art for the present invention includes the following: [0013] 1. U.S. Pat. No. 8,990,066B2 [0014] 2. U.S. Pat. No. 8,631,126B2 [0015] 3. WO2006091528A3 [0016] 4. WO2009110991A2 [0017] 5. U.S. Pat. No. 10,909,328B2 [0018] 6. US20160343004A1 [0019] 7. U.S. Pat. No. 8,838,633B2 [0020] 8. “Markets With Memory”, by Nima Veiseh, 2019.ISBN: 9781700165121
[0021] Although the invention has been explained in relation to its preferred embodiment, it is to be understood that many other possible modifications and variations can be made without departing from the spirit and scope of the invention.