INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING PROGRAM
20230326560 · 2023-10-12
Inventors
Cpc classification
International classification
Abstract
An information processing apparatus performs a first search for searching structure data indicating a structure of a chemical substance, which is a processing target, for a partial structure included in partial structure data in which the partial structure of the chemical substance and an index value indicating performance or a structure of the partial structure are associated with each other, performs a second search for searching past data for a chemical substance including the partial structure extracted by the first search, derives a reliability degree of the index value of the partial structure based on a total number of the chemical substances extracted by the second search and an index value corresponding to the chemical substance, and performs control of displaying the partial structure extracted by the first search and the reliability degree of the partial structure on a display device.
Claims
1. An information processing apparatus comprising: at least one processor, wherein the processor performs a first search for searching structure data indicating a structure of a chemical substance, which is a processing target, for a partial structure included in partial structure data in which the partial structure of the chemical substance and an index value indicating performance or a structure of the partial structure are associated with each other, performs a second search for searching past data in which the structure of the chemical substance and an index value indicating performance or the structure of the chemical substance obtained by an experiment are associated with each other for a chemical substance including the partial structure extracted by the first search, derives a reliability degree indicating reliability of the index value of the partial structure extracted by the first search based on a total number of the chemical substances extracted by the second search and an index value corresponding to the chemical substance, and performs control of displaying the partial structure extracted by the first search and the reliability degree of the partial structure on a display device.
2. The information processing apparatus according to claim 1, wherein the processor performs control of displaying the reliability degree on the partial structure of the chemical substance, which is the processing target, in a discriminable manner.
3. The information processing apparatus according to claim 1, wherein the processor performs control of displaying the reliability degree in descending order of the reliability degree in a case in which there are a plurality of the partial structures extracted by the first search.
4. The information processing apparatus according to claim 1, wherein the processor performs weighting according to the total number of the chemical substances extracted by the second search in a case in which the reliability degree is derived.
5. An information processing method executed by a processor provided in an information processing apparatus, the method comprising: performing a first search for searching structure data indicating a structure of a chemical substance, which is a processing target, for a partial structure included in partial structure data in which the partial structure of the chemical substance and an index value indicating performance or a structure of the partial structure are associated with each other; performing a second search for searching past data in which the structure of the chemical substance and an index value indicating performance or the structure of the chemical substance obtained by an experiment are associated with each other for a chemical substance including the partial structure extracted by the first search; deriving a reliability degree indicating reliability of the index value of the partial structure extracted by the first search based on a total number of the chemical substances extracted by the second search and an index value corresponding to the chemical substance; and performing control of displaying the partial structure extracted by the first search and the reliability degree of the partial structure on a display device.
6. A non-transitory computer-readable storage medium storing an information processing program causing a processor provided in an information processing apparatus to execute a process comprising: performing a first search for searching structure data indicating a structure of a chemical substance, which is a processing target, for a partial structure included in partial structure data in which the partial structure of the chemical substance and an index value indicating performance or a structure of the partial structure are associated with each other; performing a second search for searching past data in which the structure of the chemical substance and an index value indicating performance or the structure of the chemical substance obtained by an experiment are associated with each other for a chemical substance including the partial structure extracted by the first search; deriving a reliability degree indicating reliability of the index value of the partial structure extracted by the first search based on a total number of the chemical substances extracted by the second search and an index value corresponding to the chemical substance; and performing control of displaying the partial structure extracted by the first search and the reliability degree of the partial structure on a display device.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
DETAILED DESCRIPTION
[0023] Hereinafter, with reference to the drawings, an embodiment for performing the technology of the present disclosure will be described in detail.
[0024] First, with reference to
[0025] The storage unit 22 is realized by a hard disk drive (HDD), a solid state drive (SSD), a flash memory, or the like. An information processing program 30 is stored in the storage unit 22 as a storage medium. The CPU 20 reads out the information processing program 30 from the storage unit 22, develops the read out information processing program 30 in the memory 21, and executes the developed information processing program 30.
[0026] In addition, partial structure data 32 and past data 34 are stored in the storage unit 22. With reference to
[0027] As shown in
[0028] In addition, as shown in
[0029]
[0030]
[0031] As described above, since the past data 34 is the actual measurement value obtained by the past experiment, the reliability of the index value included in the past data 34 is relatively high. On the other hand, since the index value included in the partial structure data 32 is based on the experience and knowledge of the expert, the reliability may be relatively low. Therefore, the information processing apparatus 10 according to the present embodiment has a function of deriving a reliability degree of the index value of the partial structure constituting the chemical substance, which is the processing target, by using the partial structure data 32 and the past data 34.
[0032] Next, with reference to
[0033] The reception unit 40 receives the structure data indicating the structure of the chemical substance, which is the processing target and is input by a user. The user inputs the structure data indicating the structure of the chemical substance by using, for example, a molecular design editor, such as ChemDraw (registered trademark).
[0034] The first search unit 42 performs a first search for searching the structure data received by the reception unit 40 for the partial structure included in the partial structure data 32. The second search unit 44 performs a second search for searching the past data 34 for the chemical substance including the partial structure extracted by the first search by the first search unit 42.
[0035] The derivation unit 46 derives the reliability degree indicating the reliability of the index value of the partial structure extracted by the first search by the first search unit 42 based on a total number of the chemical substances extracted by the second search by the second search unit 44 and the index values corresponding to the chemical substances. In a case in which the reliability degree is derived, the derivation unit 46 performs weighting according to the total number of the chemical substances extracted by the second search by the second search unit 44. Specifically, the derivation unit 46 derives the reliability degree according to Expression (1).
Reliability degree=The number of matches÷(10×√{square root over (Total number of chemical substances)}) (1)
[0036] In Expression (1), “the number of matches” means the number of the chemical substances in which the type of the index value and the index value match the type of the index value of the partial structure and the index value extracted by the first search among the chemical substances extracted by the second search. As a specific example, a case will be described in which “partial graph B” is included in the structure data received by the reception unit 40, and “toxicity” and “HIGH” are associated with “partial graph B” in the partial structure data 32. In addition, in this case, a case will be described in which 99 chemical substances including “partial graph B” are included in the past data 34, and the number of the chemical substances of which “toxicity” is “HIGH” is 90 and the number of the chemical substances of which “toxicity” is “LOW” is 9 among the 99 chemical substances.
[0037] In this case, the number of matches in Expression (1) is “90”, and the total number of the chemical substances is “99”. Therefore, in this case, the derivation unit 46 derives the reliability degree of the index value of “partial graph B” as “0.9” as shown in Expression (2).
Reliability degree=90÷(10×√{square root over (99)})=90÷99.5=0.9 (2)
[0038] It should be noted that the method of deriving the reliability degree is not limited to the method using Expression (1). For example, the reliability degree may be derived using an expression other than Expression (1) in which a higher reliability degree is derived as a ratio of the number of matches to the total number of the chemical substances is higher and the number of matches is larger.
[0039] The display control unit 48 performs control of displaying the partial structure extracted by the first search and the reliability degree of the partial structure derived by the derivation unit 46 on the display 23. In this case, the display control unit 48 performs control of displaying the reliability degree on the partial structure of the chemical substance, which is the processing target, in a discriminable manner. In addition, in a case in which there are a plurality of partial structures extracted by the first search, the display control unit 48 performs control of displaying the reliability degree in descending order of the reliability degree.
[0040]
[0041] In the display region A2, for each of the partial structures extracted by the first search, the type of the index value associated with the partial structure and the index value, the reliability degree of the index value, the number of matches used in a case in which the reliability degree is derived, and the total number of the chemical substances are displayed. In addition, in the display region A2, in a case in which there are the plurality of partial structures extracted by the first search, the reliability degree is displayed in descending order of the reliability degree.
[0042] Next, with reference to
[0043] In step S10 of
[0044] In step S14, the second search unit 44 performs the second search for searching the past data 34 for the chemical substance including the partial structure extracted by the first search in step S12. In step S16, as described above, according to Expression (1), the derivation unit 46 derives the reliability degree of the index value of the partial structure extracted by the first search in step S12 based on the total number of the chemical substances extracted by the second search in step S14 and the index value corresponding to the chemical substance.
[0045] In step S18, as described above, the display control unit 48 performs control of displaying the partial structure extracted by the first search in step S12 and the reliability degree derived in step S16 for the partial structure on the display 23. The reliability degree display screen shown in
[0046] By viewing the reliability degree display screen displayed on the display 23, the user can understand, for example, the partial structure having a high risk and high reliability for the high risk among the chemical substances designed by the user himself/herself.
[0047] As described above, according to the present embodiment, it is possible to support appropriate screening.
[0048] It should be noted that, in the embodiment described above, various processors shown below can be used as the hardware structure of processing units that execute various pieces of processing, such as the reception unit 40, the first search unit 42, the second search unit 44, the derivation unit 46, and the display control unit 48. As described above, the various processors include, in addition to the CPU that is a general-purpose processor that executes software (program) to function as various processing units, a programmable logic device (PLD) that is a processor of which a circuit configuration can be changed after manufacture, such as a field programmable gate array (FPGA), and a dedicated electric circuit that is a processor having a circuit configuration that is designed for exclusive use in order to execute specific processing, such as an application specific integrated circuit (ASIC).
[0049] One processing unit may be configured by one of the various processors, or may be configured by a combination of two or more processors of the same type or different types (for example, a combination of a plurality of FPGAs, or a combination of the CPU and the FPGA). Further, a plurality of processing units may be configured by one processor.
[0050] A first example of the configuration in which the plurality of processing units are configured by one processor is a form in which one processor is configured by a combination of one or more CPUs and the software and this processor functions as the plurality of processing units, as represented by computers, such as a client and a server. A second example thereof is a form of using a processor that realizes the function of the entire system including the plurality of processing units by one integrated circuit (IC) chip, as represented by a system on chip (SoC) or the like. In this way, the various processing units are configured by using one or more of the various processors described above, as the hardware structure.
[0051] Further, more specifically, as the hardware structure of the various processors, an electric circuit (circuitry) in which circuit elements, such as semiconductor elements, are combined can be used.
[0052] In addition, in the embodiment described above, an aspect has been described in which the information processing program 30 is stored (installed) in the storage unit 22 in advance, but the present disclosure is not limited to this. The information processing program 30 may be provided in a form of being recorded in a recording medium, such as a compact disc read only memory (CD-ROM), a digital versatile disc read only memory (DVD-ROM), and a universal serial bus (USB) memory. Moreover, the information processing program 30 may be provided in a form being downloaded from an external device via a network.
[0053] The disclosure of JP2020-216936 filed on Dec. 25, 2020 is incorporated in the present specification by reference in its entirety. Also, all documents, patent applications, and technical standards described in the present specification are incorporated in the present specification by reference to the same extent as a case in which each document, patent application, and technical standard are specifically and individually described to be incorporated by reference.