Material Development Support Apparatus, Material Development Support Method, and Material Development Support Program
20230037015 · 2023-02-02
Inventors
Cpc classification
G16C20/30
PHYSICS
G06N3/0442
PHYSICS
International classification
G16C60/00
PHYSICS
G16C20/30
PHYSICS
Abstract
An embodiment includes a materials development support apparatus including an input data acquisition device configured to acquire input data including a material of a base forming a thin film and a function of the thin film, a candidate data generator configured to provide a preset verification target material as an input to a first learning, output a plurality of candidates for a function provided by the verification target material, an inverse analyzer configured to select a material that provides the function of the thin film included in the input data from the plurality of candidates for the function included in the candidate data, provide the material of the base included in the input data and the selected material as inputs to a second learning model, output a candidate for structure of the thin film, and a presenter configured to present the candidate for the structure of the thin film output.
Claims
1-7. (canceled)
8. A materials development support apparatus comprising: an input data acquisition device configured to acquire input data including a material of a base forming a thin film and a function of the thin film; a candidate data generator configured to provide a preset verification target material as an input to a first learning model in which a relationship between an individual one of a plurality of materials used for forming a thin film and a function provided by the material is previously learned, perform an operation of the first learning model, output a plurality of candidates for a function provided by the verification target material, and generate candidate data; an inverse analyzer configured to select a material that provides the function of the thin film included in the input data from the plurality of candidates for the function included in the candidate data, provide the material of the base included in the input data and the selected material as inputs to a second learning model in which compatibility with the base forming the thin film is previously acquired by learning, perform an operation of the second learning model, and output a candidate for structure of the thin film; and a presenter configured to present the candidate for the structure of the thin film output by the inverse analyzer
9. The materials development support apparatus according to claim 8, further comprising: a first extractor configured to extract a plurality of preset function names indicating the function of the thin film from an individual one of a plurality of document data; and a second extractor configured to extract a plurality of preset material names indicating the material used for forming the thin film from an individual one of a plurality of document data.
10. The materials development support apparatus according to claim 9, further comprising: a first learning data generator configured to generate first learning data in which a material and a function provided by the material are associated with each other for each of the plurality of material names, based on the plurality of function names extracted by the first extractor and the plurality of material names extracted by the second extractor; and a second learning data generator configured to generate second learning data in which the individual material indicated by the plurality of material names and compatibility with the base forming the thin film are associated with each other, based on the plurality of function names extracted by the first extractor, the plurality of material names extracted by the second extractor, and extraction-source document data.
11. The materials development support apparatus according to claim 10, further comprising: a first learning processor configured to train a preset first machine learning model by using the first learning data and construct the first learning model in which a relationship between a material and a function provided by the material is learned; a second learning processor configured to train a preset second machine learning model by using the second learning data and construct the second learning model in which compatibility with the base forming the thin film is acquired by learning; a first learning model storage device configured to store the trained first learning model; and a second learning model storage device configured to store the trained second learning model.
12. A materials development support method comprising: acquiring input data including a material of a base forming a thin film and a function of the thin film; providing a preset verification target material as an input to a first learning model in which a relationship between an individual one of a plurality of materials used for forming a thin film and a function provided by the material is previously learned; performing an operation of the first learning model; outputting a plurality of candidates for a function provided by the verification target material; generating candidate data; selecting a material configured to provide the function of the thin film included in the input data from the plurality of candidates for the function included in the candidate data; providing the material of the base included in the input data and the selected material as inputs to a second learning model in which compatibility with the base forming the thin film is previously acquired by learning; performing an operation of the second learning model; outputting a candidate for structure of the thin film; and presenting the candidate for the structure of the thin film output.
13. The materials development support method according to claim 12, comprising: extracting a plurality of preset function names indicating the function of the thin film from an individual one of a plurality of document data; and extracting a plurality of preset material names indicating the material used for forming the thin film from an individual one of a plurality of document data.
14. The materials development support method according to claim 13, comprising: generating first learning data in which a material and a function provided by the material are associated with each other for each of the plurality of material names, based on the plurality of function names extracted in the first extraction process and the plurality of material names extracted in the second extraction process; and generating second learning data in which the individual material indicated by the plurality of material names and compatibility with the base forming the thin film are associated with each other, based on the plurality of function names extracted in the first extraction process, the plurality of material names extracted in the second extraction process, and the extraction-source document data.
15. The materials development support method according to claim 14, comprising: training a preset first machine learning model by using the first learning data and constructs the first learning model in which a relationship between a material and a function provided by the material is learned; training a preset second machine learning model by using the second learning data and constructs the second learning model in which compatibility with the base forming the thin film is acquired by learning; storing the trained first learning model in a first learning model storage device; and storing the trained second learning model in a second learning model storage device.
16. A materials development support program that causes a computer to execute: an input data acquisition process that acquires input data including a material of a base forming a thin film and a function of the thin film; a candidate data generation process that provides a preset verification target material as an input to a first learning model in which a relationship between an individual one of a plurality of materials used for forming a thin film and a function provided by the material is previously learned, performs an operation of the first learning model, outputs a plurality of candidates for a function provided by the verification target material, and generates candidate data; an inverse analysis process that selects a material that provides the function of the thin film included in the input data from the plurality of candidates for the function included in the candidate data, provides the material of the base included in the input data and the selected material as inputs to a second learning model in which compatibility with the base forming the thin film is previously acquired by learning, performs an operation of the second learning model, and outputs a candidate for structure of the thin film; and a presentation process that presents the candidate for the structure of the thin film output in the inverse analysis process.
17. The materials development support program according to claim 16 that causes the computer to further execute: a first extraction process that extracts a plurality of preset function names indicating the function of the thin film from an individual one of a plurality of document data; and a second extraction process that extracts a plurality of preset material names indicating the material used for forming the thin film from an individual one of a plurality of document data;
18. The materials development support program according to claim 17 that causes the computer to further execute: a first learning data generation process that generates first learning data in which a material and a function provided by the material are associated with each other for each of the plurality of material names, based on the plurality of function names extracted in the first extraction process and the plurality of material names extracted in the second extraction process; and a second learning data generation process that generates second learning data in which the individual material indicated by the plurality of material names and compatibility with the base forming the thin film are associated with each other, based on the plurality of function names extracted in the first extraction process, the plurality of material names extracted in the second extraction process, and the extraction-source document data.
19. The materials development support program according to claim 18 that causes the computer to further execute: a first learning processing process that trains a preset first machine learning model by using the first learning data and constructs the first learning model in which a relationship between a material and a function provided by the material is learned; and a second learning processing process that trains a preset second machine learning model by using the second learning data and constructs the second learning model in which compatibility with the base forming the thin film is acquired by learning.
20. The materials development support program according to claim 19 that causes the computer to further execute: a first learning model storage process that stores the trained first learning model in a first learning model storage device; and a second learning model storage process that stores the trained second learning model in a second learning model storage device.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS
[0035] Hereinafter, a preferred embodiment of the present invention will be described in detail with reference to
[0036] Outline of Embodiments of the Invention
[0037] First, an outline of a materials development support apparatus 1 according to an embodiment of the present invention will be described. The materials development support apparatus 1 according to the present embodiment extracts preset function names indicating a function of a thin film and preset material names indicating a material used for forming the thin film from a plurality of document data such as papers and generates learning data used in machine learning based on the extracted data.
[0038] The materials development support apparatus 1 trains a machine learning model (a first machine learning model) prepared in advance based on the learning data and constructs a first learning model in which a relationship between a material and a function provided by the material is learned. In addition, the materials development support apparatus 1 trains a preset machine learning model (a second machine learning model) by using the learning data and constructs a second learning model in which compatibility with a base forming the thin film is acquired by learning. Further, the materials development support apparatus 1 outputs the first learning model and the second learning model that have been trained to the outside.
First Embodiment
[0039] First, an outline of a configuration of the materials development support apparatus 1 according to a first embodiment of the present invention will be described. The materials development support apparatus 1 according to the first embodiment performs learning processing using machine learning and constructs a trained first learning model and a trained second learning model.
[0040] Functional Block of Materials Development Support Apparatus
[0041] The materials development support apparatus 1 includes a document DB 10, a first extraction unit 11, a second extraction unit 12, a learning data generation unit 13, a learning processing unit 14, a storage unit 15, a first learning model storage unit 16, a second learning model storage unit 17, and a presentation unit 18.
[0042] The document DB 10 stores text information such as papers. In the document DB 10, a plurality of documents related to a specific technique, for example, a thin film, is stored in advance. The document DB 10 can store document data in a specific language, for example, in English. For example, in a case of a paper, the document data stored in the document DB lo includes text data other than image data, such as titles, summaries, experimental methods, results, and consideration.
[0043] Hereinafter, a “sentence” means text data. Further, the “sentence” refers to text data of a character string divided by a punctuation mark or a period, and a “document” refers to a file of text data in a natural language including text composed of a plurality of “sentences”.
[0044] The first extraction unit 11 extracts a plurality of preset function names indicating a function of a thin film from an individual one of the plurality of document data stored in the document DB 10. In the present embodiment, the “function” includes, for example, not only a function that can be represented by energy calculation or the like in a mathematically uniform manner, such as thermoelectric conversion, but also information having relatively low mathematical relevance. For example, durability, transparency, liquid repellency, and flexibility can be listed as the function of the thin film. Words related to these preset functions are stored in the storage unit 15. For example, the first extraction unit 11 extracts a word indicating the function stored in the storage unit 15, such as “wettability” and “conductivity”, from the document data. In the present embodiment, the first extraction unit 11 can extract a word indicating the function from each of the document data sets.
[0045] The second extraction unit 12 extracts a plurality of preset material names indicating a material used for forming the thin film from an individual one of the plurality of document data stored in the document DB 10. The “material” includes, for example, a functional group such as “methyl”, “ethyl”, “vinyl”, and “fluoro”, a metal composition, and the material of a substrate (base) such as “glass” and “cellulose”. The second extraction unit 12 extracts words indicating the materials stored in the storage unit 15 from the document data. The second extraction unit 12 can extract the word indicating the material from each of the document data sets.
[0046] The first extraction unit 11 and the second extraction unit 12 can use a known character string search algorithm such as the Boyer-Moore (BM) algorithm and the Knuth-Morris-Pratt (KMP) algorithm when detecting a specific word from the document data. The extraction data including the “material” and the “function” extracted from each of the document data sets by the first extraction unit 11 and the second extraction unit 12 is stored in the storage unit 15.
[0047] The learning data generation unit 13 generates learning data based on the extraction data in which words indicating the preset “function” and “material” are extracted by the first extraction unit 11 and the second extraction unit 12.
[0048] More specifically, based on the plurality of function names extracted by the first extraction unit 11 and the plurality of material names extracted by the second extraction unit 12, the learning data generation unit (first learning data generation unit) 13 generates first learning data in which a material and a function provided by the material are associated with each other for each of the plurality of material names. Compatibility between the materials is a reference that reflects the material properties, which are taken into consideration when forming a thin film.
[0049] For example, among the materials used in the consecutive processes or the same process, the materials that have good compatibility in terms of the order of manufacturing a thin film and that have actually been used in similar procedures are defined as having good compatibility. In contrast, the materials that have poor compatibility in terms of the order of manufacturing a thin film and that have never been actually used in similar procedures are defined as having poor compatibility. There is a certain ordering in selecting film-forming materials, and information reflecting this ordering is the compatibility between the materials. The first learning data is, for example, data in which information indicating compatibility is added to a combination of two materials as a correct answer label.
[0050] The learning data generation unit 13 divides text data that is included in the document data and that indicates a plurality of consecutive processes related to the film-forming process into segments each constituting one process. Further, when a material A in the preceding stage and a material B in the subsequent stage appear in the same process or the consecutive processes, the learning data generation unit 13 adds a label indicating good compatibility to the material A and the material B. The consecutive processes refer only to a case where a layer is first formed with the material A in the preceding stage, and the next layer is formed with the material B in the subsequent stage. If a layer is first formed with the material B in the subsequent stage, and the next layer is formed with the material A in the preceding stage in the consecutive processes, these materials are not deemed to have good compatibility. For example, while it is common to have a glass substrate as the material in the preceding stage and an etching solution as the material in the subsequent stage, it is impossible to have an etching solution as the material in the preceding stage and a glass substrate as the material in the subsequent stage as the manufacturing order.
[0051] Further, the learning data generation unit (second learning data generation unit) 13 generates a second learning data in which the individual material indicated by the plurality of material names and compatibility with the base (substrate) forming the thin film are associated with each other, based on the plurality of function names extracted by the first extraction unit 11, the plurality of material names extracted by the second extraction unit 12, and extraction-source document data. For example, a conductive material is used for a heater film by Joule heat. Further, the same conductive material may be used as an electromagnetic shielding film. Each material contributes to achieving a function in accordance with an intended use.
[0052] As described above, the second learning data is data in which the function of each material extracted by the first extraction unit 11 is added to the material extracted by the second extraction unit 12 as a correct answer label. The first learning data and the second learning data generated by the learning data generation unit 13 are stored in the storage unit 15.
[0053] The learning processing unit 14 trains a learning model such as a machine learning model prepared in advance by using the learning data generated by the learning data generation unit 13 and constructs a trained model. For example, the learning processing unit 14 can perform supervised learning on a known machine learning model such as a multi-layer neural network including a recurrent neural network (RNN), an autoencoder, a convolutional neural network (CNN), and an LSTM network. Alternatively, the machine learning model to be trained can be set as desired, and not only supervised learning but also semi-supervised learning or the like can also be adopted.
[0054] More specifically, the learning processing unit (first learning processing unit) 14 trains a preset machine learning model using the first learning data and constructs a first learning model in which a relationship between a material and a function provided by the material is learned. For example, the learning processing unit 14 trains the multi-layer neural network to update and adjust a feature amount representing the compatibility between two materials, that is, a value of the configuration parameter of the multi-layer neural network and determines a final value. The first learning model constructed by the learning using the first learning data is stored in the first learning model storage unit 16.
[0055] Further, the learning processing unit (second learning processing unit) 14 trains a preset machine learning model using the second learning data and constructs a second learning model in which compatibility with the base forming the thin film is acquired by the learning.
[0056] The storage unit 15 stores the extraction data including the functions and materials of the thin film extracted from the document data by the first extraction unit 11 and the second extraction unit 12. In addition, the storage unit 15 stores the first learning data and the second learning data generated by the learning data generation unit 13. Further, the storage unit 15 stores information about preset machine learning models used by the learning processing unit 14 as learning targets.
[0057] The first learning model storage unit 16 stores the trained first learning model constructed by the learning processing unit 14. More specifically, the first learning model storage unit 16 stores values of weight parameters of the multi-layer neural network determined in the learning processing by the learning processing unit 14, etc.
[0058] The second learning model storage unit 17 stores the trained second learning model constructed by the learning processing unit 14.
[0059] The presentation unit (output unit) 18 can present the extraction data indicating the “material” and the “function” extracted from each of the document data sets by the first extraction unit 11 and the second extraction unit 12 and the trained first learning model and second learning model obtained in the learning processing by the learning processing unit 14 to an external server (not illustrated) or the like.
[0060] Hardware Configuration of Materials Development Support Apparatus
[0061] Next, an example of a computer configuration that implements the materials development support apparatus 1 having the above-described functions will be described with reference to
[0062] As illustrated in
[0063] A program for causing the processor 102 to perform various controls and calculations is stored in the main storage device 103 in advance. The processor 102 and the main storage device 103 implement each function of the materials development support apparatus 1 including the first extraction unit ii, the second extraction unit 12, the learning data generation unit 13, and the learning processing unit 14 illustrated in
[0064] The communication I/F 104 is an interface circuit for performing communication with various external electronic devices via a communication network NW.
[0065] As the communication I/F 104, for example, a communication control circuit and an antenna corresponding to wireless data communication standards such as 3G, 4G, 5G, a wireless LAN, and Bluetooth (registered trademark) are used.
[0066] The auxiliary storage device 105 is composed of a readable and writable storage medium and a drive device for writing and reading various kinds of information such as programs and data to and from the storage medium. A semiconductor memory such as a hard disk or a flash memory can be used as the storage medium of the auxiliary storage device 105.
[0067] The auxiliary storage device 105 has a program storage area for storing programs for causing the materials development support apparatus 1 to perform material development support processing including extraction processing, learning data generation processing, and learning processing. The auxiliary storage device 105 implements the storage unit 15, the first learning model storage unit 16, and the second learning model storage unit 17 described with reference to
[0068] The input-output I/O 106 is composed of I/O terminals that input a signal from the external device and output a signal to the external device.
[0069] The input device 107 is composed of a keyboard, a touch panel, or the like, receives an operation input from the outside, and generates a signal corresponding to the operation input.
[0070] The display device 108 is implemented by a liquid crystal display or the like.
[0071] Example of Specific Configuration of Materials Development Support Apparatus
[0072] An example of a specific configuration of the materials development support apparatus 1 having the above-described configuration will be described with reference to a block diagram in
[0073] The server 100 includes, for example, the document DB 10, the first extraction unit ii, the second extraction unit 12, and the learning data generation unit 13 described with reference to
[0074] The server 200 includes, for example, the learning processing unit 14, the first learning model storage unit 16, and the second learning model storage unit 17 described with reference to
[0075] The servers 100 and 200 are implemented by a computer configuration including a processor, a main storage device, a communication I/F, and an auxiliary storage device as described with reference to
[0076] As described above, the materials development support apparatus 1 according to the present embodiment can be implemented by the configuration in which each function illustrated in
[0077] Materials Development Support Method
[0078] Next, an operation performed by the materials development support apparatus 1 having the above-described configuration will be described with reference to
[0079] The materials development support apparatus 1 according to the present embodiment trains individually two machine learning models such as multi-layer neural network and constructs a trained first learning model and a trained second learning model. As illustrated in
[0080] Outline of Materials Development Support Method
[0081] First, an outline of the operation performed by the materials development support apparatus 1 according to the present embodiment will be described with reference to a flowchart in
[0082] As illustrated in
[0083] Next, the learning data generation unit 13 generates first learning data indicating the function provided by the material and second learning data indicating the compatibility between two materials based on the words indicating the “materials” and the “functions” extracted in step S1 and the extraction-target document data (step S2).
[0084] Next, the learning processing unit 14 trains a predetermined machine learning model using the first learning data generated in step S2 and outputs a trained first learning model, and the learning processing unit 14 also trains a predetermined machine learning model using the second learning data and outputs a trained second learning model (step S3). More specifically, the learning processing unit 14 constructs a first learning model in which the compatibility between the materials is learned and a second learning model in which the relationship between the material and the function is learned.
[0085] Next, the trained first learning model and the trained second learning model are stored in the first learning model storage unit 16 and the second learning model storage unit 17, respectively (step S4).
[0086] Extraction Processing
[0087] Next, a specific example of extraction processing performed by the first extraction unit 11 and the second extraction unit 12 will be described with reference to
[0088] As illustrated in
[0089] As illustrated in
[0090] Since a plurality of processes are performed when a multi-layer film is formed, the second extraction unit 12 extracts a material name used in each process and creates the extraction data in the intermediate file. The second extraction unit 12 performs the extraction processing on a paragraph of “experimental method” or the like included in paper data.
[0091] The first extraction unit 11 extracts a word related to a preset function, for example, “wettability”, “conductivity”, and the like (“liquid repellency (F1)”, “transparency (F3)”, etc. illustrated in
[0092] Hereinafter, the extraction processing performed by the first extraction unit 11 and the second extraction unit 12 and implemented by the processor 102 will be described with reference to a flowchart illustrated in
[0093] First, the processor 102 opens the intermediate file in which the extraction results are recorded (step S100). Next, the processor 102 starts 100p processing in which the processing from step S102 to step S113 are repeatedly performed on all of the plurality of paper data stored in the document DB 10 (step S10i).
[0094] Next, the processor 102 acquires one of the paper data sets from the document DB 10 and edits the intermediate file opened in step S100 (step S102). More specifically, as illustrated in “intermediate file Dim” in
[0095] Next, the processor 102 identifies a paragraph related to an experiment included in the paper data and repeatedly performs the processing from step S104 to step S109 on each sentence from the first to the last in the paragraph (step S103). For example, information that can identify the paragraph of “experimental method” and the paragraph of “summary” is previously given to the corresponding paragraph in each of the paper data sets stored in the document DB 10.
[0096] Next, the processor 102 identifies the paragraph of the experiment included in the paper data and extracts a sentence related to film formation (step S104). For example, the processor 102 performs the extraction in order from the first sentence of the paragraph of “experimental method” included in the paper data.
[0097] If the extraction target sentence includes a preset word related to film formation (step S104: YES), the processor 102 increments (+1) the value of the P column in the intermediate file (step S105). In contrast, if the extraction target sentence does not include a preset word related to film formation (step S104: NO), the processing proceeds to step S111 via connector B.
[0098] Next, the processor 102 repeatedly performs the processing in step S107 and step S108 until the end of one extraction target sentence (step S106). More specifically, the processor 102 converts the film formation-related material name included in one extraction target sentence into a uniform material name such as an IUPAC name (step S107).
[0099] Next, the processor 102 edits the intermediate file (step S108). More specifically, the processor 102 adds one row to the intermediate file and writes a material number corresponding to the material in the M column as illustrated in
[0100] When a plurality of materials are included in one sentence, the processor 102 adds a row for each of the materials and edits the intermediate file. For example, the second and third rows of the intermediate file illustrates in
[0101] [our] After the processor 102 repeatedly performs the processing in step S107 and S108 until the end of one sentence (step S109), the processing proceeds to step Silo via connector A, and the processing from step S104 to step S109 is further performed until the end of the paragraph of “experimental method” included in the paper data (step Silo).
[0102] Next, the processor 102 searches a specified paragraph such as the paragraph of “summary” in the paper data, from which the material names have been extracted, for a function name corresponding to a search condition, and if the matching function name is found (step S112: YES), the processor 102 edits the intermediate file (step S113).
[0103] More specifically, the processor 102 writes 1 in the F column indicating the function in the processing target paper data set having the same title. If no function name is hit in the search (step S112: NO), the value in the F column is set to 0. For example, as illustrated in
[0104] Next, the processor 102 executes searches for all of the plurality of preset function names (step S114). Further, when the above processing has been performed on all the paper data sets stored in the document DB 10 (step S115), the processor 102 closes the intermediate file (step S116).
[0105] [Learning Data Generation Processing]
[0106] Next, a specific example of learning data generation processing by the learning data generation unit 13 implemented by the processor 102 will be described with reference to
[0107] As illustrated in
[0108] The first learning data is learning data in which the materials and the functions are stored in association with each other. The learning data generation unit 13 extracts the material number (M), the material composition (C), and the function (F) stored in the intermediate file to generate the first learning data.
[0109] As the data structure of the second learning data, a material number (M) and material composition (C) of two materials and compatibility are set. The “compatibility” is defined as 1 for two materials used in the consecutive processes or the same process and 0 for the other cases. The “compatibility” reflects, for example, the properties of the material to be considered during the film formation.
[0110] Specific examples are as follows: i) a film of a negatively charged material can be formed on a positively charged surface so that this combination is likely to be used consecutively, whereas, a film of a positively charged material is difficult to be formed on a positively charged surface so that this combination is rarely used consecutively; ii) in addition, a hydrophobic material is easily adopted to a hydrophobic surface due to hydrophobic group-hydrophobic group interaction so that this combination is likely to be used simultaneously; iii) a material having a thiol group and a material having a vinyl group are likely to be used consecutively due to thiol-ene reaction. The compatibility between the two materials reflects a certain ordering applied when such a film-forming material is selected.
[0111] Next, the generation processing of the second learning data illustrated in
[0112] As illustrated in
[0113] Next, the processor 102 randomly selects two materials from the N materials and repeats processing in which one of the materials is set as a material A in a preceding stage and the other is set as a material B in a subsequent stage for (NC2×2!) times (step S202). The processor 102 generates second learning data illustrated in
[0114] Next, if the value of the compatibility between the material A in the preceding stage and the material B in the subsequent stage selected in step S202 is 0 in the second learning data (step S203: YES), the processor 102 determines whether the material A in the preceding stage and the material B in the subsequent stage are used in the same process or the consecutive processes based on the values in the P column of the intermediate file (step S204). If the material A and the material B have the P-column values indicating the same process or the consecutive processes (step S204: YES), the value of the “compatibility” of the corresponding row and column in the second learning data is changed to “1” (step S205).
[0115] In contrast, if the compatibility between the material A and the material B is 1 in the second learning data (step S203: NO), the processing proceeds to step S206. In addition, in step S204, if the material A in the preceding stage and the material B in the subsequent stage are not in the same process or consecutive processes in the intermediate file (step S204: NO), the processing also proceeds to step S206. That is, the processor 102 does not change the value of the compatibility between the material A in the preceding stage and the material B in the subsequent stage in the second learning data.
[0116] Next, the processor 102 repeatedly performs the processing from step S203 to step S205 on the N materials for (NC2×2!) times, which is the total number of combinations (step S206). Further, after the values of the compatibility between the two materials have been updated for all the title numbers (numbers “1, 2, . . . ” in the T column) of the paper data sets in the intermediate file (step S207), the processing ends.
[0117] Learning Processing
[0118] Next, learning processing performed by the learning processing unit 14 will be described with reference to
[0119] The learning processing unit 14 trains a neural network NN2 by using the second learning data. As described above, the second learning data is data in which two materials and the compatibility between these two materials are associated with each other. In an example in
[0120] The learning processing unit 14 performs an operation of the neural network NN2 based on the material composition in the preceding stage given as an input, and adjusts, updates, and determines values of parameters such as weights so that the compatibility, which is a correct answer label, is output. In this way, the trained second learning model is obtained. The trained second learning model is a model in which the compatibility between the two materials in terms of a film-forming process is learned. The data structure of the input and output of the neural network NN2 is not limited to the example in
[0121] As illustrated in
[0122] The learning processing unit 14 performs an operation of the neural network NM based on the material composition (C) given as an input, and adjusts and determines parameters such as weights so that the function (F), which is a correct answer label, is output. In this way, the trained first learning model is obtained. The first learning model is a model in which the function corresponding to the material is learned. The data structure of the input and output of the neural network NM is not limited to the example in
[0123] As described above, the materials development support apparatus 1 according to the first embodiment extracts preset words indicating a film formation-related “material” and a “function” of the “material” from a large number of paper data sets related to film formation and generates extraction data. Further, the materials development support apparatus 1 generates second learning data indicating the compatibility between the two materials in terms of the film forming process based on the extraction data. Further, the materials development support apparatus 1 generates first learning data indicating the function corresponding to the material based on the extraction data.
[0124] Further, the materials development support apparatus 1 trains a machine learning model prepared in advance by using the first learning data to obtain a trained first learning model in which the function corresponding to the material is learned.
[0125] The materials development support apparatus 1 trains a machine learning model prepared in advance by using the second learning data to obtain a trained second learning model in which the compatibility between the two materials in terms of the film forming process is learned.
[0126] As described above, the materials development support apparatus 1 more effectively collects information about the film formation from a large amount of text data and learns the compatibility between the materials and the function corresponding to the material. Thus, the materials development support apparatus 1 can support the user to develop the film formation materials.
[0127] In addition, the materials development support apparatus 1 learns the feature amount of the function with relatively low mathematical relevance, such as transparency, liquid repellency, and conductivity, as the function corresponding to the material. Thus, the materials development support apparatus 1 can support the user to develop the film forming materials more effectively.
[0128] Further, the materials development support apparatus 1 generates the learning data from “experimental method”, “summary”, and the like included in paper data so that the materials development support apparatus 1 can easily generate the learning data.
Second Embodiment
[0129] Next, a second embodiment of the present invention will be described. In the following description, the same components as those in the first embodiment described above will be denoted by the same reference characters, and description thereof will be omitted.
[0130] In the first embodiment, the learning processing in which the first learning model in which the compatibility between materials related to film formation is learned and the second learning model in which a function corresponding to a material is learned are acquired by training the machine learning models prepared in advance has been described. In the second embodiment, inference processing is performed by using the first learning model and the second learning model that have been obtained by the learning processing.
[0131] In the inference processing performed by a materials development support apparatus 1A according to the present embodiment, as illustrated in
[0132] In this respect, in a conventional method for acquiring a design guideline for the multi-layer film mainly by experiment, as illustrated in
[0133] Functional Block of Materials Development Support Apparatus
[0134]
[0135] In addition to the functional units constituting the learning processing apparatus described in the first embodiment, the materials development support apparatus 1A includes a candidate data generation unit 19, an input data acquisition unit 20, an inverse analysis unit 21, a storage unit 22, and an output data generation unit 23 that constitute an inference processing apparatus. Hereinafter, a configuration different from that of the first embodiment will be mainly described.
[0136] The candidate data generation unit 19 inputs verification data including a preset verification target material to the trained first learning model, performs an operation of the first learning model, checks the function of each material, outputs a plurality of candidates for the function provided by the verification target material, and generates candidate data (“Dc” in
[0137] The input data acquisition unit 20 is data including information about a material of a substrate specified by the user and desired functions of the thin film that are received by the input device 107. The acquired input data (“Di” in
[0138] The inverse analysis unit 21 provides the input data and data of the material randomly selected from the candidate data as inputs to the second learning model, performs an operation of the second learning model, and outputs the materials that are likely to satisfy the user request, the order of layers, and a manufacturing method as outputs.
[0139] The storage unit 22 stores the candidate data generated by the candidate data generation unit 19. The storage unit 22 also stores the output by the inverse analysis unit 21.
[0140] The output data generation unit 23 generates data indicating the candidate for the structure of the multi-layer film output from the inverse analysis unit 21.
[0141] The presentation unit 18 can display the output data (“Dout” in
[0142] Inference Processing
[0143] Next, inference processing performed by the materials development support apparatus 1A having the above-described functional configuration will be described with reference to a flowchart in
[0144] As illustrated in
[0145] The candidate data is stored in the storage unit 22.
[0146] In addition, a material that is not stored in the intermediate file which is the extraction data, that is, a material that is not included in the paper data may be added to the verification data, and candidates for the function of such a material may be output in the candidate data. This may allow a completely new film to be presented as a candidate for the material development. The present embodiment makes it possible to present such a new film candidate since the material related to the film formation is grasped from various aspects, for example, by the functional group or the like.
[0147]
[0148] As described above, by generating the candidate data by using the trained first learning model, the material that is relatively less likely to satisfy the function specified in the input data acquired by the input data acquisition unit 20 can be eliminated in advance. Of course, a single material can have a plurality of functions, and if so, a machine learning algorithm that calculates the probability of each function can be used. In that case, since the probabilities are presented per function, determination processing can be performed by using a predetermined threshold. In this way, the candidate data generation unit 19 obtains candidate data, which are items of the materials corresponding to the function, by performing the operation of the trained first learning model.
[0149] Returning to
[0150] Next, the presentation unit 18 displays the output data generated by the output data generation unit 23 on the display screen (step S22).
[0151] Inverse Analysis Processing
[0152] First, an outline of inverse analysis processing will be described with reference to
[0153] As illustrated in
[0154] The random data selected from the candidate data includes the material randomly selected from the materials satisfying the functions specified by the user in the input data and is input to the trained second learning model as the material to serve as the first layer constituting the multi-layer film.
[0155] The neural network NN2 illustrated in
[0156] As described above, the substrate material specified by the user is input from the input data to the first layer L1 of the neural network NN2, and the material selected from the materials satisfying the functions specified by the user is input from the candidate data to the first layer L1 of the neural network NN2 as the material of the first layer of the multi-layer film. The neural network NN2 is a learning model that has learned the compatibility between the materials and outputs the compatibility between the input material of the substrate and the input material of the first layer of the multi-layer film by performing an operation of the neural network NN2.
[0157] When the inference result indicating that the input material of the substrate has good compatibility with the input material of the first layer of the multi-layer film is obtained from the output of the first layer L1 of the neural network NN2, an operation of the second layer L2 of the neural network NN2 is performed. In the second layer L2, the material of the first layer of the multi-layer film, which has good compatibility with the substrate material, and the material randomly selected from the materials satisfying the functions specified by the user in the candidate data to serve as the material of the second layer of the multi-layer film are provided as inputs. Likewise, the compatibility between the material of the first layer and the material of the second layer of the multi-layer film is output as the operation result of the neural network NN2, and if the output indicating that the compatibility between these materials is good is obtained, the operation of the neural network NN2 is repeatedly performed on each of the materials from the third layer until the N-th layer of the multi-layer film.
[0158] Next, the inverse analysis processing by the inverse analysis unit 21 implemented by the processor 102 will be described with reference to a flowchart in
[0159] First, the processor 102 acquires information indicating a material X of the substrate specified by the user from the input data (step S300). Next, the processor 102 repeatedly performs the inverse analysis processing from step S302 to step S305 a predetermined number of times (step S301). More specifically, the processor 102 acquires information indicating, for example, a material Y of the multi-layer film from the candidate data (step S302).
[0160] Next, the processor 102 provides the material X as the preceding stage process and the material Y as the subsequent stage process as inputs to the second learning model that has previously learned the material composition (C) of each material (step S303).
[0161] Next, the processor 102 performs an operation of the second learning model and obtains probability values for respective classes of “good compatibility” and “poor compatibility” between the material X and the material Y as outputs, and if the probability value of “good compatibility” is higher than the probability value of “poor compatibility” (step S304: YES), the processor 102 performs the operation of the second learning model by using the material Y in the subsequent stage as the material in the preceding stage (step S30.sub.5).
[0162] Next, the processor 102 performs the inverse analysis processing a predetermined number of times (step S306), and then generates output data (step S307). In contrast, in step S304, if the probability value of “poor compatibility” between the two materials is higher (step S304: NO), the processing proceeds to steps S307, and the processor 102 generates output data (step S307).
[0163] By performing the above processing, sequential candidates for the materials in the vertical direction from the substrate can be obtained as output data. In the example of the inverse analysis processing described with reference to
[0164] Further, in view of the temperature at the time of film formation, the solubility in the solvent, or the like, by giving constraints to the material selected from the candidate data in step S302, the materials to be input as candidates may be narrowed down in advance. The constraints are previously stored in the storage unit 22.
[0165] In addition to the above constraints, for example, the film thickness, roughness of the surface, porosity, etc. are also important factors for allowing the multi-layer film to exhibit the specified functions. Therefore, such information can be arranged to be taken into consideration upon selecting the material from the candidate data.
[0166] Specific Example of Configuration of Materials Development Support Apparatus
[0167] An example of a specific configuration of the materials development support apparatus 1A having the above-described configuration will be described with reference to the block diagram in
[0168] In addition, a flow indicated by a dashed line in
[0169] The server 100 includes, for example, the document DB 10, the first extraction unit 11, the second extraction unit 12, and the learning data generation unit 13 described with reference to
[0170] The server 200 includes, for example, the learning processing unit 14, the first learning model storage unit 16, the second learning model storage unit 17, the candidate data generation unit 19, the storage unit 22, and the inverse analysis unit 21 described with reference to
[0171] The servers 100, 200, and the communication terminal device 300 are implemented by a computer configuration including the processor, the main storage device, the communication I/F, and the auxiliary storage device described with reference to
[0172] As described above, the materials development support apparatus 1A according to the present embodiment can be implemented by the configuration in which each function illustrated in
[0173] Effects of Materials Development Support Apparatus
[0174] Next, effects of the materials development support apparatus 1A according to the present embodiment will be described with reference to
[0175]
[0176] The upper portion of
[0177] In the lower portion of
[0178] Further, as a result of the inverse analysis, output data (“output.txt”) suggesting that a film be formed with “trichlorovinylsilane”, “1H, 1H, 2H, 2H-perfluorodecanethiol”, and “perfluoroalkylether” in the vertical direction from the substrate can be obtained. This is a material selection result close to the manufacturing method in the one paper not used for the learning. Therefore, it can be said that this is a highly feasible solution.
[0179] In contrast, in the conventional example illustrated in the upper portion of
[0180] In other words, it can be said that the materials development support apparatus 1A according to the present embodiment is a technique that imitates one of the thinking methods that a human uses to develop a new technique by means of the inverse analysis using machine learning. Furthermore, not only imitating but also more rational material selection without depending on subjectivity or detection of the user can be achieved, and a comprehensive search can be performed even on a volume of the material combinations that is deemed to be impossible to handle manually.
[0181] As described above, according to the second embodiment, since the inverse analysis processing is performed by using the trained first learning model and the trained second learning model, a candidate for the design of a multi-layer film having a plurality of functions can be more easily presented.
[0182] In the embodiment described above, the case where the materials development support apparatus 1A includes the learning processing apparatus and the inference processing apparatus has been described with reference to
[0183] While the embodiments of the materials development support apparatus, the materials development support method, and the materials development support program according to embodiments of the present invention have thus been described, the present invention is not limited to the embodiments described above, and various modifications conceivable by those skilled in the art can be made within the scope of the invention recited in the claims. For example, the order of each step in the materials development support method is not limited to that described above.
REFERENCE SIGNS LIST
[0184] 1, 1A Materials development support apparatus
[0185] 10 Document DB
[0186] 11 First extraction unit
[0187] 12 Second extraction unit
[0188] 13 Learning data generation unit
[0189] 14 Learning processing unit
[0190] 15, 22 Storage unit
[0191] 16 First learning model storage unit
[0192] 17 Second learning model storage unit
[0193] 18 Presentation unit
[0194] 19 Candidate data generation unit
[0195] 20 Input data acquisition unit
[0196] 21 Inverse analysis unit
[0197] 23 Output data generation unit
[0198] 100, 200 Server
[0199] 300 Communication terminal device
[0200] 101 Bus
[0201] 102 Processor
[0202] 103 Main storage device
[0203] 104 Communication I/F
[0204] 105 Auxiliary storage device
[0205] 106 Input-output I/O
[0206] 107 Input device
[0207] 108 Display device