METHOD OF OBTAINING ARRAY OF PLURALITY OF SHOT REGIONS ON SUBSTRATE, EXPOSURE METHOD, EXPOSURE APPARATUS, METHOD OF MANUFACTURING ARTICLE, NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM, AND INFORMATION PROCESSING APPARATUS
20230108056 · 2023-04-06
Inventors
Cpc classification
G03F7/705
PHYSICS
International classification
Abstract
A method including calculating, using an objective function, which includes a regression model used to estimate an array of a plurality of regions on a substrate and a regularization term used to limit a value of a coefficient of the regression model, a value of each of a plurality of coefficients included in the regression model, with which the objective function becomes not more than a reference value, extracting, based on the calculated values, the coefficient having the value not less than a threshold value from the plurality of coefficients, and obtaining, using a regression model including only the extracted coefficient, an array of a plurality of regions on a substrate.
Claims
1. A method of obtaining an array of a plurality of regions on a substrate, comprising: calculating, using an objective function, which includes a regression model used to estimate the array and a regularization term used to limit a value of a coefficient of the regression model, and position measurement data of a mark assigned to a sample region among a plurality of regions on a first substrate, a value of each of a plurality of coefficients included in the regression model, with which the objective function becomes not more than a reference value; extracting, based on the values calculated in the calculating, the coefficient having the value not less than a threshold value from the plurality of coefficients; and obtaining, using a regression model including only the coefficient extracted in the extracting and position measurement data of a mark assigned to a sample region among a plurality of regions on a second substrate, an array of the plurality of regions on the second substrate.
2. The method according to claim 1, wherein in the calculating, the value of each of the plurality of coefficients included in the regression model, with which the objective function is minimized, is calculated.
3. The method according to claim 1, wherein the number of sample regions on the second substrate is smaller than the number of sample regions on the first substrate.
4. The method according to claim 1, wherein the first substrate and the second substrate are different substrates.
5. The method according to claim 1, wherein the first substrate and the second substrate are the same substrate.
6. The method according to claim 1, wherein the regularization term includes an Lp norm.
7. The method according to claim 6, wherein the Lp norm includes one of an L1 norm and an L2 norm.
8. The method according to claim 1, wherein the regression model includes a polynomial model that includes a position of the substrate as a variable.
9. A method of obtaining an array of a plurality of regions on a substrate, comprising obtaining, using an objective function, which includes a regression model used to estimate the array and a regularization term used to limit a value of a coefficient of the regression model, the array from position measurement data assigned to a sample region among the plurality of regions.
10. The method according to claim 9, wherein in the obtaining, the array is obtained by calculating, using the objective function and the position measurement data, a value of each of a plurality of coefficients included in the regression model, with which the objective function becomes not more than a reference value.
11. The method according to claim 9, wherein in the obtaining, the array is obtained by calculating, using the objective function and the position measurement data, a value of each of a plurality of coefficients included in the regression model, with which the objective function is minimized.
12. The method according to claim 9, wherein the regularization term includes an Lp norm.
13. The method according to claim 12, wherein the Lp norm includes one of an L1 norm and an L2 norm.
14. The method according to claim 9, wherein the regression model includes a polynomial model that includes a position of the substrate as a variable.
15. An exposure method of exposing a substrate via an original, comprising: obtaining an array of a plurality of regions on the substrate using a method defined in claim 1; and transferring a pattern of the original to each of the plurality of regions while positioning the substrate based on the array obtained in the obtaining.
16. An exposure apparatus for exposing a substrate via an original, comprising: a processing unit configured to obtain an array of a plurality of regions on the substrate; and a stage configured to position the substrate based on the array obtained by the processing unit, wherein the processing unit calculates, using an objective function, which includes a regression model used to estimate the array and a regularization term used to limit a value of a coefficient of the regression model, and position measurement data of a mark assigned to a sample region among a plurality of regions on a first substrate, a value of each of a plurality of coefficients included in the regression model, with which the objective function becomes not more than a reference value, extracts, based on the calculated value, the coefficient having the value not less than a threshold value from the plurality of coefficients, and obtains, using a regression model including only the extracted coefficient and position measurement data of a mark assigned to a sample region among a plurality of regions on a second substrate, an array of the plurality of regions on the second substrate.
17. An exposure apparatus for exposing a substrate via an original, comprising: a processing unit configured to obtain an array of a plurality of regions on the substrate; and a stage configured to position the substrate based on the array obtained by the processing unit, wherein the processing unit obtains, using an objective function, which includes a regression model used to estimate the array and a regularization term used to limit a value of a coefficient of the regression model, the array from position measurement data assigned to a sample region among the plurality of regions.
18. A method of manufacturing an article, comprising: exposing a substrate using an exposure method defined in claim 15; developing the exposed substrate; and manufacturing the article from the developed substrate.
19. A non-transitory computer-readable storage medium storing a program configured to cause a computer to execute a method of obtaining an array of a plurality of regions on a substrate, the program causing the computer to execute calculating, using an objective function, which includes a regression model used to estimate the array and a regularization term used to limit a value of a coefficient of the regression model, and position measurement data of a mark assigned to a sample region among a plurality of regions on a first substrate, a value of each of a plurality of coefficients included in the regression model, with which the objective function becomes not more than a reference value; extracting, based on the values calculated in the calculating, the coefficient having the value not less than a threshold value from the plurality of coefficients; and obtaining, using a regression model including only the coefficient extracted in the extracting and position measurement data of a mark assigned to a sample region among a plurality of regions on a second substrate, an array of the plurality of regions on the second substrate.
20. A non-transitory computer-readable storage medium storing a program configured to cause a computer to execute a method of obtaining an array of a plurality of regions on a substrate, the program causing the computer to execute obtaining, using an objective function, which includes a regression model used to estimate the array and a regularization term used to limit a value of a coefficient of the regression model, the array from position measurement data assigned to a sample region among the plurality of regions.
21. An information processing apparatus executing a method defined in claim 1.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0009]
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
DESCRIPTION OF THE EMBODIMENTS
[0017] Hereinafter, embodiments will be described in detail with reference to the attached drawings. Note, the following embodiments are not intended to limit the scope of the claimed invention. Multiple features are described in the embodiments, but limitation is not made to an invention that requires all such features, and multiple such features may be combined as appropriate. Furthermore, in the attached drawings, the same reference numerals are given to the same or similar configurations, and redundant description thereof is omitted.
[0018]
[0019] As shown in
[0020] The control unit CN is formed by, for example, a computer (information processing apparatus) including a CPU, a memory, and the like, and generally controls the units of the exposure apparatus 1 in accordance with a program stored in the storage unit SU or the like. In this embodiment, in addition to controlling exposure processing of exposing the substrate 4 via the original 2, the control unit CN functions as a processing unit configured to obtain an array (a shot array or a region array) of a plurality of shot regions on the substrate (a plurality of regions on the substrate).
[0021] The storage unit SU stores a program and various kinds of information (data) necessary to execute exposure processing of exposing the substrate 4 by controlling the units of the exposure apparatus 1. The storage unit SU also stores a program and various kinds of information (data) necessary for the control unit CN to obtain a shot array.
[0022]
[0023] Light from the light source 8 is reflected by the beam splitter 9 and illuminates, via the lens 10, an alignment mark 11 or 12 provided on the substrate 4. The light diffracted by the alignment mark 11 or 12 is received by the sensor 14 via the lens 10, the beam splitter 9, and the lens 13.
[0024] Exposure processing by the exposure apparatus 1 will be described with reference to
[0025] In step S103, fine alignment is executed. More specifically, first, based on the result of pre-alignment, the substrate stage 6 is driven to a position where the alignment mark 12 for fine alignment provided on the substrate 4 can be detected by the alignment optical system 7. Then, the alignment mark 12 provided in each of the plurality of shot regions on the substrate 4 is detected by the alignment optical system 7, thereby accurately obtaining the shift and the first-order linear component (magnification or rotation) of the entire substrate 4. At this time, the high-order deformation component of the substrate 4 can accurately be obtained by obtaining the positions of a number of shot regions. This makes it possible to obtain the accurate position of each shot region on the substrate 4, that is, the shot array.
[0026] In step S104, the substrate 4 is exposed. More specifically, after the fine alignment is executed, the pattern of the original 2 is transferred to each shot region on the substrate 4 via the projection optical system 3. In step S105, the substrate 4 is unloaded from the exposure apparatus 1.
[0027] In this embodiment, if a distortion occurs in the substrate 4, the high-order deformation component is corrected in the fine alignment of step S103. As a regression model used to estimate the shot array, a cubic polynomial model will be described as an example. However, the present invention is not limited to this. For example, as the regression model, an arbitrary degree model may be used, or a model (a triangle function model or a logarithmic model) other than a polynomial may be used.
[0028] If the deformation of the substrate 4 is expressed by a cubic polynomial model, the position deviations (ShiftX, ShiftY) of each shot region are represented by equations (1) below. Note that a position deviation of each shot region can also be considered as a correction value used to correct the position deviation.
where x and y are the positions of a shot region on the substrate 4. Coefficients k.sub.1 to k.sub.20 in equations (1) are decided from the actual position measurement data of each shot region on the substrate 4. Then, the position deviation of each shot region is obtained based on equations (1) in which the coefficients are decided.
[0029] To obtain the position measurement data, for example, as shown in
[0030] Processing of optimizing a regression model used to estimate the shot array on the substrate and obtaining the shot array using the regression model in this embodiment will described below with reference to
[0031] In step S502, a test substrate (first substrate) is used to acquire the first position measurement data. More specifically, the number of sample shot regions for the test substrate is set to a number which does not cause overfitting relative to the degree of freedom of the function model and is sufficient relative to the degree of freedom of the model. Then, the alignment mark assigned to each sample shot region on the test substrate is detected by the alignment optical system 7 to acquire the first position measurement data.
[0032] In step S504, coefficients of the regression model used to estimate the shot array are calculated (first step). More specifically, by using the first position measurement data acquired in step S502 and a regularized regression model, the value of each of a plurality of coefficients included in the regression model used to estimate the shot array, with which the regularized regression model becomes equal to or smaller than a reference value, for example, the regularized regression model is minimized, are calculated. Here, the regularized regression model is a function (objective function) including the regression model used to estimate the shot array and a regularization term for limiting the values of the coefficients of the regression model.
[0033] In step S506, based on the values of the coefficients of the regression model calculated in step S504, the coefficient having a value equal to or larger than a preset threshold value is extracted (selected) from the plurality of coefficients included in the regression model (second step). Note that as the threshold value, the same (common) threshold value may be set for all the coefficients included in the regression model, or a threshold value may be set for each of the plurality of coefficients included in the regression model.
[0034] In step S508, the second position measurement data is acquired using the substrate 4 (second substrate) as a substrate to be actually exposed. More specifically, the number of sample shot regions for the substrate 4 is set to be smaller than the number of sample shot regions set for the test substrate. Then, the alignment mark 12 assigned to each sample shot region on the substrate 4 is detected by the alignment optical system 7 to acquire the second position measurement data.
[0035] In step S510, the shot array (the array of the plurality of shot regions) on the substrate 4 is obtained (third step). More specifically, by using the regression model including only the coefficients extracted in step S506 and the second position measurement data acquired in step S508, the shot array of the substrate 4 (that is, the position deviation of each shot region on the substrate 4) is obtained. Here, the regression model including only the coefficients extracted in step S506 includes only important coefficients (models corresponding thereto) with unnecessary coefficients (models corresponding thereto) removed by regularized regression. Therefore, overfitting can be suppressed even with the small number of the sample shot regions (measurement points) set for the substrate 4.
[0036] As has been described above, according to this embodiment, overfitting can be suppressed and the shot array on the substrate 4 can be obtained with high accuracy while using a small number of measurement points and a function model of a high degree of freedom. Hence, the high-order deformation component can be corrected. Note that in this embodiment, the test substrate is used in step S502, but the substrate 4 as the substrate to be actually exposed may be used in place of the test substrate. In other words, instead of using different substrates in steps S502 and S508, the same substrate (the substrate to be actually exposed) may be used in steps S502 and S508.
[0037] The regularized regression model in this embodiment will be more specifically described below. First, as a comparative example, a general regression model including no regularization term, that is, without regularization, and its problem will be described.
[0038] As a simple example, assume a regression model up to the second order of the substrate coordinates. In order to obtain optimal coefficients (k.sub.1, k.sub.3, k.sub.5, k.sub.7, k.sub.9, k.sub.11) of the regression model, as represented by equations (2) below, the sum of the differences, each of which is the difference between position measurement data Yi and a predicted value Ti from the regression model, at all the data points is calculated. Note that in equations (2), i is an integer between 1 and n, and n is the number of data points:
Σ.sub.i=1.sup.n(Y.sub.i−T.sub.i).sup.2*T.sub.i=k.sub.1+k.sub.3x.sub.i+k.sub.5y.sub.i+k.sub.7x.sub.i.sup.2+k.sub.9x.sub.iy.sub.i+k.sub.11y.sub.i.sup.2 (2)
[0039] Equations (2) are generally called a loss function. Optimization of each coefficient of a general regression model without regularization can be replaced with an optimization problem of minimizing, as an objective function, the loss function including the above-described coefficients k.sub.1 to k.sub.11. However, in such optimization, since the magnitude of the variable is not limited, the coefficient serving as the variable (parameter) becomes infinitely large when minimizing the objective function. As the variable becomes larger, the possibility of overfitting generally increases, and correction errors of unmeasured regions increase.
[0040] To prevent this, this embodiment uses the regularized regression model. More specifically, as represented by equation (3) below, the regularized regression model is obtained by adding a regularization term with a model variable as a function to the loss function as a general regression model without regularization. In equation (3), the regularization term includes the monotonically increasing function of variables.
[0041] By providing the regularization term in the objective function as described above, a penalty is given to the larger variable. Thus, it is possible to suppress (prevent) the excessively large variable. In other words, overfitting can be suppressed in the objective function including the loss function and the regularization term.
[0042] An arbitrary monotonically increasing function of variables can be set as the regularization term. This embodiment will be described assuming that the regularization term includes an Lp norm of the variable vector, particularly, an L1 norm with p=1. Note that the regularization term is not limited to an L1 norm with p=1. For example, the regularization term can include the L2 norm with p=2. Further, the regularization term can include an Lp norm with p as an arbitrary number. Equations (4) below represent the Lp norm, the L1 norm, and the L2 norm. In equations (4), D is the number of variables to be used, and k is the value of each variable.
Lp norm: ∥k∥.sub.p=(Σ.sub.d=1.sup.D|k.sub.d|.sup.p).sup.1/p
L1 norm: ∥k∥.sub.1=(Σ.sub.d=1.sup.D|k.sub.d|)
L2 norm: ∥K∥.sub.2=(Σ.sub.d=1.sup.D(k.sub.d).sup.2).sup.1/2 (4)
[0043] The variable regularization effect depends on the type of norm. For the L1 norm, due to its nature, unnecessary coefficients tend to drop to zero, but the coefficients tend to be sparse.
[0044] With reference to
[0045] With reference to
[0046] In the case of using all assumed coefficients of the regression model, as shown in
[0047] In this embodiment, the example has been described in which the L1 norm is used as the regularization term, but the present invention is not limited to this. For example, an Lp norm or a linear combination of Lp norms may be used as the regularization term.
[0048] As shown in
[0049] A method of manufacturing an article according to the embodiment of the present invention is suitable for manufacturing an article, for example, a liquid crystal display element, a semiconductor element, a flat panel display, or a MEMS. The manufacturing method includes a process of exposing, using the above-described exposure apparatus 1 or exposure method, a substrate to which a photoresist is applied, and a process of developing the exposed photoresist. In addition, an etching process, an ion implantation process, and the like are performed for the substrate using the pattern of the developed photoresist as a mask, thereby forming a circuit pattern on the substrate. By repeating the processes of exposure, development, etching, and the like, a circuit pattern formed by a plurality of layers is formed on the substrate. In the post-process, dicing (processing) is performed for the substrate on which the circuit pattern is formed, and chip mounting, bonding, and inspection processes are performed. The manufacturing method can also include other known processes (for example, oxidation, deposition, vapor deposition, doping, planarization, and resist removal). The method of manufacturing an article according to this embodiment is advantageous in at least one of the performance, quality, productivity, and production cost of the article, as compared to conventional methods.
[0050] Embodiment(s) of the present invention can also be realized by a computer of a system or apparatus that reads out and executes computer executable instructions (e.g., one or more programs) recorded on a storage medium (which may also be referred to more fully as a ‘non-transitory computer-readable storage medium’) to perform the functions of one or more of the above-described embodiment(s) and/or that includes one or more circuits (e.g., application specific integrated circuit (ASIC)) for performing the functions of one or more of the above-described embodiment(s), and by a method performed by the computer of the system or apparatus by, for example, reading out and executing the computer executable instructions from the storage medium to perform the functions of one or more of the above-described embodiment(s) and/or controlling the one or more circuits to perform the functions of one or more of the above-described embodiment(s). The computer may comprise one or more processors (e.g., central processing unit (CPU), micro processing unit (MPU)) and may include a network of separate computers or separate processors to read out and execute the computer executable instructions. The computer executable instructions may be provided to the computer, for example, from a network or the storage medium. The storage medium may include, for example, one or more of a hard disk, a random-access memory (RAM), a read only memory (ROM), a storage of distributed computing systems, an optical disk (such as a compact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™), a flash memory device, a memory card, and the like.
[0051] While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.
[0052] This application claims the benefit of Japanese Patent application No. 2021-163069 filed on Oct. 1, 2021, which is hereby incorporated by reference herein in its entirety.