Determination of Imaging Transfer Function of a Charged-Particle Exposure Apparatus Using Isofocal Dose Measurements
20240427254 ยท 2024-12-26
Assignee
Inventors
- Christoph Spengler (Vienna, AT)
- Wolf Naetar (Vienna, AT)
- Johannes Leitner (Vienna, AT)
- Elmar Platzgummer (Vienna, AT)
Cpc classification
G03F7/705
PHYSICS
G03F7/70508
PHYSICS
G03F7/706837
PHYSICS
G03F7/70625
PHYSICS
G03F7/706849
PHYSICS
G03F7/706839
PHYSICS
G03F7/706845
PHYSICS
International classification
Abstract
A method for determining parameters of an imaging transfer function (point spread function) is presented. With regard to a model that describes the imaging transfer function including a number of model parameters, a test substrate is exposed and developed using a test pattern which comprises multiple sub-patterns that are based on the same sub-pattern template but with varying control width of a feature in the template, such as the width of a line or a distance between lines. On the test substrate, isofocal dose measurements are performed using the structures thus formed on a test substrate with varying control and imaging parameters. The isofocal dose thus determined are utilized to determine the model parameters of the imaging transfer function.
Claims
1. A method for determining an imaging transfer function of a charged-particle exposure apparatus during exposure of a target positioned in a target plane of said apparatus, said imaging transfer function describing the distribution of dose or energy generated at the target plane resulting from a single active element in a pattern definition device of the charged-particle exposure apparatus when said single active element is imaged to a substrate in the charged-particle exposure apparatus, the method comprising the steps of i. providing a model of the imaging transfer function, said model including at least one function parameter to be determined, ii. selecting a set of imaging properties, including at least one of a beam blur and a beam focus, which are adjustable through modifying pre-defined imaging parameters of the charged-particle apparatus, other than a base exposure dose describing an overall intensity of the imaging transfer function; iii. exposing, using the exposure apparatus, a test substrate with a test pattern and developing the test substrate to produce a test structure on said at least one test substrate, wherein the test pattern comprises a plurality of sub-patterns each of which is a copy of a sub-pattern template modified according to at least one control parameter, said at least one control parameter varying across the sub-patterns of the plurality of sub-patterns within a defined parameter range, and wherein the test pattern is exposed to the test substrate a number of times with the base exposure dose and at least one imaging parameter of the charged-particle apparatus being varied, to produce a number of test pattern copies on the substrate, the test structure thus produced comprising a plurality of sub-structures, each sub-structure being associated with specific values of imaging parameters, the base exposure dose, and said at least one control parameter; iv. evaluating the sub-structures with respect to at least one measurable quantity, including a critical dimension of features in the sub-structure; v. determining, for each value of the at least one control parameter, the variation of said at least one measurable quantity between the sub-structures as a function of the imaging parameters, and determining, from said variation, a respective value of isofocal dose where the variation is minimally variant with respect to the changes in the imaging parameters, vi. calculating, using the values of isofocal dose determined in step v as function of the at least one control parameter the at least one function parameter of the imaging transfer function.
2. The method of claim 1, wherein the measurable quantity in steps iv and v includes a critical dimension of a feature of interest in the sub-structures.
3. The method of claim 1, wherein the imaging transfer function is modeled as weighted sum of radially symmetric Multi-Gaussian functions, said sum including at least three Gaussian components as summands, and in step vi the weights and/or length scales of at least one of said summands are determined.
4. The method of claim 3, wherein the imaging transfer function includes a Multi-Gaussian function comprising at least one mid-range component having a weight and a length scale as parameters that are determined in step vi, wherein the length scale corresponds to a width constrained to a range between 200 nm and 2 m.
5. The method of claim 1, wherein the method further includes a step of ii. calculating, in terms of the model provided in step i and the at least one function parameter thereof, a model calculation of said at least one measurable quantity as a function of said subset of the imaging and control parameters and determining the values of the parameters of said subset where said model calculation predicts said at least one measurable quantity to be stationary with respect to said parameters, which step ii is performed before step vi, and step vi includes performing a least-squares fit of said model calculation to a course of minimal variation to obtain final parameters of the imaging transfer function.
6. The method of claim 5, wherein the fitting in step v is performed by finding an optimal value of an evaluation function including a weighted sum of squares of differences between the values of parameters in the model calculation and the course of minimal variation.
7. The method of claim 6, wherein the evaluation function is augmented with a regularization term, said regularization term including the first and/or second radial derivatives of the imaging transfer function and/or the magnitude (L2) or sum of absolute values of a vector of imaging transfer functions (L1).
8. The method of claim 1, wherein different values of beam blur are generated by physically defocusing the beam by means of modulation of appropriate electrostatic voltages of lens and/or multi-pole lens components of an imaging system of the charged-particle exposure apparatus.
9. The method of claim 1, wherein different values of beam blur are generated by modulating the pattern to emulate an increased blur.
10. The method of claim 1, wherein the sub-pattern template is selected from one of the following: a single line, wherein the control parameter is the width of line; a triple line structure comprising a center line surrounded by two outer lines, wherein the control parameter is the width of the two outer lines; a triple line structure comprising a center line surrounded by two outer lines, wherein the control parameter is the distance of the two outer lines from the center line; or a combination of thereof, and the measurable quantity in the resulting sub-structure is the width of the single line or center line, respectively.
11. An exposed substrate comprising a test structure on at least one test substrate exposed in a charged-particle exposure apparatus according to steps i to iii of the method of claim 1, the test structure comprising a plurality of sub-structures, said sub-structures being formed using copies of the same underlying sub-pattern template modified according to a control parameter varying across the sub-patterns.
12. The substrate of claim 11, further comprising multiple sub-structures which have been formed in said charged-particle exposure apparatus by applying respective values of imaging parameters, said values being different between each of said multiple sub-structures.
13. The substrate of claim 11, wherein the underlying sub-pattern template comprises one of the following: a single line, wherein the control parameter is the width of line; a triple line structure comprising a center line surrounded by two outer lines, wherein the control parameter is the width of the two outer lines; a triple line structure comprising a center line surrounded by two outer lines, wherein the control parameter is the distance of the two outer lines from the center line; or a combination of thereof, and the measurable quantity in the resulting sub-structure is the width of the single line or center line, respectively.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0024] In the following, the present invention is illustrated by several embodiments described below in more detail with reference to the attached drawings. It is emphasized that the embodiments shown here are of illustrative character and are not to be construed as limiting the scope of the invention. The drawings schematically show:
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION
[0036] The detailed discussion given herein is intended to illustrate the invention and exemplary embodiments thereof, as well as further advantageous developments. It will be evident to the skilled person to freely combine several or all of the embodiments and aspects discussed here as deemed suitable for a specific application of the invention. Throughout this disclosure, terms like advantageous, exemplary or preferred indicate elements or dimensions which are particularly suitable (but not essential) to the invention or an embodiment thereof, and may be modified wherever deemed suitable by the skilled person, except where expressly stated otherwise. It will be appreciated that the invention is not restricted to the exemplary embodiments discussed in the following, which are given for illustrative purpose and merely present suitable implementations of the invention.
[0037] In particular, even though the invention can be used in combination with virtually any charged particle lithographic apparatus, it will be discussed in the exemplary context of electron-beam devices for lithographic mask manufacturing. In particular, in the lithography apparatus of
[0038] The invention aims at certain improvements of the correction of the proximity effect in electron beam lithography, which is caused by the interaction of the electron beam and the resist and substrate employed for the writing process. In particular, the invention aims at a method to determine a point spread function that is suitable for use in proximity effect correction.
[0039] Proximity effect correction (PEC), which adjusts the pattern to be exposed or its exposure dose amount to account for additional dose from backscattered electrons, is a well-established technique in electron beam lithography, see, for instance, U.S. Pat. No. 5,241,185, 6,815,693 or 7,511,290.
[0040] For this purpose, it is known to model the electron-substrate interaction as an exposure intensity distribution function (or point spread function), which describes an imaging transfer function from a single element of a pattern definition device (which single element has minimal lateral extension, ideally point-like) to the target plane; this exposure intensity distribution function is then convolved with the pattern to obtain the dose distribution on the target.
[0041] For CoG (Chrome on Glass) or OMOG (Opaque MoSi on Glass) photomasks used in 193 nm immersion lithography, a typical choice of point spread function is a two-component Multi-Gaussian
where G.sub. is a forward-scattering component with range (in the order of 20 nm) and weight normed to 1 and G.sub. a backscattering component with range (in the order of 10 m) with weight or backscattering ratio it (with typical values in the range of 0.3-0.8) and
a (rotationally symmetric) Gaussian with integral normed to 1.
[0042] For reticles used in Extreme Ultraviolet (EUV) lithography in particular, a two-Gaussian model is usually not sufficient, due to more complex backscattering effects generated by the thick Mo/Si multilayer structures found on EUV mask blanks (see H. Tanabe et al. in Proc. SPIE Vol. 7748, Photomask and Next-Generation Lithography Mask Technology XVII, 774823; available at https://doi.org/10.1117/12.862641). Instead, a model with more Gaussian components is suitable. For instance, a triple Gaussian model
is utilized, where G.sub. is a mid-range scattering component with range (typically in the order of 400 nm) with corresponding weight (around 0.2).
[0043] One prior-art approach determines the point spread function by imposing and fitting an exposure model (which may include development and etch effects) to values of critical dimensions (CDs) generated and measured for variable dose or pattern (see, for instance, P. Hudek et al. in J. Micro/Nanopattern. Mats. Metro. 20(4) 041402; available at https://doi.org/10.1117/1.JMM.20.4.041402).
[0044] Furthermore, in prior art, the use of an isofocal dose measurement has been suggested to determine and compare process windows in electron beam lithography (see e.g. K. Keil et al. in Microelectronic Engineering, Volume 85, Issues 5-6, pp. 778-781; available at https://doi.org/10.1016/j.mee.2008.01.042); however, this approach does not lend itself to determining a point spread function, in particular in relation to proximity effect correction.
[0045] The inventors suggest a novel method to determine an imaging transfer function, which is illustrated below relating to an example of a multi-component point spread function (PSF) in electron-beam lithography, using for instance a Multi-Gaussian PSF as introduced above, based on measurements of the isofocal dose for a test pattern containing a range of sub-patterns.
[0046] The flow-chart of
Test Patterns
[0047] This section discusses several examples of test patterns which each comprise a set of respective sub-patterns suitable for the invention, with emphasis on lines of variable width and distance; it is to be noted, however, that the concepts introduced by the inventors can readily be translated to other types of test patterns such as dots/contacts/rectangles, or even more complex patterns.
[0048]
[0049]
[0050] In
[0051] In all variants, the length of the lines should favorably be longer than 3 backscattering ranges, which simplifies the model calculations in that they can be performed one-dimensionally in this case. The variable widths and spaces should favorably correspond to the range of the point spread function to be determined, and a higher measurement density of control widths typically leads to more accurate results. To determine the isofocal dose for each sub-pattern, it may be necessary to expose multiple copies of the sub-patterns with variable configurations of dose and blur. Furthermore, it may increase the accuracy to obtain several such measurements from the features of interest (e.g. on different positions along each feature of interest), which can be averaged to ensure low measurement noise or, in the case of a multi-beam exposure apparatus, sample over the beam field (over which the blur may vary).
[0052] Also, several types of sub-patterns (e.g. of the types shown in
[0053] In all variants the distance between the sub-patterns P.sub.1, . . . , P.sub.N will advantageously be chosen sufficiently large so as to avoid mutual influence by backscattering or other unwanted effects.
[0054]
[0055] Without loss of generality, we stipulate the lines to be oriented along the y-direction (vertically in the drawings) in the embodiments of the invention discussed here and below.
Isofocal Dose
[0056] The concept of isofocal dose is explained with reference to
[0057] In U.S. Pat. Nos. 9,520,268 and 9,373,482, in the context of a charged-particle multi-beam mask writer, the applicant introduced a technique to emulate physical beam blur by adjustment of the exposure pattern. Using this method, as illustrated in
[0058] One suitable procedure for determining, experimentally, the isofocal dose for a given pattern or sub-pattern (which often will also depend on the pattern density) uses so-called Bossung plots, which plot the variation of CD, against change of blur for a plurality of candidate dose values. The procedure is explained with reference to
[0059] The procedure described above may also be performed with emulated pixel-based blur instead of physical beam blur, as illustrated in
[0060] It is to be noted that, generally, there are further parameters which may influence the value of the isofocal dose, such as the pattern density, which may be expressed or simulated by means of a suitable control width of test patterns. Thus, the Bossung plots of
[0061] The inventors found that it is often advantageous because of reduced complexity to use global dose modulations for the above, so a global isofocal dose is determined, i.e. the whole sub-patterns including the background generating features are exposed at a constant dose level (so the dose background due to backscattering also scales with the chosen dose level). Variations of the described method may also determine local isofocal doses, in which only the dose at the edge of the feature of interest is modulated; this change, however, has to be accounted for in the post-processing steps as further discussed below.
Point Spread Functions
[0062] In most embodiments, the invention starts from assuming that an imaging transfer function such as a point-spread function, which is to be determined by the invention, is rotationally symmetric and comprises forward- and back-scattering components, that is
[0063] In many embodiments corresponding a typical use-case of the invention, a Multi-Gaussian PSF is determined. Then, we have
for Gaussians with weights .sub.1, . . . , .sub.K and ranges .sub.1, . . . , .sub.K, which, alongside the forward scattering range , are the unknown parameters (summarized under the symbol C) of the point spread function. The integral of the PSF over the target area is normalized to 1. Furthermore, the forward-scattering weight is fixed to 1, and thus it corresponds to the D50-dose for lines and spaces of equal width (50% pattern density), which is independent of blur, width and spacing. The D50-dose is usually determined separately and then used to normalize other dose values after measurement to obtain relative doses. For instance, the D50-dose may be determined by including lines and spaces of equal width (or other suitable features of uniform width) in the test pattern and performing a determining procedure for the isofocal dose as described above based on the corresponding CD measurements.
[0064] The method of the invention can also be utilized to determine point spread functions defined by a piecewise polynomial of the radius, for instance, a cubic spline PSF. The use of point spread functions of this type for proximity effect correction has been suggested, e.g. in U.S. Pat. No. 10,553,394. For least-squares fitting purposes, the use of B-Splines (a set of Basis functions for a given space of spline functions) is favorable. B-Splines are readily constructed with routines in standard numerical libraries, such as scipy or PPPACK. Splines and B-Splines may be used to model PSF functions that are more general than Gaussians or Multi-Gaussians. To define a spline basis B.sub.1, . . . , B.sub.K of polynomial degree M basis with K=LM1 degree of freedom one first determines a set R of radial grid points r.sub.1r.sub.2 . . . r.sub.L. Sufficiently, the grid points are chosen uniformly spaced (cardinal B-Splines) with their knots lying in the range of the point spread function of interest (outside it is zero). The B-Spline functions are combined with a weighted sum
where the coefficients C=(c.sub.1, . . . , c.sub.K) are PSF parameters, to form a PSF component.
[0065]
[0066] To form a full PSF, multiple spline components of the above type with individual grids can be combined by summation and normalizing the integral to 1. For instance, a fine grid R.sub. (and coefficients C.sub.) for the forward scattering component S.sub.C.sub.
[0067] Here, (S)=S denotes the weight of a spline component, C=(C.sub.,C.sub.m,C.sub.b) is the vector of combined PSF coefficients. Multi-Gaussian and Spline PSFs can also be combined to form composite point spread functions, e.g. using Gaussians components G.sub., G.sub. for forward and long-range backscattering (with weights 1 and ) and a spline component S.sub.C.sub.
with parametrization C=(,,,C.sub.m).
[0068] In some embodiments of the invention, parts of some type of point spread function (or, equivalently, some its parameters) may be known already (e.g. the backscattering range ) or turn out to be not recoverable (independent) from the measured range of isofocal doses (e.g. the forward scattering range , which only modulates the isofocal dose for very small control widths w.sub.k<2). In such a case, it is generally a sufficient approach to insert values that are known from experiment or literature, as the skilled person will deem appropriate, and continue the procedure with these inserted values.
Numerical Isofocal Dose
[0069] Using an exposure model such as the threshold model mentioned above, it is possible to determine the isofocal dose for a given point spread function and sub-pattern, which can then be matched with the measured isofocal doses. For a threshold exposure model, the exposed dose profile d(x,y) may be simulated by convolution
of the binary indicator function of the sub-pattern P (which is 1 if in the pattern and 0 if not) with the PSF denoted as F. For the patterns presented above (see
where =.sub..sup.Fdy (also called marginal point spread function) and p(x)=P(x,y.sub.0) is the one-dimensional pattern in x-direction with fixed arbitrary y.sub.0. For a Multi-Gaussian PSF, the marginal point spread function is a one-dimensional Multi-Gaussian; for a spline or composite PSF, the marginal PSF can be determined numerically (e.g. by numerical integration).
[0070] The continuous convolution of eq. (6) may suitably be approximated by a discrete convolution of samples of the functions p and F in a sufficiently fine computational grid, e.g. with 0.1 nm resolution. The computation should, suitably, be performed over at least 3 times the maximum range of the point spread function (for a Multi-Gaussian) or over the support of a spline function.
[0071] In the threshold model, the isofocal dose D.sub.is can then be determined by choosing the dose under which the measured structure width varies the least under blur fluctuations, that is,
where .sub. is the marginal PSF with variable forward scattering range (e.g. taken in the set of test blurs A, which were also applied experimentally to the test pattern), and is a symbol for evaluating the convoluted pattern (p*.sub.a) to determine the area or width of the structure feature that is exposed when the exposure dose D is applied to the relevant sub-pattern. For instance, in terms of a threshold model, the evaluation
will yield the dimension (area or width) of those portions of the exposed structure that are at doses above the dose threshold D.sub.T. The minimum (dose of least variation) can be determined over a set of candidate doses either by numerical minimization or direct calculation.
Analytical Evaluation of Isofocal Dose
[0072] In some cases, it is possible to calculate the isofocal dose D.sub.is for the (one-dimensional) sub-pattern p analytically from the marginal PSF without having to calculate multiple exposure dose profiles by convolution (which is typically slow). To do so, in a first step, the dose background (for unit dose) at the line edge x.sub.e (which edge does not matter due to the left/right symmetry of the test pattern) of the feature of interest (e.g. isoline 51 or central line width of line triple 52, 53) is calculated by
that is, by integrating the backscattering part .sub.b=.sub..sup.F.sub.bdy of the marginal point spread function , localized at the pattern edge, over the pattern . For a generic point spread function, this calculation can be suitably performed on a computer by choosing a set of uniform grid points, summing sampled function values for grid points in the pattern, and multiplying with the grid step (or by other types of numerical integration).
[0073] For some types of PSFs a test pattern (9) can be evaluated using standard functions. For a Multi-Gaussian PSF (2) and isolines 51 as test pattern, for instance, we have
where erf is the Gauss error function.
[0074] In a second step, the isofocal dose is determined from the dose background. For lines of width w that are large relative to the forward scattering range (e.g. 3<w for a Multi-Gaussian PSF), it is well known (compare, e.g. M. Yu et al in Proc. SPIE 5853, Photomask and Next-Generation Lithography Mask Technology XII, available at https://doi.org/10.1117/12.617058) that the isofocal dose D.sub.is at the line edge x.sub.e is given by
in the threshold model, which is illustrated in
so the isofocal dose D.sub.is can be determined from the unit dose background b with
compare the graph in
Inverse Problem
[0075] The existence of an isofocal dose for a range of sub-patterns with variable control widths is a signature property of the point spread function, which allows its determination by solving an inverse problem. For a Multi-Gaussian PSF, for instance, each parameter uniquely changes the shape of the isofocal dose trend. This is illustrated in
[0076] The simulation approach of eq. (8), or alternatively the combination of the formulas (9) and (13), allows the determination of the isofocal dose
as a function of the given PSF F or its coefficients C (or a subset thereof, if they are partially known or cannot be determined from the measurement range) and sub-pattern control width w (such as one of the widths w.sub.1, . . . , w.sub.N of
[0077] This forward measurement function can be formally inverted to estimate the PSF coefficients C.sub.est from measured isofocal doses {circumflex over (D)}.sub.w1, . . . , {circumflex over (D)}.sub.w.sub.
[0078] To increase the stability of the fit procedure in the presence of noise (or if insufficiently many measurements are available), it may be advantageous to augment eq. (16) using a suitable regularization term, for instance of L1/L2-type
where e.g. p=1 (Lasso regularization) or p=2 (Tikhonov regularization), is a regularization parameter which determines the amount of regularization, and s a selection vector which chooses which PSF coefficients are regularized (e.g. weights only). If there are large number of parameters in the PSF model, Lasso regularization will usually be advantageous, since the regularization term typically forces the least significant parameters to be 0, so unweighted terms can be removed from the model.
[0079] A different variant penalizes variation or curvature of the point spread function F with
where m=1 or 2. This variant approach may be useful to avoid overfitting to measurement noise.
[0080] The minimization prescriptions (16), (17), (18) above are readily performed by state-of-the-art numerical packages such as the routines included in scipy (using least-squares routines or general purpose minimization methods).
[0081] An example of the fitting procedure is shown in