Method and apparatus for measuring asymmetry of a microstructure, position measuring method, position measuring apparatus, lithographic apparatus and device manufacturing method
09939742 ยท 2018-04-10
Assignee
Inventors
- Patricius Aloysius Jacobus Tinnemans (Hapert, NL)
- Arie Jeffrey Den Boef (Waalre, NL)
- Simon Gijsbert Josephus Mathijssen (Den Bosch, NL)
Cpc classification
International classification
Abstract
A lithographic apparatus includes a sensor, such as an alignment sensor including a self-referencing interferometer, configured to determine the position of an alignment target including a periodic structure. An illumination optical system focuses radiation of different colors and polarizations into a spot which scans the structure. Multiple position-dependent signals are detected and processed to obtain multiple candidate position measurements. Asymmetry of the structure is calculated by comparing the multiple position-dependent signals. The asymmetry measurement is used to improve accuracy of the position read by the sensor. Additional information on asymmetry may be obtained by an asymmetry sensor receiving a share of positive and negative orders of radiation diffracted by the periodic structure to produce a measurement of asymmetry in the periodic structure.
Claims
1. A method of measuring a geometric property, other than a position, of a structure, the method comprising: illuminating the structure with radiation and detecting radiation diffracted by the structure using a detector; processing signals representing the diffracted radiation to obtain a plurality of results related to the position of the structure, each result having the same form but being influenced in a different way by a variation in the property; and calculating a measurement of the property of the structure that is at least partially based on a difference observed among the plurality of results, wherein the calculating the measurement of the property uses the difference in combination with another result obtained using radiation diffracted by the structure, the other result not related to the position of the structure.
2. The method of claim 1, wherein the plurality of results includes results based on illumination and detection of radiation at different wavelengths.
3. The method of claim 1, wherein the plurality of results includes results based on illumination and detection of radiation at different polarizations.
4. The method of claim 1, wherein the plurality of results includes results based on different spatial frequencies within a position-dependent signal received by the detector.
5. The method of claim 4, wherein the structure has a form that is substantially periodic in one or more directions, and the different spatial frequencies correspond to different orders of diffraction by the periodic structure.
6. The method of claim 1, wherein the other result is obtained using another detector processing a different portion of the radiation diffracted by the structure at the same time as the detecting the radiation diffracted by the structure using the detector.
7. The method of claim 1, wherein the other result includes a result obtained from the same signals as the results related to the position of the structure.
8. The method of claim 1, wherein the property is an asymmetry related parameter of the structure.
9. A method of measuring a position of a periodic structure, the method comprising: illuminating the structure with radiation and detecting radiation diffracted by the structure using a detector; processing signals representing the diffracted radiation to obtain a plurality of results related to the position of the structure, each result having the same form but being influenced in a different way by a variation in a geometric property, other than the position, of the structure; calculating a measurement of the property of the structure that is at least partially based on a difference observed among the plurality of results; and calculating a measurement of the position of the structure using one or more of the plurality of results corrected in accordance with the measurement of the property.
10. The method of claim 9, wherein calculating the measurement of the position comprises applying corrections to two or more of the plurality of the results using the measurement of the property, followed by calculating the position measurement using one or more of the corrected results.
11. The method of claim 9, wherein calculating the measurement of the position comprises calculating a quality measure for each of the plurality of results and using the quality measures to determine to what degree each result contributes to the position measurement.
12. A method of manufacturing devices wherein a device pattern is applied to a substrate using a lithographic process, the method including positioning the applied pattern by reference to a measured position of a periodic structure formed on the substrate, the measured position obtained by the method of claim 9.
13. The method of claim 9, wherein the plurality of results includes results based on illumination and detection of radiation at different wavelengths.
14. The method of claim 9, wherein the plurality of results includes results based on illumination and detection of radiation at different polarizations.
15. The method of claim 9, wherein the plurality of results includes results based on different spatial frequencies within a position-dependent signal received by the detector.
16. The method of claim 15, wherein the different spatial frequencies correspond to different orders of diffraction by the periodic structure.
17. The method of claim 9, wherein the calculating the measurement of the property uses the difference in combination with another result obtained using radiation diffracted by the structure, but not related to the position of the structure.
18. The method of claim 17, wherein the other result is obtained using another detector processing a different portion of the radiation diffracted by the structure at the same time as the detecting the radiation diffracted by the structure using the detector.
19. The method of claim 17, wherein the other result includes a result obtained from the same signals as the results related to the position of the structure.
20. The method of claim 9, wherein the property is an asymmetry related parameter of the structure.
21. A lithographic apparatus comprising: a patterning subsystem configured to transfer a pattern to a substrate; a measuring subsystem configured to measure a position of the substrate in relation to the patterning subsystem, wherein the patterning subsystem is arranged to use the position measured by the measuring subsystem to apply the pattern at a desired position on the substrate, and wherein the measuring subsystem is configured to measure the position of the substrate using a periodic structure on the substrate and measure the position of the periodic structure by: illuminating the periodic structure with radiation and detecting radiation diffracted by the periodic structure using a detector; processing signals representing the diffracted radiation to obtain a plurality of results related to a position of the periodic structure, each result having the same form but being influenced in a different way by a variation in a geometric property, other than the position, of the structure; calculating a measurement of the property of the periodic structure that is at least partially based on a difference observed among the plurality of results; and calculating a measurement of the position of the periodic structure using one or more of the plurality of results corrected in accordance with the measurement of the property.
22. The apparatus of claim 21, wherein the property is an asymmetry related parameter of the structure.
23. An apparatus to measure a position of a structure, the apparatus comprising: a detecting arrangement configured to detect radiation diffracted by the structure using a detector; a processing arrangement configured to process signals representing the diffracted radiation to obtain a plurality of results related to a position of the structure, each result having the same form but being influenced in a different way by variation in a geometric property, other than the position, of the structure; a calculating arrangement configured to calculate a position of the structure using one or more of the results obtained by the processing arrangement, wherein the calculating arrangement is configured to include a correction in the calculated position in accordance with a measurement of the property of the structure, and wherein the calculating arrangement is configured to calculate the measurement of the property of the structure at least partially on the basis of a difference observed among the plurality of results.
24. The apparatus of claim 23, further comprising an illuminating arrangement arranged to illuminate the structure with radiation of a plurality of wavelengths, and wherein the detecting arrangement is configured to detect separately the radiation of the plurality of wavelengths and wherein the plurality of results obtained by the processing arrangement include a plurality of results obtained using radiation of different wavelengths.
25. The apparatus of claim 23, wherein the plurality of results obtained by the processing arrangement include a plurality of results corresponding to different diffraction orders in the diffracted radiation.
26. The apparatus of claim 25, arranged to scan the structure with the radiation and wherein the detecting arrangement includes an interferometer configured to generate a position dependent signal that varies as the structure is scanned with the radiation, and wherein the plurality of results corresponding to different diffraction orders are obtained by extracting different spatial frequency components from the position dependent signal.
27. The apparatus of claim 23, wherein the property is an asymmetry related parameter of the structure.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Embodiments of the invention will now be described, by way of example only, with reference to the accompanying schematic drawings in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS
(15)
(16) The illumination system may include various types of optical components, such as refractive, reflective, magnetic, electromagnetic, electrostatic or other types of optical components, or any combination thereof, for directing, shaping, or controlling radiation.
(17) The support structure holds the patterning device in a manner that depends on the orientation of the patterning device, the design of the lithographic apparatus, and other conditions, such as for example whether or not the patterning device is held in a vacuum environment. The support structure can use mechanical, vacuum, electrostatic or other clamping techniques to hold the patterning device. The support structure may be a frame or a table, for example, which may be fixed or movable as required. The support structure may ensure that the patterning device is at a desired position, for example with respect to the projection system. Any use of the terms reticle or mask herein may be considered synonymous with the more general term patterning device.
(18) The term patterning device used herein should be broadly interpreted as referring to any device that can be used to impart a radiation beam with a pattern in its cross-section such as to create a pattern in a target portion of the substrate. It should be noted that the pattern imparted to the radiation beam may not exactly correspond to the desired pattern in the target portion of the substrate, for example if the pattern includes phase-shifting features or so called assist features. Generally, the pattern imparted to the radiation beam will correspond to a particular functional layer in a device being created in the target portion, such as an integrated circuit.
(19) The patterning device may be transmissive or reflective. Examples of patterning devices include masks, programmable mirror arrays, and programmable LCD panels. Masks are well known in lithography, and include mask types such as binary, alternating phase-shift, and attenuated phase-shift, as well as various hybrid mask types. An example of a programmable mirror array employs a matrix arrangement of small mirrors, each of which can be individually tilted so as to reflect an incoming radiation beam in different directions. The tilted mirrors impart a pattern in a radiation beam which is reflected by the mirror matrix.
(20) The term projection system used herein should be broadly interpreted as encompassing any type of projection system, including refractive, reflective, catadioptric, magnetic, electromagnetic and electrostatic optical systems, or any combination thereof, as appropriate for the exposure radiation being used, or for other factors such as the use of an immersion liquid or the use of a vacuum. Any use of the term projection lens herein may be considered as synonymous with the more general term projection system.
(21) As here depicted, the apparatus is of a transmissive type (e.g. employing a transmissive mask). Alternatively, the apparatus may be of a reflective type (e.g. employing a programmable mirror array of a type as referred to above, or employing a reflective mask).
(22) The lithographic apparatus may be of a type having two (dual stage) or more substrate tables (and/or two or more patterning device tables). In such multiple stage machines the additional tables may be used in parallel, or preparatory steps may be carried out on one or more tables while one or more other tables are being used for exposure. The two substrate tables WTa and WTb in the example of
(23) The lithographic apparatus may also be of a type wherein at least a portion of the substrate may be covered by a liquid having a relatively high refractive index, e.g. water, so as to fill a space between the projection system and the substrate. An immersion liquid may also be applied to other spaces in the lithographic apparatus, for example, between the mask and the projection system Immersion techniques are well known in the art for increasing the numerical aperture of projection systems. The term immersion as used herein does not mean that a structure, such as a substrate, must be submerged in liquid, but rather only means that liquid is located between the projection system and the substrate during exposure.
(24) Referring to
(25) The illuminator IL may comprise an adjuster AD for adjusting the angular intensity distribution of the radiation beam. Generally, at least the outer and/or inner radial extent (commonly referred to as -outer and -inner, respectively) of the intensity distribution in a pupil plane of the illuminator can be adjusted. In addition, the illuminator IL may comprise various other components, such as an integrator IN and a condenser CO. The illuminator may be used to condition the radiation beam, to have a desired uniformity and intensity distribution in its cross-section.
(26) The radiation beam B is incident on the patterning device (e.g., mask) MA, which is held on the support structure (e.g., mask table) MT, and is patterned by the patterning device. Having traversed the patterning device MA, the radiation beam B passes through the projection system PS, which focuses the beam onto a target portion C of the substrate W. With the aid of the second positioner PW and position sensor IF (e.g. an interferometric device, linear encoder or capacitive sensor), the substrate table WTa/WTb can be moved accurately, e.g. so as to position different target portions C in the path of the radiation beam B Similarly, the first positioner PM and another position sensor (which is not explicitly depicted in
(27) The depicted apparatus could be used in at least one of the following modes: 1. In step mode, the support structure MT and the substrate table WTa/WTb are kept essentially stationary, while an entire pattern imparted to the radiation beam is projected onto a target portion C at one time (i.e. a single static exposure). The substrate table WTa/WTb is then shifted in the X and/or Y direction so that a different target portion C can be exposed. In step mode, the maximum size of the exposure field limits the size of the target portion C imaged in a single static exposure. 2. In scan mode, the support structure MT and the substrate table WTa/WTb are scanned synchronously while a pattern imparted to the radiation beam is projected onto a target portion C (i.e. a single dynamic exposure). The velocity and direction of the substrate table WTa/WTb relative to the support structure MT may be determined by the (de-)magnification and image reversal characteristics of the projection system PS. In scan mode, the maximum size of the exposure field limits the width (in the non-scanning direction) of the target portion in a single dynamic exposure, whereas the length of the scanning motion determines the height (in the scanning direction) of the target portion. 3. In another mode, the support structure MT is kept essentially stationary holding a programmable patterning device, and the substrate table WTa/WTb is moved or scanned while a pattern imparted to the radiation beam is projected onto a target portion C. In this mode, generally a pulsed radiation source is employed and the programmable patterning device is updated as required after each movement of the substrate table WTa/WTb or in between successive radiation pulses during a scan. This mode of operation can be readily applied to maskless lithography that utilizes programmable patterning device, such as a programmable mirror array of a type as referred to above.
(28) Combinations and/or variations on the above described modes of use or entirely different modes of use may also be employed.
(29) Lithographic apparatus LA is of a so-called dual stage type which has two tables WTa and WTb and two stationsan exposure station and a measurement stationbetween which the tables can be exchanged. For example, while one substrate on one substrate table is being exposed at the exposure station, another substrate can be loaded onto the other substrate table at the measurement station so that various preparatory steps may be carried out. In an embodiment, one table is a substrate table and another table is a measurement table including one or more sensors. Preparatory steps may be performed at the measurement station such as mapping the surface of the substrate using a level sensor LS and/or measuring the position of one or more alignment markers on, for example, the substrate using an alignment sensor AS. Such preparatory steps enable a substantial increase in the throughput of the apparatus. If the position sensor IF is not capable of measuring the position of the table while it is at the measurement station as well as at the exposure station, a second position sensor may be provided to enable the positions of the table to be tracked at both stations.
(30) The apparatus further includes a lithographic apparatus control unit LACU which controls the movements and measurements of the various actuators and sensors described. Control unit LACU also includes signal processing and data processing capacity to implement desired calculations relevant to the operation of the apparatus. In practice, control unit LACU may be realized as a system of many sub-units, each handling the real-time data acquisition, processing and control of a subsystem or component within the apparatus. For example, one processing subsystem may be dedicated to servo control of the positioner PW. Separate units may even handle coarse and fine actuators, or different axes. Another unit might be dedicated to the readout of the position sensor IF. Overall control of the apparatus may be controlled by a central processing unit, communicating with these sub-systems processing units, with operators and with other apparatuses involved in the lithographic manufacturing process.
(31)
(32) Coarse and fine marks may be provided, so that the alignment sensor can distinguish between different cycles of the periodic signal, as well as the exact position (phase) within a cycle. Marks of different pitches can also be used for this purpose. These techniques are known to the person skilled in the art, and will not be detailed herein. The design and operation of such a sensor is known in the art, and each lithographic apparatus may have its own design of sensor. For the purpose of the present description, it will be assumed that the alignment sensor AS is generally of the form described in U.S. Pat. No. 6,961,116.
(33)
(34) Radiation scattered by mark 202 is picked up by objective lens 224 and collimated into an information-carrying beam 226. A self-referencing interferometer 228, such as of the type disclosed in U.S. Pat. No. 6,961,116 mentioned above, processes beam 226 and outputs separate beams (for each wavelength) onto a sensor array 230. Spot mirror 223 serves conveniently as a zero order stop at this point, so that the information carrying beam 226 comprises only higher order diffracted radiation from the mark 202 (this is not essential to the measurement, but improves signal to noise ratios). Intensity signals 232 from one or more individual sensors in sensor grid 230 are provided to a processing unit PU. By a combination of the optical processing in the block 228 and the computational processing in the unit PU, values for X- and Y-position of the substrate relative to the reference frame RF are output. Processing unit PU may be separate from the control unit LACU shown in
(35) As mentioned already, a single measurement of the type illustrated fixes the position of the mark within a certain range corresponding to one pitch of the mark. Coarser measurement techniques are used in conjunction with this to identify which period of the sine wave is the one containing the marked position. The same process at coarser and/or finer levels can be repeated at different wavelengths for increased accuracy, and for robust detection of the mark irrespective of the materials from which the mark is made, and in, on and/or below which it sits. The wavelengths can be multiplexed and demultiplexed optically so as to be processed simultaneously, and/or they may be multiplexed by time division or frequency division. Examples in the present disclosure will exploit measurement at several wavelengths to provide a practical and robust measurement apparatus (alignment sensor) with reduced sensitivity to mark asymmetry.
(36) Referring to the measurement process in more detail, an arrow labeled v.sub.W in
(37) As discussed in U.S. patent application publication no. US 2012-0212749, incorporated by reference herein in its entirety, high productivity requirements of the lithographic apparatus means that measurement of the alignment marks at numerous positions on the substrate should be performed as quickly as possible, which implies that the scanning velocity v.sub.W is fast, and the time T.sub.ACQ available for acquisition of each mark position is correspondingly short. In simplistic terms, the formula T.sub.ACQ=L/v.sub.W applies. US 2012-0212749 describes a technique to impart an opposite scanning motion of the spot, so as to lengthen the acquisition time. The same scanning spot technique can be applied in a sensor and method of the type newly disclosed herein, if desired.
(38) There is interest in aligning on marks with smaller grating pitches. The measured overlay in real production can be generally significantly larger than under controlled test conditions. This may be due to the alignment marks on product substrates becoming asymmetric to varying degrees during processing. Reducing the pitch of the alignment marks decreases the effect of some types of asymmetry on the measured alignment position.
(39) Some options to allow reduction of the pitch of an alignment grating include (i) shortening the wavelength of radiation used, (ii) increasing the numerical aperture (NA) of the alignment sensor optics and/or (iii) using off-axis illumination. A shorter wavelength is not always possible since alignment gratings are often located underneath an absorbing film (for example an amorphous carbon hard mask). Increasing the NA is in general possible but may not be preferred since there is a desire for a compact objective with a safe distance from the substrate. Therefore using off-axis illumination is attractive.
(40) Position Measurement with Off-axis Illumination
(41)
(42) An optical axis O which has several branches is indicated by a broken line running throughout the optical system 400. For ease of comparison with the schematic diagram of
(43) Additional components illustrated in this more detailed schematic diagram are as follows. In an illumination subsystem 440, radiation from source 420 is delivered via an optical fiber 442 to an illumination profiling optic 446. This delivers input beam 422 via beam splitter 454 to objective lens 424 having a pupil plane P. Objective lens 424 forms a spot 406 on alignment mark 202/204/210. Information-carrying beam 426, diffracted by the mark, passes through beam splitter 454 to interferometer 428. Interferometer 428 splits the radiation field into two parts with orthogonal polarization, rotates these parts about the optical axis by 180 relative to one another, and combines them into an outgoing beam 482. A lens 484 focuses the entire field onto a detector 430, which is an arrangement similar to the alignment sensor of
(44) Included in the present example is an asymmetry measuring arrangement 460. Arrangement 460 receives a part 464 of the information carrying beam 426 through a second beam splitter 462 positioned in advance of the interferometer 428. In the present disclosure, a novel technique for the measurement of asymmetry using position information obtained through the detector 430 is described. In principle, a dedicated asymmetry measuring arrangement 460 could be eliminated. However, in the particular embodiments described herein, the techniques are used to obtain additional information on asymmetry, that can be combined with the results of dedicated asymmetry measuring arrangement 460. This allows the apparatus user to improve further the accuracy of asymmetry information available, and thereby to enable a more accurate and/or more measurement of position.
(45) Illumination profiling optic 446 can take various forms, some of which are disclosed in more detail in US patent application no. U.S. 61/623,391, filed Apr. 12, 2012, the contents of which is incorporated herein its entirety by reference. In the examples disclosed therein, an alignment sensor (more generally, a position measuring apparatus) is shown which may allow the use of a reduced grating pitch without the need for spatial resolution on the detector side. By use of one or more novel illumination modes, the apparatus may be able to measure the position of a mark with a wide range of different pitches, for example from less than 1 m to about 20 microns, without changing the current detector design. A particular feature common to the examples described in U.S. 61/623,391 mentioned above, is the option to use off-axis illumination at a limited range of incidence angles (limited radial extent in the pupil plane). By off-axis illumination, it is meant that source regions of radiation are confined to a peripheral portion of the pupil, that is to say, some distance away from the optical axis. Confining the illumination to an extreme periphery of the pupil reduces the smallest possible pitch of the alignment mark from substantially /NA to substantially /2NA, where , is the wavelength of radiation used, and NA is the numerical aperture of an objective lens of the instrument (e.g. the alignment sensor or more generally the position measuring apparatus). The examples described in U.S. 61/623,391 also use a particular distribution of spot mirrors in a beam splitter of the apparatus, which can both provide the desired illumination and act as a field stop for zero order diffracted radiation. A universal illumination profile can be designed that allows for aligning on any of the X, Y and XY marks without changing the illumination mode, although this inevitably brings some compromise in performance and/or some complication in the apparatus. Alternatively, dedicated modes can be designed and made to be selectable for use with the different mark types. Different polarizations of illumination can be selected also.
(46) A primary function of the illumination profiling optic 446 is such to supply coherent radiation from first and second source regions within a pupil of the objective lens 424. The first and second regions are confined to a peripheral portion of the pupil (in the sense of at least being away from the optical axis). They are each limited in angular extent and are positioned essentially diametrically opposite one another with respect to the optical axis. As will be seen from the examples in U.S. 61/623,391, the source regions may take the form of very small spots, or may be more extended in form. Further source regions may be provided, in particular third and fourth source regions may be provided rotated at about 90 from the first and second regions. A particular embodiment of illumination profiling optics 446 comprises a self-referencing interferometer of the same general form as interferometer 428. The apparatus as a whole need not be limited to providing these particular off-axis illumination profiles. It may have other modes of use, both known or yet to be developed, which favor the use of different profiles. A particular alternative profile, included in discussions below, is one having a single, on-axis region.
(47) It should be noted that in the example shown in
(48) Referring to
(49) The horizontal dotted line in
(50) When off-axis illumination is used, bright spots of coherent radiation can be produced at peripheral positions, also illustrated in
(51) The spots and spot mirrors are likely to be much smaller in practice than the large spots illustrated schematically here. For example, for a pupil diameter of a few centimeters, the spot size may be less than 1 millimeter. The optical system as shown is only presented for the discussion of an embodiment of the present invention, and additional components can be added in a practical implementation. As one example, one or more additional beam splitters can be provided in the path of information carrying beam 426, to collect portions of the radiation for other purposes. For example, another splitter with part-silvered spot mirrors could be placed between splitters 454 and 462, to collect some radiation for measurement of intensity. Alternatively or in addition, portions of the radiation can be collected in the arrangement 460 for similar purposes.
(52)
(53) If the pitch of the grating were to increase, additional orders 2 and +2 etc. may fall within the pupil. Because of the offset mentioned already, the diffraction orders of each spot remain separate from those of the other spot, irrespective of the pitch of the grating. An apparatus can be envisaged in which the offset is not present, and the illumination spots lie exactly on the X, Y and/or XY axes. However such an arrangement may place constraints on the combinations of mark pitches and radiation wavelengths that can be used, if one is to avoid unwanted overlap between diffraction orders, and to avoid wanted diffraction orders being blocked. In an embodiment where broadband or polychromatic radiation is used, the higher order diffraction signals will not be a single spot, as shown here, but rather will be spread into a first order spectrum, second order spectrum and so forth. The potential for unwanted overlap between orders is thereby greater. The orders will be represented as spots here for simplicity only.
(54)
(55) The directions in which the higher order spots will be found in the diffracted radiation field are indicated for the X, Y and XY marks by white dotted lines on the profiles 448 and 448(0) as illustrated in
(56) The prior application further illustrates the diffraction patterns and interferometer outputs for illumination modes designed for a Y-direction mark (204 in
(57) The illumination profiles can be produced in a number of ways to form a practical instrument, bearing in mind that the opposed segments should be coherent for the interferometer 428 to produce the desired signal. Particularly when a broadband source is involved, the coherence length/time of the source radiation will be short. Even with a monochromatic laser source, U.S. Pat. No. 6,961,116 teaches that a short coherence time is desired, for example to eliminate interference from undesired multiple reflections. Consequently, optical path lengths from the source to each segment should be closely matched. An aperture corresponding directly to the desired profile could be placed in a widened parallel beam, but that would result in a relatively large radiation loss. To circumvent the loss of radiation, various alternative solutions in the U.S. 61/623,391 mentioned above are proposed.
(58) The illumination emerging from the illumination source 442 may be monochromatic but is typically broadband in nature, for example white light, or polychromatic. A diversity of wavelengths in the beam increases the robustness of the measurement. The sensor may use, for example, a set of four wavelengths named green, red, near infrared and far infrared. In a sensor implementing an embodiment of the present invention, the same four wavelengths could be used, or a different four, or more or fewer than four wavelengths might be used.
(59) The mark may need to be scanned more than once if it is desired for example to measure position using two different polarizations. Also the illumination mode may be switched midway through scanning the XY mark. In other embodiments, however, multiplexing of optical signals is used so that two measurements can be made simultaneously. Similarly, multiplexing can be applied so that different portions of the XY mark can be scanned and measured without switching illumination mode. A simple way to perform such multiplexing is by frequency division multiplexing. In this technique, radiation from each pair of spots and/or polarization is modulated with a characteristic frequency, selected to be much higher than the frequency of the time-varying signal that carries the position information. The diffracted and processed optical signals arriving at detector 430 will be a mixture of two signals, but they can be separated electronically using one or more filters tuned to the respective frequencies of the source radiation. Time division multiplexing could also be used, but this would involve accurate synchronization between source and detector. The modulation at each frequency can be a simple sine or square wave, for example.
(60) If it is desired to illuminate a mark with circular polarization, whether for position sensing or some other form of metrology, a quarter wave plate (not shown) can be inserted between beam splitter 454 and objective 424. This has the effect of turning a linear polarization into a circular one (and changing it back again after diffraction by the mark). The spot positions are chosen as before according to the mark direction. The direction of circular polarization (clockwise/counterclockwise) can be changed by selecting a different linear polarization in the illumination source 420, fiber 422 or illumination profiling optic 446.
(61) Referring briefly to
(62) While the examples described herein concentrate on 0.sup.th order and +/1.sup.st order diffraction signals, it will be understood that the disclosure extends to the capture and analysis of higher orders, for example +/2.sup.nd orders, more generally +/n.sup.th orders. In the examples, the 1.sup.st orders only are shown and discussed, for simplicity.
(63)
(64) The four colors are transported by polarization maintaining fiber to a multiplexer 502, where they are combined into a single four-color beam. The multiplexer maintains linear polarization, as indicated by arrows 504. The arrows 504 and similar arrows throughout the diagram are labeled G and R to indicate polarization of the green and red components. The N and F components are oriented the same as the G and R components, respectively.
(65) This combined beam goes via suitable delivery optic 506 into beam splitter 454. As already described, the beam then reflects from a partially- or fully reflecting surface (e.g. a 0.5 mm dia spot mirror), which is inside the beam splitter. The objective lens 424 focuses the beam to a narrow beam which is reflected and diffracted by the grating formed by alignment mark 202 on the substrate. Radiation is collected by the objective, with for example numerical aperture NA=0.6. This NA value may allow at least ten orders of diffraction to be collected from a grating with 16 m pitch, for each of the colors.
(66) The reflected and diffracted radiation forming information carrying beam 426 is then transported to the self-referencing interferometer 428. In this example, as already described, the beam is split 462 to supply a portion 464 of the information carrying beam to the asymmetry measuring arrangement 460, when provided. Signals 466 conveying asymmetry measurement information are passed from arrangement 460 to the processing unit PU. Just before the interferometer, polarization is rotated by 45 by a half wave plate 510. From this point on, polarization arrows are shown for only one color, for clarity. The interferometer, as already described above and in U.S. Pat. No. 6,961,116, comprises a polarizing beam splitter, where half of each color is transmitted, and half of each color is reflected. Each half then is reflected three times inside the interferometer, rotating the radiation field by +90 and 90, giving a relative rotation of 180. The two fields are then superimposed on top of each other and allowed to interfere. A phase compensator 512 is present to compensate for path differences of the 90 and 90 image. The polarization is then rotated 45 by another half wave plate 514 (having its major axis set at 22.5 to the X or Y axis). The half wave plates 510, 514 are substantially wavelength insensitive, so that polarizations of all four wavelengths are rotated by 45.
(67) A further beam splitter 516 (not shown in
(68) Note that this arrangement chooses to use one polarization for illumination in each color. Measurements with two polarizations per color could be made, by changing the polarization between readings (or by time division multiplexing within a reading). However, to maintain high throughput while benefiting from some diversity in color and polarization, a set of different colors with single, but different, polarizations represents a good compromise between diversity and measurement throughput. To increase diversity without impacting throughput, one can envisage an implementation similar to the four-color scheme presented here, but using more colors, for example eight or sixteen, with mixed polarizations.
(69) The radiation for each path A and B is collected by a respective collector lens assembly 484A and 484B. It then goes through an aperture 518A or 518B that eliminates most of the radiation from outside the spot on the substrate. Multimode fiber 520A and 520B transports the collected radiation of each path to a respective demultiplexer 522A and 522B. The demultiplexer splits each path in the original four colors, so that a total of eight optical signals are delivered to detectors 430A and 430B. In one practical embodiment, fiber goes from the demultiplexer to eight detector elements on a detector circuit board. The detectors provide no spatial resolution, but deliver time-varying intensity signals I.sub.A and I.sub.B for each color, as the apparatus scans the mark 202. The signals are actually position-dependent signals, but received as time-varying signals (waveforms) synchronized with the physical scanning movement between the apparatus and the mark (recall
(70) Processing unit PU receives the intensity waveforms from the eight detectors and processes them to provide a position measurement POS. Because there are eight signals to choose from, based on different wavelengths and incident polarizations, the apparatus can obtain useable measurements in a wide variety of situations. In this regard it should be remembered that the mark 202 may be buried under a number of layers of different materials and structures. Some wavelengths will penetrate different materials and structures better than others. Processing unit PU conventionally processes the waveforms and provides a position measurement based on the one which is providing the strongest position signal. The remaining waveforms may be disregarded. In a simple implementation, the recipe for each measurement task may specify which signal to use, based on advance knowledge of the target structure, and experimental investigations. In more advanced systems, for example as described in the paper by Huijbregtse et al. mentioned above, an automatic selection can be made, using Color Dynamic or Smooth Color Dynamic algorithms to identify the one or more best signals without prior knowledge.
(71) Discarded waveforms, when considered together as a set, may contain useful information about the structure and materials. In particular, they may contain information about asymmetry of the structure, which will be exploited to provide an alternative or additional asymmetry measurement technique as described further below. In addition, the set of signals can contain other information on the stack, that is the sequence of layers lying on top of the mark, and possibly beneath it as well. It will be appreciated that by using more of the information present in these existing signals, the proposed technique makes more efficient use of the total amount of photons reflected and diffracted by the substrate.
(72) Also described in the Huijbregtse et al. paper is the use of multiple gratings in a composite target. Each grating has a different profile, enhancing for example higher diffraction orders (3.sup.rd, 5.sup.th, 7.sup.th). Position measurements can be derived from different ones of these gratings, as well as from different color signals on an individual grating. In the present disclosure, it will be assumed that there is a single grating with a simple bar pattern. The skilled reader can readily expand the disclosure to envisage embodiments having multiple gratings with different patterns.
(73) Asymmetry MeasurementIntroduction
(74) As described so far, the position measurement apparatus is used for example to obtain an alignment position in a lithographic apparatus such as that shown in
(75) Metrology tools are available commercially to measure asymmetry. However, these may neither be integrated with the alignment sensor nor may they be fast enough to operate with the alignment sensor without harming throughput of a lithographic process. One such apparatus is an angle-resolved scatterometer that uses a CCD-array in a conjugate pupil plane to measure the intensity asymmetry in a diffraction spectrum. The scatterometer measures asymmetry sequentially for a number of colors. In the alignment sensor, the positions signals from different colors may be measured in parallel for speed. Additionally, speed, noise and power (heat) dissipation may present challenges to the asymmetry measuring arrangement, if it is to be integrated in an alignment senor.
(76) Several different approaches are possible for adding an asymmetry measuring function to the position measuring apparatus. As mentioned already, an asymmetry measuring arrangement 460 may be included in the apparatus, which processes a portion 464 of the information carrying beam 426 diverted by beam splitter 462. The form of the asymmetry measuring apparatus 460 can vary.
(77) In U.S. 61/623,391 mentioned above, there is mentioned an asymmetry measuring arrangement that includes a camera to capture pupil plane images of the diffracted radiation. These images can be used for angle-resolved scatterometry. By comparing intensities of image portions corresponding to positive and negative orders of diffraction, asymmetry can be measured. The option to add such a pupil image camera as an asymmetry measuring arrangement in an alignment sensor is discussed U.S. 61/623,391. U.S. 61/623,391 mentions another technique for measuring asymmetry through the interferometer and detector 430. This uses illumination profiles in which off-axis illumination is provided from one side only at a time, allowing the apparatus to measure the intensity of the +1 order and 1 order separately from one another.
(78) In US patent application no. U.S. 61/684,006, filed 16 Aug. 2012, the contents of which is incorporated herein in its entirety by reference, a further form of asymmetry measuring arrangement 460 is proposed. In this form of arrangement, the illumination spot on the substrate is imaged onto a detector. Special optical elements are included in the optical path prior to imaging, which deflect positive and negative diffraction orders so that radiation of the different diffraction orders is separated and used to image the spot onto separate detectors.
(79) Any of the arrangements just mentioned can be used to implement an asymmetry measuring arrangement 460 in the present apparatus. The following description concerns a further technique for measuring asymmetry, using the existing position measuring hardware. This technique may be used instead of or in combination with the arrangement 460, which may take either (or both) of the forms described in the mentioned prior applications, or may take another form entirely.
(80) Asymmetry Measurement from Position Signals
(81)
(82) In step S1, the mark is scanned as described above, and multiple waveforms are recorded, according to the different colors, polarizations and/or the like that are accessible in the optical system. Referring to the example of
(83) In step S3 asymmetry information is obtained from the asymmetry measuring arrangement 460 (asymmetry sensor for short). Asymmetry information may be obtained alternatively or additionally from some source external to the position measuring apparatus.
(84) In step S4, rather than discarding additional information from the multiple position signals derived from the waveforms captured in step S1, information from multiple signals is used to obtain a refined measurement of asymmetry or an asymmetry-dependent parameter. The manner in which this is done can vary, and examples will be explained further below. Increasing the measured information used can be beneficial to help break unknown correlations between measured alignment position and various parameters of the target grating parameter. It can also increase the total number of photons used, and hence the signal to noise ratio will be improved.
(85) At step S5, the refined asymmetry measurement derived in step S4 is used to apply a correction to each of the positions measured in step S2. In step S6, a best measured position is calculated by selecting and/or combining results from among the multiple corrected position measurements. This measurement, which has improved accuracy due to reduced asymmetry sensitivity, is output S7 either for use in a lithographic process, or more generally as a metrology result.
(86)
(87) In step S41 the waveforms (position-dependent intensity signals) from the eight elements (in this example) of detectors 430A, 430B are received. At step S42, each waveform is decomposed into separate components. For example, a discrete Fourier transform (DFT) may be used to decompose the waveform into a set of component waveforms that are essentially harmonics of the period of the grating forming the mark 202. If the waveform were purely sinusoidal with period P/2, then only a first order component would have any magnitude. In a real target and a real instrument, however, several odd and even harmonics may be present in different phases and amplitudes. As described in the Huijbregtse et al. paper mentioned above, different target gratings may even be designed specifically to introduce strong higher-order signals. These multiple orders will be exploited to learn more about the structure of the target (including overlying stack layers). The result of step S42 is thus a set of numerous components, of different orders, but also of different colors/polarization combinations. Each of these components in principle can yield a position measurement. Therefore taking for example five orders for each of the eight waveforms will yield 40 different position measurements.
(88) In step S43 a position measurement is calculated from each of the multiple components (color/polarization and order), which in practice is a matter of sharing the results already calculated in step S2 (
(89) In step S47, the numerous different position measurements are combined with a model of the target structure (mark 202) to identify best fitting parameters of that model. In particular, for the purposes of asymmetry measurement, an asymmetry-dependent parameter is included in the model. The variance calculated in step S44 is used as a measure of the quality of the corresponding position measurement obtained in step S43. Similarly, the variance calculated in step S46 is used as a measure of the quality of the corresponding intensity-based measure obtained in step S45. All of these results in turn are weighed against the asymmetry measurement per color/polarization and order that was obtained from the asymmetry sensor to obtain a single best measurement of asymmetry, which is then output at step S48.
(90) Exemplary Implementation
(91) A particular implementation of the above method steps will now be illustrated in mathematical detail. It should be understood that the above method steps are not the only way to implement asymmetry measurement and position measurement in accordance with an embodiment of the present invention. Moreover the mathematical detail below is not the only way to implement the above method steps in practice.
(92) To facilitate the description and implementation of the asymmetry measurement technique, an alignment sensor model is defined and will be used throughout this document as an example. For convenience the same coordinate systems will be used as for the basic operation of the position measuring apparatus and associated asymmetry measuring arrangement 460. Firstly, the spatial coordinate system at target (substrate) level and the polarization coordinate level at pupil level are defined. Let {circumflex over (x)}.sub.P, .sub.P and {circumflex over (z)}.sub.P denote the pupil spatial Cartesian coordinate system unit vectors. Let {circumflex over (x)}.sub.T, .sub.T and {circumflex over (z)}.sub.T denote the target spatial Cartesian coordinate system unit vectors. Note that the intensity detectors 430A, 430B are located in planes conjugate to the target plane. Depending on the design of asymmetry measuring arrangement 460, intensity detectors there may be in a pupil plane or a target plane.
(93)
(94)
(95) With reference to the notation introduced in
(96)
(97) Similar notation with subscript P instead of T applies to the pupil coordinate system illustrated in
(98) In a real implementation, allowances may need to be made for tilt relative to the apparatus coordinate system, and rotation about the Z axis.
cos(.sub.P).Math..sub.P=cos(.sub.T).Math..sub.T+.sub..sub.
(99) In
sin(.sub.P).Math..sub.P=sin(.sub.T).Math..sub.T.sub.{circumflex over (x)}.sub.
(100)
(101)
(102) In
(103) Coordinate system transformations from the pupil spatial polar coordinate system to the target spatial spherical coordinate system and vice versa can be derived. Without going into the detailed derivation, the mapping SP2T: (.sub.P,.sub.P).fwdarw.(.sub.T,.sub.T) (Spatial Pupil To Target) can be shown to be
(104)
(105) The mapping ST2P: (.sub.T,.sub.T).fwdarw.(.sub.P,.sub.P) (Spatial Target To Pupil) can be shown to be
(106)
(Note that the subscript P here indicates pupil plane, and is not to be confused with the variable P that is the period of the target grating.)
(107) Further, coordinate system transformations from the incident ray in the target spherical coordinate system to the reflected/diffracted ray in the target spherical coordinate system can be derived. The unknown mapping SI2RD: (.sub.T,.sub.T,n).fwdarw.(.sub.T,.sub.T) (Spatial Incident To Reflected/Diffracted) can be derived as the following:
(108)
(109) In this transformation: {N,N} denotes the diffraction order (noting that =0 refers to the reflected order, and 0 refers to the diffracted (higher) orders); P>0 again denotes the target grating pitch and .sub.0>0 denotes the incident plane wave wavelength in vacuum (typical values for this application are 400 nm.sub.01100 nm).
(110) Note that the radial coordinate {square root over (f.sub.T.sup.2+g.sub.T.sup.2)} is clipped to one.
(111) Coordinate system transformations from the pupil polarization coordinate system to the target polarization coordinate system and vice versa can be derived. First, the counterclockwise rotation matrix is defined as:
(112)
(113) The unknown mapping PP2T: ({circumflex over (p)}.sub.P,.sub.P).fwdarw.({circumflex over (p)}.sub.T,.sub.T) (Polarization Pupil To Target) can be derived as:
(114)
(115) The mapping PT2P: ({circumflex over (p)}.sub.T,.sub.T).fwdarw.({circumflex over (p)}.sub.P,.sub.P) (Polarization Target To Pupil) can be derived as:
(116)
(117) The mapping PPPS2XY: ({circumflex over (p)}.sub.P,.sub.P).fwdarw.({circumflex over (x)}.sub.P,.sub.P) (Polarization Pupil Parallel Senkrecht To X Y) can be derived as:
(118)
(119) The mapping PPXY2PS: ({circumflex over (x)}.sub.P,.sub.P).fwdarw.({circumflex over (p)}.sub.P,.sub.P) (Polarization Pupil X Y To Parallel Senkrecht) can be derived as:
(120)
(121) Having defined coordinate systems and transformations, it is assumed that the (complex) pupil plane electric field amplitudes are known/given. These (complex) pupil plane electric field amplitudes can be computed using any suitable approach. In the present implementation, a Jones calculus model is used to compute these fields given the illumination field (in terms of wavelength, angle and polarization) and given coefficients of reflection and diffraction of the target structure (alignment mark, substrate and overlying stack). These coefficients can be computed by solving Maxwell's equations for a model of the target and surrounding materials. The equations can be solved for example by the well-known technique of RCWA (rigorous coupled-wave analysis). This (complex) pupil plane electric field amplitude, in the {circumflex over (x)}.sub.P and .sub.P polarization coordinate system, can be denoted by the following equation
(122)
(123) Again, {N,N} denotes the diffraction order, =0 refers to the reflected order, and 0 refers to the diffracted orders. How to calculate the intensity as seen by the detectors 430A, 430B in the alignment sensor will now be discussed. As mentioned already, the case of off-axis illumination will be considered. The case of on-axis illumination can be derived as a special case. Note that in the off-axis illuminated alignment sensor, both illumination rays (spots in the pupil plane) are mutually coherent and in phase. Hence the (complex) electric field amplitudes, of positive and negative diffraction orders, as summed below, may originate from a different incident plane wave. Note it will be assumed here that the target tilts are zero.
(124) The (complex) pupil plane electric field amplitude, as a function of the (stage) scan position equals
(125)
where x.sub.stage denotes the scan x-position of, for example, the substrate table WT. Note that it is assumed here that the scanning movement is pointing in the direction of (i.e. parallel to) the {circumflex over (x)}.sub.P direction. Again, P>0 denotes the target grating pitch.
(126) Note that the phase term exp
(127)
as introduced above, can be derived from a Fourier optics treatment (i.e. a Fourier series expansion) of the alignment target. The counterclockwise rotation matrix is defined again to equal
(128)
The (complex) electric field amplitudes, after passing, in order, the half-wave plate 510, the self-referencing interferometer 428 and the phase compensator 512, are the following:
(129)
(130) Note that the indices and are no longer applicable after passing the self-referencing interferometer, and hence have been replaced by U and L. Referring again to
(131) Note that it is assumed above that linear x- and y-polarized illumination radiation is being supplied to the target. The rotation of 45 is then effected by half-wave plate 510 at the input side of interferometer 428 (both shown in
(132) Generalizing the above expressions to allow for an arbitrary polarization rotation to be applied by half-wave plate 510 results in:
(133)
where denotes the (counter-clockwise) the half-wave plate 510 (fast axis) located before the self-referencing interferometer. Introducing shorthand matrices
(134)
the expressions for the detector-level amplitudes can rewritten as:
(135)
Further, by applying the above-defined expression for the complex pupil plane electric field amplitude, as a function of the scan position and by introducing shorthand notation for electric fields E.sub.A,=.sub.A,.Math.E.sub.P,(x.sub.stage=0) and E.sub.B,=.sub.B,.Math.E.sub.P,(x.sub.stage=0), the following expressions can be derived:
(136)
(137) Recall from
(138)
(139) The sum and difference (alignment) detector intensities can now be computed by summing the contributions from the different diffraction orders as shown in the following equations:
(140)
which can be expanded to
(141)
(142) Whichever form of expression is used, it will be seen that each of these intensity values, corresponding to the position-varying waveform recorded as the spot 202 scans a target, is the summation of N different orders, corresponding to the diffraction orders . Within each order , there are two constant terms, representing a DC component, and a periodic term with spatial frequency 4 f P. Comparing the sum and difference signals, it can be seen that they are identical except that their periodic components are in antiphase. Note that it is assumed that the zeroth diffraction order (i.e. the reflected order with =0) is blocked somewhere along the path from objective 424 to the interferometer, as already described above. It is further assumed that the detector surface is large relative to the pitch of the target grating. This means that the electric field amplitudes at the detector surface, resulting from pairs of two plane waves incident on the detector surface, are all orthogonal on the interval defined by the detector surface area. Hence these electric field amplitudes at the detector surfaces, due to the different order pairs, may be summed incoherently, as has been done.
(143) Referring to steps S42-S43 of the method in
(144) Given for example the position-dependent sum intensity signal I.sub.D,sum(x.sub.stage) received from alignment sensor detector, one can estimate the (relative) phase of the term
(145)
using for example a projection or a fit approach. Using this projection or fit approach, the alignment sensor detector intensity signal (for each color) is decomposed by the processing unit PU using for example a Fourier transform as follows:
(146)
(147) In this equation u.sub.0 is a zero order (DC) coefficient, while u.sub. is generally a th order Fourier coefficient. For each order there is a cosine coefficient u.sub.,cos and a sine coefficient u.sub.,sin. The relationship between these two corresponds to the phase of the periodic component of that order. In physical terms, each value of corresponding to a diffraction order in the diffraction spectrum of the target grating gives rise directly to a corresponding order (harmonic component) in the position-dependent detector waveform I (x). Based on the above decomposition (step S42) there are computed a phase .sub. for each order and consequently (step S43) a (relative) alignment position x.sub.align, using the formulae:
(148)
(149) So it can be seen that one computes (or at least can compute) multiple (relative) alignment positions from each waveform, one for each positive value of , i.e. {1, . . . , N}. Note that in case of rigorous modeling of the alignment mark (as opposed to a strictly Fourier optics model), (relative) alignment positions can be estimated for even orders, i.e. {2, 4, . . . }, as the (complex) electric field amplitudes E.sub.P, and E.sub.P, are (in general) non-zero for these even orders. Also, when asymmetry occurs in the alignment mark, the complex electric field amplitudes are (in general) non-zero for these even orders, and the even orders may carry particular information about asymmetry. Note that one can derive the phase and hence the alignment position equally using either the sum signal I.sub.D,sum(x.sub.stage) or the difference signal I.sub.D,diff(x.sub.stage), provided one takes account of the minus sign in front of the periodic component. One can also use both sum and difference signals in combination. Using both signals can improve signal to noise ratios, as they use different sets of photons and therefore their noise components (or at least those due for example to photon shot noise and detector noise) should be uncorrelated.
(150) Referring to step S44, the influence on the estimated alignment position of photon Poisson noise at the level of the alignment sensor detectors 430A, 430B is now discussed. This noise estimation allows the best signals to be selected for use in calculating the position measurement. In order to compute the noise sensitivity of the estimated alignment position for a given color, order etc., the following derivatives are computed:
(151)
(152) It is assumed that the total number of photons at detector level within a detector integration time interval is (very) large, so that the Poisson distribution (which describes the number of photons arriving at the detector) is approximated well by the normal (i.e. Gaussian) distribution. It is also assumed that the noise is white noise. Note that if a discrete Fourier transform is made of a white noise signal, then all spectral components will have an expected value that equals zero, and will have an identical variance. As the periodic components
(153)
will be mutually orthogonal on the scan trajectory interval, it can be concluded that cov(u.sub.,cos,u.sub.,sin)=0. Hence the following result can be derived:
(154)
(155) In order to simplify the computation of .sub.u.sub.
(156) From the derivation presented earlier, the sum alignment detector intensity can be derived as follows (simplified according to those simplifying assumptions):
(157)
(158) Note that one could equally start with the different detector intensity signal. It turns out that the conclusion as to the variance .sub.x.sub.
(159) A detector gain scaling constant G can be defined as follows:
(160)
where N denotes a number of photon-electrons, by which it meant the proportion of photons that are converted into electrons so as to give rise to a signal in the detector. As the photon-electron arrival is a Poisson process, the instantaneous variance of the detector signal equals the number of instantaneous photoelectrons at the detector. This property allows computations of the variance .sub.I.sub.
(161)
(162) Recalling the following expression:
(163)
and combining it with this one:
(164)
it can be observed that the following two identities hold (in general):
(165)
(166) Hence U.sub.0 and {square root over (u.sub.,cos.sup.2+u.sub.,sin.sup.2)} can be used an estimator for the intensity when actually measuring signals. If the above two identities are simplified for the particular case described above (i.e. where hold and) then it yields:
(167)
(168) For later use, there is introduced a convenient shorthand notation for the maximum number of photoelectrons in the sum detector intensity signal as:
(169)
(170) The variances .sub.u.sub.
(171)
(172) The above two integrals can be evaluated indirectly by means of numerical Monte Carlo computations, to compute the variance .sub.x.sub.
(173) As an alternative to the numerical solution, it may be useful to have an analytical rule of thumb expressing the relationship between the number of photons and the variance of the estimated, relative alignment position. To obtain this rule of thumb, it is also assumed that the alignment signal consists of first order diffraction information only. In this particular case, the above integrals can be simplified into:
(174)
(175) In conclusion, there can be now stated a final variance of the estimated alignment position, for the particular case in which the alignment signal consists of only the first diffraction order information, and the alignment mark is symmetrical about a zero value of the stage position. The variance of the estimated, relative alignment position equals:
(176)
As noted above, this last result holds for the intensity alignment signals from both the sum and difference detectors.
(177) Note that as
(178)
will be mutually orthogonal on the scan trajectory interval, it can be concluded that cov(u.sub.,cos,u.sub.,cos)=0, cov(u.sub.,sin,u.sub.,sin)=0 and cov(u.sub.,cos,u.sub.,sin)=0, for {1, . . . , N} and {1, . . . , N}, and . Hence the covariance matrix C.sub.x.sub.
(179)
To this, one can make use of the identity
(180)
which is valid if =22.5 and is purely x-polarized or y-polarized., , , . This result can be verified numerically to confirm that the rule of thumb calculation agrees with the full numerical solution.
(181) Incidentally, in a case where the asymmetry measuring arrangement 460 and the alignment sensor both work in parallel and share the same illuminator, the same detector integration time (i.e. effective scan length) applies. A lot of the calculations and derivations of results can be common to the different arrangements. It can also be arrange that the asymmetry sensor and the alignment sensor have the same scaling constant. Other noise sources can be taken into account, if they are known. For example, sensor electronics noise and/or mechanical vibration can be taken into account.
(182) As seen above, a plethora of different alignment position measurements are in fact obtained from the position-varying intensity signals captured by detectors 430A, 430B. A different measurement x.sub.align(.sub.0,E.sub.S,) can be obtained for each combination of color (.sub.0), polarization (E.sub.S) decomposed order (). There are other ways to derive a single position measurement x.sub.align from these multiple measurements, besides just selecting a best one of the waveforms and orders. Rather than discarding all but the best, one can use an average of the measurements as the single result. Various different averages can be used, which can also be referred to as location estimators. These include means, medians, weighted means, or weighted medians. Outliers can be discarded also. Rank based estimators such as Hodges-Lehmann estimators may be used. The average can be weighted in some way, if the relative quality of the different measurements x.sub.align(.sub.0,E.sub.S,) can be identified. Computed above are the accompanying variance of these measurements .sub.align.sup.2(.sub.0,E.sub.S,), which can be used for such weighting. Recall that the measurements are (assumed to be) uncorrelated. In the present apparatus, asymmetry corrections are applied to obtain corrected versions of the numerous position measurements, before the best single position measurement is calculated. While in principle the concept just described is to use all the measurements instead of discarding all but the best one, hybrid approaches are possible in which multiple measurements are used in the calculation after discarding some number of measurements that are judged to be the worst. This may be done for example to reduce processing effort. In addition, one or more statistical techniques may be applied such as trimming (discarding outliers) or Winsorizing (adjusting outliers to fall within a predetermined percentile), before an average result is calculated.
(183) Referring to steps S45 and S46, the present embodiment obtains additional information from the detector sum and/or difference waveforms to supplement the information used to reconstruct the alignment target asymmetry. In particular, there is disclosed the optional use of (estimated) intensity |E.sub.L,.sup.H.Math.E.sub.U,| of the periodic components of various orders. This is not to be confused with the intensity as seen by either detector 430A, 430B. The following result from the above:
(184)
allows us to calculate the intensity of each periodic component as measured by the apparatus, which can be compared with the modeled intensity to refine the model. The steps S45, S46 are optional and further discussion of their implementation is deferred until later in this description.
Calculation of Refined Asymmetry Measurement
(185)
(186) All of these parameters of the materials and geometry of the layers and the grating structure forming the alignment target in combination constitute the model that is the basis of calculating the measured alignment position, and one or more properties of the target structure. Parameters of the model can be set to fixed values, while others are allowed to float for the purposes of reconstruction. Parameters can be derived from combinations of other parameters. Critical dimension, side wall angle and/or the like are all parameters that can be derived from the vertex positions (x.sub.,z.sub.). A particular derived parameter is what it is called asymmetry, and can be defined in a variety of ways, to suit the application. Whatever one or more parameters are used, the one or more floating (unknown) parameters can be summarized by a column vector denoted as p.
(187) Referring now to step S47 of
(188) An asymmetry estimation/reconstruction problem can be expressed by defining a residual function of measurements made in step S3 by the asymmetry measuring arrangement 460. The asymmetry measurement arrangement 460 (referred to as the asymmetry sensor for short) can be of any type, for example of the type described in U.S. 61/684,006 mentioned above, or of a type forming a pupil image for angle-resolved scatterometry. A detailed understanding of the asymmetry sensor is not necessary for an understanding of the present subject.
(189) The residual function can be defined as follows:
(190)
where the column vector I.sub.D,asymm,meas denotes all measured intensities from detectors in the asymmetry sensor, the column vector I.sub.D,asymm,model (p) denotes all modeled intensities of the same detectors, the column vector p denotes the unknown (floating) parameters of the alignment target model, the (diagonal) matrix C.sub.I.sub.
(191) The column vector pairwise differences of measured alignment positions x.sub.align,meas is defined as
(192)
in which the difference x.sub.align,,meas of measured alignment positions between any two component signals is defined as
x.sub.align,,meas=x.sub.align,,meas(.sub.0,j,E.sub.S,j)x.sub.align,,meas(.sub.0,m,E.sub.S,m),
where .sub.0,j denotes the illumination wavelength, for measurement j, and
(193)
denotes the electric field at source level, for measurement j, in the {circumflex over (x)}.sub.P and .sub.P polarization coordinate system. Note that one could also compute all pairwise differences of the alignment position, between different diffraction orders .sub.j.sub.m. This will increase the total number of differences, but the implementer should be aware that much of this information is correlated, and hence expanding the number of differences above a certain point may be of limited use.
(194) In addition to the measured positions, one then takes account of predictions of measured positions, obtained from the model. The covariance matrix C.sub.x.sub.
(195)
where
(196)
denotes the Jacobian matrix of derivatives with respect to the alignment positions, of the pairwise differences of modeled alignment positions. Typically this matrix will be a sparse matrix with one 1 and one 1 entry per row only. Hence C.sub.x.sub.
(197) To perform the step S47, the asymmetry estimation/reconstruction problem can be posed in the following terms:
(198)
In other words, the task is to use the calculated covariance matrices, as weighting matrices, to minimize the residual function R(p), and hence to obtain the result p.sub.asymm,estimated which is a best estimate of the set of parameters of the target model (model of the periodic structure forming the alignment mark 202, etc.). When the model is defined to include one or more asymmetry related parameters, the vector p.sub.asymm,estimated includes our estimate of asymmetry. This non-linear minimization problem can be solved efficiently, using algorithms known to those skilled in the art, for example Newton minimization approaches. The resulting set of parameters p includes the refined asymmetry measurement as one of the parameters, in whatever form of expression is desired. Needless to say, any other parameters that are unknown can also be measured by allowing them to float in the model while the minimization of the residual is performed. For example, the target tilts .sub..sub.
(199) Referring again to steps S45 and S46, if it is desired to make use of the intensity
(200)
to provide additional information for the alignment target asymmetry estimation, the residual function used in S47 can be modified to be as follows:
(201)
where
(202)
denotes the covariance matrix of the column vector
(203)
Note that this matrix is not a diagonal matrix as x.sub.align,meas and
(204)
are mutually correlated.
To avoid costly and complex computations to compute the covariance matrix, one can approximate it by its diagonal. If so, only .sub.x.sub.
(205)
are computed. The computation of .sub.x.sub.
(206) In order to compute
(207)
in step S46 as follows, the following derivatives are computed:
(208)
(209) Following the same reasoning as in step S44 the following result for the variance of the estimated intensity
(210)
can be derived:
(211)
where the variances .sub.u.sub.
(212)
into:
(213)
(214) Referring then to step S48 in
(215) Calculation of Corrected Position Measurements
(216) Returning to
(217) Given all measured alignment positions x.sub.align,meas and all modeled alignment positions x.sub.align,model (p) asymmetry corrected alignment positions x.sub.align,corrected can be computed as follows:
(218)
(219) In this equation, Q{1, 2, 3, 4, . . . } denotes the total number of alignment position measurements (i.e. for all combinations of illumination color and polarization and all Fourier components) and x.sub.align,model,reference (x.sub., z.sub.) denotes an alignment reference point x-position. This alignment reference point (x.sub.align,model,reference,z.sub.align,model,reference) can be defined as a function of the grating vertices (x.sub.,z.sub.) in the model (
(220) Now in step S6, given the set of corrected alignment positions x.sub.align,corrected, one can compute one single, efficient and robust alignment position estimate, using an appropriate statistical technique for selection or averaging of the candidate measurements. For this, the variances calculated in steps S44 and S46 can be used to assign a higher weighting or rank to the measurements with the highest reliability. Various different averages can be used, which can also be referred to as location estimators. These include means, medians, weighted means, or weighted medians. Outliers can be discarded also. Rank based estimators such as Hodges-Lehmann estimators may be used. Note that this functionality is comparable to the Color Dynamics functionality. As a further refinement, a weighted Hodges-Lehmann location estimator will result in an estimation of the alignment position estimate, in which all information is being used (i.e. all photons are being used), but which is robust against outliers.
(221) To end this description, computation of the alignment position measurement variances .sub.align,corrected.sup.2 as a measure of the improved quality of the position measurements obtained by the method herein is discussed. Start by recalling the following equation from above:
x.sub.align,corrected=x.sub.align,meas(x.sub.align,model(p)x.sub.align,model,reference(x.sub.,z.sub.))x.sub.align,corrected=x.sub.align,measx.sub.align,correction.
where use of the following shorthand notation has been made:
x.sub.align,correction=x.sub.align,model(p)x.sub.align,model,reference(x.sub.,z.sub.)
(222) Now recall from the discussion of step S44 above that all alignment position measurements x.sub.align,meas are mutually uncorrelated (at least for the photon Poisson noise component of the uncertainty). It also assumed that x.sub.align,meas and (x.sub.align.model (p)x.sub.align,model,reference (x.sub.,z.sub.)) are not correlated. This is a reasonable assumption in the above embodiment where the asymmetry measurement comes predominantly from the asymmetry sensor (arrangement 460) and therefore makes use of different photons than the position measurement. Hence the variance of x.sub.align,corrected can be computed using .sub.align,corrected.sup.2=.sub.align,meas.sup.2+.sub.align,correction, where .sub.align,correction.sup.2 denotes the variance of x.sub.align.correction=x.sub.align.model(p)x.sub.align,model,reference(x.sub.,z.sub.). In other words, the variance of the measured position after correction is greater than before correction. At first sight, it would appear that consequently the corrected measurement is inferior to the uncorrected one. However, it should be remembered that the variance relates only to the reproducibility of the measurement, and the greater aim is to eliminate or at least reduce systematic errors in the position measurements, caused by absent or inaccurate knowledge of asymmetry in the target grating. Therefore, provided that the additional variance is smaller than the systematic gain in accuracy, an overall benefit is achieved.
(223) In order to quantify the additional variance, some calculations and simulations are made for different stacks. These indicate that, for the color(s) that already has the best reproducibility (i.e. lowest standard deviation .sub.x.sub.
(224)
Therefore the reduction in reproducibility of the final measurement is only modest, and this disadvantage can easily be outweighed by the reduction in systematic error. Note that it is assumed that the optics transmission of the asymmetry branch and the optics transmission of the alignment branch are equal. Note that it is also assumed here that 16 wavelengths are used to estimate the target asymmetry, while only one wavelength (i.e. the one with the best signal quality for the particular target) is used to estimate the alignment target position.
(225) Note that the systematic gain in accuracy and the additional deviation can be calculated and compared, before deciding to use the corrected measurement. In other words, in circumstances where the additional variance is larger than the systematic gain in accuracy, the correction can be discarded. The decision to discard the correction is something that can be determined either beforehand (i.e. when defining the recipe for particular targets), or in real time (i.e. in response to data observed while measuring).
CONCLUSION
(226) The above disclosure described how measurements of a property such as asymmetry can be derived by comparing a number of different results that all are derivable from position dependent signals existing in the alignment sensor. Some of these signals are results related to the position of the mark, and may for example be position measurements produced using different colors, polarizations and/or different spatial frequency components of position-dependent optical signals detected in the alignment sensor. Other results can be considered, for example the intensity values of the signals related to position, to obtain further information on the structure property. The information from these results may be combined with other measurements of the property, for example made by a separate measuring branch operating with the same illumination arrangement as the alignment sensor.
(227) It should be understood that the processing unit PU which controls alignment sensor, processes signals detected by it, and calculates from these signals position measurements suitable for use in controlling the lithographic patterning process, will typically involve a computer assembly of some kind, which will not be described in detail. The computer assembly may be a dedicated computer external to the apparatus, it may be a processing unit or units dedicated to the alignment sensor and/or it may be a central control unit LACU controlling the lithographic apparatus as a whole. The computer assembly may be arranged for loading a computer program product comprising computer executable code. This may enable the computer assembly, when the computer program product is downloaded, to control aforementioned uses of a lithographic apparatus with the alignment sensor AS.
(228) Although specific reference may be made in this text to the use of lithographic apparatus in the manufacture of ICs, it should be understood that the lithographic apparatus described herein may have other applications, such as the manufacture of integrated optical systems, guidance and detection patterns for magnetic domain memories, flat-panel displays, liquid-crystal displays (LCDs), thin-film magnetic heads, etc. The skilled artisan will appreciate that, in the context of such alternative applications, any use of the terms wafer or die herein may be considered as synonymous with the more general terms substrate or target portion, respectively. The substrate referred to herein may be processed, before or after exposure, in for example a track (a tool that typically applies a layer of resist to a substrate and develops the exposed resist), a metrology tool and/or an inspection tool. Where applicable, the disclosure herein may be applied to such and other substrate processing tools. Further, the substrate may be processed more than once, for example in order to create a multi-layer IC, so that the term substrate used herein may also refer to a substrate that already contains multiple processed layers.
(229) Although specific reference may have been made above to the use of embodiments of the invention in the context of optical lithography, it will be appreciated that the invention may be used in other applications, for example imprint lithography, and where the context allows, is not limited to optical lithography. In imprint lithography a topography in a patterning device defines the pattern created on a substrate. The topography of the patterning device may be pressed into a layer of resist supplied to the substrate whereupon the resist is cured by applying electromagnetic radiation, heat, pressure or a combination thereof. The patterning device is moved out of the resist leaving a pattern in it after the resist is cured.
(230) The terms radiation and beam used herein encompass all types of electromagnetic radiation, including ultraviolet (UV) radiation (e.g. having a wavelength of or about 365, 355, 248, 193, 157 or 126 nm) and extreme ultra-violet (EUV) radiation (e.g. having a wavelength in the range of 5-20 nm), as well as particle beams, such as ion beams or electron beams.
(231) The term lens, where the context allows, may refer to any one or combination of various types of optical components, including refractive, reflective, magnetic, electromagnetic and electrostatic optical components.
(232) While specific embodiments of the invention have been described above, it will be appreciated that the invention may be practiced otherwise than as described. For example, the invention may take the form of a computer program containing one or more sequences of machine-readable instructions describing a method as disclosed above, or a data storage medium (e.g. semiconductor memory, magnetic or optical disk) having such a computer program stored therein.
(233) The descriptions above are intended to be illustrative, not limiting. Thus, it will be apparent to one skilled in the art that modifications may be made to the invention as described without departing from the scope of the claims set out below.