Substrate polishing apparatus and method
11195729 · 2021-12-07
Assignee
Inventors
Cpc classification
B24B37/105
PERFORMING OPERATIONS; TRANSPORTING
B24B37/013
PERFORMING OPERATIONS; TRANSPORTING
H01L21/67
ELECTRICITY
International classification
H01L21/67
ELECTRICITY
B24B49/05
PERFORMING OPERATIONS; TRANSPORTING
Abstract
A substrate polishing apparatus includes a top ring for pressing a substrate against a polishing pad to perform substrate polishing; a spectrum generating unit that directs light onto a surface of the substrate of interest for polishing, receives reflected light, and calculates a reflectivity spectrum corresponding to the wavelength of the reflected light; and a storage that stores a plurality of thickness estimating algorithms for estimating the thickness of the polished surface in accordance with the reflectivity spectrum. A plurality of thickness estimating algorithms is selected among the thickness estimating algorithms stored in the storage, and a switching condition is set. The thickness of the polished surface is estimated by using the set thickness estimating algorithms, and if the switching condition is satisfied, the thickness estimating algorithm to be applied is switched.
Claims
1. A substrate polishing apparatus comprising: a top ring for pressing a substrate against a polishing pad to perform substrate polishing; a phototransmitter arranged to direct light onto a surface of the substrate; a photoreceptor arranged to receive the light reflected from the surface of the substrate, and a processor configured to calculate a reflectivity spectrum corresponding to the wavelength of the reflected light received from a spectroscope in communication with the photoreceptor; a memory that stores a plurality of thickness estimating algorithms for estimating a thickness of a polished surface in accordance with the reflectivity spectrum; a controller configured to select a first thickness estimating algorithm and a second thickness estimating algorithm among the plurality of thickness estimating algorithms stored in the memory, and set a switching condition; and the controller further configured to estimate the thickness of the polished surface during polishing based on the reflectivity spectrum calculated by the processor by using the first and the second thickness estimating algorithms, and configured to switch the thickness estimating algorithm from the first thickness estimating algorithm to the second thickness estimating algorithm to be used for determining polishing end when the set switching condition is satisfied.
2. The substrate polishing apparatus according to claim 1, wherein the controller is configured to output a control signal indicating an end of the substrate polishing when the estimated thickness of the polished surface reaches a predetermined target value.
3. The substrate polishing apparatus according to claim 1, wherein the switching condition is that the estimated thickness of the polished surface by use of the first thickness estimating algorithm reaches a set value.
4. The substrate polishing apparatus according to claim 1, wherein the switching condition is that a difference between the thicknesses estimated using the first and second thickness estimating algorithms is below a predetermined value.
5. The substrate polishing apparatus according to claim 1, wherein before the thickness estimating algorithm to be used for determining polishing end is switched, the thickness of the polished surface is estimated through a weighting function using the first and second thickness estimating algorithms.
6. The substrate polishing apparatus according to claim 1, wherein the first thickness estimating algorithm is a first algorithm that estimates the thickness of the polished surface by using Fast Fourier Transform (FFT) and the second thickness estimating algorithm is a second algorithm that estimates the thickness of the polished surface by using reference spectrums (Fitting Error).
7. A substrate polishing method for pressing a substrate against a polishing pad to perform substrate polishing, comprising the steps of: directing light onto a surface of the substrate of interest for polishing, receiving reflected light, and calculating a reflectivity spectrum corresponding to the wavelength of the reflected light; selecting a first thickness estimating algorithm and a second thickness estimating algorithm by using a storage that stores a plurality of thickness estimating algorithms for estimating a thickness of a polished surface in accordance with the reflectivity spectrum, and setting a switching condition; and estimating the thickness of the polished surface during polishing based on the reflectivity spectrum calculated by a processor by using the first and the second thickness estimating algorithms, and when the switching condition is satisfied, automatically switching the thickness estimating algorithm from the first thickness estimating algorithm to the second thickness estimating algorithm to be used for determining polishing end.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION OF NON-LIMITING EXAMPLE EMBODIMENTS
(11) A substrate processing apparatus according to one embodiment of the present invention will now be described with reference to the accompanying drawings. It should be noted that the same or corresponding components will be denoted by the same reference numeral and overlapping description will be omitted.
(12)
(13) The polishing table 13 is joined to an underlying table motor 17 through a table shaft 13a, and the table motor 17 allows the polishing table 13 to rotate in the direction indicated by the arrow. The polishing pad 11 is attached to the top surface of this polishing table 13, and the top surface of the polishing pad 11 constitutes the polishing surface 11a for polishing the wafer W. The polishing head 15 is joined to the bottom end of a polishing head shaft 16. The polishing head 15 is configured to hold the wafer W at the bottom surface by vacuum suction. The polishing head shaft 16 is configured to vertically move through a vertical move mechanism not shown in the drawing.
(14) The wafer W is polished in the following manner. The polishing head 15 and the polishing table 13 are rotated in the respective directions indicated by the arrows, and a polishing solution (slurry) is supplied from the polishing solution supply nozzle 14 onto to the polishing pad 11. In this state, the polishing head 15 presses the wafer W against the polishing surface 11a of the polishing pad 11. The surface of the wafer W is polished by mechanical effects of abrasive grain contained in the polishing solution and chemical effects of the polishing solution.
(15)
(16) The elastic film 21 has a plurality of ring-shaped partition walls 21a, and the pressure chambers D1, D2, D3, and D4 are separated by these partition walls 21a. The pressure chamber D1 disposed in the central portion has a circular shape, and the other pressure chambers D2, D3, and D4 have a ring shape. These pressure chambers D1, D2, D3, and D4 are arranged concentrically.
(17) The pressure chambers D1, D2, D3, and D4 are connected to fluid lines G1, G2, G3, and G4 so that a pressure-adjusted pressurized fluid (e.g., pressurized air or other pressurized gases) can be supplied into the pressure chambers D1, D2, D3, and D4 through the fluid lines G1, G2, G3, and G4. The fluid lines G1, G2, G3, and G4 are connected to vacuum lines U1, U2, U3, and U4 so that negative pressure can be formed in the pressure chambers D1, D2, D3, and D4 through the vacuum lines U1, U2, U3, and U4.
(18) The internal pressures on the pressure chambers D1, D2, D3, and D4 can be changed independently of each other through a processing unit 32, which will be described later, and the polishing control unit 12; thus, the polishing pressures on the respective four regions of the wafer W, i.e., the central portion, inner intermediate portion, outer intermediate portion, and edge portion can be independently adjusted.
(19) The ring-shaped elastic film 21 is disposed between the retainer ring 22 and the carrier 20. The ring-shaped pressure chamber D5 is formed in the elastic film 21. The pressure chamber D5 is connected to the fluid line G5 so that a pressure-adjusted pressurized fluid (e.g., pressurized air) can be supplied into the pressure chamber D5 through the fluid line G5. In addition, the fluid line G5 is connected to the vacuum line U5 so that negative pressure can be formed in the pressure chamber D5 through the vacuum line U5.
(20) With changes in the pressure in the pressure chamber D5, the elastic film 21 and the entire retainer ring 22 vertically move, so that the pressure in the pressure chamber D5 is applied to the retainer ring 22 and the retainer ring 22 can directly press the polishing pad 11 independently of the elastic film 21. During polishing of the wafer W, the retainer ring 22 presses the polishing pad 11 around the wafer W while the elastic film 21 presses the wafer W against the polishing pad 11.
(21) The carrier 20 is fixed to the bottom end of a head shaft 16. The head shaft 16 is joined to the vertical move mechanism 25. This vertical move mechanism 25 is configured to move up and down the head shaft 16 and the polishing head 15, and locate the polishing head 15 at a predetermined height. A combination of a servo motor and a ball screw mechanism is used as this vertical move mechanism 25 functioning as the polishing head positioning mechanism.
(22) The vertical move mechanism 25 locates the polishing head 15 at a predetermined height. In this state, a pressurized fluid is supplied to the pressure chambers D1 to D5. Receiving the pressure in the pressure chambers D1 to D4, the elastic film 21 presses the wafer W against the polishing pad 11 and, receiving the pressure in the pressure chamber D5, the retainer ring 22 presses the polishing pad 11. In this state, the wafer W is polished.
(23) The polishing apparatus 10 includes an optical measuring device 30 that acquires the thickness of the wafer W. This optical measuring device 30 includes an optical sensor 31 that acquires an optical signal varying according to the thickness of the wafer W, and a processing unit 32 that calculates the spectrum of reflected light from the wafer W on the basis of the optical signal.
(24) The optical sensor 31 is disposed in the polishing table 13, and the processing unit 32 is connected to the polishing control unit 12. As indicated by the sign A in
(25)
(26) The optical measuring device 30 includes the optical sensor 31 and the processing unit 32. The optical sensor 31 is configured to direct light onto the surface of the wafer W, receive reflected light from the wafer W, and disperse the reflected light according to the wavelength. The optical sensor 31 includes a phototransmitter 41 that directs light onto a surface of the wafer W of interest for polishing, an optical fiber 42 that serves as a photoreceptor receiving reflected light returning from the wafer W, and a spectroscope 43 that disperses reflected light from the wafer W according to the wavelength and measures the intensity of the reflected light over a predetermined range of wavelength.
(27) The polishing table 13 has a first hole 50A and a second hole 50B opened on the top surface. In addition, the polishing pad 11 has a through-hole 51 in a position corresponding to the holes 50A and 50B. The holes 50A and 50B are communicated with the through-hole 51. The through-hole 51 is opened on the polishing surface 11a. The first hole 50A is coupled to a liquid supply source 55 through a liquid supply path 53 and a rotary joint (not shown in the drawing), and the second hole 50B is coupled to a liquid exhaust path 54.
(28) The phototransmitter 41 includes a light source 45 emitting multiple-wavelength light, and an optical fiber 46 connected to the light source 45. The optical fiber 46 is an optical transmission line that guides light emitted by the light source 45, to the surface of the wafer W. The front ends of the optical fibers 46 and 42 are located in the first hole 50A and in the vicinity of the polished surface of the wafer W. The front ends of the optical fibers 46 and 42 are disposed so as to face the wafer W held by the polishing head 15. Each time the polishing table 13 rotates, the plurality of regions of the wafer W is irradiated with light. Preferably, the front ends of the optical fibers 46 and 42 are disposed so as to pass through the center of the wafer W held by the polishing head 15.
(29) During polishing of the wafer W, water (preferably pure water) is supplied from the liquid supply source 55 to the first hole 50A through the liquid supply path 53 as a transparent liquid, and fills the space between the bottom surface of the wafer W and the front ends of the optical fibers 46 and 42. Water further flows into the second hole 50B and is exhausted through the liquid exhaust path 54. The polishing solution is exhausted together with water, thereby ensuring an optical path. The liquid supply path 53 is provided with a valve (not shown in the drawing) in synchronization with the rotation of the polishing table 13. This valve operates in such a manner that it stops the water flow or reduces the flow rate of water when the wafer W is not located over the through-hole 51.
(30) Two optical fibers 46 and 42 are disposed in parallel with each other, and their front ends are disposed perpendicularly to the surface of the wafer W. The optical fiber 46 is configured to direct light perpendicularly to the surface of the wafer W.
(31) During polishing of the wafer W, light from the phototransmitter 41 is directed onto the wafer W, and the optical fiber (photoreceptor unit) 42 receives reflected light from the wafer W. The spectroscope 43 measures the intensity of reflected light at each wavelength within a predetermined range of wavelength, and transmits the acquired light intensity data to the processing unit 32. This light intensity data is an optical signal that reflects the thickness of the wafer W and is composed of the intensity of the reflected light and the corresponding wavelength.
(32)
(33) The wafer W has a bottom-layer film and a top-layer film (e.g., a silicon layer or insulating film) formed on the bottom-layer film. Light incident on the wafer W reflects off an interface between a medium (e.g., water) and the top-layer film and an interface between the top-layer film and the bottom-layer film, and interference of lightwaves reflected off these interfaces occurs. The way of interference of lightwaves changes depending on the thickness of the top-layer film (i.e., optical length). Accordingly, the spectrum generated from the reflected light from the wafer W changes depending on the thickness of the top-layer film.
(34) The spectroscope 43 disperses reflected light according to the wavelength, and measures the intensity of the reflected light for each wavelength. Further, the processing unit 32 generates a spectrum from reflected light intensity data (an optical signal) obtained through the spectroscope 43. Hereinafter, a spectrum generated from reflected light from the polished wafer W is referred to as a measuring spectrum (reflectivity spectrum). This measuring spectrum is represented by a line graph (i.e., spectral waveform) showing a relationship between the wavelength and intensity of light. Light intensity can be represented by a reflectivity or relative value, such as a relative reflectivity.
(35)
(36) A reference intensity can be defined as, for example, a light intensity obtained when a silicon wafer with no film formed thereon (a bare wafer) is water-polished in the presence of water. In actual polishing, a corrected measured intensity is determined by subtracting a dark level (background intensity obtained under the condition where light is shut out) from a measured intensity; a corrected reference intensity is determined by subtracting the dark level from a reference intensity; and a relative reflectivity is determined by dividing the corrected measured intensity by the corrected reference intensity. To be specific, the relative reflectivity R(λ) can be determined using the equation below.
R(λ)=(E(λ)−D(λ))/(B(λ)−D(λ))
(37) Here, λ represents a wavelength, E(λ) represents the intensity of light at the wavelength λ reflected off the wafer, B(λ) represents a reference intensity at the wavelength λ, and D(λ) is a background intensity (dark level) at the wavelength λ obtained in the state where light is shut out.
(38)
(39) The memory 61 stores a program for driving the processing unit 32 and the polishing control unit 12 and various data, such as a processing program for executing a thickness estimating algorithm, which will be described below, and switching conditions in which the thickness estimating algorithm is switched.
(40) The algorithm setting unit 62 selects, as a setting, a plurality of thickness estimating algorithms to be used for polishing the wafer W and the related application conditions, among a plurality of thickness estimating algorithms stored in the memory 61. This setting can be manually input by an operator through an operation input unit, which is not shown in the drawing. Alternatively, the setting can be automatically made according to a material for the polished layer or its base layer, a target thickness obtained upon termination of the polishing, and the like.
(41) The thickness estimating unit 63 applies the thickness estimating algorithm set through the algorithm setting unit 62 to the spectrum of reflected light generated in the processing unit 32, thereby estimating the thickness of the wafer W being polished. In addition, when the thickness of the wafer W being polished has reached a predetermined value or satisfies a predetermined switching condition, the thickness estimating unit 63 switches the thickness algorithm to be applied. The end detecting unit 64 determines whether the estimated thickness has reached a predetermined target value, and if the estimated thickness has reached the predetermined target value, generates a control signal for commanding the termination of the polishing.
(42) Here, the algorithm for estimating the thickness of the wafer W may be, for example, a reference spectrum (Fitting Error) algorithm, a Fast Fourier Transform (FFT) algorithm, a Peak Valley algorithm, or a Polishing Index algorithm.
(43) In the case of the reference spectrum algorithm, a plurality of spectrum groups including a plurality of reference spectra corresponding to different thicknesses is prepared. A spectrum group including a reference spectrum having a form closest to a spectrum signal (reflectivity spectrum) from the processing unit 32 is selected. Further, during the polishing of the wafer, a measuring spectrum for measuring the thickness is generated, a reference spectrum having the closest form is selected from the selected spectrum group, and a thickness corresponding to the reference spectrum is estimated to be the thickness of the wafer being polished.
(44) In the case of the FFT algorithm, a spectrum signal (reflectivity spectrum) from the processing unit 32 is subjected to Fast Fourier Transform (FFT) to extract a frequency component and the related intensity, and the obtained frequency component is converted into the thickness of a polished layer, by using a predetermined relational expression (a function that represents the thickness of the polished layer and can be determined from a measured value and the like). Accordingly, a frequency spectrum indicating a relationship between the thickness of the polished layer and the intensity of the frequency component is generated. If the peak intensity of the spectrum corresponding to the thickness of the polished layer obtained by conversion from the frequency component exceeds a threshold, the frequency component (the thickness of the polished layer) corresponding to that peak intensity is estimated to be the thickness of the wafer being polished.
(45) In the case of the Peak Valley algorithm, for a spectrum signal (reflectivity spectrum) from the processing unit 32, a wavelength which is an extreme point indicating the extreme value (maximum value or minimum value) is extracted. As the thickness of a polished layer decreases, the wavelength which is the extreme point shifts toward the short wavelength side; therefore, monitoring the extreme point with the polishing of the wafer, the thickness of the polished layer can be estimated. Further, at a plurality of points arranged in the radial direction of the wafer, monitoring the wavelength which is the extreme point allows the profile to be obtained.
(46) In the case of the Polishing Index algorithm, in response to a spectrum signal (reflectivity spectrum) from the processing unit 32, the amount of change in the spectrum per predetermined time is calculated, and the amount of change in the spectrum is summed with polishing time, thereby determining a cumulative amount of change in spectrum. The cumulative amount of change in spectrum monotonously increases with the polishing of the wafer, while the thickness monotonously decreases; therefore, a time point at which the cumulative amount of change in spectrum reaches a predetermined target value can be determined to be the termination of the polishing.
(47)
(48) In the example shown in
(49) Accordingly, in the example shown in
(50) Upon initiation of substrate polishing, the thickness estimating unit 63 estimates the thickness of the wafer W being polished, according to the FFT algorithm (Step S14). Subsequently, whether the thickness is less than or equal to 1 μm (whether the switching condition 1 is satisfied) is determined (Step S15). If the condition is not satisfied (the estimated thickness exceeds 1 μm), the process returns to Step S14 and thickness estimation based on the FFT algorithm is repeated while the wafer W is polished.
(51) On the contrary, if the switching condition 1 is satisfied, the thickness estimating unit 63 estimates the thickness of the wafer W being polished, according to the Fitting Error algorithm (Step S16). Subsequently, whether the thickness is less than or equal to 50 nm (whether the switching condition 2 is satisfied) is determined (Step S17). If the condition is not satisfied (the estimated thickness exceeds 50 nm), the process returns to Step S16 and thickness estimation based on the Fitting Error algorithm is repeated while the wafer W is polished.
(52) If the switching condition 2 is satisfied, the thickness estimating unit 63 estimates the thickness of the wafer W being polished, according to the Polishing Index algorithm (Step S18). Subsequently, whether the thickness has reached a target value is determined (Step S19). If the target value has been reached, the end detecting unit 64 outputs a control signal for terminating substrate polishing (Step S20). On the contrary, if the thickness has not reached the target value, the process returns to Step S18 and thickness estimation based on the Polishing Index algorithm is repeated while the wafer W is polished.
(53) Since the thickness estimating algorithm to be used is switched according to the thickness of the substrate in this manner, application of a thickness estimating algorithm according to the progress of substrate polishing allows for more accurate estimation of thickness and proper detection of the end of substrate polishing.
(54) The method of switching the thickness estimating algorithm according to the present invention is not limited to the above-described example, and appropriate methods can be used in combination.
(55) In this case, if the wafer W to be polished has a thickness exceeding H2, it is difficult to estimate the thickness of the substrate by the Fitting Error algorithm. For this reason, as shown in
(56) In the example shown in
(57) In the above-described embodiment, a single thickness estimating algorithm is applied to each thickness estimation, which may make the estimated thickness discontinuous upon switching between a plurality of algorithms. For this reason, it is preferable to apply a plurality of algorithms concurrently during a certain period before switching.
(58)
(59) Subsequently, if the thicknesses estimated using these two algorithms 1 and 2 match at the H4 (or become at or below a predetermined value) at the time T5, the thickness estimating unit 63 switches the algorithm applied to the algorithm 2. This prevents the estimated thickness from being discontinuous upon switching of the algorithm.
(60)
F(w)=(1−w)×x1+w×x2
Here, x1 represents a thickness estimated using the algorithm 1, x2 represents a thickness estimated using the algorithm 2, and w represents a weighting factor expressed by the equation below where a time T is an input value.
w(T)=(T−T6)/(T7−T6)
(61) In this manner, a plurality of algorithms is concurrently used for thickness estimation while a weighting factor w is used, thereby preventing the estimated thickness from being discontinuous upon switching of the algorithm.
(62) The above embodiment has been described so that those possessing general knowledge in the technical field that the present invention belongs to can implement the present invention. Various modifications of the above embodiment can be made by those skilled in the art as a matter of course, and the technological thought of the present invention can be applied to other embodiments. The present invention should not be limited to the above-described embodiment and should be construed in the widest range according to the technological thought defined by the claims.