Systems and methods for determining film thickness using DC self-bias voltage
10876209 ยท 2020-12-29
Assignee
Inventors
Cpc classification
C23C16/52
CHEMISTRY; METALLURGY
H01J37/32091
ELECTRICITY
H01J37/32174
ELECTRICITY
International classification
C23C16/455
CHEMISTRY; METALLURGY
H01L21/67
ELECTRICITY
Abstract
A controller for a substrate processing chamber includes a film thickness estimating module configured to while a first RF power is provided to generate plasma in the substrate processing chamber, receive a first measurement of a second RF power supplied to a probe, receive a second measurement of a DC self-bias voltage associated with the probe, wherein the second measurement is indicative of a thickness of a film deposited within the substrate processing chamber, and calculate a thickness of the film using the first measurement of the second RF power and the second measurement of the DC self-bias voltage. An operating parameter adjustment module is configured to adjust at least one operating parameter of the substrate processing chamber based on the thickness of the film as calculated by the film thickness estimating module.
Claims
1. A controller for a substrate processing chamber, the controller comprising: a film thickness estimating module configured to while a first radio frequency (RF) power is provided to generate plasma in the substrate processing chamber, receive a first measurement of a second RF power supplied to a probe, receive a second measurement of a DC self-bias voltage associated with the probe, wherein the second measurement is indicative of a thickness of a film deposited within the substrate processing chamber, and calculate a thickness of the film using the first measurement of the second RF power and the second measurement of the DC self-bias voltage; and an operating parameter adjustment module configured to adjust at least one operating parameter of the substrate processing chamber based on the thickness of the film as calculated by the film thickness estimating module.
2. The controller of claim 1, wherein (i) the film thickness estimating module is configured to determine a rate of change in the thickness of the film and (ii) the operating parameter adjustment module is configured to adjust the at least one operating parameter based on the rate of change in the thickness of the film.
3. The controller of claim 1, wherein the film thickness estimating module is configured to calculate the thickness of the film based on a difference between the DC self-bias voltage and a third measurement corresponding to a voltage on a surface of the probe.
4. A system comprising the controller of claim 1 and further comprising: the probe; and a first capacitor, wherein the DC self-bias voltage is measured across the first capacitor.
5. The system of claim 4, wherein the first capacitor is connected between the probe and a power source of the second RF power.
6. The system of claim 5, wherein the second RF power is less than the first RF power.
7. The system of claim 4, further comprising a voltage sensor circuit configured to measure the DC self-bias voltage.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The present disclosure will become more fully understood from the detailed description and the accompanying drawings, wherein:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
DETAILED DESCRIPTION
(16) The present disclosure describes the use of DC self-bias voltage in substrate processing systems using capacitively coupled plasma (CCP). The present disclosure monitors changes in DC self-bias voltage and/or bleed current to estimate other system parameters, for diagnostic purposes and/or for control of the substrate processing system.
(17) Referring now to
(18) The pedestal 134 may include a chuck, a fork, or lift pins (all not shown) to hold and transfer a substrate 136 during and between deposition and/or plasma treatment reactions. The chuck may be an electrostatic chuck, a mechanical chuck or various other types of chuck.
(19) The process gases are introduced to the showerhead 110 via inlet 142. Multiple process gas lines are connected to a manifold 150. The process gases may be premixed or not. Appropriate valves and mass flow controllers (generally identified at 144-1, 144-2, and 144-3) are employed to ensure that the correct gases and flow rates are used during substrate processing. Process gases exit the process chamber 102 via an outlet 160. A vacuum pump 164 typically draws process gases out of the process chamber 102 and maintains a suitably low pressure within the reactor by a flow restriction device, such as a valve 166. A controller 168 may sense operating parameters such as chamber pressure and temperature inside the processing chamber using sensors 170 and 172. The controller 168 may control the valves and mass flow controllers 144-1, 144-2 and 144-3. The controller 168 may also control the plasma power source 120.
Measurement of Film Thickness
(20) Referring now to
(21) Application of RF energy to a plasma through a coupling (blocking) capacitor typically results in a DC self-bias voltage across that capacitor. A discussion of how this voltage is formed is given in Y. P. Song et. al., J Phys. D. Appl. Ph., V23 (1990), p. 673-681, which is hereby incorporated by reference in its entirety. The approach detailed in Song et. al. focuses on the current flows in the system and employs the fact that the time averaged DC currents through the system must equal zero. As such it may be described as a current centric approach. An alternative discussion of DC self-bias is given by K. Kohler et. al., J. Appl. Phys. 57 (1), January 1985 p. 58-66 which is hereby incorporated by reference in its entirety. This second approach notes that in the high electron mobility in all such systems forces the plasma potential to always achieve the highest positive potential in the system so that electron flow to walls or other surfaces is sufficiently low as to permit an equilibrium steady state condition. This results in several voltage conditions that must be satisfied if an equilibrium state is to occur. As such this approach may be described as a voltage centric approach. Thus a DC self-bias voltage is developed across the blocking capacitor C.sub.B2. If no deposited film is present (i.e. C.sub.film=), then the measured DC self-bias voltage is the same as the DC self-bias voltage that would be measured on the surface of the probe 180 in contact with plasma. However if a film is deposited, there will be an additional capacitance added to this system in the form of C.sub.film. For this case, the DC self-bias voltage of the surface in contact with the plasma does not change, but the measured DC self-bias voltage will change as a result of DC voltage division between C.sub.film and the blocking capacitor C.sub.B2. This situation can be described by the equations below:
(22)
where A.sub.f is the area of the electrode, d is the thickness of the film. From these equations, the change in the measured DC self-bias voltage will yield a measure of the deposited film capacitance. If the relative permittivity of this capacitance is known, the thickness of the deposited film can be determined.
(23) In
(24) In
Estimating Electrode Area Ratio, Bohm Current, and/or Rf Voltage at the Powered Electrode
(25) The discussion of Y. P. Song et. al., J Phys. D. Appl. Ph., V23 (1990), p. 673-681 does not included analysis of the effect of a DC (onlyno RF) current drawn from the RF powered electrode (be it either the showerhead or pedestal). This may be done by introducing an RF filter to prevent RF current from being drawn. However, the ideas presented by Y. P. Song et. al., J Phys. D. Appl. Ph., V23 (1990), p. 673-681 can be adapted to form an analysis that gives the DC self-bias voltage response to a DC current drawn (or bled) from a powered electrode. As will be discussed below, this relationship between the bleed current and DC self-bias voltage can be used to predict DC self-bias voltage change in terms of electrode areas and Bohm current density to each electrode. In the following analysis, a current based approach will be used as discussed by Y. P. Song et. al. In the current based approach, the DC current averaged over an RF cycle must equal zero. This is the same as imposing the condition that no net DC current can flow through this system because of the presence of the blocking capacitor (C.sub.b in
(26) As will be described more fully below, a resistance of a variable bleed resistor Rv is adjusted to vary a bleed current iR. For each value of the bleed resistor, the DC self-bias voltage and the bleed current values are recorded. Curve fitting is performed based on the pairs of values and a relationship (derived below) to estimate values for the effective electrode area ratio
(27)
Bohm current i.sub.B and RF voltage at the electrode.
(28) The Bohm current density J.sub.B (derived as i.sub.B divided by the electrode area) may also be used to estimate plasma density, which can be used as a feedback parameter to control an operating parameter of the substrate processing system and/or for diagnostic purposes. Likewise, the effective electrode area ratio and the RF voltage may be used for diagnostic purposes and/or as a feedback parameter to control an operating parameter of the substrate processing system. Changes in the effective electrode area ratios can signal hardware failures, presence of excessive parasitic plasma or an unwanted coating on the inner surface of the plasma chamber.
(29) Referring now to
(30) In
(31) For a current based approach, several conditions must be satisfied. Current over an RF cycle to an electrode averages to zero if the bleed current is zero. If the bleed current is non-zero, the current over an RF cycle averages to the bleed current.
(32) Referring now to
(33)
(34) Based on (5) and (6), the current balance relationship can be written as:
(35)
where J.sub.Ba and J.sub.Bb are Bohm current densities at sheaths a and b, respectively. Using equation (6), the current balance relationship can be rewritten as:
(36)
(37) There is a relationship between the voltage at t.sub.1 or V(t.sub.1) and the voltages V.sub.DC_bias and V.sub.RF. The floating potential is assumed to be relatively small. Therefore:
V.sub.b(t.sub.1)0=V.sub.RF sin(t.sub.1)V.sub.DC_bias;
V.sub.DC_bias=V.sub.RF sin(t.sup.1);(8)
(38) Using the relationship between t.sub.a and t.sub.b1:
(39)
Inserting (7) and (8) into (9) yields:
(40)
which can be rearranged as:
(41)
(42) Assuming that the plasma has an electron temperature T.sub.e and that the sheath edge plasma densities are the same at both sheath edges, the Bohm current densities are the same at each sheath, or J.sub.Ba=J.sub.Bb=J.sub.B. As a result, equation (10) can be rewritten as follows:
(43)
(44) As can be seen in equation (11),
(45)
depends on the ratio of the total bleed current i.sub.R to the total Bohm current i.sub.B (or Bohm current density times the electrode area J.sub.BA.sub.b) and on the electrode area ratio
(46)
For small values of bleed current i.sub.R and
(47)
the ratio of
(48)
varies nearly linearly with bleed current i.sub.R. For
(49)
the ratio of
(50)
departs from linearity as bleed current i.sub.R becomes a significant fraction of the Bohm current i.sub.B.
(51) Referring now to
(52) In one example, plasma was generated using nitrogen N.sub.2 at a pressure of 2.5 Torr and RF power of 200 Watts at 13.56 MHz. Work was done at 2.5 Torr because the plasma was observed to be well localized to the electrodes. This lack of plasma spreading provided a constant effective electrode area ratio and allowed geometric estimation of the effective electrode area ratio. A DC power supply was connected to the showerhead via a filter, which provided 35 dB attenuation at 13.56 MHz. Bleed current i.sub.R and DC self-bias voltage pairs were measured using multiple different bleed resistor values set by a variable bleed resistor R.sub.V.
(53) In
(54) In
(55) Referring now to
(56)
was 0.825 and the amplitude of the applied RF voltage V.sub.RF was 220 V. As can be seen, the model data fits with the experimental results. The model departs from linearly as the magnitude of the bleed current i.sub.R increases. The experimental electrode area ratio
(57)
was 0.825 and the model-fit electrode area ratio
(58)
was 0.899. The expected Bohm current i.sub.B was 90.70 mA and the model-fit Bohm current i.sub.B was 94.464 mA. These values for i.sub.B may divided by the electrode area A.sub.b (if known) to obtain the corresponding Bohm current density values J.sub.B.
(59) By fitting the model to the DC self-bias voltage V.sub.DC_bias and bleed current i.sub.R, the electrode area ratio
(60)
total Bohm current J.sub.B received by powered electrode, and the RF voltage at the powered electrode V.sub.RF can be determined. The RF voltage at the powered electrode V.sub.RF can also be measured using an oscilloscope. If the RF voltage at the powered electrode V.sub.RF is measured, the model reduces to two unknowns.
(61) The model does not require knowledge of plasma density or electron temperature. If the electron temperature T.sub.e, the electrode area A.sub.b and the atomic mass of the ion species is known, a Bohm argument can be used to determine plasma density from the fitted Bohm current i.sub.B. A differential bias resistance can also be derived by differentiating equation (11) with respect to bleed current i.sub.R.
(62) Referring now to
(63)
(64) The voltage-based approach also provides good results as well. The fitted slope was 0.204. The experimental area ratio
(65)
was 0.815. The implied area ratio
(66)
was 0.825. The model fit area ratio
(67)
was 0.899.
(68) With a few additional assumptions (electron temperature T.sub.e, electrode area A and mass of the ion species (M.sub.i)), the plasma density can also be estimated from the Bohm current using the relationship set forth below:
(69)
where n is the plasma density, A is the electrode area, M.sub.i is the mass of the ion species in the plasma, k is Bolzman's constant and T.sub.e is the electron temperature (in units of eV). The plasma density can be used to vary an operating parameter of the substrate processing system.
(70) In another example, the time dependent characteristics of the DC self-bias voltage are monitored as the variable resistor R.sub.v is switched into or out of the circuit or removed entirely from the circuit. For example only, a first value of R.sub.v is used to bleed current to ground. This will cause a shift in the DC self-bias voltage in accordance with the equations given above. Then, the resistor is removed from the circuit by opening the circuit. At this point, no bleed current is drawn and the DC self-bias voltage will transition over a brief period of time to a value corresponding to no bleed current as indicated in the equations given above. An example of this behavior is illustrated in
(71) Alternatively the variable resistor R.sub.v can switch from one resistance value to another resistance value and then the change in DC self-bias behavior can be monitored over time. The behavior of the DC self-bias voltage during the recovery time shows a characteristic shape and recovery time similar to (but not the same as) that seen for an RC circuit. There will be a charging of the blocking capacitance but the charging current is mediated by the Bohm sheath mechanism. For example, an expression of the form below can be fit to the recovery of the DC bias voltage:
(72)
where is a time constant and to is the onset of the voltage change. Tracking of changes in provides a parameter by which changes in the system can be detected.
(73) Referring now to
(74) Referring now to
(75) In this application, including the definitions below, the term controller or module may be replaced with the term circuit. The term module may refer to, be part of, or include an Application Specific Integrated Circuit (ASIC); a digital, analog, or mixed analog/digital discrete circuit; a digital, analog, or mixed analog/digital integrated circuit; a combinational logic circuit; a field programmable gate array (FPGA); a processor (shared, dedicated, or group) that executes code; memory (shared, dedicated, or group) that stores code executed by a processor; other suitable hardware components that provide the described functionality; or a combination of some or all of the above, such as in a system-on-chip.
(76) The term code, as used above, may include software, firmware, and/or microcode, and may refer to programs, routines, functions, classes, and/or objects. The term shared processor encompasses a single processor that executes some or all code from multiple modules. The term group processor encompasses a processor that, in combination with additional processors, executes some or all code from one or more modules. The term shared memory encompasses a single memory that stores some or all code from multiple modules. The term group memory encompasses a memory that, in combination with additional memories, stores some or all code from one or more modules. The term memory may be a subset of the term computer-readable medium. The term computer-readable medium does not encompass transitory electrical and electromagnetic signals propagating through a medium, and may therefore be considered tangible and non-transitory. Non-limiting examples of a non-transitory tangible computer readable medium include nonvolatile memory, volatile memory, magnetic storage, and optical storage.
(77) The apparatuses and methods described in this application may be partially or fully implemented by one or more computer programs executed by one or more processors. The computer programs include processor-executable instructions that are stored on at least one non-transitory tangible computer readable medium. The computer programs may also include and/or rely on stored data.
(78) The foregoing description is merely illustrative in nature and is in no way intended to limit the disclosure, its application, or uses. The broad teachings of the disclosure can be implemented in a variety of forms. Therefore, while this disclosure includes particular examples, the true scope of the disclosure should not be so limited since other modifications will become apparent upon a study of the drawings, the specification, and the following claims. For purposes of clarity, the same reference numbers will be used in the drawings to identify similar elements. As used herein, the phrase of least one of A, B, and C should be construed to mean a logical (A or B or C), using a non-exclusive logical OR. It should be understood that one or more steps within a method may be executed in different order (or concurrently) without altering the principles of the present disclosure.