Miniature ultra-low-power LNA employing current reuse and bias sharing
12095424 ยท 2024-09-17
Assignee
Inventors
Cpc classification
H03F1/26
ELECTRICITY
H03F2200/87
ELECTRICITY
H03F2200/72
ELECTRICITY
International classification
Abstract
A low noise amplifier (LNA) offering one or more of the following benefits: increased gain, reduced power consumption, and/or reduced area, while achieving a similar noise figure, is disclosed. The LNA achieves these benefits by employing an inductorless chip design, current reuse among the transistors, bias sharing, limited AC coupling capacitors, common gate input device feedback, and careful sizing of the transistors.
Claims
1. A low noise amplifier comprising: an input adapted to receive a signal; a first transistor of a first type, a source of the first transistor coupled to the input; a second transistor of the first type, a source of the second transistor coupled a drain of the first transistor; a positive output adapted to output a positive polarity of an amplified signal, the positive output coupled to a drain of the second transistor; a third transistor of the first type, a source of the second transistor coupled to a gate of the third transistor and a source of the third transistor coupled to a gate of the second transistor; a negative output adapted to output a negative polarity of the amplified signal, the negative output coupled to a drain of the third transistor; a fourth transistor of the first type, a drain of the fourth transistor coupled to the source of the third transistor and a gate of the first transistor, a gate of the fourth transistor coupled to the input; a fifth transistor of a second type, a gate of the fifth transistor coupled to the input; and a sixth transistor of the first type, a drain of the sixth transistor coupled a source of the fifth transistor, and a gate of the sixth transistor coupled to the input.
2. The low noise amplifier of claim 1, wherein the first type is NMOS, and the second type is PMOS.
3. The low noise amplifier of claim 1, further comprising an inductor coupled to the input.
4. The low noise amplifier of claim 1, wherein the fifth and sixth transistors implement an inverter function.
5. The low noise amplifier of claim 1, wherein the first, fourth, and sixth transistors have a same length.
6. The low noise amplifier of claim 1, wherein the first, fourth, and sixth transistors have a same finger width.
7. The low noise amplifier of claim 1, wherein sizes of the first, second, third, fourth, and sixth transistors are adapted to produce a voltage at the source of the second transistor that is substantially equal to a voltage at the source of the third transistor when the low noise amplifier is operating.
8. The low noise amplifier of claim 1, wherein the low noise amplifier is adapted to be biased by a current through the fifth transistor when the low noise amplifier is operating.
9. The low noise amplifier of claim 1, wherein an input impedance at the input is approximately 50? or 75?.
10. A low noise amplifier comprising: a first leg including first and second transistors, the first and second transistors having a cascode configuration, with a drain of the first transistor being coupled to a source of the second transistor; a second leg cross-coupled to the first leg, the second leg including third, fourth, fifth, and sixth transistors, the fifth and sixth transistors having an inverter configuration and being coupled to the third and fourth transistors having a cascode configuration, with a drain of the fourth transistor being coupled to a source of the third transistor, and a gate of the first transistor being coupled to the source of the third transistor; an input adapted to receive a signal, the input coupled to the first leg; a positive output adapted to output a positive polarity of an amplified signal, the positive output coupled to the first leg; and a negative output adapted to output a negative polarity of the amplified signal, the negative output coupled to the second leg.
11. The low noise amplifier of claim 10, wherein the first, second, third, fourth, and sixth transistors are NMOS transistors, and the fifth transistor is a PMOS transistor.
12. The low noise amplifier of claim 10, wherein the second and third transistors have a cross-coupled configuration.
13. The low noise amplifier of claim 10, wherein the first leg further includes an inductor coupled to the input.
14. The low noise amplifier of claim 10, wherein the first, fourth, and sixth transistors have a same length.
15. The low noise amplifier of claim 10, wherein the first, fourth, and sixth transistors have a same finger width.
16. The low noise amplifier of claim 10, wherein sizes of the first, second, third, fourth, and sixth transistors are adapted to produce a voltage at the source of the second transistor that is substantially equal to a voltage at the source of the third transistor when the low noise amplifier is operating.
17. The low noise amplifier of claim 10, wherein the low noise amplifier is adapted to be biased by a current through the fifth transistor.
18. The low noise amplifier of claim 10, wherein an input impedance at the input is approximately 50? or 75?.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1) The drawings illustrate several embodiments of the invention, wherein identical reference numerals refer to identical or similar elements or features in different views or embodiments shown in the drawings. The drawings are not to scale and are intended only to illustrate the elements of various embodiments of the present invention.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
DETAILED DESCRIPTION OF THE INVENTION
(14)
(15) In this illustrated embodiment, the input sampling stage 120 is an 8-phase switched capacitor network sampling at the full sampling rate of F.sub.S. While the input sampling stage 120 illustrated in
(16) As illustrated in
(17) While the programmable delay stage 130 illustrated in
(18) Each of the eight programmable switched capacitor banks 131.sub.0-131.sub.7 is coupled to a corresponding output reconstruction switch 142.sub.0-142.sub.7 via a corresponding optional output buffer 141.sub.0-141.sub.7 in an output reconstruction stage 140. The signals output by the eight optional output buffer 141.sub.0-141.sub.7 are termed output reconstruction signals. In this embodiment, the output reconstruction stage 140 is an 8-phase switching network operating at the full sampling rate of F.sub.S. The output reconstruction stage 140, based on the sequential switching of the output reconstruction switches 142.sub.0-142.sub.7, outputs a reconstructed delayed output RF signal 150, that corresponds to a programmed time delayed version of the input RF signal 110.
(19) The programmable delay device 100 illustrated in
(20)
(21) As reflected by the first two lines 210.sub.0, 210.sub.1, the input signal 110 is sampled at a sampling rate of F.sub.S with a period of T=1/Fs, with the first two sampling switched capacitor storage elements 121.sub.0, 121.sub.1 sampling the first ?th T and second ?th T, respectively. The first line 210.sub.0 shows the sampling of the input signal 110 that generates input sampled signals 1, 9, 17, etc., while the second line 210.sub.1 shows the sampling of the input signal 110 that generates input sampled signals 2, 10, 18, etc. The third line 220.sub.0,0 shows the storage of input sampled signal 1 by the first delay switched capacitor storage element 132.sub.0 in the first programmable switched capacitor bank 131.sub.0 as delay sampled switched signal 1. The fourth line 220.sub.0,1 shows the storage of input sampled signal 9 by the second delay switched capacitor storage element 132.sub.1 in the first programmable switched capacitor bank 131.sub.0 as delay sampled switched signal 9. The fifth line 220.sub.1,0 shows the storage of input sampled signal 2 by the first delay switched capacitor storage element 132.sub.0 in the second programmable switched capacitor bank 131.sub.1 as delay sampled switched signal 2. The sixth line 230.sub.0,0 shows the passing of delay sampled switched signal 1 stored by the first delay switched capacitor storage element 132.sub.0 in the first programmable switched capacitor bank 131.sub.0 to the output reconstruction switch 142.sub.0 via the output buffer 141.sub.0 as output reconstruction signal 1. The seventh line 230.sub.1,0 shows the passing of delay sampled switched signal 2 stored by the first delay switched capacitor storage element 132.sub.0 in the second programmable switched capacitor bank 131.sub.1 to the output reconstruction switch 142.sub.1 via the output buffer 1411 as output reconstruction signal 2.
(22) While the settling time T is 1/Fs in the input sampling stage 120, settling time expansion is created in the programmable delay stage 130 by allowing sample transfer from the input sampling stage 120 to the programmable delay stage 130 to continue during the input sampling stage 120 hold time. With the expanded sample time, the sampler bandwidth required in the programmable delay stage 130 is greatly reduced. This allows the use of much smaller input switched bank switches 133.sub.0-133.sub.185 in the delay switched capacitor storage elements 132.sub.0-132.sub.185 of the programmable delay stage 130, which in turn enables a large reduction in OFF state sample leakage. This leakage reduction enables a corresponding increase in the maximum achievable hold time, which is key to achieving more than 100 ns of delay. (The programmable delay device 100 illustrated in
(23) To reduce timing skew sensitivity, the programmable delay stage 130 input clock signal PI.sub.x,y transitions prior to the input sampling stage 120 input sample clock signal P.sub.x, where x corresponds to the path in the input sampling stage 120 (i.e., it has a value from 0 to 7) and y corresponds to the path in the programmable delay stage 130 (i.e., it has a value from 0 to 185). Thus, the programmable delay stage 130 input is static during clock transitions (e.g., PI.sub.1,0 before P.sub.1). After the programmed delay, a programmable delay stage 130 output clock signal PO.sub.x,y initiates the transfer of the delay sampled switched signal to the input of the corresponding output buffer 141.sub.0-141.sub.7, again time expanded. The output buffers 141.sub.0-141.sub.7 output the delay sampled switched signals employing the same 8-phase clock timing as the input sampling stage 120 (P.sub.x) as corresponding output reconstruction signals. Timing skew is again mitigated by transitioning the programmable delay stage 130 output clock signal PO.sub.x,y after the output reconstruction stage 140 output clock signal P.sub.x. The input and output clocks in the programmable delay stage 130 are generated by two separate, but synchronous, divide-by-186 clocks, as will be described below with reference to
(24)
(25) The LNA 304, illustrated in
(26) Details of the design of the LNA 304 are as follows. The source of a common gate transistor 304A is directly connected to the RF input signal, while the common gate transistor 304A is DC biased by a single off-chip inductor 304B to ground. To save power, the gate of the common gate transistor 304A is connected in feedback to the output of a second leg 304C having an inverter-plus-common-source configuration. The input impedance of the LNA 304 is 1/(g.sub.m*(1+g.sub.net)) where g.sub.m is the transconductance of transistor 304A and g.sub.net can be approximated as the combined transconductances of transistors 304D, 304E, and 304H divided by the transconductance of transistor 304G given that the combined transconductances of transistors 304D and 304E are much larger than the transconductance of transistor 304A. This enables, for example, an approximately 50? or 75? input impedance match with the external input circuitry (the most common desired impedances).
(27) Further, this provides significant power savings, at least a factor of 2?, via current reuse through both the PMOS and NMOS transistors 304D, 304E in the inverter, which add to the transconductance of transistor 304H. A further benefit is that the size of the common gate transistor 304A can be optimized by the feedback gain provided by the inverter path.
(28) The cross-coupled transistors 304F, 304G provide good gain balance between the differential outputs of the balun LNA 304. Differential signals see between 1/g.sub.m and 1/(2*g.sub.m) looking into the sources of the cascode transistors 304F, 304G, while common mode signals see ideally infinite impedance, i.e., they are rejected before being output. It should be noted that only two devices are stacked in the design of the LNA 304, which allows for low voltage operation, for example, 1 V or less, as may be found with advanced node CMOS.
(29) One of the key innovations of this design of the LNA 304 is the biasing structure, which saves significant area and increases the gain of the LNA 304. This enables a DC short of the output of the inverter transistors 304D, 304E and node B. Without this DC short, a DC blocking capacitor is required between these nodes, which is placed into a low impedance 1/g.sub.m node. This would necessitate a very large capacitor, which takes up considerable area and creates considerable loss in the gain of the LNA.
(30) The design of the LNA 304 requires careful sizing of the NMOS transistors 304A, 304E, 304F, 304G, 304H. Proper sizing results in the LNA 304 at DC being biased such that nodes A and B are at substantially the same voltage level across process, supply voltage, and temperature variation. By making the common gate transistor 304A, the common source transistor 304H, and the NMOS transistor 304E of the inverter with the same length and per device finger width, but varying the transistor finger number for device sizing, each of these NMOS transistors can share a gate and drain voltage at DC and have a bias current that is exactly the ratio of the number of fingers used in each transistor. This design thus allows for the entire LNA 304 to be current biased using the current in the PMOS transistor 304D in the inverter portion.
(31) The various coupling capacitors feed nodes with high impedances, making them very small in size, thereby ensuring a small overall layout size for the LNA 304.
(32) Table 1 provides simulation performance details comparing one embodiment of the present invention with various prior art LNAs. In Table 1, Bozorg 2021 corresponds to A. Bozorg et al. A 0.02-4.5-GHz LN(T)A in 28-nm CMOS for 5G Exploiting Noise Reduction and Current Reuse, Yu 2019 corresponds to H. Yu et al., A 0.044-mm.sup.2 0.5to ?7-GHz Resistor-Plus-Source-Follower-Feedback Noise-Cancelling LNA Achieving a Flat NF of 3.3?0.45 dB, and Regulagadda 2019 corresponds to S. S. Regulagadda et al., A Packaged Noise-Canceling High-Gain Wideband Low Noise Amplifier. As shown in Table 1, this implemented embodiment of the present invention yielded simulated performance after extraction of the proposed design of ?7 dB more gain, a similar noise figure, and 25-320% power savings over other prior art LNAs, while consuming 19-100? less area.
(33) TABLE-US-00001 TABLE 1 LNA performance comparison. Present Bozorg Regulagadda Invention 2021 Yu 2019 2019 Freq 0.1-6.0 GHz 0.02-4.5 GHz 0.5-7.0 GHz 0.4-2.2 GHz Process 45 nm 28 nm 65 nm 65 nm Gain 24 dB 15 dB 17 dB 16.4 dB NF 3.5 dB 3.2 dB 3.3 dB 2.5 dB IIP3 ?3.6 dBm ?4.6 dBm ?4.5 dBm ?5.0 dBm Power 3.5 mW 4.5 mW 11.3 mW 29 mW Size 0.0016 mm.sup.2 0.03 mm.sup.2 0.044 mm.sup.2 0.16 mm.sup.2
(34) The optional delay buffers 314 of the programmable switched capacitor banks each employs a dynamic inverter clocked at both VSS and VDD by POy, where one of the 186 delay buffers 314 in each path are enabled at a time and all 186 delay buffers 314 share an optional self-biased inverter load 316 for common-mode stability in each of the 8 delay paths. The optional output buffers 318, which incorporate the non-optional output reconstruction switches, are preferably closely placed in the layout for unity gain matching and employ dynamic common-source amplifiers with a shared resistive load. The output reconstruction switches 318 output corresponding output reconstruction signals. An optional device output buffer 320, coupled to the output reconstruction switches 318 provides balun and matching operation and employs a common-source amplifier and push-pull output stage (see
(35) Clocking is provided from an input clock 330 at a frequency F.sub.S through a divide-by-2 (F.sub.CLK=2F.sub.S) clock 332. While the programmable delay device 300 illustrated in
(36) The programmable delay operates as follows. The desired delay value Z (between 1 and 185 in the illustrated embodiment), corresponding to delays of 8/Fs to 1480/Fs in 8/Fs increments, is entered using a serial programming interface 350 by an external programming source. The serial programming interface 350 outputs the desired delay value Z, which generates a delay of Z*8/Fs, to a digital counter in the enable timing circuit 352. The digital counter is enabled at the same time as the input divide-by-186 clocks 340, which generate the input clock signals PI.sub.x,y. The digital counter counts up to the programmed delay value Z. Once the digital counter reaches Z, the output divide-by-186 clocks 344, which generate output clock signals PO.sub.x,y, are enabled, thereby causing the first sample to transfer to the corresponding output reconstruction switch 318. The output clock signals PO.sub.x,y continue to cause the transfer of samples to the output reconstruction switches 318 indefinitely and are delayed relative to the input clock signals PI.sub.x,y by the desired delay Z*8/Fs.
(37) A programmable delay device in accordance with at least one embodiment was implemented in a 45 nm SOI CMOS process, resulting in a 4 mm.sup.2 chip area and 1.36 mm.sup.2 active area, as illustrated in the photomicrograph of
(38) Delay performance was verified across all delay settings, as illustrated in
(39)
(40) Table 2 provides performance details comparing one embodiment of the present invention with various prior art delay devices. In Table 2, ISSCC 2021 corresponds to Nagulu, ISSCC 2012 corresponds to Garakoui, JSSC 2017 corresponds to Mondal, and SSCL 2020 corresponds to M. Li et al., An 800-ps Origami True-Time-Delay-Based CMOS Receiver Front End for 6.5-9-GHz Phased Arrays, IEEE Solid-State Circuit Letters, vol. 3, pp. 382-385 (2020), the contents of which are incorporated herein by reference. As shown in Table 2, this implemented embodiment of the present invention yielded a factor of 9? improvement in area efficiency and nearly a factor of 60? increase in maximum delay relative to the best prior art.
(41) TABLE-US-00002 TABLE 2 Delay device performance comparison. Present ISSCC ISSCC JSSC SSCL Invention 2021 2012 2017 2020 Design Delay SIC 4 Channel Delay Delay Element Receiver Beamformer Element Element + Attenuator Archi- TIMS Switched- G.sub.m-C G.sub.m-C Delay Line tecture Switched-Cap Cap Delay 0.2-2.0 GHz 0.1-1.0 GHz 1.0-2.5 GHz 0.1-2.0 GHz 6.5-9.0 GHz Frequency Range 3 dB 0.2-1.1 GHz.sup.a 0.1-0.5 GHz.sup.b 1.0-2.5 GHz 0.1-2.0 GHz 6.5-9.0 GHz Bandwidth Max Delay 448.6 ns.sup.a 7.75 ns.sup.b 0.55 ns 1.7 ns 0.8 ns Delay per 330 ns/mm.sup.2,a 37 ns/mm.sup.2,b 7.9 ns/mm.sup.2 5.9 ns/mm.sup.2 0.4 ns/mm.sup.2 Unit Area Delay 175.9x 31x.sup.b 39.3x.sup.c 6.8x 32x.sup.c Range Gain 24 dB ?18 dB.sup.b 12 dB 0.6 dB 18 dB Noise 7.1 dB 8.0 dB 23 dB 3.6 dB Figure IP1dB ?27 dBm ?21 dBm ?13 dBm ?17 dBm Power 80 mW.sup.a 7.4 mW.sup.b 90 mW.sup.d 364 mW 107 mW Tech- 45 nm 65 nm 140 nm 130 nm 65 nm nology SOI CMOS CMOS CMOS CMOS CMOS Delay 1.36 mm.sup.2 0.21 mm.sup.2,b 0.07 mm.sup.2 0.29 mm.sup.2 2.25 mm.sup.2 Active Area .sup.aFs = 3.3 GHz, .sup.bMax delay element, .sup.cBased on delay step, .sup.dSingle channel
(42) Additional characterization of the programmable delay device 300 revealed higher than desired clock feed-through to the RF output 322 and relied on off-chip currents to properly bias the on-chip amplifiers. This led to a second programmable delay device 900 in accordance with another embodiment of the present invention.
(43) Since complementary switches were employed in the switched-capacitor circuit switches and the chosen CMOS process has equal strength PMOS and NMOS devices, extremely small clock feed-through was found from this mechanism in the original programmable delay device 300. Simulation of DC offsets in the buffers, including buffers 310 and device output buffers 318, employed within the switched-capacitor circuits were found to produce the clock feed-through levels found in the original programmable delay device 300. Specifically, buffers 310 and device output buffers 318 created the largest spurious tones at F.sub.S/8 since they repeat every 8 clock cycles. The self-biased inverter load 316 also shares the F.sub.S/8 response since there are only 8 in the original programmable delay device 300. The delay buffers 314 produced negligible spurious tones since they only repeat once every 1480 clock cycles and each of the delay buffers 314 produces a random DC offset value.
(44) To reduce DC offsets in the amplifiers, two approaches were used. Since mismatch limited DC offsets are directly reduced through increased device sizing (width times length), the length of the amplifiers in both the buffers 910 and the self-biased inverter loads 920 were increased from 40 nm to 232 nm for a DC offset reduction of ?30?. The device output buffers 318 were completely removed and replaced with an intermediate buffer 940 after the output reconstruction switches 930 where a DC blocking capacitor could be employed to remove DC offset. The device output buffers 318 could be removed since the parasitic routing capacitance from the outputs of the delay buffers 314 to the output reconstruction switches 930 was significantly higher than the input capacitance of the device output buffers 318, therefore limiting memory effects and gain reduction from the removal of the output buffers 318. The intermediate buffer 940 is followed by an output buffer 950.
(45) The programmable delay device 900 includes an on-chip bandgap reference 960. This bandgap reference 960 provides all reference currents needed by the various RF circuits. The programmable delay device 900 includes the digital scan chain SPI interface 350, which enables programming of gain and calibration of the bandgap reference 960.
(46) Lastly, the divide-by-2 (F.sub.CLK=2F.sub.S) clock 332 has been upgraded. The clock divider 970 is now a programmable divide by 1/2/4 clock divider for greater flexibility. This upgraded clock divider 970 allows operation of the programmable delay device 900 over a wider range of applications operating at a wider range of clock frequencies F.sub.CLK.
(47) Characterization of the programmable delay device 900 showed several improvements over the original programmable delay device 300. F.sub.CLK was verified to properly operate from 2 GHz to 13 GHz with a minimum required input power of less than ?10 dBm. The supported ranges, combined with the on-chip programmable clock divider 970, provide system flexibility in clock frequency, delay range, and frequency coverage. At F.sub.S=3.3 GHz and F.sub.CLK=6.6 GHz, the device consumes 74 mW (3.5 mW LNA 304, 3.5 mW output buffer 950, 29 mW clocking, 38 mW delay buffers 304) from a 1V core supply and less than 250 ?W from a 1.8V supply used for digital I/O and the bandgap reference 960.
(48) Delay measurements were repeated inside a temperature chamber over a temperature range of ?40? C. to 85? C. as shown in
(49) The noise figure was measured at room temperature and gain was verified over a temperature range of ?40? C. to 85? C. during the delay measurements as illustrated in
(50)
(51) The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.