CIRCUITS AND METHODS FOR MULTI-PHASE CLOCK GENERATORS AND PHASE INTERPOLATORS
20220244755 · 2022-08-04
Inventors
Cpc classification
H03B2200/0074
ELECTRICITY
H03B27/00
ELECTRICITY
International classification
H03B27/00
ELECTRICITY
Abstract
Circuits and methods for multi-phase clock generators and phase interpolators are provided. The multi-phase clock generators include a delay line and multi-phase injection locked oscillator. At each stage of the multi-phase injection locked oscillator, injection currents are provided from a corresponding stage of the delay line. Outputs of the multi-phase injection locked oscillator and provided to mixers which produce inputs to an operational transconductance amplifier which provides feedback to the delay line and the multi-phase injection locked oscillator. The phase interpolator uses a technique of flipping certain input clock signals to reduce the number of components required while still being able to interpolate phase over 360 degrees and to reduce noise.
Claims
1. A circuit for a multi-phase clock generator, comprising: a delay line comprising a first plurality of differential unit delay cells, wherein each of the first plurality of differential unit delay cells has a pair of clock inputs and a pair clock outputs, wherein the first plurality of differential unit delay cells are connected in series such that the pair of clock outputs of a first of the first plurality of differential unit delay cells is connected to the pair of clock inputs of a second of the first plurality of differential unit delay cells, and wherein each unit cell of the first plurality of differential unit delay cells outputs a pair of clock signals having different phases than each pair of clock signals output by other of the first plurality of differential unit delay cells; and a ring oscillator comprising a second plurality of differential unit delay cells, wherein each of the second plurality of differential unit delay cells has a pair of clock inputs, a pair of current injection inputs, and a pair clock outputs, wherein the pair of current injection inputs of each of the second plurality of differential unit delay cells is coupled to the pair of clock outputs of a corresponding one of the first plurality of differential unit delay cells, wherein the second plurality of differential unit delay cells are connected in series such that the pair of clock outputs of a first of the second plurality of differential unit delay cells is connected to the pair of inputs of a second of the second plurality of differential unit delay cells, wherein the pair of outputs of a last of the second plurality of differential unit delay cells are flipped and connected to the pair of inputs of a first of the second plurality of differential unit delay cells, and wherein each unit cell of the second plurality of differential unit cells outputs a pair of clock signals having different phases than each pair of clock signals output by other of the second plurality of differential unit delay cells.
2. The circuit of claim 1, wherein the delay line further comprises a dummy unit cell connected to a last of the first plurality of differential unit delay cells.
3. The circuit of claim 1, wherein the pair of current injection inputs of each of the second plurality of differential unit delay cells is coupled to the pair of output of a corresponding one of the first plurality of differential unit delay cells by a buffer.
4. The circuit of claim 1, wherein each unit cell in the first plurality of differential unit delay cells comprises: a first inverter having an input connected to a first of the pair of clock inputs of the unit cell and having an output connected to a first of the pair of clock outputs of the unit cell; a second inverter having an input connected to a second of the pair of clock inputs of the unit cell and having an output connected to a second of the pair of clock outputs of the unit cell; a third inverter having an input connected to the output of the first inverter and having an output connected to the output of the second inverter; and a fourth inverter having an input connected to the output of the second inverter and having an output connected to the output of the first inverter.
5. The circuit of claim 1, wherein each of the first plurality of differential unit cells also has a pair of current injection inputs.
6. The circuit of claim 2, wherein the pair of current injection inputs for each of the first plurality of differential unit delay cells is connected to ground.
7. The circuit of claim 4, wherein each unit cell in the first plurality of differential unit delay cells comprises: a first inverter having an input connected to a first of the pair of clock inputs of the unit cell and having an output connected to a first of the pair of clock outputs of the unit cell; a second inverter having an input connected to a second of the pair of clock inputs of the unit cell and having an output connected to a second of the pair of clock outputs of the unit cell; a third inverter having an input connected to the output of the first inverter and having an output connected to the output of the second inverter; a fourth inverter having an input connected to the output of the second inverter and having an output connected to the output of the first inverter; and a first buffer having an input connected to a first of the pair of current injection inputs of the unit cell and having an output connected to the first of the pair of clock outputs of the unit cell; and a second buffer having an input connected to a second of the pair of current injection inputs of the unit cell and having an output connected to the second of the pair of clock outputs of the unit cell.
8. The circuit of claim 7, wherein the first buffer is formed from a plurality of selectable, parallel transistors.
7. The circuit of claim 1, wherein each unit cell in the second plurality of differential unit delay cells comprises: a first inverter having an input connected to a first of the pair of clock inputs of the unit cell and having an output connected to a first of the pair of clock outputs of the unit cell; a second inverter having an input connected to a second of the pair of clock inputs of the unit cell and having an output connected to a second of the pair of clock outputs of the unit cell; a third inverter having an input connected to the output of the first inverter and having an output connected to the output of the second inverter; a fourth inverter having an input connected to the output of the second inverter and having an output connected to the output of the first inverter; and a first buffer having an input connected to a first of the pair of current injection inputs of the unit cell and having an output connected to the first of the pair of clock outputs of the unit cell; and a second buffer having an input connected to a second of the pair of current injection inputs of the unit cell and having an output connected to the second of the pair of clock outputs of the unit cell.
8. The circuit of claim 1, wherein each unit cell of the first plurality of differential unit delay cells has includes an identical set of components interconnected in an identical manner.
9. The circuit of claim 1, wherein each unit cell of the second plurality of differential unit delay cells includes an identical set of components interconnected in an identical manner.
10. The circuit of claim 1, wherein at least one unit cell of the first plurality of differential unit delay cells and at least one unit cell of the second plurality of differential unit delay cells include an identical set of components interconnected in an identical manner.
11. The circuit of claim 1, further comprising a first mixer that mixes the pairs of outputs of two of the second plurality of differential unit delay cells to produce a first mixer output signal and a second mixer that mixes the pairs of outputs of another two of the second plurality of differential unit delay cells to produce a second mixer output signal.
12. The circuit of claim 11, further comprising an operation transconductance amplifier that receives the first mixer output signal and the second mixer output signal and produce tuning feedback signal that is provided to the delay line and to the ring oscillator.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
DETAILED DESCRIPTION
[0033] In accordance with some embodiments, new circuits and methods multi-phase clock generators and phase interpolators are provided.
[0034] Turning to
[0035] MPCG 102 can be any suitable multi-phase clock generator in some embodiments. For example, as shown in
[0036] Any suitable QDLL can be used as QDLL 106 in some embodiments. For example, as shown in
[0037] PI 104 can be any suitable phase interpolator in some embodiments. For example, as shown in
[0038] In some embodiments, by tuning the delay of the stages in delay line 108, the f.sub.0 of MPILOSC 116 can be tuned. A control voltage V.sub.tune of QDLL 106 biases both the stages in delay line 108 to have a unit delay of 1/(2N f.sub.inj) and the stages in the MPILOSC to have f.sub.0=f.sub.inj, in some embodiments. In some embodiments, the delay line outputs have a high spectral purity and suppress the MPILOSC phase noise over a wide injection bandwidth. Moreover, the phase errors due to the finite QDLL loop gain, finite matching of the loading within the delay line, and layout asymmetry are corrected by the MPILOSC, in some embodiments. The two-step multi-phase clock generation scheme thus combines the advantages of low-noise delay lines and symmetric ROSCs and breaks the tradeoff between jitter and phase accuracy in two-phase IL-ROSCs in some embodiments.
[0039] Turning to
[0040] As illustrated, QDLL 206 includes a delay line 208 that receives a differential reference clock (Ref_P and Ref_N) and a Vdd_DL_tune signal, and that outputs eight clock signals (CKDL_0, CKDL_180, CKDL_45, CKDL_225, CKDL_90, CKDL_270, CKDL_135, CKDL_315). Although delay line 208 outputs eight clock signals, delay line 208 can output any suitable number (such as four, for example) of clock signals in some embodiments.
[0041] As also illustrated in
[0042] Dummy stage 234 provides loading to cell 232 that is uniform with the loading provided to cells 226, 228, and 230 by cells 228, 230, and 232, respectively.
[0043] Unit delay cells 226, 228, 230, 232, and 234 can be any suitable unit delay cells in some embodiments. For example, in some embodiments, unit delay cells 226, 228, 230, 232, and 234 can be implemented as described below in connection with
[0044]
[0045]
[0046] The per-stage gain K.sub.DL of each unit delay cell can be linearized and expressed in terms of the N-stage MPILOSC's gain K.sub.VCO as follows:
[0047] In some embodiments, the K.sub.VCO can vary from 2 to 6 GHz/V across the tuning range. In some embodiments, a waveform-shaping buffer at the delay-line input reduces the amplitude of the rail-to-rail input clock to be close to the delay-line internal voltage amplitude.
[0048] As further illustrated in
[0049] Passive mixers 110 in
[0050] OTA 112 in
[0051] The control voltage Vtune of the QDLL controls the delay of the delay stages in delay line 208.
[0052] As shown in
[0053] Turning to
[0054] As illustrated, MPILOSC 216 receives eight clock signals (CKDL_0, CKDL_180, CKDL_45, CKDL_225, CKDL_90, CKDL_270, CKDL_135, CKDL_315) and a Vtune signal from QDLL 206, and outputs eight clock signals (CKRO_0, CKRO_180, CKRO_45, CKRO_225, CKRO_90, CKRO_270, CKRO_135, CKRO_315). Although MPILOSC 216 receives and outputs eight clock signals, MPILOSC 216 can receive and output any suitable number (such as four, for example) of clock signals in some embodiments.
[0055] As also illustrated in
[0056] Unit delay cells 242, 244, 246, and 248 can be any suitable unit delay cells in some embodiments. For example, unit delay cells 242, 244, 246, and 248 can be the same as unit delay cells 226, 228, 230, 232, and 234, in some embodiments. In some embodiments, unit delay cells 226, 228, 230, 232, and 234 can be implemented as described above in connection with
[0057] As further illustrated in
[0058] The control voltage Vtune of the QDLL controls the delay of the delay stages in the MPILOSC 216.
[0059] As shown in
[0060] In some embodiments, when implemented on-chip, MPILOSC 216 can be laid out in a bowtie pattern, as illustrated for example in
[0061] Referring back to
[0062] Turning to
[0063] As shown in
[0064] In some embodiments, PI core 801 receives eight (or any other suitable number) quadrature clock signals (CKRO_0, CKRO_180, CKRO_45, CKRO_225, CKRO_90, CKRO_270, CKRO_135, CKRO_315) from MPILOSC 116 (which can be implemented using MPILOSC 216 in some embodiments), receives eight 16-bit (or any other suitable number of bits) thermometer encoded digital control words (one word for each set of layers for each differential pair 826/828 in each slice (i.e.: [0065] 1. one 16-bit word for differential pair 826 in slice 1 802 wherein each bit corresponds to one of the layers for that differential pair; [0066] 2. one 16-bit word for differential pair 828 in slice 1 802 wherein each bit corresponds to one of the layers for that differential pair; [0067] 3. one 16-bit word for differential pair 826 in slice 2 804 wherein each bit corresponds to one of the layers for that differential pair; [0068] 4. one 16-bit word for differential pair 828 in slice 2 804 wherein each bit corresponds to one of the layers for that differential pair, [0069] 5. one 16-bit word for differential pair 826 in slice 3 806 wherein each bit corresponds to one of the layers for that differential pair; [0070] 6. one 16-bit word for differential pair 828 in slice 3 806 wherein each bit corresponds to one of the layers for that differential pair; [0071] 7. one 16-bit word for differential pair 826 in slice 4 808 wherein each bit corresponds to one of the layers for that differential pair; and [0072] 8. one 16-bit word for differential pair 828 in slice 4 808 wherein each bit corresponds to one of the layers for that differential pair),
receives eight flip control signals (one for each pair clock signals received at each of the slices) from PI controller 803, and outputs differential phase interpolated clock phase interpolated clock signal.
[0073] More particularly, for example, slice 1 802 can receive CKR_0, CKR_180, CKR_45, and CKR_225, slice 2 804 can receive CKR_45, CKR_225, CKR_90, and CKR_270, slice 3 806 can receive CKR_90, CKR_270, CKR_135, and CKR_315, and slice 4 808 can receive CKR_135, CKR_315, CKR_0, and CKR_180.
[0074] While PI 800 is implemented with four slices, the PI can be implemented with any suitable arrangement of components (whether with slices or not) to achieve the same functionality as what is provided by the arrangement of
[0075] In some embodiments, the seven bits of the phase interpolator are one bit for the clock polarity (e.g., the most significant bit), two bits for the selection of one slice from four slices (e.g., the next two most significant bits), and four bits used to encode the thermometer encoded digital control words SelX and SelBX (e.g., the remaining four least significant bits).
[0076] During operation, in some embodiments, the eight-phase clock signals from the QDLL are received at the flip buffers of each slice. Depending on the flip control signal, the eight-phase clock signals will pass straight through the flip buffers (as represented by the straight lines in the flip buffer schematic symbols) or be swapped by them (as represented by the X lines in the flip buffer schematic symbols).
[0077] The CMOS-to-CML buffers (which can be implemented using 2-bit programmable inverters, in some embodiments) will then shape the clipped eight-phase clock signals to sinusoidal 250-mVpp clocks. The shaped CML signals are then received at the gates of the corresponding transistors in the differential pairs. The switches at the sources of the transistors in differential pairs 826 and 828 are controlled by the thermometer encoded digital control words received on busses SelX and SelBX, where X corresponds to the slice number for the differential pairs. These switches make active or inactive the corresponding differential pairs across the 16 layers. For example, Sel1<0> makes active differential pair 826 of layer 1 of slice 1; Sel2<1> makes active differential pair 826 of layer 2 of slice 2; SelB1<0> makes active differential pair 828 of layer 1 of slice 1; Sel2B<1> makes active differential pair 828 of layer 2 of slice 2. The currents of each active differential pair across all layers and slices sum as controlled by the shaped CIVIL signals at their gates to provide interpolated CIVIL clock signals CKPI_p and CKPI_n. In some embodiments, differential pairs will only be active in one slice at a time. The interpolated CML clock signals are then converted to CMOS signals by CML-to-CMOS buffers to provide interpolated CMOS clock signals.
[0078] Unfortunately, gate-to-drain parasitic capacitance coupling at the transistors of the differential pairs of each layer of each slice introduce non-linearities into the interpolated CML clock signals (and thus the interpolated CMOS clock signals). To counter these non-linearities, the quadrature clock signals at one or more of the flip buffers corresponding to inactive differential pairs can be swapped.
[0079] In some embodiments, the PI can use an octagonal constellation (for example, as illustrated in
[0080] In some embodiments, clock-flipping buffers 810 and 812 can flip the received clocks so that the same slice can cover the lower half-plane of the constellation diagram in
In some embodiments, this flipping arrangement can enable the PI core to interpolate over 360 degrees with only four slices.
[0089] In some embodiments, the clock-flipping scheme in the table of
wherein differential pairs being active means that the switches of the sources of the transistors of the differential pairs are not completely off.
[0098] Turning to
[0099] Turning to
[0100] In some embodiments, the flipping buffers can be omitted and double the number of slices provided such that each slice corresponds to one row in the table of
[0101] In some embodiments, the circuits described herein can be implemented in any suitable process. For example, in some embodiments, the circuits described herein can be implemented in a 65-nm bulk CMOS process.
[0102] Although the invention has been described and illustrated in the foregoing illustrative embodiments, it is understood that the present disclosure has been made only by way of example, and that numerous changes in the details of implementation of the invention can be made without departing from the spirit and scope of the invention, which is limited only by the claims that follow. Features of the disclosed embodiments can be combined and rearranged in various ways.