SPECTRAL CONTENT DETECTION FOR EQUALIZING INTERLEAVED DATA PATHS
20230035036 · 2023-02-02
Inventors
- Ray Luan Nguyen (Fountain Valley, CA, US)
- Dawood Alam (Lake Forest, CA, US)
- Nong Fan (Irvine, CA, US)
- Geoffrey Hatcher (Lake Forest, CA, US)
- Morteza Azarmnia (Irvine, CA, US)
Cpc classification
G06F17/142
PHYSICS
H04L25/14
ELECTRICITY
H03M1/1028
ELECTRICITY
International classification
Abstract
A high-speed data receiver includes interleaver circuitry configured to divide a received data stream into a plurality of interleaved paths for processing, spectral content detection circuitry configured to derive spectral content information from data on each of the plurality of interleaved paths, sorting circuitry configured to bin the derived spectral content information according to energy levels, stream attribute determination circuitry configured to determine, based on sorted spectral content, one or more of path offsets of the interleaved paths, gain mismatch among interleaved paths, signal bandwidth mismatch and pulse width mismatch, and equalization circuitry configured to correct the one or more of the determined offsets, the determined gain mismatch and the determined signal width mismatch. Equalization circuitry may be configured to equalize a gain-normalized signal by separately adjusting respective bandwidth actuators of each respective interleaved path and respective pulse width actuators of each respective interleaved path.
Claims
1. A high-speed data receiver comprising: interleaver circuitry configured to divide a received data stream into a plurality of interleaved paths for processing; spectral content detection circuitry configured to derive spectral content information from data on each of the plurality of interleaved paths; sorting circuitry configured to bin the derived spectral content information according to energy levels; stream attribute determination circuitry configured to determine, based on sorted spectral content, one or more of path offsets of the interleaved paths, gain mismatch among interleaved paths, signal bandwidth mismatch and pulse width mismatch; and equalization circuitry configured to correct the one or more of the determined offsets, the determined gain mismatch and the determined signal width mismatch.
2. The high-speed data receiver of claim 1, wherein the stream attribute determination circuitry comprises offset determination circuitry configured to determine offsets between the interleaved paths from mismatches at a lowest energy level.
3. The high-speed data receiver of claim 1 wherein the stream attribute determination circuitry comprises signal width mismatch detection circuitry configured to determine mismatch between at least one of bandwidth and pulse width using a spectral shape function.
4. The high-speed data receiver of claim 3, wherein the signal width mismatch determination circuitry comprises circuitry configured to independently sweep respective signal width actuators of each respective interleaved path.
5. The high-speed data receiver of claim 1, wherein the stream attribute determination circuitry comprises gain mismatch determination circuitry configured to determine gain mismatches between the interleaved paths from mismatches at energy levels above the lowest energy level.
6. The high-speed data receiver of claim 5 wherein the gain mismatch determination circuitry comprises circuitry configured to independently sweep respective gain actuators of each respective interleaved path.
7. The high-speed data receiver of claim 5, wherein the equalization circuitry is configured to remove gain mismatch by normalizing a predetermined number of lowest-energy bins.
8. The high-speed data receiver of claim 7, wherein the equalization circuitry is configured to equalize the gain-normalized signal by adjusting respective signal width actuators of each respective interleaved path.
9. The high-speed data receiver of claim 8, wherein the equalization circuitry is configured to equalize the gain-normalized signal by separately adjusting respective bandwidth actuators of each respective interleaved path and respective pulse width actuators of each respective interleaved path.
10. The high-speed data receiver of claim 8, wherein the equalization circuitry is configured to equalize the gain-normalized signal by adjusting sampling locations on each respective interleaved path.
11. The high-speed data receiver of claim 10, wherein the equalization circuitry is configured to equalize the gain-normalized signal by adjusting sampling locations on each respective interleaved path to set equal pulse widths on all interleaved paths.
12. The high-speed data receiver of claim 7, wherein the equalization circuitry is configured to equalize the gain-normalized signal by adjusting coefficients of a fitting polynomial.
13. The high-speed data receiver of claim 1 wherein the sorting circuitry comprises Fast Fourier Transform circuitry configured to derive the spectral content information from the data on each of the plurality of interleaved paths.
14. A method of equalizing parallel interleaved data paths in a high-speed data receiver, the method comprising: deriving spectral content information from data on each of the plurality of interleaved paths; binning the derived spectral content information according to energy levels; determining, based on sorted spectral content, one or more of path offsets of the interleaved paths, gain mismatch among interleaved paths, signal bandwidth mismatch and pulse width mismatch; and correcting the one or more of the determined offsets, the determined gain mismatch and the determined signal width mismatch.
15. The method of equalizing parallel interleaved data paths according to claim 14, wherein determining, based on sorted spectral content, one or more of path offsets of the interleaved paths, gain mismatch among interleaved paths, signal bandwidth mismatch and pulse width mismatch comprises determining offsets among the interleaved paths from mismatches at a lowest energy level.
16. The method of equalizing parallel interleaved data paths according to claim 14, wherein determining, based on sorted spectral content, one or more of path offsets of the interleaved paths, gain mismatch among interleaved paths, signal bandwidth mismatch and pulse width mismatch comprises determining mismatch in at least one of bandwidth and pulse width among the interleaved paths using a spectral shape function.
17. The method of equalizing parallel interleaved data paths according to claim 14, wherein determining mismatch in at least one of bandwidth and pulse width comprises independently sweeping respective signal width actuators of each respective interleaved path.
18. The method of equalizing parallel interleaved data paths according to claim 14, wherein determining, based on sorted spectral content, one or more of path offsets of the interleaved paths, gain mismatch among interleaved paths, signal bandwidth mismatch and pulse width mismatch comprises determining gain mismatches among the interleaved paths from mismatches at energy levels above the lowest energy level.
19. The method of equalizing parallel interleaved data paths according to claim 18, wherein determining gain mismatches comprises independently sweeping respective gain actuators of each respective interleaved path.
20. The method of equalizing parallel interleaved data paths according to claim 18, wherein correcting the determined gain mismatch comprises removing gain mismatch by normalizing a predetermined number of lowest-energy bins.
21. The method of equalizing parallel interleaved data paths according to claim 20, wherein correcting the determined signal width mismatch comprises adjusting respective signal width actuators of each respective interleaved path.
22. The method of equalizing parallel interleaved data paths according to claim 21, wherein correcting the determined signal width mismatch comprises separately adjusting respective bandwidth actuators of each respective interleaved path and respective pulse width actuators of each respective interleaved path.
23. The method of equalizing parallel interleaved data paths according to claim 21, wherein correcting the determined signal width mismatch comprises adjusting sampling locations on each respective interleaved path.
24. The method of equalizing parallel interleaved data paths according to claim 23, wherein correcting the determined signal width mismatch comprises adjusting sampling locations on each respective interleaved path to set equal pulse widths on all interleaved paths.
25. The method of equalizing parallel interleaved data paths according to claim 20, wherein correcting the determined signal width mismatch comprises adjusting coefficients of a fitting polynomial.
26. The method of equalizing parallel interleaved data paths according to claim 14 binning the derived spectral content information according to energy levels comprises applying a Fast Fourier Transform.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0032] Further features of the disclosure, its nature and various advantages, will be apparent upon consideration of the following detailed description, taken in conjunction with the accompanying drawings, in which like reference characters refer to like parts throughout, and in which:
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
DETAILED DESCRIPTION
[0049] In accordance with implementations of the subject matter of this disclosure, spectral techniques can be used to detect and correct various forms of mismatch, such as those of the types described above (e.g., linear offset and gain mismatch errors, bandwidth and settling mismatches, sampling pulse-width mismatches, and ISI), in time-interleaved (TI) or spatially-interleaved (SI) systems. In some implementations, a Fast Fourier Transform may be used for spectral detection and estimation.
[0050]
[0051] However, in accordance with implementations of the subject matter of this disclosure, broadband spectrum analysis over multiple ones of the sample values can be used to distinguish associated TI or SI artifacts based on frequency content. For ease of illustration, the discussion that follows will focus on spectral techniques applicable to TI systems, but the same principles apply as well to SI systems without loss of generality.
[0052] Samples of different TI elements have different spectral energy content that can be detected using, e.g., Fast Fourier Transform (FFT) circuitry. The different spectral energy components can be binned based on the FFT output. If different TI elements have different offsets, they will have different FFT Bin 0 energy content, and therefore the offsets can be discerned from gain errors. If there are interleaved gain errors aside from offsets, those interleaved gain errors can be discerned after the offsets are removed from the data samples. Offsets can be removed either by subtracting each data samples from an average value, or calibrating Bin 0 to a minimum level. Subsequently, the sum of energy in bins other than Bin 0—i.e., from Bin 1 to the Nyquist limit—can be used to calibrate for gain mismatches.
[0053] Mathematically, this can be expressed as:
|B.sub.0.sup.9|= . . . =|B.sub.0.sup.K−1| (1)
∥(B.sub.1.sup.0, . . . , B.sub.N−1.sup.9)∥.sup.m= . . . =∥(B.sub.1.sup.K−1, . . . , B.sub.N−1.sup.K−1)∥.sup.m (2)
where m=1 or m=2 to denotes either the first or second order norm of bins 1, . . . , N−1.
[0054]
[0055] After spectral analysis of the various interleaved samples, in various implementations the different attributes of each sample can be determined as follows:
[0056] First, offset mismatches between samples on different interleaver paths 113 are determined from Eq. 1 based on the content of Bin 0 of the sampled energy distribution.
[0057] Second, gain mismatches between samples on different interleaver paths 113 are determined from Eq. 2 based on the content of all bins, other than Bin 0, of the sampled energy distribution.
[0058] Third, signal width mismatches (i.e., bandwidth mismatches or pulse width mismatches) between samples on different interleaver paths 113 are determined from Eq. 3 based on spectral shape functions:
f(B.sub.1.sup.0, . . . , B.sub.N−1.sup.0)= . . . =f(B.sub.1.sup.K−1, . . . , B.sub.N−1.sup.K−1) (3)
where f( . . . ) denotes a specialized spectral shape function that depends on the nature of signal and characteristics of the system.
[0059] In one implementation, a TI system may have K=16 interleaver paths 113 sampling at 100 GHz. Each interleaver path 113 thus runs at 6.25 GHz and is assumed to have a bandwidth of 40 GHz. Each interleaver 113 (
[0060] Eq. 3 captures spectral shape as a specialized function f( . . . ) with the objective of equalizing the spectral shapes of
[0061] A further illustration is shown in
[0062] The discussion up to this point has treated pulse width variations and bandwidth variations as being the same. While this produces acceptable results in most cases because, to the first order, giving a system more time to settle is almost equivalent to increasing its bandwidth. In principle, however, mismatches resulting from different pulse width and bandwidth can produce different sensitivity functions with respect to spectral response changes. Thus, in another implementation, pulse width and bandwidth can be adjusted independently until respective sensitivity functions are equalized. This may be expressed mathematically as follows:
[0063] As an example, sensitivity functions can be obtained by dithering the actuation code (of either bandwidth or pulse width) using minimal step sizes around the current code values and measuring the corresponding changes in the spectral shape function f( . . . ). After obtaining all the sensitivity functions, equal pulse widths and bandwidths are considered equalized when all respective sensitivity functions are equalized (Eq. 4) and Eqs. 1-3 also are satisfied.
[0064] In some implementations in which pulse widths and sampling locations form a cyclic arrangement (as illustrated in
[0065] In some implementations, overall broadband ripples of the aggregated spectrum matter more than the individual interleaving elements. Alternatively, if all samplers do not see the same incoming data statistics or if the sampling errors cause ISI, the individual bandwidths or pulse widths can be adjusted until ripple energy in the aggregated spectral shape is minimized.
[0066] A spectral estimator to determine the energy distribution can be implemented based on any suitable mathematical transformation capable of estimating the time or frequency dependent nature of a system, while highlighting interleaving artifacts. Some examples include Laplace transforms, Mellin transforms, etc.
[0067] In a particular implementation, the spectral estimator may be implemented using a Fast Fourier Transform (FFT). Several different Radix configurations can be selected with different practical considerations. Table 1 below lists the number of complex multiplications as a function of different FFT lengths based on specific Radix choices.
TABLE-US-00001 TABLE 1 Number of Real Multiplications Using a 4-Mult-2-Add Scheme FFT Length Radix - 2/4 Radix - 2/8 Radix - 2/16 Radix - 4/16 16 24 24 24 24 32 84 84 84 64 248 240 248 248 128 660 636 652 256 1656 1592 1632 1624 512 3988 3812 3924 1024 9336 8896 9192 9144 2048 21396 20364 21020 4096 48248 45832 47344 47000
[0068] The complex multiplications in this implementation are based on four real multiplications and two additions. One illustrative choice can be a Radix-2/8 Decimation-In-Frequency (DIF) implementation, with 636 multiplications. If the signal input is real, the number of multiplications can further be simplified. A DIF architecture may be more desirable than a Decimation-In-Time (DIT) implementation for several reasons. For example, data may be presented in a time-ordered sequence, whereas in a DIT architecture, the samples would need to be bit-reversed, adding to routing complexity. In addition, even though the DIF architecture presents the bins in bit-reversed order, the register interface can read out the bins in any order required.
[0069]
[0070] Six pipeline stages are present—five in the core of the logic and a final pipeline stage on the output—and are illustrated by vertical black lines. As many as 100,000+ FFT bins may be accumulated outside of the core for noise suppression. Further, to minimize the growth in quantization error, true symmetric rounding may be implemented throughout, adding minimal complexity to the design. Scaling through the
[0071] FFT stages is a function of application scenarios and/or detection requirements, where dynamic-range requirements scale with accuracy. As an example, a simple symmetric rounding can take place at the output of the twiddle-factor multiplications, leading to an overall achievable dynamic range of greater than the intrinsic resolution of the transceiver.
[0072] Depending on applications, FFT results are further processed in signed or unsigned-square mode (in either interleaved or aggregated mode), depending on the design objective.
[0073] In signed mode, the real and imaginary components are taken as snapshot or separately summed while preserving their sign. For example, in a 48-bit accumulator with 2.sup.32 FFT averages each bin has a maximum range of 2.sup.16. Both magnitude and phase information are available, for applications where deterministic phase pattern in the incoming data needs to be tracked, aside from spectral shapes.
[0074] Unsigned square mode is similar to signed mode, except that the sum of the squares or absolutes of real and imaginary components are separately computed and accumulated. In a squared magnitude case, a higher theoretical maximum of accumulator bits with appropriate truncation may be required to achieve a similar accuracy. This mode may be used for high precision detection examples, where a high level of noise suppression is needed to discern different interleaving errors.
[0075] Two types of overflow detectors can be further added to detect overflow in any FFT-bin accumulator. The first type of overflow detector detects overflow in the accumulators of any DC bins, and the second type of overflow detector detects overflow in the remaining 63 bins. For each overflow detection method, one extra overflow bit is used in each accumulator. The MSB+1 of the accumulator may be monitored and any overflow events may be latched over a complete FFT processing cycle, which may include one or more successive FFTs, depending on the amount of required noise suppression.
[0076] In order to concurrently capture and process all interleavers, memory storage is also required, which may increase area footprint and routing complexity. As an example, a 128-point FFT core logic can be implemented with approximately 140,000 cells with additional storage up to 500,000 cells to accommodate interleaving parallelism and noise suppression requirements.
[0077] An example of memory requirements can be estimated as follows:
TABLE-US-00002 Total storage = Input-buffer-memory + FFT-core-storage + Bin- accumulator-memory Input-buffer-memory = Interleaving-parallelism × number-of- bits-per-sample × number of interleavers = 128 × 8 × 16 ≈ 16K bits FFT-core-storage = (stage1-bits+stage2-bits+stage3- bits+stage4-bits+stage5-bits+stage6-bits) × FFT-size = (9 + 11 + 14 + 15 + 16 + 16)×128 ≈ 10K bits Bin-accumulator-memory = 2 * (FFT-size/2) × accumulator-bit- width × number of interleavers = 2 × (128/2) × 49 × 16 ≈ 100K bits
[0078] With the design being memory-dominated, other choices such as folded FFT architectures provide minimum hardware savings over a full parallel implementation. Furthermore, a folded implementation would require programmable multipliers for the twiddle factors, thus adding complexity.
[0079] For timing purposes, an FFT cycle-time can be defined as the time to capture and perform K FFTs, which is the sum of LOAD+FFT+FLUSH time and is dependent on sampling rates. For example, the time for one full FFT computation at a sampling rate of 100 GS/s is −164 ns. The latency can be calculated as follows:
LOAD+FFT+FLUSH time=(Num_THA×ADC_parallelism)/f.sub.sample+(28×clk_div_factor)×(ADC_parallelism/f.sub.sample) =ADC_parallelism/f.sub.sample×(Number of THAs+28×clk_div_factor)
where the clk_div_factor denotes subsampling ratio (e.g. 4, 8, etc.). For example, at 100 GS/s, 100,000 FFT cycles can be performed in approximately 1.5 ms using a clock-divide ratio of 4. Additional logic may also be needed to ensure that the data-path pipeline operates in sync. An example finite-state-machine (FSM) 1400 that can be used to illustrate various control signals is shown in
[0084] As seen in
[0085] A method 1600 according to implementations of the subject matter of this disclosure, for equalizing parallel interleaved data paths in a high-speed data receiver is diagrammed in
[0086] Thus it is seen that detection of mismatches between interleaved data paths based on sorted spectral data, and correction of the detected mismatches, have been provided.
[0087] As used herein and in the claims which follow, the construction “one of A and B” shall mean “A or B.”
[0088] It is noted that the foregoing is only illustrative of the principles of the invention, and that the invention can be practiced by other than the described embodiments, which are presented for purposes of illustration and not of limitation, and the present invention is limited only by the claims which follow.