Data stream processing device with reconfigurable data stream processing resources and data stream processing method
10972318 · 2021-04-06
Assignee
Inventors
Cpc classification
G06F11/0709
PHYSICS
International classification
H04L25/03
ELECTRICITY
Abstract
A data stream processing device includes a plurality of data providing units, a plurality of processing units, and control circuitry. The data providing units are configured to output data values received via a plurality of data inputs, respectively. The processing units are configured to generate data outputs based on the data values, respectively. The control circuitry includes a mode selection input and is configured to simultaneously provide data values of different data streams to the data inputs of the data providing units, respectively, in response to the mode selection input receiving a signal indicating a first mode, and simultaneously provide a plurality of successive groups of data values of one of the data streams to the data inputs of the data providing units, respectively, in response to the mode selection input not receiving the signal indicating the first mode.
Claims
1. A data stream processing device comprising: a plurality of data providing units configured to output data values received via a plurality of data inputs, respectively; a plurality of processing units configured to generate output data based on the data values, respectively; and control circuitry including a mode selection input and configured to simultaneously provide data values of different data streams to the data inputs of the data providing units, respectively, in response to the mode selection input receiving a signal indicating a first mode, and simultaneously provide a plurality of successive groups of data values of one of the data streams to the data inputs of the data providing units, respectively, in response to the mode selection input not receiving the signal indicating the first mode.
2. The data stream processing device according to claim 1, further comprising a summation unit configured to calculate a sum of the output data generated by the processing units.
3. The data stream processing device according to claim 1, wherein corresponding pairs of the data providing units and the processing units form a plurality of FIR filters for the data streams, respectively, in response to the mode selection input receiving the signal indicating the first mode, and the data providing units and the processing units form a single FIR filter for the one of the data streams in response to the mode selection input not receiving the signal indicating the first mode.
4. The data stream processing device according to claim 1, wherein the data providing units each include a tapped delay line with one or more delayed data outputs, the one or more delayed data outputs being configured to output one or more data values received via a corresponding one of the data inputs, respectively.
5. The data stream processing device according to claim 4, wherein the processing units each include one or more delayed data inputs, the one or more delayed data inputs of each of the processing units being connected to the one or more delayed data outputs of a corresponding one of the data providing units, respectively.
6. The data stream processing device according to claim 5, further comprising one or more registers configured to store one or more coefficient values applied to the one or more data values outputted by the one or more delayed data outputs of the tapped delay lines, the processing units being configured to generate the output data as sums of products between the one or more data values outputted by the one or more delayed data outputs of the tapped delay lines and the one or more coefficient values, respectively.
7. The data stream processing device according to claim 4, wherein the control circuitry is configured to serially couple the tapped delay lines of the data providing units with respect to each other in response to the mode selection input not receiving the signal indicating the first mode.
8. The data stream processing device according to claim 4, further comprising one or more registers configured to store one or more coefficient values applied to the one or more data values outputted by the one or more delayed data outputs of the tapped delay lines, the control circuitry being configured to symmetrically apply the one or more coefficient values to the one or more data values outputted by the one or more delayed data outputs of the tapped delay lines relative to centers of the tapped delay lines, respectively, in response to the mode selection input receiving the signal indicating the first mode, and symmetrically apply the one or more coefficient values to the one or more data values outputted by the one or more delayed data outputs of the tapped delay lines relative to a center of a series of the tapped delay lines in response to the mode selection input receiving the signal indicating the first mode.
9. The data stream processing device according to claim 1, further comprising a plurality of data storages configured to store data values of the data streams, respectively, in response to the mode selection input receiving the signal indicating the first mode, and store successive groups of data values of the one of the data streams, respectively, in response to the mode selection input not receiving the signal indicating the first mode.
10. The data stream processing device according to claim 1, wherein corresponding pairs of the data providing units and the processing units form a plurality of equalizers for the data streams, respectively, in response to the mode selection input receiving the signal indicating the first mode, and the data providing units and the processing units form a single equalizer for the one of the data streams in response to the mode selection input not receiving the signal indicating the first mode.
11. The data stream processing device according to claim 1, wherein the processing units are configured to update coefficient values for generating the output data based on error values received via error inputs, respectively, the control circuitry is configured to provide error values between the output data of the processing units and reference signals to the error inputs of the processing units, respectively, in response to the mode selection input receiving the signal indicating the first mode, and provide an error value between a sum of the output data of the processing units and a reference signal to the error inputs of the processing units, respectively, in response to the mode selection input not receiving the signal indicating the first mode.
12. The data stream processing device according to claim 11, wherein the processing units are configured to update the coefficient values using a least mean squares algorithm.
13. The data stream processing device according to claim 11, wherein the processing units are configured to generate the output data by applying the coefficient values to the data values of the data streams, respectively, in response to the mode selection input receiving the signal indicating the first mode, and generate the output data by applying the coefficient values to the successive groups of the data values of the one of the data streams, respectively, in response to the mode selection input not receiving the signal indicating the first mode.
14. A data stream processing method comprising: outputting, by a plurality of data providing units, data values received via a plurality of data inputs, respectively; generating, by a plurality of processing units, output data based on the data values, respectively; simultaneously providing data values of different data streams to the data inputs of the data providing units, respectively, in response to a mode selection signal indicating a first mode; and simultaneously providing a plurality of successive groups of data values of one of the data streams to the data inputs of the data providing units, respectively, in response to the mode selection signal not indicating the first mode.
15. The data stream processing method according to claim 14, wherein corresponding pairs of the data providing units and the processing units form a plurality of FIR filters for the data streams, respectively, in response to the mode selection signal indicating the first mode, and the data providing units and the processing units form a single FIR filter for the one of the data streams in response to the mode selection signal not indicating the first mode.
16. The data stream processing method according to claim 14, further comprising symmetrically applying one or more coefficient values to one or more data values outputted by one or more delayed data outputs of tapped delay lines of the data providing units relative to centers of the tapped delay lines, respectively, in response to the mode selection signal indicating the first mode, and symmetrically applying the one or more coefficient values to the one or more data values outputted by the one or more delayed data outputs of the tapped delay lines of the data providing units relative to a center of a series of the tapped delay lines in response to the mode selection signal indicating the first mode.
17. The data stream processing method according to claim 14, wherein corresponding pairs of the data providing units and the processing units form a plurality of equalizers for the data streams, respectively, in response to the mode selection signal indicating the first mode, and the data providing units and the processing units form a single equalizer for the one of the data streams in response to the mode selection signal not indicating the first mode.
18. The data stream processing device according to claim 1, wherein the different data streams are parallel data streams.
19. The data stream processing method according to claim 14, wherein the different data streams are parallel data streams.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Referring now to the attached drawings which form a part of this original disclosure:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
DETAILED DESCRIPTION OF EMBODIMENTS
(9) Selected embodiments will now be explained with reference to the drawings. It will be apparent to those skilled in the art from this disclosure that the following descriptions of the embodiments are provided for illustration only and not for the purpose of limiting the invention as defined by the appended claims and their equivalents.
(10) The present application is related to a device and a method to efficiently utilize and combine signal processing resources in reconfigurable, parallel arrays of FIR Filters and linear channel equalizers. Specifically, the present application illustrates a device and a method to flexibly reconfigure and combine the signal processing resources of an array of independent, parallel FIR filters into a smaller array of independent FIR filters with longer tap length. Also, the present application illustrates a device and a method to further optimize the design of the filters when the filter coefficients exhibit symmetries of an odd or even function. Here, the impulse response of an FIR filter is also its coefficient set. Furthermore, the present application illustrates that the ideas embodied in this application readily extends to the design of adaptive linear channel equalizers, and enables an array of independent equalizers to reconfigure and combine both their filtering components and their coefficient adaptation components to form a smaller array of independent equalizers with longer tap lengths and superior frequency-selective properties.
First Embodiment
(11)
(12) In the illustrated embodiment, by operation of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1), the operation mode of the data stream processing device 10 is switched between a “first mode” in which the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) independently perform filtering operations on a plurality of (M) independent input data streams U (U.sub.0, U.sub.1, . . . , U.sub.M-1) and a “second mode” in which the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) perform a filtering operation on the input data stream U.sub.0 as a single filter.
(13) Furthermore, in the illustrated embodiment, the data stream processing device 10 also includes a plurality of (M) coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1) for the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1), respectively, and a summation circuit (e.g., summation unit) P. In the illustrated embodiment, the coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1) each store one or more (N) coefficient values C(0), C(1), . . . , C(N−1) (see
(14) In the illustrated embodiment, as illustrated in
(15) As illustrated in
(16) Furthermore, the digital delay line 12 also includes one or more (N) delayed data outputs T.sub.0, T.sub.1, . . . , T.sub.N-1 that output delayed output data (e.g., data values received via the data input 18) of the delays D.sub.0, D.sub.1, . . . , D.sub.N-1. Specifically, in the illustrated embodiment, the first delay D.sub.0 has an input that is coupled to the data input 18 and an output that is coupled to an input of the next delay D.sub.1 and also coupled to the delayed data output T.sub.0. Also, the last or N-th delay D.sub.N-1 has an input that is coupled to an output of the prior or (N−1)-th delay D.sub.N-2 and an output that is coupled to the delayed data output T.sub.N-1. Furthermore, except for the output of the last delay D.sub.N-1 of the last filter F.sub.M-1 (see
(17) The processing unit 14 perform a function responsive to the delayed output data received from the digital delay line 12 to generate the output data stream Y (Y.sub.0, Y.sub.1, . . . , Y.sub.M-1) from the data output 20. The function can also be referred to as a “process”, a “processing”, or a “procedure.” Performing the function can include performing a set of one or more operations, in sequence and/or combination, on the delayed output data received from the digital delay line 12. Such operations can include, but are not limited to, bitwise logical operations (such as, but not limited to, NOT, AND, OR, XOR, left shift, and right shift) and arithmetic operations (such as, but not limited to, addition, subtraction, multiplication, division, logarithmic, exponential, trigonometric, statistic, and comparison operations). In some implementations, the processing unit 14 can include a programmable processor that executes one or more program instructions to perform the function or a portion of the function. With this configuration, the processing unit 14 implements a finite impulse response (FIR) filter.
(18) More specifically, the processing unit 14 includes one or more (N) delayed data inputs S.sub.0, S.sub.1, . . . , S.sub.N-1 connected to the delayed data outputs T.sub.0, T.sub.1, . . . , T.sub.N-1, respectively. Furthermore, the processing unit 14 further includes one or more (N) multiplier circuits Q.sub.0, Q.sub.1, . . . , Q.sub.N-1 and a summation circuit V. The multiplier circuits Q.sub.0, Q.sub.1, . . . , Q.sub.N-1 calculate the products between the data values of the delayed output data received via the delayed data inputs S.sub.0, S.sub.1, . . . , S.sub.N-1 and the coefficient values C(0), C(1), . . . , C(N−1) stored in the corresponding one of the coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1), respectively. Then, the summation circuit V calculates the sum (sum(N)) of the products to generate the output data stream Y (Y.sub.0, Y.sub.1, . . . , Y.sub.M-1) from the data output 20.
(19) The coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1) are a register or memory element, and each store a set of the coefficient values C(0), C(1), . . . , C(N−1) for a respective one of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1). The coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1) can be formed as a single register or as separate registers. Also, the coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1) can be formed as part of the processing units 14 of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1), respectively. The coefficient values C(0), C(1), . . . , C(N−1) for the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are preset to adjust the filter characteristics of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1). Of course, the coefficient values C(0), C(1), . . . , C(N−1) for the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) can be adaptively changed to obtain the desired filter characteristics.
(20) As illustrated in
(21) In the illustrated embodiment, the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) are formed as a “switch”, “multiple input, single output switch,” “switching element”, “mux”, “signal selector”, or “data selector.” In the illustrated embodiment, each of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) has first and second inputs. The first inputs of the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 receive the input data streams U.sub.1, . . . , U.sub.M-1, respectively. The second inputs of the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 are connected to the delayed data outputs T.sub.N-1 of the last delays D.sub.N-1 of the filters F.sub.0, F.sub.1, F.sub.M-2, respectively, to receive the delayed output data via the delayed data outputs T.sub.N-1, respectively.
(22) Furthermore, the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) each have a mode selection input MS for receiving a mode selection signal having a value of either “0” or “1.” The multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) each selectively couple either the first input or the second input to the data input 18 of a respective one of the filters F.sub.1, . . . , F.sub.M-1 according to the value of the mode selection signal. In the illustrated embodiment, according to the value of the mode selection signal, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) can function as M independent N-tap FIR filters or as a single MN-tap FIR filter. This mode selection signal can be inputted through an interface of the data stream processing device 10, for example, to change operation mode of the data stream processing device 10 between the first and second modes. Of course, this mode selection signal can be inputted according to an operation status of the data stream processing device 10.
(23)
(24) In particular, if the value of the mode selection signal is “0,” which indicates the first mode (Yes in step S12), then the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 couple the first inputs to the data inputs 18 of the filters F1, . . . , F.sub.M-1, respectively, such that the input data streams U.sub.1, . . . , U.sub.M-1 are inputted to the filters F.sub.1, . . . , F.sub.M-1 via the data inputs 18 of the filters F.sub.1, . . . , F.sub.M-1, respectively. This allows the filters F.sub.0, F.sub.1, . . . , F.sub.M-1 to independently perform filtering operations on the independent input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1. Thus, in this case, the control circuitry 11 (the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1)) simultaneously provides data values of the independent input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1 to the data inputs 18 of the filters F.sub.0, F.sub.1, . . . , F.sub.M-1, respectively, such that the filters F.sub.0, F.sub.1, . . . , F.sub.M-1 independently performs filtering operations on the independent input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1, respectively, and independently output the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 from the data outputs 20 of the filters F.sub.0, F.sub.1, . . . , F.sub.M-1, respectively. Therefore, in this case, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) form an array of M FIR filters arranged in parallel to each other, as can be found in a device supporting parallel processing of M independent data streams. In other words, the corresponding pairs of the digital delay lines 12 and the processing units 14 form a plurality of FIR filters for the input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1, respectively, in response to the mode selection signal indicating the first mode.
(25) On the other hand, if the value of the mode selection signal is “1,” which indicates the second mode (or does not indicate the first mode) (No in step S12), then the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 couple the second inputs to the data inputs 18 of the filters F1, . . . , F.sub.M-1, respectively, such that the delayed output data from the last delays D.sub.N-1 of the filters F.sub.0, F.sub.1, . . . , F.sub.M-2 are inputted to the first delays Do of the filters F.sub.1, . . . , F.sub.M-1 via the data inputs 18 of the filters F.sub.1, . . . , F.sub.M-1, respectively. Thus, the digital delay lines 12 of the M filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are serially connected to each other and are daisy-chained into a tapped delay line with MN taps in length, which multiplies the length of the FIR filter and improves its frequency selective performance, for example. Specifically, in this case, the control circuitry 11 (the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1)) simultaneously provides a plurality of (M) successive groups of N data values of the input data stream U.sub.0 to the data inputs 18 of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1). In particular, a first group of data values of the input data u.sub.0(n), u.sub.0(n−1), . . . , u.sub.0(n−N+1) of the input data stream U.sub.0 is provided to the filter F.sub.0, a second group of data values of the input data u.sub.0(n−N), u.sub.0(n−N−1), . . . , u.sub.0(n−2N+1) of the input data stream U.sub.0 is provided to the filter F.sub.1, . . . , an M-th group of data values of the input data u.sub.0(n−(M−1)N), u.sub.0(n−(M−1)N−1), . . . , u.sub.0(n−MN+1) of the input data stream U.sub.0 is provided to the filter F.sub.M-1, The filters F.sub.0, F.sub.1, . . . , F.sub.M-1 perform filtering operations on the successive groups of data values of the input data stream U.sub.0, and output the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 from the data outputs 20 of the filters F.sub.0, F.sub.1, . . . , F.sub.M-1, respectively. Furthermore, in this case, the summation circuit P calculates the sum (sum(M)) of the data values of the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 to generate an output data stream Y.sub.s having output data y.sub.s(n) at sample period n, which is an output of the combined FIR filter with MN taps at sample period n. Therefore, in this case, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) form a single FIR filter. In other words, the digital delay lines 12 and the processing units 14 form a single FIR filter for the input data stream U.sub.0 in response to the mode selection signal not indicating the first mode. In particular, the control circuitry 11 serially couples the digital delay lines 12 with respect to each other in response to the mode selection signal not indicating the first mode.
(26) With the data stream processing device 10, the filter configuration of the parallel FIR filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) can be reconfigurable. Thus, the signal processing (filtering) resources of the data stream processing device 10 can be efficiently utilized. In particular, efficient implementation to combine the M independent FIR filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) with N taps into a single FIR filter with MN taps can be provided.
(27) Thus, even if some of the signal processing (filtering) resources of the data stream processing device 10 (e.g., some of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1)) cannot be utilized for processing many parallel data streams due to certain operating conditions, such as bandwidth allocation or network configuration, the unused signal processing (filtering) resources can be efficiently reallocated to process a data stream by effectively multiplying the number of taps of the signal processing (filtering) resources in use, which enhances filtering performance with minimal increases in implementation complexity, area, and power.
(28) In the illustrated embodiment, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are configured to be identical to each other. Specifically, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) have the same number of taps (i.e., N taps). However, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) can have different numbers of taps (i.e., different number of delays) with respect to each other as needed and/or desired.
(29) In the illustrated embodiment, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) form a single FIR filter in response to all of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) receiving the same mode selection signal with the same value “1”. However, the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) can be configured to independently receive different mode selection signals with different values “0” and “1” to form a plurality of groups of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) each forming a single FIR filter. For example, if all of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) receive the same mode selection signal with the same value “1” except for a k-th multiplexer Mux_A.sub.k, and the k-th multiplexer Mux_A.sub.k receives the mode selection signal with the value “0,” then a first group of the filters F.sub.0, . . . , F.sub.k-1 can form a first single FIR filter for the input data stream U.sub.0 and a second group of the filters F.sub.k, . . . , F.sub.M-1 can form a second single FIR filter for the input data stream U.sub.k. In this case, the summation circuit P calculates the sum of the data values of the output data streams Y.sub.0, . . . , Y.sub.k-1 to generate a first output data stream Y.sub.s1 as an output of the first single FIR filter with kN taps, and calculates the sum of the data values of the output data streams Y.sub.k, . . . , Y.sub.M-1 to generate a second output data stream Y.sub.s2 as an output of the second single FIR filter with (M−k)N taps. In this case, the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are grouped into two groups, but can also be grouped into more than two groups.
(30) Referring now to
(31) As illustrated in
(32) On the other hand, as illustrated in
(33) Also, as illustrated in
(34) More specifically, as illustrated in
(35) As also illustrated in
(36) On the other hand, the second inputs of the multiplexers Mux_B.sub.0, . . . Mux_B.sub.k-1 for each of the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) receive the coefficient values C(k−1), . . . , C(0) from a coefficient register R (R.sub.0, R.sub.1, . . . , R.sub.M-1) for a symmetrically corresponding one of the filters F (F.sub.M-1, F.sub.M-2 . . . , F0). Thus, when the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are combined into a single filter in response to the value of the mode selection signal being “1,” which indicates the second mode, the single filter formed by the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) has symmetric filter coefficients. In other words, in this case (second mode), for the MN-tap combined filter, only M sets of k coefficient values C(0), C(1), . . . , C(k−1) (where N=2k) (or M sets of k+1 coefficient values C(0), C(1), C(k) (where N=2k+1)) need to be stored in total in the coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1), and the M sets of k coefficient values C(0), C(1), . . . , C(k−1) are symmetrically applied to the MN data values outputted by the delayed data outputs T.sub.0, T.sub.1, . . . , T.sub.N-1 of the tapped delay lines 12 relative to the center of the series of the tapped delay lines 12 (i.e., relative to a position between the delayed data outputs T.sub.N-1 of the filter and the delayed data outputs T.sub.0 of the filter F.sub.j (where M=2j), a position between the delayed data outputs T.sub.k-1 and T.sub.k of the filter F.sub.j (where M=2j+1 and N=2k) or a position of the delayed data output T.sub.k of the filter F.sub.j (where M=2j+1 and N=2k+1) along a series of the tapped delay lines 12). Thus, symmetric arrangement (symmetric arrangement having symmetry of an even function) of the filter coefficients are preserved when the parallel filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are combined into a single filter.
(37) More specifically, referring to
(38) In particular,
(39) Basically, for a MN-tap combined filter, MN combined filter coefficients K(0), K(1), . . . , K(MN−1) (“combined filter coefficient number” in
(40) Furthermore, in the illustrated embodiment, the symmetric filter coefficient settings having symmetry of an even function are applied to the M independent parallel filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) with N taps, respectively, which also needs only need MN/2 unique coefficient values in total. For example, a set of N/2 unique coefficient values C.sub.0(0), . . . , C.sub.0(N/2−1) is symmetrically applied as the N filter coefficients K(0), . . . , K(N−1) having symmetry of an even function for the N-tap filter F.sub.0. Similarly, other sets of N/2 unique coefficient values C.sub.1(0), . . . , C.sub.1(N/2−1), . . . , C.sub.M-1(0), . . . , C.sub.M-1(N/2−1) are also symmetrically applied as other sets of N filter coefficients K(N), . . . , K(2N−1), . . . , K(MN−N), . . . , K(MN−1) having symmetry of an even function for the N-tap filters respectively. Thus, this filter coefficient settings does not require additional storage space in the coefficient registers R (R.sub.0, R.sub.1, . . . , R.sub.M-1) whether the filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are working independently or combined.
(41) With the data stream processing device 100, efficient implementation to combine the M independent FIR filters with N taps F (F.sub.0, F.sub.1, . . . , F.sub.M-1) each having a set of symmetric filter coefficients into a single FIR filter with MN taps having a set of symmetric filter coefficients can be provided.
(42) The signal processing (filtering) resources of two or more FIR filters can be basically combined into a single, higher tap-length FIR filter by daisy-chaining them, where the output of one filter feeds the input of the next filter. However, in case of root raised cosine (RRC) filters, the convolution of two RRC filter impulse responses results in a raised cosine (RC) impulse response. Thus, when the system requires match-filtering, as is often the case in communication systems, an RC filter cannot be used in place of an RRC filter. It is possible to split an RRC filter into two RRC filters of shorter length using numerical method. However, this adds unnecessary complexity to the system design. On the other hand, with the data stream processing device 100, this can be avoided by preserving symmetric arrangement (symmetric arrangement having symmetry of an even function or mirror symmetry) of the filter coefficients even when the parallel filters F (F.sub.0, F.sub.1, . . . , F.sub.M-1) are combined into a single filter.
Second Embodiment
(43) Referring now to
(44) As shown in
(45) Specifically, as illustrated in
(46) More specifically, in the illustrated embodiment, the digital delay line 212 is basically identical to the digital delay line 12 as illustrated in
(47) Furthermore, as illustrated in
(48) As also illustrated in
(49) The processing unit 214 includes an FIR filter sub-section 222, a dot product calculator 223, and an adaptation sub-section 224. The FIR filter sub-section 222 calculate the complex conjugate of the filter coefficients [c.sub.0(n), . . . , c.sub.N-1(n)] that have been calculated or updated by the adaptation sub-section 224. The dot product calculator 223 calculates the dot product between the complex conjugate of the filter coefficients [c.sub.0(n), . . . , c.sub.N-1(n)] (i.e., conj([c.sub.0(n), . . . , c.sub.N-1(n)])) outputted from the FIR filter sub-section 222 and a vector of the delayed output data [u(n−N+1), u(n)] outputted from the digital delay line 212, and then outputs the dot product as the output data y(n) (y.sub.0(n), y.sub.1(n), . . . , y.sub.M-1(n)) of the output data stream Y (Y.sub.0, Y.sub.1, . . . , Y.sub.M-1). The operations to calculate the output data y(n) (y.sub.0(n), y.sub.1(n), . . . , y.sub.M-1(n)) by the FIR filter sub-section 222 and the dot product calculator 223 are mathematically identical to the operations to calculate the output data y(n) (y.sub.0(n), y.sub.1(n), . . . , y.sub.M-1(n)) by the processing unit 14 (
(50) The adaptation sub-section 224 includes the error calculation circuit 226 and a coefficient update circuit 228. The error calculation circuit 226 computes the difference between known training sample d(n) and the output data y(n) to find an error e(n) at sample period n (e(n)=d(n)−y(n)). Furthermore, the coefficient update circuit 228 calculates an updated set of filter coefficients [c.sub.0(n+1), . . . , c.sub.N-1(n+1)] for sample period n+1 based on a coefficient update formula using the LMS algorithm. Specifically, the updated set of filter coefficients [c.sub.0(n+1), . . . , c.sub.N-1(n+1)] for sample period n+1 is computed as follows: [c.sub.0(n+1), . . . , c.sub.N-1(n+1)]=[c.sub.0(n), . . . , c.sub.N-1(n)]+μ[u(n−N+1), . . . , u(n)]conj(e(n)), based on a vector of the stored input data [u(n−N+1), . . . , u(n)] stored in the data storage 213, the current output data y(n), the complex conjugate of the error e(n), the current set of the filter coefficient at sampler period n [c.sub.0(n), . . . c.sub.N-1(n)], and an appropriate adaptation step size μ. The calculations of the error calculation circuit 226 and the coefficient update circuit 228 can be performed using the same clock as the clock applied to the digital delay line 212 or using multiple clocks different from the clock applied to the digital delay line 212. In the illustrated embodiment, the data storage 213 is configured as forming a flexible pipeline delay, and can hold the input data [u(n−N+1), . . . , u(n)] while the calculations of the error calculation circuit 226 and the coefficient update circuit 228 using the different clocks take time to finish.
(51) As illustrated in
(52) In the illustrated embodiment, the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) and the multiplexers Mux_C (Mux_C.sub.0, . . . , Mux_C.sub.M-1) are formed as a “switch”, “multiple input, single output switch,” “switching element”, “mux”, “signal selector”, or “data selector.” In the illustrated embodiment, each of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) and the multiplexers Mux_C (Mux_C.sub.0, . . . , Mux_C.sub.M-1) has first and second inputs. The first inputs of the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 receive the input data streams U.sub.1, . . . , U.sub.M-1, respectively. The second inputs of the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 are connected to the digital delay lines 212 of the equalizers E.sub.0, E.sub.1, . . . , E.sub.M-2, respectively, to receive the past input data inputted via the input data stream U.sub.0. On the other hand, the first inputs of the multiplexers Mux_C.sub.0, . . . , Mux_C.sub.M-1 receive the errors e.sub.0(n), . . . , e.sub.M-1(n) from the error calculation circuits 226 of the equalizers E.sub.0, E.sub.1, . . . , E.sub.M-2, respectively. The second inputs of the multiplexers Mux_C.sub.0, . . . , Mux_C.sub.M-1 receive the error e.sub.s(n) from the error calculation circuit ER.
(53) In particular, the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) and Mux_C (Mux_C.sub.0, . . . , Mux_C.sub.M-1) each have a mode selection input MS for receiving a mode selection signal. The multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) each selectively couple either the first input or the second input to the data input 218 of a respective one of the equalizers E (E.sub.1, . . . , E.sub.M-1) in response to the mode selection signal. Furthermore, the multiplexers Mux_C (Mux_C.sub.0, . . . , Mux_C.sub.M-1) each selectively couple either the first input or the second input to an error input 230 of the coefficient update circuit 228 of a respective one of the equalizers E (E.sub.1, . . . , E.sub.M-1) in response to the mode selection signal.
(54) The data stream processing algorithm (e.g., data stream processing method) of the data stream processing device 200 will be described by reference to
(55) In particular, if the value of the mode selection signal is “0,” which indicates the first mode, then the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 couple the first inputs to the data inputs 218 of the equalizers E.sub.1, . . . , E.sub.M-1, respectively, such that the input data streams U.sub.1, . . . , U.sub.M-1 are inputted to the equalizers E.sub.1, . . . , E.sub.M-1 via the data inputs 218 of the equalizers E.sub.1, . . . , E.sub.M-1, respectively. Furthermore, in this case, the multiplexers Mux_C.sub.0, . . . , Mux_C.sub.M-1 couple the first inputs to the coefficient update circuits 228 of the equalizers E.sub.1, . . . , E.sub.M-1, respectively, such that the errors e.sub.0(n), . . . , e.sub.M-1(n) from the error calculation circuits 226 are inputted to the coefficient update circuits 228, respectively.
(56) This allows the equalizers E.sub.0, E.sub.1, . . . , E.sub.M-1 to independently perform equalizing operations on the independent input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1, respectively. In this case, as illustrated in
(57) In other words, the corresponding pairs of the digital delay lines 212 and the processing units 214 form a plurality of equalizers for the input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1, respectively, in response to the mode selection signal indicating the first mode. Furthermore, in the illustrated embodiment, the processing units 214 update the coefficient values for generating the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 based on the errors (e.g., error values) received via the error inputs 230, respectively. The control circuitry 211 provides the errors e(n) (e.g., error values) between the output data y(n) of the processing units 214 and the training samples or reference signals d(n) to the error inputs 230 of the processing units 214, respectively, in response to the mode selection signal indicating the first mode. Furthermore, in the illustrated embodiment, the processing units 214 generate the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 by applying the coefficient values to the data values of the independent input data streams U.sub.0, U.sub.1, . . . , U.sub.M-1, respectively, in response to the mode selection signal indicating the first mode.
(58) On the other hand, if the value of the mode selection signal is “1,” which indicates the second mode (or does not indicate the first mode), then the multiplexers Mux_A.sub.1, . . . , Mux_A.sub.M-1 couple the second inputs to the data inputs 218 of the equalizers E.sub.1, . . . , E.sub.M-1, respectively, such that the digital delay lines 212 of the equalizers E.sub.0, E.sub.1, . . . , E.sub.M-1 are cascaded to each other. Thus, the digital delay lines 212 of the equalizers E.sub.0, E.sub.1, . . . , E.sub.M-1 are serially connected to each other to output a set of MN input data u.sub.0(n−MN+1), . . . , u.sub.0(n) inputted via the input data stream U.sub.0. In particular, the digital delay line 212 of the equalizer E.sub.0 outputs a set of N input data u.sub.0(n−N+1), . . . , u.sub.0(n) of the input data stream U.sub.0, the digital delay line 212 of the equalizer E.sub.1 output a set of N input data u.sub.0(n−2N+1), . . . , u.sub.0(n−N) of the input data stream U.sub.0, . . . , and the digital delay line 212 of the equalizer E.sub.M-1 outputs a set of N input data u.sub.0(n−MN+1), . . . , u.sub.0(n−(M−1)N) of the input data stream U.sub.0. Similarly, the data storage 213 of the equalizer E.sub.0 stores a history of N input data u.sub.0(n−N+1), . . . , u.sub.0(n) of the input data stream U.sub.0, the data storage 213 of the equalizer E.sub.1 stores a history of N input data u.sub.0(n−2N+1), . . . , u.sub.0(n−N) of the input data stream U.sub.0, . . . , and the data storage 213 of the equalizer E.sub.M-1 stores a history of N input data u.sub.0(n−MN+1), . . . , u.sub.0(n−(M−1)N) of the input data stream U.sub.0. Furthermore, in this case, the multiplexers Mux_C.sub.0, . . . , Mux_C.sub.M-1 couple the second inputs to the coefficient update circuits 228 of the equalizers E.sub.0, E.sub.1, . . . , E.sub.M-1, respectively, such that the coefficient update circuits 228 receive the error e.sub.s(n) from the error calculation circuit ER.
(59) In this case, in each of the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1), the coefficient update circuit 228 independently calculates an updated set of filter coefficients [c.sub.0(n+1), . . . , c.sub.N-1(n+1)] based on the coefficient update formula using the LMS algorithm. Specifically, the updated set of filter coefficients [c.sub.0(n+1), . . . , c.sub.N-1(n+1)] is computed based on a vector of the stored input data stored in the data storage 213, the current output data y(n), the complex conjugate of the error e.sub.s(n), the current set of the filter coefficient at sampler period n [c.sub.0(n), . . . c.sub.N-1(n)], and an appropriate adaptation step size μ. Also, in each of the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1), the processing unit 214 independently calculates the current output data y(n) based on the same formula illustrated in
(60) In other words, the digital delay lines 212 and the processing units 214 form a single equalizer for the input data stream U.sub.0 in response to the mode selection signal not indicating the first mode. Furthermore, the control circuitry 211 provides the error e.sub.s(n) (e.g., error value) between the sum of the data values (e.g., output data) of the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 of the processing units 214 and the training sample or reference signal d.sub.s(n) to the error inputs 230 of the processing units 214, respectively, in response to the mode selection signal not indicating the first mode. Furthermore, in the illustrated embodiment, the processing units 214 generate the output data streams Y.sub.0, Y.sub.1, . . . , Y.sub.M-1 by applying the coefficient values to the successive groups of the data values of the input data stream U.sub.0, respectively, in response to the mode selection signal not indicating the first mode.
(61) In the illustrated embodiment, the M equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) with N taps are combined into a single equalizer of MN taps in a manner similar to the first embodiment, in which an array of M FIR filters of N taps are combined into a single FIR filter of MN taps. To combine the M equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1), the output data from the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) are summed in a manner similar to the first embodiment, in which the M FIR filters are combined. In the illustrated embodiment, to combine the adaptation sub-sections 224 of the M individual equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1), a common error e.sub.s(n) is produced from the difference between the training sample d.sub.s(n) and the output data y.sub.s(n) of the combined equalizer. This common error e.sub.s(n) is then used to update all coefficient sets of all M equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1), which maintain their own independent storages for the input data or samples u(n) and the set of the filter coefficients [c.sub.0(n), . . . , c.sub.N-1(n)]. In the illustrated embodiment, the switch between the first mode in which the M equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) operate independently and the second mode in which the M equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) are combined into a single equalizer is effected by setting the value of the mode selection signal to “1” inputted to the multiplexors Mux_A and Mux_C.
(62) With the data stream processing device 200, the equalizer configuration of the linear channel equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) can be reconfigurable. Thus, the signal processing (equalizing) resources of the data stream processing device 200 can be efficiently utilized. In particular, an efficient implementation to combine the LMS linear channel equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) having N-tap filter length into a single LMS linear channel equalizer with MN-tap length filter can be provided.
(63) Thus, even if some of the signal processing (equalizing) resources of the data stream processing device 200 (e.g., some of the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1)) cannot be utilized for processing many parallel data streams due to certain operating conditions, such as bandwidth allocation or network configuration, the unused signal processing (equalizing) resources can be efficiently reallocated to process a data stream by effectively multiplying the number of taps of the signal processing (equalizing) resources in use, which enhances equalizing performance with minimal increases in implementation complexity, area, and power. In particular, the liner channel equalizers generally not only have a filtering section, but also a coefficient update unit. Reallocating of the signal processing (equalizing) resources of unused parallel equalizers to a single equalizer multiplies both the tap length of the filtering section and the number of the coefficient update units. Thus, the performance can be improved without slowing down the rate of the coefficient update even when the number of the filter coefficients has multiplied.
(64) Generally, when a plurality of liner channel equalizers are provided in a device, each equalizer computes an error between the output and the training sample or reference to update filter coefficients. If two equalizers are daisy-chained together, then each adaptive filter section needs to compute its own error, which is computationally inefficient. On the other hand, with the data stream processing device 200, two or more filtering sections can be combined into a single filtering section with a single coefficient set.
(65) In the illustrated embodiment, the data stream processing device 200 only additionally needs a summation (sum(M)) by the summation circuit P, a difference computation for the error e.sub.s(n) by the error calculation circuit ER, and a switching by the multiplexors Mux C to combine the M equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) of N taps into a more powerful equalizer of MN taps. Thus, with the data stream processing device 200, the data storages and the digital delay lines for input data and all of the coefficient update facilities can be efficiently reused.
(66) In the illustrated embodiment, the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) form a single equalizer in response to all of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) receiving the same mode selection signal with the same value “1”. However, the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) can be configured to independently receive different mode selection signals with different values “0” and “1” to form a plurality of groups of the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) each forming a single equalizer. For example, if all of the multiplexers Mux_A (Mux_A.sub.1, . . . , Mux_A.sub.M-1) receive the same mode selection signal with the same value “1” except for a k-th multiplexer Mux_A.sub.k, and the k-th multiplexer Mux_A.sub.k receives the mode selection signal with the value “0,” then a first group of the equalizers E.sub.0, . . . , E.sub.k-1 can form a first single equalizer for the input data stream U.sub.0 and a second group of the equalizers E.sub.k, . . . , E.sub.M-1 can form a second single equalizer for the input data stream U.sub.k. In this case, the summation circuit P calculates the sum of the data values of the output data streams Y.sub.0, . . . , Y.sub.k-1 to generate a first output data stream Y.sub.s1 as an output of the first single equalizer with kN taps, and calculates the sum of the data values of the output data streams Y.sub.k, . . . , Y.sub.M-1 to generate a second output data stream Y.sub.s2 as an output of the second single equalizer with (M−k)N taps. In this case, the equalizers E (E.sub.0, E.sub.1, . . . , E.sub.M-1) are grouped into two groups, but can also be grouped into more than two groups.
(67) The foregoing data stream processing devices 10, 100 and 200 can be included in a demodulator of a communication signal receiver. For example, the demodulator can be configured to receive communication signal, and perform analog-to-digital conversion of the communication signal to produce the input data streams U (U.sub.0, U.sub.1, . . . , U.sub.M-1). Also, the foregoing data stream processing devices 10, 100 and 200 can be included in a modular of a communication signal transmitter. Likewise, the foregoing data stream processing devices 10, 100 and 200 can be included in a digital filter, such as, but not limited to, in a device for audio or radar signal processing.
(68) The foregoing data stream processing devices 10, 100 and 200 can be implemented in an application specific integrated circuit (ASIC). The foregoing data stream processing devices 10, 100 and 200 can be implemented and/or included in a field programmable gate array (FPGA) device or other such programmable hardware logic device. A bitstream can include data therein which, when received by an FPGA device or other such programmable hardware logic device, causes the receiving device to implement and/or include one or more of the foregoing data stream processing devices 10, 100 and 200. A machine-readable medium can include such a bitstream recorded therein.
(69) A machine-readable medium can include a netlist or other such hardware specification recorded thereon that when processed by suitable software and/or hardware to instantiate an ASIC device or configure an FPGA device or other such programmable hardware logic device, the device implements and/or includes one or more of the foregoing data stream processing devices 10, 100 and 200. A machine-readable medium can include a device specification in a hardware description language such as, but not limited to, VHDL or Verilog, recorded thereon that when processed by suitable software and/or hardware to instantiate an ASIC device or configure an FPGA device or other such programmable hardware logic device, the device implements and/or includes one or more of the foregoing data stream processing devices 10, 100 and 200. A computer-implemented method can automatically generate the foregoing netlist or other such hardware specification or device specification in a hardware description language. For example, the computer-implemented method can perform an algorithm to automatically determine appropriate couplings among elements of one or more of the foregoing data stream processing devices. Instructions can be recorded on a machine-readable medium which, when executed by one or more computer processors, cause the one or more computer processors to perform the computer-implemented method.
(70) In understanding the scope of the present invention, the term “comprising” and its derivatives, as used herein, are intended to be open ended terms that specify the presence of the stated features, elements, components, groups, integers, and/or steps, but do not exclude the presence of other unstated features, elements, components, groups, integers and/or steps. The foregoing also applies to words having similar meanings such as the terms, “including”, “having” and their derivatives. Also, the terms “part,” “section,” “portion,” “member” or “element” when used in the singular can have the dual meaning of a single part or a plurality of parts. Also, the term “detect” as used herein to describe an operation or function carried out by a component, a section, a device or the like includes a component, a section, a device or the like that does not require physical detection, but rather includes determining, measuring, modeling, predicting or computing or the like to carry out the operation or function. The term “configured” as used herein to describe a component, section or part of a device includes hardware and/or software that is constructed and/or programmed to carry out the desired function. The terms of degree such as “substantially”, “about” and “approximately” as used herein mean a reasonable amount of deviation of the modified term such that the end result is not significantly changed.
(71) While only selected embodiments have been chosen to illustrate the present invention, it will be apparent to those skilled in the art from this disclosure that various changes and modifications can be made herein without departing from the scope of the invention as defined in the appended claims. For example, the size, shape, location or orientation of the various components can be changed as needed and/or desired. Components that are shown directly connected or contacting each other can have intermediate structures disposed between them. The functions of one element can be performed by two, and vice versa. The structures and functions of one embodiment can be adopted in another embodiment. It is not necessary for all advantages to be present in a particular embodiment at the same time. Every feature which is unique from the prior art, alone or in combination with other features, also should be considered a separate description of further inventions by the applicant, including the structural and/or functional concepts embodied by such feature(s). Thus, the foregoing descriptions of the embodiments according to the present invention are provided for illustration only, and not for the purpose of limiting the invention as defined by the appended claims and their equivalents.