Sequence detection
11205131 · 2021-12-21
Assignee
Inventors
- Hazar Yüksel (New York, NY, US)
- Giovanni Cherubini (Rueschlikon, CH)
- Roy Cideciyan (Rüschlikon, CH)
- Simeon Furrer (Altdorf, CH)
- Marcel Kossel (Reichenburg, CH)
Cpc classification
G06N7/01
PHYSICS
H03M13/3961
ELECTRICITY
G06N5/01
PHYSICS
International classification
G06N7/00
PHYSICS
Abstract
Methods and apparatus are provided for calculating branch metrics, associated with possible transitions between states of a trellis, in a sequence detector for detecting symbol values corresponding to samples of an analog signal transmitted over a channel. For each sample and each transition, the method calculates a plurality of distance values indicative of distance between that sample and respective hypothesized sample values for that transition. In parallel with calculation of the distance values, the sample is compared with a set of thresholds, each defined between a pair of successive hypothesized symbol values arranged in value order, to produce a comparison result. An optimum distance value is selected as a branch metric for the transition in dependence on the comparison result.
Claims
1. A method for calculating branch metrics, associated with possible transitions between states of a trellis, in a sequence detector for detecting symbol values corresponding to samples of an analog signal transmitted over a channel, the method comprising, for each said sample and each said transition: calculating a plurality of distance values indicative of distance between that sample and respective hypothesized sample values for that transition; in parallel with calculation of said distance values, comparing the sample with a set of thresholds, each threshold defined between a pair of successive said hypothesized sample values arranged in value order, to produce a comparison result; and selecting an optimum distance value as a branch metric for the transition in dependence on said comparison result.
2. The method as claimed in claim 1 wherein each threshold is defined as halfway between said pair of hypothesized sample values.
3. The method as claimed in claim 1 including selecting the minimum distance value as said branch metric.
4. The method as claimed in claim 1 including calculating each distance value as the modulus of the difference between the sample and the respective hypothesized sample value.
5. The method as claimed in claim 1 for use with a channel having an impulse response with L>0 interfering channel coefficients, the method including selecting said optimum distance value in dependence on said comparison result and the sign of each said channel coefficient.
6. A branch metric unit for calculating branch metrics, associated with possible transitions between states of a trellis, in a sequence detector for detecting symbol values corresponding to samples of an analog signal transmitted over a channel, the branch metric unit comprising, for each said transition: distance calculation logic adapted to calculate, for each said sample, a plurality of distance values indicative of distance between that sample and respective hypothesized sample values for that transition; comparison logic, connected in parallel with the distance calculation logic, adapted to compare each sample with a set of thresholds, each threshold defined between a pair of successive said hypothesized sample values arranged in value order, to produce a comparison result; and selection logic adapted to select, for each said sample, an optimum distance value as a branch metric for the transition in dependence on said comparison result for that sample.
7. The branch metric unit as claimed in claim 6 wherein each threshold is defined as halfway between said pair of hypothesized sample values.
8. The branch metric unit as claimed in claim 6 wherein the selection logic is adapted to select the minimum distance value as said branch metric.
9. The branch metric unit as claimed in claim 6 wherein the distance calculation logic is adapted to calculate each distance value as the modulus of the difference between the sample and the respective hypothesized sample value.
10. The branch metric unit as claimed in claim 6 for use with a channel having an impulse response with L>0 interfering channel coefficients, wherein the selection logic is adapted to select said optimum distance value in dependence on said comparison result and the sign of each said channel coefficient.
11. The branch metric unit as claimed in claim 6, the unit being adapted to calculate branch metrics in a two-state 4-PAM Viterbi detector.
12. The branch metric unit as claimed in claim 6, the unit being adapted to calculate branch metrics in an eight-state 4-D 5-PAM Viterbi detector.
13. A sequence detector for detecting symbol values corresponding to a sequence of samples of an analog signal transmitted over a channel, the sequence detector comprising: a branch metric unit as claimed in claim 6 for calculating branch metrics for each said sample and each said transition, said branch metric unit comprising, for each said transition: distance calculation logic adapted to calculate, for each said sample, a plurality of distance values indicative of distance between that sample and respective hypothesized sample values for that transition; comparison logic, connected in parallel with the distance calculation logic, adapted to compare each sample with a set of thresholds, each threshold defined between a pair of successive said hypothesized sample values arranged in value order, to produce a comparison result; and selection logic adapted to select, for each said sample, an optimum distance value as a branch metric for the transition in dependence on said comparison result for that sample; a path metric unit, arranged to receive branch metrics from the branch metric unit, adapted to calculate path metrics for respective survivor paths to each state of said trellis and to select, for each state, a latest symbol value in the survivor path to that state in dependence on the branch metrics; and a survivor memory unit arranged to receive said latest symbol value in the survivor path to each state from the path metric unit and adapted to select, at the end of said sequence of samples, a survivor path corresponding to said sequence.
14. The sequence detector as claimed in claim 13 wherein, in the branch metric unit, each said threshold is defined as halfway between said pair of hypothesized sample values.
15. The sequence detector as claimed in claim 13 wherein, in the branch metric unit, the selection logic is adapted to select the minimum distance value as said branch metric.
16. The sequence detector as claimed in claim 13 wherein, in the branch metric unit, the distance calculation logic is adapted to calculate each distance value as the modulus of the difference between the sample and the respective hypothesized sample value.
17. The sequence detector as claimed in claim 13 for use with a channel having an impulse response with L>0 interfering channel coefficients, wherein, in the branch metric unit, the selection logic is adapted to select said optimum distance value in dependence on said comparison result and the sign of each said channel coefficient.
18. The sequence detector as claimed in claim 13 wherein the branch metric unit, path metric unit and survivor memory unit are adapted to collectively implement a two-state 4-PAM Viterbi detector.
19. The sequence detector as claimed in claim 13 wherein the branch metric unit, path metric unit and survivor memory unit are adapted to collectively implement an eight-state 4-D 5-PAM Viterbi detector.
20. A computer program product comprising a non-transitory computer readable storage medium having program instructions embodied therein, the program instructions being executable by a processing device to cause the processing device to perform a method for calculating branch metrics, associated with possible transitions between states of a trellis, during sequence detection for detecting symbol values corresponding to samples of an analog signal transmitted over a channel, said method comprising, for each said sample and each said transition: calculating a plurality of distance values indicative of distance between that sample and respective hypothesized sample values for that transition; in parallel with calculation of said distance values, comparing the sample with a set of thresholds, each defined between a pair of successive said hypothesized sample values arranged in value order, to produce a comparison result; and selecting an optimum distance value as a branch metric for the transition in dependence on said comparison result.
Description
BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION
(11) The present invention may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
(12) The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
(13) Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
(14) Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
(15) Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
(16) These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
(17) The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
(18) The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
(19)
(20) The detector 1 comprises a branch metric unit (BMU) 2, a path metric unit (PMU) 3 and a survivor memory unit (SMU) 4. The BMU 2 receives successive input samples z and calculates, for each input sample, branch metrics associated with possible transitions between states χ of a trellis as explained in detail below. The PMU 3 receives branch metrics λ for each sample from BMU 2. Based on the branch metrics for successive samples, the PMU calculates path metrics for respective survivor paths to each state of the trellis and selects, for each state, a latest symbol value û in the survivor path to that state. This involves, for each input sample z, updating of previous path metrics by addition of current branch metrics to obtain partial path metrics for each state, and then selecting an optimum (e.g. smallest) path metric from the partial path metrics for each state. The optimum path metric for each state corresponds to the most likely path to that state. This optimum path metric thus decides the preceding state χ in the current survivor path to the state, and also the latest symbol value û in that survivor path. The state decisions χ and symbol decisions û are output to SMU 4 which stores the symbol decisions for the survivor paths. (In some embodiments, the state decisions χ and symbol decisions û are also fed back to BMU 2 for use in selection of hypothesized symbol values as explained below). At the end of the input sample sequence, the SMU 4 selects an optimum (most likely) one of the survivor paths for the sequence, e.g. the survivor path with the smallest path metric. This optimum path defines the symbol sequence output by SMU 4 and corresponds to the most-likely sequence of symbols at the channel input.
(21) The component units 2, 3 and 4 of detector 1 are implemented as a series of pipeline stages which process input samples in a succession of time-steps k=0, 1, . . . , (K−1) corresponding to a sequence of K samples z.sub.k produced at the channel output. An ISI channel has a discrete-time impulse response with L+1 channel coefficients where L>0. In particular, the channel is modelled by its discrete-time impulse-response sequence h=(h.sub.0, h.sub.1, . . . , h.sub.L) where L is the number of interfering channel coefficients (channel memory). For a symbol u.sub.k input to the channel at time k, the corresponding channel output y.sub.k can be expressed as y.sub.k=Σ.sub.i=0.sup.Lh.sub.iu.sub.k−i and is thus a function of u.sub.k and the L previous symbols u.sub.k−1 to u.sub.k−L. This output is corrupted by additive white Gaussian noise w.sub.k, whereby the resulting input sample at detector 1 is given by z.sub.k=y.sub.k+w.sub.k.
(22) The BMU 2 receives the input samples z.sub.k and also receives the channel coefficient vector h=(h.sub.0, h.sub.1, . . . , h.sub.L) described above. For each input sample z.sub.k, branch metrics λ.sub.k are calculated based on the difference between the input sample and a set of hypothesized sample values, denoted here by {tilde over (y)}.sub.k, calculated for each possible transition between states χ.sub.k, χ.sub.k+1 of the trellis. For example, with two post-cursor per-survivor decision-feedback taps {h.sub.1, h.sub.2}, i.e. L=2, the hypothesized sample values {tilde over (y)}.sub.k are calculated by taking the inner product of the symbols û.sub.k−1, û.sub.k−2 in each survivor path with the post-cursor discrete-time channel impulse-response sequence {h.sub.1, h.sub.2} and adding h.sub.0u.sub.k to the result:
{tilde over (y)}.sub.k=u.sub.k+h.sub.1û.sub.k−1+h.sub.2û.sub.k−2∀u.sub.k∈
where is the symbol constellation of the transmission scheme and we assume here, without loss of generality, that the main-cursor tap h.sub.0=1. The hypothesized sample values {tilde over (y)}.sub.k are what the input sample z.sub.k would be for certain permutations of transmitted input symbols {u.sub.k, u.sub.k−1, u.sub.k−2} in the absence of noise.
(23) The symbol u.sub.k, transmitted in time-step k, determines the state χ.sub.k+1 of a survivor path at the end of that time-step. For example, in a trellis with two states χ=0 and χ=1, there are four possible transitions (χ.sub.k, χ.sub.k+1) at time-step k, i.e., (0, 0), (0, 1), (1, 0) and (1, 1). For each of these transitions, there will be a number of hypothesized sample values {tilde over (y)}.sub.k.sup.j, j=0, 1, . . . , depending on the number of possible permutations of symbol values in Σ.sub.i=0.sup.Lh.sub.iu.sub.k−i for the path terminating in that transition. This is explained in more detail below. For each sample z.sub.k and each transition (χ.sub.k, χ.sub.k+1), the BMU 2 calculates distance values, denoted here by d.sub.k.sup.j, indicative of distance between that sample and respective hypothesized sample values {tilde over (y)}.sub.k.sup.j for that transition. Various distance metrics, such as Euclidean distance or squared Euclidean distance, may be used here. In the preferred embodiments below, each distance value d.sub.k.sup.j, is calculated as the modulus of the difference between the sample and the respective hypothesized sample value:
d.sub.k.sup.j(χ.sub.k,χ.sub.k+1)=|z.sub.k−{tilde over (y)}.sub.k.sup.j(χ.sub.k,χ.sub.k+1)| (1)
(24) The branch metric λ(χ.sub.k, χ.sub.k+1) for each transition is selected as the optimum (here smallest) distance value for that transition:
λ.sub.k(χ.sub.k,χ.sub.k+1)=min.sub.jd.sub.k.sup.j(χ.sub.k,χ.sub.k+1) (2)
(25) The index of the selected distance value is thus given by:
d.sub.k.sup.argmin=argmin.sub.jd.sub.k.sup.j(χ.sub.k,χ.sub.k+1) (3)
(26) In a conventional BMU, the distance values for a given transition are first calculated, and the resulting distance values are then compared to identify the minimum value which is selected as the branch metric. In contrast,
(27) By comparing the input sample with the thresholds (step 12) in parallel with the distance calculation (step 11), the implementation complexity, power consumption, and propagation delay of the BMU can all be reduced compared to a conventional BMU implementation. The comparison operation is performed directly with the input sample instead of the distance metric calculated in the BMU. In doing so, the propagation delay of one comparator can be eliminated from the longest path of the BMU, and the total number of comparators required for level discrimination can be reduced. The branch metric calculation method does not require additional pipeline stages, so no extra latency is incurred. Embodiments of the invention thus offer a significant increase in speed of the branch metric calculation.
(28) Particular embodiments of BMU 2 are described in more detail below for two transmission schemes: uncoded 4-PAM (four-level pulse-amplitude modulation) and 4-D (four-dimensional) 5-PAM TCM (Trellis Coded Modulation) with eight states. The PMU 3 and SMU 4 of detector 1 can be implemented in conventional manner for these embodiments. The various circuit elements of the embodiments described can be implemented by hard-wired logic circuits of generally known form. In general, however, functionality of components can be implemented in hardware or software or a combination thereof.
(29) The following notation will be used: the signal constellation;
(i) information symbol in
: i∈
, 0≤i≤|
|;
.sub.s subset in
: s∈{0,1},
.sub.0∩
.sub.1=0,
.sub.0∪
.sub.1=
, and intra-subset Euclidean distance is maximized; u.sub.k transmitted symbol at time k, u.sub.k ∈
; χ.sub.k state at time k.
(30) In the first embodiment, the sequence detector 1 is a reduced-state sequence detector (RSSD), and BMU 2 calculates branch metrics for transitions between states (also known as “substates”) of a reduced-state trellis. The reduced-state trellis is constructed via mapping by set partitioning. The symbol constellation used in the transmission scheme is partitioned into subsets, and the subset to which a symbol u.sub.k, transmitted in time-step k, belongs determines the state χ.sub.k+1 of a survivor path at the end of that time-step. In this example, the BMU 2, PMU 3 and SMU 4 of RSSD 1 collectively implement a two-state 4-PAM Viterbi detector. The discrete-time channel impulse-response sequence is taken as h=(1, h.sub.1) with |h.sub.1|<1. A symbol u.sub.k transmitted over the channel at time k∈{0, 1, . . . , K−1} is drawn from a 4-PAM signal constellation
containing four symbols centered on the origin:
={−3, 1, 1, 3}, whereby
(0)=−3,
(1)=−1,
(2)=1, and
(3)=3. The constellation is partitioned into two subsets
.sub.0={
(0),
(2)}={−3, 1}, and
.sub.1={
(1),
(3)}={−1, 3} such that the intra-subset Euclidean distance is maximized. The reduced-state subset trellis has two states χ=0 and χ=1. The subset to which the symbol u.sub.k belongs determines the state χ.sub.k+1 at time k+1 according to: χ.sub.k+1=0 if u.sub.k ∈
.sub.0 and χ.sub.k+1=1 if u.sub.k ∈
.sub.1.
(31) The BMU 2 comprises four component units (sub-BMUs) for calculating the branch metrics λ.sub.k(0, 0), λ.sub.k(0,1), λ.sub.k(1, 0), λ.sub.k(1,1) respectively for the four possible transitions in the reduced-state trellis of
(32) For the χ.sub.k=0 to χ.sub.k+1=0 transition, the possible trellis transitions for {u.sub.k, u.sub.k−1} are shown in .sub.0={
(0),
(2)} and û.sub.k ∈
.sub.0={
(0),
(2)}. There are four hypothesized sample values {tilde over (y)}.sub.k.sup.j(0, 0)=û.sub.k+h.sub.1û.sub.k−1 as follows:
{tilde over (y)}.sub.k.sup.0(0,0)=(0)+h.sub.1
(0);
{tilde over (y)}.sub.k.sup.1(0,0)=(0)+h.sub.1
(2);
{tilde over (y)}.sub.k.sup.2(0,0)=(2)+h.sub.1
(0);
{tilde over (y)}.sub.k.sup.3(0,0)=(2)+h.sub.1
(2). (4)
(33) When 0<h.sub.1<1, the intervals of the hypothesized sample values {tilde over (y)}.sub.k.sup.j(0, 0) are as follows:
{tilde over (y)}.sub.k.sup.0(0,0)∈(2(0),
(0));
{tilde over (y)}.sub.k.sup.1(0,0)∈((0),
(0)+
(2));
{tilde over (y)}.sub.k.sup.2(0,0)∈((0),+
(2),
(2));
{tilde over (y)}.sub.k.sup.3(0,0)∈((2),2
(2).
(34) Therefore, the hypothesized sample values can be arranged in value order as follows:
{tilde over (y)}.sub.k.sup.0(0,0)<{tilde over (y)}.sub.k.sup.1(0,0)<{tilde over (y)}.sub.k.sup.2(0,0)<{tilde over (y)}.sub.k.sup.3(0,0). (5)
(35) The set of thresholds {θ} used in step 12 of
(36)
(37) The distance values d.sub.k.sup.j(0,0) are calculated in step 11 of
d.sub.k.sup.0(0,0)=|z.sub.k−{tilde over (y)}.sub.k.sup.0(0,0)|
d.sub.k.sup.1(0,0)=|z.sub.k−{tilde over (y)}.sub.k.sup.1(0,0)|
d.sub.k.sup.2(0,0)=|z.sub.k−{tilde over (y)}.sub.k.sup.2(0,0)|
d.sub.k.sup.3(0,0)=|z.sub.k−{tilde over (y)}.sub.k.sup.3(0,0)|
(38) These distance values effectively constitute the “candidate branch metrics” from which the optimum (here minimum) value will be selected as the final branch metric λ.sub.k (0,0) in accordance with Equation (2) above. The index j of this minimum distance value d.sub.k.sup.j(0, 0) is thus determined by the index j of the hypothesized sample value {tilde over (y)}.sub.k.sup.j(0,0) to which z.sub.k is closest. It can be seen from
(39)
The optimum branch metric λ.sub.k (0, 0) can thus be determined by solving these equations simultaneously.
(40) Similarly, when −1<h.sub.1<0, the intervals of the hypothesized sample values {tilde over (y)}.sub.k.sup.j(0, 0) are as follows:
{tilde over (y)}.sub.k.sup.1(0,0)∈((0)−
(2),
(0));
{tilde over (y)}.sub.k.sup.0(0,0)∈((0),0);
{tilde over (y)}.sub.k.sup.3(0,0)∈(0,(2));
{tilde over (y)}.sub.k.sup.2(0,0)∈((2),
(2)−
(0));
(41) Therefore, the hypothesized sample values can be ordered as follows:
{tilde over (y)}.sub.k.sup.1(0,0)<{tilde over (y)}.sub.k.sup.0(0,0)<{tilde over (y)}.sub.k.sup.3(0,0)<{tilde over (y)}.sub.k.sup.2(0,0).
(42) The thresholds θ(0), θ(1) and θ(2) calculated for this ordering of the hypothesized values are the same as in Equation set (6) above. Hence, the optimum branch metric λ.sub.k (0, 0) for −1<h.sub.1<0 is given by:
(43)
(44)
(45) The sub-BMU 20 also includes comparison logic comprising three comparators, indicated generally at 24, connected in parallel with the distance calculation logic 21. A first input of each comparator receives a respective threshold value θ(0), θ(1) or θ(2). The second input of each comparator receives the input sample z.sub.k. Each comparator produces a 1-bit output whose value indicates whether or not the input sample z.sub.k exceeds the respective threshold. The three comparator output bits collectively constitute a select signal, indicating the result of the threshold comparison, which is supplied to a control input of multiplexer 23. The 3-bits of the select signal map to d.sub.k.sup.argmin in Equation (3) above, i.e. the index j of the minimum distance value d.sub.k.sup.j (0, 0) to be selected as the optimum branch metric λ.sub.k(0, 0). This mapping, defined by Equation set (7) or (8) above, depends on whether the coefficient h.sub.1 is positive or negative. Hence, a 1-bit signal sgn(h.sub.1), indicating the sign of h.sub.1, is supplied to a further control input of multiplexer 23 as indicated. Based on these control inputs, multiplexer 23 selects the optimum distance value d.sub.k.sup.j(0, 0) input from distance calculators 21 and outputs this value as the branch metric λ.sub.k (0, 0).
(46) The following example illustrates operation of sub-BMU 20 for the 4-PAM signal constellation ={−3, 1, 1, 3} with a discrete-time channel impulse-response sequence h=(1, 0.6). The hypothesized input values are:
{tilde over (y)}.sub.k.sup.0(0,0)=−3−3h.sub.1=−4.8;
{tilde over (y)}.sub.k.sup.1(0,0)=−3+h.sub.1=−2.4;
{tilde over (y)}.sub.k.sup.2(0,0)=1−3h.sub.1=−0.8;
{tilde over (y)}.sub.k.sup.3(0,0)=1+h.sub.1=1.6.
(47) The distance values are:
d.sub.k.sup.0(0,0)=|z.sub.k+4.8|;
d.sub.k.sup.1(0,0)=|z.sub.k+2.4|;
d.sub.k.sup.2(0,0)=|z.sub.k+0.8|;
d.sub.k.sup.3(0,0)=|z.sub.k−1.6|.
(48) The thresholds are:
(49)
(50) The branch metric is calculated as:
(51)
(52) It can be seen from
(53) The BMU 2 contains three further sub-BMUs for calculating the branch metrics λ.sub.k(0, 1), λ.sub.k(1, 0) and λ.sub.k(1, 1). The structure and operation of these sub-BMUs corresponds directly to that of the
(54) Sub-BMU for λ.sub.k(0,1)
(55) Hypothesized sample values:
{tilde over (y)}.sub.k.sup.0(0,1)=(1)+h.sub.1
(0);
{tilde over (y)}.sub.k.sup.1(0,1)=(1)+h.sub.1
(2);
{tilde over (y)}.sub.k.sup.2(0,1)=(3)+h.sub.1
(0);
{tilde over (y)}.sub.k.sup.3(0,1)=(3)+h.sub.1
(2).
(56) Ordering of hypothesized sample values:
{tilde over (y)}.sub.k.sup.0(0,1)<{tilde over (y)}.sub.k.sup.1(0,1)<{tilde over (y)}.sub.k.sup.2(0,1)<{tilde over (y)}.sub.k.sup.3(0,1)
(57) Thresholds:
(58)
(59) Branch metric selection:
(60)
Sub-BMU for λ.sub.k(1, 0)
(61) Hypothesized sample values:
{tilde over (y)}.sub.k.sup.0(1,0)=(0)+h.sub.1
(1);
{tilde over (y)}.sub.k.sup.1(1,0)=(0)+h.sub.1
(3);
{tilde over (y)}.sub.k.sup.2(1,0)=(2)+h.sub.1
(1);
{tilde over (y)}.sub.k.sup.3(1,0)=(2)+h.sub.1
(3).
(62) Ordering of hypothesized sample values:
{tilde over (y)}.sub.k.sup.0(1,0)<{tilde over (y)}.sub.k.sup.1(1,0)<{tilde over (y)}.sub.k.sup.2(1,0)<{tilde over (y)}.sub.k.sup.3(1,0)
(63) Thresholds:
(64)
(65) Branch metric selection:
(66)
Sub-BMU for λ.sub.k(1, 1)
(67) Hypothesized sample values:
{tilde over (y)}.sub.k.sup.0(1,1)=(1)+h.sub.1
(1);
{tilde over (y)}.sub.k.sup.1(1,1)=(1)+h.sub.1
(3);
{tilde over (y)}.sub.k.sup.2(1,1)=(3)+h.sub.1
(1);
{tilde over (y)}.sub.k.sup.3(1,1)=(3)+h.sub.1
(3).
(68) Ordering of hypothesized sample values:
{tilde over (y)}.sub.k.sup.0(1,1)<{tilde over (y)}.sub.k.sup.1(1,1)<{tilde over (y)}.sub.k.sup.2(1,1)<{tilde over (y)}.sub.k.sup.3(1,1)
(69) Thresholds:
(70)
(71) Branch metric selection:
(72)
(73) With four sub-BMU's as described above, a total of twelve comparators are eliminated from the 2-state 4-PAM RSSD compared to a conventional implementation, significantly reducing both the implementation complexity and power consumption. In general, implementation complexity and power consumption increase with both number of states in the trellis and number of time steps for which the BMU calculates branch metrics in parallel. An N-step BMU contains N parallel BMU units which calculate the branch metrics for N input samples z.sub.k in parallel. For a 2-step 2-state RSSD, for instance, the implementation complexity of the BMU is at least doubled compared to the 1-step 2-state embodiment above, whereby 24 comparators can be eliminated. Typically, 1≤N≤16 for a 2-substate 4-PAM Viterbi detector, and the saving increases dramatically with higher values of N.
(74) In the
(75) In the second embodiment, the sequence detector 1 is a full-state detector and the BMU 2, PMU 3 and SMU 4 collectively implement an eight-state 4-D 5-PAM Viterbi detector. The 5-PAM signal constellation ={−2, 1, 0, 1, 2} contains five symbols:
(0)=−2,
(1)=−1,
(2)=0,
(3)=1 and
(4)=2. With this 4-D transmission scheme, four 1-D symbols u.sub.k are transmitted in parallel and a 4-D sample, consisting of four 1-D samples z.sub.k, is received at the detector input. The signal constellation
is partitioned into two subsets
.sub.0={
(0),
(2),
(4)}, and
.sub.1={
(1),
(3)}. This results in 16 different 4-D subsets {(
.sub.0,
.sub.0,
.sub.0,
.sub.0), (
.sub.0,
.sub.0,
.sub.0,
.sub.1), . . . , (
.sub.1,
.sub.1,
.sub.1,
.sub.1)}. By uniting a 4-D subset and its complement, e.g., (
.sub.0,
.sub.1,
.sub.0,
.sub.1) and (
.sub.1,
.sub.0,
.sub.1,
.sub.0), eight new 4-D subsets {s.sub.0, s.sub.1, . . . , s.sub.7} are obtained such that the 4-D intrasubset Euclidean distance remains constant. The radix-4 trellis diagram for this embodiment has eight states χ=0 to 7 as shown in
(76) The 4-D branch metrics for each trellis transition are obtained by first calculating 1-D branch metrics for each of the four 1-D samples z.sub.k(l), where l∈{0, 1, 2, 3} denotes dimension, supplied to the detector input. In this embodiment, steps 10 to 13 of the
(77) Each 1-D sample z.sub.k (l) corresponds to a 1-D symbol u.sub.k in either subset .sub.0 or subset
.sub.1. The BMU calculates a 1-D branch metric for each sample z.sub.k (l) and each subset
.sub.0 and
.sub.1. The 1-D branch metric for subset
.sub.0 and dimension l is denoted here by λ.sub.k(
.sub.0, l). The 1-D branch metric for subset
.sub.1 and dimension l is denoted by λ.sub.k(
.sub.1, l). The BMU comprises four sub-BMUs, one for each dimension 1, each containing component units shown in
(78) .sub.0, l). The unit 30 comprises three distance calculators, indicated generally at 31, a multiplexer 32, and two comparators indicated generally at 33. The distance calculators 31 receive respective hypothesized sample values {tilde over (y)}.sub.k.sup.j, j∈{0, 1, 2}. In this example with h=(1), the hypothesized sample values {tilde over (y)}.sub.k.sup.0, {tilde over (y)}.sub.k.sup.1 and {tilde over (y)}.sub.k.sup.2 correspond respectively to the three symbols
(0),
(2) and
(4) in subset
.sub.0. Each distance calculator also receives the current sample z.sub.k (l) and calculates a respective distance value d.sub.k.sup.j indicating distance of z.sub.k (l) from {tilde over (y)}.sub.k.sup.j as described above. The distance values d.sub.k.sup.j are output to multiplexer 32. Comparators 33, connected in parallel with the distance calculators, receive respective threshold values θ(0), θ(1). The thresholds θ(0) and θ(1) are defined as halfway between respective pairs of hypothesized values {tilde over (y)}.sub.k.sup.j arranged in value order: {tilde over (y)}.sub.k.sup.0<{tilde over (y)}.sub.k.sup.1<{tilde over (y)}.sub.k.sup.2. In this example, θ(0)=
(1) and θ(1)=
(3). The second input of each comparator receives the input sample z.sub.k (l). Each comparator compares the sample z.sub.k (l) with the corresponding threshold. The two comparator output bits collectively constitute a select signal, indicating the result of the threshold comparison, which is supplied to a control input of multiplexer 32. The 2-bits of the select signal indicate the optimum (here minimum) distance value d.sub.k.sup.j to be selected as λ.sub.k(
.sub.0, l) according to:
(79)
(80) .sub.1, l). The unit 35 comprises two distance calculators 36, a multiplexer 37 and a control input 38. The distance calculators 36 receive respective hypothesized sample values {tilde over (y)}.sub.k.sup.j, j∈{0,1}. In this example, the hypothesized sample values {tilde over (y)}.sub.k.sup.0, and {tilde over (y)}.sub.k.sup.1 correspond respectively to the symbols
(1) and
(3) in subset
.sub.1. Each distance calculator calculates a respective distance value d.sub.k.sup.j indicating distance of the sample z.sub.k (l) from {tilde over (y)}.sub.k.sup.j. The distance values d.sub.k.sup.0 and d.sub.k.sup.1 are output to multiplexer 37. For this unit, there is a single threshold, θ(0)=
(2)=0, halfway between the two hypothesized values {tilde over (y)}.sub.k.sup.0=
(1), and {tilde over (y)}.sub.k.sup.1=
(3). Here therefore, the result of the threshold comparison depends on whether z.sub.k (l)>0 or z.sub.k (l)<0. The comparison logic can therefore be implemented simply by extracting the sign bit, denoted here by z.sub.k,0(l), of sample z.sub.k (l). The sign bit by z.sub.k,0(l) is supplied on control input 38 to multiplexer 37. The sign bit z.sub.k,0(l) determines the optimum (here minimum) distance value d.sub.k.sup.j to be selected as λ.sub.k(
.sub.1, l) according to:
(81)
(82) The 4-D branch metrics for transitions in the .sub.0, l) and λ.sub.k (
.sub.1, l) for each dimension and adding the four selected values in each case.
(83) The propagation delay of sub-BMU unit 30 is the sum of that of a distance calculator and a 3-to-1 multiplexer. Similarly, the propagation delay of sub-BMU unit 35 is the sum of that of a distance calculator and a 2-to-1 multiplexer. By comparison, corresponding sub-units for calculating λ.sub.k (.sub.0, l) and λ.sub.k(
.sub.1, l) in a conventional BMU are shown in
(84) In the 1-step 8-state 4-D 5-PAM detector with channel time-dispersion length |h|=1 above, there are four sub-BMU's so a total of eight comparators are eliminated compared to a conventional implementation. The principles described can be readily applied for BMU operation with a channel time-dispersion length |h|>1, resulting in even greater savings. For example, with |h|=2, the BMU needs 32 sub-BMUs whereby 64 comparators are eliminated. Implementation complexity and power consumption increase with the number of time steps for which the BMU calculates branch metrics as well as with the number of states. For a 2-step 8-state 4-D 5-PAM Viterbi detector with |h|=1, for example, the BMU complexity is at least quadrupled compared to the 1-step embodiment above, whereby 32 comparators can be eliminated. Typically, 1≤N≤4 for a N-step 8-state 4-D 5-PAM Viterbi detector, and the saving increases substantially with N.
(85) While a full-state Viterbi detector is described for the second embodiment above, the detector could be a reduced-state detector in the presence of intersymbol interference (ISI). In this case, the ISI attributable to the channel coefficients can be suppressed by embedded per-survivor decision feedback to avoid expanding the number of detector states.
(86) Numerous changes and modifications can of course be made to the exemplary embodiments described. For example, while the smallest distance value is selected as the optimum branch metric above, embodiments may be envisaged where the largest distance value is selected as optimum. Other difference metrics, such as squared Euclidean distance, may be used for calculating the distance values. Embodiments may also be envisaged where a threshold θ used for level discrimination is not halfway between a pair of hypothesized sample values. Where the signal constellation does not contain equiprobable symbols, for example, thresholds may be adapted to accommodate different symbol probabilities.
(87) The branch metric calculation method can be applied with any number of encoder and/or channel states, and with or without a coded or coded-modulation scheme, such as trellis-coded modulation. When a transmission scheme with many encoder/channel states is adopted (such as that in the IEEE 802.3ab standard), or when an architecture based on the sliding-block or systolic-array approach is chosen, the advantages described have a huge impact on overall efficiency of the detector.
(88) The descriptions of the various embodiments of the present invention have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.