Coding Blocks of Pixels
20230051125 · 2023-02-16
Inventors
Cpc classification
H04N19/90
ELECTRICITY
H04N19/635
ELECTRICITY
International classification
Abstract
A method and decoding unit for decoding a compressed data structure that encodes a set of Haar coefficients for a 2×2 quad of pixels of a block of pixels. The set of Haar coefficients comprises a plurality of differential coefficients and an average coefficient. A first portion of the compressed data structure encodes the differential coefficients for the 2×2 quad of pixels. A second portion of the compressed data structure encodes the average coefficient for the 2×2 quad of pixels. The first portion of the compressed data structure is used to determine signs and exponents differential coefficients which are non-zero. The second portion of the compressed data structure is used to determine a representation of the average coefficient. The result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels is determined using: (i) the determined signs and exponents for the differential coefficients which are non-zero, (ii) the determined representation of the average coefficient, and (iii) respective weights for the differential coefficients. The determined result is used to determine the decoded value. The determined decoded value is outputted.
Claims
1. A method of decoding a compressed data structure to determine a decoded value, wherein the compressed data structure encodes a set of Haar coefficients for a 2×2 quad of pixels of a block of pixels, wherein the set of Haar coefficients comprises a plurality of differential coefficients and an average coefficient, wherein a first portion of the compressed data structure encodes the differential coefficients for the 2×2 quad of pixels and comprises: (i) a first set of bits which indicates an order of the magnitudes of the differential coefficients, and (ii) a second set of bits which indicates a sign and an exponent for each of one or more of the differential coefficients which are non-zero, wherein a second portion of the compressed data structure encodes the average coefficient for the 2×2 quad of pixels, wherein the method comprises: using the first portion of the compressed data structure to determine signs and exponents for said one or more of the differential coefficients which are non-zero; using the second portion of the compressed data structure to determine a representation of the average coefficient; determining the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels using: (i) the determined signs and exponents for said one or more of the differential coefficients which are non-zero, (ii) the determined representation of the average coefficient, and (iii) respective weights for the differential coefficients; using the determined result to determine the decoded value; and outputting the determined decoded value.
2. The method of claim 1, wherein said determining the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels comprises: performing a binary shift of the weights in accordance with the determined exponents for the respective one or more of the differential coefficients which are non-zero; and summing the determined representation of the average coefficient and the shifted weights in accordance with the determined signs for the respective one or more of the differential coefficients which are non-zero.
3. The method of claim 1, wherein said determining the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels does not involve performing any multiplication or division operations and does comprise adding a gamma correction term.
4. The method of claim 1, wherein the respective weights for the differential coefficients are predetermined, and wherein said using the determined result to determine the decoded value comprises using said determined result as the decoded value representing a value at a pixel position of the 2×2 quad of pixels and wherein the steps of determining the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels and using the determined result to determine the decoded value are performed four times for the respective four pixels of the 2×2 quad of pixels, wherein the predetermined weights for the differential coefficients are different in each of the times that said steps are performed so that four decoded values representing values at the four pixel positions of the 2×2 quad of pixels are determined.
5. The method of claim 1, further comprising determining the respective weights for the differential coefficients; wherein the weights are determined based on one or more of: a position, relative to the 2×2 quad of pixels, that the decoded value is to be determined; an indication of a level of detail to be provided by the decoded value; and an indication of an amount and/or direction of anisotropy to be provided by the decoded value.
6. The method of claim 5, wherein the block of pixels is a 4×4 block of pixels comprising four 2×2 quads of pixels, wherein for each of the four 2×2 quads of pixels a result of a weighted sum of the differential coefficients and the average coefficient for that 2×2 quad of pixels is determined, and wherein the four determined results for the respective four 2×2 quads of pixels are combined to determine the decoded value for the block of pixels.
7. The method of claim 6, wherein the four determined results for the respective four 2×2 quads of pixels are combined to determine the decoded value for the block of pixels by performing bilinear interpolation on the four determined results.
8. The method of claim 6, wherein if a position at which the decoded value is to be determined is within an area that is representable by considering the pixels of a single one of the 2×2 quads of pixels in the block of pixels then the decoded value for the block of pixels is equal to the determined result for that single 2×2 quad of pixels, and if a position at which the decoded value is to be determined is not within an area that is representable by considering the pixels of a single one of the 2×2 quads of pixels in the block of pixels then the decoded value for the block of pixels is the result of a weighted sum of two or more of the determined results for the 2×2 quads of pixels.
9. The method of claim 6, wherein said determining the respective weights for the differential coefficients comprises: determining initial weights based on an indication of a level of detail to be provided by the decoded value, an indication of an amount of anisotropy and an indication of a direction of anisotropy; and multiplying the determined initial weights by respective transfer functions to determine the weights for the differential coefficients, wherein each of the transfer functions is based on a position, relative to the 2×2 quad of pixels, that the decoded value is to be determined, and wherein each of the transfer functions: (i) is zero at the centre of the 2×2 quad of pixels, (ii) has a magnitude of one at each of the pixel positions of the 2×2 quad of pixels, and (iii) is extrapolated out to the edges of the area represented by the 2×2 quad of pixels.
10. The method of claim 9, wherein beyond edges of the area represented by a 2×2 quad of pixels, the transfer functions are clamped to their values on the nearest edge of said area.
11. The method of claim 6, wherein the pixels within the 4×4 block of pixels are selectively mirrored about neither, one or both of: (i) a horizontal axis passing through the centre of the block of pixels, and (ii) a vertical axis passing through the centre of the block of pixels, so that the position at which the decoded value is to be determined is within a particular quadrant of the block of pixels.
12. The method of claim 1, wherein the second set of bits comprises: a first section which indicates a sign for each of said one or more of the differential coefficients which are non-zero; and a second section which indicates an exponent for each of said one or more of the differential coefficients which are non-zero, or the second set of bits comprises a single section which indicates a sign and an exponent for each of the one or more of the differential coefficients which are non-zero.
13. The method of claim 12, when the second set of bits comprises the first section and the second section, wherein if one or more of the differential coefficients are zero, the first section indicates a mantissa bit for the differential coefficient which has the largest magnitude, wherein said determining the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels comprises including two terms in the weighted sum for the differential coefficient which has the largest magnitude, the two terms being: (i) the weight for that differential coefficient shifted in accordance with the exponent for that differential coefficient, and (ii) the weight for that differential coefficient shifted in accordance with an amount that is one less than the exponent for that differential coefficient.
14. The method of claim 1, wherein each pixel in the block of pixels has a value in a plurality of channels, and wherein the encoded differential coefficients of the Haar coefficients for the 2×2 quad of pixels in the compressed data structure represent a combination of differential channel coefficients determined for the respective channels, wherein said using the first portion of the compressed data structure to determine signs and exponents for said one or more of the differential coefficients which are non-zero comprises determining signs and exponents for one or more differential channel coefficients for each of the channels.
15. The method of claim 14, wherein a third portion of the compressed data structure indicates a combining mode, from a set of predetermined combining modes, according to which the differential channel coefficients determined for the channels are combined to determine the encoded differential coefficients of the Haar coefficients for the 2×2 quad of pixels, and wherein the method comprises identifying the combining mode using the third portion of the compressed data structure and using the identified combining mode to determine the differential channel coefficients for each of the channels.
16. The method of claim 1, wherein the block of pixels is a 4×4 block of pixels comprising four 2×2 quads of pixels, wherein the compressed data structure comprises a first portion for each of the 2×2 quads of pixels which is determined using differential coefficients for that 2×2 quad of pixels; wherein the second portion of the compressed data structure encodes a set of L1 Haar coefficients for a 2×2 quad of coefficients represented by the average coefficients for each of the 2×2 quads of pixels, wherein the set of L1 Haar coefficients comprises a plurality of L1 differential coefficients and an L1 average coefficient, wherein a first sub-portion of the second portion of the compressed data structure encodes the L1 differential coefficients for the 2×2 quad of coefficients and comprises: (i) a first set of bits which indicates an order of the magnitudes of the L1 differential coefficients, and (ii) a second set of bits which indicates a sign and an exponent for each of one or more of the L1 differential coefficients which are non-zero, wherein a second sub-portion of the second portion of the compressed data structure encodes the L1 average coefficient for the 2×2 quad of coefficients, wherein the method comprises: using the first portion of the compressed data structure for each of the 2×2 quads of pixels to determine signs and exponents for one or more of the differential coefficients which are non-zero for each of the 2×2 quads of pixels; and using the second portion of the compressed data structure to determine representations of the average coefficients for each of the 2×2 quads of pixels by: using the first sub-portion of the second portion of the compressed data structure to determine signs and exponents for said one or more of the L1 differential coefficients which are non-zero; using the second sub-portion of the second portion of the compressed data structure to determine a representation of the L1 average coefficient for the 2×2 quad of coefficients; and determining the result of a weighted sum of the L1 differential coefficients and the L1 average coefficient for the 2×2 quad of coefficients using: (i) the determined signs and exponents for said one or more of the L1 differential coefficients which are non-zero, (ii) the determined representation of the L1 average coefficient, and (iii) respective weights for the L1 differential coefficients; wherein, for each of the 2×2 quads of pixels, the respective weights for the L1 differential coefficients are set such that the determined result of the weighted sum provides the determined representation of the average coefficient for that 2×2 quad of pixels.
17. The method of claim 1, wherein said using the determined result to determine the decoded value comprises: using the determined result to determine a delta value; and adding a predicted value to the delta value to determine the decoded value for the block of pixels, wherein the predicted value is determined based on one or more neighbouring blocks of pixels.
18. A decoding unit configured to decode a compressed data structure to determine a decoded value, wherein the compressed data structure encodes a set of Haar coefficients for a 2×2 quad of pixels of a block of pixels, wherein the set of Haar coefficients comprises a plurality of differential coefficients and an average coefficient; wherein a first portion of the compressed data structure encodes the differential coefficients for the 2×2 quad of pixels and comprises: (i) a first set of bits which indicates an order of the magnitudes of the differential coefficients, and (ii) a second set of bits which indicates a sign and an exponent for each of one or more of the differential coefficients which are non-zero; wherein a second portion of the compressed data structure encodes the average coefficient for the 2×2 quad of pixels; wherein the decoding unit is configured to: use the first portion of the compressed data structure to determine signs and exponents for said one or more of the differential coefficients which are non-zero; use the second portion of the compressed data structure to determine a representation of the average coefficient; determine the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels using: (i) the determined signs and exponents for said one or more of the differential coefficients which are non-zero, (ii) the determined representation of the average coefficient, and (iii) respective weights for the differential coefficients; use the determined result to determine the decoded value; and output the determined decoded value.
19. The decoding unit according to claim 18, further comprising a filter parameter setup module configured to determine the respective weights for differential coefficients; wherein the block of pixels is a 4×4 block of pixels comprising four 2×2 quads of pixels, and the decoder unit further comprises: four decoder instances, each of which is configured to determine, for a respective one of the four 2×2 quads of pixels, a result of a weighted sum of the differential coefficients and the average coefficient for that 2×2 quad of pixels; and a bilinear interpolation unit configured to combine the four determined results for the respective four 2×2 quads of pixels to determine the decoded value for the block of pixels by performing bilinear interpolation on the four determined results.
20. A non-transitory computer readable storage medium having stored thereon an integrated circuit dataset definition that, when processed in an integrated circuit manufacturing system, configures the integrated circuit manufacturing system to manufacture a decoding unit which is configured to decode a compressed data structure to determine a decoded value, wherein the compressed data structure encodes a set of Haar coefficients for a 2×2 quad of pixels of a block of pixels, wherein the set of Haar coefficients comprises a plurality of differential coefficients and an average coefficient, wherein a first portion of the compressed data structure encodes the differential coefficients for the 2×2 quad of pixels and comprises (i) a first set of bits which indicates an order of the magnitudes of the differential coefficients, and (ii) a second set of bits which indicates a sign and an exponent for each of one or more of the differential coefficients which are non-zero; wherein a second portion of the compressed data structure encodes the average coefficient for the 2×2 quad of pixels, wherein the decoding unit is configured to: use the first portion of the compressed data structure to determine signs and exponents for said one or more of the differential coefficients which are non-zero; use the second portion of the compressed data structure to determine a representation of the average coefficient; determine the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels using: (i) the determined signs and exponents for said one or more of the differential coefficients which are non-zero, (ii) the determined representation of the average coefficient, and (iii) respective weights for the differential coefficients; use the determined result to determine the decoded value; and output the determined decoded value.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0102] Examples will now be described in detail with reference to the accompanying drawings in which:
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
[0111]
[0112]
[0113]
[0114]
[0115]
[0116]
[0117]
[0118]
[0119]
[0120]
[0121]
[0122]
[0123]
[0124]
[0125]
[0126]
[0127]
[0128] The accompanying drawings illustrate various examples. The skilled person will appreciate that the illustrated element boundaries (e.g., boxes, groups of boxes, or other shapes) in the drawings represent one example of the boundaries. It may be that in some examples, one element may be designed as multiple elements or that multiple elements may be designed as one element. Common reference numerals are used throughout the figures, where appropriate, to indicate similar features.
DETAILED DESCRIPTION
[0129] The following description is presented by way of example to enable a person skilled in the art to make and use the invention. The present invention is not limited to the embodiments described herein and various modifications to the disclosed embodiments will be apparent to those skilled in the art.
[0130] Embodiments will now be described by way of example only. The examples described herein provide an encoding unit for encoding a block of pixels (e.g. representing a portion of a texture) using Haar coefficients. Similarly, a decoding unit is provided for decoding a compressed data structure to determine a block of pixels using Haar coefficients. The examples described herein can cheaply decompose images into downscaled images with directional derivatives (e.g. Haar wavelet coefficients). The compression format is designed to reduce the complexity of reconstructing an image whereby the wavelet coefficients are independently weighted according to an algorithm that serves as an implementation of a parametric texture filter that can deliver a range of isotropic and anisotropic filtering automatically as part of the decoding process. Furthermore, the Haar coefficients are stored using indications of their signs and exponents, so that performing a weighted sum of the Haar coefficients reduces from a ‘multiply and add’ operation to a ‘shift and add’ operation, where the weights are shifted by the exponents of the Haar coefficients. The encoding format is simple enough to allow rapid online compression of a small footprint of pixels.
[0131] Examples described herein relate to encoding a block of pixels. The block of pixels may be part of a texture, where the “pixels” are the “texels” of the texture. In some other examples, the block of pixels may be part of an image that is being rendered by a graphics processing system, so that the image can be compressed before being written out to a memory.
[0132] Examples are described herein in which a block of pixels (e.g. a 2×2 block or a 4×4 block to give just two examples) are encoded into a compressed data structure. Different implementations may achieve different compression ratios, e.g. some examples achieve a 50% compression ratio whereby the amount of data in the compressed data structure is half that of the block of pixels, and some examples achieve a 25% compression ratio whereby the amount of data in the compressed data structure is a quarter of that of the block of pixels.
[0133] The encoding methods described herein can be implemented efficiently (e.g. in terms of processing latency, power consumption and/or silicon area). Furthermore, the encoding methods encode the data in the compressed data structure in a format such that the decoding of the compressed data structure to determine a decoded block of pixels can be performed efficiently (e.g. in terms of processing latency, power consumption and/or silicon area). In particular, the encoded data indicates signed exponent values (i.e. values which are powers of 2), and the decoding process involves performing a weighted sum of the signed exponent values. Since the signed exponent values are powers of 2, the multiplication of the signed exponent values by respective weights can be implemented with respective binary shifts of the weights, wherein the directions and magnitudes of the shifts are represented by the sign and the exponent of the signed exponents. Binary shift operations are relatively cheap to implement compared to multiplication operations (e.g. in terms of processing latency, power consumption and/or silicon area).
[0134] Furthermore, the encoding methods described herein involve determining Haar coefficients for a 2×2 quad of pixels and encoding those Haar coefficients. As described in more detail below, the Haar coefficients for a 2×2 quad of pixels comprise three directional differential coefficients (dx, dy and dxy) and an average coefficient (dc). The directional differential coefficients allow different levels of detail to be represented in different directions when the compressed data structure is decoded. The decoding process involves representing a decoded block of pixels as the dc value+some differential term, which is determined by performing a weighted sum of the dx, dy and dxy coefficients. Just representing a quad as the dc value effectively blurs the whole quad so that details at a spatial frequency higher than that of the quad are lost, but then detail can be added back in using the differential terms, and by selecting the weights of the different differential terms, different levels of detail can be added in different directions. For example, if, in addition to the average coefficient (dc), the horizontal differential coefficient (dx) is used to decode a 2×2 quad of pixels (by setting its weight to a non-zero value) but the vertical and diagonal differential coefficients (dy and dxy) are not used to decode the 2×2 quad of pixels (by setting their weights to zero) then the level of detail in the decoded block of pixels in the horizontal direction will be twice that in the vertical direction. In this example, the decoding process implements anisotropic filtering with an anisotropic ratio of 2 and with the direction of anisotropy along the vertical direction (since the maximum blurring occurs along that axis). By altering the weights applied to the dx, dy and dxy differential coefficients, the anisotropic ratio and the direction of anisotropy can be varied. In this way, the encoding/decoding process functions as a parametric texture filter, whereby different parameters (e.g. a desired anisotropic ratio and direction of anisotropy) can be supplied to the decoding unit to apply different types of filtering when decoding a compressed data structure. As another example, if the weights applied to dx and dy are equal magnitude and the weight applied to dxy is equal to the product of the weights applied to dx and dy then the decoding unit will operate as an isotropic filter. In particular, when the weights applied to dx, dy and dxy all have unit magnitude and the sign of the weight applied to dxy is the sign of the product of the weights applied to dx and dy, then the decoder will reconstruct one of the original 4 pixels in the 2×2 quad i.e. the top left pixel will be recovered if the weights applied to dx and dy are both −1 (with a weight of +1 for dxy), the top right pixel will be recovered if the weight applied to dx is +1, but the weight applied to dy is −1 (with a weight of −1 for dxy), the bottom left pixel will be recovered if the weight applied to dx is −1, but the weight applied to dy is +1 (with a weight of −1 for dxy) and the bottom right pixel will be recovered if the weights applied to dx and dy are both+1 (with a weight of +1 for dxy), These cases correspond to an isotropic parametric texture filter with parametric Level of Detail (LOD) of zero with sample locations falling on the pixels themselves.
[0135]
[0136]
[0137] The block of pixel values are received at the Haar coefficient determining module 206 of the encoding unit 202. In step S302 the Haar coefficient determining module 206 (in particular, the transform module 208 in this example) determines a set of Haar coefficients for the 2×2 quad of pixels. As described above, the set of Haar coefficients comprises a plurality of differential coefficients (dx, dy and dxy) and an average coefficient (dc).
[0138]
wherein the value of dx may be rounded, e.g. to the nearest representable number in the format of the pixel values or to said format, extended by one or more bits of precision (for accuracy). The horizontal differential coefficient (dx) is represented as 404 in
wherein the value of dy may be rounded, e.g. to the nearest representable number in the format of the pixel values or to said format, extended by one or more bits of precision (for accuracy). The vertical differential coefficient (dy) is represented as 406 in
wherein the value of dxy may be rounded, e.g. to the nearest representable number in the format of the pixel values or to said format, extended by one or more bits of precision (for accuracy). It can be thought of as a derivative first in x and then in y (hence the notation), or vice-versa, and as a second order term, does not have a unique direction associated with it (but since it still plays a role in the directional filtering described herein, we refer to it elsewhere in this description as a directional derivative). The dxy differential coefficient (dxy) is represented as 408 in
wherein the value of dc may be rounded, e.g. to the nearest representable number in the format of the pixel values. The average coefficient (dc) is represented as 410 in
[0139]
[0140] The coefficient encoding module 210 determines (in steps S304 and S306) the first portion 504 of the compressed data structure using the differential coefficients (dx, dy and dxy) determined for the 2×2 quad of pixels.
[0141] In particular, in step S304 the coefficient encoding module 210 determines the first set of bits 506.sub.1 of the first portion of the compressed data structure, wherein these bits 506.sub.1 (which are denoted “encOrder” in
[0142] The first set of bits 506.sub.1 further indicates how many of the differential coefficients (dx, dy and dxy) are non-zero. In the example shown in
[0143] If the three differential coefficients are non-zero, and if they are not all of equal magnitude, then there are six permutations of the ordering of the magnitudes of the coefficients, and these are represented with the following encOrder bits:
TABLE-US-00001 encOrder first (i.e. largest) second (i.e. middle) third (i.e. smallest) bits 506.sub.1 coefficient coefficient coefficient 0000 dx dy dxy 0001 dy dx dxy 0010 dxy dx dy 0011 dx dxy dy 0100 dy dxy dx 0101 dxy dy dx
[0144] If two of the three differential coefficients are non-zero and one is zero, then there are six permutations of the ordering of the magnitudes of the coefficients, and these are represented with the following encOrder bits:
TABLE-US-00002 encOrder first (i.e. largest) second (i.e. smallest bits 506.sub.1 coefficient non-zero) coefficient 0110 Dx dy 0111 Dy dx 1000 Dxy dx 1001 Dx dxy 1010 Dy dxy 1011 Dxy dy
[0145] It should be noted that while other orderings of permutations are possible, in this example, the integer “encOrder” describes a cyclic rotation of the coefficients with an additional pairwise transposition of the smaller (including zero) elements indicated by the least significant bit, which as such, indicates the parity of the permutation. This structure can simplify the encoder and/or decoder logic.
[0146] If one of the three differential coefficients are non-zero and two are zero, then there are three permutations of the ordering of the magnitudes of the coefficients, and these are represented with the following encOrder bits (note that, in this example, these encodings extend the cyclic ordering of coefficients):
TABLE-US-00003 encOrder first (i.e. largest, and only bits 506.sub.1 non-zero) coefficient 1100 dx 1101 dy 1110 dxy
[0147] If all of the three differential coefficients are of equal magnitude (including if they are all zero), then there is one permutation of the ordering of the magnitudes of the coefficients, and this is represented with the following encOrder bits:
TABLE-US-00004 encOrder bits 506.sub.1 first coefficient second coefficient third coefficient 1111 Dx dy dxy
[0148] The differential coefficients are considered to be in a floating point format. If the differential coefficients were to be represented without loss of data, each differential coefficient would be represented with a sign bit, one or more exponent bits and one or more mantissa bits. In step S306 the coefficient encoding module 210 determines the second set of bits 506.sub.2 of the first portion of the compressed data structure, wherein these bits 506.sub.2 indicate a sign and an exponent for each of one or more of the differential coefficients which are non-zero. As shown in
[0149] The signs and the exponents are the most important pieces of information to maintain in the compressed data structure, i.e. they are more important than the mantissa bits. As described below, in some situations, e.g. if the differential coefficients are non-zero and are not all equal, the compressed data structure does not indicate any mantissa bits for the differential coefficients. In other situations, e.g. if one or more of the differential coefficients are zero, the first section 507.sub.1 of the second set of bits 506.sub.2 indicates one or more mantissa bits for the differential coefficient which has the largest magnitude. Referring to the earlier discussion, when there are fewer than maximum non-zero terms with which to perform a weighted sum, additional precision can be accommodated without increasing the arithmetic complexity (e.g. a floating point term with one bit of mantissa can be written as the sum of two signed exponent terms).
[0150] The encoding of the second set of bits 506.sub.2 depends on the number of non-zero differential coefficients (as indicated by the first set of bits 506.sub.1 (“encOrder”)).
[0151] When there are three non-zero differential coefficients and they are not all equal in magnitude (e.g. if “encOrder” is between 0000 and 0101), the first section 507.sub.1 of the second set of bits 506.sub.2 has three bits which respectively indicate the signs of the three differential coefficients, and the second section 507.sub.2, which in this example has four bits, indicates one of a set of possible combinations of exponents for the three differential coefficients. In the example shown below, in all of the possible combinations, the first coefficient has an exponent that lies within the range [4, 7] (note that these exponents correspond directly to a power of 2 except 0, which is reserved to encode the special value of zero, which cannot be represented by a finite exponent), and the exponents of the other two coefficients are encoded differentially with respect to the exponent of the first coefficient, such that they are at most 2. In the example shown below, the two least significant bits (LSBs) of the second section 507.sub.2 (“encExp”) encode the two LSBs of the exponent of the first coefficient directly. If the exponents of the two other coefficients differ, the most significant bit (MSB) of the second section 507.sub.2 (“encExp”) encodes the LSB of the difference between the exponents of the first and second coefficients and the remaining bit in the second section 507.sub.2 (“encExp”) encodes the LSB of the difference between the exponents of the first and third coefficients. The ordering of the exponents ensures that these values are unique (e.g. if the LSB of the first and second exponents difference is 1 and the LSB of the first and third exponent difference is 0, the later exponent difference must be 2 to ensure that the third coefficient is smaller than the second coefficient). If the exponents of the second and third coefficients match, this is indicated by a value of 1 in each of the two MSBs of the second section 507.sub.2 (“encExp”); note that in this example it is not possible for second and third coefficients with distinct exponents no greater than 2 smaller than the first coefficient to both have an LSB of 1. In this case, the parity of the coefficient ordering is redundant and the LSB of encOrder (which is labelled “orderSgn”) is reinterpreted to encode the LSB of the difference between the exponents of the first and second coefficients (which is the same as the LSB of the difference between the exponents of the first and third coefficients). Note that in this example, whether the second and third coefficients have matching exponents or not, encodings that correspond with three matching magnitudes are excluded from the list of options. This is summarised in the table below (where the symbol ‘X’ indicates “don't care”):
TABLE-US-00005 exponent exponent of exponent of of first second (i.e. third (i.e. (i.e. largest) middle) smallest) orderSgn encExp coefficient coefficient coefficient X 0000 4 4 2 X 0001 5 5 3 X 0010 6 6 4 X 0011 7 7 5 X 0100 4 4 3 X 0101 5 5 4 X 0110 6 6 5 X 0111 7 7 6 X 1000 4 3 2 X 1001 5 4 3 X 1010 6 5 4 X 1011 7 6 5 0 1100 4 2 2 0 1101 5 3 3 0 1110 6 4 4 0 1111 7 5 5 1 1100 4 3 3 1 1101 5 4 4 1 1110 6 5 5 1 1111 7 6 6
[0152] When there are two non-zero differential coefficients (e.g. if encOrder is between 0110 and 1011), the first section 507.sub.1 of the second set of bits 506.sub.2 has three bits, two of which respectively indicate the signs of the two non-zero differential coefficients, whilst the third of which may indicate a mantissa bit for the first coefficient. The second section 507.sub.2, which in this example has four bits, indicates one of a set of possible combinations of exponents for the two non-zero differential coefficients. In the example shown below, in all of the possible combinations, the first coefficient has an exponent that lies within the range [4, 7], if the two encoded coefficients are distinct (including the mantissae) and within the range [1, 8] if they are equal. In the example shown below, the exponent of the second coefficient is encoded differentially with respect to the exponent of the first coefficient such that it lies within the range [−1, 3]. A value of −1, however, is only valid when the mantissa bit of the first coefficient is set (i.e. when it is a 1) since otherwise the first coefficient would be smaller than the second coefficient, and a value of 3 is instead valid when the mantissa bit of the first coefficient is not set (i.e. when it is a 0). Note that this assumes the coefficients have been sorted prior to rounding and that it remains beneficial to encode the largest value with greater precision even if the second coefficient rounds with lesser precision to a greater value. The two LSBs of the second section 507.sub.2 (“encExp”) encode the two LSBs of the exponent of the first coefficient directly. The 2 MSBs of the second section 507.sub.2 encode (d1_exp-d0_exp) mod 4, where d0_exp is the exponent of the first coefficient and d1_exp is the exponent of the second coefficient. If this value is 0, the exponents match, and if in addition the mantissa bit of the first coefficient (“d0_mnt”) is not set (i.e. if it is 0), the parity of the coefficient ordering (as indicated by encOrder) is redundant and the LSB of the encOrder bits (“orderSgn”) is reinterpreted to encode the MSB of d0_exp mod 8=d1_exp mod 8. This is summarised in the table below:
TABLE-US-00006 exponent exponent of of first second (i.e. (i.e. largest) middle) d0_mnt orderSgn encExp coefficient coefficient 0 0 0000 8 8 0 0 0001 1 1 0 0 0010 2 2 0 0 0011 3 3 0 1 0000 4 4 0 1 0001 5 5 0 1 0010 6 6 0 1 0011 7 7 1 X 0000 4 4 1 X 0001 5 5 1 X 0010 6 6 1 X 0011 7 7 X X 0100 4 3 X X 0101 5 4 X X 0110 6 5 X X 0111 7 6 X X 1000 4 2 X X 1001 5 3 X X 1010 6 4 X X 1011 7 5 0 X 1100 4 1 0 X 1101 5 2 0 X 1110 6 3 0 X 1111 7 4 1 X 1100 4 5 1 X 1101 5 6 1 X 1110 6 7 1 X 1111 7 8
[0153] When there is one non-zero differential coefficient (e.g. if encOrder is between 1100 and 1110), the first section 507.sub.1 of the second set of bits 506.sub.2 has three bits, one of which indicates the sign of the non-zero differential coefficient, whilst the second and third of which may indicate two mantissa bits for the first coefficient (i.e. for the non-zero coefficient). Alternatively, one or more of the second and third bits may instead be used to increase the precision of the dc coefficient (e.g. when the exponent is small). The second section 507.sub.2 indicates one of a set of possible combinations of exponents for the non-zero differential coefficient. In the example shown below, the exponent of the first coefficient lies within the range [1, 8]. It is noted that the non-zero coefficient cannot be zero (an exponent of 0 is reserved for the value zero rather than 2.sup.0=1). The three LSBs of the second section 507.sub.2 (“encExp”) encode d0_exp mod 8. The MSB of the second section 507.sub.2 is used to increase the precision of the dc coefficient, i.e. the MSB of the second section 507.sub.2 is repurposed and used as an extra bit in the second portion 508, such that the second portion 508 can have 6 bits to represent the average coefficient (dc). This is summarised in the table below:
TABLE-US-00007 exponent of first (i.e. encExp largest) coefficient X000 8 X001 1 X010 2 X011 3 X100 4 X101 5 X110 6 X111 7
[0154] When the three coefficients are of equal magnitude (e.g. if encOrder is 1111), the first section 507.sub.1 of the second set of bits 506.sub.2 has three bits which respectively indicate the signs of the three differential coefficients unless all coefficients are zero, in which case these three bits may be repurposed and used as extra bits in the second portion 508. The second section 507.sub.2 indicates one of a set of possible combinations of exponents for the (equal magnitude) differential coefficients. In this case, when there are 3 matching differential coefficients present, the exponent lies within the range [0, 7]. As mentioned above, when the three coefficients have equal magnitude then they might all be zero. The three LSBs of the second section 507.sub.2 (“encExp”) encode this value directly. If the coefficients are nonzero the MSB of the second section 507.sub.2 is used to increase the precision of the dc coefficient, i.e. the MSB of the second section 507.sub.2 is repurposed and used as an extra bit in the second portion 508, such that the second portion 508 can have 6 bits to represent the average coefficient (dc). If the coefficients are zero, however, the three bits of the first section 507.sub.1 are used to increase the precision of the dc (such that second portion 508 can have 8 bits to represent the average coefficient) and the MSB of the second section 507.sub.2 is instead reserved for a “bypass” flag used in an optional coefficient prediction feature (described later). This is summarised in the table below:
TABLE-US-00008 exponent of exponent of second exponent of third first (i.e. largest) (i.e. middle) (i.e. smallest) encExp coefficient coefficient coefficient x000 0 0 0 X001 1 1 1 X010 2 2 2 X011 3 3 3 X100 4 4 4 X101 5 5 5 X110 6 6 6 X111 7 7 7
[0155] So when the first portion 504 of the compressed data structure has been determined, then method passes to step S308 in which the second portion 508 of the compressed data structure is determined using the average coefficient (dc) determined for the 2×2 quad of pixels. For example, the DC encoding module 212 may receive the average coefficient (dc) and may receive an indication of how the differential coefficients have been encoded. The DC encoding module 212 may round the average coefficient to determine bits representing the value of the average coefficient determined for the 2×2 quad of pixels. For example, the average coefficient (dc) may have 8 bits, and if the second portion 508 has 5 bits (as shown in
[0156] In step S310 the compressed data structure is stored in the memory 204. In particular, the first portion 504 of data is provided to the packing unit 214 from the coefficient encoding module 210. The second portion 508 of data is provided to the packing unit 214 from the DC encoding module 212. The packing unit packs the first portion 504 and the second portion 508 into the compressed data structure and outputs the compressed data structure for storage in the memory 204.
[0157] In the example described above, each pixel has a value in a single channel. In another example, each pixel in a block of pixels has a value in a plurality of channels. The plurality of channels may be Red, Green and Blue channels.
[0158]
[0159] The encoding system shown in
[0160] The block of pixel values are received at the Haar coefficient determining module 606 of the encoding unit 602. In step S302 the Haar coefficient determining module 606 determines a set of Haar coefficients for the 2×2 quad of pixels. The set of Haar coefficients comprises, for each channel, a plurality of differential coefficients (dx, dy and dxy) and an average coefficient (dc). In particular, in step S302, the Haar coefficient determining module 606 determines, for each of the channels, differential channel coefficients as described above. So for the Red channel three differential channel coefficients (dx.r, dy.r and dxy.r) and an average coefficient (dc.r) are determined; for the Green channel three differential channel coefficients (dx.g, dy.g and dxy.g) and an average coefficient (dc.g) are determined, and for the Blue channel three differential channel coefficients (dx.b, dy.b and dxy.b) and an average coefficient (dc.b) are determined. These nine differential coefficients and three average coefficients are provided to the mode encoding module 618. In other examples, the three average coefficients might not be provided to the mode encoding module 618 because the mode encoding module operates on the differential coefficients, not the average coefficients.
[0161] The mode encoding module 618 combines the differential channel coefficients determined for the channels to determine the differential coefficients of the Haar coefficients for the 2×2 quad of pixels. It is noted that in this example, the number of differential channel coefficients for each of the channels (i.e. three) is the same as the number of differential coefficients of the Haar coefficients that are determined for the 2×2 quad of pixels. In this example, the mode encoding module 618 combines, according to a combining mode, the nine channel differential coefficients (dx.r, dy.r, dxy.r, dx.g, dy.g, dxy.g, dx.b, dy.b and dxy.b) to determine three differential coefficients (dx, dy and dxy) which can be passed to the coefficient encoding module. An indication of the combining mode (“enc_mode”) is passed from the mode encoding module 618 to the DC encoding module 612 and then to the packing unit 614 so that it can be included in the compressed data structure. It is noted that in the example shown in
[0162]
[0163] The third portion 710 includes the indication of the combining mode (“encMode”). In this example, step S302 comprises determining the third portion 710 of the compressed data structure 702 which indicates the combining mode, from a set of predetermined combining modes, according to which the differential channel coefficients determined for the channels are combined to determine the differential coefficients of the Haar coefficients for the 2×2 quad of pixels. The third portion 710 of the compressed data structure 702 has a number of bits n.sub.mode, wherein the set of predetermined combining modes comprises 2.sup.n.sup.
[0164] A combining mode is chosen with the aim of preserving as much detail as possible when the nine channel differential coefficients are combined into three differential coefficients. The table below indicates a set of eight predetermined combining modes which may be used, for each combining mode and for each of the nine channel differential coefficients (dx.r, dy.r, dxy.r, dx.g, dy.g, dxy.g, dx.b, dy.b and dxy.b) it indicates which of the output differential coefficients (dx, dy and dxy) that channel differential coefficient contributes to (with a value of 0 indicating that it makes no contribution). The table also indicates the bits of the indication (encMode) used to represent each combining mode:
TABLE-US-00009 encMode dx.r dx.g dx.b dy.r dy.g dy.b dxy.r dxy.g dxy.b 000 dx dx dx dy dy dy dxy dxy dxy 001 dx dy dxy 0 0 0 0 0 0 010 0 0 0 dx dy dxy 0 0 0 011 0 0 0 0 0 0 dx dy dxy 100 dx dy dxy dx dy dxy dx dy dxy 101 dx dy dxy dx dy dxy −dx −dy −dxy 110 dx dy dxy −dx −dy −dxy dx dy dxy 111 dx dy dxy −dx −dy −dxy −dx −dy −dxy
[0165]
dx=[dx.r+dx.g+dx.b]/3
dy=[dy.r+dy.g+dy.b]/3
dxy=[dxy.r+dxy.g+dxy.b]/3
[0166] The other seven combining modes represent different options for dividing the 2×2 quad up into two colours. Block 804 corresponds to encMode 001 and represents the quad as two colours with a vertical division between the two colours. In this mode, the output differential coefficients dx, dy and dxy are determined as: dx=dx.r, dy=dx.g, dxy=dx.b.
[0167] Block 806 corresponds to encMode 010 and represents the quad as two colours with a horizontal division between the two colours. In this mode, the output differential coefficients dx, dy and dxy are determined as: dx=dy.r, dy=dy.g, dxy=dy.b.
[0168] Block 808 corresponds to encMode 011 and represents the quad as two colours with the top left and bottom right pixels being a first colour and the top right and bottom left pixels being a second colour. In this mode, the output differential coefficients dx, dy and dxy are determined as: dx=dxy.r, dy=dxy.g, dxy=dxy.b.
[0169] Block 810 corresponds to encMode 100 and represents the quad as two colours with the bottom right pixel being a first colour and the other three pixels being a second colour. In this mode, the output differential coefficients dx, dy and dxy are determined as:
dx=[dx.r+dy.r+dxy.r]/3
dy=[dx.g+dy.g+dxy.g]/3
dxy=[dx.b+dy.b+dxy.b]/3
[0170] Block 812 corresponds to encMode 101 and represents the quad as two colours with the top left pixel being a first colour and the other three pixels being a second colour. In this mode, the output differential coefficients dx, dy and dxy are determined as:
dx=[dx.r+dy.r−dxy.r]/3
dy=[dx.g+dy.g−dxy.g]/3
dxy=[dx.b+dy.b−dxy.b]/3
[0171] Block 814 corresponds to encMode 110 and represents the quad as two colours with the bottom left pixel being a first colour and the other three pixels being a second colour. In this mode, the output differential coefficients dx, dy and dxy are determined as:
dx=[dx.r−dy.r+dxy.r]/3
dy=[dx.g−dy.g+dxy.g]/3
dxy=[dx.b−dy.b+dxy.b]/3
[0172] Block 816 corresponds to encMode 111 and represents the quad as two colours with the top right pixel being a first colour and the other three pixels being a second colour. In this mode, the output differential coefficients dx, dy and dxy are determined as:
dx=[dx.r−dy.r−dxy.r]/3
dy=[dx.g−dy.g−dxy.g]/3
dxy=[dx.b−dy.b−dxy.b]/3
[0173] The values of dx, dy and dxy determining by the mode encoding module 618 are provided to the first coefficient encoding logic 620. The first coefficient encoding logic 620 operates as described above for the coefficient encoding module 210 to thereby determine the first portion 704 of the compressed data structure 702 encoding the differential coefficients. The third portion 710 (encMode) is provided from the mode encoding module 618 to the DC encoding module 612. The average coefficients (dc.r, dc.g and dc.b) are provided to the DC encoding module 612. The DC encoding module 612 determines the second portion 708 of the compressed data structure 708 as described above for the DC encoding module 212. In particular, the DC encoding module 612 rounds each of the average coefficients to determine an indication of the average coefficients (denoted encDC_R, encDC_G and encDC_B in
[0174] When deciding which combining mode to choose, the mode encoding module 618 selects the combining mode from the set of predetermined combining modes that gives the largest magnitude (or some suitable approximation) for the differential coefficients of the Haar coefficients (dx, dy and dxy) for the 2×2 quad of pixels. By selecting the combining mode that gives the largest magnitude for the differential coefficients, the least amount of data is lost by combining the differential coefficients for the three channels into a single set of dx, dy and dxy coefficients. In some examples, a slight bias for luma encoding is included since more precision is generally available in this mode (compared with palette encoding) for near-monochromatic blocks.
[0175] As mentioned above, in some examples, each pixel in the block of pixels has a further value in an Alpha channel. In this case, the transform module 608 comprises the alpha channel transform module 616.sub.A and the coefficient encoding module comprises the alpha coefficient encoding logic 622. The DC encoding module is also configured to encode a DC value for the alpha channel. In this example, when an alpha channel is present in addition to the Red, Green and Blue channels, the encoding unit is configured to encode the values in the Alpha channel independently of the values in the Red, Green and Blue channels.
[0176] The alpha channel transform module 616.sub.A determines a set of Haar coefficients for the Alpha channel of the 2×2 quad of pixels (as described above for the transform module 208). The set of Haar coefficients for the Alpha channel comprises a plurality of differential Alpha coefficients (dx.a, dy.a and dxy.a) and an average Alpha coefficient (dc.a). The differential Alpha coefficients (dx.a, dy.a and dxy.a) are provided from the alpha channel transform module 616.sub.A to the alpha coefficient encoding logic 622, and the average Alpha coefficient (dc.a) is provided from the alpha channel transform module 616.sub.A to the DC encoding module 612.
[0177] The alpha coefficient encoding logic 622 operates as described above for the coefficient encoding module 210 to thereby determining a first Alpha portion of the compressed data structure using the differential Alpha coefficients determined for the 2×2 quad of pixels. As will be apparent by analogy with the description above in relation to the coefficient encoding module 210, the first Alpha portion comprises: (i) a first set of Alpha bits which indicates an order of the magnitudes of the differential Alpha coefficients; and a second set of Alpha bits which indicates, for one or more of the differential Alpha coefficients which are non-zero, a sign and an exponent.
[0178] The DC encoding module 612 determines a second Alpha portion of the compressed data structure using the average Alpha coefficient determined for the 2×2 quad of pixels. In particular, the DC encoding module 612 determines the second Alpha portion of the compressed data structure as described above for the DC encoding module 212. That is, the DC encoding module 612 rounds the average alpha coefficient (dc.a) to determine an indication of the average coefficient, e.g. which has 5 or 6 bits, and which can be included in the compressed data structure.
[0179] In the examples described above, the block of pixels is a 2×2 block of pixels, such that it includes a single 2×2 quad of pixels. The examples described above may be referred to as “High Quality” or “HQ” encoding schemes.
[0180] However, in an example described with reference to
[0181] The encoding system shown in
[0182] The block of pixel values are received at the Haar coefficient determining module 906 of the encoding unit 902. In step S302 the Haar coefficient determining module 906 determines a set of Haar coefficients for each of the 2×2 quads of pixels within the 4×4 block of pixels. As described above, the set of Haar coefficients comprises, for each channel, a plurality of differential coefficients (dx, dy and dxy) and an average coefficient (dc). In particular, in step S302, the Haar coefficient determining module 906 determines, for each of the channels, differential channel coefficients as described above. So for each 2×2 quad, for the Red channel three differential channel coefficients (dx.r, dy.r and dxy.r) and an average coefficient (dc.r) are determined; for the Green channel three differential channel coefficients (dx.g, dy.g and dxy.g) and an average coefficient (dc.g) are determined, and for the Blue channel three differential channel coefficients (dx.b, dy.b and dxy.b) and an average coefficient (dc.b) are determined. These nine differential coefficients and three average coefficients are provided to the mode encoding module 918. In other examples, the three average coefficients might not be provided to the mode encoding module 918 because the mode encoding module operates on the differential coefficients, not the average coefficients.
[0183] Similarly to as described above with reference to the mode encoding module 618 shown in
dx=[dx.r+dx.g+dx.b]/3
dy=[dy.r+dy.g+dy.b]/3
dxy=[dxy.r+dx.g+dxy.b]/3
[0184] These three differential coefficients for each of the 2×2 quads of pixels are passed to the coefficient encoding module 910.
[0185]
[0186] In this example, the coefficient encoding module 910 implements a more aggressive compression of the differential coefficients for the 2×2 quads of pixels than described above in order to determine respective first portions 1004 of the compressed data structure 1002 for the 2×2 quads of pixels. In particular, the first set of bits 1006.sub.1 is the same as the first set of bits 706.sub.1 and 506.sub.1 described above (though it may have different behaviour in some special cases). However, the second set of bits 1006.sub.2 comprises a single section which indicates, for one or more of the differential coefficients which are non-zero, a sign and an exponent (and sometimes a mantissa bit in examples described herein). This is in contrast to the examples described above in which the second set of bits (706.sub.2 and 506.sub.2) comprise two sections.
[0187] The second set of bits 1006.sub.2 has four bits to indicate the signs and exponents of three differential coefficients. The encoding of the second set of bits 1006.sub.2 depends on the number of non-zero differential coefficients (as indicated by the first set of bits 1006.sub.1 (encOrder)).
[0188] When there are three non-zero differential coefficients and they are not all equal in magnitude (e.g. if encOrder is between 0000 and 0101), the three LSBs of the second set of bits 1006.sub.2 are used to encode the signs of the three coefficients. Since there is just one remaining available bit, the two smaller coefficients are forced to have equal magnitude such that the parity of the coefficient ordering is redundant and the LSB of encOrder (which is labelled “orderSgn”) is reinterpreted to encode the second LSB of the first coefficient's exponent. The MSB of the second set of bits 1006.sub.2 is used to encode the LSB of the first coefficient's exponent. In the example shown below, in all of the possible combinations, the first coefficient has an exponent that lies within the range [4, 7], and the exponents of the other two coefficients are equal to each other and one less than the exponent of the first coefficient. The possible combinations of exponents for the three differential coefficients in this example are summarised in the table below:
TABLE-US-00010 exponent exponent of exponent of of first second (i.e. third (i.e. (i.e. largest) middle) smallest) orderSgn encSgnExp coefficient coefficient coefficient 0 0XXX 4 3 3 0 1XXX 5 4 4 1 0XXX 6 5 5 1 1XXX 7 6 6
[0189] When there are two non-zero differential coefficients (e.g. if encOrder is between 0110 and 1011), the second and third MSBs of the second set of bits 1006.sub.2 are used to represent the two signs of the non-zero differential coefficients (the second MSB is used to encode the sign of the first i.e. larger coefficient and the third MSB is used to encode the sign of the second i.e. smaller coefficient), and the MSB and the LSB of the second set of bits 1006.sub.2 indicate one of a set of four possible combinations of exponents for the two non-zero differential coefficients. In the example shown below, in all of the possible combinations, the first coefficient has an exponent that lies within the range [4, 7], and the exponents of the other non-zero coefficient is one less than the exponent of the first coefficient. In this example, the MSB of the second set of bits (“encSgnExp”) gives the LSB of the largest coefficient exponent and the LSB of the second set of bits (“encSgnExp”) gives the second bit of the largest coefficient exponent. The possible combinations of exponents for the two non-zero differential coefficients in this example are summarised in the table below:
TABLE-US-00011 exponent exponent of of first second (i.e. (i.e. largest) middle) encSgnExp coefficient coefficient 0XX0 4 3 1XX0 5 4 0XX1 6 5 1XX1 7 6
[0190] When there is one non-zero differential coefficients (e.g. if encOrder is between 1100 and 1110), the second MSB of the second set of bits 1006.sub.2 is used to represent the sign of the non-zero differential coefficient, the MSB of the second set of bits 1006.sub.2 is used to represent the LSB of the exponent of the non-zero coefficient and the LSB of the second set of bits 1006.sub.2 is used to represent the second LSB of the exponent of the non-zero coefficient. If the two LSBs of the exponent of the non-zero coefficient are both zero, then the second LSB of the second set of bits 1006.sub.2 is used to represent the third LSB of the exponent of the non-zero coefficient, otherwise it is used to represent a single mantissa bit of the non-zero coefficient. The possible exponents for the single non-zero differential coefficient in this example are summarised in the table below:
TABLE-US-00012 exponent of first (i.e. encSgnExp largest) coefficient 0X00 8 0X10 4 1XX0 5 0XX1 6 1XX1 7
[0191] This more aggressive encoding may be referred to as a “low quality” or “LQ” encoding. In the example given above, this LQ encoding does not provide for three equal magnitude coefficients. Instead, the three equal magnitude coefficient encoding (e.g. if encOrder is 1111) is repurposed to provide additional encodings for one non-zero differential coefficient. Since there is no indication from the coefficient ordering which of the three coefficients is largest (which could be used to determine the nonzero coefficient), the second LSB of the second set of bits 1006.sub.2 is used to indicate whether the dx coefficient is nonzero (when the second LSB is 0) or whether the dy coefficient is nonzero (when the second LSB is 1); there is no encoding to select the dxy coefficient (as a second order term, it is generally less important). The third LSB of the second set of bits 1006.sub.2 represents the sign of the non-zero coefficient, the MSB of the second set of bits 1006.sub.2 represents the LSB of the exponent of the non-zero coefficient and the LSB of the second set of bits 1006.sub.2 represents the second LSB of the exponent of the non-zero coefficient. If the two LSBs of the exponent of the nonzero coefficient are both zero, the coefficient is treated as being zero (so that a null vector can still be represented with this reinterpreted encoding) and the second and third LSBs, representing the indication of which coefficient is non-zero and its sign respectively, are both redundant and the second LSB of the second set of bits is instead used to indicate a prediction bypass flag (described later). In this example, the non-zero exponents lie in the range [1,3]. The possible exponents for the non-zero differential coefficients are summarised in the table below:
TABLE-US-00013 exponent of first (i.e. encSgnExp largest) coefficient 0XX0 NA 1XX0 1 0XX1 2 1XX1 3
[0192] So, for each of the 2×2 quads of pixels the Haar coefficient determining module 906 determines a set of Haar coefficients for the 2×2 quad of pixels, wherein the set of Haar coefficients comprises a plurality of differential coefficients and an average coefficient, and then for each of the 2×2 quads of pixels, the coefficient encoding module 910 determines a respective first portion 1004 of the compressed data structure 1002 for the 2×2 quad of pixels using the differential coefficients determined for the 2×2 quad of pixels.
[0193] The second portion 1008 of the compressed data structure 1002 is determined by the encoding unit 902 using the average coefficients (dc) determined for each of the 2×2 quads of pixels. In particular, a 2×2 quad of coefficients represented by the average coefficients determined for each of the 2×2 quads of pixels is determined. As shown in
[0194] The Haar coefficient determining module 906 operates on the 2×2 quad of coefficients in the same way that the Haar coefficient determining module 606 described above operates on a 2×2 quad of pixels. In this way, the Haar coefficient determining module 906 determines a set of L1 Haar coefficients for the 2×2 quad of coefficients, wherein the set of L1 Haar coefficients comprises a plurality of L1 differential coefficients and an L1 average coefficient.
[0195] The coefficient encoding module 910 determines a first sub-portion (including the “encOrder”, “encSgnMnt” and “encExp” fields shown in bit positions 32 to 42 in
[0196] The DC encoding module 910 determines a second sub-portion (including the “encDC_R”, “encDC_G” and “encDC_B” fields shown in bit positions 46 to 63 in
[0197] The mode encoding module 918 determines a third sub-portion (including the “encMode” field shown in bit positions 43 to 45 in
[0198] In some examples, the values of the differential coefficients (i.e. the dx, dy and dxy coefficients) for a 2×2 quad of pixels are predicted based on the average coefficients (i.e. the dc coefficients) of one or more neighbouring 2×2 quads of pixels. Differences (or “delta values”) between the predicted values and the original values of the differential coefficients for the 2×2 quad of pixels can be determined and then used in place of the differential coefficients for the encoding of the differential coefficients (i.e. for determining the first portion of the compressed data structure). The delta values tend to be smaller in magnitude than the differential coefficients themselves. In general, the encoding techniques described herein will be more accurate (i.e. introduce fewer or less significant artefacts) when the values which are encoded as the differential coefficients have smaller magnitude. The average coefficients (i.e. the dc coefficients) are unmodified by the prediction. If the differential coefficients have zero magnitude, a bypass control signal may be set, and in response to this bypass control signal being set, the encoding unit might not implement this prediction technique. This avoids (potentially harmful) prediction in regions of flat colour.
[0199] Sometimes the average coefficients of the neighbouring 2×2 quads might not all be available, in which case the dc value of the current 2×2 quad may be used in the prediction. In the explanation given below, the control signals “enable_n”, “enable_e”, “enable_s” and “enable_w” indicate whether the average coefficient is available fora quad across each of the northern, eastern, southern and western axis-aligned quad boundaries. One or more of these signals may be low when encoding quads at the boundary of a block-based scheme. The original Haar coefficients before the prediction (i.e. “upstream” of the prediction) are denoted “dx_up”, “dy_up”, “dxy_up” and “dc_up”, and the coefficients after the prediction (i.e. “downstream” of the prediction) are denoted “dx_dn”, “dy_dn”, “dxy_dn” and “dc_dn”. As mentioned above, the dc coefficient is not modified by the prediction, so dc_dn=dc_up.
[0200] The bypass control signal is set (as described above) such that:
[0201] The predictor for the dx coefficient selects eastern and western dc coefficients, as available, for terms c.sub.e and c.sub.w respectively:
[0202] An unnormalised prediction δ′.sub.x is defined by δ′.sub.x=c.sub.e−c.sub.w. The prediction is normalised by the separation distance of dc terms. If only a single neighbour is available, this distance is half that of when both neighbours are available (since the missing neighbour is replaced by the central value). The (normalised) prediction δ.sub.x is given by:
where the value of the normalised prediction δ.sub.x may be rounded in accordance with a rounding mode, e.g. a round to nearest mode. It is noted that if neither the east nor the west neighbour is available, the prediction is by definition zero, such that the normalisation factor is irrelevant.
[0203] The residual dx coefficient (i.e. the delta value representing the difference between the dx coefficient and the prediction of the dx coefficient) is given as dx_dn=dx_up−δ.sub.x. The value of dx_dn may be rounded in accordance with a rounding mode, e.g. a round to nearest mode.
[0204] The predictor for the dy coefficient selects northern and southern dc coefficients, as available, for terms c.sub.n and c.sub.s respectively:
[0205] An unnormalised prediction δ′.sub.y is defined by δ′.sub.y=c.sub.s−c.sub.n. The prediction is normalised by the separation distance of dc terms. If only a single neighbour is available, this distance is half that of when both neighbours are available (since the missing neighbour is replaced by the central value). The (normalised) prediction δ.sub.y is given by:
where the value of the normalised prediction δ.sub.y may be rounded in accordance with a rounding mode, e.g. a round to nearest mode. It is noted that if neither the north nor south neighbour is available, the prediction is by definition zero, such that the normalisation factor is irrelevant.
[0206] The residual dy coefficient (i.e. the delta value representing the difference between the dy coefficient and the prediction of the dy coefficient) is given as dy_dn=dy_up−δ.sub.y. The value of dy_dn may be rounded in accordance with a rounding mode, e.g. a round to nearest mode.
[0207] The predictor for the dxy coefficient selects north-western, north-eastern, south-western and south-eastern dc coefficients, as available, for terms c.sub.nw, c.sub.ne, c.sub.sw and c.sub.sw respectively. Ideally, for each term, data across both relevant boundaries is available. Otherwise, the predictor selects which of the relevant northern, eastern, southern and western coefficients are available, if any, for a given term, as follows:
[0208] An unnormalised prediction δ″.sub.xy is defined by δ′.sub.xy=½(c.sub.nw+c.sub.se−c.sub.ne−c.sub.sw). The prediction is normalised by the separation distance of dc terms along the x and y axes. If only a single neighbour is available along a given axis, this distance is half that of when both neighbours are available (since the missing neighbour is replaced by the central value). The (normalised) prediction δ.sub.xy is given (in terms of an intermediate prediction δ′.sub.xy) by:
where the values of the intermediate prediction δ′.sub.xy and the normalised prediction δ.sub.y may be rounded in accordance with a rounding mode, e.g. a round to nearest mode. It is noted that if neither neighbour is available for a given axis, the prediction is by definition zero, such that the normalisation factor is irrelevant.
[0209] The residual dxy coefficient (i.e. the delta value representing the difference between the dxy coefficient and the prediction of the dxy coefficient) is given as dxy_dn=dxy_up−δ.sub.xy. The value of dxy_dn may be rounded in accordance with a rounding mode, e.g. a round to nearest mode.
[0210] As mentioned above, in some examples, each pixel in the block of pixels has a further value in an Alpha channel. In this case, the transform module 908 comprises the alpha channel transform module 916.sub.A and the coefficient encoding module comprises the alpha coefficient encoding logic 922. The DC encoding module is also configured to encode a DC value for the alpha channel. In this example, when an alpha channel is present in addition to the Red, Green and Blue channels, the encoding unit is configured to encode the values in the Alpha channel independently of the values in the Red, Green and Blue channels. In one example, Haar coefficients are determined for each of the 2×2 quads of pixels using the low quality encoding scheme, and the average coefficients for the alpha channel are fed back to the input of the Haar coefficient determining module 906 so they can be processed as a quad of coefficients to determine the L1 Haar coefficients for the alpha channel, using the high quality scheme. The processing then proceeds as described above for the alpha channel independently of the Red, Green and Blue channels. In another example, the alpha channel is encoded using the high quality encoding scheme only (as in
[0211] The coefficient encoding modules (210, 610, 910) described above employ a combination of deduction and heuristic to determine a suitable encoding of a block of pixels. As an example, the following conditions, applied to sorted and then suitably rounded (e.g. to the available low precision float formats) input coefficients, may be evaluated in decreasing precedence:
1. If the largest differential coefficient has zero magnitude or a bypass control signal has been set, then 0 non-zero coefficients are encoded and encOrder is set to 1111. The label “PTFC_0D” (which is used to indicate all zero coefficients as a special case of matching coefficients i.e. “PTFC_MD”; this can be useful to distinguish cases that might need remapping as in the RGB dc coefficient encoding described earlier) is assigned to this situation.
2. If the encoding is to be a high quality encoding (such that a “hq” control signal has been set) and the middle of the ordered differential coefficients has zero magnitude (which of course implies that the lowest of the ordered differential coefficients also has zero magnitude), or the hq control signal has not been set (i.e. such that the low quality encoding is used) and the largest of the ordered differential coefficients has a rounded exponent of 8, then 1 nonzero coefficient is encoded, and encOrder is set to one of 1100, 1101 or 1110 depending on the ordering of the differential coefficients as described above. The label “PTFC_1D” is assigned to this situation.
3. If the hq control signal has been set and the differential coefficients have matching exponents or the largest differential coefficient exponent lies outside the closed interval [4,7] and the range in exponents of the differential coefficients is no greater than 1, then 3 nonzero coefficients of equal magnitude are encoded, and encOrder is set to 1111. If the hq control signal has not been set and the exponent of the largest differential coefficient is smaller than 4, at most 1 nonzero coefficient is encoded (encodings ordinarily reserved for matching coefficients are repurposed for 1 coefficient due to encoding sparsity in the low quality mode). However, in either case, the label “PTFC_MD” is assigned.
4. If the hq control signal has been set and the range of the exponents of the coefficients is less than 2, then 3 nonzero coefficients of nonequal magnitude are encoded, and encOrder is set to one of 0000 to 0101 depending on the ordering of the differential coefficients as described above. If the hq control signal is set, the range of the exponents of the coefficients is 2, the mantissa of the major coefficient (rounded to three bits of mantissa) is not equal to 1.375 and the exponent of the largest coefficient lies in the interval [4,7], then 3 nonzero coefficients of nonequal magnitude are encoded, and encOrder is set to one of 0000 to 0101 depending on the ordering of the differential coefficients as described above. If the hq control signal is not set and the exponent range of coefficients is less than or equal to 2, then 3 nonzero coefficients of equal magnitude are encoded, and encOrder is set to one of 0000 to 0101 depending on the ordering of the differential coefficients as described above. The label “PTFC_3D” is assigned.
5. If the hq control signal has been set and the difference in exponent between the two largest coefficients, when the larger coefficient is rounded with 1 bit of mantissa (when available), is less than 1 (it can be negative in the high quality mode due to potentially higher precision representation of the larger coefficient, such that rounding does not preserve the ordering of terms), then 2 nonzero coefficients are encoded, and encOrder is set to one of 0110 to 1011 depending on the ordering of the differential coefficients as described above. If the hq control signal has been set, the two larger coefficients exponent difference is 1 (taking into account rounding when the larger coefficient has 1 bit of mantissa) and the larger exponent lies in the closed exponent interval [4,7], then 2 nonzero coefficients are encoded and encOrder is set to one of 0110 to 1011 depending on the ordering of the differential coefficients as described above. If the hq control signal has not been set and the non-zero coefficient exponent difference is less than 2, then 2 nonzero coefficients are encoded, and encOrder is set to one of 0110 to 1011 depending on the ordering of the differential coefficients as described above. The label “PTFC_2D” is assigned.
6. If the hq control signal has been set, but the largest coefficient exponent (rounded to one bit of mantissa as appropriate for 2 non-zero coefficients encoding) lies outside the closed exponent interval [4,7], then 1 nonzero coefficient is encoded and encOrder is set to one of 1100 to 1110 depending on the ordering of the differential coefficients as described above. If the hq control signal has not been set, then 1 nonzero coefficient is encoded (this is thus the last condition for the low quality encoding and subsequent conditions only apply when the hq control signal has been set). The label “PTFC_1D” is assigned.
7. If the difference in exponent between the two largest coefficients, when the largest coefficient is rounded with 1 bit of mantissa, is 2, then 2 nonzero coefficients are encoded, and encOrder is set to one of 0110 to 1011 depending on the ordering of the differential coefficients as described above. If the 2d exponent range is 3, the mantissa of the rounded 2d major coefficient is 1 and the mantissa of the major coefficient is neither 1.125 nor 1.625, then 2 nonzero coefficients are encoded, and encOrder is set to one of 0110 to 1011 depending on the ordering of the differential coefficients as described above. The label “PTFC_2D” is assigned.
8. If none of the above conditions are satisfied, then 1 nonzero coefficient is encoded, and encOrder is set to one of 1100 to 1110 depending on the ordering of the differential coefficients as described above. The label “PTFC_1D” is assigned.
[0212] The encoding processes described above is efficient to implement (e.g. in terms of latency, power consumption and/or silicon area).
[0213] We now go on to describe how a compressed data structure representing a block of pixels is decoded. As described above, the decoding process involves representing a 2×2 quad of pixels as the dc value+some differential term, which is determined by performing a weighted sum of the dx, dy and dxy coefficients.
[0214]
[0215]
[0216] The compressed data structure is received at the decoding unit 1102, e.g. from the memory 1104. In step S1202 the decoding unit 1102 (e.g. the decoder instance 1105) uses the first portion (e.g. 504) of the compressed data structure to determine signs and exponents for the one or more of the differential coefficients which are non-zero for the 2×2 quad of pixels. As described above, the first portion (e.g. 504) of the compressed data structure comprises: (i) a first set of bits (e.g. 506.sub.1) which indicates an order of the magnitudes of the differential coefficients, and (ii) a second set of bits (e.g. 506.sub.2) which indicates a sign and an exponent for each of one or more of the differential coefficients which are non-zero. Different implementations for how the first portion of the compressed data structure indicates the signs and exponents for the differential coefficients are described above, and it will be apparent how the decoder can determine the signs and exponents for the one or more of the differential coefficients which are non-zero from the first portion of the compressed data structure in the different implementations described above. As described above, the first set of bits also indicates whether any (and if so, which) of the differential coefficients are zero. So in step S1202 the decoding unit 1102 (e.g. the decoder instance 1105) can determine whether any (and if so, which) of the differential coefficients are zero using the first portion (e.g. 504) of the compressed data structure. It is noted that in most situations, the decoding unit 1102 (e.g. the decoder instance 1105) does not determine any mantissa bits for the differential coefficients, although in some situations one or more mantissa bits are determined (as described above).
[0217] In step S1204 the decoding unit 1102 (e.g. the decoder instance 1105) uses the second portion (e.g. 508) of the compressed data structure to determine a representation of the average coefficient for the 2×2 quad of pixels. Different implementations for how the second portion of the compressed data structure indicates the average coefficient are described above (including examples in which the first portion (e.g. 504) of the compressed data structure also contribute to the determination of the average coefficient), and it will be apparent how the decoder can determine the average coefficient from the second portion of the compressed data structure in the different implementations described above. For example, the bits of the second portion of the compressed data structure may be a rounded version of the average coefficient.
[0218] In step S1206 the decoding unit 1102 (e.g. the decoder instance 1105) determines the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels. This is done using: (i) the determined signs and exponents for the one or more of the differential coefficients (dx, dy and dxy) which are non-zero, (ii) the determined representation of the average coefficient (dc), and (iii) respective weights (w.sub.x, w.sub.y and w.sub.xy) for the differential coefficients.
[0219] The weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels may comprise adding a gamma correction term. For example, the weighted sum may be determined as: dc+w.sub.x, dx+w.sub.y dy+w.sub.xy.Math.dxy+γ, where γ is an optional pseudo gamma correction term. The gamma correction term can be determined empirically, and may be different in different implementations. For example, in some implementations γ=0. As another example, in other implementations, γ=α[(1−|w.sub.x|)|dx|+(1−|w.sub.y|)|dy|+(1−|w.sub.xy|)|dxy|], where α is some fixed constant
selected to provide the best approximation for the target application, while minimising computational complexity. If a is known at the point of compression, the transform module (e.g. 208) may be modified to perform gamma correction on the average coefficient dc, either according to the exact gamma response of the colour format or according to the following expression:
dc.sub.γ=dc+α(|dx|+|dy|+|dxy|)
where dc.sub.γ is the “gamma correct” average coefficient. With this substitution, the weighted sum simplifies to
dc.sub.γ+w.sub.x.Math.dx+w.sub.y.Math.dy+w.sub.xy.Math.dxy−α(|w.sub.x.Math.dx|+|w.sub.y.Math.dy|+|w.sub.xy.Math.dxy|)
which, assuming a is a simple power of two (or zero), can be implemented in binary arithmetic as a signed sum of shifted terms involving products of the differential coefficients dx, dy and dxy and their respective weights w.sub.x, w.sub.y and w.sub.xy.
[0220] Since the differential coefficients are represented as signed exponents, the determination in step S1206 of the result of the weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels does not need to involve performing any multiplication or division operations. For example, it may involve performing a binary shift of the weights in accordance with the exponents that were determined in step S1202 for the respective one or more of the differential coefficients, and summing the representation of the average coefficient that was determined in step S1202 and the shifted weights in accordance with the signs that were determined in step S1202 for the respective one or more of the differential coefficients. If a sign is negative, true negation of the corresponding shifted weight can be approximated with bit inversion (although exact inversion can alternatively be adopted without a significant increase in complexity). In particular, if the weights are supplied in a signed magnitude format (such that a value is encoded with a single bit representing its sign and a plurality of bits representing its magnitude, as in an unsigned integer format), a product of a differential coefficient, represented as a signed exponent, and its respective weight is given by (−1).sup.s.sup.
which is seen in binary arithmetic, if a is a power of two
or zero, to De a signed sum of shifted terms involving (the magnitudes of) w.sub.x, w.sub.y and w.sub.xy. Note that the above efficient representation holds whether or not gamma correction is applied (e.g. when α=0).
[0221] If there are fewer than three non-zero coefficients, the above expression simplifies further. In some examples, the largest differential coefficient may be represented with one bit of mantissa if no more than two differential coefficients are represented as being non-zero. Furthermore, in some examples, the largest differential coefficient may be represented with two bits of mantissa if only one differential coefficient (i.e. the largest) is represented as being non-zero. Such examples introduce additional precision without increasing the overall arithmetic complexity. A generalised form of the above weighted sum is given by:
where d.sub.0, d.sub.1 and d.sub.2 are the differential coefficients sorted in order of decreasing magnitude, w.sub.0, and w.sub.2 are their respective (sorted) weights and m.sub.1 and m.sub.2 are the first and second mantissa bits of the largest coefficient d.sub.0. Since m.sub.2≠0 implies that d.sub.1=d.sub.2=0 the weighted sum may be written
whenever the largest coefficient has two bits of mantissa (and therefore the remaining coefficients are zero). This expression has the same number of shifted additive terms as the pure exponent (i.e. no mantissa) expression and thus has a similar arithmetic complexity. Note that the (single bit) mantissa coefficients m.sub.1 and m.sub.2 represent values of one or zero and thus act only to include or exclude an additive term (the logic for which may anyway exist to handle zero coefficients). Otherwise, since m.sub.1≠0 implies that d.sub.2=0 the weighted sum may be written
whenever the largest coefficient has one bit of mantissa (but not two). This expression has the same number of shifted additive terms as the pure exponent expression and thus has a similar arithmetic complexity. Hence we see that additional precision can be introduced without increasing complexity by replacing terms (using relatively cheap selection logic) involving zero coefficients with partial mantissa products.
[0222] In step S1208 the decoding unit 1102 (e.g. the decoder instance 1105) uses the result determined in step S1206 to determine the decoded value. In step S1210 the decoding unit 1102 (e.g. the decoder instance 1105) outputs the determined decoded value. The outputted decoded value may be used in any suitable manner, e.g. processed by a processing unit, transmitted to another device or stored in a memory (e.g. in the memory 1304) to give just a few examples of how the decoded value may be used.
[0223] In the example shown in
[0224] In the example shown in
[0225] As mentioned above, steps S1206 and S1208 are performed four times for the respective four pixels of the 2×2 quad of pixels by the respective four decoder instances. Steps S1202 and S1204 may be performed a single time to determine the average coefficient and the signs and exponents of the differential coefficients for the 2×2 quad of pixels, with those values being provided to each of the decoder instances. Alternatively, each of the decoder instances may implement steps S1202 and S1204. As noted above, when the w.sub.x, w.sub.y and w.sub.xy weights all have unit magnitude and the sign of the w.sub.xy weight is the sign of the product of the w.sub.x and w.sub.y weights, then the decoder will reconstruct one of the original 4 pixels in the 2×2 quad. When the w.sub.x, w.sub.y and w.sub.xy weights all have unit magnitude then the pseudo gamma correction term described above is zero.
[0226] As noted above with reference to
The predetermined weights (w.sub.x, w.sub.y and w.sub.xy) used by the decoder instance 1105 are w.sub.x=−1, w.sub.y=−1 and w.sub.xy=+1. Therefore, the result of the weighted sum determined in step S1206 is given by
[0227] The predetermined weights (w.sub.x, w.sub.y and w.sub.xy) used by the decoder instance 1106 are w.sub.x=−1, w.sub.y=+1 and w.sub.xy=−1. Therefore, the result of the weighted sum determined in step S1206 is given by
[0228] The predetermined weights (w.sub.x, w.sub.y and w.sub.xy) used by the decoder instance 1108 are w.sub.x=+1, w.sub.y=−1 and w.sub.xy=−1. Therefore, the result of the weighted sum determined in step S1206 is given by
[0229] The predetermined weights (w.sub.x, w.sub.y and w.sub.xy) used by the decoder instance 1110 are w.sub.x=+1, w.sub.y=+1 and w.sub.xy=+1. Therefore, the result of the weighted sum determined in step S1206 is equivalent to:
[0230] The decoder instances shift the weights by the exponents of the corresponding differential coefficients and use the signs of the differential coefficients to sum them correctly. Binary shifts and additions/subtractions are very simple to implement, e.g. in hardware. So the processing power, silicon area and/or latency of the decoding unit 1102 can be reduced compared to conventional decoding units. Furthermore, the compression ratio that can be achieved using the scheme described herein without detrimentally affecting the quality of the data can be improved compared to conventional encoding/decoding schemes.
[0231] In some examples described above, the values of the differential coefficients (i.e. the dx, dy and dxy coefficients) for a 2×2 quad of pixels are predicted based on the average coefficients (i.e. the dc coefficients) of one or more neighbouring 2×2 quads of pixels. Differences (or “delta values”) between the predicted values and the original values of the differential coefficients for the 2×2 quad of pixels can be determined and then used in place of the differential coefficients for the encoding of the differential coefficients (i.e. for determining the first portion of the compressed data structure). In these examples, step S1208 of using the determined result to determine the decoded value comprises: (i) using the determined result to determine a delta value; and (ii) adding a predicted value to the delta value to determine the decoded value for the block of pixels, wherein the predicted value is determined based on one or more neighbouring blocks of pixels. Since these predicted values are in addition to the coefficients encoded in the compressed data structure (and may not be suited to signed exponent representation), they are typically more suited to such examples as above where filtering is not required and therefore variable weights need not be applied to the predicted terms.
[0232]
[0233]
[0234] This example is particularly useful for texture filtering. In this example, the weights are not predetermined. This allows the weights to be altered in runtime to provide different types of filtering, e.g. isotropic filtering or anisotropic filtering with different levels of detail, different anisotropic ratios and different anisotropic directions. In particular, the filter parameter setup module 1312 of the decoding unit 1302 receives an indication of a level of detail along a minor axis of the filtering (p.sub.λ), an indication of a level of detail along a major axis of the filtering (p.sub.μ) and an anisotropic direction (φ), as well as u and v coordinates which indicate a position at which a filtered sample is to be determined. In step S1401, the decoding unit 1302 (specifically the filter parameter setup module 1312) determines the respective weights for the differential coefficients. As described in more detail below, the weights can be determined based on one or more of: (i) a position, relative to the 2×2 quad of pixels, that the decoded value is to be determined (e.g. as indicated by the u and v coordinates); (ii) an indication of a level of detail to be provided by the decoded value (e.g. as indicated by the PA parameter); and (iii) an indication of an amount and/or direction of anisotropy to be provided by the decoded value (e.g. as indicated by the p.sub.λ, p.sub.μ and φ parameters).
[0235] In this example, the compressed data structure encodes a 4×4 block of pixels. As shown in
[0236] For each of the four 2×2 quads of pixels a respective one of the decoder instances (1305, 1306, 1308 and 1310) determines a result of a weighted sum of the differential coefficients and the average coefficient for that 2×2 quad of pixels as described above based on a respective set of weights (w.sub.x, w.sub.y and w.sub.xy) that it receives from the filter parameter setup module 1312. In particular, steps S1402, S1404 and S1406 are performed for each of the respective 2×2 quads (1502, 1504, 1506 and 1508) by a respective one of the decoder instances (1305, 1306, 1308 and 1310). Steps S1402 to S1406 are now described with reference to the processing of the quad 1502 by the decoder instance 1305, but it is to be understood that the other decoder instances (1306, 1308 and 13010) process their respective quads (1504, 1506 and 1508) in the same way. In step S1402 the decoder instance 1305 uses the first portion (e.g. 504) of the compressed data structure to determine signs and exponents for the one or more of the differential coefficients which are non-zero for the 2×2 quad of pixels. As described above, the first portion (e.g. 504) of the compressed data structure comprises: (i) a first set of bits (e.g. 506.sub.1) which indicates an order of the magnitudes of the differential coefficients, and (ii) a second set of bits (e.g. 506.sub.2) which indicates a sign and an exponent for each of one or more of the differential coefficients which are non-zero. Different implementations for how the first portion of the compressed data structure indicates the signs and exponents for the differential coefficients are described above, and it will be apparent how the decoder can determine the signs and exponents for the one or more of the differential coefficients which are non-zero from the first portion of the compressed data structure in the different implementations described above. As described above, the first set of bits also indicates whether any (and if so, which) of the differential coefficients are zero. So in step S1402 the decoding unit 1302 (e.g. the decoder instance 1305) can determine whether any (and if so, which) of the differential coefficients are zero using the first portion (e.g. 504) of the compressed data structure. It is noted that in most situations, the decoding unit 1302 (e.g. the decoder instance 1305) does not determine any mantissa bits for the differential coefficients, although in some situations one or more mantissa bits are determined (as described above).
[0237] In step S1404 the decoding unit 1302 (e.g. the decoder instance 1305) uses the second portion (e.g. 508) of the compressed data structure to determine a representation of the average coefficient for the 2×2 quad of pixels. Different implementations for how the second portion of the compressed data structure indicates the average coefficient are described above (including examples in which the first portion (e.g. 504) of the compressed data structure also contribute to the determination of the average coefficient), and it will be apparent how the decoder can determine the average coefficient from the second portion of the compressed data structure in the different implementations described above. For example, the bits of the second portion of the compressed data structure may be a rounded version of the average coefficient.
[0238] In step S1406 the decoding unit 1302 (e.g. the decoder instance 1305) determines the result of a weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels. This is done using: (i) the determined signs and exponents for the one or more of the differential coefficients (dx, dy and dxy) which are non-zero, (ii) the determined representation of the average coefficient (dc), and (iii) the respective weights (w.sub.x, w.sub.y and w.sub.xy) for the differential coefficients received from the filter parameter setup module.
[0239] Each of the decoder instances (1305, 1306, 1308 and 1310) output the determined result of their weighted sum to the bilinear interpolation unit 1314. It is noted that the different decoder instances will determine different results because the weights that they use in the weighted sums (as provided by the filter parameter setup module 1312) are different. In step S1408, the bilinear interpolation unit 1314 combines the four determined results for the respective four 2×2 quads of pixels to determine the decoded value for the block of pixels. In particular, the bilinear interpolation unit 1314 performs bilinear interpolation on the four determined results for the respective four 2×2 quads of pixels to determine the decoded value for the block of pixels. In step S1410 the decoding unit 1302 (e.g. the bilinear interpolation unit 1314) outputs the determined decoded value. The outputted decoded value may be used in any suitable manner, e.g. processed by a processing unit, transmitted to another device or stored in a memory (e.g. in the memory 1304) to give just a few examples of how the decoded value may be used.
[0240] For some locations within the area 1510, a decoded value can be determined from a single 2×2 quad of pixels. In these cases the bilinear interpolation unit 1314 applies a weight of 1 to the determined result for that single 2×2 quad of pixels and it applies a weight of 0 for the other three determined results for the other 2×2 quads of pixels. In other words, if a position at which the decoded value is to be determined is within an area that is representable by considering the pixels of a single one of the 2×2 quads of pixels in the block of pixels then the decoded value for the block of pixels is equal to the determined result for that single 2×2 quad of pixels.
[0241] For some locations within the area 1510, a decoded value cannot be determined from a single 2×2 quad of pixels. In these cases the bilinear interpolation unit 1314 blends results determined for two or more of the 2×2 quads of pixels by applying non-zero weights in its weighted sum to the determined results for those two or more of the 2×2 quad of pixels. In other words, if a position at which the decoded value is to be determined is not within an area that is representable by considering the pixels of a single one of the 2×2 quads of pixels in the block of pixels then the decoded value for the block of pixels is the result of a weighted sum of two or more of the determined results for the 2×2 quads of pixels.
[0242] The weights that are determined by the filter parameter setup module 1312 for each of the 2×2 quads vary in dependence upon the u and v parameters which define the position at which the decoded value is to be determined. For example, the weights for the differential coefficients can be determined by: (i) determining initial weights based on the LOD indication (p.sub.λ), an indication of an amount of anisotropy (which may be indicated by p.sub.λ and p.sub.μ, or which may be indicated by an anisotropic ratio η) and an indication of a direction of anisotropy (φ) as described in more detail below; and multiplying the determined initial weights by respective transfer functions to determine the weights for the differential coefficients. A transfer function can be considered to be a function which models a module's output for different inputs. As described in more detail below, each of the transfer functions is based on a position, relative to the 2×2 quad of pixels, that the decoded value is to be determined at, and each of the transfer functions: (i) is zero at the centre of the 2×2 quad of pixels, (ii) has a magnitude of one at each of the pixel positions of the 2×2 quad of pixels, and (iii) is extrapolated out to the edges of the area represented by the 2×2 quad of pixels. For example, each of the transfer functions t.sub.x and t.sub.y (which are multiplied by the initial weights for the dx and dy coefficients respectively to determine the final weights for the dx and dy coefficients) is linearly extrapolated out to the edges of the area represented by the 2×2 quad of pixels such that it lies in the range [−2, +2]. The transfer function t.sub.xy (which is multiplied by the initial weights for both of the dx and dy coefficients and then added to the initial weight for the dxy coefficient to determine the final weight for the dxy coefficient) is a product of the t.sub.x and t.sub.y transfer functions so the extrapolation of t.sub.xy out to the edges of the area represented by the 2×2 quad of pixels is not necessarily linear, and the value of t.sub.xy lies in the range [−4, +4].
[0243]
[0244] Each of the transfer functions is determined based on a position, relative to the 2×2 quad of pixels, that the decoded value is to be determined. In
[0245] In
[0246] The final weight for the dxy coefficient is given by the product of the transfer functions t.sub.x and t.sub.y, each of which has been weighted by the initial weights for dx and dy respectively as above, to which the initial dxy weight is added (which does not have any spatial (i.e. u and v) dependence). Note that the initial dxy weight is zero whenever the filter response is isotropic such that in these cases the weight for the dxy coefficient is always the product of the weights for the dx and dy coefficients (compare with the simple reconstruction scheme described with reference to
[0247] In some examples, in order to apply full weighting on any individual texel within the quad, the texels are mirrored such that the interpolation point falls within a particular quadrant, e.g. the top left quadrant. After mirroring, and depending on the LOD setting for the filter, the input interpolation weights may be modified to apply the correct amount of blending to the samples (or weighted combinations thereof) generated by the decoding unit for each quad of pixels. In general, the pixels within the 4×4 block of pixels are selectively mirrored about neither, one or both of: (i) a horizontal axis passing through the centre of the block of pixels, and (ii) a vertical axis passing through the centre of the block of pixels, so that the position at which the decoded value is to be determined is within a particular quadrant (e.g. the top left quadrant) of the block of pixels.
[0248]
[0249]
[0250]
[0251]
[0252]
[0253]
[0254] If the position at which the decoded value is to be determined is within the footprint of the “00” pixel in the 2×2 quad 1508 then two mirroring operations may be performed: (i) a mirroring operation through a horizontal axis through the centre of the 4×4 block (e.g. as described above with reference to
[0255] We now provide a detailed example of how the weights may be determined by the filter parameter setup module 1312. The minor LOD (p.sub.λ) and the major LOD (p.sub.μ) parameters received at the filter parameter setup module 1312 are shifted to determine a shifted minor LOD parameter (p′.sub.λ) and a shifted major LOD parameter (p′.sub.μ) such that
where the values of p′.sub.λ and p′.sub.μ may be rounded in accordance with a rounding mode, e.g. a round to nearest mode. The above shifting operation has been designed to better map the filtering response to the expected sampling characteristics of filtered textures (such as mipmaps) but in some examples may be skipped. In particular, since exact reconstruction of the source pixels is only possible when p′.sub.λ=p′.sub.μ=0 and p.sub.μ≥p.sub.λ≥0, an “interpolating” filter may skip the above shifting operation.
[0256] The values of p′.sub.λ and p′.sub.μ are used to define the logarithmic anisotropic ratio p.sub.α, such that p.sub.α=p′.sub.μ−p′.sub.λ. The direction of anisotropy is given in terms of the sine and cosine of the angle φ from the texture x axis:
[0257] The square of the sine s.sup.2, weighted by the logarithmic anisotropic ratio p.sub.α, determines the xy axis variance correction p.sub.δ, such that p.sub.δ=s.sup.2.Math.p.sub.α, where the value of p.sub.δ may be rounded in accordance with a rounding mode, e.g. a round to nearest mode.
[0258] With this, the weights of the xy axes closest to the minor and major axes are determined as w.sub.λ, and w.sub.μ respectively as w.sub.λ=1−(p.sub.λ+p.sub.δ) and w.sub.μ=1−(p.sub.μ−p.sub.δ).
[0259] The sign of the cosine determines the correspondence between w.sub.λ and w.sub.μ and w.sub.x and w.sub.y, the axis coefficient weights:
[0260] The sine of twice the angle φ (2sc), weighted by the logarithmic anisotropic ratio p.sub.α, determines the (scaled) xy covariance weight w.sub.xy, such that w.sub.xy=2.Math.s.Math.c.Math.p.sub.α
[0261] The three weights w.sub.x, w.sub.y and w.sub.xy may be used as the weights for the dx, dy and dxy coefficients respectively. As described above, in some examples, the weights are multiplied by transfer functions before being used as the weights for the dx, dy and dxy coefficients. An example of how this can be done by the filter parameter setup module 1312 is now described in order to determine sets of weights for the dx, dy and dxy coefficients for each pixel quad in a 4×4 block of pixels, as well as modifying the input interpolation weights (u, v) to feed to the bilinear interpolation unit 1314.
[0262] Since the interpolation patch (e.g. the area 1510 shown in
[0263] A value of s.sub.x=−1 means that the block of pixels is mirrored across a vertical axis (e.g. as shown in
where the Boolean function sgn( ) returns true for negative arguments and returns false for positive arguments.
[0264] The mirrored coordinates u.sub.mirror and v.sub.mirror are then given (in units of pixels) as:
[0265] The global coefficient weights w.sub.x, w.sub.y and w.sub.xy are used to control the strength of the dx, dy and dxy coefficients associated with each pixel quad in the 4×4 patch. As described above the transfer functions are used to linearly extrapolate the dx and dy coefficients over the patch, saturating at the vertical and horizontal mirror lines respectively:
(w.sub.x0,w.sub.x1)=(2s.sub.x.Math.w.sub.x.Math.u.sub.mirror,−2s.sub.x.Math.w.sub.x) and (w.sub.y0,w.sub.y1)=(2s.sub.y.Math.w.sub.y.Math.v.sub.mirror,−2s.sub.y.Math.w.sub.y)
[0266] Note that the s.sub.x and s.sub.y terms ensures the correct sign is applied to the dx and dy coefficients on decode.
[0267] The common dx weight for the (mirrored if mirror_x) top left and bottom left pixel quad is output as w.sub.x0.
[0268] The common dx weight for the (mirrored if mirror_x) top right and bottom right pixel quad is output as w.sub.x1.
[0269] The common dy weight for the (mirrored if mirror_y) top left and top right pixel quad is output as w.sub.y0.
[0270] The common dy weight for the (mirrored if mirror_y) bottom left and bottom right pixel quad is output as w.sub.y1.
[0271] The dxy coefficients undergo quadratic extrapolation, to which a constant contribution is added, weighted by the w.sub.xy coefficient, such that: w.sub.xy00=w.sub.xy+w.sub.x0.Math.w.sub.y0, w.sub.xy01=w.sub.xy+w.sub.x0.Math.w.sub.y1, w.sub.xy10=w.sub.xy+w.sub.x1.Math.w.sub.y0 and w.sub.xy11=w.sub.xy+w.sub.x1.Math.w.sub.y1
[0272] The dxy weights for the (mirrored if mirror_x/mirror_y) top left pixel quad, top right pixel quad, bottom left pixel quad and bottom right pixel quad are output as w.sub.xy00, w.sub.xy10, w.sub.xy01 and w.sub.xy11 respectively.
[0273] The modified bilinear interpolation factors u.sub.mirror and v.sub.mirror ensure that pixel quads are independently reconstructed at full resolution (by disabling interpolation) and that their dc coefficients employ ordinary bilinear interpolation, when dx, dy and dxy are zeroed, by applying an outward bias to the coordinates (clamping to zero) proportional to the dx and dy weighting:
[0274] The interpolation factors are output for consumption by the bilinear filter unit as the output signals u.sub.out and v.sub.out, such that
By substituting the earlier expression for |w.sub.x0|, we may write
from which we see that
when w.sub.x=0 (as in conventional (bi)linear interpolation), that u.sub.out=0 when
(indicating a region in which only the left half of the (mirrored) four quads is interpolated) and that
whenever u.sub.mirror=1 (such that a 50:50 blend is always applied on the vertical mirror line). Similarly, by substituting the earlier expression for |w.sub.y0|, we may write
from which we see that
when w.sub.y=0 (as in conventional (bi)linear interpolation), that v.sub.out=0 when
(indicating a region in which only the top half of the (mirrored) four quads is interpolated) and that
whenever u.sub.mirror=1 (such that a 50:50 blend is always applied on the horizontal mirror line).
[0275] There are many example encoding formats described above, and the way in which the compressed data structure is to be decoded to determine the signs and exponents of the differential coefficients from the first portion of the compressed data structure and to determine the representation of the average coefficient from the second portion of the compressed data structure would be apparent to a skilled person having read the above description of the way in which the data is encoded into the compressed data structure in the different examples.
[0276] For example, the first set of bits (e.g. 506.sub.1) of the first portion (e.g. 504) of the compressed data structure indicates the order of the magnitudes of the differential coefficients and may indicates how many of the differential coefficients are non-zero, so the first set of bits can be used by the decoding unit to determine how many of the differential coefficients are non-zero and to determine the ordering of the non-zero differential coefficients that are encoded in the compressed data structure in accordance with the encoding format used (as described above).
[0277] As described above, the second set of bits may comprise: (i) a first section (e.g. 507.sub.1) which indicates a sign for each of the non-zero differential coefficients, and (ii) a second section (e.g. 507.sub.2) which indicates an exponent for of the non-zero differential coefficients. So the decoding unit can use the second set of bits to determine the sign and exponent for each of the non-zero differential coefficients in accordance with the encoding format used (as described above). The first section is comprised of a first series of bits, which indicates a sign for each of said one or more of the differential coefficients which are non-zero, and the second section is comprised of a second series of bits, which indicates an exponent for each of said one or more of the differential coefficients which are non-zero, of the second set of bits.
[0278] As described above, if one or more of the differential coefficients are zero, the first section (e.g. 507.sub.1) may indicate a mantissa bit for the differential coefficient which has the largest magnitude. In this case, the determination of the result of the weighted sum of the differential coefficients and the average coefficient for the 2×2 quad of pixels that is performed in step S1406 comprises including two terms in the weighted sum for the differential coefficient which has the largest magnitude, the two terms being: (i) the weight for that differential coefficient shifted in accordance with the exponent for that differential coefficient, and (ii) the weight for that differential coefficient shifted in accordance with an amount that is one less than the exponent for that differential coefficient. In this way the weighted sum is still performed just as a sum of (up to three) shifted weights which is simple to implement in the decoding unit, and in particular does not include complex operations such as multiplication or division operations.
[0279] In some examples described above (e.g. with reference to
[0280] In some examples described above (e.g. with reference to
[0281] In some examples described above (e.g. with reference to
[0282] In some examples described above (e.g. with reference to
[0283] In some examples described above (e.g. with reference to
[0284]
[0285] The encoding and/or decoding units (202, 602, 902, 1102 and/or 1302) are shown as comprising a number of functional blocks. This is schematic only and is not intended to define a strict division between different logic elements of such entities. Each functional block may be provided in any suitable manner. It is to be understood that intermediate values described herein as being formed by an encoding unit or a decoding unit need not be physically generated by the encoding and/or decoding unit at any point and may merely represent logical values which conveniently describe the processing performed by the encoding and/or decoding unit between its input and output.
[0286] The encoding and/or decoding units described herein may be embodied in hardware on an integrated circuit. The encoding and/or decoding units described herein may be configured to perform any of the methods described herein. Generally, any of the functions, methods, techniques or components described above can be implemented in software, firmware, hardware (e.g., fixed logic circuitry), or any combination thereof. The terms “module,” “functionality,” “component”, “element”, “unit”, “block” and “logic” may be used herein to generally represent software, firmware, hardware, or any combination thereof. In the case of a software implementation, the module, functionality, component, element, unit, block or logic represents program code that performs the specified tasks when executed on a processor. The algorithms and methods described herein could be performed by one or more processors executing code that causes the processor(s) to perform the algorithms/methods. Examples of a computer-readable storage medium include a random-access memory (RAM), read-only memory (ROM), an optical disc, flash memory, hard disk memory, and other memory devices that may use magnetic, optical, and other techniques to store instructions or other data and that can be accessed by a machine.
[0287] The terms computer program code and computer readable instructions as used herein refer to any kind of executable code for processors, including code expressed in a machine language, an interpreted language or a scripting language. Executable code includes binary code, machine code, bytecode, code defining an integrated circuit (such as a hardware description language or netlist), and code expressed in a programming language code such as C, Java or OpenCL. Executable code may be, for example, any kind of software, firmware, script, module or library which, when suitably executed, processed, interpreted, compiled, executed at a virtual machine or other software environment, cause a processor of the computer system at which the executable code is supported to perform the tasks specified by the code.
[0288] A processor, computer, or computer system may be any kind of device, machine or dedicated circuit, or collection or portion thereof, with processing capability such that it can execute instructions. A processor may be or comprise any kind of general purpose or dedicated processor, such as a CPU, GPU, NNA, System-on-chip, state machine, media processor, an application-specific integrated circuit (ASIC), a programmable logic array, a field-programmable gate array (FPGA), or the like. A computer or computer system may comprise one or more processors.
[0289] It is also intended to encompass software which defines a configuration of hardware as described herein, such as HDL (hardware description language) software, as is used for designing integrated circuits, or for configuring programmable chips, to carry out desired functions. That is, there may be provided a computer readable storage medium having encoded thereon computer readable program code in the form of an integrated circuit definition dataset that when processed (i.e. run) in an integrated circuit manufacturing system configures the system to manufacture an encoding and/or decoding unit configured to perform any of the methods described herein, or to manufacture an encoding and/or decoding unit comprising any apparatus described herein. An integrated circuit definition dataset may be, for example, an integrated circuit description.
[0290] Therefore, there may be provided a method of manufacturing, at an integrated circuit manufacturing system, an encoding and/or decoding unit as described herein. Furthermore, there may be provided an integrated circuit definition dataset that, when processed in an integrated circuit manufacturing system, causes the method of manufacturing an encoding and/or decoding unit to be performed.
[0291] An integrated circuit definition dataset may be in the form of computer code, for example as a netlist, code for configuring a programmable chip, as a hardware description language defining hardware suitable for manufacture in an integrated circuit at any level, including as register transfer level (RTL) code, as high-level circuit representations such as Verilog or VHDL, and as low-level circuit representations such as OASIS® and GDSII. Higher level representations which logically define hardware suitable for manufacture in an integrated circuit (such as RTL) may be processed at a computer system configured for generating a manufacturing definition of an integrated circuit in the context of a software environment comprising definitions of circuit elements and rules for combining those elements in order to generate the manufacturing definition of an integrated circuit so defined by the representation. As is typically the case with software executing at a computer system so as to define a machine, one or more intermediate user steps (e.g. providing commands, variables etc.) may be required in order for a computer system configured for generating a manufacturing definition of an integrated circuit to execute code defining an integrated circuit so as to generate the manufacturing definition of that integrated circuit.
[0292] An example of processing an integrated circuit definition dataset at an integrated circuit manufacturing system so as to configure the system to manufacture an encoding and/or decoding unit will now be described with respect to
[0293]
[0294] The layout processing system 1904 is configured to receive and process the IC definition dataset to determine a circuit layout. Methods of determining a circuit layout from an IC definition dataset are known in the art, and for example may involve synthesising RTL code to determine a gate level representation of a circuit to be generated, e.g. in terms of logical components (e.g. NAND, NOR, AND, OR, MUX and FLIP-FLOP components). A circuit layout can be determined from the gate level representation of the circuit by determining positional information for the logical components. This may be done automatically or with user involvement in order to optimise the circuit layout. When the layout processing system 1904 has determined the circuit layout it may output a circuit layout definition to the IC generation system 1906. A circuit layout definition may be, for example, a circuit layout description.
[0295] The IC generation system 1906 generates an IC according to the circuit layout definition, as is known in the art. For example, the IC generation system 1906 may implement a semiconductor device fabrication process to generate the IC, which may involve a multiple-step sequence of photo lithographic and chemical processing steps during which electronic circuits are gradually created on a wafer made of semiconducting material. The circuit layout definition may be in the form of a mask which can be used in a lithographic process for generating an IC according to the circuit definition. Alternatively, the circuit layout definition provided to the IC generation system 1906 may be in the form of computer-readable code which the IC generation system 1906 can use to form a suitable mask for use in generating an IC.
[0296] The different processes performed by the IC manufacturing system 1902 may be implemented all in one location, e.g. by one party. Alternatively, the IC manufacturing system 1902 may be a distributed system such that some of the processes may be performed at different locations, and may be performed by different parties. For example, some of the stages of: (i) synthesising RTL code representing the IC definition dataset to form a gate level representation of a circuit to be generated, (ii) generating a circuit layout based on the gate level representation, (iii) forming a mask in accordance with the circuit layout, and (iv) fabricating an integrated circuit using the mask, may be performed in different locations and/or by different parties.
[0297] In other examples, processing of the integrated circuit definition dataset at an integrated circuit manufacturing system may configure the system to manufacture an encoding and/or decoding unit without the IC definition dataset being processed so as to determine a circuit layout. For instance, an integrated circuit definition dataset may define the configuration of a reconfigurable processor, such as an FPGA, and the processing of that dataset may configure an IC manufacturing system to generate a reconfigurable processor having that defined configuration (e.g. by loading configuration data to the FPGA).
[0298] In some embodiments, an integrated circuit manufacturing definition dataset, when processed in an integrated circuit manufacturing system, may cause an integrated circuit manufacturing system to generate a device as described herein. For example, the configuration of an integrated circuit manufacturing system in the manner described above with respect to
[0299] In some examples, an integrated circuit definition dataset could include software which runs on hardware defined at the dataset or in combination with hardware defined at the dataset. In the example shown in
[0300] The implementation of concepts set forth in this application in devices, apparatus, modules, and/or systems (as well as in methods implemented herein) may give rise to performance improvements when compared with known implementations. The performance improvements may include one or more of increased computational performance, reduced latency, increased throughput, and/or reduced power consumption. During manufacture of such devices, apparatus, modules, and systems (e.g. in integrated circuits) performance improvements can be traded-off against the physical implementation, thereby improving the method of manufacture. For example, a performance improvement may be traded against layout area, thereby matching the performance of a known implementation but using less silicon. This may be done, for example, by reusing functional blocks in a serialised fashion or sharing functional blocks between elements of the devices, apparatus, modules and/or systems. Conversely, concepts set forth in this application that give rise to improvements in the physical implementation of the devices, apparatus, modules, and systems (such as reduced silicon area) may be traded for improved performance. This may be done, for example, by manufacturing multiple instances of a module within a predefined area budget.
[0301] The applicant hereby discloses in isolation each individual feature described herein and any combination of two or more such features, to the extent that such features or combinations are capable of being carried out based on the present specification as a whole in the light of the common general knowledge of a person skilled in the art, irrespective of whether such features or combinations of features solve any problems disclosed herein. In view of the foregoing description it will be evident to a person skilled in the art that various modifications may be made within the scope of the invention.