Method and apparatus for encoding color mapping information and processing pictures based on color mapping information
10805506 · 2020-10-13
Assignee
Inventors
- Yannick OLIVIER (Cesson-Sevigne, FR)
- Sébastien Lasserre (Cesson-Sevigne, FR)
- Pierre ANDRIVON (Cesson-Sevigne, FR)
- Philippe Bordes (Cesson-Sevigne, FR)
Cpc classification
H04N1/6063
ELECTRICITY
International classification
Abstract
Color mapping information can be used to transform one color to another color. The present embodiments provide a solution for representing the color mapping information using a successive application of multiple color mapping functions. Parameters for the multiple color mapping functions can be encoded into a bitstream. In one embodiment, color mapping functions are consecutively applied on their own domains of definition only. In another embodiment, the first color mapping (CRI1) is applied on its domain of definition only, but the second color mapping is applied only on samples that have been previously color mapped by CRI1 and which are also inside the domain of definition of the second color mapping function. At the decode side, the multiple color mapping functions can be reconstructed and successively applied to a decoded picture to generate another picture.
Claims
1. A method for processing a picture, comprising: accessing a first color mapping function, input of the first color mapping function being defined on a first subset of a color space, wherein a color mapping function transforms a sample in said picture from one color to another color; accessing a second color mapping function, input of the second color mapping function being defined on a second subset of said color space, wherein the second subset is different from the first subset of said color space; applying said first color mapping function to said picture to form a mapped picture; selecting samples, in said mapped picture, belonging to said second subset of said color space; and applying said second color mapping function to said selected samples to form an output picture.
2. The method of claim 1, wherein the second subset for the second color mapping function is a subset of the first subset for the first color mapping function.
3. The method of claim 1, further comprising: selecting samples, in said mapped picture, which are modified by said application of said first color mapping function, wherein the second color mapping function is applied to said selected modified samples.
4. The method of claim 1, wherein each of said first color mapping function and said second color mapping function is represented by a first piece-wise linear function applied to each color component, followed by a three-by-three matrix and a second piece-wise linear function applied to each color component.
5. The method of claim 4, wherein each of said first piece-wise linear function and said second piece-wise linear function has uniform pivot point intervals.
6. A method for encoding color mapping information, comprising: accessing a first color mapping function and a second color mapping function, wherein a color mapping function transforms a sample in a picture from one color to another color, wherein a successive application of the first color mapping function and the second color mapping function is used to represent the color mapping information, input of said first color mapping function being defined on a first subset of a color space, and input of said second color mapping function being defined on a second subset of said color space, wherein the second subset is different from the first subset of said color space, wherein said first color mapping function is for applying to said picture to form a mapped picture, wherein samples, in said mapped picture, belonging to said second subset of said color space are selected, and wherein said second color mapping function is for applying to said selected samples; encoding a first set of parameters indicative of the first color mapping function; encoding a second set of parameters indicative of the second color mapping function; and providing a bitstream including the first and second sets of parameters as output.
7. The method of claim 6, wherein the second subset of the second color mapping function is a subset of the first subset of the first color mapping function.
8. The method of claim 6, wherein the second color mapping function is only applied to samples that are modified by the application of the first color mapping function.
9. The method of claim 6, wherein the selected samples are used to determine the second color mapping function.
10. The method of claim 6, wherein each of said first color mapping function and said second color mapping function is represented by a first piece-wise linear function applied to each color component, followed by a three-by-three matrix and a second piece-wise linear function applied to each color component.
11. An apparatus for processing a picture, comprising at least one memory and one or more processors, wherein said one or more processors are configured to: access a first color mapping function, input of the first color mapping function being defined on a first subset of a color space, wherein a color mapping function transforms a sample in said picture from one color to another color; access a second color mapping function, input of the second color mapping function being defined on a second subset of said color space, wherein the second subset is different from the first subset of said color space; apply said first color mapping function to said picture to form a mapped picture; select samples, in said mapped picture, belonging to said second subset of said color space; and apply said second color mapping function to said selected samples to form an output picture.
12. The apparatus of claim 11, wherein the second subset for the second color mapping function is a subset of the first subset for the first color mapping function.
13. The apparatus of claim 11, wherein said one or more processors are further configured to: select samples, in said mapped picture, which are modified by said application of said first color mapping function, wherein the second color mapping function is applied to said selected modified samples.
14. The apparatus of claim 11, wherein each of said first color mapping function and said second color mapping function is represented by a first piece-wise linear function applied to each color component, followed by a three-by-three matrix and a second piece-wise linear function applied to each color component.
15. The apparatus of claim 14, wherein each of said first piece-wise linear function and said second piece-wise linear function has uniform pivot point intervals.
16. An apparatus for encoding color mapping information, comprising at least one memory and one or more processors, wherein said one more processors are configured to: access a first color mapping function and a second color mapping function, wherein a color mapping function transforms a sample in a picture from one color to another color, wherein a successive application of the first color mapping function and the second color mapping function is used to represent the color mapping information, input of said first color mapping function being defined on a first subset of a color space, and input of said second color mapping function being defined on a second subset of said color space, wherein the second subset is different from the first subset of said color space, wherein said first color mapping function is for applying to a picture to form a mapped picture, wherein samples, in said mapped picture, belonging to said second subset of said color space are selected, and wherein said second color mapping function is for applying to said selected samples; encode a first set of parameters indicative of the first color mapping function; encode a second set of parameters indicative of the second color mapping function; and provide a bitstream including the first and second sets of parameters as output.
17. The apparatus of claim 16, wherein the second subset for the second color mapping function is a subset of the first subset for the first color mapping function.
18. The apparatus of claim 16, wherein the second color mapping function is only applied to samples that are modified by the application of the first color mapping function.
19. The apparatus of claim 16, wherein the selected samples are used to determine the second color mapping function.
20. The apparatus of claim 16, wherein each of said first color mapping function and said second color mapping function is represented by a first piece-wise linear function applied to each color component, followed by a three-by-three matrix and a second piece-wise linear function applied to each color component.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
DETAILED DESCRIPTION
(13) Color transform, also referred to as color mapping and color remapping in the present application, can be used in a variety of applications. For example, because of the wide range of color formats, of capture capability and of display characteristics, color mapping may be used to render decoded images onto a display device. In another example, a video may be color graded multiple times for different purposes, wherein color grading is a process of altering/enhancing the colors of the video. For instance, a colorist may color grade a movie such that the movie is represented in a wide color gamut (WCG) and has a look for theatres, and another colorist may color grade the movie such that the movie is represented in a smaller gamut and has a look for home entertainment. Each color graded version of the movie corresponds to an artistic intent and may depend on the capabilities of the targeted display or application.
(14) A transmitter may only transmit the home entertainment version and a set of color mapping information, which indicates how colors in the home entertainment version may be mapped to the theatre version. To represent the set of color mapping information, a color mapping function can be determined, in order to minimize the difference between the mapped pictures (e.g., CMF(home entertainment version)) and the target pictures (e.g., the theatre version), for example, using a psycho-visual metric. At the receiver side, the home entertainment version can be mapped to the theatre version using the color mapping information.
(15) Also a transmitter may only transmit the theatre version and a set of color mapping information, which indicates how the colors in the theater version may be mapped to the home entertainment version. At the receiver side, the theatre version can be mapped to the home entertainment version using the color mapping information. Thus, rather than transmitting both versions, only one version can be transmitted, and the other version is recovered using the color mapping information. This approach usually requires much less bandwidth than transmitting both versions, while still preserving the possibility of displaying either version at the display device.
(16) More generally, in order to enable a display device to display either version, metadata representing color mapping information may be signaled in a bitstream. Encoding such color mapping metadata makes it possible to display various versions of the content, and enhance the transmitted coded video if a display is capable of displaying data enhanced by the color mapping information. Transmitting the color mapping information also makes it possible to gracefully degrade a wide color gamut graded content while preserving the artistic intent.
(17) In a draft edition of HEVC (Edition 2 Draft Text of High Efficiency Video Coding (HEVC), JCTVC-R1013, hereinafter JCTVC-R1013), color transform information is defined in CRI (Color Remapping Information) as shown in Table 1, with line numbers added in the table for ease of reference. The CRI can be applied to HEVC, HEVC Range Extension, Scalability (SHVC) and Multi-View (MV-HEVC) Extensions. In particular, the color remapping model used in the color remapping information SEI message is composed of a pre set of syntax elements, which may be used to construct a first piece-wise linear function applied to each color component, a three-by-three matrix, which may be applied to the three color components, and a post set of syntax elements, which may be used to reconstruct a second piece-wise linear function applied to each color component.
(18) TABLE-US-00001 TABLE 1 CRI syntax as defined in a draft version of HEVC Descriptor Line # colour_remapping_info( payloadSize ) { colour_remap_id ue(v) 1 colour_remap_cancel_flag u(1) 2 if( !colour_remap_cancel_flag ) { 3 colour_remap_persistence_flag u(1) 4 colour_remap_video_signal_info_present_flag u(1) 5 if( colour_remap_video_signal_info_present_flag ) { 6 colour_remap_full_range_flag u(1) 7 colour_remap_primaries u(8) 8 colour_remap_transfer_function u(8) 9 colour_remap_matrix_coefficients u(8) 10 } 11 colour_remap_input_bit_depth u(8) 12 colour_remap_bit_depth u(8) 13 for( c = 0; c < 3; c++ ) { 14 pre_lut_num_val_minus1[ c ] u(8) 15 if( pre_lut_num_val_minus1[ c ] > 0 ) 16 for( i = 0; i<=pre_lut_num_val_minus1[ c ]; i++ ) { 17 pre_lut_coded_value[ c ][ i ] u(v) 18 pre_lut_target_value[ c ][ i ] u(v) 19 } 20 } 21 colour_remap_matrix_present_flag u(1) 22 if( colour_remap_matrix_present_flag ) { 23 log2_matrix_denom u(4) 24 for( c = 0; c < 3; c++ ) 25 for( i = 0; i < 3; i++ ) 26 colour_remap_coeffs[ c ][ i ] se(v) 27 } 28 for( c = 0; c < 3; c++ ) { 29 post_lut_num_val_minus1[ c ] u(8) 30 if( post_lut_num_val_minus1[ c ] > 0 ) 31 for( i = 0; i<=post_lut_num_val_minus1[ c ]; i++ ) { 32 post_lut_coded_value[ c ][ i ] u(v) 33 post_lut_target_value[ c ][ i ] u(v) 34 } 35 } 36 } 37 } 38
(19) Semantics
(20) colour_remap_id contains an identifying number that may be used to identify the purpose of the colour remapping information. The value of colour_remap_id shall be in the range of 0 to 2.sup.322, inclusive.
(21) Values of colour_remap_id from 0 to 255 and from 512 to 2.sup.311 may be used as determined by the application. Values of colour_remap_id from 256 to 511, inclusive, and from 2.sup.31 to 2.sup.322, inclusive are reserved for future use by ITU-T|ISO/IEC. Decoders shall ignore all colour remapping information SEI messages containing a value of colour_remap_id in the range of 256 to 511, inclusive, or in the range of 2.sup.31 to 2.sup.322, inclusive, and bitstreams shall not contain such values.
(22) colour_remap_cancel_flag equal to 1 indicates that the colour remapping information SEI message cancels the persistence of any previous colour remapping information SEI message in output order that applies to the current layer. colour_remap_cancel_flag equal to 0 indicates that colour remapping information follows.
(23) colour_remap_persistence_flag specifies the persistence of the colour remapping information SEI message for the current layer.
(24) colour_remap_persistence_flag equal to 0 specifies that the colour remapping information applies to the current picture only.
(25) colour_remap_video_signal_info_present_flag equal to 1 specifies that syntax elements colour_remap_full_range_flag, colour_remap_primaries, colour_remap_transfer_function and colour_remap_matrix_coefficients are present, colour_remap_video_signal_info_present_flag equal to 0 specifies that syntax elements colour_remap_full_range_flag, colour_remap_primaries, colour_remap_transfer_function and colour_remap_matrix_coefficients are not present.
(26) colour_remap_full_range_flag has the same semantics as specified in clause E.3.1 of JCTVC-R1013 for the video_full_range_flag syntax element, except that colour_remap_full_range_flag specifies the colour space of the remapped reconstructed picture, rather than the colour space used for the CLVS (Coded Layer-wise Video Sequence). When not present, the value of colour_remap_full_range_flag is inferred to be equal to the value of video_full_range_flag.
(27) colour_remap_primaries has the same semantics as specified in clause E.3.1 of JCTVC-R1013 for the colour_primaries syntax element, except that colour_remap_primaries specifies the colour space of the remapped reconstructed picture, rather than the colour space used for the CLVS. When not present, the value of colour_remap_primaries is inferred to be equal to the value of colour_primaries.
(28) colour_remap_transfer_function has the same semantics as specified in clause E.3.1 of JCTVC-R1013 for the transfer_characteristics syntax element, except that colour_remap_transfer_function specifies the colour space of the remapped reconstructed picture, rather than the colour space used for the CLVS. When not present, the value of colour_remap_transfer_function is inferred to be equal to the value of transfer_characteristics.
(29) colour_remap_matrix_coefficients has the same semantics as specified in clause E.3.1 of JCTVC-R1013 for the matrix_coeffs syntax element, except that colour_remap_matrix_coefficients specifies the colour space of the remapped reconstructed picture, rather than the colour space used for the CLVS. When not present, the value of colour_remap_matrix_coefficients is inferred to be equal to the value of matrix_coeffs.
(30) colour_remap_input_bit_depth specifies the bit depth of the luma and chroma components or the RGB components of the associated pictures for purposes of interpretation of the colour remapping information SEI message. When any colour remapping information SEI messages is present with the value of colour_remap_input_bit_depth not equal to the bit depth of the coded luma and chroma components or that of the coded RGB components, the SEI message refers to the hypothetical result of a transcoding operation performed to convert the coded video to a converted video with bit depth equal to colour_remap_input_bit_depth.
(31) The value of colour_remap_input_bit_depth shall be in the range of 8 to 16, inclusive. Values of colour_remap_input_bit_depth from 0 to 7, inclusive, and from 17 to 255, inclusive, are reserved for future use by ITU-T|ISO/IEC. Decoders shall ignore all colour remapping SEI messages that contain a colour_remap_input_bit_depth in the range of 0 to 7, inclusive, or in the range of 17 to 255, inclusive, and bitstreams shall not contain such values.
(32) colour_remap_bit_depth specifies the bit depth of the output of the colour remapping function described by the colour remapping information SEI message.
(33) The value of colour_remap_bit_depth shall be in the range of 8 to 16, inclusive. Values of colour_remap_bit_depth from 0 to 7, inclusive, and in the range of 17 to 255, inclusive, are reserved for future use by ITU-T|ISO/IEC. Decoders shall ignore all colour remapping SEI messages that contain a value of colour_remap_bit_depth from 0 to 7, inclusive, or in the range of 17 to 255, inclusive.
(34) pre_lut_num_val_minus1[c] plus 1 specifies the number of pivot points in the piece-wise linear remapping function for the c-th component, where c equal to 0 refers to the luma or G component, c equal to 1 refers to the Cb or B component, and c equal to 2 refers to the Cr or R component. When pre_lut_num_val_minus1[c] is equal to 0, the default end points of the input values are 0 and 2.sup.colour_remap_input_bit_depth1, and the corresponding default end points of the output values are 0 and 2.sup.color_remap_bit_depth1, for the c-th component. In bitstreams conforming to this version of this Specification, the value of pre_lut_num_val_minus1[c] shall be in the range of 0 to 32, inclusive.
(35) pre_lut_coded_value[c][i] specifies the value of the i-th pivot point for the c-th component. The number of bits used to represent pre_lut_coded_value[c][i] is ((colour_remap_input_bit_depth+7)>>3)<<3.
(36) pre_lut_target_value[c][i] specifies the value of the i-th pivot point for the c-th component. The number of bits used to represent pre_lut_target_value[c][i] is ((colour_remap_bit_depth+7)>>3)<<3.
(37) colour_remap_matrix_present_flag equal to 1 indicates that the syntax elements log 2_matrix_denom and colour_remap_coeffs[c][i], for c and i in the range of 0 to 2, inclusive, are present. colour_remap_matrix_present_flag equal to 0 indicates that the syntax elements log 2_matrix_denom and colour_remap_coeffs[c][i], for c and i in the range of 0 to 2, inclusive, are not present.
(38) log 2_matrix_denom specifies the base 2 logarithm of the denominator for all matrix coefficients. The value of log 2_matrix_denom shall be in the range of 0 to 15, inclusive. When not present, the value of log 2_matrix_denom is inferred to be equal to 0.
(39) colour_remap_coeffs[c][i] specifies the value of the three-by-three colour remapping matrix coefficients. The value of colour_remap_coeffs[c][i] shall be in the range of 2.sup.15 to 2.sup.151, inclusive. When colour_remap_coeffs[c][i] is not present, it is inferred to be equal to 1 if c is equal to i, and inferred to be equal to 0 otherwise.
(40) The variable matrixOutput[c] for c=0, 1 and 2 is derived as follows:
(41) roundingOffset=log 2_matrix_denom==0?0:1<<(log 2_matrix_denom1) matrixOutput[c]=Clip3(0, (1<<colour_remap_bit_depth)1, (colour_remap_coeffs[c][0]*matrixInput[0]+colour_remap_coeffs[c][1]*matrixInput[1]+colour_remap_coeffs [c][2]*matrixInput[2]+roundingOffset)>>log 2_matrix_denom)
where matrixInput[c] is the input sample value of the c-th colour component, and matrixOutput[c] is the output sample value of the c-th colour component.
(42) post_lut_num_val_minus1[c] has the same semantics as pre_lut_num_val_minus1[c], with pre replaced by post, except that the default end points of the input values are 0 and 2.sup.colour_remap_bit_depth1 for the c-th colour component. The value of post_lut_num_val_minus1[c] shall be in the range of 0 to 32, inclusive.
(43) post_lut_coded_value[c][i] has the same semantics as pre_lut_coded_value[c][i], with pre replaced by post, except that the number of bits used to represent post_lut_coded_value[c][i] is ((colour_remap_bit_depth+7)>>3)<<3.
(44) post_lut_target_value[c][i] has the same semantics as pre_lut_target_value[c][i], with pre replaced by post.
(45)
(46) Referring back to Table 1, syntax elements pre_lut_num_val_minus1, pre_lut_coded_value and pre_lut_target_value (lines 14-21 in Table 1) can be used to represent the first 1D LUT F1, syntax elements log 2_matrix_denom and colour_remap_coeffs (lines 22-28 in Table 1) can be used to represent matrix M, and syntax elements post_lut_num_val_minus1, post_lut_coded_value and post_lut_target_value (lines 29-36 in Table 1) can be used to represent the second 1D LUT F21.
(47) The HEVC standard defines the parameters for the CMF, but does not mandate the method of reconstructing the CMF.
(48) As shown in
(49) More generally, the color mapping may be defined in the entire color space or a subset of the color space.
(50) When the color mapping function is represented using different methods, for example, using 1D LUTs and 33 matrix, the accuracy of color mapping may be reduced. For example,
(51) Using the HEVC CRI signaling as an example, each of the pre set of syntax elements or the post set of syntax elements may support up to 33 pivot points (pre_lut_num_val_minus1[c] and post_lut_num_val_minus1[c] are in the range of 0 to 32) in the 1D LUT. In order to represent the color mapping function such that the mapped picture has a good quality (for example, the mapped picture is close to a target picture), selection of the pivot points should usually consider the constraints in the number of pivot points available and the quality of the mapped picture. For example, critical colors, such as colors human eyes are more sensitive to, or of greater statistical importance, should usually get finer representation. Generally, for the critical colors, there should be smaller intervals between pivot points in order to provide a more accurate representation.
(52) However, selecting the pivot points in consideration of statistics and in consideration of the human vision may conflict with each other. For example, we consider an image that includes blue sky (with the values of the B component ranging from 10-63) that corresponds to 95% of the B component and a blue bicycle (with the values of the B component around 56) that correspond to 1% of the B component. A piece-wise linear curve based on statistics may choose to have 32 pivot points at values 10-41 (a pivot point at each value of 10-41) and another pivot point at value 63 such that most samples get a good representation. To map the blue bicycle, the mapping for colors around 56 is interpolated, which may be quite off from the intended mapping. Since the blue bicycle is in the foreground, the distorted blue color in the bike may appear quite pronounced and affect the perceived visual quality.
(53) In another example, we consider an image where there are many red clothes, a red lip and a red nail, where red clothes corresponding to the most samples in the R component, and the red lip and red nail correspond to few samples. Similar to the previous example, pivot points chosen based on statistics cause the colors of red lip/nail to appear brownish, which becomes annoying to human eyes.
(54) In addition to the number of parameters that are available to represent the color mapping function, hardware implementation cost may impose another constraint. We observe that uniform pivot point intervals (i.e., all intervals between two adjacent pivot points have the same distance) in the domain is a preferred hardware implementation. Thus, it is also desirable to consider uniform intervals when designing parameters to represent the color mapping function. For example, when the domain is [0, 1023] for a 10-bit video, the pivot points are at 0, 31, 62, . . . 31*i, . . . , 1023. However, this uniform representation does not provide finer representation for more critical colors, for example, the colors corresponding to skin tones in a picture (for example, 31-62 in B component) may not be mapped very well.
(55) The present principles are directed to a method and apparatus for using multiple color mapping functions to improve the representation of the color mapping information. In particular, we propose different solutions that can provide finer representations for critical colors while also respecting the hardware implementation consideration. Such representation of the color mapping function may be implemented by re-using existing syntax elements. At the decoder side, the color mapping functions can be decoded and then be applied to process a decoded picture. In the following, we use two successive color mapping functions to discuss different embodiments. The present principles can also be applied when more rounds of color mappings are used.
(56) In one embodiment, a first color mapping function can be generated, based on, for example, the color mapping information obtained from two different color gradings. Then an input picture is first mapped using the first color mapping function to form Remap1, for example, one with pivot points at uniform intervals in the domain of definition [0, 1023] in the R component. The remapped picture (Remap1) may have color artifacts because some critical colors are not mapped accurately. Thus, a set of samples may be selected, for example, by an operator manually through a user interface, for further adjustments. The CMF creator generates a second color mapping function for the selected samples (in a different domain of definition, for example, 31-62 for R component). Samples in the input picture corresponding to the selected samples then go through a second color mapping to improve the quality. Subsequently, the mapping result Remap2 from the second CMF which corresponds to the selected samples, and samples from the mapping result Remap1 which correspond to the remaining samples are combined to form the output picture.
(57) In another embodiment, as shown in
(58)
(59) At step 640, it determines a second color mapping function, for example, based on samples selected for further adjustment. At step 650, it applies the second color mapping on the mapped picture (Remap1), for example, it transforms Remap1 to an SDR (Standard Dynamic Range) picture. At step 660, it encodes the first and second color mapping functions and the input picture into a bitstream. At step 670, it outputs the bitstream. Method 600 ends at step 699.
(60) Thus, according to the present embodiments, color mapping functions can be applied successively and one can encode multiple color mapping functions. Correspondingly, color mapping functions can be applied successively to process a picture at the decoder side. When applying the successive color mappings, different rules, for example, as to how the second color mapping function is applied, can be defined. Which rule is to be used can be signaled in the bitstream or known a priori at both the encoder and decoder side. In the following, we discuss two different rules for successive color mappings in further detail.
(61) For ease of notation, we denote the first color mapping as CRI1, and the first color mapping function as f.sub.CRI1, which is defined on a first domain D.sub.CRI1, and we denote the second color mapping as CRI2, and the second color mapping function as f.sub.CRI2, which is defined on a second domain D.sub.CRI2. Both D.sub.CRI1 and D.sub.CRI2 can correspond to the entire possible input colors or a subset thereof. Usually D.sub.CRI1 and D.sub.CRI2 are different, for example, D.sub.CRI2 may be a subset of D.sub.CRI1, D.sub.CRI1 and D.sub.CRI2 may overlap, or D.sub.CRI1 and D.sub.CRI2 may not overlap. Function f.sub.CRI1 or f.sub.CRI2 may be any color mapping function, for example, those we discussed above. Outside the domains of definition of the color mapping functions, an identity function can be used for color mapping (i.e., the input color is not changed after mapping).
(62) Rule 1
(63) In one embodiment, two color mapping functions are consecutively applied on their own domains of definition only.
Remap1=f.sub.CRI1(D.sub.CRI1)
(64) The domain of the second color mapping function is D.sub.CRI2, which is shown within the solid line in
Remap2=f.sub.CRI2(D.sub.CRI2)(Remap1\D.sub.CRI2).(2)
When Remap1 does not include the entire D.sub.CRI2, the second color mapping f.sub.CRI2 is applied on (Remap1D.sub.CRI2) to be strict, thus, the second color mapping can also be written as
Remap2=f.sub.CRI2(Remap1D.sub.CRI2)(Remap1\D.sub.CRI2).(3)
(65) Rule 2
(66) In another embodiment, the first color mapping CRI1 is applied on its domain of definition, but the second color mapping CRI2 is applied only on samples that have been previously color mapped by CRI1 and which are also inside the domain of definition of function f.sub.CRI2.
(67) Same as the previous rule, the output after CRI1 can be written as a combination of the mapping results from both D.sub.CRI1 and
Remap1=f.sub.CRI1(D.sub.CRI1)
(68) The domain of the second color mapping function is D.sub.CRI2, which is shown within the solid line in
Remap2=f.sub.CRI2(f.sub.CRI1(D.sub.CRI1)D.sub.CRI2)(Remap1\D.sub.CRI2)
(69) Two different rules for successively applying color transforms are discussed above. Rule 1 can be easier to implement, but affects non-mapped samples (with which the operator may be already satisfied) and may cause new problems. Rule 2 only affects mapped samples so the operator has exact control, but it needs to identify which samples are selected so the implementation is more difficult. Based on the user requirements or other inputs, the encoder may choose one rule over the other one.
(70) In the above, we mainly discussed applying color mappings in a subset of the color space. The present principles can also be applied to a spatial region of the picture. For example, the color mappings can only be applied to a spatial window within the picture. To indicate which spatial region is color mapped, additional syntax elements (xmin, ymin) and (xmax, ymax) can be used to indicate the top-left and bottom-right pixel coordinates of the spatial window, respectively. Or additional syntax elements (xmin, ymin) and (xsize, ysize) can be used to indicate the top-left pixel coordinates and the window size in the number of pixels of the spatial window respectively.
(71) Parameters related to different color mapping functions can be signaled in the bitstream. In one embodiment, several sets of HEVC CRI are encoded in the bitstream before the video coded picture(s) to which it applies (CRI applies to the reconstructed pictures of the same layer (for example, with the same layer_id) the CRI SEI belongs to), in the order that they are applied. In another embodiment, the CRI application order is derived from another syntax element such as colour_remap_id. The present principles can also be applied to other video compression standard that define parameters for color mapping functions.
(72)
(73)
(74) When the second color mapping function is defined on a domain of definition that is a subset of all possible colors, it selects (930) samples that fall within the domain of the second color mapping function from the mapped picture (Remap1). When the second rule as described in Eq. (5) is used, the samples are selected only if they are previously mapped in the first mapping. The second mapping function is applied (940) to the selected samples in Remap1, and other samples are not changed.
(75) In the above, we use WCG HDR and SRD pictures to illustrate the color mappings. The present principles can also be applied to color mappings between other formats of pictures.
(76) Advantageously, the present embodiments can use several color mapping functions in order to capture local variations in the pictures. It may be combined with local spatial window to allow the application of the mappings on the samples inside the local spatial window only. The successive applications of different color mappings also allow for correcting/improving the first color mapping with the subsequent color mapping, without developing more complex color mapping functions, and thereby reducing the implementation costs.
(77)
(78) The system 1000 may include at least one processor 1010 configured to execute instructions loaded therein for implementing the various processes as discussed above. Processor 1010 may include embedded memory, input output interface and various other circuitries as known in the art. The system 1000 may also include at least one memory 1020 (e.g., a volatile memory device, a non-volatile memory device). System 1000 may additionally include a storage device 1040, which may include non-volatile memory, including, but not limited to, EEPROM, ROM, PROM, RAM, DRAM, SRAM, flash, magnetic disk drive, and/or optical disk drive. The storage device 1040 may comprise an internal storage device, an attached storage device and/or a network accessible storage device, as non-limiting examples. System 1000 may also include an encoder/decoder module 1030 configured to process data to provide an encoded video or decoded video.
(79) Encoder/decoder module 1030 represents the module(s) that may be included in a device to perform the encoding and/or decoding functions. As is known, a device may include one or both of the encoding and decoding modules. Additionally, encoder/decoder module 1030 may be implemented as a separate element of system 1000 or may be incorporated within processors 1010 as a combination of hardware and software as known to those skilled in the art.
(80) Program code to be loaded onto processors 1010 to perform the various processes described hereinabove may be stored in storage device 1040 and subsequently loaded onto memory 1020 for execution by processors 1010. In accordance with the exemplary embodiments of the present principles, one or more of the processor(s) 1010, memory 1020, storage device 1040 and encoder/decoder module 1030 may store one or more of the various items during the performance of the processes discussed herein above, including, but not limited to the modulation value, the SDR video, the HDR video, equations, formula, matrices, variables, operations, and operational logic.
(81) The system 1000 may also include communication interface 1050 that enables communication with other devices via communication channel 1060. The communication interface 1050 may include, but is not limited to a transceiver configured to transmit and receive data from communication channel 1060. The communication interface may include, but is not limited to, a modem or network card and the communication channel may be implemented within a wired and/or wireless medium. The various components of system 1000 may be connected or communicatively coupled together using various suitable connections, including, but not limited to internal buses, wires, and printed circuit boards.
(82) The exemplary embodiments according to the present principles may be carried out by computer software implemented by the processor 1010 or by hardware, or by a combination of hardware and software. As a non-limiting example, the exemplary embodiments according to the present principles may be implemented by one or more integrated circuits. The memory 1020 may be of any type appropriate to the technical environment and may be implemented using any appropriate data storage technology, such as optical memory devices, magnetic memory devices, semiconductor-based memory devices, fixed memory and removable memory, as non-limiting examples. The processor 1010 may be of any type appropriate to the technical environment, and may encompass one or more of microprocessors, general purpose computers, special purpose computers and processors based on a multi-core architecture, as non-limiting examples.
(83) Referring to
(84) The data transmission system 1100 receives processed data and other information from a processor 1101. In one implementation, the processor 1101 generates color mapping information based on two color gradings of the same video and represents the color information using two color mapping functions, for example, using method 500. The processor 1101 may also provide metadata to 1100 indicating, for example, the rule as to how the second color mapping function is applied.
(85) The data transmission system or apparatus 1100 includes an encoder 1102 and a transmitter 1104 capable of transmitting the encoded signal. The encoder 1102 receives data information from the processor 1101. The encoder 1102 generates an encoded signal(s).
(86) The encoder 1102 may include sub-modules, including for example an assembly unit for receiving and assembling various pieces of information into a structured format for storage or transmission. The various pieces of information may include, for example, coded or uncoded video, and coded or uncoded elements. In some implementations, the encoder 1102 includes the processor 1101 and therefore performs the operations of the processor 1101.
(87) The transmitter 1104 receives the encoded signal(s) from the encoder 1102 and transmits the encoded signal(s) in one or more output signals. The transmitter 1104 may be, for example, adapted to transmit a program signal having one or more bitstreams representing encoded pictures and/or information related thereto. Typical transmitters perform functions such as, for example, one or more of providing error-correction coding, interleaving the data in the signal, randomizing the energy in the signal, and modulating the signal onto one or more carriers using a modulator 1106. The transmitter 1104 may include, or interface with, an antenna (not shown). Further, implementations of the transmitter 1104 may be limited to the modulator 1106.
(88) The data transmission system 1100 is also communicatively coupled to a storage unit 1108. In one implementation, the storage unit 1108 is coupled to the encoder 1102, and stores an encoded bitstream from the encoder 1102. In another implementation, the storage unit 1108 is coupled to the transmitter 1104, and stores a bitstream from the transmitter 1104. The bitstream from the transmitter 1104 may include, for example, one or more encoded bitstreams that have been further processed by the transmitter 1104. The storage unit 1108 is, in different implementations, one or more of a standard DVD, a Blu-Ray disc, a hard drive, or some other storage device.
(89) Referring to
(90) The data receiving system 1200 may be, for example, a cell-phone, a computer, a set-top box, a television, or other device that receives encoded video and provides, for example, decoded video signal for display (display to a user, for example), for processing, or for storage. Thus, the data receiving system 1200 may provide its output to, for example, a screen of a television, a computer monitor, a computer (for storage, processing, or display), or some other storage, processing, or display device.
(91) The data receiving system 1200 is capable of receiving and processing data information. The data receiving system or apparatus 1200 includes a receiver 1202 for receiving an encoded signal, such as, for example, the signals described in the implementations of this application. The receiver 1202 may receive, for example, a signal providing one or more of a WCG HDR video and color mapping functions, or a signal output from the data transmission system 1100 of
(92) The receiver 1202 may be, for example, adapted to receive a program signal having a plurality of bitstreams representing encoded pictures. Typical receivers perform functions such as, for example, one or more of receiving a modulated and encoded data signal, demodulating the data signal from one or more carriers using a demodulator 1204, de-randomizing the energy in the signal, de-interleaving the data in the signal, and error-correction decoding the signal. The receiver 1202 may include, or interface with, an antenna (not shown). Implementations of the receiver 1202 may be limited to the demodulator 1204.
(93) The data receiving system 1200 includes a decoder 1206. The receiver 1202 provides a received signal to the decoder 1206. The signal provided to the decoder 1206 by the receiver 1202 may include one or more encoded bitstreams. The decoder 1206 outputs a decoded signal, such as, for example, decoded video signals including video information.
(94) The data receiving system or apparatus 1200 is also communicatively coupled to a storage unit 1207. In one implementation, the storage unit 1207 is coupled to the receiver 1202, and the receiver 1202 accesses a bitstream from the storage unit 1207. In another implementation, the storage unit 1207 is coupled to the decoder 1206, and the decoder 1206 accesses a bitstream from the storage unit 1207. The bitstream accessed from the storage unit 1207 includes, in different implementations, one or more encoded bitstreams. The storage unit 1207 is, in different implementations, one or more of a standard DVD, a Blu-Ray disc, a hard drive, or some other storage device.
(95) The output data from the decoder 1206 is provided, in one implementation, to a processor 1208. The processor 1208 is, in one implementation, a processor configured for performing the HDR to SDR mapping based on color mapping information. In some implementations, the decoder 1206 includes the processor 1208 and therefore performs the operations of the processor 1208. In other implementations, the processor 1208 is part of a downstream device such as, for example, a set-top box or a television.
(96) The implementations described herein may be implemented in, for example, a method or a process, an apparatus, a software program, a data stream, or a signal. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method), the implementation of features discussed may also be implemented in other forms (for example, an apparatus or program). An apparatus may be implemented in, for example, appropriate hardware, software, and firmware. The methods may be implemented in, for example, an apparatus such as, for example, a processor, which refers to processing devices in general, including, for example, a computer, a microprocessor, an integrated circuit, or a programmable logic device. Processors also include communication devices, such as, for example, computers, cell phones, portable/personal digital assistants (PDAs), and other devices that facilitate communication of information between end-users.
(97) Reference to one embodiment or an embodiment or one implementation or an implementation of the present principles, as well as other variations thereof, mean that a particular feature, structure, characteristic, and so forth described in connection with the embodiment is included in at least one embodiment of the present principles. Thus, the appearances of the phrase in one embodiment or in an embodiment or in one implementation or in an implementation, as well any other variations, appearing in various places throughout the specification are not necessarily all referring to the same embodiment.
(98) Additionally, this application or its claims may refer to determining various pieces of information. Determining the information may include one or more of, for example, estimating the information, calculating the information, predicting the information, or retrieving the information from memory.
(99) Further, this application or its claims may refer to accessing various pieces of information. Accessing the information may include one or more of, for example, receiving the information, retrieving the information (for example, from memory), storing the information, processing the information, transmitting the information, moving the information, copying the information, erasing the information, calculating the information, determining the information, predicting the information, or estimating the information.
(100) Additionally, this application or its claims may refer to receiving various pieces of information. Receiving is, as with accessing, intended to be a broad term. Receiving the information may include one or more of, for example, accessing the information, or retrieving the information (for example, from memory). Further, receiving is typically involved, in one way or another, during operations such as, for example, storing the information, processing the information, transmitting the information, moving the information, copying the information, erasing the information, calculating the information, determining the information, predicting the information, or estimating the information.
(101) As will be evident to one of skill in the art, implementations may produce a variety of signals formatted to carry information that may be, for example, stored or transmitted. The information may include, for example, instructions for performing a method, or data produced by one of the described implementations. For example, a signal may be formatted to carry the bitstream of a described embodiment. Such a signal may be formatted, for example, as an electromagnetic wave (for example, using a radio frequency portion of spectrum) or as a baseband signal. The formatting may include, for example, encoding a data stream and modulating a carrier with the encoded data stream. The information that the signal carries may be, for example, analog or digital information. The signal may be transmitted over a variety of different wired or wireless links, as is known. The signal may be stored on a processor-readable medium.