HYBRID FRAMEWORK-BASED IMAGE BIT-DEPTH EXPANSION METHOD AND DEVICE

Abstract

The present disclosure provides a hybrid framework-based image bit-depth expansion method and device. The invention fuses a traditional de-banding algorithm and a depth network-based learning algorithm, and can remove unnatural effects in an image flat area whilst more realistically restoring numerical information of missing bits. The method comprises the extraction of image flat areas, local adaptive pixel value adjustment-based flat area bit-depth expansion and convolutional neural network-based non-flat area bit-depth expansion. The present invention uses a learning-based method to train an effective depth network to solve the problem of realistically restoring missing bits, whilst using a simple and robust local adaptive pixel value adjustment method in an flat area to effectively inhibit unnatural effects in the flat area such as banding, a ringing and flat noise, improving subjective visual quality of the flat area.

Claims

1. A hybrid framework-based image bit-depth expansion method, comprising extraction of image flat areas, local adaptive pixel value adjustment-based flat area bit-depth expansion and convolutional neural network-based non-flat area bit-depth expansion; comprising the following steps: the first step, extracting flat areas of an image, dividing the image into flat areas and non-flat areas; the second step, as for the flat areas, bit-depth expansion based on local adaptive pixel value adjustment; comprising: first, determining a bit-expanded pixel that differs from its neighboring pixel value; then, according to bit-depth information of pixels around the bit-expanded pixel, a local adaptive pixel value adjustment-based method is used to adaptively adjust the value of the bit-expanded pixel, thereby eliminating unnatural effects in flat areas of a low bit-depth image; the third step, as for the non-flat areas, bit-depth expansion based on a convolutional neural network; comprising: first, a large number of low bit-depth images and corresponding high bit-depth images are used to train a convolutional neural network learned with amplified residual; then, using the trained convolutional neural network to reconstruct more accurate numerical information of missing bits; completing a non-flat area bit-depth expansion process based on the convolutional neural network; through the above steps, the bit-depth expansion of the image is realized.

2. The hybrid framework-based image bit-depth expansion method of claim 1, wherein, in the first step, dividing the image into flat areas and non-flat areas comprises the following steps: 11) measuring the degree of local numerical change using local average pixel value difference information, and the calculation method is as shown in Formula 1: $\begin{matrix} D = \frac{1}{P} .Math. {.Math.}_{i = 1}^{P} .Math. .Math. (g_{i} - \overline{g}) & (Formula .Math. .Math. 1) \end{matrix}$ wherein, D is a local pixel value difference; P is the total number of pixels in the local; g.sub.i(i=1, 2, . . . , P) is a pixel in the local; is a local average pixel value; 12) obtaining an average local pixel value difference D of the whole image according to each local pixel value difference D; 13) local pixel values of the flat areas change gradually, that is, the value of the difference D thereof is small; a coefficient is introduced, and if the local area satisfies DD, the local area is a flat area, otherwise it is a non-flat area.

3. The hybrid framework-based image bit-depth expansion method of claim 1, wherein, the second step is to use a method based on local adaptive pixel value adjustment for the flat areas of the image to eliminate unnatural effects of the flat areas in the low bit-depth image; specifically, Assuming that an input is a low bit-depth image Y with a bit-depth of 1-bits, and a high bit-depth image X obtained after bit-depth expansion has a bit-depth of h-bits; banding effect of the flat areas appear at a pixel with a bit value difference of 1, and a pixel g.sub.c satisfying |g.sub.ig.sub.c|=1 is referred to as a contour pixel, wherein g.sub.i is a neighboring pixel of the contour pixel g.sub.c; 8 pixels adjacent to a pixel are used as neighboring pixels thereof; for each contour pixel, a boosting factor + and an inhibitor factor are calculated by Formula 2 and Formula 3, respectively: $\begin{matrix} _{+} = \frac{1}{8} .Math. {.Math.}_{i = 1}^{8} .Math. .Math. S (g_{i} - g_{c} = 1), S (x) = {\begin{matrix} 1, if .Math. .Math. x .Math. .Math. is .Math. .Math. true .Math. \\ 0, if .Math. .Math. x .Math. .Math. is .Math. .Math. false \end{matrix} & (Formula .Math. .Math. 2) \\ _{-} = \frac{1}{8} .Math. {.Math.}_{i = 1}^{8} .Math. .Math. S (g_{i} - g_{c} = - 1), S (x) = {\begin{matrix} 1, if .Math. .Math. x .Math. .Math. is .Math. .Math. true .Math. \\ 0, if .Math. .Math. x .Math. .Math. is .Math. .Math. false \end{matrix} & (Formula .Math. .Math. 3) \end{matrix}$ then, the pixel value of the contour pixel after bit-depth promotion is obtained by Formula 4:
g.sub.c*=ZP(g.sub.c)+(.sub.+.sub.)2.sup.hl(Formula 4) wherein, ZP (g.sub.c) uses a zero padding method for the pixel value g.sub.c to promote the bit-depth; is an adjustment value parameter, which can be adjusted to increase or decrease the degree of denoising.

4. The hybrid framework-based image bit-depth expansion method of claim 3, wherein, the value of the adjustment value parameter is 0.125.

5. The hybrid framework-based image bit-depth expansion method of claim 1, wherein, in the third step, during the training of the convolutional neural network, a low bit-depth image Y is used as an input, and the residual (XY) of a high bit-depth image X corresponding to Y and Y is amplified by a factor , and the value (XY) is used as the ground truth value; for the input Y, the reconstruction result of the convolutional neural network F is defined as F(Y), and the final reconstructed high bit-depth image is expressed as Formula 5: $\begin{matrix} X^{*} = Y + \frac{F (Y)}{} & (Formula .Math. .Math. 5) \end{matrix}$ wherein, $= \frac{2^{h} - 1}{2^{h - l} - 1};$ is a high bit-depth image reconstructed using the convolutional neural network.

6. The hybrid framework-based image bit-depth expansion method of claim 5, wherein, the structure of the convolutional neural network includes: a. an input layer composed of 33 convolution kernels, and the input layer inputs an input image and outputs 64 feature images; b. a convolutional layer composed of 33 convolution kernels, outputting 64 feature images; c. a batch normalization layer; d. an ReLU activation function layer; e. a convolutional layer composed of 33 convolution kernels, outputting 64 feature images; f. a batch normalization layer; g. an ReLU activation function layer; h. a convolutional layer composed of 33 convolution kernels, outputting 64 feature images; i. a batch normalization layer; j. an ReLU activation function layer; k. a convolutional layer composed of 11 convolution kernels, outputting 64 feature images; l. a batch normalization layer; m. an ReLU activation function layer; n. an output layer composed of a 33 convolution kernels, outputting a reconstructed residual image F(Y).

7. A hybrid framework-based image bit-depth expansion device obtained by the hybrid framework-based image bit-depth expansion method of one of claims 1 to 6, comprising a processor, configured to execute the following program modules stored in a memory: an image flat area extraction module, a local adaptive pixel value adjustment-based flat area bit-depth expansion module and a convolutional neural network-based non-flat area bit-depth expansion module; the image flat area extraction module, for dividing an image into flat areas and non-flat areas; the local adaptive pixel value adjustment-based flat area bit-depth expansion module, for the flat areas, based on local adaptive pixel value adjustment, to eliminate unnatural effects in the flat areas of a low-bit-depth image; the convolutional neural network-based non-flat area bit-depth expansion module, used for bit-depth expansion of non-flat areas based on convolutional neural networks.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0032] FIG. 1 is a flowchart of the present disclosure.

[0033] FIG. 2 is a flowchart of the convolutional neural network method used in the present invention.

[0034] FIG. 3 is a comparison diagram of the subjective quality of reconstructed images when reconstructing images from 6-bits to 8-bits using the present invention and several prior methods;

[0035] Wherein, the upper left image is a schematic representation of the location of the selected flat area in the image, and the results of the bit-depth expansion of the image flat area using different methods are: (a) a zero expansion method, (b) a bit replication method, and (c) an ideal gain method, (d) the method of the present invention, and (e) the real image.

DETAILED DESCRIPTION OF EMBODIMENTS

[0036] Hereinafter, the present disclosure is further described through the embodiments, but the scope of the present disclosure is not limited in any manner.

[0037] The invention provides a hybrid framework-based image bit-depth expansion method, and by fusing a traditional de-banding algorithm and a depth network-based learning algorithm, the method can well remove unnatural effects of image flat areas whilst more realistically restore numerical information of missing bits.

[0038] FIG. 1 shows the flow of the method of the present invention, comprising: extraction of image flat areas, local adaptive pixel value adjustment-based flat area bit-depth expansion and convolutional neural network-based non-flat area bit-depth expansion, specifically comprises the following steps:

[0039] The first step, extraction of flat areas of an image.

[0040] The application measures the degree of local numerical change using local average pixel value difference information, and the calculation method is as shown in formula 1:

[00004] $\begin{matrix} D = \frac{1}{P} .Math. {.Math.}_{i = 1}^{P} .Math. .Math. (g_{i} - \overline{g}) & (Formula .Math. .Math. 1) \end{matrix}$

[0041] Wherein, P is the total number of pixels in the local, g.sub.i(=1, 2, . . . , P) is a pixel in the local, g is a local average pixel value. After calculating every local pixel value difference D, an average local pixel value difference D of the entire image can be obtained. Since local pixel values of the flat areas change gradually, that is, the value of the difference D of an flat area is small, by introducing a coefficient , if the local area satisfies DD, the local area is a flat area, otherwise it is a non-flat area.

[0042] The second step, bit-depth expansion based on local adaptive pixel value adjustment for flat areas.

[0043] In flat areas of an image, the present invention uses a method based on local adaptive pixel value adjustment to eliminate unnatural effects such as banding effect and noise in flat areas in a low bit-depth image. Assuming that an input is a low-bit-depth image Y with a bit-depth of 1-bits, and a high bit-depth image X obtained after bit-depth expansion has a bit-depth of h-bits.

[0044] Because banding effect of the flat areas appear at a pixel with a bit value difference of 1, a pixel g.sub.c satisfying |g.sub.ig.sub.c|=1 is referred to as a contour pixel, wherein g.sub.i is a neighboring pixel of the contour pixel g.sub.c. In the present invention, 8 pixels adjacent to a pixel are used as its neighboring pixels.

[0045] For each contour pixel, a boosting factor + and an inhibitor factor thereof can be calculated by Formula 2 and Formula 3, respectively:

[00005] $\begin{matrix} _{+} = \frac{1}{8} .Math. {.Math.}_{i = 1}^{8} .Math. .Math. S (g_{i} - g_{c} = 1), S (x) = {\begin{matrix} 1, if .Math. .Math. x .Math. .Math. is .Math. .Math. true .Math. \\ 0, if .Math. .Math. x .Math. .Math. is .Math. .Math. false \end{matrix} & (Formula .Math. .Math. 2) \\ _{-} = \frac{1}{8} .Math. {.Math.}_{i = 1}^{8} .Math. .Math. S (g_{i} - g_{c} = - 1), S (x) = {\begin{matrix} 1, if .Math. .Math. x .Math. .Math. is .Math. .Math. true .Math. \\ 0, if .Math. .Math. x .Math. .Math. is .Math. .Math. false \end{matrix} & (Formula .Math. .Math. 3) \end{matrix}$

[0046] Then, the pixel value of the contour pixel after bit-depth promotion can be obtained by Formula 4:

g.sub.c*=ZP(g.sub.c)+(.sub.+.sub.)1.sup.hl(Formula 4))

[0047] Wherein, ZP (g.sub.c) uses a traditional zero padding method for the pixel value g.sub.c to promote the bit-depth, is an adjustment value parameter, and the default value of is 0.125, and a can be adjusted to increase or decrease the degree of denoising.

[0048] The third step, bit-depth expansion based on a convolutional neural network for non-flat areas.

[0049] The present invention uses a convolutional neural network learned with amplified residual to learn reconstruction of missing bit-depth of an image. In the bit-depth expansion, since the missing bits and the added bits are both the last bits, the magnitude of the missing bits is small. In order to effectively train the network to reconstruct the missing bit information, during the training process of the depth network, the present invention uses a low bit-depth image Y as an input, and takes the value (XY) as the ground truth value after the residual (XY) of the corresponding high bit-depth image X and Y is amplified by the factor .

[0050] For the input Y, the reconstruction result of the convolutional neural network F is defined as F(Y), and the final reconstructed high bit-depth image is expressed as Formula 5:

[00006] $\begin{matrix} X^{*} = Y + \frac{F (Y)}{} & (Formula .Math. .Math. 5) \end{matrix}$

[0051] Wherein,

[00007] $= \frac{2^{h} - 1}{2^{h - l} - 1}; X^{*}$

is the high bit-depth image reconstructed using the convolutional neural network.

[0052] The structure of the convolutional neural network used in the present invention is shown in FIG. 2, and its structure is:

[0053] a. an input layer composed of 33 convolution kernels, and the input layer inputs an input image and outputs 64 feature images;

[0054] b. a convolutional layer composed of 33 convolution kernels, outputting 64 feature images;

[0055] c. a batch normalization layer;

[0056] d. an ReLU activation function layer;

[0057] e. a convolutional layer composed of 33 convolution kernels, outputting 64 feature images;

[0058] f. a batch normalization layer;

[0059] g. an ReLU activation function layer;

[0060] h. a convolutional layer composed of 33 convolution kernels, outputting 64 feature images;

[0061] i. a batch normalization layer;

[0062] j. an ReLU activation function layer;

[0063] k. a convolutional layer composed of 11 convolution kernels, outputting 64 feature images;

[0064] l. a batch normalization layer;

[0065] m. an ReLU activation function layer;

[0066] n. an output layer composed of a 33 convolution kernels, outputting a reconstructed residual image F(Y).

[0067] The adaptive pixel value adjustment method can better suppress unnatural effects in flat areas, but it will introduce blurring of non-flat areas and further lose high frequency detail information; the method based on a convolutional neural network can more accurately restore missing bit values, but the subjective quality of flat areas is still affected by banding effect and noise. By combining advantages of the two methods, the results based on the adaptive pixel adjustment method are used in flat areas, and the results of convolutional neural network reconstruction are used in non-flat areas, and the two methods are combined to finally obtain high bit-depth images.

[0068] Table 1 shows the comparison of the peak signal-to-noise ratio (PSNR) effect of the method of the present invention and three traditional methods (Zero Padding, Bit Replication, and Ideal Gain) on several data sets during image reconstruction from 6-bits to 8-bits, and Set5, Set14, Kodak and B100 are 4 image test sets. It can be seen from numerical comparison that by using a learning method based on a convolutional neural network, the numerical information of the missing bits can be more accurately recovered, thereby restoring more realistic high-frequency details of the texture.

[0069] Table 1 Comparison of average peak signal-to-noise ratio (PSNR) of reconstruction from 6-bits to 8-bits on different test sets.

TABLE-US-00001 TABLE 1 Method Zero Bit Ideal of the Padding Replication Gain present method method method application Set5 43.26 45.53 46.29 46.64 Set14 42.87 45.58 46.18 46.22 Kodak 42.61 45.59 46.10 46.36 B100 42.64 45.17 45.87 46.32

[0070] FIG. 3 shows the comparison of subjective quality of the reconstructed image of the present invention and several prior traditional methods when reconstructing images from 6-bits to 8-bits. It can be seen that banding effect and noise of the flat areas of the image reconstructed by the method of the present invention are both well suppressed, thus improving subjective visual quality.

[0071] It needs to be noted that the embodiments as disclosed are intended to facilitating further understanding of the present disclosure; however, those skilled in the art may understand that various substitutions and modifications are possible without departing from the spirit and scope of the present disclosure. Therefore, the present disclosure should not be limited to the contents disclosed in the embodiments, but should be governed by the appended claims.

HYBRID FRAMEWORK-BASED IMAGE BIT-DEPTH EXPANSION METHOD AND DEVICE

Inventors

Cpc classification

Classification Explorer

H04N1/58

ELECTRICITY

Classification Explorer

G06T5/00

PHYSICS

Classification Explorer

H04N1/6072

ELECTRICITY

Classification Explorer

G06T7/11

PHYSICS

Classification Explorer

G06T5/94

PHYSICS

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

G06T2207/20012

PHYSICS

Classification Explorer

G06T2207/20084

PHYSICS

Classification Explorer

G06N3/045

PHYSICS

Classification Explorer

G06T2207/20081

PHYSICS

International classification

Classification Explorer

G06T5/00

PHYSICS

Classification Explorer

G06T7/11

PHYSICS

Abstract

Claims

Description