CLASSIFICATION-BASED IMAGE MERGING, TUNING, CORRECTION, AND REPLACEMENT

20220392032 · 2022-12-08

Inventors

Cpc classification

International classification

Abstract

Methods for improving and modifying a High Dynamic Range (HDR) scene, captured as a series of images of the scene with different exposure levels and the scene through classification-based image merging, tuning, correction, and replacement. The approach employs mixing images to improve the selection and display of both shadowed and highlighted details. The increased efficiency resulting from improvements in computational resource utilization of image processing hardware can, from the implementation of the improved computational methods herein, significantly reduce the time required to generate and display a tone-mapped HDR image, a gamma-corrected HDR image, and/or a segmented and replaced HDR image.

Claims

1. A method of determining gamma correction in an image, comprising: receiving, by one or more processors, at least a first exposure image of a scene; receiving, by the one or more processors, at least a second exposure image of the scene, wherein the second exposure image of the scene has a shorter exposure time than the first exposure image of the scene; computing, by the one or more processors, a pixel value for a pixel location of a high dynamic range (HDR) image to be a sum of a pixel value of the first exposure image weighted by a first exposure weight and a pixel value of the second exposure image weighted by a second exposure weight, to produce a merged HDR image comprising Y bit data; adaptively mapping, by the one or more processors, the HDR image, to produce an output HDR image having Z bit data and a total number of pixels; applying, by the one or more processors, a range of gamma value correction levels to the output HDR image; detecting a number of pixels having a value of black level less than a predefined black level threshold; and selecting a tuned gamma value correction level.

2. The method of claim 1, further comprising: When selecting a tuned gamma value correction level, said number of pixels having a value of black level less than a predefined black level threshold is less than 0.025*(said total number of pixels).

3. A method of correcting detail obscured by brightness glare in an image, comprising: receiving, by one or more processors, at least a first exposure image of a scene; receiving, by the one or more processors, at least a second exposure image of the scene, wherein the second exposure image of the scene has a shorter exposure time than the first exposure image of the scene; computing, by the one or more processors, a refined mask, by performing a conjugation of the at least first exposure image of the scene and the at least second exposure image of the scene and selecting a number of pixels having a value of black level greater than a predefined black level threshold to form an unrefined mask of the scene, quantifying an amount of detail present in at least one portion of the second exposure image of the scene having a brightness level higher than the average brightness level of all pixels in the second exposure image of the scene, by applying a Laplacian to said second exposure image of the scene and applying a median blur denoising operation to form an intermediary laplacian mask, and selecting at least one pixel in at least one region of the intermediary laplacian mask that does not have a zero value; and computing a blended image by applying a gaussian pyramid operation and a laplacian pyramid merging of the at least second exposure image and an exposure fusion image using the refined mask.

4. A method of segmenting an image having sky by computing a pixel mask, comprising: receiving, by one or more processors, at least a first exposure image of a scene; receiving, by the one or more processors, at least a second exposure image of the scene, wherein the at least second exposure image of the scene has a shorter exposure time than the at least first exposure image of the scene; detecting, a number of pixels having a blue hue value greater than a predefined blue hue level threshold, greater than a red hue value for the number of pixels, and greater than a green hue value for the number of pixels; computing, by one or more processors, a detection mask by conjugating at least one mean blue hue mask and at least one threshold blue hue mask; and computing, by one or more processors, at least one group of pixels from the detection mask as sky, by detecting a largest group of pixels having a blue hue value greater than a predefined blue hue level threshold, greater than a red hue value for the number of pixels in the detection mask, and greater than a green hue value for the number of pixels in the detection mask, and designating pixels away from said largest group of pixels as not sky.

5. The method of claim 4, further comprising: computing, by one or more processors, a value mask by averaging the brightness value of the at least one group of pixels in a value channel; computing, by one or more processors, a threshold blue-red mask by detecting at least one pixel having a blue hue value greater than a predefined red hue level threshold; computing, by one or more processors, a threshold blue-green mask by detecting at least one pixel having a blue hue value greater than a predefined red hue level threshold; computing, by one or more processors, a combined detection mask by conjugating the threshold blue-red mask, threshold blue-green mask, and the value mask; computing, by one or more processors, a hue, saturation, and value mask by conjugating a hue threshold, a saturation threshold, and a value threshold of the at least one group of pixels of the at least second exposure image of the scene; computing, by one or more processors, a combined mask by applying a disjunction to the detection mask, combined detection mask, and hue, saturation, and value mask; computing, by one or more processors, a bright mask by conjugating at least a brightest intensity channel of the at least first exposure image of the scene and a brightest intensity channel of the at least second exposure image of the scene and retaining portions exceeding a defined brightness threshold; computing, by one or more processors, a sure sky mask by conjugating the combined mask and the bright mask and subsequently applying a morphological erosion; computing, by one or more processors, a finalized sky segmentation mask by calculating a probable foreground segment mask, calculating a probable background segment mask, iteratively segmenting a conjunction of the sure sky mask, the probable foreground segment mask, and the probable background segment mask to form a grab cut mask, and applying a disjunction to the grab cut mask and sure sky mask.

6. A method of classifying, segmenting, and replacing the sky in an image scene, comprising: classifying at least one group of pixels in an image scene as a sky portion by inputting a digitized image into a convolutional network of artificial neurons pretrained through the repeated convolution and pooling of at least one set of clear sky images and at least one set of sky images at least partially containing cloud cover; computing a pixel mask bounding the sky portion, wherein the pixel mask is calculated from the collection and convolution of at least a first exposure image of a scene and at least a second exposure image of the scene, wherein the at least second exposure image of the scene has a shorter exposure time than the at least first exposure image of the scene; and replacing the sky portion bounded by the pixel mask.

7. The method of claim 6, wherein replacing the sky portion bounded by the pixel mask occurs by applying a segmented interpolation.

8. The method of claim 6, wherein replacing the sky portion bounded by the pixel mask occurs by applying alpha blending to a whole-image replacement.

9. A method of classifying, segmenting, and replacing at least one portion of an image scene, comprising: classifying at least one group of pixels in an image scene as a relevant portion by inputting a digitized image into a convolutional network of artificial neurons pretrained through the repeated convolution and pooling of at least one image set containing at least one training replacement portion; computing a pixel mask bounding the relevant portion and calculated from the collection and convolution of at least a first exposure image of a scene and at least a second exposure image of the scene, wherein the at least second exposure image of the scene has a shorter exposure time than the at least first exposure image of the scene; and replacing the relevant portion of bounded by the pixel mask.

10. The method of claim 9, wherein replacing the relevant portion bounded by the pixel mask occurs by applying a segmented interpolation.

11. The method of claim 9, wherein replacing the relevant portion bounded by the pixel mask occurs by applying alpha blending to a whole-image replacement.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0020] The above and other aspects, features and advantages of the invention will be more apparent from the following more particular description thereof, presented in conjunction with the following drawings wherein:

[0021] FIG. 1 is a flowchart illustrating the general process of merging, classifying, segmenting, and replacing portions of a scene within a blended HDR image, in part in accordance with one or more embodiments of the present invention.

[0022] FIG. 2 is a set of three images taken of a scene, where each of the three images has a different exposure level.

[0023] FIG. 3. is a set of two images taken of a scene, and an associated four-part grid demonstrating the composited alignment of the two images using a coarse-to-fine alignment schema.

[0024] FIG. 4 is a flowchart illustrating the process of creating a blended HDR image that captures heightened detail in brighter scene areas, in accordance with one or more embodiments of the present invention.

[0025] FIG. 5 is a demonstration of the process of creating an unrefined brightness mask by applying a thresholded logical conjunction of a highest and lowest exposure image of a scene, in accordance with one or more embodiments of the present invention.

[0026] FIG. 6 is a demonstration of the process of creating a refined brightness mask by conjugating an intermediary laplacian mask and the unrefined brightness mask, in accordance with one or more embodiments of the present invention.

[0027] FIG. 7 is a demonstration of the process of blending the lowest exposure image of a scene, the exposure fusion image of a scene, and the calculated refined brightness mask to form a finalized brightness-adjusted HDR image that captures heightened detail in brighter scene areas, in accordance with one or more embodiments of the present invention.

[0028] FIG. 8 is a set of two images taken of an exterior scene, where each of the two images has a different exposure level. (pg. 10)

[0029] FIG. 9A is a flowchart illustrating the process of segmenting the portion of pixels representing the sky from the same exterior scene captured in each of the two images, in accordance with one or more embodiments of the present invention.

[0030] FIG. 9B is a flowchart illustrating the process of segmenting the portion of pixels representing the sky from the same exterior scene captured in each of the two images, in accordance with one or more embodiments of the present invention.

[0031] FIG. 9C is a flowchart illustrating the process of segmenting the portion of pixels representing the sky from the same exterior scene captured in each of the two images, in accordance with one or more embodiments of the present invention.

[0032] FIG. 10 is a demonstration of the process of creating a detection mask to aid in segmenting the portion of the same exterior scene captured in each of the two images representing the sky, in accordance with one or more embodiments of the present invention.

[0033] FIG. 11 is a demonstration of the process of creating a combined detection mask to aid in segmenting the portion of the same exterior scene captured in each of the two images representing the sky, in accordance with one or more embodiments of the present invention.

[0034] FIG. 12 is a demonstration of the process of creating a value mask to aid in segmenting the portion of the same exterior scene captured in each of the two images representing the sky, in accordance with one or more embodiments of the present invention.

[0035] FIG. 13 is a demonstration of the process of creating a sure sky mask to aid in segmenting the portion of the same exterior scene captured in each of the two images representing the sky, in accordance with one or more embodiments of the present invention.

[0036] FIG. 14 is a demonstration of the process of creating a finalized sky segmentation mask that excludes all regions of the exterior scene captured in each of the two images that does not represent the sky, in accordance with one or more embodiments of the present invention.

[0037] FIG. 15 is a demonstration of replacing the portion of the exterior scene that represented the sky with another image.

DETAILED DESCRIPTION

[0038] The present invention comprising classification-based image merging, tuning, correction, and replacement method will now be described. In the following exemplary description numerous specific details are set forth in order to provide a more thorough understanding of embodiments of the invention. It will be apparent, however, to an artisan of ordinary skill that the present invention may be practiced without incorporating all aspects of the specific details described herein. Furthermore, although steps or processes are set forth in an exemplary order to provide an understanding of one or more systems and methods, the exemplary order is not meant to be limiting. One of ordinary skill in the art would recognize that the steps or processes may be performed in a different order, and that one or more steps or processes may be performed simultaneously or in multiple process flows without departing from the spirit or the scope of the invention. In other instances, specific features, quantities, or measurements well known to those of ordinary skill in the art have not been described in detail so as not to obscure the invention. It should be noted that although examples of the invention are set forth herein, the claims, and the full scope of any equivalents, are what define the metes and bounds of the invention.

[0039] For a better understanding of the disclosed embodiment, its operating advantages, and the specified object attained by its uses, reference should be made to the accompanying drawings and descriptive matter in which there are illustrated exemplary disclosed embodiments. The disclosed embodiments are not intended to be limited to the specific forms set forth herein. It is understood that various omissions and substitutions of equivalents are contemplated as circumstances may suggest or render expedient, but these are intended to cover the application or implementation.

[0040] The term “first”, “second” and the like, herein do not denote any order, quantity or importance, but rather are used to distinguish one element from another, and the terms “a” and “an” herein do not denote a limitation of quantity, but rather denote the presence of at least one of the referenced item.

[0041] As used herein, the term “substantially,” “about,” and similar terms are used as terms of approximation and not as terms of degree, and are intended to account for the inherent deviations in measured or calculated values that would be recognized by those of ordinary skill in the art. Further, the use of “may” when describing embodiments of the present invention refers to “one or more embodiments of the present invention.” As used herein, the terms “use,” “using,” and “used” may be considered synonymous with the terms “utilize,” “utilizing,” and “utilized,” respectively. Also, the term “exemplary” is intended to refer to an example or illustration.

[0042] As will be understood by one skilled in the art, for any and all purposes, such as in terms of providing a written description, all ranges disclosed herein also encompass any and all possible sub-ranges and combinations of sub-ranges thereof. Any listed range can be easily recognized as sufficiently describing and enabling the same range being broken down into at least equal halves, thirds, quarters, fifths, tenths, etc. As a non-limiting example, each range discussed herein can be readily broken down into a lower third, middle third and upper third, etc. As will also be understood by one skilled in the art all language such as “up to”, “at least”, “greater than”, “less than”, and the like include the number recited and refer to ranges which can be subsequently broken down into sub-ranges as discussed above. Finally, as will be understood by one skilled in the art, a range includes each individual member. Thus, for example, a group having 1-3 articles refers to groups having 1, 2, or 3 articles. Similarly, a group having 1-5 articles refers to groups having 1, 2, 3, 4, or 5 articles, and so forth. The phrases “and ranges in between” can include ranges that fall in between the numerical value listed. For example, “1, 2, 3, 10, and ranges in between” can include 1-1, 1-3, 2-10, etc. Similarly, “1, 5, 10, 25, 50, 70, 95, or ranges including and or spanning the aforementioned values” can include 1, 5, 10, 1-5, 1-10, 10-25, 10-95, 1-70, etc.

[0043] Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the present invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and/or the present specification, and should not be interpreted in an idealized or overly formal sense, unless expressly so defined herein.

[0044] One or more embodiments of the present invention will now be described with references to FIGS. 1-15.

[0045] FIG. 1 provides a diagrammatic flowchart of a process that includes classification-based HDR image merging, tuning, correction, segmentation, and replacement. The HDR process 10 begins by collecting and identifying images of a particular scene, that each have different exposure times and thus different light levels. Here, three images are shown as the input, with the first being a low exposure darkest image 12, the second being a medium exposure medium darkness image 14, and the third being a high exposure brightest image 16. All of the images are merged to form a single HDR composite image of the scene, and then inspected for any ghosting effects 18 that may have appeared as a result of the merge.

[0046] FIG. 2 shows a grey-scaled version of the low exposure darkest image 212 labeled as I.sub.L, the second being a medium exposure medium darkness image 214 labeled as I.sub.M, and the third being a high exposure brightest image 216 labeled as I.sub.H, and a merged composite image 218 bearing an annotated exemplary ghosting region 220. When an object in a scene moves during the capture of three different exposure images 212, 214, and 216, after merging it can create a transparent “ghosting” region 220 around or within that object in the merged composite image 218. If there is no anomaly detected between a masked and thresholded comparison 18 of each of the different exposure images forming a composite image, then the brightness of each pixel should increase in correlation to its combined exposure duration, excepting only the brightest and darkest regions of the image. If not, then a ghosting detection process 18 will isolate the pixels comprising the anomalous transparent ghosted region 220, thus allowing for their later removal or correction.

[0047] In the first step of the ghosting detection process 18, the images are thresholded using two thresholds t.sub.d and t.sub.b, for each image 212, 214, and 216 where:

(I>t.sub.b or I<t.sub.d)=>I=0

Where I represents that respective image's intensity or brightness.

[0048] In the second step, for each pixel's intensity in each respective image 212, 214, and 216 check for the following condition:

i.sub.H.sup.i>I.sub.M.sup.i>I.sub.L.sup.i

where i is the pixel location, and I.sub.H, I.sub.M, I.sub.L are the high 216, medium 214, and low 212 exposure images, respectively.

[0049] In the third step, create a binary mask, using the equation in step two, setting a mask pixel value to ‘1’ if condition is ‘false’, and setting a mask pixel value to ‘0’ if the condition is ‘true.’ The white pixels in the formulated binary black/white ghosting mask indicate the pixels in the merged composite image 218 that containing a ghosting region 220.

[0050] In the final step, filter for noise and count the number of non-zero pixels in the merged composite image 218. Finally, threshold this number of non-zero pixels by dividing it by total number of pixels in the image for ghosting:

TABLE-US-00001 If (N.sub.non zero / N.sub.total > threshold){ deghost = true }

[0051] Referring again to FIG. 1, if ghosting is detected in a merged image following the application of the process 18 above, the pre-merged images 12, 14, and 16 must be aligned at local patch levels followed by a weighted merging process 20 before scrubbing the composite image 218 of the ghosting transparency region 220. For the alignment and merging process 20, the medium exposure medium darkness image 14 is used as the reference frame, although in a multi-image merging operation with even numbers, a reference frame should be selected as that which maintains a brightness, exposure, or intensity level between extremes.

[0052] FIG. 3 shows two exterior images bearing extremes in exposure timing 312 and 314, where a medium darkness image of the same exterior scene has been set apart as a reference frame. To align the images, the extreme exposure exterior images 312 and 314 are aligned to the pre-selected reference frame by a coarse-to-fine alignment on four intermediate alignment fields 316, 318, 320, and 322 through the application of gaussian pyramids.

[0053] Each of the four alignment fields 316, 318, 320, and 322 is divided into segments, and each segment is matched with a corresponding candidate segment in an alternate image. The overall displacement in alignment is calculated from the sum of L1 distance between the pixel of the reference segment and the search area expanded by (4,4) in the alternate segment. This generates an aligned image set where for each segment, there are offset values determining which segment to choose while merging the images 312 and 314.

[0054] After aligning the images 312 and 314, they can be merged first temporally and then spatially. In the initial temporal merging step, each segment is first merged between burst images based on an offset determined by the alignment process described above. Weighting for selection of a segment is determined based on the average distance between pixel values of aligned segments.

[0055] The temporal phase of post-alignment merging follows the following equations:

O(x,y.sub.t)=Σ.sup.i=n(W.sub.t.sup.i*I.sub.t.sup.i(x,y)/W.sub.s)+I.sub.t.sup.t(x,y)/W.sub.s

Where,

W.sub.r.sup.i=1/ND.sub.t.sup.i if ND.sub.t.sup.i>290 else 0

ND.sub.r.sup.i=max(1,Σ.sup.x=16,y=16(I.sub.t.sup.i(x,y)−I.sub.r(x,y))/256)

W.sub.s=Σ.sup.i=3(W.sub.t)

Where I.sub.t.sup.i is the underlying intensity value of segment ‘t’ in the i.sup.th exposure image and I.sub.t.sup.i is the initially-segregated reference image.

[0056] Following the temporal merging of images, the temporally-merged segments are now merged spatially to create an aligned and deghosted composite HDR image 26 (FIG. 1). For smoothness, each segment is merged using a raised cosine window of the same size of that segment.

[0057] Referring again to FIG. 1, if ghosting is not detected in a merged image following the application of the process 18 above, the combined exposure-composite image is then scrutinized for automatic categorization tagging 24. The medium exposure medium darkness image 14 is used to classify the captured scene as relating to several categories. Here, the present embodiment relates to residential photography, and thus the scenes can be separated distinctly between interior and exterior scenes, as each class maintains distinct preferences for weighting contrast, saturation, and exposure of a final image. In an alternative embodiment, any applicable classification nomenclature may be adopted, and this process may be applied to any photographically-related discipline that relies upon HDR imagery. A combined ensemble of two inceptionV3 models with input size (684×342) is used to classify the medium exposure medium darkness image 14 into an AWB (Auto White Balance) image, or a category from the following classes relevant to the present embodiment:

TABLE-US-00002 {0: ‘BATHROOM’, 1: ‘BEDROOM’, 2: ‘CLOSET’, 3: ‘COMMON_AREA_ROOM’, 4: ‘DINING’, 5: ‘ENTRY_SHOT_FRONT_DOOR’, 6: ‘FRONT_EXTERIORS’, 7: ‘GARAGE’, 8: ‘HOA_COMMON_AREA_AMENITIES_EXTERIOR’, 9: ‘HOA_COMMON_AREA_AMENITIES_INTERIOR’, 10: ‘KITCHEN’, 11: ‘LAUNDRY’, 12: ‘REAR_EXTERIORS’, 13: ‘SPECIALTY’, 14: ‘STAIRS’, 15: ‘WHITE_BALANCE’}
The applied label is then used to select particular parameters and/or thresholds in later HDR image processing. Again, depending on the applicable context for image review and use, the classification labeling may differ from the above.

[0058] If ghosting was not detected following the application of the process 18 above, the images 12, 14, and 16 can be merged using a standard Mertens algorithm (via Exposure Fusion) 24, resulting in a manipulable HDR image 26. As part of the merge, the Exposure Fusion weights for contrast, saturation and exposure were determined by the classification 22 of the scene as being either interior or exterior.

[0059] To improve image quality, the merged output HDR image 26 from either the classification 22 and Exposure Fusion merge 24, or the deghosted alignment and merging process 20, is tuned using gamma corrections 28 followed by correcting dark and bright regions 30 in the scene.

[0060] In this embodiment related to residential photography, the tuning algorithm is different for ‘interior’ and ‘exterior’ scenes. In interior scenes, one goal is to not allow black levels below a certain threshold. With an HDR image 26 as a starting point, the HDR image 26 can be adaptively mapped to produce an output HDR image having ‘Z’ bit data and a total number of pixels N.sub.total. A range of gamma values from 0.5 to 2.0 are then applied to alter the image 26, the number of pixels (N.sub.b) with values less than a black level threshold (t.sub.b) is determined, and then the lowest tuned gamma value correction level is selected, for which:

N.sub.b<0.025*(N.sub.t)

where N.sub.t is the total number of pixels.

[0061] Further, interior scenes often include significant glare in brighter pixel regions, thus leading to an overall degradation of detail within brighter regions of the scene. To correct this, the omitted details are recovered from the lowest exposure image. FIGS. 4-7 illustrate the recovery of detail and correction of brightness within over-exposed regions of an HDR composite image 26 that is classified as relating to an ‘interior’ scene.

[0062] FIG. 4 provides a diagrammatic flowchart of a method for correcting detail obscured by brightness glare in an image 410. The first step is to receive or retrieve 412 at least a first exposure image of a scene 512 (FIG. 5) and at least a second exposure image of the scene 514 (FIG. 5), where the second exposure image of the scene 514 has a shorter exposure time than the first exposure image of the scene 512. The second step is computing 414 a unrefined mask M.sub.b 518 (FIG. 5), by performing a conjugation 516 of the at least first exposure image of the scene 512 and the at least second exposure image of the scene 514. The next step is thresholding the conjugation 416 by selecting a number of pixels N.sub.m having a value of black level greater than a predefined black level threshold t.sub.b to form an unrefined mask M.sub.b 518 (FIG. 5) of the scene, as follows:

M.sub.b=AND(I.sub.b,I.sub.d)>t.sub.b, where I.sub.b is the brightest image and I.sub.d is the darkest image.

[0063] FIG. 5 illustrates the thresholded conjugation 516 of a grey-scaled brighter (e.g. over-exposed) first exposure image I.sub.b 512 and a darker (e.g. under-exposed) second exposure image I.sub.d 514 to form an unrefined mask M.sub.b 518.

[0064] Referring again to FIG. 4, in conjunction with reference to FIG. 6, after computing the unrefined mask M.sub.b 518, the next step is quantifying 418 an amount of detail present in at least one portion 612 of the second exposure image of the scene 514 that has a brightness level higher than the average brightness level of all pixels in the second exposure image of the scene 514, by applying a Laplacian to said second exposure image of the scene 614. Then, the next step is applying 420 a median blur denoising operation to form an intermediary Laplacian mask M.sub.i 616:

M.sub.i=Denoise(Laplacian(I.sub.d))

[0065] for each region (R) in M.sub.i: [0066] if N.sub.r<threshold [0067] M.sub.b[N.sub.r]=0

[0068] Where, N.sub.r is the number of non zero pixels in region R

[0069] Next is selecting 422 at least one pixel N.sub.r in at least one region R of the intermediary Laplacian mask M.sub.i 616 that does not have a zero value in comparison to the unrefined mask M.sub.b 518, and then keeping only those common regions in the unrefined mask M.sub.b 518 not having a zero value to form a refined mask M.sub.b (Refined) 618. FIG. 6 illustrates the end result of applying the Laplacian 418, and the subsequent operations required to form a refined mask M.sub.b (Refined) 618.

[0070] The final step in correcting detail obscured by brightness glare in an ‘interior’ image 410 is computing a blended image I.sub.blend 716 (FIG. 7) by applying 424 a Gaussian pyramid operation and a Laplacian pyramid merging 714 (FIG. 7) of the darker (e.g. under-exposed) second exposure image I.sub.d 514 and a simple Exposure Fusion image I.sub.EF 712 (FIG. 7) formed from both the lighter 512 and darker 514 images, using the refined mask M.sub.b (Refined) 618:

I.sub.blend=Blend(I.sub.d,I.sub.EF,M.sub.b(Refined))

The end result is a blended image I.sub.blend 716 where the brightest regions of the darker (e.g. under-exposed) second exposure image I.sub.d 514 bearing a greater amount of detail replace the overexposed regions in the simple Exposure Fusion image I.sub.EF 712 that lack detail. FIG. 7 illustrates the final blending/merging step in the brightness correction method that results in a blended image I.sub.blend 716.

[0071] In contrast to interior images, exterior image gamma tuning can be accurately determined based on the image's colorfulness. Colorfulness (C) is generally calculated in the following way:

C=stdRoot+(0.3*meanRoot)

Where,

stdRoot=√(std.sub.B.sup.2+std.sub.YB.sup.2)

meanRoot=√(mean.sub.B.sup.2+mean.sub.YB.sup.2)

YB=absolute(0.5*(R+G)−B)

where R,G,B are the RGB channels of the exterior image, and mean.sub.X and std.sub.X are mean and standard deviations of channel x, respectively.

[0072] In the context of exterior exposure fusion images I.sub.EF and white-balanced images I.sub.WB, Colorfulness is calculated on the exposure fusion result and the output of auto white balance on exposure fusion.

I.sub.WB=WhiteBalance(I.sub.EF)

C1=Colorfulness(I.sub.EF)

C2=Colorfulness(I.sub.WB)

If (C1<1.03*C2):

I.sub.EF=I.sub.WB

The output is gamma corrected as follows:

[0073] for gamma in [0.5,0.6,0.7,0.8,0.9,1.0,1.1,1.2]:

I.sub.gamma=GammaAdjustment(I.sub.EF)

if (maxMean.sub.gamma>128) and (minMean.sub.gamma>110):

I.sub.EF=I.sub.gamma

Where

maxMean.sub.gamma=max(Mean(R.sub.gamma),Mean(G.sub.gamma),Mean(B.sub.gamma))

minMean.sub.gamma=min(Mean(R.sub.gamma),Mean(G.sub.gamma),Mean(B.sub.gamma))

[0074] With a gamma-corrected (based on Colorfulness) exterior HDR composite image I.sub.gamma, we are then able to correct over-brightened regions of the image that lost detail due to over exposure (as done above for interior images 410). The difference in the context of an exterior image is that gradient values are not used to filter out the over-exposed regions:

M.sub.d=AND(INV(I.sub.d),INV(I.sub.b))

Where M.sub.d is the mask of the darkest regions in the exterior scene. This darkness mask is then utilized in a blend to create a brightness-corrected and gamma-tuned HDR exterior image composite:

I.sub.ext blend=Blend(I.sub.d,I.sub.gamma,M.sub.d)

[0075] After improving the visibility of details apparent in glare-obscured regions of a scene captured with HDR composite images, regions of that scene may need to be selected in their entirety for manipulation, correction, or wholesale replacement. In one embodiment of the present invention, the portion of an exterior image has such a region, such that segmentation and replacement of that portion can be performed to replace the overcast appearance within the final image to something else, or to manipulate the appearance of the existing region by increasing blue level or even enhancing contrast. In the present embodiment, the region for manipulation and/or replacement is an overcast sky.

[0076] FIG. 8 shows two exterior images bearing extremes in exposure timing: a darkest exterior image I.sub.dark 812 and a brightest exterior image I.sub.bright 814. The portion of the images representing the overcast sky 816 must be detected and segmented before it can be replaced. Sky segmentation is mostly done on the darkest exterior image I.sub.dark 812. Two heuristic approaches are used for segmenting the overcast sky region 816, thus creating two distinct masks (see FIGS. 9A & 9B) that are then finally combined to form a final segmentation mask (see FIG. 9C). FIGS. 10-14 illustrate the approaches captured by diagrammatic flowcharts of FIGS. 9A, 9B, & 9C.

[0077] FIG. 9A provides a diagrammatic flowchart of a first heuristic method 900 calculating a detection mask M.sub.31012 (FIG. 10) from the conjugation of a mean blue hue mask M.sub.1 1008 (FIG. 10) and a threshold blue hue mask M.sub.2 1010 (FIG. 10).

[0078] Ideally in the sky region 816 (FIG. 8) blue should outweigh red and green colorization. The blue tone value should also be greater than a certain threshold value. Therefore, we generate a mean blue hue mask M.sub.1 1008 (FIG. 10) through the conjugation of masks representing the detection of blue-hued pixels 904 as follows: (1) a blue-red comparative threshold mapping M.sub.BR 1002 (FIG. 10); (2) a blue-green comparative threshold mapping M.sub.BG 1004 (FIG. 10); and (3) a blue threshold mapping M.sub.B 1006 (FIG. 10):

M.sub.BR=(B−R)>th

M.sub.BG=(B−G)>th

M.sub.B=B>th

M.sub.1=AND(OR(M.sub.BR,M.sub.BG),M.sub.B)

A threshold blue hue mask M.sub.2 1010 (FIG. 10) is calculated when removing regions bearing oversaturation. The mean blue hue mask M.sub.1 1008 (FIG. 10) and the threshold blue hue mask M.sub.2 1010 (FIG. 10) are computed 906 together. The threshold blue hue mask M.sub.2 1010 (FIG. 10) can then be conjugated with the mean blue hue mask M.sub.1 1008 (FIG. 10) to calculate 908 a detection mask M.sub.31012 (FIG. 10):

meanB=mean(M.sub.1*B)

M.sub.2=B−meanB<th

M.sub.3=AND(M.sub.2,M.sub.1)

[0079] The detection mask M.sub.3 1012 (FIG. 10) will likely include discrete disconnected (i.e. not adjacent to) blue-hue regions, which may come as a result of differentiation or separation by obscuring objects. To ensure that such smaller regions apart from the larger ‘sky’ region are not detected as ‘sky’ in the scene 910, we first detect a largest group of pixels R.sub.b having a blue hue value greater than a predefined blue hue level threshold and then designate pixels away from the largest group of pixels R.sub.b as non-sky: [0080] Let R.sub.b be the biggest region in the scene, and μ.sup.B, μ.sup.G, μ.sup.R represent the means of region blue, green and red values in R.sub.b, [0081] For each region r in M.sub.31012 (FIG. 10):

dist.sub.1=L2-norm([μ.sup.B,μ.sup.G,μ.sup.R],[Rmean.sup.r,Gmean.sup.r,Bmean.sup.r])

if dist.sub.1>th:

M.sub.3{r}=0

[0082] FIG. 9B provides a diagrammatic flowchart of a second heuristic method 920 calculating a combined detection mask M.sub.4 1020 (FIG. 11) by conjugating 930 a value mask M.sub.V 1016 (FIG. 11), a threshold blue-red mask M.sub.BR1 1014 (FIG. 11), and a threshold blue-green mask M.sub.BG1 1018 (FIG. 11).

[0083] In the lowest (darkest) exposure image I.sub.dark 812 (FIG. 8), the sky will be one of the brightest regions 816 (FIG. 8). If we assume the top 20% region of the scene is ‘sky,’ we can calculate the mean intensity channel of the image 812 and consider that it is close to the average brightness of the ‘sky’ therein. Similarly, the bottom 40% region of the image can be safely assumed to not be ‘sky’. The average brightness of that lower region can be assumed to be a sample of ‘non-sky’ regions in the image 812. Based on these averaged values, a value mask M.sub.V 1016 (FIG. 11) is calculated 924, where V is the value channel:

sky_brightness=mean(V[0:0.2*image_h])

min_brightness=mean(V[0.6*image_h:image_h])

M.sub.V[0:0.4*image_h]=V[0:0.4*image_h]>min(0.7*sky_brightness,t1)

M.sub.V[0.4*image_h:]=V[0.4*image_h:]>min(2*min_brightness,t2)

if sky_brighntess<=t3:

M.sub.V=AND(M.sub.V,V<245)

[0084] We then compute thresholded masks for the blue-red 926 and blue-green 928 channels to get masks for blue-red M.sub.BR1 1014 (FIG. 11) and blue-green M.sub.BG1 1018 (FIG. 11) channels:

M.sub.BR1=B>R+10

M.sub.BG1=B>G+10

[0085] Finally, the combined detection mask M.sub.41020 (FIG. 11) can be calculated as a conjugation of first the blue-red mask M.sub.BR1 1014 (FIG. 11) and blue-green mask M.sub.BG1 1018 (FIG. 11), and then a conjugation of that resultant with the value mask M.sub.V 1016 (FIG. 11):

M.sub.4=AND(AND(M.sub.BR1,M.sub.BG1),M.sub.V)

[0086] After completing the two heuristic approaches for detecting and differentiating (i.e. segmenting) the ‘sky’ region 816 (FIG. 8) apart from other pixel regions in the scene, we are left with two distinct masks (a detection mask M.sub.3 1012 (FIG. 10) and a combined detection mask M.sub.4 1020 (FIG. 11)) which can finally be combined 950 to form a final segmentation mask M.sub.final 1032 (FIG. 14) that accurately distinguishes ‘sky’ from non-sky pixels within a scene.

[0087] FIG. 9C provides a diagrammatic flowchart of a method 940 for calculating a final segmentation mask M.sub.final 1032 (FIG. 14). As part of the final method 940, we must first compute 942 a hue, saturation, and value mask M.sub.HSV 1022 (FIG. 12). Thresholds are applied on hue H, saturation S, and value V channels of the lowest exposure image I.sub.dark812 (FIG. 8), and corresponding regions are detected on the resulting mask M.sub.HSV 1022 (FIG. 12):

M.sub.HSV=AND(S<t.sub.1,AND(V>t.sub.2,H<t.sub.3))

[0088] Next, the newly-formed hue, saturation, and value mask M.sub.HSV 1022 (FIG. 12) is merged 944 with the detection mask M.sub.31012 (FIG. 12) and a combined detection mask M.sub.41020 (FIG. 12) via disjunction to form a combined mask M.sub.combined 1024 (FIG. 12) as follows:

M.sub.combined=OR(M.sub.3,M.sub.4,M.sub.HSV)

[0089] Next, a bright mask M.sub.bright 1026 (FIG. 13) is computed 946 by conjugating the intensity—or luminocity—channel INT.sub.dark of the darkest exterior image I.sub.dark 812 (FIG. 8) and the intensity channel INT.sub.bright of the brightest exterior image I.sub.bright 814 (FIG. 8) and then thresholding the output to a predetermined brightness level t.sub.b:

M.sub.bright=AND(INT.sub.bright,INT.sub.dark)>t.sub.b

[0090] Next, a sure sky mask M.sub.sure 1028 (FIG. 13) is computed 948 through the conjugation and morphological erosion of the previously-created combined mask combined mask M.sub.combined 1024 (FIG. 13) and newly-created bright mask M.sub.bright 1026 (FIG. 13):

M.sub.sure=(M.sub.combined⊖M.sub.bright)

[0091] Next, an intermediary grab-cut mask M.sub.grab_cut 1030 (FIG. 14) is computed as follows:

M.sub.sure=sure foreground seed

M.sub.combined−M.sub.sure=probable foreground (M.sub.pf)

M.sub.sure_bg=Bottom 40% region=sure background

INV(M.sub.sure_bg+M.sub.combined)=probable background

M.sub.grab_cut=GRAB_CUT(M.sub.sure,M.sub.pf,M.sub.sure_bg,INV(M.sub.sure_bg+M.sub.combined))

[0092] Finally, the newly-created intermediary grab-cut mask M.sub.grab_cut 1030 (FIG. 14) is merged via disjunction with the sure sky mask M.sub.sure 1028 (FIG. 14) to compute 950 the finalized sky segmentation mask M.sub.final 1032 (FIG. 14) as follows:

M.sub.final=OR(M.sub.grab_cut,M.sub.sure)

[0093] In this embodiment, the creation of the finalized sky segmentation mask M.sub.final1032 (FIG. 14) allows for the easy detection and subsequent replacement of the portion of the scene that is ‘sky’. FIG. 15 shows an HDR-merged exterior image 1510 combined from a darkest exterior image I.sub.dark 812 (FIG. 8) and a brightest exterior image I.sub.bright 814 (FIG. 8), wherein the portion of the images previously representing the overcast sky 816 (FIG. 8) has been automatically and completely replaced by the image of a starry sky 1512 via segmentation per a finalized sky segmentation mask M.sub.final 1032 (FIG. 14) of the bounded ‘sky’ region and application of alpha blending of an underlying replacement image. In an alternative embodiment, the methods discussed herein (900, 920, and 940) for accurate segmentation of an image can be applied to other regions of a scene and need not be used only to segment sky. Variations in thresholding for colorization values and assumptions regarding placement of the segmentation region within the scene allow for segmentation of a variety of features and/or portions of indoor and outdoor scenes (e.g. rectangular real estate signboards, ceilings, lawns, pools, carpets, etc.). The only constraint on variations in embodiments that can be considered as part of the present invention is that the image portion or pixel region being subjected to segmentation must be detectable and different in comparison to other pixel groupings within the same scene.

[0094] The segmentation improvement method described in the present embodiment also affords improvement in pretraining convolutional networks of artificial neurons through the repeated convolution and pooling of at least one set of clear sky images and at least one set of sky images at least partially containing cloud cover. In an alternative embodiment, clear-sky and cloud-cover images may be synthetically generated using computer graphics. In a further alternative embodiment, the improvements in pretraining can be applied to features and/or portions of indoor and outdoor scenes other than the sky (e.g. rectangular real estate signboards, ceilings, lawns, pools, carpets, etc.).

[0095] While the invention herein disclosed has been described by means of specific embodiments and applications thereof, numerous modifications and variations could be made thereto by those skilled in the art without departing from the scope of the invention set forth in the claims.

CLASSIFICATION-BASED IMAGE MERGING, TUNING, CORRECTION, AND REPLACEMENT

Inventors

Cpc classification

Classification Explorer

G06T7/136

PHYSICS

Classification Explorer

G06T5/50

PHYSICS

Classification Explorer

G06T5/009

PHYSICS

Classification Explorer

G06T2207/20036

PHYSICS

Classification Explorer

G06T7/194

PHYSICS

Classification Explorer

G06T2207/20208

PHYSICS

Classification Explorer

G06V10/26

PHYSICS

Classification Explorer

G06T7/155

PHYSICS

Classification Explorer

G06T2207/20221

PHYSICS

Classification Explorer

G06V10/764

PHYSICS

Classification Explorer

G06T7/90

PHYSICS

Classification Explorer

G06T5/007

PHYSICS

Classification Explorer

G06V10/82

PHYSICS

Classification Explorer

G06T7/11

PHYSICS

Classification Explorer

G06T2207/20016

PHYSICS

Classification Explorer

G06T5/005

PHYSICS

Classification Explorer

G06T2207/20032

PHYSICS

Classification Explorer

G06T2207/10024

PHYSICS

Classification Explorer

G06T2207/10144

PHYSICS

Classification Explorer

G06T5/002

PHYSICS

Classification Explorer

G06T2207/20084

PHYSICS

Classification Explorer

G06T2207/20081

PHYSICS

Classification Explorer

G06V10/74

PHYSICS

International classification

Classification Explorer

G06T5/50

PHYSICS

Classification Explorer

G06T5/00

PHYSICS

Classification Explorer