DEVICE AND METHOD FOR SKIN GLOSS DETECTION
20200305784 ยท 2020-10-01
Assignee
Inventors
Cpc classification
A61B5/0077
HUMAN NECESSITIES
A61B5/442
HUMAN NECESSITIES
G06V10/60
PHYSICS
A61B5/6898
HUMAN NECESSITIES
International classification
Abstract
The present invention relates to a device and method for skin gloss detection. To enable skin gloss detection in a simple and efficient manner, preferably with an available user device, the device comprises an illumination unit (2) for illuminating a skin area at a flashing rate, an imaging unit (3) for acquiring images of the skin area at an imaging rate which is different than said flashing rate, and a processing unit (4) for processing acquired images and detecting the amount of gloss in the skin area from at least one partial image of the skin area while being illuminated by said illumination unit and at least one partial image of the skin area while not being illuminated by said illumination unit.
Claims
1. Device for skin gloss detection, said device comprising: an illumination unit for illuminating a skin area at a flashing rate, an imaging unit for acquiring images of the skin area at an imaging rate, which is different than said flashing rate, wherein images of the skin area while the illumination by said illumination unit is on and images of the skin area while the illumination by said illumination unit is off are acquired, and a processing unit for processing acquired images by detecting the amount of gloss in the skin area from a specular reflection component and a diffuse component in at least one first partial image of the skin area representing a portion of an image acquired by the imaging unit while the skin area is illuminated by said illumination unit and at least one partial image of the skin area representing a portion of an image acquired by the imaging unit while the skin area is not being illuminated by said illumination unit.
2. Device as claimed in claim 1, wherein said imaging unit is configured to acquire images of the illuminated skin area at an imaging rate which is higher than said flashing rate.
3. Device as claimed in claim 1, wherein said illumination unit is configured to illuminate the skin area at a flashing rate which is a multiple higher than said imaging rate.
4. Device as claimed in claim 3, wherein said illumination unit is configured to illuminate the skin area at a flashing rate, which is a non-integer multiple higher than two times said imaging rate.
5. Device as claimed in claim 1, further comprising a control unit for controlling the flashing rate and/or the imaging rate.
6. Device as claimed in claim 5, wherein said control unit is configured to generate configurable temporal light patterns with modifiable duty cycles.
7. Device as claimed in claim 1, wherein said processing unit is configured to determine the specular reflection component in one or more sub-areas of said skin area and to determine the amount of skin gloss in said sub-areas by determining the ratio of specular reflection component in said one or more sub-areas to a diffuse component in said one or more sub-areas.
8. Device as claimed in claim 1, wherein said processing unit is configured to determine skin sub-areas of the illuminated skin area which are substantially perpendicular to the optical axis of the imaging unit and to determine the amount of skin gloss from the determined skin sub-areas.
9. Device as claimed in claim 8, wherein said processing unit is configured to determine said skin sub-areas by use of a deformable model evaluating skin landmarks and/or pose estimation or by detecting flash components and diffuse components.
10. Device as claimed in claim 1, wherein said processing unit is configured to compensate one or more images of the skin area while not being illuminated by said illumination unit with a contribution of the illumination to a diffuse component estimation per color channel histogram shifting.
11. Device as claimed in claim 1, wherein said processing unit is configured to determine a numerical skin gloss value per pixel, in particular by dividing a specular component value per pixel by a diffuse component value per pixel or by dividing the sum of a specular component value per pixel and a diffuse component value per pixel by the diffuse component value per pixel.
12. Device as claimed in claim 1, wherein said processing unit is configured to determine a skin gloss map of the skin area indicating the amount of skin gloss per pixel or per group of pixels.
13. Device as claimed in claim 1, wherein said device is a mobile user device, in particular a smartphone, camera, laptop, or tablet.
14. Method for skin gloss detection, said method comprising: illuminating a skin area at a flashing rate, acquiring images of the skin area at an imaging rate which is different than said flashing rate, wherein images of the skin area while the illumination is on and images of the skin area while the illumination is off are acquired, and processing acquired images by detecting the amount of gloss in the skin area from a specular reflection component and a diffuse component in at least one first partial image of the skin area representing a portion of an image acquired while the skin area is illuminated and at least one second partial image of the skin area representing a portion of an image acquired while the skin are is not illuminated.
15. Computer program comprising program code means for causing a device as claimed in claim 1 to carry out the steps of the method when said computer program is carried out on the device.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0027] These and other aspects of the invention will be apparent from and elucidated with reference to the embodiment(s) described hereinafter. In the following drawings
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
DETAILED DESCRIPTION OF THE INVENTION
[0039]
[0040] The device 1 comprises an illumination unit 2 for illuminating a skin area at a flashing rate. The illumination unit 2 may e.g. be the flash of a user device. In the following, the illumination unit is often simply referred to as flash, which shall, however, be generally understood as illumination unit.
[0041] The device 1 further comprises an imaging unit 3 for acquiring images of the skin area at an imaging rate which is different than said flashing rate. The imaging unit 3 may e.g. be the camera of a user device. In the following, the imaging unit is often simply referred to as camera, which shall, however, be generally understood as imaging unit.
[0042] The device 1 further comprises a processing unit 4 for processing acquired images and detecting the amount of gloss in the skin area from at least one partial (or complete) image of the skin area while being illuminated by said illumination unit 2 and at least one partial (or complete) image of the skin area while not being illuminated by said illumination unit 2. The processing unit 4 may e.g. be the processor of a user device. Generally, the processing unit 4 may also be an external processor, e.g. of an external computer, to which the acquired images are transmitted for processing to obtain information on the skin gloss. It is, however, preferred that the processing unit forms an integrated device with the illumination unit 2 and the imaging unit 3.
[0043] The device 1 may comprise further optional elements, such as a control unit 5 for controlling the illumination unit 2 and/or the imaging unit 3, in particular to control the flashing rate and/or the imaging rate. The task of the control unit 5 may, however, also be performed by the processor 4, or the flashing rate and/or the imaging rate may be predetermined and fixed so that an active control unit 5 may not be required.
[0044] The device 1 may further comprise a user interface 6, e.g. a display, keypad, touchscreen, etc., allowing the user to enter information, e.g. to start and stop skin gloss detection, change settings, enter personal information, etc., and enabling output of information, e.g. the detected skin gloss information or user instructions.
[0045] According to the present invention the illumination unit 2 is controlled in such a way that it is exactly known which image line is exposed with flash (i.e. illumination) on or off. Additionally, face feature tracking (or other means) may be used to accurately track the region, shape and pose of the face, in which the skin gloss of a skin area (e.g. of the forehead, the cheeks, the chin, etc.) shall be determined. Additionally, averaging (e.g. horizontal averaging) of the image content may be performed resulting in a strong 1D signal indicating the exact switching times per image line even more accurately.
[0046] In the following various embodiments will be explained in more detail. A first main embodiment extracts skin gloss values from alternating images. A second main embodiment creates and combines alternating illuminations per image line.
[0047] In the first main embodiment, for calculation of the amount of gloss (using a zero order approximation) the skin sub-areas, which are perpendicular to the imaging unit's optical axis (since at a smartphone the flash is mounted very close to the camera). These perpendicular sub-areas may e.g. be determined in one of the following two combined ways.
[0048] One way is model based. It may particularly use accurate face landmark detection and 3D face pose estimation. The fit of the 3D deformable model indicates the skin-sub-areas which are perpendicular substantially perpendicular to the imaging unit's optical axis. There are many different methods known for face landmark detection and 3D face pose estimation (e.g. as currently be found at http://blog.mashape.com/list-of-10-face-detection-recognition-apis/ or https://facedetection.com/software/). Further methods are listed by Tim Cootes at http://personalpages.manchester.ac.uk/staff/timothy.f.cootes/tfc_publications.html. These include a computer algorithm to determine the visual location of the generic face features in the image and next fit a deformable 3D shape model to these generic face features in order to robustly estimate pose and surface angles. According to an embodiment spatial image locations are derived from the detected landmarks and the face pose, which are perpendicular to the camera. The 3D shape model registered to the generic face features provides an estimation of the skin normal vectors for each part of the skin.
[0049] According to another way the areas where the flash component is much larger than the diffuse component are used. The flash component is the illumination raise everywhere in the scene due to the flash (as later indicated by S+D). The specular component (S) of the flash illumination is much higher and only present at the location where the surface is close to perpendicular. On top of this, it is assumed that the ambient light contributes mostly to the baseline of D. The value of the diffuse component is either taken from the median of the delta in luminance in the face, since the majority of the face is not perpendicular to the optical axis of the camera. This works even more accurately by taking the areas from the above mentioned first way to determine these non-perpendicular areas.
[0050]
[0051]
[0052]
[0053] In short, the ratio of the specular component and the diffuse component value may be used as an indicator for the amount of gloss.
[0054] More refinement is achieved in another embodiment by compensating a non-flash image (e.g. image 11) with the contribution of the flash to the diffuse estimation per color channel histogram shifting. This histogram matching is preferably only applied in the tracked face area, derived from the face landmarks as shown in
[0055]
[0056]
[0057] The face landmarks already give a robust registration/alignment of the face region per consecutive image frame. Additional accuracy may optionally be achieved by dense registration of image feature points inside this area.
[0058] The gloss estimation is preferably only computed for the skin sub-areas which are close to perpendicular to the camera axis and which show a significant specular component (e.g. having a value above a predetermined threshold). As mentioned, the perpendicularity estimation for each pixel in the image can be coarsely determined by using robust 3D face model based face pose estimation. This combined sub-area can e.g. be illustrated by a binary mask as illustrated as an example in
[0059] The numerical gloss value may be computed per pixel by dividing the specular component value by the diffuse component value and can be assigned to the skin area where the binary mask is valid. Alternatively, the ratio between the specular+diffuse component divided by the diffuse component may also be used as a coarse indicator for the amount of gloss.
[0060] In another embodiment, multiple camera view pairs from a video sequence (angled views on the face) are combined (e.g. by taking the maximum measured gloss) into a single gloss map, which is then assigned to the image with the most frontal face present in the sequence of view pairs. This can yield a complete gloss map for each area of the face or the whole face. An example output gloss map 50 (with heat map color coding, i.e. areas 51 for low values, areas 52 for medium values and areas 53 for high values) for a forehead that was partly greased with oil to illustrate the difference between low gloss (left part of the image) and medium/high gloss (right part of the image).
[0061] Further, in an embodiment the exposure time is automatically fixed by setting auto exposure during the first exposure while holding the flash ON and then locking the exposure. This results in a recording which is hardly ever clipped by over-exposure.
[0062] In the second main embodiment, the frame rate of the flash (i.e. the flashing rate) is a non-integer times higher than the frame rate of the camera (i.e. the imaging rate). This results in exposed images where the duty cycle of the flash is clearly visible in each single image. By deliberately choosing a flashing rate not starting in sync with each frame exposure, the phase of the illuminated image lines is shifting for every image frame. Although this scheme is more complex, it has the benefit of being able to run the flash at a modulation rate which is not perceivable by the human eye, which may be a requirement for the application of gloss measurement by a smartphone.
[0063] The timing of the flash could be exactly controlled in order to know which lines are exposed with flash on and which ones with flash off. For some existing base hardware (e.g. certain smartphones or cameras) the flash cannot be easily controlled in this much detail. When this is the case, as illustrated in this embodiment, the image line, at which the switching of the flash happens, can be extracted from the image robustly, e.g. by adding up the rows of the image and detecting the switching from this 1D modulation signal.
[0064]
[0065] The subsequent processing is rather similar to the processing explained above for the first main embodiment, while taking care of assigning each set of image lines to either flash ON or flash OFF state.
[0066]
[0067] In one embodiment a smartphone is set to record at a high frame rate (imaging rate) like e.g. 120 frames per second (fps), while (synchronously) switching the flashlight on and off at 60 fps. The recorded images now alternately consist of an ambient illuminated scene (consisting e.g. of a face) and a scene which is ambient illuminated plus the flash illumination. Using accurate face feature and pose tracking it has been shown to isolate and measure the specular reflection component with respect to the ambient component, which is a strong indication of the amount of glossiness of the skin. Additionally, in a second embodiment the camera runs a lower framerate while the flash operates at specific non-integer higher multiple of this frame rate, resulting, due to the rolling shutter effect, in different illuminations per set of image lines. Computer vision technology may be used to align the face areas despite the rolling shutter effect and measure different exposures for the same area resulting in similar gloss measurements.
[0068] While the invention has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive; the invention is not limited to the disclosed embodiments. Other variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the claimed invention, from a study of the drawings, the disclosure, and the appended claims.
[0069] In the claims, the word comprising does not exclude other elements or steps, and the indefinite article a or an does not exclude a plurality. A single element or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
[0070] A computer program may be stored/distributed on a suitable non-transitory medium, such as an optical storage medium or a solid-state medium supplied together with or as part of other hardware, but may also be distributed in other forms, such as via the Internet or other wired or wireless telecommunication systems.
[0071] Any reference signs in the claims should not be construed as limiting the scope.