Method for detecting image of object using convolutional neural network
11580637 · 2023-02-14
Assignee
Inventors
- Hsiang-Chen Wang (Chiayi, TW)
- Hao-Yi Syu (Chiayi County, TW)
- Tsung-Yu Yang (Chiayi County, TW)
- Yu-Sheng Chi (Chiayi County, TW)
Cpc classification
G16H50/20
PHYSICS
G06V10/25
PHYSICS
International classification
G16H50/20
PHYSICS
Abstract
The present application related to a method for detecting an object image using a convolutional neural network. Firstly, obtaining feature images by Convolution kernel, and then positioning an image of an object under detected by a default box and a boundary box from the feature image. By Comparing with the sample image, the detected object image is classifying to an esophageal cancer image or a non-esophageal cancer image. Thus, detecting an input image from the image capturing device by the convolutional neural network to judge if the input image is the esophageal cancer image for helping the doctor to interpret the detected object image.
Claims
1. A method for detecting an object image using a convolutional neural network, the steps include: Providing an input image to a host by an image capture unit, the input image including at least one detected object image and one background image; Converting the input image into a plurality of characteristic values and comparing the characteristic values with a plurality of convolution kernels to obtain at least one partial or full object image corresponding to some of the characteristic values by using the host, the convolution kernels containing the characteristic values of plural partial images and the adjacent background image in at least one object image; Capturing at least one regional image according to the region where the characteristic values corresponding to the partial or full detected object image and generating at least one default frame based on the edge of at least one regional image and overlapping the default frame on the input image, by using the host; Capturing and comparing a first center point of the default frames with a second center point of a boundary frame on the input image to obtain a center offset between the default frame and the boundary frames by using the host; Performing a regression operation according to the center offset to position the object image in the default frame on the input image by using the host, where the host performs the regression operation with a first position of the default frame, a second position of the boundary frame and a zooming factor to position the detected object image; Comparing the object image with at least one sample image to produce a comparison result by using the host; and Classifying the input image as a target object image or a non-target object image according to the comparison result by using the host; Wherein the boundary frame corresponds to the boundary of the input image, and thereby encompasses the entirety of the input image, and the default frame corresponds to the boundary of the detected object.
2. The method for detecting an object image with a convolutional neural network of claim 1, wherein in the step of Converting the input image into a plurality of characteristic values and comparing the characteristic values with a plurality of convolution kernels to obtain at least one partial or full object image corresponding to some of the characteristic values by using the host, the host sets the detection boundary of convolution cores to 3×3×p and normalizes a plurality of pixel values of input image to a plurality of pixel normal values; the host obtains the characteristic values in a convolution layer by having the convolution kernels multiplying the pixel normal values.
3. The method for detecting an object image with a convolutional neural network of claim 1, wherein in the step of capturing at least one regional image according to the region where the characteristic values corresponding to the partial or full detected object image, the host integrates the regions where the characteristic values are located, obtains the regional image of the input image, and establishes the default frame with the regional image.
4. The method for detecting an object image with a convolutional neural network of claim 1, wherein in the step of Converting the input image into a plurality of characteristic values and comparing the characteristic values with a plurality of convolution kernels to obtain at least one partial or full object image corresponding to some of the characteristic values by using the host, the host convolutes each pixel of input image according to a single shot multibox detector model to detect the characteristic values.
5. The method for detecting an object image with a convolutional neural network of claim 1, wherein in the step of comparing the detected object image with at least one sample image by using the host, the host performs classified comparison at a fully connection layer.
6. The method for detecting an object image with a convolutional neural network of claim 1, wherein in the step of classifying the input image as a target object image or a non-target object image based on a comparison result, when the host fails to identify the object image in the default frame that matches at least one sample image, the host classifies the input image as the non-target object image, else, the host classifies the input image as the target object image.
7. The method for detecting an object image with a convolutional neural network of claim 1, wherein in the step of classifying the input image as a target object image or a non-target object image according to a comparison result, when the host classifies the input image as the non-target object image, the host secondly compares at least one sample image with the object image; when the host judges that an approximation of one of the detected object images is greater than an approximation threshold, it classifies the input image into the target object image; else, the host classifies the input image into the non-target object image.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
DETAILED DESCRIPTION
(9) Due to the fact of the negligence in manual operation or the difficulty of image identification caused by the complex operation of endoscopy, the present application proposes a method for detecting the object image by using the convolutional neural network, used to solve the problems of manual operation and difficulty of image identification caused by the complex operation of endoscopy.
(10) In the following interpretation, the characteristics and matching system of a method for detecting an object image by using a convolutional neural network will be revealed as follow:
(11) First, refer to
(12) Step S10: providing input image to host by image capture unit;
(13) Step S20: converting input image into characteristic values and compared with convolution kernels to obtain partial or full detected object image corresponding to part of the characteristic values by using host;
(14) Step S30: capturing regional image according to region of characteristic values corresponding to partial or full detected object image, and generating a default frame based on edge of regional image to overlap it on input image, by using host; and
(15) Step S40: capturing and comparing first center point of default frame with second center point of boundary frame of input image to obtain center offset between default frame and boundary frame by using host;
(16) Step S50: performing regression operation according to center offset to position detected object image in default frame on input image;
(17) Step S60: comparing detected object image with sample image to produce comparison result by using host; and
(18) Step S70: classifying input image as target object image or non-target object image according to comparison result by using host.
(19) Refer to
(20) In Step S10, as shown in
(21) As shown in
(22) As shown in
(23)
(24) By using Equation (2) and (3) below, calculate the frame height and width from frame size s.sub.k:
h.sub.k=s.sub.k√{square root over (a.sub.r)} Equation (2)
w.sub.k=s.sub.k/√{square root over (a.sub.r)} Equation (3)
(25) Where h.sub.k represents the first-check frame height of the rectangle in the k.sup.th characteristic diagram; the frame height, w.sub.k represents the first-checking frame width, a.sub.r represents the aspect ratio of default frame D, a.sub.r>0.
(26) As shown in
Location of default frame D,d=(d.sup.cx,d.sup.cy,d.sup.w,d.sup.h) Equation (4)
Location of boundary frame B,b=(b.sup.cx,b.sup.cy,b.sup.w,b.sup.h) Equation (5)
Zooming factor l=(l.sup.cx,l.sup.cy,l.sup.w,l.sup.h) Equation (6)
b.sup.cx=d.sup.wl.sup.cx+d.sup.cx Equation (7)
b.sup.cy=d.sup.hl.sup.cy+d.sup.cy Equation (8)
b.sup.w=d.sup.wexp(l.sup.w) Equation (9)
b.sup.h=d.sup.hexp(l.sup.h) Equation (10)
(27) First, align the central coordinates of boundary frame B with the central coordinates of predicting detection frame D, which means that the center point of boundary frame B is “moved” to the center point of predicting detection frame D, that is, the first center point Dc and the second center point Bc shown in
(28) In order to more accurately define the position of the detected object image O1, a further matching of the loss equation is applied as Equation (8) in below
L.sub.loc(x,l,g)=Σ.sub.i∈Pos.sup.NΣ.sub.m∈{cx,cy,w,h}x.sub.ij.sup.ksmooth.sub.L1(l.sub.i.sup.m−ĝ.sub.j.sup.m) Equation (8)
(29) It thus can verify the error between the position of predicting detection fame D and the detected object image O1.
(30) In Step S60, as shown in
(31) As shown in
(32) Sum up the aforesaid statements, the method of detecting a detected object image by a convolutional neural network provides a host to execute the convolution program and allow the host to build a Convolutional Neural Network, used to convolute the input image taken by the image capture unit to screen out the filter area under detection. A predicting detection frame is set up on the input image, and the position of the detected object image is located by the boundary frame through regression operation. Finally, perform the sample image comparison, and use the comparison result to classify the target object images and non-target object images according to the comparison result.