Image-processing apparatus, image-processing system, image-processing method, and storage medium
11145068 · 2021-10-12
Assignee
Inventors
- Hidetaka OHIRA (Fuchu, JP)
- Ryohei Tanaka (Kawasaki, JP)
- Shuta Yanaga (Kawasaki, JP)
- Kenji Furukawa (Machida, JP)
- Mayumi Ikeda (Ota, JP)
Cpc classification
G08B13/19608
PHYSICS
International classification
Abstract
According to one embodiment, an image-processing apparatus includes an output unit, a tracker, a storage controller, a setter, and an output controller. The tracker tracks a target object in one or more time-series images. The storage controller stores tracking information including a movement trajectory of the object in a predetermined storage. The setter sets a detection region for detecting passing through of the object based on the movement trajectory of the object, on the image, in accordance with a user's operation. An evaluator evaluates a setting mode for the detection region based on the tracking information stored in the predetermined storage and the detection region set by the setter. The output controller outputs the tracking information stored in predetermined storage, from the output unit, in an output mode based on an evaluation result of the evaluator.
Claims
1. An image-processing apparatus, comprising: an output unit configured to output information; a tracker configured to track an object which is a target in one or more time-series images; a storage controller configured to store tracking information including movement trajectories of one or more of the objects tracked by the tracker, in a predetermined storage; a setter configured to set a detection region for detecting passing through of the object on the basis of the movement trajectory of the object, on the image, in accordance with a user's operation; an evaluator configured to evaluate a setting mode for the detection region on the basis of the tracking information stored in the predetermined storage and the detection region which is set by the setter; and an output controller configured to output the tracking information stored in the predetermined storage, from the output unit in an output mode based on an evaluation result of the evaluator, wherein the evaluator evaluates a setting mode for the detection region, for each movement trajectory selected by the user, according to a presence or absence of intersection between one or more movement trajectories and the detection region which is set by the setter, the one or more movement trajectories being selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage.
2. The image-processing apparatus according to claim 1, wherein the evaluator evaluates a high setting mode for the detection region with respect to a movement trajectory that intersects the detection region among the one or more movement trajectories selected by the user, and evaluates a low setting mode for the detection region with respect to a movement trajectory that does not intersect the detection region.
3. The image-processing apparatus according to claim 2, wherein the output unit includes a display configured to display an image, and the output controller performs a display for further highlighting the movement trajectory for which a higher evaluation of the setting mode for the detection region is made by the evaluator, on the display.
4. The image-processing apparatus according to claim 1, wherein the evaluator evaluates a low setting mode for the detection region with respect to a movement trajectory that intersects the detection region among the one or more movement trajectories which are not selected by the user, and evaluates a high setting mode for the detection region with respect to a movement trajectory that does not intersect the detection region.
5. The image-processing apparatus according to claim 4, wherein the output unit includes a display configured to display an image, and the output controller performs a display for further highlighting the movement trajectory for which a lower evaluation of the setting mode for the detection region is made by the evaluator, on the display.
6. The image-processing apparatus according to claim 1, wherein the output controller further outputs the detection region, set on the image by the setter, from the output unit, in an output mode based on the evaluation result.
7. The image-processing apparatus according to claim 6, wherein in a case where one or more movement trajectories selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage and the detection region intersect each other, the evaluator evaluates the setting mode for the detection region in accordance with an intersection position on the one or more movement trajectories.
8. The image-processing apparatus according to claim 7, wherein the evaluator evaluates a higher setting mode for the detection region as the intersection position approaches a center of the movement trajectory, and evaluates a lower setting mode for the detection region as the intersection position becomes farther away from the center of the movement trajectory.
9. The image-processing apparatus according to claim 6, wherein in a case where one or more movement trajectories selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage and the detection region intersect each other, the evaluator evaluates the setting mode for the detection region in accordance with an intersection position in the one or more detection regions.
10. The image-processing apparatus according to claim 9, wherein the evaluator evaluates a higher setting mode for the detection region as the intersection position approaches a center of the detection region, and evaluates a lower setting mode for the detection region as the intersection position becomes farther away from the center of the detection region.
11. The image-processing apparatus according to claim 6, wherein the evaluator evaluates the setting mode for the detection region in accordance with an angle between one or more movement trajectories selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage and the detection region.
12. The image-processing apparatus according to claim 11, wherein the evaluator evaluates a higher setting mode for the detection region as the angle approaches 90 degrees, and evaluates a lower setting mode for the detection region as the angle approaches 0 degrees or 180 degrees.
13. The image-processing apparatus according to claim 6, wherein the evaluator evaluates the setting mode for the detection region in accordance with a distance between one or more movement trajectories which are not selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage and the detection region.
14. The image-processing apparatus according to claim 13, wherein the evaluator evaluates a lower setting mode for the detection region as the distance becomes shorter.
15. The image-processing apparatus according to claim 6, wherein the evaluator evaluates the setting mode for the detection region in accordance with an angle between one or more movement trajectories which are not selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage and the detection region.
16. The image-processing apparatus according to claim 15, wherein the evaluator evaluates a lower setting mode for the detection region as the angle approaches 90 degrees.
17. The image-processing apparatus according to claim 2, wherein the detection region is a detection line.
18. The image-processing apparatus according to claim 1, wherein the setter further sets candidates for one or more of the detection regions on the image, and determines a candidate, selected by the user from among the candidates for one or more of the detection regions set on the image, as the detection region.
19. The image-processing apparatus according to claim 1, wherein the tracker derives a degree of conviction indicating a degree of probability of an object to be tracked in the one or more time-series images being an object which is the target, and the storage controller stores the tracking information, including a movement trajectory of an object of which the degree of conviction derived by the tracker is equal to or greater than a threshold among one or more of the objects tracked by the tracker, in the predetermined storage.
20. The image-processing apparatus according to claim 1, wherein the movement trajectory of the object includes a moving direction of the object, and the output controller outputs the tracking information, stored in the predetermined storage, from the output unit, in an output mode according to the moving direction of the object.
21. An image-processing system, comprising: the image-processing apparatus according to claim 1; and one or more cameras configured to generate the one or more time-series images.
22. An image-processing method comprising causing a computer to: track an object which is a target in one or more time-series images; store tracking information including movement trajectories of one or more of the tracked objects in a predetermined storage; set a detection region for detecting passing through of the object on the basis of the movement trajectory of the object, on the image, in accordance with a user's operation; evaluate a setting mode for the detection region on the basis of the tracking information stored in the predetermined storage and the set detection region; and output the tracking information, stored in the predetermined storage, from an output unit configured to output information, in an output mode based on an evaluation result of the setting mode for the detection region, evaluate a setting mode for the detection region, for each movement trajectory selected by the user, according to a presence or absence of intersection between one or more movement trajectories and the detection region, the one or more movement trajectories being selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage.
23. A non-transitory computer-readable storage medium having a program stored therein, the program causing a computer to execute: a process of tracking an object which is a target in one or more time-series images; a process of storing tracking information including movement trajectories of one or more of the tracked objects in a predetermined storage; a process of setting a detection region for detecting passing through of the object on the basis of the movement trajectory of the object, on the image, in accordance with a user's operation; a process of evaluating the setting mode for the detection region on the basis of the tracking information stored in the predetermined storage and the set detection region; and a process of outputting the tracking information, stored in the predetermined storage, from an output unit configured to output information, in an output mode based on an evaluation result of the setting mode for the detection region, a process of evaluating a setting mode for the detection region, for each movement trajectory selected by the user, according to as presence or absence of intersection between one or more movement trajectories and the detection region, the one or more movement trajectories being selected by the user from among movement trajectories of one or more of the objects included in the tracking information stored in the predetermined storage.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
DETAILED DESCRIPTION
(29) According to one embodiment, an image-processing apparatus includes an output unit, a tracker (tracking unit), a storage controller (storage control unit), a setter (setting unit), and an output controller (output control unit). The output unit outputs information. The tracker tracks an object which is a target in one or more time-series images. The storage controller stores tracking information, including movement trajectories of one or more of the objects tracked by the tracker, in a predetermined storage. The setter sets a detection region for detecting passing through of the object on the basis of the movement trajectory of the object, on the image, in accordance with a user's operation. An evaluator evaluates a setting mode for the detection region on the basis of the tracking information stored in the predetermined storage and the detection region which is set by the setter. The output controller outputs the tracking information stored in the predetermined storage, from the output unit, in an output mode based on an evaluation result of the evaluator.
(30) Hereinafter, an image-processing apparatus, an image-processing system, an image-processing method, and a storage medium of an embodiment will be described with reference to the accompanying drawings.
First Embodiment
(31)
(32) The image-processing apparatus 100 is installed, for example, individually in each building, and performs image processing on a moving image generated by each of one or more cameras 10 installed in the same building, to thereby detect persons who go in (enter a room) or persons who go out (leave a room) at each entrance port. For example, the image-processing apparatus 100 sets a detection window W.sub.D for detecting a person who is a target for detection with respect to each frame constituting a moving image, and scans this detection window W.sub.D in the two-dimensional direction of a frame, to thereby detect at which position in a frame region the person is present. The image-processing apparatus 100 derives a trajectory obtained by linking detection windows W.sub.D in which a person is detected in each frame in a time direction, as a person's movement trajectory (hereinafter, referred to as a tracklet TL), and determines whether a person enters an entrance port or leaves an entrance port on the basis of this tracklet TL and a detection line LN.sub.D.
(33) The image-processing apparatus 100 may perform such processes, for example, on-line and in real time, and may store a moving image in a storage device followed by processing the moving image off-line. Meanwhile, the image-processing apparatus 100 may be installed in another building without being limited to being installed in the same building as a building in which the camera 10 is installed. In this case, the camera 10 and the image-processing apparatus 100 may be connected to each other through a wide area network (WAN) or the like. In addition, some functions of the image-processing apparatus 100 may be realized by another computer (such as, for example, a cloud server) connected to a wide area network (WAN).
(34)
(35) The communicator 102 includes a hardware interface such as a network interface card (NIC) capable of being connected to a network such as a LAN or a WAN. For example, the communicator 102 communicates with the camera 10 through a network, and acquires a moving image from the camera 10 of its communication partner.
(36) The display 104 includes a display device such as, for example, a liquid crystal display (LCD) or an organic electroluminescence (EL) display. The display 104 displays an image under control performed by the controller 110.
(37) The operator 106 includes a user interface such as, for example, a button, a keyboard or a mouse. The operator 106 accepts a user's operation, and outputs information according to the accepted operation to the controller 110. Meanwhile, the operator 106 may be a touch panel configured integrally with the display 104.
(38) The controller 110 includes, for example, an object-tracker (object-tracking unit) 112, a storage controller 114, an output controller 116, a detection region-setter (detection region-setting unit) 118, and an evaluation and analyzer 120. These components of the controller 110 are realized by a processor such as a central processing unit (CPU) or a graphics-processing unit (GPU) executing a program (software) stored in the storage 130. In addition, some or all of the components of the controller 110 may be realized by hardware (circuitry) such as a large-scale integration (LSI), an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or a graphics-processing unit (GPU), and may be realized by cooperation between software and hardware. In addition, the above program may be stored in the storage 130 in advance, and may be stored on a detachable storage medium such as a DVD or a CD-ROM and be installed in the storage 130 from the storage medium by the storage medium being mounted in the drive device of the image-processing apparatus 100.
(39) The storage 130 is realized by, for example, a hard disk drive (HDD), a flash memory, an electrically erasable programmable read-only memory (EEPROM), a read-only memory (ROM), a random-access memory (RAM), or the like. In the storage 130, moving images acquired from the camera 10, various processing results, and the like are stored in addition to a program referred to by a processor.
(40) [Process Flow]
(41) Hereinafter, the processing details of each component of the controller 110 of the first embodiment will be described with reference to a flow diagram.
(42) First, the object-tracker 112 tracks a target object in a moving image acquired from the camera 10 (step S100). The target object may be, for example, the upper half of a human body, or a part of a face, a head or the like, and may be the whole body. For example, the object-tracker 112 sets a detection window W.sub.D in each frame constituting a moving image, and uses template matching for comparing an image region overlapping this detection window W.sub.D with a template image prepared in advance, to determine whether the image region overlapping the detection window W.sub.D is an image region reflecting a target object.
(43) For example, the object-tracker 112 may perform template matching on the basis of parameters such as a hue, a luminance value, or a luminance gradient. For example, in a case where a parameter of a hue is used in template matching, the object-tracker 112 may determine that the image region overlapping the detection window W.sub.D is an image region reflecting a target object in a case where a difference between the hue of the image region overlapping of the detection window W.sub.D and the hue of the template image is equal to or less than a fixed value, and determines that the image region overlapping the detection window W.sub.D is a background image region in a case where the difference exceeds the fixed value. In addition, for example, in a case where a parameter of a luminance value or a luminance gradient is used in template matching, the object-tracker 112 may determine that the image region overlapping the detection window W.sub.D is an image region reflecting a target object in a case where a difference between the luminance value or the luminance gradient of the image region overlapping the detection window W.sub.D and the luminance value or the luminance gradient of the template image is equal to or less than a fixed value, and determines that the image region overlapping the detection window W.sub.D is a background image region in a case where the difference exceeds the fixed value.
(44) The object-tracker 112 changes the position of the detection window W.sub.D after determining whether a region in which the detection window W.sub.D is set is an image region including a target object, and determines whether the region to which the position has been changed is an image region including a target object. The object-tracker 112 repeats such processes to thereby detect a target object from each frame while scanning the detection window W.sub.D in the two-dimensional direction of a frame.
(45) The object-tracker 112 compares the positions of target objects detected in each frame between a plurality of frames consecutive in a time-series manner, determines that these target objects are the same object in a case where a distance (distance on an image plane) between the positions which are targets for comparison is less than a predetermined distance, and determines that these target objects are different from each other object in a case where a distance between the positions of the target objects which are targets for comparison is equal to or greater than the predetermined distance. The predetermined distance may, for example, be made shorter as the frame rate of a moving image becomes higher, and may be made longer as the frame rate becomes lower.
(46) In addition, in a case where the shapes of target objects which are targets for comparison are more than a certain degree similar to each other, the object-tracker 112 may determine that these target objects are the same object. For example, in a case where a difference between hues, luminance values, luminance gradients or the like is equal to or less than a fixed value, the object-tracker 112 may determine that the shapes of the target objects are similar to each other.
(47) In addition, in a case where the position of a target object is detected from each of a plurality of frames of a moving image acquired for a certain period in the past, the object-tracker 112 may obtain an optical flow on the basis of the detected positions of the target object, predict the position of a target object at a certain time in the future on the basis of the optical flow, compare the predicted position with the position of a target object detected in reality at that time, and determine whether these target objects are the same object.
(48) The object-tracker 112 links, between frames, representative points such as the centroids of detection windows W.sub.D set with respect to each frame when target objects determined to be the same object are detected, and derives a line obtained by linking the representative points between these frames as a tracklet TL. Meanwhile, in a case where a plurality of target objects are detected in each of a plurality of frames in a result of scanning the detection window W.sub.D, the object-tracker 112 may derive a plurality of tracklets TL. The tracklet TL is assumed to show, for example, the moving direction of an object.
(49) Next, the storage controller 114 superimposes one or more tracklets TL derived by the object-tracker 112 on one frame which is representative (for example, a frame of the latest time) among a plurality of frames referred to when the tracklets TL are derived, and stores information (hereinafter, called tracking information) including the representative frame on which the one or more tracklets TL are superimposed, in the storage 130 (step S102). In addition, the storage controller 114 may control the communicator 102 such that the tracking information is stored in an external storage device on a network.
(50) Next, the output controller 116 causes the display 104 to display a setting screen for the detection line LN.sub.D (step S104). The setting screen for the detection line LN.sub.D is, for example, a graphical user interface (GUI) screen for accepting various types of input in order to set the detection line LN.sub.D. For example, the output controller 116 may display a screen, as a GUI, which makes it possible to designate coordinates of a start point and an end point of the detection line LN.sub.D on a frame on which tracklets TL included in the tracking information are superimposed.
(51) Next, the detection region-setter 118 sets a detection line LN.sub.D for detecting passing through of a target object on the basis of the tracklet TL included in the tracking information, in a frame on which tracklets TL are superimposed, in accordance with a user's operation of the operator 106 (step S106).
(52) The detection line LN.sub.D may be an approximative line obtained by linking one or more pixels in an image (frame), and may be a line that does not have a width geometrically defined as a function of a straight line or a curved line. In a case of the former, the detection line LN.sub.D may be a line having at least one pixel's worth of width.
(53) For example, when a frame on which tracklets TL are superimposed is displayed as a setting screen on the display 104, a user may operate the operator 106 while viewing this screen, and designate a plurality of position coordinates on the frame on which tracklets TL are superimposed. For example, in a case where coordinates of two points are designated by a user, the detection region-setter 118 may set a straight line connecting these two points as a detection line LN.sub.D. In addition, for example, in a case where coordinates of two or more points are designated by a user, the detection region-setter 118 may set a curve in which squares of distances from these points are a minimum, as a detection line LN.sub.D. The straight line or the curved line may be a one-dimensional line that does not have a width geometrically defined as a function, and may be a linear two-dimensional region in which several pixels are formed together in a width direction.
(54) In addition, the detection region-setter 118 may set a detection region R.sub.D which is a two-dimensional region instead of setting the detection line LN.sub.D of a straight line or a curved line. The detection region R.sub.D is, for example, a region which is set as a subspace (two-dimensional space) of a frame on which tracklets TL are superimposed, and is a space in which the length and breadth of a frame are represented as dimensions. For example, in a case where coordinates of three or more points are designated by a user, the detection region-setter 118 may set a polygonal region obtained by linking each point with a straight line, as a detection region R.sub.D, using each point as a vertex. In addition, the detection region-setter 118 may set a stereoscopic three-dimensional detection region R.sub.D in which each point having coordinates designated by a user is used as a vertex. In the following description, as an example, a description will be given of a case where the detection region-setter 118 sets a one-dimensional detection line LN.sub.D.
(55) Next, the evaluation and analyzer 120 evaluates a setting mode for the detection line LN.sub.D with respect to each of one or more tracklets TL included in the tracking information, on the basis of the tracking information stored in the storage 130 or the external storage device and the detection line LN.sub.D set by the detection region-setter 118. In a case where the detection line LN.sub.D which is a one-dimensional detection region R.sub.D is set, the setting mode is, for example, the position of the detection line LN.sub.D, the length of the detection line LN.sub.D, the bending state of the detection line LN.sub.D, or the like. In addition, in a case where a two-or-more-dimensional detection region R.sub.D is set, the setting mode is, for example, the position of the detection region R.sub.D, the area (volume) of the detection region R.sub.D, the shape of the detection region R.sub.D, or the like.
(56) For example, the evaluation and analyzer 120 selects any one tracklet TL, as a processing target, from among one or more tracklets TL included in the tracking information, and determines whether this tracklet TL which is a target for processing and the detection line LN.sub.D intersect each other (step S108). The wording “intersect each other” means that, for example, the tracklet TL enters a pixel region including one or more pixels indicating the detection line LN.sub.D, and then leaves the pixel region.
(57)
(58) In addition, in a case where the detection line LN.sub.D is represented as a line having one pixel's worth of width, the evaluation and analyzer 120 determines that the tracklet TL and the detection line LN.sub.D intersect each other in a case where the fact that two pixels (for example, adjacent pixels on the left side and the lower side) located on a diagonal line among four pixels which are vertically and horizontally adjacent to a certain pixel to be focused upon among a plurality of pixels indicating the detection line LN.sub.D are pixels indicating the tracklet TL is established with respect to two or more pixels among a plurality of pixels indicating the detection line LN.sub.D.
(59)
(60) On the other hand, as illustrated in
(61) The evaluation and analyzer 120 makes different evaluations of the setting mode for the detection line LN.sub.D in a case where the tracklet TL and the detection line LN.sub.D intersect each other and a case where the tracklet TL and the detection line LN.sub.D do not intersect each other. For example, in a case where the tracklet TL and the detection line LN.sub.D intersect each other, the evaluation and analyzer 120 may evaluate a higher setting mode for the detection line LN.sub.D than in a case where the tracklet TL and the detection line LN.sub.D do not intersect each other.
(62) The output controller 116 changes a display mode for the tracking information on the basis of the evaluation result of the evaluation and analyzer 120. For example, since evaluations are different from each other in a case where the tracklet TL and the detection line LN.sub.D intersect each other and a case where the tracklet TL and the detection line LN.sub.D do not intersect each other, the output controller 116 makes the display modes for the detection line LN.sub.D different from each other in these respective cases.
(63) For example, in a case where it is determined by the evaluation and analyzer 120 that the tracklet TL and the detection line LN.sub.D intersect each other, the output controller 116 may cause the display 104 to highlight a tracklet TL intersecting the detection line LN.sub.D (step S110). The highlighting may be to change, for example, the display color of the tracklet TL to other colors. In addition, for example, instead of or in addition to changing the color of the tracklet TL, the highlighting may be to further increase an element value such as brightness, chroma, or luminance, may be to further increase the line thickness of the tracklet TL, may be to decrease the line transparency of the tracklet TL, and may be to change the texture of the tracklet TL.
(64) On the other hand, in a case where it is determined by the evaluation and analyzer 120 that the tracklet TL and the detection line LN.sub.D do not intersect each other, the output controller 116 causes the display 104 to lowlight a tracklet TL not intersecting the detection line LN.sub.D (step S112). The lowlighting is to, for example, maintain the current color without changing the display color of the tracklet TL to other colors. In addition, the lowlighting may be to, for example, reduce an element value such as brightness, chroma, or luminance of the display color of the tracklet TL compared with during highlighting, may be to reduce the line thickness of the tracklet TL compared with during highlighting, and may be to increase the line transparency of the tracklet TL compared with during highlighting.
(65)
(66) In addition, the output controller 116 may cause the display 104 to display the tracklets TL as a heat map as shown in
(67) In addition, the output controller 116 may change display modes in accordance with the directions of the tracklets TL. For example, as shown in
(68) The controller 110 terminates the processing of the present flow diagram in a case where intersection with the detection line LN.sub.D is determined with respect to all the tracklets TL, changes tracklets TL which are targets for processing in a case where intersection with the detection line LN.sub.D is not determined with respect to all the tracklets TL, and repeats the processes of S108 to S112.
(69) Meanwhile, in the processing of the flow diagram described above, the evaluation and analyzer 120 may determine whether the tracklet TL and the detection region R.sub.D intersect each other in a case where a two-dimensional or three-dimensional detection region R.sub.D is set by the detection region-setter 118 instead of the one-dimensional detection line LN.sub.D. The term “intersect” refers to, for example, a positional relationship in which the tracklet TL and the detection region R.sub.D intrude into each other.
(70) According to the first embodiment described above, a target object is tracked in a moving image, tracking information including a tracklet TL which is a movement trajectory of the tracked object is stored in the storage 130 or the external storage device, a detection line LN.sub.D or a detection region R.sub.D for detecting passing through of an object is set in a frame within the moving image on the basis of the tracklet TL included in the tracking information, a setting mode for the detection line LN.sub.D or the detection region R.sub.D is evaluated on the basis of this set detection line LN.sub.D or detection region R.sub.D and the tracklet TL, and the tracking information is output in an output mode based on the evaluation result, whereby it is possible to improve the accuracy of detection of an entering or leaving person.
(71) For example, in a technique disclosed in Patent Document 1, a user is presented with a recommended region for setting a detection line LN.sub.D with reference to the past person's tracklet TL. However, in a case where there is a person passing through a place other than an entrance port such as, for example, a person passing by the entrance port, there is a problem in that it may not be possible to present a correct recommended region, which leads to a decrease in the accuracy of detection of an object. In addition, in a case of a technique disclosed in Patent Document 2, generally, a screen end region in which there is a decrease in the accuracy of detection of a tracklet TL of a tracked person is set to a setting forbidden region. However, a problem occurs in that a situation in which there is a person passing through a place other than an entrance port can occur even in a region at the center of a screen, which leads to a decrease in the accuracy of detection of an object.
(72) On the other hand, in the present embodiment, it is possible to cause a user to confirm whether a past tracklet TL can be correctly detected using the detection line LN.sub.D or the detection region R.sub.D set in accordance with the user's operation, and thus it is possible to cause the user to set the detection line LN.sub.D or the like again, for example, so as not to intersect the tracklet TL of a person passing through a place other than an entrance port. As a result, even in a case where there is a person passing through a place other than an entrance port, it is possible to detect a person who enters an entrance port or leaves the entrance port with a good degree of accuracy.
Second Embodiment
(73) Hereinafter, a second embodiment will be described. The second embodiment is different from the first embodiment in that it is determined whether tracklets TL selected by a user among a plurality of tracklets TL intersect the detection line LN.sub.D or the detection region R.sub.D, and a setting mode for the detection line LN.sub.D or the detection region R.sub.D is evaluated for each of the tracklets TL selected by the user, on the basis of the determination result. Hereinafter, a description will be given with focus on differences from the first embodiment, and common parts with respect to those in the first embodiment will not be described.
(74)
(75) First, the object-tracker 112 tracks a target object in a moving image acquired from the camera 10 (step S200). Next, the storage controller 114 superimposes one or more tracklets TL derived by the object-tracker 112 in order to track a target object, on one frame which is representative among a plurality of frames referred to when the tracklets TL are derived, and cause the storage 130 or the like to store tracking information including the representative frame on which the one or more tracklets TL are superimposed (step S202).
(76) Next, the output controller 116 causes the display 104 to display a selection screen for tracklets TL (step S204). The selection screen for tracklets TL is, for example, a screen for causing a user to select a tracklet TL (tracklet TL which is a target for detection) desired to be detected using the detection line LN.sub.D or the detection region R.sub.D from among all the tracklets TL derived by the object-tracker 112.
(77)
(78) Next, the output controller 116 causes the display 104 to display the selection screen for tracklets TL followed by determining whether one or more tracklets TL have been selected by a user (step S206), and causes the display 104 to display a setting screen for the detection line LN.sub.D in a case where one or more tracklets TL are selected by the user (step S208).
(79)
(80) Next, the evaluation and analyzer 120 selects any one tracklet TL, as a processing target, from among one or more tracklets TL included in the tracking information, and determines whether this tracklet TL which is a target for processing and the detection line LN.sub.D intersect each other (step S212).
(81) Next, in a case where it is determined that the tracklet TL which is a target for processing and the detection line LN.sub.D intersect each other, the evaluation and analyzer 120 further determines whether the tracklet TL which is a target for processing is a tracklet TL selected as a detection target by a user (step S214).
(82) On the other hand, in a case where it is determined that the tracklet TL which is a target for processing and the detection line LN.sub.D do not intersect each other, the evaluation and analyzer 120 further determines whether the tracklet TL which is a target for processing is a tracklet TL selected as a detection target by a user (step S216).
(83) The evaluation and analyzer 120 evaluates the setting mode for the detection line LN.sub.D for each tracklet TL, on the basis of these various types of determination result.
(84) For example, the evaluation and analyzer 120 may evaluate a high setting mode for the detection line LN.sub.D with respect to tracklets TL that intersect the detection line LN.sub.D among one or more tracklets TL selected as detection targets by a user, and evaluate a low setting mode for the detection line LN.sub.D with respect to tracklets TL that do not intersect the detection line LN.sub.D.
(85) In addition, for example, the evaluation and analyzer 120 evaluates a low setting mode for the detection line LN.sub.D with respect to tracklets TL that intersect the detection line LN.sub.D among one or more tracklets TL which are targets for non-detection and are not selected as detection targets by a user, and evaluates a high setting mode for the detection line LN.sub.D with respect to tracklets TL that do not intersect the detection line LN.sub.D.
(86)
(87) In addition, the tracklet TL1 which is a target for detection does not intersect the detection line LN.sub.D, and the tracklet TL5 which is a target for non-detection does not intersect the detection line. That is, the tracklet TL1 which is a target for detection is in a “non-detected state” which is not being detected by the detection line LN.sub.D in spite of this being desired by a user, and the tracklet TL5 which is a target for non-detection is in a “correctly non-detected state” which is not being detected by the detection line LN.sub.D as desired by a user.
(88) In such a case, the evaluation and analyzer 120 evaluates a high setting mode for the detection line LN.sub.D with respect to the tracklets TL2 and TL3 which are targets for detection in the “correctly detected state” and the tracklet TL5 which is a target for non-detection in the “correctly non-detected state”, and evaluates a low setting mode for the detection line LN.sub.D with respect to the tracklet TL1 which is a target for detection in the “non-detected state” and the tracklet TL4 which is a target for non-detection in the “over-detected state”.
(89) The output controller 116 changes the display modes for tracklets TL on the basis of the evaluation results of the setting modes for the detection line LN.sub.D with respect to these respective tracklets TL.
(90) For example, in a case where it is determined by the evaluation and analyzer 120 that the tracklet TL which is a target for processing intersects the detection line LN.sub.D, and is a tracklet TL which is a target for detection, that is, in a case of being in the “correctly detected state”, the output controller 116 causes the display 104 to perform first highlighting on the tracklet TL which is a target for processing (step S218). The first highlighting refers to, for example, greater highlighting than in other display modes such as second highlighting or third highlighting described later.
(91) In addition, in a case where it is determined by the evaluation and analyzer 120 that the tracklet TL which is a target for processing intersects detection line LN.sub.D, and is not a tracklet TL which is a target for detection, that is, in a case of being in a “misdetected state”, the output controller 116 causes the display 104 to perform second highlighting on the tracklet TL which is a target for processing (step S220). The second highlighting refers to, for example, the same degree of highlighting as third highlighting, or greater highlighting than the third highlighting.
(92) In addition, in a case where it is determined by the evaluation and analyzer 120 that the tracklet TL which is a target for processing does not intersect the detection line LN.sub.D, and is a tracklet TL which is a target for detection, that is, in a case of being in the “non-detected state”, the output controller 116 causes the display 104 to perform third highlighting on the tracklet TL which is a target for processing (step S222). The third highlighting refers to, for example, greater highlighting than at least lowlighting.
(93) In addition, in a case where it is determined by the evaluation and analyzer 120 that the tracklet TL which is a target for processing does not intersect the detection line LN.sub.D, and is not a tracklet TL which is a target for detection, that is, in a case of being in the “correctly non-detected state”, the output controller 116 causes the display 104 to lowlight the tracklet TL which is a target for processing (step S224).
(94)
(95) The controller 110 terminates the processing of the present flow diagram in a case where intersection with the detection line LN.sub.D is determined with respect to all the tracklets TL, changes tracklets TL which are targets for processing in a case where intersection with the detection line LN.sub.D is not determined with respect to all the tracklets TL, and repeats the processes of S212 to S224.
(96) According to the second embodiment described above, it is determined whether tracklets TL selected by a user among a plurality of tracklets TL intersect the detection line LN.sub.D or the detection region R.sub.D, and a setting mode for the detection line LN.sub.D or the detection region R.sub.D is evaluated for each of the tracklets TL selected by the user, on the basis of the determination result. Therefore, it is possible to present to a user whether a tracklet TL of a person tracked in the past can be correctly detected or is erroneously detected with respect to the detection line LN.sub.D or the like tentatively set by the user, and possible for the user himself (or herself) to determine whether the set detection line LN.sub.D or the like operates as intended by the user.
Third Embodiment
(97) Hereinafter, a third embodiment will be described. The third embodiment is different from the first and second embodiments in that when the tracklet TL and the detection line LN.sub.D intersect each other, the setting mode for the detection line LN.sub.D is evaluated in accordance with an intersection position on the tracklet TL, an intersection position on the detection line LN.sub.D, and some or all of the angles (included angles) between the tracklet TL and the detection line LN.sub.D. Hereinafter, a description will be given with focus on differences from the first and second embodiments, and common parts with respect to those in the first and second embodiments will not be described.
(98)
(99) In addition, in a case where the end of the detection line LN.sub.D and the tracklet TL are set so as to intersect each other, a target object which is being tracked travels away from the detection line LN.sub.D, and thus the target object may not be able to be detected. Therefore, in order to stably detect passing through of a target object, it is preferable that the vicinity of the midpoint of the detection line LN.sub.D and the tracklet TL be set so as to intersect each other. For this reason, for example, in the detection line LN.sub.D, the evaluation and analyzer 120 may increase the score as the point of intersection with the tracklet TL approaches the center (midpoint) of a length D.sub.LND of the detection line LN.sub.D, and may lower the score as the point of intersection with the tracklet TL becomes farther away from the center of the length D.sub.LND of the detection line LN.sub.D.
(100) In addition, in a case where the detection line LN.sub.D is set so as to be parallel to the tracklet TL, there is a decreasing probability of a target object which is being tracked passing through the detection line LN.sub.D, and thus it is preferable that the detection line LN.sub.D be set so as to be orthogonal to the tracklet TL. For this reason, the evaluation and analyzer 120 may increase the score, for example, as an angle θ between the tracklet TL and the detection line LN.sub.D approaches 90 degrees, and may lower the score as the angle θ approaches 0 degrees or 180 degrees.
(101) The evaluation and analyzer 120 evaluates a setting mode for a final detection line LN.sub.D on the basis of some or all of a score (hereinafter, called a first score) obtained by evaluating the position of the point of intersection with the detection line LN.sub.D on the tracklet TL, a score (hereinafter, called a second score) obtained by evaluating the position of the point of intersection with the tracklet TL on the detection line LN.sub.D, and a score (hereinafter, called a third score) obtained by evaluating the angle θ between the tracklet TL and the detection line LN.sub.D. For example, in a case where all the scores are referred to, the evaluation and analyzer 120 may evaluate a highest setting mode in which a weighted sum of these scores becomes maximum. In this case, the evaluation and analyzer 120 evaluates a highest setting mode for the detection line LN.sub.D set by a user in a case where the center of the tracklet TL and the center of the detection line LN.sub.D intersect each other in a state where the tracklet TL and the detection line LN.sub.D are orthogonal to each other.
(102)
(103) In addition, for example, as shown in
(104) In addition, for example, as shown in
(105) In addition, for example, as shown in
(106) Meanwhile, in the above-described example, the representation of the degree of the accuracy of detection of a target object by the color of the detection line LN.sub.D is not limited thereto. For example, the output controller 116 may represent the degree of the accuracy of detection of a target object, by the transparency, texture, line thickness or the like of the detection line LN.sub.D, and may display characters or images indicating that the accuracy of detection of a target object is high or low, separately from the detection line LN.sub.D. In addition, the output controller 116 may output the high or low accuracy of detection of a target object, as sounds, from a speaker (not shown) or the like. In addition, the output controller 116 may display the value of a weighted sum of the first score, the second score and the third score, as a probability value indicating the accuracy of detection of a target object.
(107) According to the third embodiment described above, the tracklet TL and the detection line LN.sub.D intersect each other, but in a case where the tracklet TL is shortened without being able to sufficiently track a target object, or the like, a display method different from usual is used in a detection line LN.sub.D determined to have a low probability of the target object being able to be correctly detected, whereby it is possible for a user to confirm how accurately the set detection line LN.sub.D has been detected. Thereby, for example, since a user resets the detection line LN.sub.D again, it is possible to set the detection line LN.sub.D which makes it possible to detect a target object with a higher degree of accuracy.
Fourth Embodiment
(108) Hereinafter, a fourth embodiment will be described. The fourth embodiment is different from the first to third embodiments in that when the tracklet TL and the detection line LN.sub.D intersect each other, the setting mode for the detection line LN.sub.D is evaluated in accordance with one or both of a distance from another tracklet TL and an angle between another tracklet TL and the detection line. Hereinafter, a description will be given with focus on differences from the first to third embodiments, and common parts with respect to those in the first to third embodiments will not be described.
(109) For example, as is the case with the second embodiment, in a case where a tracklet TL selected as a detection target by a user and a tracklet TL which is a target for non-detection, not selected as a detection target by the user, are set among a plurality of tracklets TL, the detection line LN.sub.D may intersect the tracklet TL which is a target for detection, and may not intersect the tracklet TL which is a target for non-detection. That is, depending on the setting position of the detection line LN.sub.D, as desired by a user, the tracklet TL which is a target for detection may be set to be in a “correctly detected state”, and the tracklet TL which is a target for non-detection may be set to be in a “correctly non-detected state”. However, in a case where the detection line LN.sub.D is set at a position close to the tracklet TL which is a target for non-detection, or the extended line of the detection line LN.sub.D is set so as to intersect the tracklet TL which is a target for non-detection, there is an increasing probability of even the tracklet TL which is a target for non-detection being erroneously detected by the detection line LN.sub.D.
(110) Therefore, even in a case where the detection line LN.sub.D does not intersect the tracklet TL which is a target for non-detection, and a high evaluation of the setting mode is made, an evaluation and analyzer 120 of the fourth embodiment lowers the evaluation of the setting mode for the detection line LN.sub.D in a case where a distance between the tracklet TL and the detection line LN.sub.D are short, or a case where the extended line of the detection line LN.sub.D intersects the tracklet TL which is a target for non-detection at a nearly orthogonal angle.
(111) For example, when the tracklet TL which is a target for detection and the detection line LN.sub.D intersect each other, the evaluation and analyzer 120 may increase the score as a distance between (shortest distance) the tracklet TL which is a target for non-detection and the detection line LN.sub.D becomes longer, and may lower the score as the distance between the tracklet TL which is a target for non-detection and the detection line LN.sub.D becomes shorter.
(112) In addition, for example, when the tracklet TL which is a target for detection and the detection line LN.sub.D intersect each other, the evaluation and analyzer 120 may increase the score as an angle φ between the tracklet TL which is a target for non-detection and the detection line LN.sub.D approaches 0 degrees or 180 degrees, and may lower the score as the angle φ approaches 90 degrees.
(113) The evaluation and analyzer 120 evaluates a setting mode for a final detection line LN.sub.D on the basis of one or both of a score (hereinafter, called a fourth score) obtained by evaluating a distance between the tracklet TL which is a target for non-detection and the detection line LN.sub.D and a score (hereinafter, called a fifth score) obtained by evaluating an angle φ between the tracklet TL which is a target for non-detection and the detection line LN.sub.D. For example, the evaluation and analyzer 120 evaluates a higher setting mode as a weighted sum of these scores becomes larger.
(114)
(115) In addition, in the example of
(116) Meanwhile, the evaluation and analyzer 120 may evaluate the setting mode for the detection line LN.sub.D in further consideration of some or all of the intersection position on the tracklet TL, the intersection position on the detection line LN.sub.D, and the angle between the tracklet TL and the detection line LN.sub.D which are described in the aforementioned third embodiment, in addition to one or both of a distance from the tracklet TL which is a target for non-detection and an angle between the tracklet TL which is a target for non-detection and the detection line. In a case where these five elements are considered, the evaluation and analyzer 120 may evaluate a higher setting mode for the detection line LN.sub.D as a weighted sum of all the first to fifth scores becomes larger.
(117) According to the fourth embodiment described above, in a case where the tracklet TL which is a target for detection and the tracklet TL which is a target for non-detection are designated by a user, the tracklet TL which is a target for detection and the detection line LN.sub.D intersect each other, and the tracklet TL which is a target for non-detection and the detection line LN.sub.D do not intersect each other, but a display method different from usual is used in a detection line LN.sub.D (detection line LN.sub.D having the fourth score and the fifth score which are low) having a high probability of erroneously detecting passing through of a target object depicting the same movement trajectory as that of the tracklet TL which is a target for non-detection, whereby it is possible for a user to confirm how accurately the set detection line LN.sub.D has been detected. Thereby, for example, since a user resets the detection line LN.sub.D again, it is possible to set the detection line LN.sub.D which makes it possible to detect a target object with a higher degree of accuracy.
Fifth Embodiment
(118) Hereinafter, a fifth embodiment will be described. The fifth embodiment is different from the first to fourth embodiments in that candidates for the detection line LN.sub.D are presented in advance, and a user is caused to select a candidate used as the detection line LN.sub.D from among these candidates. Hereinafter, a description will be given with focus on differences from the first to fourth embodiments, and common parts with respect to those in the first to fourth embodiments will not be described.
(119) A detection region-setter 118 of the fifth embodiment sets, for example, a straight line or a curved line, having the most number of intersections with tracklets TL included in a certain first group, and the least number of intersections with tracklets TL included in a second group which is a set of tracklets TL excluding the first group, as a candidate for the detection line LN.sub.D. The first group is a set of one or more tracklets TL of which start points or end points are located in the vicinity of an entrance port.
(120)
(121) In addition, the detection region-setter 118 may set, for example, a straight line or a curved line, having the maximum number of intersections with tracklets TL facing toward the direction of the entrance port or toward the opposite direction of the entrance port, and the minimum number of intersections with other tracklets TL, as a candidate for the detection line LN.sub.D.
(122) In addition, the detection region-setter 118 may set, for example, a straight line or a curved line, having the maximum number of intersections with tracklets TL selected as detection targets by a user, and the minimum number of intersections with other tracklets TL, as a candidate for the detection line LN.sub.D.
(123) The detection region-setter 118 determines a candidate selected from among candidates for one or more detection lines LN.sub.D presented to a user, as the detection line LN.sub.D. Meanwhile, in a case where the start point, the end point or the like of a straight line or a curved line determined as the detection line LN.sub.D from among candidates for detection lines LN.sub.D is re-designated by a user, the detection region-setter 118 may change the length, the degree of curvature or the like of the detection line LN.sub.D so as to pass through this re-designated point.
(124) In addition, the detection region-setter 118 of the fifth embodiment may newly set candidates for one or more detection lines LN.sub.D on the basis of the evaluation of a setting mode for the detection line LN.sub.D set in the past in accordance with a user's operation. For example, a plurality of detection lines LN.sub.D having different evaluations of setting modes are set with respect to the patterns of tracklets TL such as a relative positional relationship between a plurality of tracklets TL, and the length or the degree of curvature of each of a plurality of tracklets TL. In this case, in a case where the pattern of a tracklet TL in this time is similar to the pattern of a tracklet TL in the past when candidates for detection lines LN.sub.D are set, the detection region-setter 118 preferentially sets a detection line LN.sub.D having a higher evaluation of a setting mode among a plurality of detection lines LN.sub.D set at the tracklet TL of this pattern, as a candidate for a detection line LN.sub.D in this time.
(125) In addition, the detection region-setter 118 of the fifth embodiment may learn a user's preferences for a setting mode for the detection line LN.sub.D, and set the detection line LN.sub.D automatically (irrespective of a user's operation) in accordance with the learned preference. For example, when a user sets a tracklet TL which is a target for detection and a tracklet TL which is a target for non-detection, in a case where a detection line LN.sub.D is often set at a position where the tracklet TL which is a target for non-detection is detected together with the tracklet TL which is a target for detection, that is, a case where a detection line LN.sub.D having a certain amount of “over-detection” allowed is often set in order to prioritize “correct detection”, the detection region-setter 118 learns that this user prefers to improve the accuracy of detection of a tracklet TL by, forcibly if necessary, allowing the over-detection. In addition, for example, when a user sets a tracklet TL which is a target for detection and a tracklet TL which is a target for non-detection, in a case where a detection line LN.sub.D is often set at a preferential position where the tracklet TL which is a target for non-detection is not detected though the tracklet TL which is a target for detection is not detected, that is, a case where a detection line LN.sub.D having a certain amount of “non-detection” allowed is often set in order to prioritize not to be “over-detected”, the detection region-setter 118 learns that this user prefers to prohibit the over-detection by, forcibly if necessary, lowering the accuracy of detection of a tracklet TL.
(126) The detection region-setter 118 automatically sets a new detection line LN.sub.D in accordance with this learned user's preferences for a setting mode for the detection line LN.sub.D. For example, in a case where a user having learned a tendency to set a detection line LN.sub.D having a certain amount of “over-detection” allowed in order to prioritize “correct detection” attempts to newly set a detection line LN.sub.D, the detection region-setter 118 may automatically set a detection line LN.sub.D which is given the highest-priority of “correct detection” while allowing the “over-detection”, on the basis of this learning result. Thereby, it is possible to save much time and effort for a user to set a detection line LN.sub.D, and to improve the user's convenience.
(127) Meanwhile, a user's preferences for a setting mode for the detection line LN.sub.D which are learned by the detection region-setter 118 may be determined by the user himself (or herself).
(128) In addition, in a case where the detection line LN.sub.D is set for each area under surveillance of each camera 10, the detection region-setter 118 may set a setting mode for the detection line LN.sub.D set with respect to a certain area under surveillance, to a setting mode for the detection line LN.sub.D set with respect to another area under surveillance.
(129)
(130) According to the fifth embodiment described above, since a user is presented candidates for detection lines LN.sub.D which are set in advance so as to pass through a certain amount of tracklets TL, the user may just select a detection line LN.sub.D from among the presented candidates, and it is possible to save much time and effort for the user to set the detection line LN.sub.D. As a result, it is possible to improve a user's convenience.
(131) In addition, according to the fifth embodiment described above, since candidates for one or more detection lines LN.sub.D are newly set on the basis of the evaluation of a setting mode for the detection line LN.sub.D set in the past in accordance with a user's operation, a detection line LN.sub.D close to a mode when the user performs a setting by himself (or herself) can be selected from among candidates for a plurality of detection lines LN.sub.D. As a result, it is possible to further save much time and effort for the user to set the detection line LN.sub.D, and to improve the user's convenience.
(132) In addition, according to the fifth embodiment described above, since a user' preferences for a setting mode for the detection line LN.sub.D are learned, it is possible to automatically set the detection line LN.sub.D in accordance with the user's preference.
(133) In addition, according to the fifth embodiment described above, since the detection line LN.sub.D is uniformly set in each area under surveillance for each camera 10 on the basis of the learned user's preferences for a setting mode for the detection line LN.sub.D, it is possible to save more time and effort than when a user sets detection lines LN.sub.D with respect to all the areas under surveillance, and to improve the user's convenience.
Sixth Embodiment
(134) Hereinafter, a sixth embodiment will be described. The sixth embodiment is different from the first to fifth embodiments in that when an object is tracked in a moving image, the degree of conviction indicating the degree of probability of the tracked object being a target object is obtained, and tracking information including only a tracklet TL which is the movement trajectory of an object of which the degree of conviction is equal to or greater than a threshold is stored in the storage 130 or the external storage device. Hereinafter, a description will be given with focus on differences from the first to fifth embodiments, and common parts with respect to those in the first to fifth embodiments will not be described.
(135) An object-tracker 112 of the sixth embodiment derives the degree of conviction for each object which is a target for tracking. For example, the object-tracker 112 may derive a similarity between an image region overlapping a detection window W.sub.D and a template image as the degree of conviction, may derive a similarity between objects detected in each of a plurality of frames as the degree of conviction, may derive a similarity between the image region overlapping the detection window W.sub.D and a background image as the degree of conviction, and may derive a difference in a hue, a luminance value, a luminance gradient or the like between a plurality of frames as the degree of conviction. In addition, the object-tracker 112 may derive a value such as a weighted sum when these parameters are combined as the degree of conviction.
(136) A storage controller 114 of the sixth embodiment sets an object of which the degree of conviction is equal to or greater than a threshold, among objects of which the degrees of conviction are derived by the object-tracker 112, to a target object (for example, person), and causes the storage 130 or an external storage device to store tracking information including only a tracklet TL which is the movement trajectory of the object of which the degree of conviction is equal to or greater than a threshold. Thereby, a detection region-setter 118 of the sixth embodiment sets the detection window W.sub.D using only a tracklet TL of an object having a high degree of conviction indicating a target object.
(137) According to the sixth embodiment described above, only a tracklet TL of an object having a high degree of conviction indicating a person is included in the tracking information, it is possible to suppress the setting of the detection window W.sub.D with reference to tracklets TL of objects other than a target object which is erroneously tracked.
(138) Meanwhile, in any of the above-described embodiments, a description has been given in which the output controller 116 changes a display mode for the tracking information on the basis of the evaluation result of the evaluation and analyzer 120, but there is no limitation thereto. For example, the output controller 116 may cause a user to perform setting feedback on the detection line LN.sub.D by causing a speaker (not shown) to output the evaluation result of the evaluation and analyzer 120 as a sound. For example, the output controller 116 may perform sound guidance on the contents such as “A detection line LN.sub.D set by a user intersects a tracklet TL having an accuracy of detection of ∘∘%”, or the contents such as “An over-detected state was detected by a detection line LN.sub.D set by a user”.
(139) (Hardware Configuration)
(140) The image-processing apparatus 100 of an embodiment described above is realized by, for example, a hardware configuration as shown in
(141) The image-processing apparatus 100 is configured such that an NIC 100-1, a CPU 100-2, a RAM 100-3, a ROM 100-4, a secondary storage device 100-5 such as a flash memory or a HDD, and a drive device 100-6 are connected to each other by an internal bus or an exclusive communication line. A portable storage medium such as an optical disc is mounted in the drive device 100-6. A program stored in the secondary storage device 100-5 or a portable storage medium mounted in the drive device 100-6 is developed in the RAM 100-3 by a DMA controller (not shown) or the like, and is executed by the CPU 100-2, whereby the controller 110 is realized. A program referred to by the controller 110 may be downloaded from other devices through a communication network NW.
(142) According to at least one embodiment described above, a target object is tracked in a moving image, tracking information including a tracklet TL which is a movement trajectory of the tracked object is stored in the storage 130 or the external storage device, a detection line LN.sub.D or a detection region R.sub.D for detecting passing through of an object is set in a frame within the moving image on the basis of the tracklet TL included in the tracking information, a setting mode for the detection line LN.sub.D or the detection region R.sub.D is evaluated on the basis of this set detection line LN.sub.D or detection region R.sub.D and the tracklet TL, and the tracking information is output in an output mode based on the evaluation result, whereby it is possible to improve the accuracy of detection of an entering or leaving person.
(143) The above embodiment can be represented as follows.
(144) An image-processing apparatus, including:
(145) an output device configured to output information;
(146) a storage configured to store a program; and
(147) a processor,
(148) wherein by executing the program, the processor
(149) tracks an object which is a target in one or more time-series images,
(150) stores tracking information including movement trajectories of one or more of the tracked objects in the storage or an external storage,
(151) sets a detection region for detecting passing through of the object on the basis of the movement trajectory of the object, on the image, in accordance with a user's operation,
(152) evaluates a setting mode for the detection region on the basis of the tracking information stored in the storage or the external storage and the set detection region, and
(153) outputs the tracking information stored in the storage or the external storage, from the output unit in an output mode based on an evaluation result of the setting mode for the detection region.
(154) While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.