DIRECTIONALITY CONTROL SYSTEM, CALIBRATION METHOD, HORIZONTAL DEVIATION ANGLE COMPUTATION METHOD, AND DIRECTIONALITY CONTROL METHOD
20180160032 ยท 2018-06-07
Assignee
Inventors
- Hirotaka Sawa (Tokyo, JP)
- Toshimichi Tokuda (Fukuoka, JP)
- Shinichi Shigenaga (Tokyo, JP)
- Shuichi Watanabe (Fukuoka, JP)
- Shojiro Matsuo (Fukuoka, JP)
- Takashi Hirabara (Fukuoka, JP)
- Tadashi Maki (Fukuoka, JP)
- Hisashi TSUJI (Fukuoka, JP)
- Teppei Fukuda (Fukuoka, JP)
- Akitoshi IZUMI (Fukuoka, JP)
- Shintaro Yoshikuni (Fukuoka, JP)
- Hiroyuki Matsumoto (Fukuoka, JP)
- Kazunori Hayashi (Fukuoka, JP)
- Kosuke Shinozaki (Fukuoka, JP)
Cpc classification
H04R1/04
ELECTRICITY
H04S2420/01
ELECTRICITY
G01S3/785
PHYSICS
H04N23/661
ELECTRICITY
H04S7/00
ELECTRICITY
H04N23/69
ELECTRICITY
International classification
H04R1/04
ELECTRICITY
G01S3/785
PHYSICS
Abstract
A directionality control system includes: a camera; a microphone provided as a separate body from the camera; a display that displays video data captured by the camera; and a processor that computes a sound collection direction, which is directed from the microphone toward a sound position corresponding to a designated position in the video data. The processor computes the sound collection direction by using parameters including: a first height of the camera from a reference surface, a second height of the microphone from the reference surface, a third height of a computation reference point from the reference surface, the computation reference point being positioned in the sound collection direction at a position different from the sound position, a direction which is directed from the camera toward the sound position, and a fourth height of the sound position from the reference surface.
Claims
1. A directionality control system comprising; an imaging device that captures a video; a sound collector that collects sound, and being provided as a separate body from the imaging device; a display that displays video data captured by the imaging device; and a processor that computes a sound collection direction, which is directed from the sound collector toward a sound position corresponding to a designated position in the video data in response to a designation of a position in the video data displayed on the display, wherein the processor computes the sound collection direction by using parameters including; a first height of the imaging device from a reference surface, a second height of the sound collector from the reference surface, a third height of a computation reference point from the reference surface, the computation reference point being positioned in the sound collection direction at a position different from the sound position corresponding to the designated position in the video data, a direction which is directed from the imaging device toward the sound position corresponding to the designated position in the video data, and a fourth height of the sound position corresponding to the designated position in the video data from the reference surface.
2. The directionality control system according to claim 1, wherein; the processor forms a sound collection directionality of the sound collected by the sound collector in the computed sound collection direction.
3. The directionality control system according to claim 1, wherein the computation reference point is provided on an optical axis of the imaging device, and the parameters further includes; a horizontal component distance between the imaging device and the sound collector; a distance and a depression angle from the imaging device to the computation reference point; and a distance and a depression angle from the sound collector to the computation reference point.
4. The directionality control system according to claim 1, wherein the computation reference point is provided on an optical axis of the imaging device, and the parameters further includes: a horizontal component distance between the imaging device and the sound collector; a distance from the imaging device to the computation reference point; and a horizontal angle and a vertical angle from the sound collector to the computation reference point.
5. The directionality control system according to claim 1, wherein the processor uses a value selected from a plurality of predetermined default values as the fourth height.
6. The directionality control system according to claim 1, wherein the processor computes the fourth height by designating, in the video data, a first designation position corresponding to the sound position, and a second designation position corresponding to a position on the reference surface located in a vertically lower direction of the sound position.
7. A directionality control method in a directionality control system which comprises an imaging device that captures a video, and a sound collector that collects sound and is provided as a separate body from the imaging device, the method comprising: displaying video data captured by the imaging device on a screen; receiving a designation of a position in the video data displayed on the screen; receiving an input of parameters at least including: a first height of the imaging device from a reference surface; a second height of the sound collector from the reference surface; and a third height of a computation reference point from the reference surface, the computation reference point being positioned in a sound collection direction at a position different from a sound position corresponding to the designated position in the video data; setting a fourth height of the sound position corresponding to the designated position in the video data from the reference surface; and computing a direction which is directed from the sound collector toward the sound position corresponding to the designated position in the video data, by using the input parameters, a direction which is directed from the imaging device toward the sound position corresponding to the designated position in the video data, and the fourth height.
8. The directionality control method according to claim 7, further comprising: forming a sound collection directionality of the sound collected by the sound collector in the sound collection direction as computed.
9. The directionality control system according to claim 1, wherein the imaging device includes a camera.
10. The directionality control system according to claim 1, wherein the sound collector includes a microphone.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
[0057]
[0058] ) output from the speaker device.
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
[0111]
[0112]
[0113]
[0114]
[0115]
DESCRIPTION OF EMBODIMENTS
[0116] Hereinafter, embodiments of a directionality control system and a directionality control method related to the present invention will be described with reference to the drawings. The directionality control system of the present embodiment is used as a monitoring system (including a manned monitoring system and an unmanned monitoring system) provided in, for example, a factory, a public facility (for example, a library or an event hall), or a store (for example, a retail store or a bank).
[0117] In addition, the present invention can be expressed as respective apparatuses (for example, a directionality control apparatus to be described later) constituting the directionality control system, or a directionality control method including respective operations (steps) performed by each apparatus constituting the directionality control system.
First Embodiment
Configuration of Directionality Control System
[0118]
[0119] The directionality control system 10A illustrated in
[0120] Hereinafter, a description will be made focusing on an operation of each unit of the directionality control system 10, and a description of an operation of each unit of the directionality control system 10A will be made regarding the content which is different from the operation of each unit of the directionality control system 10.
[0121] Each of the camera apparatuses 11 to 1n as at least one imaging part is a monitoring camera which is connected to the network NW1 and is installed on, for example, a ceiling surface or a wall surface (refer to
[0122] The camera apparatuses 11 to 1n capture a video (including a still image and a moving image; this is also the same for the following description) of a position of a monitoring target which is present in a predefined angle of view CAR centering on an optical axis CX. The camera apparatuses 11 to 1n transmit the captured video data, and input parameters for computing a sound collection direction (.sub.MAh,.sub.MAv) which will be described later to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW1.
[0123] The microphone array apparatus 2 as a sound collection part which is connected to the network NW1 is installed on, for example, a ceiling surface or a wall surface of a room of an event hall or a predetermined stand (refer to
[0124] The microphone array apparatus 2 collects sound of a monitoring target in a sound collection direction by using each of the microphone (or microphones) 22 and 23, and transmits audio data collected by each of the microphones 22 and 23 to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW1 or the network NW2.
[0125] The microphone array apparatus 2 forms sound collection directionality of each of the microphones 22 and 23 in a sound collection direction (.sub.MAh,.sub.MAv) which is derived (hereinafter, referred to as computed) by a signal processing unit 33 which will be described later in response to a directionality formation instruction from the signal processing unit 33.
[0126] Consequently, the microphone array apparatus 2 can increase a volume level of audio data which is collected from the sound collection direction (.sub.MAh,.sub.MAv) in which the sound collection directionality is formed, and can reduce a volume level of audio data which is collected from a direction in which the sound collection directionality is not formed. In addition, a method of computing the sound collection direction (.sub.MAh,.sub.MAv) will be described later.
[0127] Exteriors of the microphone array apparatus 2 will be described later with reference to
[0128] The networks NW1 and NW2 are wired communication networks (for example, an intranet or the Internet) or wireless communication networks (for example, a local area network (LAN)).
[0129] The directionality control apparatus 3 is connected to the network NW1 or the network NW2, and may be, for example, a stationery personal computer (PC) installed in a monitoring system control room (not illustrated), and may be a mobile phone, a tablet terminal, or a smart phone, which can be carried by a user.
[0130] The directionality control apparatus 3 includes at least a communication unit 31, an operation unit 32, a signal processing unit 33, a display device 36, a speaker device 37, and a memory 38. The signal processing unit 33 includes at least a coordinate transform processing section 34z and an output control section 35.
[0131] The communication unit 31 receives video data or audio data which is transmitted from the camera apparatuses 11 to 1n or the microphone array apparatus 2, and outputs the data to the signal processing unit 33 via the network NW1 or the network NW2.
[0132] The operation unit 32 is a user interface (UI) for notifying the signal processing unit 33 of the content of a user's input operation, and is, for example, a pointing device such as a mouse or a keyboard. In addition, the operation unit 32 may be configured by using a touch panel or a touch pad which is disposed so as to correspond to, for example, a screen of the display device 36 and allows an input operation to be performed with the finger FG of the user or a stylus pen.
[0133] The operation unit 32 outputs coordinate data indicating a region where the user desires to increase or decrease a volume level, that is, a designated position A or a region B illustrated in
[0134] The signal processing unit 33 is configured by using, for example, a central processing unit (CPU), a micro processing unit (MPU), or a digital signal processor (DSP), and performs a control process for collectively controlling operations of the respective units of the directionality control apparatus 3, data input and output processes with other respective units, a data computation (calculation) process, and a data storage process.
[0135] The coordinate transform processing section 34z computes, as a sound collection direction, coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed from an installation position of the microphone array apparatus 2 toward a sound collection region central position (a sound position) A by using coordinate data indicating the position A or the region B which is output from the operation unit 32. In the sound collection direction (.sub.MAh,.sub.MAv), .sub.MAh indicates a horizontal angle of a depression angle .sub.MA which is directed toward the sound collection region central position A from the microphone array apparatus 2, and .sub.MAv indicates a vertical angle of the depression angle .sub.MA which is directed toward the sound collection region central position A from the microphone array apparatus 2. In addition, a process of computing the sound collection direction (.sub.MAh,.sub.MAv) will be described later in detail with reference to the drawings.
[0136] In addition, the sound collection region central position A is a field position which corresponds to the position A designated with the finger FG of the user or a stylus pen on a screen of the display device 36 via the operation unit 32, and is an actual monitoring target.
[0137] The output control section 35 controls operations of the display device 36 and the speaker device 37, so as to cause the display device 36 to reproduce and output video data transmitted from the camera apparatuses 11 to 1n, and to cause the speaker device 37 to output audio data transmitted from the microphone array apparatus 2 as sound.
[0138] The display device 36 as a display part displays video data captured by the camera apparatuses 11 to 1n on a screen.
[0139] The speaker device 37 as a sound output part outputs, as sound, audio data collected by the microphone array apparatus 2 or audio data which is collected by the microphone array apparatus 2 after the sound collection directionality is formed in the sound collection direction (.sub.MAh,.sub.MAv) computed by the coordinate transform processing section 34z. In addition, the display device 36 and the speaker device 27 may be configured separately from the directionality control apparatus 3.
[0140] The memory 38 is configured by using, for example, a random access memory (RAM), and functions as a work memory when the respective units of the directionality control apparatus 3 operate.
[0141] The recorder apparatus 4 records video data captured by the camera apparatuses 11 to 1n and audio data collected by the microphone array apparatus 2. The recorder apparatus 4 records the video data captured by the camera apparatuses 11 to 1n and the audio data collected by the microphone array apparatus 2 in correlation with each other. In addition, the networks NW1 and NW2 may be connected to each other, and various data items may be transmitted between the directionality control systems 10 and 10A.
[0142]
[0143] The microphone array apparatus 2 illustrated in
[0144] The plurality of microphone units 22 which are disposed in the large circular shape have a wide interval therebetween, have a large diameter, and have a characteristic suitable for a low sound range. On the other hand, the respective microphone units 23 have a narrow interval therebetween, have a small diameter, and have a characteristic suitable for a high sound range.
[0145] The microphone array apparatus 2A illustrated in
[0146] The microphone array apparatus 2B illustrated in
[0147] The microphone array apparatus 2C illustrated in
[0148] The microphone array apparatus 2D illustrated in
[0149]
[0150] The sound source 80 is, for example, a conversation of subjects of the camera apparatus 11, present in a sound collection direction in which the microphone array apparatus 2 collects sound, and is present in a direction of a predetermined angle with respect to the upper surface of the casing 21 of the microphone array apparatus 2. In addition, gaps d between the respective microphones 221, 222, 223, . . . , 22(n1) and 22n are assumed to be constant.
[0151] The sound waveforms generated from the sound source 80 initially arrive at and are collected by the microphone 221, then arrive at and are collected by the microphone 222, similarly, sequentially arrive at and are collected by the microphones, and, finally, arrive at and are collected by the microphone 22n.
[0152] In addition, a direction which is directed toward the sound source 80 from each of the microphones 221, 222, 223, . . . , 22(n1) and 22n of the microphone array apparatus 2 is the same as, for example, a direction which is directed toward a sound collection region central position corresponding to a position designated by the user on a screen of the display device 36 from each microphone of the microphone array apparatus 2 in a case where the sound source 80 is the sound of conversations which people have.
[0153] Here, there are occurrences of arrival time differences 1, 2, 3, . . . and (n1) between time points at which the sound waves arrive at the microphones 221, 222, 223, . . . and 22(n1) and finally arrive at the microphone 22n. For this reason, if audio data of sound collected by the respective microphones 221, 222, 223, . . . , 22(n1) and 22n is added without change, the audio data is added in a state where a phase thereof is shifted, and thus a volume level of the sound waves is completely lowered.
[0154] In addition, 1 indicates a time difference between the time point at which the sound wave arrives at the microphone 221 and the time point at which the sound wave arrives at the microphone 22n, 2 indicates a time difference between the time point at which the sound wave arrives at the microphone 222 and the time point at which the sound wave arrives at the microphone 22n, and, similarly, (n1) indicates a time difference between the time point at which the sound wave arrives at the microphone 22(n1) and the time point at which the sound wave arrives at the microphone 22n.
[0155] In the present embodiment, the microphone array apparatus 2 includes A/D converters 241, 242, 243, . . . , 24(n1) and 24n, delay devices 251, 252, 253, . . . , 25(n1) and 25n which are respectively provided so as to correspond to the microphones 221, 222, 223, . . . , 22(n1) and 22n, and an adder 26 (refer to
[0156] In other words, in the microphone array apparatus 2, the A/D converters 241, 242, 243, . . . , 24(n1) and 24n A/D-convert analog audio data collected by the respective microphones 221, 222, 223, . . . , 22(n1) and 22n into digital audio data.
[0157] In addition, in the microphone array apparatus 2, the delay devices 251, 252, 253, . . . , 25(n1) and 25n provide delay times corresponding to the arrival time differences in the respective microphones 221, 222, 222 , . . . , 22(n1) and 22n to all phases of the sound waves so that the phases thereof are made to match each other, and then the adder 26 adds the audio data having undergone the delay process together. Accordingly, the microphone array apparatus 2 can form a directionality of the audio data in each of the microphones 221, 222, 223, . . . , 22(n1) and 22n in a direction of the predetermined angel .
[0158] For example, in
[0159] Here, L1 indicates a difference between sound wave arrival distances in the microphone 221 and the microphone 22n. L2 indicates a difference between sound wave arrival distances in the microphone 222 and the microphone 22n. L3 indicates a difference between sound wave arrival distances in the microphone 223 and the microphone 22n, and, similarly, L(n1) indicates a difference between sound wave arrival distances in the microphone 22(n1) and the microphone 22n. Vs indicates a velocity of the sound wave (sound velocity). L1, L2, L3, . . . , and L(n1), and Vs are known values. In
[0160] In the above-described way, the microphone array apparatus 2 can easily form the sound collection directionality of audio data collected by each of the microphones 22 and 23 by changing the delay times D1, D2, D3, . . . , D(n1) and Dn which are respectively set in the delay devices 251, 252, 253, . . . , 25(n1) and 25n.
[0161] In addition, for convenience of description, the directionality forming process illustrated in
[0162] Next, a summary of an operation of the directionality control system 10 of the present embodiment will be described with reference to
[0163] In the directionality control system 10, the camera apparatus 11 images the subjects (for example, two people) and the speaker device SP reflected in the predefined angle of view CAR. The microphone array apparatus 2 collects ambient sound. In ) as sound. Video data captured by the camera apparatus 11 is displayed on the display device 36 of the directionality control apparatus 3 (refer to
[0164] Here, if the position A on the display device 36, that is, a substantially central position or the region B of the two people having conversations is designated with the finger FG of the user, the directionality control apparatus 3 computes coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed from the installation position of the microphone array apparatus 2 toward the sound collection region central position A as a sound collection direction of the microphone array apparatus 2 by using coordinate data indicating the position A or the region B. The microphone array apparatus 2 forms sound collection directionality MIX in the direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A by using the coordinate data (.sub.MAh,.sub.MAv) computed by the directionality control apparatus 3.
[0165] Therefore, the microphone array apparatus 2 can increase a volume level of the conversation (Hello) of the two people present in the direction in which the sound collection directionality MIX is formed more than a volume level of the music () output from the speaker device SP which is not present in the direction in which the sound collection directionality MIX is formed.
[0166] Consequently, the directionality control apparatus 3 causes the speaker device 37 to output sound with a volume level of the conversation (Hello) of the two people present in the direction in which the sound collection directionality MIX is formed higher than a volume level of the music () output from the speaker device SP which is not present in the direction in which the sound collection directionality MIX is formed (refer to
[0167] Next, initial setting (calibration) performed in the directionality control systems 10 and 10A of the present embodiment will be described with reference to
[0168] Hereinafter, for simplification of description, the directionality control system 10 will be described of the directionality control systems 10 and 10A, and the directionality control system 10 includes the single camera apparatus 11.
[0169] In
[0170] After the camera apparatus 11 and the microphone array apparatus 2 are initially installed, the signal processing unit 33 measures each input parameter which is required to compute coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 (step ST2). The process in step ST2 includes a case where the user measures the input parameter by using a measuring device (for example, a laser range finder), a case where the camera apparatus 11 measures and acquires the input parameter by using functions of well-known techniques of the camera apparatus 11, or a case where the input parameter is acquired by using a design drawing (layout) of an installation space (for example, a building). Each input parameter in step ST2 differs in each embodiment which will be described later, and thus detailed content thereof will be described later.
[0171] After step ST2, each input parameter measured in step ST2 is input to the signal processing unit 33 of the directionality control apparatus 3 (step ST3). For example, the camera apparatus 11 transmits an input parameter acquired by using functions of well-known techniques of the camera apparatus 11, to the communication unit 31 of the directionality control apparatus 3. The communication unit 31 outputs the input parameter transmitted by the camera apparatus 11, to the signal processing unit 33.
[0172] In addition, the microphone array apparatus 2 transmits an input parameter acquired by using functions of well-known techniques of the microphone array apparatus 2, to the communication unit 31 of the directionality control apparatus 3. The communication unit 31 outputs the input parameter transmitted by the medical examination support system 2, to the signal processing unit 33.
[0173] Further, the operation unit 32 outputs an input parameter to the signal processing unit 33 in response to a user's input operation for operating the directionality control apparatus 3.
[0174] The signal processing unit 33 temporarily preserves each input parameter acquired in step ST3, in the memory 38 (step ST4). Through the above-described steps, the operation of the initial setting (calibration) in the directionality control system 10 is finished.
[0175] Next, with reference to
[0176] In
[0177] After step ST11, in a case where the camera apparatus 11 receives the notification indicating that the designation of the position A or the region B has been received from the directionality control apparatus 3, the camera apparatus 11 acquires all or some coordinates of a distance, a horizontal angle, and a vertical angle (L.sub.CA,.sub.CAh,.sub.CAv) to the sound collection region central position A corresponding to the position A or the region B on the screen designated in step ST11, with the installation position of the camera apparatus 11 as a start point (step ST12).
[0178] The camera apparatus 11 transmits all or some of the coordinates of the distance, the horizontal angle, and the vertical angle (L.sub.CA,.sub.CAh,.sub.CAv) to the sound collection region central position A corresponding to the position A or the region B on the screen designated in step ST11 with the installation position of the camera apparatus 11 as a start point, to the directionality control apparatus 3.
[0179] The signal processing unit 33 of the directionality control apparatus 3 computes coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 by using all or some of the coordinates of the distance, the horizontal angle, and the vertical angle (L.sub.CA,.sub.CAh,.sub.CAv) to the sound collection region central position A corresponding to the position A or the region B on the screen designated in step ST11 with the installation position of the camera apparatus 11 acquired in step ST12 as a start point, and each input parameter which is temporarily preserved in the memory 38 in step ST4 illustrated in
[0180] The directionality control apparatus 3 transmits a directionality formation instruction including the coordinates (.sub.MAh,.sub.MAv) computed in step ST13 to the microphone array apparatus 2. The microphone array apparatus 2 forms the sound collection directionality of each of the microphones 22 and 23 in a sound collection direction indicated by the coordinates (.sub.MAh,.sub.MAv) computed by the directionality control apparatus 3 in response to the directionality formation instruction from the directionality control apparatus 3 (step ST14).
[0181] Consequently, the microphone array apparatus 2 can increase a volume level of audio data which is collected from the sound collection direction (.sub.MAh,.sub.MAv) in which the sound collection directionality is formed, and can reduce a volume level of audio data which is collected from a direction in which the sound collection directionality is not formed. In the above-described way, the operation is finished in which the directionality control apparatus 3 of the directionality control system 10 computes a sound collection direction of the microphone array apparatus 2.
In Case of Camera Apparatus Sends Only Panning/Tilting/Zooming Information
[0182] In the description hitherto, the camera apparatus 11 performs the operation in step ST12 illustrated in
[0183] In addition, in the directionality control system 10 of the present embodiment, a timing at which the microphone array apparatus 2 collects sound is not limited to the time right after step ST14, and may be, for example, the time after power is supplied to the microphone array apparatus 2.
Description of Generalization of Coordinate Transform Process from Camera Coordinate System to Microphone Coordinate System
[0184]
[0185] As illustrated in
[Equation 2]
{right arrow over (V.sub.MXP)}={right arrow over (V.sub.CXP)}{right arrow over (V.sub.CXMX)}(2)
[0186] However, a position of the sound source P on an image captured by the camera CX is represented by coordinates corresponding to an independent coordinate system (hereinafter, referred to as a camera coordinate system) of the camera CX, and a position of the sound source P which is a sound collection target of the omnidirectional microphone array apparatus MX is represented by coordinates corresponding to an independent coordinate system (hereinafter, referred to as a microphone coordinate system) of the omnidirectional microphone array apparatus MX. For this reason, a coordinate transform process is required to compute a position of the sound source P viewed from the omnidirectional microphone array apparatus MX by using a position of the sound source P based on the camera coordinate system, viewed from the camera CX.
[0187] Coordinates (X,Y,Z) corresponding to the microphone coordinate system are expressed by Equation (3) by using coordinates (x,y,z) corresponding to the camera coordinate system, and a predetermined coordinate transform matrix U.
[0188] As a specific example, in a case where the x axis, the y axis, and the z axis of the camera coordinate system are rotated about the z axis by degrees, rotated about the x axis by a degrees, and rotated about the y axis by degrees so that the coordinates of the camera CX are moved in parallel by (x.sub.0,y.sub.0,z.sub.0), the coordinate transform matrix U is expressed by Equation (4). In addition, in Equation (4), a parallel movement, an order of integration in a rotation matrix of the coordinate axes, or selection of a rotation axis is not limited thereto.
[0189] In order to compute the coordinate transform matrix U, a method using a design drawing (layout) of a space (for example, a room) in which the camera CX and the omnidirectional microphone array apparatus MX are installed may be employed (refer to
[0190] Here, for example, it is assumed that data regarding coordinates (x.sub.0,y.sub.0,z.sub.0) in the camera coordinate system, indicating a distance between the PTZ camera apparatus 1 as the camera CX and the omnidirectional microphone array apparatus 2 as the omnidirectional microphone array apparatus MX, and the camera coordinate system and the microphone coordinate system are predefined, and an angle formed by a horizontal reference axis of the camera is known. The user obtains data regarding coordinates of the PTZ camera apparatus 1 in an orthogonal coordinate system and the angle by using a layout diagram and inputs the data and the angle via the operation unit 32.
[0191] Hereinafter, with reference to
[0192] As illustrated in
[0193] In the camera coordinate system of the PTZ camera apparatus 1, an origin O.sub.C is a rotation center of the PTZ camera apparatus 1 in the panning direction and the tilting direction, an x-y plane is defined in parallel to a pedestal of the PTZ camera apparatus 1, and this x-y plane is parallel to the wall surface. In addition, the x axis is directed vertically upward.
[0194] First, the PTZ camera apparatus 1 transforms coordinates (r,,) in a spherical coordinate system of the camera coordinate system into coordinates (x,y,z) in the orthogonal coordinate system according to Equation (5), and transmits data regarding the coordinates (x,y,z) having undergone the transform process to the directionality control apparatus 3.
[0195] An origin .sub.M of the omnidirectional microphone array apparatus 2 viewed from the PTZ camera apparatus 1 is provided at the position of (x.sub.0,y.sub.0,z.sub.0) when viewed from the orthogonal coordinate system of the PTZ camera apparatus 1. The microphone coordinate system of the omnidirectional microphone array apparatus 2 has the origin O.sub.M as a center of the omnidirectional microphone array apparatus 2, an array formation plane of the omnidirectional microphone array apparatus 2 is an X-Y plane, and the X-Y plane is parallel to the ceiling surface. In addition, the Z axis is perpendicular to the array formation plane (X-Y plane) and is directed vertically downward.
[0196] The coordinate transform processing section 34z of the signal processing unit 33 of the directionality control apparatus 3 moves in the origin O.sub.C of the PTZ camera apparatus 1 parallel by (x.sub.0,y.sub.0,z.sub.0) by using the data regarding the coordinates (x,y,z) in the orthogonal coordinate system of the PTZ camera apparatus 1 received from the PTZ camera apparatus 1, rotates the origin O.sub.C about the y axis by 90 degrees, and then rotates the origin O.sub.C about the z axis by degrees, so as to compute data regarding the coordinates (X,Y,Z) in the orthogonal coordinate system of the omnidirectional microphone array apparatus 2 (refer to Equation (6)). In addition, in Equation (6), a parallel movement or an order of integration in a rotation matrix of the coordinate axes is not particularly limited, and the content of each vector changes depending on rotation about a rotation axis and an order of parallel movement according to an approach to coordinate transform in mathematics.
[0197] The coordinate transform processing section 34z of the signal processing unit 33 of the directionality control apparatus 3 transforms the coordinates (X,Y,Z) in the orthogonal coordinate system of the omnidirectional microphone array apparatus 2 into coordinates (R,,) in the spherical coordinate system of the omnidirectional microphone array apparatus 2 according to Equation (7) based on the computation result using Equation (6). Thus, the directionality control apparatus 3 can compute the coordinates (,) in the spherical coordinate system of the omnidirectional microphone array apparatus 2 as the sound collection direction coordinates (,). Further, the output control section 35 of the signal processing unit 33 of the directionality control apparatus 3 forms the directionality in the sound collection direction coordinates (,).
[0198] In addition, the above-described example of computing the sound collection direction coordinates (,) has been described with the position of the PTZ camera apparatus 1 as a reference, but is not limited to a case of using the position of the PTZ camera apparatus 1 as a reference as illustrated in
[0199] Further, a description has been made of a case where the PTZ camera apparatus 1 sends data which is transformed into orthogonal coordinates, but data of the camera coordinate system may be transmitted to the directionality control apparatus 3, and the directionality control apparatus 3 may perform coordinate transform on the data.
[0200] Hereinafter, a description will be made of an example of a method in which the camera apparatus 11 and the omnidirectional microphone array apparatus 2 specifically compute a sound collection direction of the omnidirectional microphone array apparatus 2 for a sound position viewed from the camera apparatus 11 based on an actually measured value from the camera apparatus 11 to the sound position without a design drawing (layout diagram) of an installation space (for example, a room).
Method of Computing Coordinates (.SUB.MAh.,.SUB.MAv.) Indicating Sound Collection Direction of Microphone Array Apparatus 2
[0201] Next, with reference to
First Computation Method
[0202] In a first computation method, a reference point O is provided in a direction of the optical axis CX of the camera apparatus 11.
[0203] The signal processing unit 33 computes the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 based on:
[0204] (1) a horizontal component (direction) distance L.sub.CMh of a distance L.sub.CM between the camera apparatus 11 and the microphone array apparatus 2;
[0205] (2) a distance L.sub.CO and a depression angle .sub.CO from the camera apparatus 11 to the reference point O;
[0206] (3) a distance L.sub.MO and a depression angle .sub.MO from the microphone array apparatus 2 to the reference point O;
[0207] (4) respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the reference point O from a horizontal surface;
[0208] (5) a horizontal angle .sub.CAh and a vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A; and
[0209] (6) a height H.sub.A of the sound collection region central position A from the horizontal surface.
[0210] In the first computation method, the input parameters in step ST2 illustrated in
[0211] (1) the horizontal component (direction) distance L.sub.CMh of the distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2;
[0212] (2) the distance L.sub.CO and the depression angle .sub.CO from the camera apparatus 11 to the reference point O;
[0213] (3) the distance L.sub.MO and the depression angle .sub.MO from the microphone array apparatus 2 to the reference point O; and
[0214] (4) the respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the reference point O from the horizontal surface.
[0215] (1) The horizontal component (direction) distance L.sub.CMh of the distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2 is a fixed value defined when the camera apparatus 11 and the microphone array apparatus 2 are initially installed.
[0216] (2) The distance L.sub.CO and the depression angle .sub.CO from the camera apparatus 11 to the reference point O can be easily measured, for example, by the user causing a laser range finder to be directed toward the camera apparatus 11 at the position of the reference point O.
[0217] (3) The distance L.sub.MO and the depression angle .sub.MO from the microphone array apparatus 2 to the reference point O can be easily measured, for example, by the user causing the laser range finder to be directed toward the medical examination support system 2 at the position of the reference point O.
[0218] (4) The respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the reference point O from the horizontal surface are fixed values defined when the camera apparatus 11 and the microphone array apparatus 2 are initially installed, and are fixed values defined when the position of the reference point O is determined.
[0219] In addition, in the first computation method, in the coordinates (L.sub.CA,.sub.CAh,.sub.CAv) in step ST12 illustrated in
[0220] (5) the horizontal angle .sub.CAh and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A are used, and are acquired by using a function of a well-known technique of the camera apparatus 11.
[0221] In addition, in the first computation method,
[0222] (6) the height H.sub.A of the sound collection region central position A from the horizontal surface is a fixed value which is set in advance, and is a selected value or an input value with a size of a person as H.sub.A, for example, in a case where there is the person around the sound collection region central position A when the position A is designated with the finger FG of the user. Alternatively, when the position A is designated with the finger FG of the user, a default value (for example, 1.5 m or 0.8 m) may be used in a case where the directionality control apparatus 3 determines that there is a person (for example, an adult or a child) at the designated position.
[0223]
[0224] In addition, in the following description of the first computation method, the position A designated with the finger FG of the user is different from the position of the reference point O and is a position in the lower right direction of the reference point O (refer to
[0225]
[0226]
[0227]
[0228] Hereinafter, a detailed description will be made of the first computation method of the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 in the signal processing unit 33. In the first computation method, a reference line in a direction of 0 degrees of a horizontal angle of the microphone array apparatus 2 is directed toward the camera apparatus 11. In addition, a computation operation in the following description of each computation method will be described as being performed by the signal processing unit 33, but the signal processing unit 33 may be replaced with the coordinate transform processing section 34z.
[0229] The signal processing unit 33 computes a horizontal component distance L.sub.COh of the distance L.sub.CO from the camera apparatus 11 to the reference point O according to Equation (8) by using the distance L.sub.CO and the depression angle .sub.CO from the camera apparatus 11 to the reference point O.
[Equation 8]
L.sub.COh=L.sub.COcos .sub.CO (8)
[0230] The signal processing unit 33 computes a horizontal component distance L.sub.MOh of the distance L.sub.MO from the medical examination support system 2 to the reference point O according to Equation (9) by using the distance L.sub.MO and the depression angle .sub.MO from the microphone array apparatus 2 to the reference point O.
[Equation 9]
L.sub.MOh=L.sub.MOcos .sub.MO (9)
[0231] The signal processing unit 33 computes a cosine value cos .sub.COh of a horizontal angle .sub.COh of the depression angle .sub.CO from the camera apparatus 11 to the reference point O according to Equation (10) based on the cosine theorem for the triangle COM illustrated in
[0232] The signal processing unit 33 computes a cosine value cos .sub.MOh of a horizontal angle .sub.MOh of the depression angle .sub.MO from the camera apparatus 11 to the reference point O according to Equation (11) based on the cosine theorem for the triangle COM illustrated in
[0233] The signal processing unit 33 computes a vertical component distance L.sub.COv of the distance L.sub.CO from the camera apparatus 11 to the reference point O according to Equation (12) by using the respective heights H.sub.C and H.sub.O of the camera apparatus 11 and the reference point O from the horizontal surface and the respective computation results of Equations (8) and (10).
[Equation 12]
L.sub.COv={square root over ((L.sub.COhcos .sub.COh).sup.2+(H.sub.CH.sub.O).sup.2)}(12)
[0234] The signal processing unit 33 computes a vertical component distance L.sub.MOv of the distance L.sub.MO from the microphone array apparatus 2 to the reference point O according to Equation (13) by using the respective heights H.sub.M and H.sub.O of the microphone array apparatus 2 and the reference point O from the horizontal surface and the respective computation results of Equations (9) and (11).
[Equation 13]
L.sub.MOv={square root over ((L.sub.MOhcos .sub.MOh).sup.2+(H.sub.MH.sub.O).sup.2)}(13)
[0235] The signal processing unit 33 computes a vertical component distance L.sub.CMv (=L.sub.CM) of the distance L.sub.CM from the camera apparatus 11 to microphone array apparatus 2 according to Equation (14) by using the respective heights H.sub.C and H.sub.M of the camera apparatus 11 and the microphone array apparatus 2 from the horizontal surface and the horizontal component distance L.sub.CMh of the distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2.
Equation [14]
L.sub.CMv=L.sub.CM={square root over (L.sub.CMh.sup.2+(H.sub.MH.sub.C).sup.2)}(14)
[0236] The signal processing unit 33 computes a cosine value cos .sub.COv of the vertical angle .sub.COv of the depression angle .sub.CO from the camera apparatus 11 to the reference point O according to Equation (15) based on the cosine theorem for the triangle COM illustrated in
[0237] The signal processing unit 33 computes a sine value sin .sub. of an angle .sub. between a direction which is directed toward the microphone array apparatus 2 from the camera apparatus 11 illustrated in
[0238] Next, the signal processing unit 33 computes a distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (17) by using the respective computation results of Equations (15) and (16), the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A, the height H.sub.A of the sound collection region central position A from the horizontal surface, and the height H.sub.C of the camera apparatus 11 from the horizontal surface.
[0239] The signal processing unit 33 computes a horizontal component distance L.sub.CAh of the distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (18) by using the respective computation results of Equations (15) to (17) and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A.
[0240] The signal processing unit 33 computes a horizontal component distance L.sub.MAh of the distance from the microphone array apparatus 2 to the sound collection region central position A according to Equation (19) based on the cosine theorem for the triangle CAM illustrated in
[0241] The signal processing unit 33 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (20) based on the cosine theorem for the triangle CAM illustrated in
[0242] Consequently, the signal processing unit 33 can compute the horizontal angle .sub.MAh of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (21).
[0243] In addition, the signal processing unit 33 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (22) based on a tangent for the triangle MAP illustrated in
[0244] Consequently, the signal processing unit 33 can compute the vertical angle .sub.MAv of the the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (23).
[0245] In the above-described way, in the first computation method, the reference point O is provided in the direction of the optical axis CX of the camera apparatus 11, and the directionality control apparatus 3 uses, as the respective input parameters, the distance L.sub.CO and the depression angle .sub.CO between the camera apparatus 11 and the reference point O, the distance L.sub.MO and the depression angle .sub.MO between the microphone array apparatus 2 and the reference point O, the horizontal component distance L.sub.CMh of the distance L.sub.CM between the camera apparatus 11 and the microphone array apparatus 2, and the respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the reference point O from the horizontal surface.
[0246] Further, the directionality control apparatus 3 computes, as a sound collection direction, the coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed toward the sound collection region central position A corresponding to the position A designated with the finger FG of the user in a video of a predetermined region captured as a monitoring target of the camera apparatus 11, that is, a direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A, with the position of the microphone array apparatus 2 as a reference, by using the respective input parameters.
[0247] Consequently, according to the first computation method, the directionality control system 10 of the present embodiment can form the sound collection directionality in the direction of the sound collection region central position A designated with the position of the microphone array apparatus 2 as a reference with high accuracy and can thus collect audio data in the corresponding direction with high accuracy.
Second Computation Method
[0248] In a second computation method, instead of the optical axis CX of the camera apparatus 11, a position of a marker MAK which is suspended vertically downward from the microphone array apparatus 2 by a string STR as the reference point O.
[0249] In addition, a length of the string STR is less than the height H.sub.M of the microphone array apparatus 2 from the horizontal surface. Further, the marker MAK is, for example, a ball in a color which can be easily recognized by the user when imaged by the camera apparatus 11.
[0250] The signal processing unit 33 computes the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 based on:
[0251] (1) a distance L.sub.CO from the camera apparatus 11 to the marker MAK which is suspended vertically downward from the microphone array apparatus 2;
[0252] (2) a distance L.sub.MO from the microphone array apparatus 2 to the marker MAK;
[0253] (3) a horizontal angle .sub.CMh (=.sub.COh) and a vertical angle .sub.COv between a direction of the optical axis CX of the camera apparatus 11 and a direction which is directed from the camera apparatus 11 toward the marker MAK; and
[0254] (4) a distance L.sub.CA, a horizontal angle .sub.CAh, and a vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A.
[0255] In the second computation method, the input parameters in step ST2 illustrated in
[0256] (1) the distance L.sub.CO from the camera apparatus 11 to the marker MAK which is suspended vertically downward from the microphone array apparatus 2;
[0257] (2) the distance L.sub.MO from the microphone array apparatus 2 to the marker MAK; and
[0258] (3) the horizontal angle .sub.CMh and the vertical angle .sub.COv between the direction of the optical axis CX of the camera apparatus 11 and the direction which is directed from the camera apparatus 11 toward the marker MAK.
[0259] (1) The distance L.sub.CO from the camera apparatus 11 to the marker MAK which is suspended vertically downward from the microphone array apparatus 2 is acquired by using a function of a well-known technique of the camera apparatus 11. For example, a focal length obtained when the camera apparatus 11 focuses and images the marker MAK may be used as L.sub.CO.
[0260] (2) The distance L.sub.MO from the microphone array apparatus 2 to the marker MAK is the same as the length of the string STR.
[0261] (3) The horizontal angle .sub.CMh and the vertical angle .sub.COv between the direction of the optical axis CX of the camera apparatus 11 and the direction which is directed from the camera apparatus 11 toward the marker MAK is acquired by using a function of a well-known technique of the camera apparatus 11.
[0262] In addition, in the second computation method,
[0263] (4) the distance L.sub.CA, the horizontal angle .sub.CAh, and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A are acquired by using a function of a well-known technique of the camera apparatus 11 in step ST12 illustrated in
[0264]
[0265] In addition, in the following description of the second computation method, the position A designated with the finger FG of the user is different from the positions of the reference point O and the central point of the screen indicating the optical axis direction and is a position in the lower right direction of the reference point O (refer to
[0266]
[0267]
[0268] Hereinafter, a detailed description will be made of the second computation method of the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 in the signal processing unit 33. In the second computation method, a reference line in a direction of 0 degrees of a horizontal angle of the microphone array apparatus 2 is directed toward the camera apparatus 11, and the respective heights H.sub.C and H.sub.M of the camera apparatus 11 and the microphone array apparatus 2 from the horizontal surface are assumed to be the same as each other.
[0269] The signal processing unit 33 computes a sine value sin .sub.COv of an angle .sub.COv between a direction which is directed toward the marker MAK from the camera apparatus 11 and a direction which is directed toward the microphone array apparatus 2 from the camera apparatus 11 according to Equation (24) by using the distance L.sub.CO from the camera apparatus 11 to the marker MAK which is suspended vertically downward from the microphone array apparatus 2, and the distance L.sub.MO from the microphone array apparatus 2 to the marker MAK.
[0270] The signal processing unit 33 computes a distance L.sub.CM between the camera apparatus 11 and the microphone array apparatus 2 according to Equation (25) by using the computation result of Equation (24) and the distance L.sub.CO from the camera apparatus 11 to the marker MAK which is suspended vertically downward from the microphone array apparatus 2.
[Equation 25]
L.sub.CM=L.sub.COcos .sub.COv (25)
[0271] The signal processing unit 33 computes a horizontal component distance L.sub.CAh of the the distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (26) by using the respective computation results of Equations (24) and (25), the distance L.sub.CA and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A, and the vertical angle .sub.COv between the direction of the optical axis CX of the camera apparatus 11 and the direction which is directed from the camera apparatus 11 toward the marker MAK.
[Equation 26]
L.sub.CAh=L.sub.CAcos(.sub.COv+.sub.COv+.sub.CAv) (26)
[0272] The signal processing unit 33 computes a horizontal component distance L.sub.MAh of the distance from the microphone array apparatus 2 to the sound collection region central position A according to Equation (27) based on the cosine theorem for the triangle CAM illustrated in
[0273] The signal processing unit 33 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (28) based on the cosine theorem for the triangle CAM illustrated in
[0274] Consequently, the signal processing unit 33 can compute the horizontal angle .sub.MAh of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (29).
[0275] The signal processing unit 33 computes a vertical component distance L.sub.CAv of the the distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (30) by using the distance L.sub.CA and the horizontal angle .sub.CAh from the camera apparatus 11 to the sound collection region central position A, and the horizontal angle .sub.CMh between the direction of the optical axis CX of the camera apparatus 11 and the direction which is directed from the camera apparatus 11 toward the marker MAK.
[Equation 30]
L.sub.CAv=L.sub.CAcos(.sub.CMh+.sub.CAh) (30)
[0276] The signal processing unit 33 computes a vertical component distance L.sub.MAv from the microphone array apparatus 2 to the sound collection region central position A according to Equation (31) based on the cosine theorem for the triangle CAM illustrated in
[0277] The signal processing unit 33 computes a cosine value cos .sub.MAv of the angle .sub.MAv between the direction which is directed from the microphone array apparatus 2 toward the camera apparatus 11 and the direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A according to Equation (32) based on the cosine theorem for the triangle MAP illustrated in
[0278] The signal processing unit 33 computes a sine value sin .sub.MAv for the angle .sub.MAv between the direction which is directed from the microphone array apparatus 2 toward the camera apparatus 11 and the direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A according to Equation (33) based on a sine for the triangle MAS illustrated in
[0279] The signal processing unit 33 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (34) by using the respective computation results of Equations (27) and (33).
[0280] Consequently, the signal processing unit 33 can compute the vertical angle .sub.MAv of the the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (35).
[0281] In the above-described way, in the second computation method, the marker MAK which is suspended vertically downward from the microphone array apparatus 2 by using the string is provided as the reference point O, and the directionality control apparatus 3 uses, as the respective input parameters, the distance L.sub.CO and the vertical angle .sub.COv of the depression angle .sub.CO from the camera apparatus 11 to the marker MAK, the angle .sub.CMh between the direction of the optical axis CX of the camera apparatus 11 and the direction which is directed from the camera apparatus 11 toward the medical examination support system 2, and the distance L.sub.MO from the microphone array apparatus 2 to the marker MAK.
[0282] Further, the directionality control apparatus 3 computes, as a sound collection direction, the coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed toward the sound collection region central position A corresponding to the position A designated with the finger FG of the user in a video of a predetermined region captured as a monitoring target of the camera apparatus 11, that is, a direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A, with the position of the microphone array apparatus 2 as a reference, by using the respective input parameters.
[0283] Consequently, according to the second computation method, the directionality control system 10 of the present embodiment can more easily compute the coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed toward the sound collection region central position A from the microphone array apparatus 2 since the number of input parameters is smaller than that in the first computation method. In addition, the directionality control system 10 can form the sound collection directionality in the direction of the sound collection region central position A designated with the position of the microphone array apparatus 2 as a reference with high accuracy and can thus collect audio data in the corresponding direction with high accuracy.
[0284] In addition, in the second computation method, a position of the marker MAK may be a position of an intersection between the direction of the optical axis CX of the camera apparatus 11 and the vertically downward direction of the microphone array apparatus 2. In this case, since the angle .sub.COv between the direction of the optical axis CX and the direction which is directed from the camera apparatus 11 toward the marker MAK is 0 (zero), an amount of computation of the coordinates (.sub.MAh,.sub.MAv) indicating the direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A is reduced in the signal processing unit 33.
[0285] Further, in the second computation method, in a case where the microphone array apparatus 2 has a function of performing irradiation with laser light, the microphone array apparatus 2 performs irradiation with laser light vertically downward without using the marker MAK, and the camera apparatus 11 analyzes a captured image of an irradiation point (for example, a point with a certain height in the vertically downward direction of the microphone array apparatus 2) of the laser light. Thus, the signal processing unit 33 can compute the coordinates (.sub.MAh,.sub.MAv) indicating the direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A in the same manner as in the above-described second computation method.
Third Computation Method
[0286] In a third computation method, a sound source is provided as the reference point O in the direction of the optical axis CX of the camera apparatus 11.
[0287] The signal processing unit 33 computes the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 based on:
[0288] (1) a distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2;
[0289] (2) a distance L.sub.CO from the camera apparatus 11 to a predetermined sound source position (reference point O) in the optical axis direction;
[0290] (3) a horizontal angle .sub.MOh and a vertical angle .sub.MOv from the microphone array apparatus 2 to the sound source position (reference point O);
[0291] (4) respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the sound source position (reference point O) from a horizontal surface; and
[0292] (5) a distance L.sub.CA, a horizontal angle .sub.CAh, and a vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A.
[0293] In the third computation method, the input parameters in step ST2 illustrated in
[0294] (1) the distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2;
[0295] (2) the distance L.sub.CO from the camera apparatus 11 to the predetermined sound source position (reference point O) in the optical axis direction;
[0296] (3) the horizontal angle .sub.MOh and the vertical angle .sub.MOv from the microphone array apparatus 2 to the sound source position (reference point O); and
[0297] (4) the respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the sound source position (reference point O) from the horizontal surface.
[0298] (1) The distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2 is a fixed value defined when the camera apparatus 11 and the microphone array apparatus 2 are initially installed.
[0299] (2) The distance L.sub.CO from the camera apparatus 11 to the predetermined sound source position (reference point O) in the optical axis direction can be easily measured, for example, by the user causing a laser range finder to be directed toward the camera apparatus 11 at the position of the reference point O.
[0300] (3) The horizontal angle .sub.MOh and the vertical angle .sub.MOv from the microphone array apparatus 2 to the sound source position (reference point O) may be measured by using a function (for example, a sound source detection function) of a well-known technique of the microphone array apparatus 2.
[0301] (4) The respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the reference point O from the horizontal surface are fixed values defined when the camera apparatus 11 and the microphone array apparatus 2 are initially installed, and are fixed values defined when the position of the reference point O is determined.
[0302] In addition, in the third computation method,
[0303] (5) the distance L.sub.CA, the horizontal angle .sub.CAh, and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A are are acquired by using a function of a well-known technique of the camera apparatus 11 in step ST12 illustrated in
[0304] Further, in the third computation method, since the sound source serving as the reference point O is present in the direction of the optical axis CX of the camera apparatus 11, a positional relationship between the reference point O and the designated position A for computing a sound collection direction of the microphone array apparatus 2 on a screen of the display device 36 is the positional relationship illustrated in
[0305]
[0306]
[0307]
[0308] Hereinafter, a detailed description will be made of the third computation method of the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 in the signal processing unit 33. In the third computation method, a reference line in a direction of 0 degrees of a horizontal angle of the microphone array apparatus 2 is not directed toward the camera apparatus 11, and the respective heights H.sub.C and H.sub.M of the camera apparatus 11 and the microphone array apparatus 2 from the horizontal surface are assumed to be the same as each other.
[0309] First, the signal processing unit 33 computes a horizontal component distance L.sub.COh of the distance L.sub.CO from the camera apparatus 11 to the predetermined sound source position (reference point O) in the optical axis direction according to Equation (36) by using the distance L.sub.CO from the camera apparatus 11 to the predetermined sound source position (reference point O) in the optical axis direction, and the respective heights H.sub.C and H.sub.O of the camera apparatus 11 and the sound source position (reference point O) in the optical axis direction from the horizontal surface.
[Equation 36]
L.sub.COh={square root over ((L.sub.CO).sup.2(H.sub.CH.sub.O).sup.2)}(36)
[0310] The signal processing unit 33 computes a horizontal component distance L.sub.MOh of the distance L.sub.MO from the microphone array apparatus 2 to the predetermined sound source position (reference point O) in the optical axis direction according to Equation (37) by using the vertical angle .sub.MOv from the microphone array apparatus 2 to the sound source position (reference point O), and the respective heights H.sub.M and H.sub.O of the microphone array apparatus 2 and the sound source position (reference point O) from the horizontal surface.
[0311] The signal processing unit 33 computes a cosine value cos .sub.COh of a horizontal angle .sub.COh of the depression angle .sub.CO from the camera apparatus 11 to the sound source (reference point O) according to Equation (38) based on the cosine theorem for the triangle COM illustrated in
[0312] Consequently, the signal processing unit 33 can compute an angle .sub.MCh between the reference line for the 0-degree direction of the microphone array apparatus 2 and the direction which is directed from the microphone array apparatus 2 toward the camera apparatus 11 according to Equation (40).
[0313] Next, the signal processing unit 33 computes a vertical component distance L.sub.COv of the distance L.sub.CO from the camera apparatus 11 to the sound source position (reference point O) according to Equation (41) by using the respective computation results of Equations (36) and (38) and the respective heights H.sub.C and H.sub.O of the camera apparatus 11 and the sound source (reference point O) from the horizontal surface.
[Equation 41]
L.sub.COv={square root over ((L.sub.COhcos .sub.COh).sup.2+(H.sub.CH.sub.O).sup.2)}(41)
[0314] The signal processing unit 33 computes a vertical component distance L.sub.MOv of the distance L.sub.MO from the microphone array apparatus 2 to the sound source (reference point O) according to Equation (42) by using the respective computation results of Equations (37) and (39) and the respective heights H.sub.M and H.sub.O of the microphone array apparatus 2 and the sound source position (reference point O) from the horizontal surface.
[Equation 42]
L.sub.MOv={square root over ((L.sub.MOhcos(.sub.MOh.sub.MCh)).sup.2+(H.sub.MH.sub.O).sup.2)}(42)
[0315] The signal processing unit 33 computes a cosine value cos .sub.COv of the angle .sub.COv between the direction which is directed from the camera apparatus 11 toward sound source (reference point O) and the direction which is directed from the camera apparatus 11 toward the microphone array apparatus 2 according to Equation (43) based on the cosine theorem for the triangle COM illustrated in
[0316] Next, the signal processing unit 33 computes a horizontal component distance L.sub.CAh of the distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (44) by using the computation result of Equation (43), and the distance L.sub.CA and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A.
[Equation 44]
L.sub.CAh=L.sub.CAcos(.sub.COv+.sub.CAv) tm (44)
[0317] The signal processing unit 33 computes a horizontal component distance L.sub.MAh of the distance L.sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (45) based on the cosine theorem for the triangle CAM illustrated in
[0318] The signal processing unit 33 computes a cosine value cos(.sub.MAh.sub.MCh) of a horizontal angle (.sub.MAh.sub.MCh) of the depression angle (.sub.MA.sub.MC) in a direction which is directed toward the sound collection region central position A from the reference line for the 0-degree direction of the microphone array apparatus 2 according to Equation (46) based on the cosine theorem for the triangle CAM illustrated in
[0319] Consequently, the signal processing unit 33 can compute a horizontal angle .sub.MAh of the depression angle .sub.MA in the direction which is directed from the reference line for the 0-degree direction of the microphone array apparatus 2 toward the sound collection region central position A according to Equation (47).
[0320] Further, the signal processing unit 33 computes a vertical component distance L.sub.CAh of the distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (48) by using the computation result of Equations (40), and the distance L.sub.CA and the horizontal angle .sub.CAh from the camera apparatus 11 to the sound collection region central position A.
[Equation 48]
L.sub.CAv=L.sub.CAcos(.sub.MCh+.sub.CAh) (48)
[0321] The signal processing unit 33 computes a vertical component distance L.sub.MAv of the distance L.sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (49) based on the cosine theorem for the triangle CAM illustrated in
[0322] The signal processing unit 33 computes a cosine value cos .sub.MAv of the angle .sub.MAv between the direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A and the direction which is directed from the microphone array apparatus 2 toward the camera apparatus 11 in the K-K section illustrated in
[0323] The signal processing unit 33 computes a difference (H.sub.MH.sub.A) between the heights H.sub.M and H.sub.A of the microphone array apparatus 2 and the sound collection region central position A from the horizontal surface according to Equation (51) by using the respective computation results of Equations (49) and (50).
[Equation 51]
H.sub.MH.sub.A=L.sub.MAvsin .sub.MAv (51)
[0324] The signal processing unit 33 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv of the depression angle .sub.MA which is directed toward the sound collection region central position A from the reference line for the 0-degree direction of the microphone array apparatus 2 according to Equation (52) based on a tangent for the triangle MSA illustrated in
[0325] Consequently, the signal processing unit 33 can compute the vertical angle .sub.MAv of the depression angle .sub.MA from the reference line for the 0-degree direction of the microphone array apparatus 2 to the sound collection region central position A according to Equation (53).
[0326] In the above-described way, in the third computation method, the sound source is provided as the reference point O in the direction of the optical axis CX of the camera apparatus 11, and the directionality control apparatus 3 uses, as the respective input parameters, the distance L.sub.CO from the camera apparatus 11 to the sound source (reference point O), the horizontal angle .sub.MOh and the vertical angle .sub.MOv of the depression angle .sub.MO from the microphone array apparatus 2 to the sound source (reference point O), the distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2, and the respective heights H.sub.C, H.sub.M and H.sub.O of the camera apparatus 11, the microphone array apparatus 2, and the reference point O from the horizontal surface.
[0327] Further, the directionality control apparatus 3 computes, as a sound collection direction, the coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed toward the sound collection region central position A corresponding to the position A designated with the finger FG of the user in a video of a predetermined region captured as a monitoring target of the camera apparatus 11, that is, a direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A, with the position of the microphone array apparatus 2 as a reference, by using the respective input parameters.
[0328] Consequently, according to the third computation method, even if the reference line for the 0-degree direction of the microphone array apparatus 2 is not set in advance in the direction toward the camera apparatus 11, the directionality control system 10 of the present embodiment can form the sound collection directionality in the direction of the sound collection region central position A designated with the position of the microphone array apparatus 2 as a reference with high accuracy and can thus collect audio data in the corresponding direction with high accuracy.
[0329] In addition, in each of the first to third computation methods, in order to simplify calculation of coordinates indicating a sound collection direction of the microphone array apparatus 2, a vertical angle and a horizontal angle representing the coordinates are computed in an approximate manner in some portions, but may be more accurately computed by using, for example, a geometric positional relationship.
Fourth Computation Method
[0330] In a fourth computation method, the camera apparatus 11 and the microphone array apparatus 2 are connected and fixed to each other by using a dedicated tool 50 and are installed on, for example, an indoor ceiling surface (refer to
[0331] The dedicated tool 50 connects and fixes the microphone array apparatus 2 and the camera apparatus 11 to each other so that, for example, a reference line for the 0-degree direction of the microphone array apparatus 2 opposes, for example, a reference line for the 0-degree direction of the camera apparatus 11 and heights thereof from a horizontal surface are the same as each other. In addition, a shape of the dedicated tool 50 is not particularly limited as long as the camera apparatus 11 and the microphone array apparatus 2 are connected and fixed to each other with a predetermined horizontal angle and vertical angle.
[0332] Since the camera apparatus 11 and the microphone array apparatus 2 are connected and fixed to each other by using the dedicated tool 50, in the fourth computation method, the distance L.sub.CM between the camera apparatus 11 and the microphone array apparatus 2 is a fixed value corresponding to a parameter of the dedicated tool.
[0333] The signal processing unit 33 computes the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 based on:
[0334] (1) a length of the dedicated tool 50; and
[0335] (2) a distance L.sub.CA, a horizontal angle .sub.CAh, and a vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position (point A).
[0336] In the fourth computation method, the input parameters in step ST2 illustrated in
[0337] (1) the length of the dedicated tool 50, that is, information regarding the type of dedicated tool 50 to be used. For example, if the length of the dedicated tool 50 is 5 m, the distance L.sub.CM between the camera apparatus 11 and the microphone array apparatus 2 is also 5 m, and if the length of the dedicated tool 50 is 10 m, the distance L.sub.CM between the camera apparatus 11 and the microphone array apparatus 2 is also 10 m.
[0338] In addition, in the fourth computation method,
[0339] (2) the distance L.sub.CA, the horizontal angle .sub.CAh, and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A are acquired by using a function of a well-known technique of the camera apparatus 11 in step ST12 illustrated in
[0340]
[0341]
[0342] Hereinafter, a detailed description will be made of the fourth computation method of the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 in the signal processing unit 33.
[0343] The signal processing unit 33 computes a horizontal component distance L.sub.CAh of the the distance L.sub.CA from the camera apparatus 11 to the sound collection region central position A according to Equation (54) by using the distance L.sub.CA and the vertical angle .sub.CAv from the camera apparatus 11 to the sound collection region central position A.
[Equation 54]
L.sub.CAh=L.sub.CAcos .sub.CAv (54)
[0344] The signal processing unit 33 computes a horizontal component distance L.sub.MAh of the distance from the microphone array apparatus 2 to the sound collection region central position A according to Equation (55) based on the cosine theorem for the triangle CAM illustrated in
[Equation 55]
L.sub.MAh={square root over ((L.sub.CAh.sup.2+L.sub.CM.sup.22L.sub.CAhL.sub.CMcos .sub.CAh)}(55)
[0345] The signal processing unit 33 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (56) based on the cosine theorem for the triangle CAM illustrated in
[0346] Consequently, the signal processing unit 33 can compute the horizontal angle .sub.MAh of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (57).
[0347] The signal processing unit 33 computes a difference (H.sub.MH.sub.A) between the heights H.sub.M and H.sub.A of the microphone array apparatus 2 and the sound collection region central position A from the horizontal surface according to Equation (58) based on a sine for the triangle CAP illustrated in
[Equation 58]
H.sub.MH.sub.A=H.sub.CH.sub.A=L.sub.CAsin .sub.CAv (58)
[0348] The signal processing unit 33 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv of the depression angle .sub.MA which is directed toward the sound collection region central position A from the microphone array apparatus 2 according to Equation (59) based on a tangent for the triangle MAS illustrated in
[0349] Consequently, the signal processing unit 33 can compute the vertical angle .sub.MAv of the depression angle .sub.MA from the microphone array apparatus 2 to the sound collection region central position A according to Equation (60).
[0350] In the above-described way, in the fourth computation method, the microphone array apparatus 2 and the camera apparatus 11 are connected and fixed to each other by using the dedicated tool 50, and are installed on, for example, an indoor ceiling surface, and, in this state, the directionality control apparatus 3 uses the length of the dedicated tool 50, that is, the distance L.sub.CM between the microphone array apparatus 2 and the camera apparatus 11 as the input parameter.
[0351] Further, the directionality control apparatus 3 computes, as a sound collection direction, the coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed toward the sound collection region central position A corresponding to the position A designated with the finger FG of the user in a video of a predetermined region captured as a monitoring target of the camera apparatus 11, that is, a direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A, with the position of the microphone array apparatus 2 as a reference, by using the respective input parameter.
[0352] Consequently, according to the fourth computation method, the directionality control system 10 of the present embodiment can more easily compute the coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed from the microphone array apparatus 2 toward the sound collection region central position A since the number of input parameters is smaller than that in the first to third computation methods and the reference lines for the 0-degree direction of the camera apparatus 11 and the microphone array apparatus 2 oppose each other. In addition, the directionality control system 10 can form the sound collection directionality in the direction of the sound collection region central position A designated with the position of the microphone array apparatus 2 as a reference with high accuracy and can thus collect audio data in the corresponding direction with high accuracy.
[0353] In addition, in the above-described respective embodiments, a description has been made a case where a timing at which the directionality control apparatus 3 starts to compute the coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction of the microphone array apparatus 2 is the time when any position A or region B on a screen of the display device 36 is designated with the finger FG of the user, but the timing is not limited thereto.
[0354] For example, the timing may be the time when ranges of the angles of view CAR of the camera apparatuses 11 to 1n change due to the camera apparatuses 11 to 1n being rotated in panning directions, tilting directions, or the panning and tilting directions at predetermined intervals which are predefined. Consequently, if a sound collection direction is predefined, the directionality control system 10 can form the sound collection directionality in the predefined sound collection direction without the user designating a sound collection direction of the microphone array apparatus 2.
[0355] In the above-described respective computation methods, the description has been made based on the microphone array apparatus 2 being installed along the surface parallel to the horizontal surface. However, there is a case where a ceiling surface on which the microphone array apparatus 2 is installed may be obliquely tilted.
[0356] In this case, the horizontal angle .sub.MAh and the vertical angle .sub.MAv of a sound collection direction of the microphone array apparatus 2 are required to be corrected by an angle with which the ceiling surface is tilted since values thereof computed by the signal processing unit 33 according to the first to fourth computation methods cannot be used without being changed.
[0357] Hereinafter, with reference to
[0358] For example, a vertical angle of a sound collection direction of the microphone array apparatus 2 computed by the signal processing unit 33 according to each of the above-described first to fourth computation methods is a vertical angle .sub.MAv of a sound collection direction with respect to a plane HR parallel to the horizontal surface. In
[0359] For this reason, the signal processing unit 33 computes the vertical angle .sub.MAv and further computes the tilt angle .sub.Mv of the indoor ceiling surface. Specifically, the signal processing unit 33 computes a cosine value cos .sub.Mv of the tilt angle .sub.Mv of the indoor cesf according to Equation (61) by using the horizontal component distance L.sub.CMh of the distance L.sub.CM from the camera apparatus 11 to the microphone array apparatus 2 and the respective heights H.sub.C and H.sub.M of the camera apparatus 11 and the microphone array apparatus 2 from the horizontal surface.
[0360] Consequently, the signal processing unit 33 computes the tilt angle .sub.Mv of the indoor ceiling surface by using the computation result of Equation (61) (refer to Equation (62)).
[0361] The signal processing unit 33 computes the vertical angle .sub.MAv of the sound collection direction of the microphone array apparatus 2 according to Equation (63) by using the computation result of Equation (62) and the vertical angle .sub.MAv which is computed according to each of the above-described first to fourth computation methods.
[0362] Consequently, even in a case where the indoor ceiling surface on which the microphone array apparatus 2 is installed is not parallel to the horizontal surface but is tilted with a predetermined angle, the directionality control apparatus 3 can appropriately compute a sound collection direction of the microphone array apparatus 2, can form the sound collection directionality in the direction of the sound collection region central position A designated with the position of the microphone array apparatus 2 as a reference with high accuracy, and can thus collect audio data in the corresponding direction with high accuracy.
[0363] Each of second to fifth embodiments described below relates to a calibration method of matching a reference direction of a horizontal angle of an imaging direction from the camera apparatus with a reference direction of a horizontal angle of a sound collection direction from the microphone array apparatus.
[0364] Here, there is a control apparatus which controls an operation of each of a camera apparatus and a microphone array apparatus and obtains audio data in a direction in which the camera apparatus performs imaging (for example, refer to a reference Patent Literature 1). The control apparatus disclosed in Patent Literature 1 controls an operation of each, for example, the camera apparatus which is installed in a conference room which is available to a TV conference system and can acquire omnidirectional images and the microphone array apparatus which can change a sound collection region.
[0365] The control apparatus disclosed in the reference Patent Literature 1 detects a direction of a speaker based on voice, compresses a data amount by cutting out only a region centering on the speaker from an omnidirectional image, and directs a beam direction of the microphone array apparatus toward the speaker so as to reduce noise other than conversations of the speaker.
[0366] (Reference Patent Literature 1) Japanese Patent No. 4252377
[0367] For example, in a case where the camera apparatus and the microphone array apparatus are integrally assembled and are disposed on the same axis, an optical axis of the camera apparatus and a physical central axis of the microphone array apparatus are common to each other. Therefore, in a case where the microphone array apparatus collects conversation voice of a subject in a video captured by the camera apparatus, a vertical angle of coordinates (horizontal angle, vertical angle) indicating a sound collection direction of the microphone array apparatus is the same as a vertical angle of coordinates (horizontal angle, vertical angle) indicating an imaging direction of the camera apparatus.
[0368] However, in a case where the camera apparatus and the microphone array apparatus are integrally assembled and are disposed on the same axis, and the camera apparatus and the microphone array apparatus are used through a combination of separate operations thereof, the horizontal angle of of the coordinates (horizontal angle, vertical angle) indicating the sound collection direction of the microphone array apparatus is the same as the horizontal angle of the coordinates (horizontal angle, vertical angle) indicating the imaging direction of the camera apparatus only when reference directions (for example, a 0 direction) of the horizontal angles of the microphone array apparatus and the camera apparatus match each other.
[0369] In the above-described reference Patent Literature 1, the camera apparatus and the microphone array apparatus are integrally connected to each other via a cylinder which is acoustically transparent, but it is not disclosed that reference directions of the horizontal angles of the camera apparatus and the microphone array apparatus match each other. In addition, the camera apparatus and the microphone array apparatus do not have a structure in which the apparatuses can be used separately from each other.
[0370] Therefore, when the camera apparatus and the microphone array apparatus which operate separately from each other are used integrally with each other, if the reference directions of the horizontal angles of the camera apparatus and the microphone array apparatus do not match each other, an imaging region of the camera apparatus in the panning direction and a sound collection region of the microphone array apparatus do not match each other. For this reason, there is a problem in that conversation voice of a subject in a video captured by the camera apparatus is not appropriately collected by the microphone array apparatus.
[0371] Therefore, in each of the second to fifth embodiments related to the present invention, in order to solve the above-described problem, a description will be made of examples of a calibration method of matching a reference directions of a horizontal angle of coordinates indicating an imaging region of the camera apparatus with a reference direction of a horizontal angle of coordinates indicating a sound collection direction of the microphone array apparatus in a case where the camera apparatus and the microphone array apparatus are integrally used.
[0372] Hereinafter, each of the second to fifth embodiments of a calibration method related to the present invention will be described with reference to the drawings. The following calibration method of each embodiment is, for example, a method of matching a horizontal angle of coordinates indicating an imaging region of an omnidirectional camera apparatus and a horizontal angle of coordinates indicating a sound collection direction of an omnidirectional microphone array apparatus in a sound collection system in which the omnidirectional camera apparatus and the omnidirectional microphone array apparatus are integrally disposed on the same axis. The sound collection system of each embodiment is installed on an installation surface (for example, a ceiling surface of an event hall) in a predetermined sound collection space.
Second Embodiment
[0373] A calibration method of the second embodiment will be described with reference to
[0374] The omnidirectional camera apparatus 3z includes a casing into which an optical system (for example, a fish-eye lens) and an imaging unit (for example, an image sensor) (not illustrated) are built and which is covered with a dome-shaped transparent cover 3a, and is fitted into an inner circumferential space inside an opening 13 which is formed at the center of a casing of the omnidirectional microphone array apparatus 5. The omnidirectional camera apparatus 3z, which has a function of, for example, a monitoring camera, is connected to a host computer (not illustrated) of a central control room via a network (not illustrated), and displays omnidirectional videos on a monitor (not illustrated). In addition, the omnidirectional camera apparatus 3z may cut out a video in a designated direction and may display the video on the monitor (not illustrated) in response to a remote operation from the host computer.
[0375] The omnidirectional microphone array apparatus 5 includes a ring-shaped casing 17 in which the casing of the omnidirectional camera apparatus 3z is fitted into the inner circumferential space inside the opening 13, and a plurality of microphone units 18 are disposed in a concentric shape around the opening 13 in a circumferential direction of the casing 17. The microphone unit 18 employs, for example, a high-quality small-sized electret condenser microphone (ECM), and this is also the same for the following respective embodiments. The omnidirectional microphone array apparatus 5 forms the directionality in a predetermined sound collection direction, and emphasizes and collects sound in the sound collection direction in which the directionality is formed.
[0376] In addition, in the respective embodiments including the present embodiment, configurations and operations of the omnidirectional camera apparatus and the omnidirectional microphone array apparatus are the same as the above-described content, but different content in each embodiment will be described as necessary.
[0377] Here, in a case of installing the sound collection system 1Z in which the omnidirectional microphone array apparatus 5 and the omnidirectional camera apparatus 3z are integrally attached to each other in an i direction on the same axis, it is necessary to match reference directions (for example, a 0 direction) of horizontal angles of the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 with each other in order to align an imaging region of the omnidirectional camera apparatus 3z with a sound collection direction of the omnidirectional microphone array apparatus 5.
[0378] In the present embodiment, in order to match the reference directions of the horizontal angles of the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 with each other on a plane perpendicular to the i direction on the same axis, a key 11y as an example of an engagement member which protrudes upwardly in
[0379] A key groove 15 as an engagement groove to the key 11y is fitted when the cover 3a of the omnidirectional camera apparatus 3z is inserted into the opening 13 is formed at a circumferential edge of the opening 13 of the omnidirectional microphone array apparatus 5. The key groove 15 is formed in a reference direction h of the horizontal angle of the coordinates (horizontal angle, vertical angle) indicating the sound collection direction of the omnidirectional microphone array apparatus 5, that is, in the direction of the horizontal angle of 0. In addition, in the present embodiment, a sectional shape of each of the key 11y and the key groove 15 is a rectangular shape but may be a polygonal shape or a semicircular shape other than the rectangular shape.
[0380] In a case where the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 are installed in the i direction on the same axis, the omnidirectional camera apparatus 3z is fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5 so that the key 11y is fitted to the key groove 15, and thus the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 are integrally combined with each other.
[0381] In the above-described way, in the calibration method of the present embodiment, the omnidirectional camera apparatus 3z is fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5 so that, for example, the key 11y is fitted to the key groove 15, and thus it is possible to easily match the reference direction g of the horizontal angle of the omnidirectional camera apparatus 3z with the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5 and therefore to alleviate a restriction when the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 are installed. As a result, in the sound collection system 1Z of the present embodiment, the omnidirectional microphone array apparatus 5 can collect sound in the sound collection direction with high accuracy by using the same horizontal angle as the horizontal angle of the coordinates (horizontal angle, vertical angle) indicating the imaging region of the omnidirectional camera apparatus 3z.
[0382] In addition, the present embodiment relates to a case where the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 are integrally provided by fitting the omnidirectional camera apparatus 3z into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5.
[0383] Further, the omnidirectional camera apparatus 3z illustrated in the present embodiment is a camera apparatus which includes a fish-eye lens and can capture an image in all directions, but may be a camera apparatus with a hemispheric transparent dome shape which has a rotation function in a panning direction, a rotation function in a tilting direction, and a zooming function.
Third Embodiment
[0384] Next, a calibration method of the third embodiment will be described with reference to
[0385] In addition, in the respective embodiments including the present embodiment, the same constituent elements as those in the above-described second embodiment are given the same reference numerals so that description thereof will be omitted or made briefly, and different content will be described.
[0386] In the present embodiment, for example, a triangular marker 21z illustrated in
[0387] Further, a similar triangular marker 23z is added at a position facing the marker 21z when the cover 3a of the omnidirectional camera apparatus 3AZ is inserted into the opening 13 on the circumferential edge of the opening 13 of the omnidirectional microphone array apparatus 5A. In addition, a shape of the marker 23z is not limited to a triangular shape and may be, for example, a rectangular shape. The marker 23z is added in the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5A, that is, in the direction of the horizontal angle of 0.
[0388] In a case where the omnidirectional camera apparatus 3AZ and the omnidirectional microphone array apparatus 5A are installed in the i direction on the same axis, the omnidirectional camera apparatus 3AZ is fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5A so that the tip of the marker 21z faces the tip of the marker 23z, and thus the omnidirectional camera apparatus 3AZ and the omnidirectional microphone array apparatus 5A are integrally combined with each other.
[0389] In the above-described way, in the calibration method of the present embodiment, the omnidirectional camera apparatus 3AZ is fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5A so that, for example, the respective tips of the markers 21z and 23 face each other, and thus it is possible to easily match the reference direction g of the horizontal angle of the omnidirectional camera apparatus 3AZ with the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5A. Further, in the calibration method of the present embodiment, it is possible to alleviate a restriction when the omnidirectional camera apparatus 3AZ and the omnidirectional microphone array apparatus 5A are installed. As a result, in the sound collection system 1A of the present embodiment, the omnidirectional microphone array apparatus 5A can collect sound in the sound collection direction with high accuracy by using the same horizontal angle as the horizontal angle of the coordinates (horizontal angle, vertical angle) indicating the imaging region of the omnidirectional camera apparatus 3AZ.
[0390] In addition, in the present embodiment, the omnidirectional camera apparatus 3AZ may not be fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5A unlike the above-described second embodiment, and, for example, a female screw portion formed in advance on the inside of the opening 13 of the omnidirectional microphone array apparatus 5A is screwed with a male screw portion formed on the outer circumference of the cover 3a of the omnidirectional camera apparatus 3AZ so that the omnidirectional camera apparatus 3AZ and the omnidirectional microphone array apparatus 5A are integrally combined with each other.
Fourth Embodiment
[0391] Next, a calibration method of the fourth embodiment will be described with reference to
[0392] In the sound collection system of each of the above-described second and third embodiments, the omnidirectional microphone array apparatus and the omnidirectional camera apparatus are directly attached to a predetermined installation surface.
[0393] In the sound collection system 1B of the present embodiment, the attachment member 7 (attachment tool) is first attached and fixed to a predetermined installation surface (for example, a ceiling surface 8), and both an omnidirectional microphone array apparatus 5B and an omnidirectional camera apparatus 3BZ are attached to the attachment member 7 (refer to
[0394]
[0395] Engagement pieces 7a which protrude in the i direction on the same axis and are used for attaching and fixing the omnidirectional camera apparatus 3BZ are formed at three concentric locations on the surface of the attachment member 7, that is, the surface of the attachment member 7 facing the ceiling surface 8. Further, engagement pieces 7b which protrude in the i direction on the same axis and are used for attaching and fixing the omnidirectional microphone array apparatus 5B are formed at three concentric locations which have diameters larger than those of the concentric locations where the engagement pieces 7a are formed, on the surface of the attachment member 7.
[0396]
[0397] Each of the engagement pieces 7a is provided with the engagement hole 71 which is engaged with the fixation pin 33z provided on a bottom of the omnidirectional camera apparatus 3BZ and is formed in a gourd shape of which a diameter of one end is larger than a diameter of the other end. Similarly, each of the engagement pieces 7b is provided with the engagement hole 73 which is engaged with the fixation pin 35z provided on a bottom of the omnidirectional microphone array apparatus 5B and is formed in a gourd shape of which a diameter of one end is larger than a diameter of the other end.
[0398] Each of the fixation pins 33z and 35z includes a head having a thickness (diameter) between one end and the other end of each of the engagement holes 71 and 73 and a body thinner than the head.
[0399] Each of fan-shaped hole portions 7c and 7d is formed at three locations so as to expand outside the engagement pieces 7a and the engagement pieces 7b on the surface of the attachment member 7. Shapes and positions of the fan-shapes hole portions 7c and 7d are designed so that the reference directions g and h of the horizontal angles of the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B match each other when the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B are attached to the attachment member 7.
[0400] Screw holes 7e into which screws 31z are inserted are formed at three locations at the center of the surface of the attachment member 7. The screws 31z are screwed with the ceiling surface 8 via the screw holes 7e and thus the attachment member 7 is fixed to the ceiling surface 8.
[0401] When the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B are attached to the attachment member 7, first, the omnidirectional camera apparatus 3BZ is attached to the attachment member 7. In this case, the fixation pin 33z is engaged with the engagement hole 71 formed at the engagement piece 7a (refer to
[0402] In other words, as illustrated in
[0403] After the omnidirectional camera apparatus 3BZ is attached to the attachment member 7, the omnidirectional microphone array apparatus 5B attached to the attachment member 7 from the inside of the opening 13 of the omnidirectional microphone array apparatus 5B so as to expose the omnidirectional camera apparatus 3BZ. In this case, the fixation pin 35z is engaged with the engagement hole 73 formed at the engagement piece 7b. In addition, a procedure of fixing the fixation pin 35z to the engagement hole 73 is the same as the procedure of fixing the fixation pin 33z to the engagement hole 71.
[0404] In the above-described way, in the calibration method of the present embodiment, the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B are integrally attached to the attachment member 7 which is attached and fixed to the ceiling surface 8. Consequently, in the calibration method of the present embodiment, the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B can be easily installed in a state in which the reference directions of the horizontal angles of the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B match each other. Therefore, in the sound collection system 1B of the present embodiment, the omnidirectional microphone array apparatus 5B can collect sound in the sound collection direction with high accuracy by using the same horizontal angle as the horizontal angle of the coordinates (horizontal angle, vertical angle) indicating the imaging region of the omnidirectional camera apparatus 3BZ.
Fifth Embodiment
[0405] Next, a calibration method of the fifth embodiment will be described with reference to
[0406] In the calibration method of the present embodiment, configurations of the omnidirectional camera apparatus 3CZ and the omnidirectional microphone array apparatus 5C in the sound collection system 1C and a procedure of attaching the apparatuses to an installation surface are the same as the configurations and the attachment procedure of the omnidirectional camera apparatus 3AZ and the omnidirectional microphone array apparatus 5A in the third embodiment, and thus description thereof will be omitted.
[0407] The tool 61 indicating the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5C is attached to both ends which oppose to each other in a casing 17 of the omnidirectional microphone array apparatus 5C. In the present embodiment, a wire is hung so as to cross over a casing of the omnidirectional camera apparatus 3CZ as the tool 61. Both tip ends of the wire are attached to holes which are formed in advance in the casing 17 of the omnidirectional microphone array apparatus 5C, and thus the tool 61 is attached to the omnidirectional microphone array apparatus 5C.
[0408] In addition, a mark 63 indicating a reference direction (for example, a 0 direction) of a horizontal angle is added to a part of the tool 61. The mark 63 may be colored differently from other parts of the tool 61. A material of the tool 61 is not particularly limited, such as ceramics, synthetic resin (for example, plastic or elastomer), and metal, and a shape thereof is not limited to a spherical shape and may be a columnar shape or other shapes.
[0409] In the present embodiment, as illustrated in
[0410] In addition, in the present embodiment, as illustrated in
[0411] Further, the tool 61 is not limited to a single wire. As long as the mark 63 indicating a reference of a horizontal angle is fixed to a predetermined position and is recognized in a video captured by the camera apparatus, the tool may be made of a sheet metal regardless of a shape thereof, and may have a three-legged shape.
[0412] Further, the tool 61 may be formed in an opaque dome shape covering the entire casing of the omnidirectional camera apparatus 3 and including an opening which is used as a mark in a reference direction of a horizontal angle of the omnidirectional microphone array apparatus 5C.
[0413] A tool image 61A indicating the reference direction his reflected through the center of the captured image 80z in the circular captured image 80z obtained by the omnidirectional camera apparatus 3CZ illustrated in
[0414] In the captured image 80z, the reference direction g of the horizontal angle of the omnidirectional camera apparatus 3CZ shows a vertical direction in
[0415] Therefore, in a case where the omnidirectional camera apparatus 3CZ and the omnidirectional microphone array apparatus 5C are connected to each other via a network (not illustrated), the omnidirectional microphone array apparatus 5C may set the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5 to a value which is deviated relative to the reference direction g of the horizontal angle of the omnidirectional camera apparatus 3CZ by the deviation amount . In other words, the omnidirectional microphone array apparatus 5C can appropriately compute coordinates of a sound collection direction by using an angle which is reversely returned by the deviation amount of a horizontal angle as a horizontal angle of coordinates (horizontal angle, vertical angle) indicating an actual sound collection direction.
[0416] In the above-described way, in the calibration method of the present embodiment, the omnidirectional microphone array apparatus 5C can match a horizontal angle indicating an imaging direction of the omnidirectional camera apparatus 3CZ with a horizontal angle indicating a sound collection direction of the omnidirectional microphone array apparatus 5C by using the deviation amount E of the horizontal angle computed by the omnidirectional camera apparatus 3CZ, and can thus adjust coordinates (horizontal angle, vertical angle) of the sound collection direction of the omnidirectional microphone array apparatus 5C.
[0417] Hereinafter, configurations, operations, and effects of the above-described calibration methods will be described.
[0418] According to an embodiment of the present invention, there is provided a calibration method including a step of positioning a camera apparatus which captures a video of a predetermined imaging region and a microphone array apparatus which collects sound of the imaging region on the same axis; a step of attaching the camera apparatus to a circumferential edge of an opening which is formed at a center of the casing of the microphone array apparatus; and a step of matching reference directions of horizontal angles of the camera apparatus and the microphone array apparatus with each other on a plane perpendicular to the same axis by attaching the camera apparatus to inside of the opening.
[0419] In the above-described method, it is possible to easily match the reference direction g of the horizontal angle of the omnidirectional camera apparatus 3z with the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5 and therefore to alleviate a restriction when the omnidirectional camera apparatus 3z and the omnidirectional microphone array apparatus 5 are installed. As a result, in the sound collection system 1Z of the present embodiment, the omnidirectional microphone array apparatus 5 can collect sound in the sound collection direction with high accuracy by using the same horizontal angle as the horizontal angle of the coordinates (horizontal angle, vertical angle) indicating the imaging region of the omnidirectional camera apparatus 3z.
[0420] In addition, in the calibration method according to the embodiment of the present invention, in the matching step, the reference direction of the horizontal angle of the microphone array apparatus matches the reference direction of the horizontal angle of the camera apparatus by engaging an engagement member formed on an outer circumference of a casing of the camera apparatus with an engagement groove formed at the circumferential edge of the opening.
[0421] In the above-described method, the omnidirectional camera apparatus 3z is fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5 so that, for example, the key 11y is fitted to the key groove 15, and thus it is possible to easily match the reference direction of the horizontal angle of the omnidirectional camera apparatus 3z with the reference direction of the horizontal angle of the omnidirectional microphone array apparatus 5.
[0422] Further, in the calibration method according to the embodiment of the present invention, in the matching step, the reference direction of the horizontal angle of the microphone array apparatus matches the reference direction of the horizontal angle of the camera apparatus by opposing a first marker portion added to the circumferential edge of the opening to a second marker portion added on the casing of the camera apparatus.
[0423] In the above-described method, the omnidirectional camera apparatus 3AZ is fitted into the inner circumferential space inside the opening 13 of the omnidirectional microphone array apparatus 5A so that, for example, the respective tips of the markers 21 and 23 face each other, and thus it is possible to easily match the reference direction g of the horizontal angle of the omnidirectional camera apparatus 3AZ with the reference direction h of the horizontal angle of the omnidirectional microphone array apparatus 5A.
[0424] Further, in the calibration method according to the embodiment of the present invention, in the attachment step, the casing of the camera apparatus is attached to a predetermined attachment tool, and the casing of the microphone array apparatus is attached to the predetermined attachment tool by inserting the casing of the camera apparatus attached to the predetermined attachment tool into the inside of the opening so that the casing of the camera apparatus is fitted into the inside thereof.
[0425] In the above-described calibration method, the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B are integrally attached to the attachment member 7 which is attached and fixed to the ceiling surface 8, and thus the reference directions of the horizontal angles of the omnidirectional camera apparatus 3BZ and the omnidirectional microphone array apparatus 5B can match each other.
[0426] In addition, according to another embodiment of the present invention, there is provided a calibration method including a step of positioning a camera apparatus which captures a video of a predetermined imaging region and a microphone array apparatus which collects sound of the imaging region on the same axis; a step of attaching the camera apparatus to a circumferential edge of an opening which is formed at a center of the casing of the microphone array apparatus; a step of attaching a tool indicating a reference direction of a horizontal angle of the microphone array apparatus to both opposing ends of a casing of the microphone array apparatus; a step of causing the camera apparatus to capture an image of the tool; a step of computing a deviation amount between a reference direction of a horizontal angle of the camera apparatus and a reference direction of a horizontal angle of the microphone array apparatus; and a step of matching the reference directions of the horizontal angles of the camera apparatus and the microphone array apparatus with each other on a plane perpendicular to the same axis by adjusting a horizontal angle of a sound collection direction of the microphone array apparatus by using the computed deviation amount.
[0427] In the calibration method of the present embodiment, the omnidirectional microphone array apparatus 5C can match a horizontal angle indicating an imaging direction of the omnidirectional camera apparatus 3CZ with a horizontal angle indicating a sound collection direction of the omnidirectional microphone array apparatus 5C by using the deviation amount E of the horizontal angle computed by the omnidirectional camera apparatus 3CZ, and can thus adjust coordinates (horizontal angle, vertical angle) of the sound collection direction of the omnidirectional microphone array apparatus 5C.
[0428] In the above-described respective embodiments, a reference direction of a horizontal angle has been described as a direction in which a horizontal angle is 0, but any angle may be set as a reference direction.
[0429] Further, each microphone disposed inside the casing of the omnidirectional microphone array apparatus in the above-described respective embodiments may employ a nondirectional microphone, a bidirectional microphone, a unidirectional microphone, a sharply directional microphone, a super-directional microphone (for example, a shotgun microphone), or a combination thereof.
[0430] A sixth embodiment described below relates to a directionality control system and a horizontal deviation angle computation method, in which a direction connecting a camera apparatus to a microphone array apparatus is set as each reference direction, a deviation angle between a front direction (0 direction) of the camera apparatus and the reference direction of the camera apparatus and a deviation angle between a front direction (0 direction) of the microphone array apparatus and the reference direction of the microphone array apparatus, and thus calibration of a horizontal angle is performed.
[0431] In the above-described monitoring system, in a case where the camera apparatus and the microphone array apparatus are disposed at different positions as separate apparatuses, an optical axis of the camera apparatus is different from a physical central axis of the microphone array apparatus. For this reason, in a case where the microphone array apparatus collects conversation voice of a subject who is present in an imaging direction of the camera apparatus, it is necessary to adjust, to appropriate values, coordinates (horizontal angle, vertical angle) indicating the direction (hereinafter, simply referred to as a sound collection direction) in which the microphone array apparatus collects the conversation voice and coordinates (horizontal angle, vertical angle) indicating the imaging direction of the camera apparatus.
[0432] In order to perform the adjustment, for example, a front direction (for example, a 0 direction) of a horizontal angle of coordinates (hereinafter, referred to as imaging direction coordinates) indicating the imaging direction of the camera apparatus is required to be a known value in the camera apparatus, and a front direction (for example, a 0 direction) of a horizontal angle of coordinates (hereinafter, referred to as sound collection direction coordinates) indicating a sound collection direction of the microphone array apparatus is required to be a known value in the microphone array apparatus.
[0433] Also in order that a positional relationship between the camera apparatus and the microphone array apparatus are required to be appropriately specified, and a direction of the microphone array apparatus with the camera apparatus as a reference and a direction of the camera apparatus with the microphone array apparatus as a reference are known values, for example, when the microphone array apparatus is installed, a front direction (for example, a 0 direction) of a horizontal angle of the microphone array apparatus is preferably directed toward the camera apparatus.
[0434] However, Patent Literature 1 does not disclose that a direction of the microphone array apparatus with the camera apparatus as a reference and a direction of the camera apparatus with the microphone array apparatus as a reference are computed in a case where a front direction (for example, a 0 direction) of a horizontal angle of the camera apparatus and a front direction (for example, a 0 direction) of a horizontal angle of the microphone array apparatus are unknown. Therefore, it is not clear an angle from which direction each horizontal angle is, and thus a positional relationship between the camera apparatus and the microphone array apparatus cannot be appropriately specified.
[0435] Therefore, for example, since a sound collection direction of the microphone array apparatus is not directed in a direction in which a subject is present while the microphone array apparatus collects voice of the subject, there is a problem in that it is hard for the microphone array apparatus to appropriately collect conversation voice of the subject who is present in an imaging direction of the camera apparatus.
[0436] In addition, in most cases where the camera apparatus is installed later at a location where the microphone array apparatus has been installed, it is hard to install the camera apparatus so that a front direction (for example, a 0 direction) of a horizontal angle of the microphone array apparatus is directed toward the camera apparatus. Further, in a case where the microphone array apparatus is additionally installed later at a location where a plurality of camera apparatuses have been installed, there is a problem in which a front direction (for example, a 0 direction) of a horizontal angle of the microphone array apparatus cannot match directions of the plurality of camera apparatuses.
[0437] Therefore, in the sixth embodiment related to the present invention, in order to solve the above-described problems, a description will be made of examples of a directionality control system and a horizontal deviation angle computation method, in which a horizontal deviation angle indicating an angle between a 0 direction of each horizontal angle of imaging direction coordinates of the camera apparatus and sound collection direction coordinates of the microphone array apparatus and mutual reference directions connecting both the apparatuses to each other is computed, and thus the microphone array apparatus can appropriately collect conversation voice of a subject who is present in an imaging direction of the camera apparatus.
[0438] Hereinafter, with reference to the drawings, a description will be made of the sixth embodiment related to the directionality control system and the horizontal deviation angle computation method according to the present invention. The directionality control system of the present embodiment is used as a monitoring system (including a manned monitoring system and an unmanned monitoring system) provided in, for example, a factory, a public facility (for example, a library or an event hall), or a store (for example, a retail store or a bank).
[0439] In addition, the present invention can be expressed as respective apparatuses (for example, a directionality control apparatus to be described later) constituting the directionality control system, or a horizontal deviation angle computation method including respective operations (steps) performed by each apparatus (for example, a directionality control apparatus to be described later) constituting the directionality control system.
Sixth Embodiment
Configuration of Directionality Control System
[0440]
[0441] The directionality control system 10 illustrated in
[0442] The omnidirectional camera apparatus 11z functions as, for example, a monitoring camera, includes a casing into which an optical system (for example, a fish-eye lens or a wide angle lens) and an imaging system (for example, an image sensor) (not illustrated) are built, and is installed on a predetermined installation surface (for example, a ceiling surface of a room of an event hall or a stand). The omnidirectional camera apparatus 11z is connected to a host computer (not illustrated) of a central control room via a network (not illustrated), and performs a panning direction operation, a tilting direction operation, a zooming operation, an imaging operation, and a distance-measuring operation and an angle-measuring operation related to an actual position corresponding to a designated position (for example, a designated position A to be described later) in a captured image in response to a remote operation from the host computer. The omnidirectional camera apparatus 11z images, for example, a subject which is present in a first imaging direction CAX1 which is directed from the omnidirectional camera apparatus 11z toward a sound collection position (or a sound position) A which will be described later (refer to
[0443] The calibration omnidirectional camera apparatus C1 functions as, for example, a calibration camera which computes front direction (for example, a 0 direction) of each horizontal angle of imaging direction coordinates of the omnidirectional camera apparatus 11z and sound collection direction coordinates of the omnidirectional microphone array apparatus 2, and a horizontal deviation angle relative to mutual reference directions connecting the omnidirectional camera apparatus 11z and the omnidirectional microphone array apparatus 2 to each other. The calibration omnidirectional camera apparatus C1 includes a casing into which an optical system (for example, a fish-eye lens or a wide angle lens) and an imaging system (for example, an image sensor) (not illustrated) are built, and is installed on a predetermined installation surface (for example, a ceiling surface of a room of an event hall or a stand). The calibration omnidirectional camera apparatus C1 is connected to the host computer (not illustrated) of the central control room via a network (not illustrated), and performs a panning direction operation, a tilting direction operation, a zooming operation, an imaging operation, and a distance-measuring operation and an angle-measuring operation related to an actual position corresponding to a designated position (for example, a designated position A to be described later) in a captured image in response to a remote operation from the host computer. The calibration omnidirectional camera apparatus C1 images, for example, a subject which is present in a second imaging direction CAX2 which is directed from the calibration omnidirectional camera apparatus C1 toward the sound collection position A which will be described later (refer to
[0444] The omnidirectional microphone array apparatus 2 of the directionality control system 10 illustrated in
[0445] On the other hand, in the directionality control system 10A illustrated in
[0446] Consequently, in the same manner as in the directionality control system 10 illustrated in
[0447] In addition, in the present embodiment, the omnidirectional camera apparatus 11z including a fish-eye lens or a wide angle lens is used, but a pan-tilt-zoom (PTZ) camera apparatus which includes a standard lens or a telephoto lens and mechanically performs operation in a panning direction and a tilting direction and a zooming operation may be used.
[0448]
[0449] In addition, the directionality control system 10A illustrated in
[0450] The omnidirectional camera apparatus 11z is connected to the network NW, measures and acquires input parameters (for example, a distance L.sub.CK between the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1) for computing a sound collection direction (.sub.MAh,.sub.MAv) of the omnidirectional microphone array apparatus 2, which will be described later, and transmits the measured input parameters and captured image data to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW.
[0451] The calibration omnidirectional camera apparatus C1 is connected to the network NW, similarly, measures and acquires input parameters (for example, the distance L.sub.CK between the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1) for computing the sound collection direction (.sub.MAh,.sub.MAv) of the omnidirectional microphone array apparatus 2, which will be described later, and transmits the measured input parameters and captured image data to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW.
[0452] The microphone array apparatus 2 are connected to the network NW and includes at least microphone units 22 and 23 in which microphones are provided at equal intervals (refer to
[0453] The omnidirectional microphone array apparatus 2 collects sound in a sound collection direction in which a subject serving as a sound collection target is present by using each of the microphone units 22 and 23, and transmits audio data collected by each of the microphone units 22 and 23 to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW.
[0454] The omnidirectional microphone array apparatus 2 forms sound collection directionality of each of the microphone units 22 and 23 in the sound collection direction (.sub.MAh,.sub.MAv) which is computed by a coordinate computation section 34x of a signal processing unit 33 of the directionality control apparatus 3 in response to a directionality formation instruction from the directionality control apparatus 3 which will be described later.
[0455] Consequently, the omnidirectional microphone array apparatus 2 can relatively increase a volume level of audio data collected from the sound collection direction (.sub.MAh,.sub.MAv) in which the sound collection directionality is formed, and can relatively reduce a volume level of audio data collected from a direction in which the sound collection directionality is not formed. In addition, a method of computing the sound collection direction (.sub.MAh,.sub.MAv) will be described later.
[0456] An exterior of the omnidirectional microphone array apparatus 2 has been described with reference to
[0457] The directionality control apparatus 3 is connected to the network NW, and may be, for example, a stationery personal computer (PC) installed in a monitoring system control room (not illustrated), and may be a mobile phone, a tablet terminal, or a smart phone, which can be carried by a user.
[0458] The directionality control apparatus 3 includes at least a communication unit 31, an operation unit 32, a signal processing unit 33, a display device 36, a speaker device 37, and a memory 38. The signal processing unit 33 includes a horizontal deviation angle computation section 34w, a coordinate computation section 34x, and an output control section 34c.
[0459] The communication unit 31 outputs image data or audio data which is transmitted from the omnidirectional camera apparatus 11z, the calibration omnidirectional camera apparatus C1, or the microphone array apparatus 2, to the signal processing unit 33 via the network NW.
[0460] The operation unit 32 is a user interface (UI) for notifying the signal processing unit 33 of the content of a user's input operation, and is, for example, a pointing device such as a mouse or a keyboard. In addition, the operation unit 32 may be configured by using a touch panel or a touch pad which is disposed so as to correspond to, for example, a screen of the display device 36 and allows an input operation to be performed with the finger FG of the user or a stylus pen.
[0461] The operation unit 32 acquires coordinate data indicating a region where the user desires to increase or decrease a volume level, that is, a designated position A illustrated in
[0462] The signal processing unit 33 is configured by using, for example, a central processing unit (CPU), a micro processing unit (MPU), or a digital signal processor (DSP), and performs a control process for collectively controlling operations of the respective units of the directionality control apparatus 3, data input and output processes with other respective units, a data computation (calculation) process, and a data storage process.
[0463] The horizontal deviation angle computation section 34w as a deviation angle computation part computes a first horizontal deviation angle .sub.Ch of the omnidirectional camera apparatus 11z in a first front direction (horizontal angle 0 direction) and a second horizontal deviation angle .sub.Kh of the calibration omnidirectional camera apparatus C1 in a second front direction (horizontal angle 0 direction) relative to a reference position (a line K-K illustrated in
[0464] The coordinate computation section 34x computes a horizontal angle .sub.MAh and a vertical angle .sub.MAv of the sound collection direction MIX which is directed from the omnidirectional microphone array apparatus 2 toward the sound collection position A based on the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh computed by the horizontal deviation angle computation section 34w, as coordinates (sound collection direction coordinates) indicating a sound collection direction in which the omnidirectional microphone array apparatus 2 collects voice of a subject.
[0465] In the sound collection direction (.sub.MAh,.sub.MAv), .sub.MAh indicates a horizontal angle of the sound collection direction MIX which is directed from the omnidirectional microphone array apparatus 2 toward the sound collection position A, and .sub.MAv indicates a vertical angle of the sound collection direction MIX which is directed from the omnidirectional microphone array apparatus 2 toward the sound collection position A.
[0466] Since a relationship between coordinate axes of the omnidirectional camera apparatus 11z and coordinate axes of the microphone array apparatus 2 is known based on the first horizontal deviation angle .sub.Ch, the second horizontal deviation angle .sub.Kh, and the distance L.sub.ck therebetween, the directionality control apparatus 3 computes the sound collection direction (.sub.MAh,.sub.MAv) in response to designation of a sound collection position in image data displayed on the display device 36.
[0467] In addition, the sound collection position A is a field position which corresponds to the designated position A which is designated with the finger FG of the user or a stylus pen on a screen of the display device 36 via the operation unit 32, and is an actual monitoring target (refer to
[0468] The output control section 34c controls operations of the display device 36 and the speaker device 37, so as to cause the display device 36 to reproduce and output video data transmitted from the omnidirectional camera apparatus 11z, and to cause the speaker device 37 to output audio data transmitted from the omnidirectional microphone array apparatus 2 as sound. In addition, the output control section 34c controls an operation of the omnidirectional microphone array apparatus 2 so as to cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality of audio data in the sound collection direction MIX corresponding to sound collection direction coordinates (.sub.MAh,.sub.MAv) computed by, for example, the coordinate computation section 34x.
[0469] The display device 36 as a display part displays image data captured by the omnidirectional camera apparatus 11z or the calibration omnidirectional camera apparatus C1 on a screen.
[0470] The speaker device 37 as a sound output part outputs, as sound, audio data collected by the omnidirectional microphone array apparatus 2 or audio data which is collected by the omnidirectional microphone array apparatus 2 after the sound collection directionality is formed in the sound collection direction (.sub.MAh,.sub.MAv) computed by the coordinate computation section 34x. In addition, the display device 36 and the speaker device 37 may be configured separately from the directionality control apparatus 3.
[0471] The memory 38 is configured by using, for example, a random access memory (RAM), and functions as a work memory when the respective units of the directionality control apparatus 3 operate.
[0472] The recorder apparatus 4 records image data captured by the omnidirectional camera apparatus 11z or the calibration omnidirectional camera apparatus C1 and audio data collected by the omnidirectional microphone array apparatus 2. The recorder apparatus 4 records the image data captured by the omnidirectional camera apparatus 11z and the audio data collected by the omnidirectional microphone array apparatus 2 in correlation with each other.
[0473] Next, a summary of an operation of the directionality control system 10 of the present embodiment will be described with reference to
[0474] In the directionality control system 10, the omnidirectional camera apparatus 11z images the subjects (for example, two people) illustrated in
[0475] Here, if the designated position A on the display device 36, that is, a central position or a substantially central position the two people having conversations is designated with the finger FG of the user, the directionality control apparatus 3 acquires coordinate data (.sub.CAh,.sub.CAv) of the first imaging direction CAX1 indicating the designated position A.
[0476] The directionality control apparatus 3 computes sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a direction which is directed from the installation position of the omnidirectional microphone array apparatus 2 toward the sound collection position A, that is, a sound collection direction, as a sound collection direction of the omnidirectional microphone array apparatus 2 based on a relationship between a coordinate system of the omnidirectional camera apparatus 11z and a coordinate system of the omnidirectional microphone array apparatus 2, which is computed through calibration. The omnidirectional microphone array apparatus 2 forms the sound collection directionality in the direction which is directed from the omnidirectional microphone array apparatus 2 toward the sound collection position A by using the coordinate data (.sub.MAh,.sub.MAv) computed by the directionality control apparatus 3.
[0477] Therefore, the omnidirectional microphone array apparatus 2 can increase a volume level of the conversation (Hello) of the two people present in the direction in which the sound collection directionality is formed more than a volume level of sound output from other objects which are not present in the direction in which the sound collection directionality is formed.
[0478] Consequently, the directionality control apparatus 3 causes the speaker device 37 to output sound with a volume level of the conversation (Hello) of the two people present in the direction in which the sound collection directionality is formed higher than a volume level of sound output from other objects which are not present in the direction in which the sound collection directionality is formed (refer to
[0479] Next, a specific operation procedure of the directionality control system 10 or the directionality control system 10A of the present embodiment will be described with reference to
[0480] In description of the calibration illustrated in
[0481] In
[0482] After the calibration is completed in step ST20, a position where sound is desired to be collected from a video (or an image) captured by the omnidirectional camera apparatus 11z is designated on a screen of the display device 36 (step ST21).
[0483] The coordinate computation section 34x of the directionality control apparatus 3 computes a horizontal angle and a vertical angle (.sub.CAh and .sub.CAv) of the first imaging direction CAX1 designated in step ST21 as coordinates (sound collection direction coordinates, that is, (.sub.MAh,.sub.MAv)) indicating a sound collection direction in which a voice of a subject is collected by the omnidirectional microphone array apparatus 2, by using an input parameter L.sub.CK measured in step ST12 (which will be described later) and the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh computed in step ST15 (step ST22).
[0484] The output control section 34c of the directionality control apparatus 3 forms the sound collection directionality of each of the microphone units 22 and 23 in the sound collection direction indicated by the sound collection direction coordinates (.sub.MAh,.sub.MAv) computed in step ST22 (step ST23).
[0485] Consequently, the omnidirectional microphone array apparatus 2 can relatively increase a volume level of audio data collected from the sound collection direction defined by the sound collection coordinates (.sub.MAh,.sub.MAv) in which the sound collection directionality is formed, and can relatively reduce a volume level of audio data collected from a direction in which the sound collection directionality is not formed.
[0486] In addition, in the directionality control system 10 of the present embodiment, a timing at which the omnidirectional microphone array apparatus 2 collects sound is not limited to the time right after step ST20, and may be, for example, the time after power is supplied to the omnidirectional microphone array apparatus 2.
[0487] The calibration in step S20 will be described in detail. In
[0488] In step ST11, the calibration omnidirectional camera apparatus C1 and the omnidirectional microphone array apparatus 2 are integrally installed as follows, for example.
[0489] Specifically, first, an attachment tool (not illustrated; for example, an attachment tool made of metal, an attachment tool made of ceramics, or an attachment tool made of a synthetic resin (for example, plastic or elastomer)) is attached and fixed to a predetermined installation surface (for example, a stand).
[0490] After the attachment tool is attached to the predetermined installation surface, both of the calibration omnidirectional camera apparatus C1 and the omnidirectional microphone array apparatus 2 are attached to the attachment tool. As described above, the calibration omnidirectional camera apparatus C1 is fitted into the inner circumferential space inside the opening 21a formed at the center of the casing of the omnidirectional microphone array apparatus 2 so that the omnidirectional microphone array apparatus 2 and the calibration omnidirectional camera apparatus C1 are integrally formed with each other. In addition, the calibration omnidirectional camera apparatus C1 and the microphone array apparatus 2 are integrally formed with each other so that a front direction (0 direction) of each horizontal angle is used in common.
[0491] Therefore, since a horizontal angle and a vertical angle of the second imaging direction CAX2 of the calibration omnidirectional camera apparatus C1 are the same as a horizontal angle and a vertical angle of a sound collection direction of the omnidirectional microphone array apparatus 2, if the second front direction is determined by computing at least the second horizontal deviation angle .sub.Kh, sound collection direction coordinates in the omnidirectional microphone array apparatus 2 can be appropriately computed.
[0492] After the omnidirectional camera apparatus 11z, the calibration omnidirectional camera apparatus C1, and the omnidirectional microphone array apparatus 2 are initially installed, there is the measurement of an input parameter (for example, the distance L.sub.CK) which is required for the horizontal deviation angle computation section 34w to compute the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh (step ST12). The distance L.sub.CK indicates a distance between the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1.
[0493] The process in step ST12 includes a case where the user measures the distance L.sub.CK by using a measuring device (for example, a laser range finder), or a case where the omnidirectional camera apparatus 11z measures and acquires the distance L.sub.CK by using functions of well-known techniques of the omnidirectional camera apparatus 11z. Hereinafter, for simplification of description, it is assumed that the user measures the distance L.sub.CK by using a measuring device (for example, a laser range finder) in step S12. The signal processing unit 33 acquires data regarding the distance L.sub.CK which is an input parameter output from the operation unit 32 in response to a user's input operation.
[0494] After step ST12, the directionality control apparatus 3 receives designation of any designated position A in image data which is being displayed on a screen of the display device 36, via the operation unit 32 (step ST13). The directionality control apparatus 3 transmits a notification indicating that the designation of the designated position A in the image data which is being displayed on the screen of the display device 36 has been received, to the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1.
[0495] After step ST13, in a case where the omnidirectional camera apparatus 11z receives the notification indicating that the designation of the designated position A has been received from the directionality control apparatus 3, the omnidirectional camera apparatus 11z measures and acquires coordinate data including a horizontal angle and a vertical angle (.sub.CAh,.sub.CAv) of the first imaging direction CAX1 which is directed toward the sound collection position A corresponding to the designated position A on the screen designated in step ST13, with the installation position of the omnidirectional camera apparatus 11z as a start point (step ST14). However, in the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) obtained in step ST14, the horizontal angle .sub.CAh is data in a state in which it is not determined which direction is the first front direction.
[0496] Further, in a case where the calibration omnidirectional camera apparatus C1 receives the notification indicating that the designation of the designated position A has been received from the directionality control apparatus 3, the calibration omnidirectional camera apparatus C1 measures and acquires coordinate data including a horizontal angle and a vertical angle (.sub.KAh,.sub.KAv) of the second imaging direction CAX2 which is directed toward the sound collection position A corresponding to the designated position A on the image data designated in step ST13, with the installation position of the calibration omnidirectional camera apparatus C1 as a start point (step ST14). Similarly, in the coordinate data including the horizontal angle and the vertical angle (.sub.KAh,.sub.KAv) obtained in step ST14, the horizontal angle .sub.KAh is data in a state in which it is not determined which direction is the second front direction.
[0497] The omnidirectional camera apparatus 11z transmits the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) of the first imaging direction CAX1 to the directionality control apparatus 3. The calibration omnidirectional camera apparatus C1 transmits the coordinate data including the horizontal angle and the vertical angle (.sub.KAh,.sub.KAv) of the second imaging direction CAX2 to the directionality control apparatus 3.
[0498] The horizontal deviation angle computation section 34w of the directionality control apparatus 3 computes the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh based on the input parameter L.sub.CK measured in step S12, and the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) of the first imaging direction CAX1 and the coordinate data including the horizontal angle and the vertical angle (.sub.KAh,.sub.KAv) of the second imaging direction CAX2 measured in step S14 (step ST15). Details of the operation in step S15 will be described later with reference to
[0499] Consequently, the directionality control apparatus 3 can determine which direction is the first front direction (horizontal angle of 0) of a horizontal angle of the omnidirectional camera apparatus 11z and can further determine which direction is the second front direction (horizontal angle of 0) of a horizontal angle of the calibration omnidirectional camera apparatus C1 before computing sound collection direction coordinates of the omnidirectional microphone array apparatus 2.
[0500] Here, in the initial installation in step ST11, in a case where the omnidirectional microphone array apparatus 2 is installed so that a front direction (for example, a 0 direction) of a horizontal angle of a sound collection direction of the omnidirectional microphone array apparatus 2 matches a front direction (for example, a 0 direction) of a horizontal angle of the calibration omnidirectional camera apparatus C1 (YES in step ST16), the calibration is completed, and the operation procedure proceeds to step ST21.
[0501] On the other hand, in a case where the omnidirectional microphone array apparatus 2 is not installed so that a reference direction (for example, a 0 direction) of a horizontal angle of a sound collection direction of the omnidirectional microphone array apparatus 2 matches a front direction (for example, a 0 direction) of a horizontal angle of the calibration omnidirectional camera apparatus C1 in the initial installation in step ST11 (NO in step ST16), the operation procedure of the directionality control system 10 proceeds to step ST21.
[0502] In other words, in step ST17, the omnidirectional microphone array apparatus 2 is installed so that a front direction (for example, a 0 direction) of a horizontal angle of a sound collection direction of the omnidirectional microphone array apparatus 2 matches a front direction (for example, a 0 direction) of a horizontal angle of the calibration omnidirectional camera apparatus C1 (step ST17).
[0503] In step ST17, the omnidirectional microphone array apparatus 2 is installed as follows, for example, so that a reference direction (for example, a 0 direction) of a horizontal angle of a sound collection direction of the omnidirectional microphone array apparatus 2 matches a reference direction of a horizontal angle of the second imaging direction CAX2 of the calibration omnidirectional camera apparatus C1.
[0504] Specifically, for example, a triangular or triangular marker (not illustrated) is added to an outer circumference of the casing of the calibration omnidirectional camera apparatus C1. The marker is added in a direction indicating the front direction (0 direction) of a horizontal angle of the second imaging direction of the calibration omnidirectional camera apparatus C1. In addition, for example, a marker with the same shape (not illustrated) is added at a position facing the marker of calibration omnidirectional camera apparatus C1 on a circumferential edge of the opening 21a formed at the center of the casing of the omnidirectional microphone array apparatus 2.
[0505] Therefore, if the omnidirectional microphone array apparatus 2 is installed so that the marker of the calibration omnidirectional camera apparatus C1 faces the marker of the omnidirectional microphone array apparatus 2, the omnidirectional microphone array apparatus 2 is installed so that a front direction (for example, a 0 direction) of a horizontal angle of a sound collection direction of the omnidirectional microphone array apparatus 2 matches a front direction of a horizontal angle of the second imaging direction CAX2 of the calibration omnidirectional camera apparatus C1.
Method of Computing Coordinates (.SUB.MAh.,.SUB.MAv.) Indicating Sound Collection Direction of Omnidirectional Microphone Array Apparatus 2
[0506] Next, with reference to
[0507]
[0508]
[0509] The horizontal deviation angle computation section 34w computes the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh in response to designation of any designated position A from the user in image data displayed on the display device 36 based on:
[0510] (1) the distance L.sub.CK between the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 measured in step ST12;
[0511] (2) the coordinates including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) of the first imaging direction CAX1 measured in step ST14;
[0512] (3) the coordinates including the horizontal angle and the vertical angle (.sub.KAh,.sub.KAv) of the second imaging direction CAX2 measured in the initial installation of step ST14;
[0513] (4) the respective heights H.sub.C and H.sub.K of the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 from the horizontal surface measured in the initial installation of step ST11; and
[0514] (5) the height H.sub.A of the sound collection position A from the horizontal surface.
[0515] (4) The respective heights H.sub.C, H.sub.K and H.sub.M of the omnidirectional camera apparatus 11z, the calibration omnidirectional camera apparatus C1, and the omnidirectional microphone array apparatus 2 from the horizontal surface are fixed values defined when the omnidirectional camera apparatus 11z, the calibration omnidirectional camera apparatus C1, and the omnidirectional microphone array apparatus 2 are initially installed. For example, the respective heights H.sub.C, H.sub.K and H.sub.M of the omnidirectional camera apparatus 11z, the calibration omnidirectional camera apparatus C1, and the omnidirectional microphone array apparatus 2 from the horizontal surface are the same as each other.
[0516] (5) The height H.sub.A of the sound collection position A from the horizontal surface is a predefined fixed value, and, for example, in a case where there is a person around the sound collection position A when the designated position A is designated with the finger FG of the user, the height H.sub.A thereof is a selected value or an input value corresponding to a size of the person. Alternatively, when the position A is designated with the finger FG of the user, a default value (for example, 1.5 m or 0.8 m) may be used in a case where the directionality control apparatus 3 determines that there is a person (for example, an adult or a child) at the designated position.
[0517] Hereinafter, a detailed description will be made of a method of computing the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh in the horizontal deviation angle computation section 34w. First, as the premise for describing the present computation method, the first front direction indicating a front direction (0 direction) of a horizontal angle which is required to define a horizontal angle of the first imaging direction CAX1 of the omnidirectional camera apparatus 11z is not known, that is, unknown (refer to
[0518] The horizontal deviation angle computation section 34w computes a horizontal component distance L.sub.CAh of the distance from the omnidirectional camera apparatus 11z to the sound collection position A according to Equation (64) by using the height H.sub.C of the omnidirectional camera apparatus 11z from the horizontal surface, the height H.sub.A of the sound collection position A from the horizontal surface, and the vertical angle .sub.CAv of the first imaging direction CAX1 in the triangle CAS illustrated in
[0519] The horizontal deviation angle computation section 34w computes a horizontal component distance L.sub.KAh of the distance from the calibration omnidirectional camera apparatus C1 to the sound collection position A according to Equation (65) by using the height H.sub.K of the calibration omnidirectional camera apparatus C1 from the horizontal surface, the height H.sub.A of the sound collection position A from the horizontal surface, and the vertical angle .sub.KAv of the second imaging direction CAX2 in the triangle KAS illustrated in
[0520] The horizontal deviation angle computation section 34w computes a cosine value cos .sub.KCAh of a horizontal angle .sub.KCAh formed between a straight line connecting the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 to each other and a straight line K-K from the omnidirectional camera apparatus 11z to the sound collection position A according to Equation (66) based on the cosine theorem for the triangle KCA illustrated in
[0521] The horizontal deviation angle computation section 34w computes the horizontal angle .sub.KCAh according to Equation (67) by using the computation result of Equation (66).
[Equation 67]
.sub.KCAh=arc cos(L.sub.CK.sup.2+L.sub.CAh.sup.2L.sub.KAh.sup.2/2L.sub.CKL.sub.CAh) (67)
[0522] The horizontal deviation angle computation section 34w computes a cosine value cos .sub.CKAh of a horizontal angle .sub.CKAh formed between the straight line connecting the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 to each other and a straight line K-K from the calibration omnidirectional camera apparatus C1 to the sound collection position A according to Equation (68) based on the cosine theorem for the triangle KCA illustrated in
[0523] The horizontal deviation angle computation section 34w computes the horizontal angle .sub.KCAh according to Equation (69) by using the computation result of Equation (68).
[0524] The horizontal deviation angle computation section 34w computes the first horizontal deviation angle .sub.Ch according to Equation (70) by using the horizontal angle .sub.CAh of the first imaging direction CAX1, and the horizontal angle .sub.KCAh computed according to Equation (67).
[Equation 70]
.sub.Ch=.sub.CAh.sub.KCAh (70)
[0525] The horizontal deviation angle computation section 34w computes the second horizontal deviation angle cm, according to Equation (71) by using the horizontal angle .sub.KAh of the second imaging direction CAX2, and the horizontal angle .sub.CKAh computed according to Equation (68).
[Equation 71]
.sub.Kh=.sub.KAh.sub.CKAh (71)
[0526] In the above-described way, in the directionality control system 10 or the directionality control system 10A of the present embodiment, the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 capture images of the same subject, and the omnidirectional microphone array apparatus 2 collects voice of the subject whose image is captured. If any designated position A is designated in image data which is obtained by the omnidirectional camera apparatus 11z and is displayed on the display device 36, the horizontal deviation angle computation section 34w of the directionality control apparatus 3 computes the first horizontal deviation angle .sub.Ch which is a deviation angle of a front direction of the omnidirectional camera apparatus 11z relative to mutual reference directions connecting the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 to each other, and the second horizontal deviation angle .sub.Kh which is a deviation angle formed between the second front direction of a horizontal angle of the calibration omnidirectional camera apparatus C1 and the reference direction of the calibration omnidirectional camera apparatus C1.
[0527] Consequently, in the directionality control system 10, the directionality control apparatus 3 can compute sound collection direction coordinates of the omnidirectional microphone array apparatus 2 which is attached so as to surround the casing of the calibration omnidirectional camera apparatus C1 or the omnidirectional microphone array apparatus 2 which is attached at a position of the calibration omnidirectional camera apparatus C1 so as to match a reference direction of a horizontal angle of the calibration omnidirectional camera apparatus C1, based on imaging direction coordinates which are directed from the omnidirectional camera apparatus 11z to the sound collection position A.
[0528] In other words, the directionality control apparatus 3 can determine to what extent a deviation of the first front direction of a horizontal angle of the omnidirectional camera apparatus 11z occurs relative to a reference direction of the omnidirectional camera apparatus 11z, and can further determine to what extent a deviation of the second front direction of a horizontal angle of the calibration omnidirectional camera apparatus C1 occurs relative to a reference direction of the calibration omnidirectional camera apparatus C1, before computing sound collection direction coordinates of the omnidirectional microphone array apparatus 2. Therefore, in the directionality control system 10, a horizontal angle and a vertical angle of the second imaging direction CAX2 of the calibration omnidirectional camera apparatus C1 are respectively the same as a horizontal angle and a vertical angle of a sound collection direction of the omnidirectional microphone array apparatus 2. Thus, if the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh are computed, the sound collection direction coordinates of the omnidirectional microphone array apparatus 2 can be appropriately computed based on the computed distance L.sub.ck.
[0529] In other words, the directionality control apparatus 3 can appropriately compute sound collection direction coordinates which are directed from omnidirectional microphone array apparatus 2 to a certain sound collection position in image data captured by the omnidirectional camera apparatus 11z.
[0530] Hereinafter, configurations, operations, and effects of the above-described directionality control system and horizontal deviation angle calibration method related of the present invention will be described.
[0531] According to an embodiment of the present invention, there is provided a directionality control system including a first imaging part that captures an image of a subject; a second imaging part that captures an image of the subject; a sound collection part that collects voice of the subject; a display part that displays image data captured by the first imaging part; and a deviation amount computation part that computes a first horizontal deviation angle of a horizontal angle of a first imaging direction which is directed from the first imaging part toward a sound collection position corresponding to a designated position in the image data relative to a first reference horizontal angle, and a second horizontal deviation angle of a horizontal angle of a second imaging direction which is directed from the second imaging part toward the sound collection position relative to a second reference direction, in response to designation of any position in the displayed image data.
[0532] In the above-described configuration, the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 capture images of the same subject, and the omnidirectional microphone array apparatus 2 collects voice of the subject whose image is captured. If any designated position A is designated in image data which is obtained by the omnidirectional camera apparatus 11z and is displayed on the display device 36, the horizontal deviation angle computation section 34w of the directionality control apparatus 3 computes the first horizontal deviation angle .sub.Ch which is a deviation angle of the first front direction (for example, a 0 direction) of a horizontal angle of the omnidirectional camera apparatus 11z relative to mutual reference directions connecting the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 to each other, and the second horizontal deviation angle .sub.Kh which is a deviation angle formed between the second front direction of a horizontal angle of the calibration omnidirectional camera apparatus C1 and the reference direction of the calibration omnidirectional camera apparatus C1.
[0533] Consequently, in the directionality control system 10, the directionality control apparatus 3 can compute sound collection direction coordinates of the omnidirectional microphone array apparatus 2 which is attached so as to surround the casing of the calibration omnidirectional camera apparatus C1 or the omnidirectional microphone array apparatus 2 which is attached at a position of the calibration omnidirectional camera apparatus C1 so as to match a reference direction of a horizontal angle of the calibration omnidirectional camera apparatus C1, based on imaging direction coordinates which are directed from the omnidirectional camera apparatus 11z to the sound collection position A, and can thus compute each deviation amount between a front direction (for example, a 0 direction) of a horizontal angle of the sound collection direction coordinates and the mutual reference directions. Therefore, in the directionality control system 10, the directionality control apparatus 3 can appropriately compute sound collection direction coordinates which are directed from omnidirectional microphone array apparatus 2 to a certain sound collection position in image data captured by the omnidirectional camera apparatus 11z.
[0534] In addition, in the directionality control system according to the embodiment of the present invention, the deviation amount computation part computes the first horizontal deviation angle and the second horizontal deviation angle based on a distance from the first imaging part to the second imaging part, a horizontal angle and a vertical angle which are directed from the first imaging part toward the sound collection position, a horizontal angle and a vertical angle which are directed from the second imaging part toward the sound collection position, a height of the first imaging part from a horizontal surface, a height of the second imaging part from the horizontal surface, and a height of the sound collection position from the horizontal surface.
[0535] In the above-described configuration, the horizontal deviation angle computation section 34w of the directionality control apparatus 3 computes the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh based on the distance L.sub.CK between the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1, the coordinates including the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the omnidirectional camera apparatus 11z toward the sound collection position A, the coordinates including the horizontal angle .sub.KAh and the vertical angle .sub.KAv which are directed from the calibration omnidirectional camera apparatus C1 toward the sound collection position A, the height H.sub.C of the omnidirectional camera apparatus 11z from the horizontal surface, the height H.sub.K of the calibration omnidirectional camera apparatus C1 from the horizontal surface, and the height H.sub.A of the sound collection position A from the horizontal surface. Consequently, the directionality control apparatus 3 can easily compute the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh.
[0536] Further, in the directionality control system according to the embodiment of the present invention, the directionality control system further includes a coordinate computation part that computes a horizontal angle and a vertical angle of a direction which is directed from the sound collection part to the sound collection position as coordinates indicating a sound collection direction in which the sound collection part collects the voice of the subject based on the first horizontal deviation angle and the second horizontal deviation angle.
[0537] In this above-described configuration, the coordinate computation section 34x of the directionality control apparatus 3 can compute a horizontal angle and a vertical angle of sound collection direction coordinates of the omnidirectional microphone array apparatus 2 which is attached so as to surround the casing of the calibration omnidirectional camera apparatus C1 or the omnidirectional microphone array apparatus 2 which is attached at a position of the calibration omnidirectional camera apparatus C1 so as to match a reference direction of a horizontal angle of the calibration omnidirectional camera apparatus C1, based on the first horizontal deviation angle .sub.Ch and the second horizontal deviation angle .sub.Kh computed by the horizontal deviation angle computation section 34w.
[0538] In addition, in the directionality control system according to the embodiment of the present invention, the directionality control system further includes an output control part that causes the sound collection part to form sound collection directionality of audio data in the sound collection direction corresponding to the computed coordinates indicating the sound collection direction.
[0539] In the above-described configuration, the output control section 34c of the directionality control apparatus 3 causes the omnidirectional microphone array apparatus 2 to form the sound collection directionality of audio data in the sound collection direction MIX corresponding to the sound collection direction coordinates of the omnidirectional microphone array apparatus 2 computed by the coordinate computation section 34x. Consequently, the directionality control apparatus 3 can cause the omnidirectional microphone array apparatus 2 to appropriately collect conversation voice of a subject who is present in an imaging direction of the omnidirectional camera apparatus 11z.
[0540] Further, in the directionality control system according to the embodiment of the present invention, a columnar opening is formed at a center of a casing of the sound collection part, and the sound collection part and the second imaging part are integrally formed with each other as a result of the second imaging part being fitted into an inner circumferential space of the opening.
[0541] In the above-described configuration, the columnar opening is formed at the center of the casing of the omnidirectional microphone array apparatus 2, and the omnidirectional microphone array apparatus 2 and the calibration omnidirectional camera apparatus C1 are integrally formed with each other as a result of the calibration omnidirectional camera apparatus C1 being fitted into the inner circumferential space of the opening 21a. Consequently, the omnidirectional microphone array apparatus 2 can use a horizontal angle and a vertical angle of the second imaging direction CAX2 which is directed from the calibration omnidirectional camera apparatus C1 toward the sound collection position A in common as a horizontal angle and a vertical angle of the sound collection direction MIX of the omnidirectional microphone array apparatus 2.
[0542] Further, according to another embodiment of the present invention, there is provided a horizontal deviation angle computation method for a directionality control system including a first imaging part, a second imaging part, and a sound collection part, the method including a step of causing the first imaging part to capture an image of a subject; a step of causing the second imaging part to capture an image of the subject; a step of causing the sound collection part to collect voice of the subject; a step of displaying image data captured by the first imaging part on a display part; and a step of computing a first horizontal deviation angle of a horizontal angle of a first imaging direction which is directed from the first imaging part toward a sound collection position corresponding to a designated position in the image data relative to a first reference angle, and a second horizontal deviation angle of a horizontal angle of a second imaging direction which is directed from the second imaging part toward the sound collection position relative to a second reference direction, in response to designation of any position in the image data displayed on the display part.
[0543] In the above-described method, the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 capture images of the same subject, and the omnidirectional microphone array apparatus 2 collects voice of the subject whose image is captured. If any designated position A is designated in image data which is obtained by the omnidirectional camera apparatus 11z and is displayed on the display device 36, the horizontal deviation angle computation section 34w of the directionality control apparatus 3 computes the first horizontal deviation angle .sub.Ch which is a deviation angle of the first front direction (for example, a 0 direction) of a horizontal angle of the omnidirectional camera apparatus 11z relative to mutual reference directions connecting the omnidirectional camera apparatus 11z and the calibration omnidirectional camera apparatus C1 to each other, and the second horizontal deviation angle .sub.Kh which is a deviation angle formed between the second front direction of a horizontal angle of the calibration omnidirectional camera apparatus C1 and the reference direction of the calibration omnidirectional camera apparatus C1.
[0544] Consequently, in the directionality control system 10, the directionality control apparatus 3 can compute sound collection direction coordinates of the omnidirectional microphone array apparatus 2 which is attached so as to surround the casing of the calibration omnidirectional camera apparatus C1 or the omnidirectional microphone array apparatus 2 which is attached at a position of the calibration omnidirectional camera apparatus C1 so as to match a reference direction of a horizontal angle of the calibration omnidirectional camera apparatus C1, based on imaging direction coordinates which are directed from the omnidirectional camera apparatus 11z to the sound collection position A, and can thus compute each deviation amount between a front direction (for example, a 0 direction) of a horizontal angle of the sound collection direction coordinates and the mutual reference directions. Therefore, in the directionality control system 10, the directionality control apparatus 3 can appropriately compute sound collection direction coordinates which are directed from omnidirectional microphone array apparatus 2 to a certain sound collection position in image data captured by the omnidirectional camera apparatus 11z.
[0545] Each of seventh to tenth embodiments described below relates to a directionality control system and a directionality control method, in which a height of a target object present in a sound collection space as a sound source from a reference surface is determined, and sound collection directionality of sound collected by a microphone array apparatus is formed by using the height of the target object from the reference surface.
[0546] Patent Literature 1 is based on, for example, a camera apparatus, a microphone array apparatus, and a target object (for example, a person) being present on the same plane assuming usage forms in a television conference system. However, in the above-described monitoring system, the camera apparatus, the microphone array apparatus, and the target object (for example, a person) are all seldom present on the same plane in practice.
[0547] For example, as illustrated in
[0548] Therefore, in the monitoring system illustrated in
[0549] In addition, in the monitoring system illustrated in
[0550] For example, in
[0551] Therefore, in each of the seventh to tenth embodiments of the present invention, in order to solve the above-described problems, a description will be made of examples of a directionality control system and a directionality control method, in which a height of a target sound source position present in a sound collection space from a reference surface is determined, and sound collection directionality is formed in a sound collection directional direction which is directed from a microphone array apparatus toward the target sound source position based on the height of the target sound source position from the reference surface.
[0552] Hereinafter, with reference to the drawings, description will be made of each of the seventh to tenth embodiments of a directionality control system and a directionality control method related to the present invention. The directionality control system of each embodiment is used as a monitoring system (including a manned monitoring system and an unmanned monitoring system) provided in, for example, a factory, a company, a public facility (for example, an event hall), or a store (for example, a retail store), but an installation location is not particularly limited. In the following respective embodiments, description will be made assuming that the directionality control system of each embodiment is installed in, for example, a store.
[0553] In addition, the present invention can be expressed as respective apparatuses (for example, a directionality control apparatus to be described later) constituting the directionality control system, or a directionality control method including respective operations (steps) performed by each apparatus constituting the directionality control system.
Seventh Embodiment
Configuration of Directionality Control System
[0554]
[0555] The camera apparatuses 11 to 1n as at least one imaging part includes a casing into which an optical system (for example, a wide angle lens) and an imaging system (for example, an image sensor) (not illustrated) are built, and is fixed to and installed on, for example, a ceiling surface of the store or a stand (refer to
[0556] In addition, the camera apparatuses 11 to 1n capture a video (including a still image and a moving image; this is also the same for the following description) of a target object which is present in a predefined angle of view centering on an optical axis. The camera apparatuses 11 to 1n transmit the captured video data, and input parameters for computing a horizontal angle .sub.MAh and a vertical angle .sub.MAv of a sound collection directional direction which will be described later to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW.
[0557] The omnidirectional microphone array apparatus 2 as a sound collection part includes, for example, a doughnut-shaped or ring-shaped (annular) casing 21C (refer to
[0558] In the omnidirectional microphone array apparatus 2, a plurality of microphone units 22 are disposed in a concentric shape around the opening 21a in a circumferential direction of the casing 21C. The microphone unit 22 employs, for example, a high-quality small-sized electret condenser microphone (ECM), and this is also the same for the following respective embodiments.
[0559] The omnidirectional microphone array apparatus 2 is connected to the network NW and includes at least microphone units 22 and 23 in which microphones are provided at equal intervals (refer to
[0560] The omnidirectional microphone array apparatus 2 collects sound in a sound collection directional direction in which a target object (sound source) as a sound collection target is present by using each of the microphone units 22 and 23, performs predetermined sound processing on audio data collected by each of the microphone units 22 and 23, and transmits a processed result to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW.
[0561] The omnidirectional microphone array apparatus 2 forms sound collection directionality of each of the microphone units 22 and 23 in sound collection directional coordinates (.sub.MAh,.sub.MAv) which are computed by a sound collection directional direction computation section 34b of a signal processing unit 33 of the directionality control apparatus 3 in response to a directionality formation instruction from the directionality control apparatus 3 which will be described later.
[0562] Consequently, the omnidirectional microphone array apparatus 2 can relatively increase a volume level of sound collected from the sound collection directional coordinates (.sub.MAh,.sub.MAv) in which the sound collection directionality is formed, and can relatively reduce a volume level of sound collected from a direction in which the sound collection directionality is not formed. In addition, a method of computing the sound collection directional coordinates (.sub.MAh,.sub.MAv) will be described later.
[0563] The directionality control apparatus 3 is connected to the network NW, and may be, for example, a stationery personal computer (PC) installed in a central control room (not illustrated) of a company, and may be a mobile phone, a tablet terminal, or a smart phone, which can be carried by a user.
[0564] The directionality control apparatus 3 includes at least a communication unit 31, an operation unit 32, a signal processing unit 33, a display device 36, a speaker device 37, and a memory 38. The signal processing unit 33 includes at least a sound source height determination section 34a, a sound collection directional direction computation section 34b, and an output control section 34c.
[0565] The communication unit 31 outputs video data or audio data which is transmitted from the camera apparatuses 11 to 1n or the omnidirectional microphone array apparatus 2, to the signal processing unit 33 via the network NW.
[0566] The operation unit 32 is a user interface (UI) for notifying the signal processing unit 33 of the content of a user's input operation, and is, for example, a pointing device such as a mouse or a keyboard. In addition, the operation unit 32 may be configured by using a touch panel or a touch pad which is disposed so as to correspond to, for example, a screen of the display device 36 and allows an input operation to be performed with the finger FG of the user or a stylus pen.
[0567] The operation unit 32 acquires coordinate data indicating a location where the user desires to increase or decrease a volume level, that is, a designated position A illustrated in
[0568] The signal processing unit 33 is configured by using, for example, a central processing unit (CPU), a micro processing unit (MPU), or a digital signal processor (DSP), and performs a control process for collectively controlling operations of the respective units of the directionality control apparatus 3, data input and output processes with other respective units, a data computation (calculation) process, and a data storage process.
[0569] The sound source height determination section 34a as a height determination part determines a height H.sub.A of the target sound source position A from a floor surface BL, corresponding to the designated position A if the designated position A is designated with the finger FG of the user or a stylus pen in video data captured by, for example, the camera apparatus 11 as video data displayed on the display device 36. In the following respective embodiments, as long as there is no particular description, for example, a reference surface is the floor surface BL in the store.
[0570] Specifically, if the designated position A in the video data displayed on the display device 36 is designated with the finger FG of the user, the sound source height determination section 34a reads data regarding the height of the target sound source position A from the floor surface BL, corresponding to coordinate data regarding the designated position A, from a configuration file CF1. The sound source height determination section 34a determines the read data regarding the height of the target sound source position A from the floor surface BL, corresponding to the coordinate data regarding the designated position A as a height H.sub.A of the target sound source position A corresponding to the designated position A from the floor surface BL.
[0571] The sound collection directional direction computation section 34b computes coordinates (.sub.MAh,.sub.MAv) (hereinafter, simply referred to as sound collection directional coordinates) indicating a sound collection directional direction which is directed from the installation position M of the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A in response to the designation of the designated position A in the video data displayed on the display device 36.
[0572] In the sound collection directional coordinates (.sub.MAh,.sub.MAv), .sub.MAh indicates a horizontal angle of the sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A, and .sub.MAv indicates a vertical angle of the sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A. In addition, the target sound source position A is a field position which corresponds to the designated position A which is designated with the finger FG of the user or a stylus pen in the video data displayed on the display device 36 via the operation unit 32, and is an actual monitoring target.
[0573] The output control section 34c as a control part controls operations of the camera apparatuses 11 to 1n, the omnidirectional microphone array apparatus 2, the display device 36, and the speaker device 37, so as to reproduce and output video data transmitted from the camera apparatuses 11 to 1n on the display device 36 and to output audio data transmitted from the omnidirectional microphone array apparatus 2 from the speaker device 37 as sound. The output control section 34c causes the omnidirectional microphone array apparatus 2 to form sound collection directionality of audio data in a sound collection directional direction corresponding to sound collection direction coordinates (.sub.MAh,.sub.MAv) computed by the sound collection directional direction computation section 34b.
[0574] The display device 36 as a display part displays video data captured by the camera apparatuses 11 to 1n on a screen thereof.
[0575] The speaker device 37 as a sound output part outputs, as sound, audio data collected by the omnidirectional microphone array apparatus 2 or audio data which is collected by the omnidirectional microphone array apparatus 2 after the sound collection directionality is formed in the sound collection directional coordinates (.sub.MAh,.sub.MAv) computed by the sound collection directional direction computation section 34b. In addition, the display device 36 and the speaker device 27 may be configured separately from the directionality control apparatus 3.
[0576] The memory 38 as a storage part is configured by using, for example, a random access memory (RAM), and functions as a program memory, a data memory, and a work memory when the respective units of the directionality control apparatus 3 operate. In addition, the memory 38 stores the configuration file CF1 illustrated in
[0577] The recorder apparatus 4 records the video data captured by the camera apparatuses 11 to 1n and the audio data collected by the omnidirectional microphone array apparatus 2 in correlation with each other.
[0578] Next, a summary of an operation of the directionality control system 10 of the present embodiment will be described with reference to
[0579] ) output from the speaker device SP.
[0580] In the directionality control system 10 illustrated in
[0581] In
[0582] Next, an operation procedure of initial setting performed in the directionality control system 10 of the present embodiment will be described with reference to
[0583] In
[0584] After the camera apparatus 11 and the omnidirectional microphone array apparatus 2 are initially installed, the sound collection directional direction computation section 34b measures each input parameter which is required to compute sound collection directional coordinates (.sub.MAh,.sub.MAv) (step ST2). The process in step ST2 includes a case where the user measures the input parameter by using a measuring device (for example, a laser range finder), and a case where the camera apparatus 11 measures and acquires the input parameter by using functions of well-known techniques of the camera apparatus 11. Each input parameter in step ST2 differs in each method of computing sound collection directional coordinates, and thus detailed content thereof will be described with reference to
[0585] After step ST2, each input parameter measured in step ST2 is input to the signal processing unit 33 of the directionality control apparatus 3 from the camera apparatus 11 or is input to the signal processing unit 33 from the operation unit 32 (step ST3). For example, the camera apparatus 11 transmits an input parameter acquired by using functions of well-known techniques of the camera apparatus 11, to the communication unit 31 of the directionality control apparatus 3. The communication unit 31 outputs the input parameter transmitted by the camera apparatus 11, to the signal processing unit 33. In addition, the operation unit 32 outputs data regarding the height H.sub.A of the target sound source position A from the floor surface BL as an example of the input parameter to the signal processing unit 33 in response to a user's input operation.
[0586] The signal processing unit 33 generates the configuration file CF1 including the respective input parameters acquired in step ST3 and preserves the configuration file CF1 in the memory 38 (step ST4). In the above-described way, the operation of the initial setting in the directionality control system 10 is finished.
[0587] Next, an operation procedure following the initial setting in the directionality control system 10 of the present embodiment will be described with reference to
[0588] In
[0589] After step ST11, in a case where the camera apparatus 11 receives the notification indicating that the designation of the designated position A has been received from the directionality control apparatus 3, the camera apparatus 11 acquires coordinate data including a horizontal angle and a vertical angle (.sub.CAh,.sub.CAv) to the target sound source position A corresponding to the position A designated in step ST11, with an installation position C of the camera apparatus 11 as a start point (step ST12).
[0590] The camera apparatus 11 transmits the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) to the target sound source position A corresponding to the position A designated in step ST11 with the installation position C of the camera apparatus 11 as a start point, to the directionality control apparatus 3.
[0591] The sound source height determination section 34a of the signal processing unit 33 of the directionality control apparatus 3 reads data regarding the height of the target sound source position A from the floor surface BL, corresponding to the coordinate data regarding the designated position A, from the configuration file CF1 stored in the memory 38 in step ST4. The sound source height determination section 34a determines the read data regarding the height of the target sound source position A from the floor surface BL, corresponding to the coordinate data regarding the designated position A as a height H.sub.A of the target sound source position A corresponding to the designated position A from the floor surface BL.
[0592] Further, the sound collection directional direction computation section 34b of the signal processing unit 33 of the directionality control apparatus 3 computes the sound collection directional coordinates (.sub.MAh,.sub.MAv) which are directed from the installation position M of the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) from the camera apparatus 11 to the target sound source position A, and the respective input parameters (including the height H.sub.A of the target sound source position A corresponding to the designated position A from the floor surface BL) read from the configuration file CF1 (step ST13). A process of computing the sound collection directional coordinates (.sub.MAh,.sub.MAv) will be described in detail with reference to
[0593] The directionality control apparatus 3 transmits a directionality formation instruction including the sound collection directional coordinates (.sub.MAh,.sub.MAv) computed in step ST13 to the omnidirectional microphone array apparatus 2. The omnidirectional microphone array apparatus 2 forms the sound collection directionality of each of the microphones 22 and 23 in a sound collection directional direction indicated by the sound collection directional coordinates (.sub.MAh,.sub.MAv) computed by the directionality control apparatus 3 in response to the directionality formation instruction from the directionality control apparatus 3 (step ST14).
[0594] Consequently, the microphone array apparatus 2 can increase a volume level of audio data which is collected from the sound collection directional direction indicated by the sound collection directional coordinates (.sub.MAh,.sub.MAv) in which the sound collection directionality is formed, and can reduce a volume level of audio data which is collected from a direction in which the sound collection directionality is not formed. In the above-described way, the operation following the initial setting in the directionality control system 10 is finished.
[0595] In addition, in the directionality control system 10 of the present embodiment, a timing at which the omnidirectional microphone array apparatus 2 collects sound is not limited to the time right after step ST14, and may be, for example, the time after power is supplied to the omnidirectional microphone array apparatus 2.
Method of Computing Coordinates (.SUB.MAh.,.SUB.MAv.) Indicating Sound Collection Directional Direction of Omnidirectional Microphone Array Apparatus 2
[0596] Here, with reference to
[0597]
[0598]
[0599] The sound collection directional direction computation section 34b computes the sound collection directional coordinates (.sub.MAh,.sub.MAv) of the omnidirectional microphone array apparatus 2 based on:
[0600] (1) a distance L.sub.CM between the camera apparatus 11 and the omnidirectional microphone array apparatus 2;
[0601] (2) a horizontal angle .sub.CAh and a vertical angle .sub.CAv from the camera apparatus 11 to the target sound source position A;
[0602] (3) respective heights H.sub.C and H.sub.M (H.sub.C=H.sub.M) of the camera apparatus 11 and the omnidirectional microphone array apparatus 2 from the floor surface BL; and
[0603] (4) a height H.sub.A of the target sound source position A from the floor surface BL.
[0604] In the present computation method, the input parameters in step ST2 illustrated in
[0605] (1) the distance L.sub.CM between the camera apparatus 11 and the omnidirectional microphone array apparatus 2;
[0606] (3) the respective heights H.sub.C and H.sub.M (H.sub.C=H.sub.M) of the camera apparatus 11 and the omnidirectional microphone array apparatus 2 from the floor surface BL.
[0607] In addition, among the respective input parameters in the present computation method,
[0608] (1) the distance L.sub.CM between the camera apparatus 11 and the omnidirectional microphone array apparatus 2 is a fixed value measured, for example, in the initial setting in step ST1 illustrated in
[0609] (3) The respective heights H.sub.C and H.sub.M (H.sub.C=H.sub.M) of the camera apparatus 11 and the omnidirectional microphone array apparatus 2 from the floor surface BL are fixed values measured, for example, in the initial setting in step ST1 illustrated in
[0610] In addition, among the respective input parameters in the present computation method,
[0611] (2) the horizontal angle .sub.CAh and the vertical angle .sub.CAv from the camera apparatus 11 to the target sound source position A are acquired by using a function of a well-known technique of the camera apparatus 11 in step ST12 illustrated in
[0612] Further, in the present computation method,
[0613] (4) the height H.sub.A of the target sound source position A from the floor surface BL is a fixed value determined by the sound source height determination section 34a of the signal processing unit 33 of the directionality control apparatus 3, that is, a fixed value (predetermined value) written in the configuration file CF1 in step ST4 illustrated in
[0614] Hereinafter, a detailed description will be made of a method of computing the sound collection directional coordinates (.sub.MAh,.sub.MAv) of the omnidirectional microphone array apparatus 2 in the sound collection directional direction computation section 34b.
[0615] The sound collection directional direction computation section 34b computes a horizontal component distance L.sub.CAh of the distance L.sub.CA from the camera apparatus 11 to the target sound source position A according to Equation (58) by using the respective heights H.sub.C and H.sub.M of the camera apparatus 11 and the omnidirectional microphone array apparatus 2 from the floor surface BL, and the vertical angle .sub.CAv from the camera apparatus 11 to the target sound source position A, in the triangle CAP illustrated in
[0616] The sound collection directional direction computation section 34b computes a horizontal component distance L.sub.MAh of the distance from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (72) based on the cosine theorem for the triangle CAM illustrated in
[Equation 72]
L.sub.MAh={square root over ((L.sub.CAh.sup.2+L.sub.CM.sup.22L.sub.CAhL.sub.CMcos .sub.CAh)}(72)
[0617] The sound collection directional direction computation section 34b computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh of the depression angle .sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (73) based on the cosine theorem for the triangle CAM illustrated in
[0618] Consequently, the sound collection directional direction computation section 34b can compute the horizontal angle .sub.MAh of the depression angle .sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (74).
[0619] In addition, the sound collection directional direction computation section 34b computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv of the depression angle .sub.MA which is directed from from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (75) based on a tangent for the triangle MAS illustrated in
[0620] Consequently, the sound collection directional direction computation section 34b can compute the vertical angle .sub.MAv of the the depression angle .sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (76).
[0621] In the above-described way, in the directionality control system 10 of the present embodiment, the sound source height determination section 34a of the directionality control apparatus 3 determines the height H.sub.A of the target sound source position A corresponding to the designated position A in designated video data from the floor surface BL in response to the designation of the designated position A performed by the user on a display screen of the video data displayed on the display device 36. The sound collection directional direction computation section 34b of the directionality control apparatus 3 computes sound collection directional coordinates indicating a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A based on the height H.sub.A of the target sound source position A from the floor surface BL. The output control section 34c of the directionality control apparatus 3 causes the omnidirectional microphone array apparatus 2 form the sound collection directionality in the sound collection directional direction indicated by the computed sound collection directional coordinates.
[0622] Consequently, in the directionality control system 10, the directionality control apparatus 3 can uniquely determine the height H.sub.A of the target sound source position A of a target object from the floor surface BL, present in an imaging direction in which the camera apparatus 11 performs imaging in the sound collection space K, and can thus accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A based on the height H.sub.A of the target sound source position A from the floor surface BL. In addition, the directionality control apparatus 3 can cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
[0623] Further, in the directionality control system 10 of the present embodiment, the memory 38 of the directionality control apparatus 3 stores the configuration file CF1 including, for example, the first configuration data in which the target sound source position A corresponding to the designated position A in video data designated in response to a user's input operation is correlated with a height (fixed value) of the target sound source position A from the floor surface BL in advance.
[0624] Therefore, the sound source height determination section 34a can easily determine, for example, the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated in response to a user's input operation based on the first configuration data of the configuration file CF1.
Eighth Embodiment
[0625] In the eighth embodiment, a directionality control apparatus 3A displays options of a height of the target sound source position A from the floor surface BL on the display device 36, and, for example, prompts the user to select any one of the options. The directionality control apparatus 3A determines a height of the target sound source position A from the floor surface BL in response to the selection of any one of the options.
[0626]
[0627] The directionality control apparatus 3A includes a communication unit 31, an operation unit 32, a signal processing unit 33A, a display device 36, a speaker device 37, and a memory 38A. The signal processing unit 33A includes at least a sound source height determination section 34aA, a sound collection directional direction computation section 34b, and an output control section 34cA. Each unit which constitutes the directionality control system 10A illustrated in
[0628] In the present embodiment, the memory 38A stores a configuration file CF2 illustrated in
[0629] For example, as illustrated in
[0630] The sound source height determination section 34aA of the signal processing unit 33A determines a target sound source height (for example, the target sound source height H.sub.A-A) corresponding to the option selected on the selection screen WD2 with the finger FG of the user as the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated with the finger FG of the user on the display screen WD1 of the video data illustrated in
[0631] Next, an operation procedure of initial setting performed in the directionality control system 10A of the present embodiment will be described with reference to
[0632] In
[0633] The signal processing unit 33A generates the configuration file CF2 including the respective input parameters acquired in step ST3 and preserves the configuration file CF2 in the memory 38A (step ST4). In the above-described way, the operation of the initial setting in the directionality control system 10A is finished.
[0634] Next, an operation procedure following the initial setting in the directionality control system 10A of the present embodiment will be described with reference to
[0635] In
[0636] Here, it is assumed that any one of a plurality of options for selecting the height H.sub.A of the target sound source position A from the floor surface BL is selected with the finger FG of the user on the selection screen WD2 (step ST15). The sound source height determination section 34aA determines a target sound source height (for example, the target sound source height H.sub.A-A) corresponding to the option selected on the selection screen WD2 with the finger FG of the user as the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated with the finger FG of the user on the display screen WD1 of the video data illustrated in
[0637] The sound collection directional direction computation section 34b of the signal processing unit 33A of the directionality control apparatus 3A computes the sound collection directional coordinates (.sub.MAh,.sub.MAv) which are directed from the installation position M of the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) from the camera apparatus 11 to the target sound source position A, and the height H.sub.A of the target sound source position A from the floor surface BL determined according to the option selected in step ST15 (step ST13A). The computation content in step ST13A and the operation content in step ST14 are the same as the computation content and the operation content in the seventh embodiment, and thus description thereof will be omitted. In the above-described way, the operation following the initial setting in the directionality control system 10A is finished.
[0638] As mentioned above, in the directionality control system 10A of the present embodiment, the memory 38A of the directionality control apparatus 3A stores the configuration file CF2 including height data (second configuration data) regarding a plurality of types of heights H.sub.A-A, H.sub.A-B, H.sub.A-D, . . . , as heights of the target sound source position A from the floor surface BL. The output control section 34cA of the directionality control apparatus 3A displays options of the plurality of types of height data of the second configuration data included in the configuration file CF2 as heights of the target sound source position A from the floor surface BL on the display device 36.
[0639] Therefore, the sound source height determination section 34aA of the directionality control apparatus 3A can easily determine the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated, for example, in response to a user's input operation, in accordance with selection of any one of displayed options of a plurality of types of height data.
[0640] In addition, also in the directionality control system 10A of the present embodiment, in the same manner as in the directionality control system 10 of the seventh embodiment, the directionality control apparatus 3A can uniquely determine the height H.sub.A of the target sound source position A of a target object from the floor surface BL, present in an imaging direction in which the camera apparatus 11 performs imaging in the sound collection space K, and can thus accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A based on the height H.sub.A of the target sound source position A from the floor surface BL. In addition, the directionality control apparatus 3 can cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
Ninth Embodiment
[0641] In the ninth embodiment, a directionality control apparatus 3B displays an entry form screen WD3 which allows the user to input a height of the target sound source position A from the floor surface BL on the display device 36 by himself or herself, and, for example, prompts the user to input a height. The directionality control apparatus 3B determines data which is input to the entry form screen WD3 as the height H.sub.A of the target sound source position A from the floor surface BL.
[0642]
[0643] The directionality control apparatus 3B includes a communication unit 31, an operation unit 32, a signal processing unit 33B, a display device 36, a speaker device 37, and a memory 38B. The signal processing unit 33B includes at least a sound source height determination section 34aB, a sound collection directional direction computation section 34b, and an output control section 34cB. Each unit which constitutes the directionality control system 10B illustrated in
[0644] In the present embodiment, the memory 38B stores a configuration file CF3 illustrated in
[0645] The output control section 34cB of the signal processing unit 33B displays the entry form screen WD3 which allows the user by himself or herself to input a height of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated with the finger FG of the user, on the display device 36.
[0646] The sound source height determination section 34aB of the signal processing unit 33B determines a numerical value of the height of the target sound source position A from the floor surface BL, input on the entry form screen WD3, as the height H.sub.A of the target sound source position A from the floor surface BL, desired by the user.
[0647] Next, an operation procedure of initial setting performed in the directionality control system 10B of the present embodiment will be described with reference to
[0648] In
[0649] The signal processing unit 33B generates the configuration file CF3 including the respective input parameters acquired in step ST3B and preserves the configuration file CF3 in the memory 38A (step ST4). In the above-described way, the operation of the initial setting in the directionality control system 10B is finished.
[0650] Next, an operation procedure following the initial setting in the directionality control system 10B of the present embodiment will be described with reference to
[0651] In
[0652] Here, it is assumed that a numerical value corresponding to a height of the target sound source position A from the floor surface BL is input by the user through the user's input operation (for example, inputting using the finger FG or inputting using a keyboard (not illustrated)) on the entry form screen WD3 (step ST16). The sound source height determination section 34aB determines the numerical value corresponding to the height of the target sound source position A from the floor surface BL, input on the entry form screen WD3, as the height H.sub.A of the target sound source position A from the floor surface BL.
[0653] The sound collection directional direction computation section 34b of the signal processing unit 33B of the directionality control apparatus 3B computes the sound collection directional coordinates (.sub.MAh,.sub.MAv) which are directed from the installation position M of the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) from the camera apparatus 11 to the target sound source position A, and the height H.sub.A of the target sound source position A from the floor surface BL determined according to the input value on the entry form screen WD3 in step ST16 (step ST13B). The computation content in step ST13B and the operation content in step ST14 are the same as the computation content and the operation content in the seventh embodiment, and thus description thereof will be omitted. In the above-described way, the operation following the initial setting in the directionality control system 10B is finished.
[0654] As mentioned above, in the directionality control system 10B of the present embodiment, the output control section 34cB of the directionality control apparatus 3B displays the entry form screen WD3 which allows the user to input a height of the target sound source position A from the floor surface BL by himself or herself, on the display device 36.
[0655] Therefore, the sound source height determination section 34aB of the directionality control apparatus 3B can accurately determine a height of the target sound source position A from the floor surface BL, desired by the user, in response to inputting of a height of the target sound source position A from the floor surface BL on the displayed entry form screen WD3. Consequently, the sound collection directional direction computation section 34b of the directionality control apparatus 3B can accurately compute sound collection directional coordinates of a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A.
[0656] In addition, also in the directionality control system 10B of the present embodiment, in the same manner as in the directionality control system 10 of the seventh embodiment, the directionality control apparatus 3B can uniquely determine the height H.sub.A of the target sound source position A of a target object from the floor surface BL, present in an imaging direction in which the camera apparatus 11 performs imaging in the sound collection space K, and can thus accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position Abased on the height H.sub.A of the target sound source position A from the floor surface BL. In addition, the directionality control apparatus 3B can cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
Tenth Embodiment
[0657] In the tenth embodiment, if a first designated position A1 around a target object (for example, a person), and a second designated position A2 on the floor surface BL located in a vertically lower direction of (directly under) the second designated position A1 are designated on a display screen WD4 of video data displayed on the display device 36, for example, with the finger FG of the user, a directionality control apparatus 3C computes a height of a target sound source position A1 from the floor surface BL, corresponding to the first designated position A1. The directionality control apparatus 3C determines the computed height as a height H.sub.A1 of the target sound source position A1 from the floor surface BL.
[0658]
[0659] The directionality control apparatus 3C includes a communication unit 31, an operation unit 32, a signal processing unit 33C, a display device 36, a speaker device 37, and a memory 38B. The signal processing unit 33C includes at least a sound source height determination section 34aC, a sound collection directional direction computation section 34b, and an output control section 34c. Each unit which constitutes the directionality control system 10C illustrated in
[0660] If the first designated position A1 and the second designated position A2 on the floor surface BL corresponding to the first designated position A1 are designated on the display screen WD4 of video data displayed on the display device 36, for example, with the finger FG of the user, the height determination section 34aC of the signal processing unit 33C computes a height of the target sound source position A1 from the floor surface BL, corresponding to the first designated position A1.
[0661] In addition, an operation procedure (refer to
[0662] Next, an operation procedure following the initial setting in the directionality control system 10C of the present embodiment will be described with reference to
[0663] In
[0664] After step ST11C, if the notification indicating that the designation of the first designated position A1 and the second designated position A2 has been received is received from the directionality control apparatus 3C, the camera apparatus 11 acquires coordinate data including a horizontal angle and a vertical angle (.sub.CA1h,.sub.CA1v) to the target sound source position A1 corresponding to the first designated position A1 designated in step ST11C, and coordinate data including a horizontal angle and a vertical angle (.sub.CA2h,.sub.CA2v) to a position A2 corresponding to the second designated position A2 designated in step ST11C, with an installation position C of the camera apparatus 11 as a start point (step ST12C).
[0665] The camera apparatus 11 transmits, to the directionality control apparatus 3C, the coordinate data including the horizontal angle and the vertical angle (.sub.CA1h,.sub.CA1v) to the target sound source position A1 corresponding to the first designated position A1 and the coordinate data including the horizontal angle and the vertical angle (.sub.CA2h,.sub.CA2v) to the position A2 corresponding to the second designated position A2, designated in step ST11C with the installation position C of the camera apparatus 11 as a start point.
[0666] The height determination section 34aC of the signal processing unit 33C of the directionality control apparatus 3C computes the height H.sub.A1 of the target sound source position A1 from the floor surface BL, corresponding to the first designated position A1 which is designated with the finger FG of the user, by using the coordinate data including (.sub.CA1h,.sub.CA1v) and the coordinate data including (.sub.CA2h,.sub.CA2v), transmitted from the camera apparatus 11 (step ST17). Details of step ST17 will be described with reference to
[0667] The sound collection directional direction computation section 34b of the signal processing unit 33C of the directionality control apparatus 3C computes the sound collection directional coordinates (.sub.MAh,.sub.MAv) which are directed from the installation position M of the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the coordinate data including the horizontal angle and the vertical angle (.sub.CAh,.sub.CAv) from the camera apparatus 11 to the target sound source position A1, and the height H.sub.A1 of the target sound source position A1 from the floor surface BL determined computed in step ST17 (step ST13C). The computation content in step ST13C and the operation content in step ST14 are the same as the computation content and the operation content in the seventh embodiment, and thus description thereof will be omitted. In the above-described way, the operation following the initial setting in the directionality control system 10C is finished.
Method of Computing Height H.SUB.A1 .of Target Sound Source Position A1 From Floor Surface BL
[0668] Here, with reference to
[0669]
[0670]
[0671] The sound source height determination section 34aC computes the height H.sub.A1 of the target sound source position A1 from the floor surface BL, corresponding to the first designated position A1 based on:
[0672] (1) a height H.sub.C of the camera apparatus 11 from the floor surface BL;
[0673] (2) a distance (horizontal component) L.sub.CA1h and an angle (vertical angle) .sub.CA1v from the camera apparatus 11 to the target sound source position A1; and
[0674] (3) a distance (horizontal component) L.sub.CA2h and an angle (vertical angle) .sub.CA2v from the camera apparatus 11 to the position A2 on the floor surface BL.
[0675] In the present height computation method, the input parameter in step ST2 illustrated in
[0676] Hereinafter, detailed description will be made of a method (the present height computation method) in which the sound source height determination section 34aC computes the height H.sub.A1 of the target sound source position A1 from the floor surface BL.
[0677] The sound source height determination section 34aC computes a tangent value tan .sub.CA1v of a vertical angle .sub.CA1v of an imaging direction which is directed from the camera apparatus 11 toward the target sound source position A1 according to Equation (77) based on a tangent for the triangle CA1Q illustrated in
[0678] Similarly, the sound source height determination section 34aC computes a tangent value tan .sub.CA2v of a vertical angle .sub.CA2v of an imaging direction which is directed from the camera apparatus 11 toward the position A2 on the floor surface BL according to Equation (78) based on a tangent for the triangle CA2Q illustrated in
[0679] Here, the target sound source position A1 and the position A2 on the floor surface BL have different coordinates in the vertically upper direction or the vertically lower direction illustrated in
[Equation 79]
L.sub.CA1h=L.sub.CA2h (79)
[0680] Therefore, the sound source height determination section 34aC computes the height H.sub.A1 of the target sound source position A1 from the floor surface BL according to Equation (80) by using the respective computation results of Equations (77) to (79).
[0681] In addition, a description will also be made of a method of computing the height H.sub.A1 of the target sound source position A1 from the floor surface BL in a case where a target object (person) is not present on the floor surface BL but is present on the stand RC with a predetermined height HD (refer to
[0682] The sound source height determination section 34aC computes a tangent value tan .sub.CA1v of a vertical angle .sub.CA1v of an imaging direction which is directed from the camera apparatus 11 toward the target sound source position A1 according to Equation (81) based on a tangent for the triangle CA1Q illustrated in
[0683] Similarly, the sound source height determination section 34aC computes a tangent value tan .sub.CA2v of a vertical angle .sub.CA2v of an imaging direction which is directed from the camera apparatus 11 toward the position A2 on the stand RC according to Equation (82) based on a tangent for the triangle CA2Q illustrated in
[0684] Here, the target sound source position A1 and the position A2 on the stand RC have different coordinates in the vertically upper direction or the vertically lower direction illustrated in
[0685] Therefore, the sound source height determination section 34aC computes the height H.sub.A1 of the target sound source position A1 from the floor surface BL according to Equation (83) by using the respective computation results of Equations (79), (81) and (82).
[0686] In the above-described way, in the directionality control system 10C of the present embodiment, the sound source height determination section 34aC of the directionality control apparatus 3C can accurately compute a height of the target sound source position A1 from the floor surface BL, corresponding to the first designated position A1, in response to designation of the first designated position A1 and the second designated position A2 on the floor surface BL corresponding to the first designated position A1 in video data displayed on the display device 36.
[0687] Therefore, in the directionality control apparatus 3C, a configuration file in which a target sound source position corresponding to a position designated in a video data is correlated with a height of the target sound source position from the floor surface BL in advance is not required to be created, and a height of the target sound source position A1 from the floor surface BL, desired by the user, can be accurately computed in response to a simple designation operation such as designation of two positions without displaying options of a height of a target sound source position A from the floor surface BL or an entry form on the display device 36.
[0688] In addition, also in the directionality control system 10C of the present embodiment, in the same manner as in the directionality control system 10 of the seventh embodiment, the directionality control apparatus 3C can uniquely determine the height H.sub.A of the target sound source position A1 of a target object from the floor surface BL, present in an imaging direction in which the camera apparatus 11 performs imaging in the sound collection space K, and can thus accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A1 based on the height H.sub.A1 of the target sound source position A1 from the floor surface BL. In addition, the directionality control apparatus 3C can cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
[0689] Hereinafter, configurations, operations, and effects of the above-described directionality control system and directionality control method related to the present invention will be described.
[0690] According to an embodiment of the present invention, there is provided a directionality control system including at least one imaging part that captures a video; a sound collection part that collects sound; a display part that displays video data captured by the at least one imaging part; a height determination part that determines a height of a target sound source position from a reference surface, corresponding to a position designated in the video data in response to designation of the position in the video data; a sound collection directional direction computation part that computes a sound collection directional direction which is directed from the sound collection part toward the target sound source position based on the height of the target sound source position from the reference surface; and a control part that causes the sound collection part to form sound collection directionality of the sound in the computed sound collection directional direction.
[0691] In the above-described configuration, the sound source height determination section 34a of the directionality control apparatus 3 determines the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A in video data in response to a user's designation of the designated position A in the video data displayed on the display device 36. The sound collection directional direction computation section 34b of the directionality control apparatus 3 computes a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A based on the height H.sub.A of the target sound source position A from the floor surface BL. The output control section 34c of the directionality control apparatus 3 causes the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
[0692] Consequently, in the directionality control system 10, the directionality control apparatus 3 can uniquely determine the height H.sub.A of the target sound source position A of a target object from the floor surface BL, present in an imaging direction in which the camera apparatus 11 performs imaging in the sound collection space K, and can thus accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A based on the height H.sub.A of the target sound source position A from the floor surface BL. In addition, the directionality control apparatus 3 can cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
[0693] In addition, in the directionality control system according to the embodiment of the present invention, the directionality control system further includes a storage part that stores first configuration data in which a target sound source position corresponding to a position designated in the video data is correlated with a height of the target sound source position from a reference surface in advance, and the height determination part determines a height of the target sound source position from the reference surface, corresponding to the designated position based on the first configuration data.
[0694] In the above-described configuration, the memory 38 of the directionality control apparatus 3 stores the configuration file CF1 including, for example, the first configuration data in which the target sound source position A corresponding to the designated position A in video data designated in response to a user's input operation is correlated with a height of the target sound source position A from the floor surface BL in advance.
[0695] Therefore, the sound source height determination section 34a can easily determine, for example, the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated in response to a user's input operation based on the first configuration data of the configuration file CF1.
[0696] In addition, in the directionality control system according to the embodiment of the present invention, the directionality control system further includes a storage part that stores second configuration data including a plurality of types of height data as heights of the target sound source position from the reference surface, the control part displays options of the plurality of types of height data of the second configuration data as a height of the target sound source position from the reference surface, and the height determination part determines a height of the target sound source position from the reference surface, corresponding to the designated position in response to selection of any one of the options of the plurality of types of height data.
[0697] In the above-described configuration, the memory 38A of the directionality control apparatus 3A stores the configuration file CF2 including height data (second configuration data) regarding a plurality of types of heights H .sub.A-A, H.sub.A-B, H.sub.A-D, . . . , as heights of the target sound source position A from the floor surface BL. The output control section 34cA of the directionality control apparatus 3A displays options of the plurality of types of height data of the second configuration data included in the configuration file CF2 as heights of the target sound source position A from the floor surface BL on the display device 36.
[0698] Therefore, the sound source height determination section 34aA of the directionality control apparatus 3A can easily determine the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A which is designated, for example, in response to a user's input operation, in accordance with selection of any one of displayed options of a plurality of types of height data.
[0699] Further, in the directionality control system according to the embodiment of the present invention, the control part displays an entry form of a height of the target sound source position from the reference surface, and the height determination part determines a height of the target sound source position from the reference surface, corresponding to the designated position in response to inputting of the height of the target sound source position from the reference surface on the displayed entry form.
[0700] The output control section 34cB of the directionality control apparatus 3B displays the entry form screen WD3 which allows the user to input a height of the target sound source position A from the floor surface BL, on the display device 36.
[0701] Therefore, the sound source height determination section 34aB of the directionality control apparatus 3B can accurately determine a height of the target sound source position A from the floor surface BL, desired by the user, in response to inputting of a height of the target sound source position A from the floor surface BL on the displayed entry form screen WD3. Consequently, the sound collection directional direction computation section 34b of the directionality control apparatus 3B can accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A.
[0702] In addition, in the directionality control system according to the embodiment of the present invention, in response to designation of a first designated position and a second designated position on the reference surface corresponding to the first designated position in the displayed video data, the height determination part computes a height of the target sound source position from the reference surface, corresponding to the first designated position.
[0703] In the above-described configuration, the sound source height determination section 34aC of the directionality control apparatus 3C can accurately compute a height of the target sound source position A1 from the floor surface BL, corresponding to the first designated position A1, in response to designation of the first designated position A1 and the second designated position A2 on the floor surface BL corresponding to the first designated position A1 in video data displayed on the display device 36.
[0704] Therefore, in the directionality control apparatus 3C, a configuration file in which a target sound source position corresponding to a position designated in a video data is correlated with a height of the target sound source position from the floor surface BL in advance is not required to be created, and a height of the target sound source position A1 from the floor surface BL, desired by the user, can be accurately computed in response to a simple designation operation such as designation of two positions without displaying options of a height of a target sound source position A from the floor surface BL or an entry form on the display device 36.
[0705] In addition, in the directionality control system according to the embodiment of the present invention, the height determination part computes a height of the target sound source position from the reference surface, corresponding to the first designated position, based on a height of the at least one imaging part from the reference surface, a distance and an angle from the at least one imaging part to the target sound source position corresponding to the first designated position, and a distance and an angle from the at least one imaging part to a position of the reference surface corresponding to the second designated position.
[0706] In the above-described configuration, the sound source height determination section 34aC of the directionality control apparatus 3C can accurately compute a height of the target sound source position A1 from the floor surface BL, corresponding to the first designated position A1, based on the height H.sub.C of the camera apparatus 11 from the floor surface BL, the distance L.sub.CA1h and the angle .sub.CA1v from the camera apparatus 11 to the target sound source position A1 corresponding to the first designated position A1, and the distance L.sub.CA2h and the angle .sub.CA2v from the camera apparatus 11 to the position A2 of the floor surface BL corresponding to the second designated position A2.
[0707] According to another embodiment of the present invention, there is provided a directionality control method for a directionality control system including at least one imaging part that captures a video and a sound collection part that collects sound, the method including a step of displaying video data captured by the at least one imaging part; a step of determining a height of a target sound source position from a reference surface, corresponding to a position designated in the video data in response to designation of the position in the video data; a step of computing a sound collection directional direction which is directed from the sound collection part toward the target sound source position based on the height of the target sound source position from the reference surface; and a step of causing the sound collection part to form sound collection directionality of the sound in the computed sound collection directional direction.
[0708] In the above-described method, the sound source height determination section 34a of the directionality control apparatus 3 determines the height H.sub.A of the target sound source position A from the floor surface BL, corresponding to the designated position A in video data in response to a user's designation of the designated position A in the video data displayed on the display device 36. The sound collection directional direction computation section 34b of the directionality control apparatus 3 computes a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A based on the height H.sub.A of the target sound source position A from the floor surface BL. The output control section 34c of the directionality control apparatus 3 causes the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
[0709] Consequently, in the directionality control system 10, the directionality control apparatus 3 can uniquely determine the height H.sub.A of the target sound source position A of a target object from the floor surface BL, present in an imaging direction in which the camera apparatus 11 performs imaging in the sound collection space K, and can thus accurately compute a sound collection directional direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A based on the height H.sub.A of the target sound source position A from the floor surface BL. In addition, the directionality control apparatus 3 can cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality in the computed sound collection directional direction.
[0710] An eleventh embodiment described below relates to a directionality control system and a directionality control method, in which sound collection directionality of a microphone array apparatus which collects sound is controlled.
[0711] Patent Literature 1 is based on, for example, a camera apparatus, a microphone array apparatus, and a subject (for example, a speaker) being present on the same plane assuming usage forms in a television conference system. However, in the above-described monitoring system, the camera apparatus, the microphone array apparatus, and the subject (for example, a staff of a store, or a visitor) are all seldom present on the same plane in practice.
[0712] For example, since the camera apparatus and the microphone array apparatus are often installed on an upper side (for example, a ceiling surface of a store) of a subject, the camera apparatus, the microphone array apparatus, and the subject are not present on a planar two-dimensional coordinate but are present on a stereoscopic three-dimensional coordinate in most cases.
[0713] Therefore, in the above-described surveillance system, in a case where the microphone array apparatus collects sounds of conversations of subjects in an image which is currently captured by the camera apparatus, there is a problem in that it is hard to use coordinates (horizontal angle, vertical angle) computed according to a method disclosed in Patent Literature 1 and indicating a direction in which the microphone array apparatus collects sound, without changing the coordinates.
[0714] For this reason, if a control system disclosed in Patent Literature 1 in which only a panning direction of the camera, that is, only a horizontal angle is used to change a sound collection region of the microphone array apparatus is applied to the above-described monitoring system, the microphone array apparatus is unlikely to collect sound in an imaging direction of the camera apparatus.
[0715] In addition, in the above-described monitoring system, in a case where the camera apparatus and the microphone array apparatus are integrally assembled with each other, an optical axis of the camera apparatus and a physical central axis of the microphone array apparatus are common to each other. Thus, in a case where the microphone array apparatus collects conversations of a subject present in an imaging direction of the camera apparatus, coordinates (horizontal angle, vertical angle) indicating a direction (hereinafter, referred to as a sound collection direction) in which the microphone array collects sound are the same as coordinates (horizontal angle, vertical angle) indicating an imaging direction of the camera apparatus.
[0716] However, in a case where the camera apparatus and the microphone array apparatus are disposed at different positions separately from each other, the optical axis of the camera is different from the physical central axis of the microphone array. For this reason, in a case where the microphone array apparatus collects conversations of a subject in videos which are currently being captured by the camera, there is a problem in that coordinates (horizontal angle, vertical angle) indicating of a sound collection direction of the microphone array apparatus are not the same as coordinates (horizontal angle, vertical angle) indicating an imaging direction of the camera apparatus.
[0717] Therefore, in the eleventh embodiment related to the present invention, in order to solve the above-described problems of the related art, a description will be made of examples of a directionality control system and a directionality control method, in which sound collection directionality is formed in a sound collection direction which is directed toward a target sound source position (or a sound position) corresponding to a designated position in a captured image obtained by a camera apparatus with a microphone array apparatus as a reference, and sound in the sound collection direction is collected with high accuracy.
Eleventh Embodiment
[0718] Hereinafter, as the eleventh embodiment of a directionality control system and a directionality control method related to the present invention, for example, an embodiment (hereinafter, referred to as the present embodiment) of a sound collection system and a sound collection control method will be described with reference to the drawings. The sound collection system of the present embodiment is used as a monitoring system (including a manned monitoring system and an unmanned monitoring system) provided in, for example, a factory, a public facility (for example, a library or an event hall), or a store (for example, a retail store or a bank).
[0719] In addition, the present invention can be expressed as respective apparatuses (for example, a directionality control apparatus to be described later) constituting the sound collection system, or a sound collection control method including respective operations (steps) performed by each apparatus constituting the sound collection system.
Configuration of Sound Collection System
[0720]
[0721] Hereinafter, the respective apparatuses constituting the sound collection system 10 will be described.
[0722] The PTZ camera apparatus 1 as an example of an imaging part is fixed to and installed on, for example, a ceiling surface of the store or a stand (refer to
[0723] If at least one of calibration markers MAK and MAK3, a calibration floor marker MAK2 (which will be described later), and the omnidirectional microphone array apparatus 2 is designated in captured image data displayed on the display device 36 with the finger FG of the user, the PTZ camera apparatus 1 receives coordinate data regarding the position designated in the image capturing device from the directionality control apparatus 3. The PTZ camera apparatus 1 computes parameters of a distance and a direction from an installation position of the PTZ camera apparatus 1 to at least one of the calibration markers MAK and MAK3, the calibration floor marker MAK2, and the omnidirectional microphone array apparatus 2. A process of computing the parameters of a distance and a direction in the PTZ camera apparatus 1 is a well-known technique, and thus description thereof will be omitted.
[0724] In addition, if any position (for example, a designated position A) is designated with the finger FG of the user in the captured image data displayed on the display device 36, the PTZ camera apparatus 1 receives coordinate data regarding the designated position A in the captured image data from the directionality control apparatus 3. The PTZ camera apparatus 1 computes parameters of a distance and a direction from the installation position of the PTZ camera apparatus 1 to an actual field position (target sound source position A) corresponding to the designated position A. The PTZ camera apparatus 1 transmits parameters of a distance and a direction from the installation position of the PTZ camera apparatus 1 to at least one of the calibration markers MAK and MAK3, the calibration floor marker MAK2, the microphone array apparatus 2, the omnidirectional microphone array apparatus 2, and the target sound source position A, to the directionality control apparatus 3.
[0725] The omnidirectional microphone array apparatus 2 as an example of a sound collection part is fixed to and installed on, for example, a ceiling surface of a store or a stand (refer to
[0726] The omnidirectional microphone array apparatus 2 forms sound collection directionality of each of the microphone units 22 and 23 in a sound collection direction indicted by sound collection direction coordinates (.sub.MAh,.sub.MAv) included in a directionality formation instruction in response to the directionality formation instruction (which will be described later) transmitted from the directionality control apparatus 3. The omnidirectional microphone array apparatus 2 performs predetermined sound processing on audio data collected by each of the microphone units 22 and 23, and transmits audio data obtained through the predetermined sound processing to the directionality control apparatus 3 or the recorder apparatus 4 via the network NW.
[0727] Consequently, the omnidirectional microphone array apparatus 2 can relatively increase a volume level of audio data in a sound collection direction in which the sound collection directionality is formed, and can relatively reduce a volume level of audio data in a direction in which the sound collection directionality is not formed. In addition, a method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) will be described later. In addition, exteriors of the microphone array apparatus 2 will be described later with reference to
[0728] The directionality control apparatus 3 may be, for example, a stationery personal computer (PC) installed in a monitoring control room (not illustrated), and may be a data communication terminal such as a mobile phone, a personal digital assistant (PDA), a tablet terminal, or a smart phone, which can be carried by a user.
[0729] The directionality control apparatus 3 includes at least a communication unit 31, an operation unit 32, a signal processing unit 33, a display device 36, a speaker device 37, and a memory 38. The signal processing unit 33 includes at least a sound collection direction computation section 34 and an output control section 35.
[0730] The communication unit 31 receives captured image data transmitted from the PTZ camera apparatus 1 or audio data transmitted from the microphone array apparatus 2, and outputs the data to the signal processing unit 33.
[0731] The operation unit 32 is a user interface (UI) for notifying the signal processing unit 33 of the content of a user's input operation, and is, for example, a pointing device such as a mouse or a keyboard. In addition, the operation unit 32 may be configured by using a touch panel or a touch pad which is disposed so as to correspond to, for example, a screen of the display device 36 and allows an input operation to be performed with the finger FG of the user or a stylus pen.
[0732] The operation unit 32 outputs coordinate data regarding a region where the user desires to increase a volume level, that is, the designated position A which is designated with the finger FG of the user in a sound collection in region B illustrated in
[0733] In addition, if at least one of the calibration markers MAK and MAK3, the calibration floor marker MAK2 (which will be described later), and the omnidirectional microphone array apparatus 2 is designated with the finger FG of the user in the captured image data obtained by the PTZ camera apparatus 1 and displayed on the display device 36, the operation unit 32 outputs coordinate data regarding the designated position to the signal processing unit 33.
[0734] The signal processing unit 33 is configured by using, for example, a central processing unit (CPU), a micro processing unit (MPU), or a digital signal processor (DSP), and performs a control process for collectively controlling operations of the respective units of the directionality control apparatus 3, data input and output processes with other respective units, a data computation (computation) process, and a data storage process.
[0735] If the coordinate data regarding at least one of the calibration markers MAK and MAK3, the calibration floor marker MAK2, and the omnidirectional microphone array apparatus 2 displayed on the display device 36 is acquired from the operation unit 32 during calibration (will be described later), the sound collection direction computation section 34 transmits the coordinate data to the PTZ camera apparatus 1 via the communication unit 31. The sound collection direction computation section 34 acquires parameters of a distance and a direction from the installation position of the PTZ camera apparatus 1 to at least one of the calibration markers MAK and MAK3, the calibration floor marker MAK2, and the omnidirectional microphone array apparatus 2 from the communication unit 31.
[0736] The sound collection direction computation section 34 computes a calibration parameter which differs for each calibration method (which will be described later) by using the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to at least one of the calibration markers MAK and MAK3, the calibration floor marker MAK2, and the omnidirectional microphone array apparatus 2.
[0737] In addition, if the coordinate data regarding the designated position A is acquired from the operation unit 32 when computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) (which will be described later), the sound collection direction computation section 34 transmits the coordinate data to the PTZ camera apparatus 1 via the communication unit 31. The sound collection direction computation section 34 acquires parameters of a distance and a direction from the installation position of the PTZ camera apparatus 1 to the target sound source position A from the communication unit 31.
[0738] The sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from an installation position of the omnidirectional microphone array apparatus 2 toward the target sound source position A (which will be described later) by using the parameters (which will be described later) of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the target sound source position A, and calibration parameters (which will be described later) computed during the calibration. In the sound collection direction coordinates (.sub.MAh,.sub.MAv), .sub.MAh indicates a horizontal angle of a sound collection direction which is directed from the installation position of the omnidirectional microphone array apparatus 2 toward the target sound source position A, and .sub.MAv indicates a vertical angle of the sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A.
[0739] In addition, in the following description, the target sound source position is defined as a field position which corresponds to the position A designated by the user (for example, with the finger FG of the user or a stylus pen) in captured image data displayed on the display device 36 and obtained by the PTZ camera apparatus 1, and is an actual monitoring target. Further, for simplification of description of the present embodiment, a single target sound source position will be described, but a plurality of target sound source positions corresponding to designated positions which are designated by the user may be employed.
[0740] The output control section 35 controls operations of the display device 36 and the speaker device 37, so as to display captured image data transmitted from the PTZ camera apparatus 1 on the display device 36, and to output audio data transmitted from the omnidirectional microphone array apparatus 2, from the speaker device 37 as sound. Further, the output control section 35 controls an operation of the omnidirectional microphone array apparatus 2, for example, so as to cause the omnidirectional microphone array apparatus 2 to form the sound collection directionality of sound collected by the omnidirectional microphone array apparatus 2 in a sound collection direction MIX indicated by the sound collection direction coordinates (.sub.MAh,.sub.MAv) computed by the sound collection direction computation section 34, or the output control section 35 forms the sound collection directionality of sound collected by the omnidirectional microphone array apparatus 2 (refer to
[0741] The display device 36 as an example of a display part is configured by using, for example, a liquid crystal display (LCD) or an organic electroluminescent (EL) element, and displays captured image data obtained by the PTZ camera apparatus 1 under the control of the output control section 35.
[0742] The speaker device 37 as an example of a sound output part outputs, as sound, audio data of sound collected by the omnidirectional microphone array apparatus 2 or audio data of sound which is collected after the sound collection directionality is formed in a sound collection direction indicated by the sound collection direction coordinates (.sub.MAh,.sub.MAv). In addition, the display device 36 and the speaker device 37 may be configured separately from the directionality control apparatus 3.
[0743] The memory 38 as an example of a storage part is configured by using, for example, a random access memory (RAM), and functions as a work memory when the respective units of the directionality control apparatus 3 operate.
[0744] The recorder apparatus 4 stores image data captured by the PTZ camera apparatus 1 and audio data of sound collected by the omnidirectional microphone array apparatus 2 in correlation with each other.
[0745] Next, a summary of an operation of the sound collection system 10 of the present embodiment will be described with reference to
[0746] In the sound collection system 10 illustrated in ) as sound. Image data captured by the PTZ camera apparatus 1 is displayed on the display device 36 of the directionality control apparatus 3 (refer to
[0747] Here, if the position A which is a substantially central position of the two people who are target objects is designated, for example, with the finger FG of the user in captured image data displayed on the display device 36, the directionality control apparatus 3 computes sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the installation position of the omnidirectional microphone array apparatus 2 toward the target sound source position A by using coordinate data indicating the position A. The omnidirectional microphone array apparatus 2 forms sound collection directionality (sound collection direction MIX) in the sound collection direction which is directed from the microphone array apparatus 2 toward the target sound source position A by using the sound collection direction coordinates (.sub.MAh,.sub.MAv) computed by the directionality control apparatus 3.
[0748] Therefore, the omnidirectional microphone array apparatus 2 can increase a volume level of a conversation (Hello) of the two people present in the direction in which the sound collection directionality (sound collection direction MIX) is formed higher than a volume level of the music () output from the speaker device SP which is not present in the direction in which the sound collection directionality (sound collection direction MIX) is formed.
[0749] Consequently, the directionality control apparatus 3 causes the speaker device 37 to output sound with a volume level of the conversation (Hello) of the two people present in the direction in which the sound collection directionality (sound collection direction MIX) is formed higher than a volume level of the music () output from the speaker device SP which is not present in the direction in which the sound collection directionality (sound collection direction MIX) is formed (refer to
[0750] Next, with reference to
[0751] In
[0752] After the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 are initially installed, the sound collection direction computation section 34 performs a process of computing or acquiring a calibration parameter which is required to compute the sound collection direction coordinates (.sub.MAh,.sub.MAv), that is, calibration (step ST12).
[0753] Here, processing content of the calibration will be described with reference to
[0754] In
[0755] After step ST12-1, if the calibration marker MAK displayed on the display device 36 is designated with the finger FG of the user, the operation unit 32 acquires coordinate data regarding the designated position and outputs the coordinate data to the signal processing unit 33 (step ST12-2). If the coordinate data regarding the calibration marker MAK displayed on the display device 36 is acquired from the operation unit 32, the sound collection direction computation section 34 transmits the coordinate data to the PTZ camera apparatus 1 via the communication unit 31. The PTZ camera apparatus 1 computes parameters of a distance and a direction from the installation position of the PTZ camera apparatus 1 to the calibration marker MAK, and transmits the parameters to the directionality control apparatus 3. The communication unit 31 receives the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the calibration marker MAK from the PTZ camera apparatus 1, and outputs the parameters to the signal processing unit 33. The sound collection direction computation section 34 acquires the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the calibration marker MAK from the communication unit 31.
[0756] The sound collection direction computation section 34 computes a calibration parameter which differs for each calibration method (which will be described later) by using the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the calibration marker MAK (step ST12-3). After step ST12-3, the calibration marker MAK installed in step ST12-1 is detached (step ST12-4). In addition, the sound collection direction computation section 34 temporarily preserves the calibration parameter computed in step ST12-3 in the memory 38. In the above-described way, the calibration process shown in step ST12 is completed.
[0757] In
[0758] If the coordinate data regarding the designated position A is acquired from the operation unit 32, the sound collection direction computation section 34 transmits the coordinate data to the PTZ camera apparatus 1 via the communication unit 31. The PTZ camera apparatus 1 receives the coordinate data regarding the designated position A in the captured image data from the directionality control apparatus 3, and computes parameters of a distance and a direction from the installation position of the PTZ camera apparatus 1 to the target sound source position A corresponding to the designated position A. The PTZ camera apparatus 1 transmits the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the target sound source position A corresponding to the designated position A, to the directionality control apparatus 3. The sound collection direction computation section 34 acquires the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the target sound source position A from the communication unit 31.
[0759] The sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the installation position of the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the parameters of the distance and the direction from the installation position of the PTZ camera apparatus 1 to the target sound source position A and the calibration parameter computed during the calibration in step ST12 (step ST14).
[0760] The output control section 35 generates an directionality formation instruction for forming directionality of sound in the sound collection direction indicated by the coordinate data of the sound collection direction coordinates (.sub.MAh,.sub.MAv) computed in step ST14, and transmits the directionality formation instruction including the coordinate data of the sound collection direction coordinates (.sub.MAh,.sub.MAv) to the omnidirectional microphone array apparatus 2 via the communication unit 31. The omnidirectional microphone array apparatus 2 forms the sound collection directionality of each of the microphone units 22 and 23 in the sound collection direction MIX indicated by the sound collection direction coordinates (.sub.MAh,.sub.MAv) included in the directionality formation instruction in response to the directionality formation instruction transmitted from the directionality control apparatus 3 (step ST15). In the above-described way, the operation of the sound collection system 10 of the present embodiment is finished.
[0761] In addition, the omnidirectional microphone array apparatus 2 has been described as the omnidirectional microphone array apparatus 2 performing the process of forming the sound collection directionality shown in step ST15 in response to the directionality formation instruction transmitted from the directionality control apparatus 3, but the directionality control apparatus 3 may performing the process of forming the sound collection directionality shown in step ST15. Specifically, the output control section 35 performs the directionality forming process described with reference to
[0762] In addition, in the sound collection system 10 of the present embodiment, a timing at which the omnidirectional microphone array apparatus 2 collects sound is not limited to the time right after step ST14, and may be, for example, the time after power is supplied to the omnidirectional microphone array apparatus 2 after the initial setting is performed in step ST11.
(Method of Computing Sound Collection Direction Coordinates (.SUB.MAh.,.SUB.MAv.) according to Calibration Method
[0763] Next, with reference to
First Method of Computing Sound Collection Direction Coordinates according to First Calibration Method
[0764] First, with reference to
[0765]
[0766]
[0767] In the first calibration method, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms the calibration marker MAK, and thus the calibration marker MAK is located at a central point of a screen of the display device 36 (refer to
[0768] In the first calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from a horizontal surface (for example, a floor in the store; this is also the same for the following description, and the floor is an example of the horizontal surface) are the same as each other, and the calibration marker MAK which is a solid object is suspended vertically downward directly under a predetermined position (for example, the center of a casing of the omnidirectional microphone array apparatus 2; this is also the same for the following description) of the omnidirectional microphone array apparatus 2 by using a string STR or a cord (refer to
[0769] If a central point O (refer to
[0770] In description of the following first to fifth calibration methods, the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other (refer to
[0771] If the central point O (refer to
[0772] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (78) by using the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 to the calibration marker MAK, and the distance L.sub.MO from the omnidirectional microphone array apparatus 2 to the calibration marker MAK in the triangle COM illustrated in
[Equation 84]
L.sub.COh=L.sub.CMh=L.sub.MO/tan .sub.COv (84)
[0773] Next, in the first method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), respective heights of the calibration marker MAK and the target sound source position A from the floor are the same as each other (H.sub.O=H.sub.A), and the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other.
[0774] Specifically, the sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using:
[0775] (1) the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2;
[0776] (2) the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2;
[0777] (3) the horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1; and
[0778] (4) a horizontal angle .sub.CAh and a vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A. With reference to
[0779] (1) As the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2, a value computed by the PTZ camera apparatus 1 in the first calibration method is used (.sub.CMh=.sub.COh).
[0780] (2) The horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 is a calibration parameter in the first calibration method (refer to Equation (84)).
[0781] (3) The horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1 is a predefined value (zero) obtained based on a positional relationship between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in the first calibration method.
[0782] (4) As the horizontal angle .sub.CAh and the vertical angle .sub.CAv from the PTZ camera apparatus 1 to the target sound source position A, values are used which are computed by the PTZ camera apparatus 1 in response to designation of the designated position A in captured image data displayed on the display device 36.
[0783] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.CAh of the distance L.sub.CA between the PTZ camera apparatus 1 and the target sound source position A according to Equation (85) by using the distance L.sub.MO (=L.sub.MOv) corresponding to a difference between the heights of the omnidirectional microphone array apparatus 2 and the target sound source position A from the floor, and the vertical angle .sub.CAv which is directed from the PTZ camera apparatus 1 to the target sound source position A in the triangle CAS illustrated in
[Equation 85]
L.sub.CAh=L.sub.MO/tan .sub.CAv (85)
[0784] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.MAh of the distance L.sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (86) based on the cosine theorem for the triangle CAM illustrated in
[0785] The sound collection direction computation section 34 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (87) based on the cosine theorem for the triangle CAM illustrated in
[0786] In addition, the sound collection direction computation section 34 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (89) based on a tangent for the triangle MAS illustrated in
[0787] As mentioned above, in the first calibration method and the first method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the central point O of the calibration marker MAK displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter, for example, even in a case where the PTZ camera apparatus 1 cannot image a floor surface directly under the omnidirectional microphone array apparatus 2 due to the presence of an obstacle on the floor directly under the omnidirectional microphone array apparatus 2.
[0788] In addition, the sound collection direction computation section 34 can easily compute the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A without a user inputting the height H.sub.A of the target sound source position A from a horizontal surface during computation of the sound collection direction coordinates.
Second Method of Computing Sound Collection Direction Coordinates according to Second Calibration Method
[0789] Next, with reference to
[0790]
[0791]
[0792] In the same manner as in the first calibration method, also in the second calibration method, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms the calibration marker MAK, and thus the calibration marker MAK is located at a central point of a screen of the display device 36 (refer to
[0793] In the second calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other, and the calibration marker MAK which is a solid object is suspended vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 by using a string STR or a cord (refer to
[0794] If a central point O (refer to
[0795] If the central point O (refer to
[0796] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (91) by using the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 to the calibration marker MAK, and the distance L.sub.MO from the omnidirectional microphone array apparatus 2 to the calibration marker MAK. In addition, since the calibration marker MAK is suspended vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2, the horizontal direction distance L.sub.COh between the PTZ camera apparatus 1 and the calibration marker MAK is the same as the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 (L.sub.COh=L.sub.CMh).
[Equation 91]
L.sub.COh=L.sub.CMh=L.sub.MO/tan .sub.COv (91)
[0797] Next, in the second method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), a height of the target sound source position A from the floor is an input value which is input by the user and is different from that in the first method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv). In addition, respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other.
[0798] Specifically, the sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using:
[0799] (1) the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2;
[0800] (2) the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2;
[0801] (3) the horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1;
[0802] (4) the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor;
[0803] (5) a horizontal angle .sub.CAh and a vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A; and
[0804] (6) the height H.sub.A of the target sound source position A from the floor, input by the user.
[0805] With reference to
[0806] (1) As the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2, a value computed by the PTZ camera apparatus 1 in the second calibration method is used (L.sub.CMh=L.sub.COh).
[0807] (2) The horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 is a calibration parameter in the second calibration method (refer to Equation (85)).
[0808] (3) The horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1 is a predefined value (zero) obtained based on a positional relationship between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in the second calibration method.
[0809] (4) The respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are predefined values in the second calibration method of computing the sound collection direction coordinates (H.sub.C=H.sub.M).
[0810] (5) As the horizontal angle .sub.CAh and the vertical angle .sub.CAv from the PTZ camera apparatus 1 to the target sound source position A, values are used which are computed by the PTZ camera apparatus 1 in response to designation of the designated position A in captured image data displayed on the display device 36.
[0811] (6) The height H.sub.A of the target sound source position A from the floor is a value which is input through a user's input operation.
[0812] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.CAh of the distance L.sub.CA between the PTZ camera apparatus 1 and the target sound source position A according to Equation (92) by using the distance (H.sub.CH.sub.A) corresponding to a difference between the heights of the PTZ camera apparatus 1 and the target sound source position A from the floor, and the vertical angle .sub.CAv which is directed from the PTZ camera apparatus 1 toward the target sound source position A in the triangle CAS' illustrated in
[Equation 92]
L.sub.CAh=(H.sub.CH.sub.A)/tan .sub.CAv (92)
[0813] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.MAh of the distance L.sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (93) based on the cosine theorem for the triangle CAM illustrated in
[0814] The sound collection direction computation section 34 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (94) based on the cosine theorem for the triangle CAM illustrated in
[0815] In addition, the sound collection direction computation section 34 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (96) based on a tangent for the triangle MAS illustrated in
[0816] As mentioned above, in the second calibration method and the second method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the central point O of the calibration marker MAK displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter, for example, even in a case where the PTZ camera apparatus 1 cannot image a floor surface directly under the omnidirectional microphone array apparatus 2 due to the presence of an obstacle on the floor directly under the omnidirectional microphone array apparatus 2.
[0817] In addition, the sound collection direction computation section 34 uses the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, which is easily measured when the omnidirectional microphone array apparatus 2 is initially installed. Therefore, in a case where the calibration marker MAK is suspended from the omnidirectional microphone array apparatus 2, a suspension distance can be easily adjusted. Further, since a value desired by the user can be used as the height H.sub.A of the target sound source position A from the floor, a degree of freedom of selection of the height H.sub.A of the target sound source position A from the floor is improved.
Second Method of Computing Sound Collection Direction Coordinates according to Third Calibration Method
[0818] Next, with reference to
[0819]
[0820] In the same manner as in the first calibration method, also in the third calibration method, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms the calibration floor marker MAK2, and thus the calibration floor marker MAK2 is located at a central point of a screen of the display device 36 (refer to
[0821] In the third calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other, and the circular calibration floor marker MAK2 is installed at a position on the floor located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 (refer to
[0822] If a central point O (refer to
[0823] If the central point O (refer to
[0824] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (98) by using the distance L.sub.CO from the PTZ camera apparatus 1 to the calibration floor marker MAK2, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the calibration floor marker MAK2 in the triangle COM illustrated in
[Equation 98]
L.sub.COh=L.sub.CMh=L.sub.CO/cos .sub.COv (98)
[0825] In addition, the sound collection direction computation section 34 computes the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor according to Equation (99) by using the distance L.sub.CO from the PTZ camera apparatus 1 to the calibration floor marker MAK2, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the calibration floor marker MAK2 in the triangle COM illustrated in
[Equation 99]
H.sub.M=L.sub.CO/sin .sub.COv (99)
[0826] Further, a method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) in the sound collection direction computation section 34 after computing the calibration parameters according to the third calibration method is the same as the above-described second method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), and thus description thereof will be omitted.
[0827] As mentioned above, in the third calibration method and the second method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the central point O (refer to
Second Method of Computing Sound Collection Direction Coordinates according to Fourth Calibration Method
[0828] Next, with reference to
[0829]
[0830] In the same manner as in the first calibration method, also in the fourth calibration method, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms the calibration floor marker MAK2, and thus the calibration floor marker MAK2 is located at a central point of a screen of the display device 36 (refer to
[0831] In the fourth calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other, and the circular calibration floor marker MAK2 is installed at a position on the floor located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 (refer to
[0832] If two locations (refer to
[0833] If two locations (refer to
[0834] Here, if the central point O of the calibration floor marker MAK2 displayed on the display device 36 is initially designated, the output control section 35 displays a guide line GUD for indicating a location which is to be designated next. Consequently, the output control section 35 can cause the user to easily recognize a position of the end point O which is to be designated following the central point of the calibration floor marker MAK2 displayed on the display device 36.
[0835] In addition, the end point O on the circumference is not any position on the circumference of the calibration floor marker MAK2 but is a position which provides the same horizontal angle as the horizontal angle .sub.COh which is directed from the PTZ camera apparatus 1 toward the central point of the calibration floor marker MAK2. Therefore, the horizontal angle .sub.COh which is directed from the PTZ camera apparatus 1 toward the central point of the calibration floor marker MAK2 is the same as the horizontal angle .sub.COh which is directed from the PTZ camera apparatus 1 toward the end point O of the calibration floor marker MAK2 (.sub.COh=.sub.COh).
[0836] The sound collection direction computation section 34 computes a relational expression of the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK2, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, and the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (100) in the triangle COM illustrated in
[Equation 100]
tan .sub.COv=H.sub.M/L.sub.CMh (100)
[0837] Similarly, the sound collection direction computation section 34 computes a relational expression of the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the end point O of the calibration floor marker MAK2, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the radius R.sub.O of the calibration floor marker MAK2 according to Equation (101) in the triangle COT illustrated in
[Equation 101]
tan .sub.COv=H.sub.M/(L.sub.CMh+R.sub.O) (101)
[0838] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (102) by using the relational expressions shown in Equations (100) and (101).
[Equation 102]
L.sub.COh=L.sub.CMh=R.sub.Otan .sub.COv/(tan .sub.COvtan .sub.COv) (102)
[0839] In addition, the sound collection direction computation section 34 computes the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor according to Equation (103) by using the respective computation results of Equations (100) and (102). Further, the sound collection direction computation section 34 may also compute the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (103).
[Equation 103]
H.sub.M=H.sub.C=L.sub.CMhtan .sub.COv (103)
[0840] Furthermore, a method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) in the sound collection direction computation section 34 after computing the calibration parameters according to the fourth calibration method is the same as the above-described second method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), and thus description thereof will be omitted.
[0841] As mentioned above, in the fourth calibration method and the second method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the two locations (refer to
First Method of Computing Sound Collection Direction Coordinates according to Fifth Calibration Method
[0842] Next, with reference to
[0843]
[0844] In the same manner as in the first calibration method, also in the fifth calibration method, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms the calibration marker MAK3, and thus the calibration marker MAK3 is located at a central point of a screen of the display device 36 (refer to
[0845] In the fifth calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are the same as each other, and the circular calibration marker MAK3 is installed at a position which is located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 and has a certain height from the floor (refer to
[0846] If a central point O (refer to
[0847] If the central point O (refer to
[0848] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (104) by using the distance L.sub.CO (=L.sub.COv) from the PTZ camera apparatus 1 to the central point O of the calibration marker MAK3, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration marker MAK3 in the triangle COM illustrated in
[Equation 104]
L.sub.COh=L.sub.CMh=L.sub.COcos .sub.COv (104)
[0849] In addition, the sound collection direction computation section 34 computes the distance L.sub.MOv between the omnidirectional microphone array apparatus 2 and the calibration marker MAK3 according to Equation (105) by using the distance L.sub.CO (=L.sub.COv) from the PTZ camera apparatus 1 to the calibration marker MAK3, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the calibration marker MAK3 in the triangle COM illustrated in
[Equation 105]
L.sub.MOv=L.sub.COsin .sub.COv (105)
[0850] Further, a method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) in the sound collection direction computation section 34 after computing the calibration parameters according to the fifth calibration method is the same as the above-described first method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), and thus description thereof will be omitted.
[0851] As mentioned above, in the fifth calibration method and the first method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the central point O (refer to
Third Method of Computing Sound Collection Direction Coordinates according to Sixth Calibration Method
[0852] Next, with reference to
[0853]
[0854]
[0855] In the sixth calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is installed so as to be located within an imaging viewing angle of the PTZ camera apparatus 1. For this reason, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms both of the omnidirectional microphone array apparatus 2 and the calibration marker MAK, and thus the omnidirectional microphone array apparatus 2 and the calibration marker MAK are located at a central point of a screen of the display device 36 (refer to
[0856] In the sixth calibration method, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor is greater than the height H.sub.C of the PTZ camera apparatus 1 from the floor (H.sub.C<H.sub.M), and the calibration marker MAK which is a solid object is suspended vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 by using a string STR or a cord (refer to
[0857] If two locations (refer to
[0858] In description of the following sixth to tenth calibration methods, the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other (refer to
[0859] If the two locations (refer to
[0860] The sound collection direction computation section 34 computes a relational expression of the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration marker MAK, the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the calibration marker MAK according to Equation (106) in the triangle COP illustrated in
[Equation 106]
tan .sub.COv=L.sub.MOv/L.sub.CMh (106)
[0861] Similarly, the sound collection direction computation section 34 computes a relational expression of the vertical angle .sub.CMv which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2, the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (107) in the triangle CMP illustrated in
[Equation 107]
tan .sub.CMv=L.sub.MOv/L.sub.CMh (107)
[0862] In addition, the vertical direction distance L.sub.MOv between the omnidirectional microphone array apparatus 2 and the calibration marker MAK is a sum of the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the calibration marker MAK and the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and thus Equation (108) is established.
[Equation 108]
L.sub.MOv=L.sub.MOv+L.sub.MOv(108)
[0863] By using the relational expressions shown in Equations (106) to (108), the sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (109), computes the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the calibration marker MAK according to Equation (110), and computes the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (111).
[Equation 109]
L.sub.CMh=L.sub.MO/(tan .sub.COv+tan .sub.CMv) (109)
[Equation 110]
L.sub.MOv=L.sub.MOtan .sub.COv/(tan .sub.COv+tan .sub.CMv) (110)
[Equation 111]
L.sub.MOv=L.sub.MOtan .sub.CMv/(tan .sub.COv+tan .sub.CMv) (111)
[0864] Next, in the third method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), the respective heights H.sub.O and H.sub.A of the calibration marker MAK and the target sound source position A from the floor are the same as each other (H.sub.O=H.sub.A), and the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other (H.sub.CH.sub.M).
[0865] Specifically, the sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using:
[0866] (1) the horizontal angle .sub.CMh and the vertical angle .sub.CMv which are directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2;
[0867] (2) the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2;
[0868] (3) the horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1;
[0869] (4) the height L.sub.MOv of the PTZ camera apparatus 1 from the target sound source position A, and the height L.sub.MO of the omnidirectional microphone array apparatus 2 from the target sound source position A; and
[0870] (5) a horizontal angle .sub.CAh and a vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A. With reference to
[0871] (1) As the horizontal angle .sub.CMh and the vertical angle .sub.CMv which are directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2, values computed by the PTZ camera apparatus 1 in the sixth calibration method are used (.sub.CMh=.sub.COh).
[0872] (2) The horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 is a calibration parameter in the sixth calibration method (refer to Equation (109)).
[0873] (3) The horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1 is a predefined value (zero) obtained based on a positional relationship between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in the sixth calibration method.
[0874] (4) The height L.sub.MOv of the PTZ camera apparatus 1 from the target sound source position A is a calibration parameter in the sixth calibration parameter (refer to Equation (110)), and the height L.sub.MO of the omnidirectional microphone array apparatus 2 from the target sound source position A is a predefined value.
[0875] (5) As horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A, values are used which are computed by the PTZ camera apparatus 1 in response to designation of the designated position A in captured image data displayed on the display device 36.
[0876] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.CAh of the distance L.sub.CA between the PTZ camera apparatus 1 and the target sound source position A according to Equation (112) by using the vertical angle .sub.CAv which is directed from the PTZ camera apparatus 1 to the target sound source position A, and the vertical direction distance L.sub.MOv between the PTZ camera apparatus 1 and the calibration marker MAK as a computation result of Equation (110) in the triangle CAS illustrated in
[Equation 112]
L.sub.CAh=L.sub.MOv/tan .sub.CAv (112)
[0877] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.MAh of the distance L.sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (113) based on the cosine theorem for the triangle CAM illustrated in
[0878] The sound collection direction computation section 34 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (114) based on the cosine theorem for the triangle CAM illustrated in
[0879] In addition, the sound collection direction computation section 34 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (116) based on a tangent for the triangle MAS illustrated in
[0880] As mentioned above, in the sixth calibration method and the third method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the two location (refer to
[0881] In addition, the sound collection direction computation section 34 can easily compute the sound collection direction coordinates which are directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A without the user inputting the height H.sub.A of the target sound source position A from the floor even in a case where the heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other.
Fourth Method of Computing Sound Collection Direction Coordinates according to Seventh Calibration Method
[0882] Next, with reference to
[0883]
[0884]
[0885] In the seventh calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is installed so as to be located within an imaging viewing angle of the PTZ camera apparatus 1. For this reason, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms both of the omnidirectional microphone array apparatus 2 and the calibration marker MAK, and thus the omnidirectional microphone array apparatus 2 and the calibration marker MAK are located at a central point of a screen of the display device 36 (refer to
[0886] In the seventh calibration method, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor is greater than the height H.sub.C of the PTZ camera apparatus 1 from the floor (H.sub.C<H.sub.M), and the circular calibration floor marker MAK2 is installed at a position on the floor located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 (refer to
[0887] If two locations (refer to
[0888] If the two locations (refer to
[0889] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (118) and computes the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (119), by using the distance L.sub.CO (=L.sub.COv) from the PTZ camera apparatus 1 to the calibration floor marker MAK2, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK2 in the triangle COP illustrated in
[Equation 118]
L.sub.COh=L.sub.CMh=L.sub.COvcos .sub.COv (118)
[Equation 119]
H.sub.C=L.sub.COvsin .sub.COv (119)
[0890] The sound collection direction computation section 34 computes a distance L.sub.MOv corresponding to a difference (H.sub.MH.sub.C) between the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor and the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (120) by using the computation result of Equation (118), and the vertical angle .sub.CMv which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2 in the triangle CMP illustrated in
[Equation 120]
L.sub.MOv=H.sub.MH.sub.C=L.sub.CMhtan .sub.CMv=(L.sub.COvcos .sub.COv)tan .sub.CMv (120)
[Equation 121]
H.sub.M=H.sub.C+L.sub.MOv=L.sub.COvsin .sub.COv+(L.sub.COvcos .sub.COvtan .sub.CMv) (121)
[0891] Next, in the fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), a height of the target sound source position A from the floor is an input value which is input by the user and is different from that in the third method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv). In addition, the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other.
[0892] Specifically, the sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using:
[0893] (1) the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2;
[0894] (2) the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2;
[0895] (3) the horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1;
[0896] (4) the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor;
[0897] (5) a horizontal angle .sub.CAh and a vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A; and
[0898] (6) the height H.sub.A of the target sound source position A from the floor, input by the user.
[0899] With reference to
[0900] (1) As the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2, a value computed by the PTZ camera apparatus 1 in the seventh calibration method is used (.sub.CMh=.sub.COh).
[0901] (2) The horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 is a calibration parameter in the seventh calibration method (refer to Equation (112)).
[0902] (3) The horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1 is a predefined value (zero) obtained based on a positional relationship between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in the seventh calibration method.
[0903] (4) The respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are calibration parameters in the seventh calibration method (refer to Equations (119) and (121)).
[0904] (5) As the horizontal angle .sub.CAh and the vertical angle .sub.CAv from the PTZ camera apparatus 1 to the target sound source position A, values are used which are computed by the PTZ camera apparatus 1 in response to designation of the designated position A in captured image data displayed on the display device 36.
[0905] (6) The height H.sub.A of the target sound source position A from a horizontal surface (for example, the floor) is a value which is input through a user's input operation.
[0906] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.CAh of the distance L.sub.CA between the PTZ camera apparatus 1 and the target sound source position A according to Equation (122) by using the distance corresponding to a difference (H.sub.CH.sub.A) between the heights of the PTZ camera apparatus 1 and the target sound source position A from the floor, and the vertical angle .sub.CAv which is directed from the PTZ camera apparatus 1 toward the target sound source position A in the triangle CAS illustrated in
[Equation 122]
L.sub.CAh=(H.sub.CH.sub.A)/tan .sub.CAv (122)
[0907] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.MAh of the distance L.sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (123) based on the cosine theorem for the triangle CAM illustrated in
[0908] The sound collection direction computation section 34 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (124) based on the cosine theorem for the triangle CAM illustrated in
[0909] In addition, the sound collection direction computation section 34 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (126) based on a tangent for the triangle MAS illustrated in
[0910] As mentioned above, in the seventh calibration method and the fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the two locations (refer to
[0911] In addition, the sound collection direction computation section 34 can easily compute the sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh, the horizontal direction distance L.sub.CMh, the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface, which are computed during calibration, the predefined horizontal angle .sub.MCh, the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A, and the input value of the height H.sub.A of the target sound source position A from the horizontal surface, input by the user. Further, since a value desired by the user can be used as the height H.sub.A of the target sound source position A from the floor, a degree of freedom of selection of the height H.sub.A of the target sound source position A from the floor is improved.
Fourth Method of Computing Sound Collection Direction Coordinates according to Eighth Calibration Method
[0912] Next, with reference to
[0913]
[0914] In the eighth calibration method, respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is installed so as not to be located within an imaging viewing angle of the PTZ camera apparatus 1. For this reason, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms both of the calibration marker MAK and the calibration floor marker MAK2, and thus the calibration marker MAK and the calibration floor marker MAK2 are located at a central point of a screen of the display device 36 (refer to
[0915] In the eighth calibration method, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor is greater than the height H.sub.C of the PTZ camera apparatus 1 from the floor (H.sub.C<H.sub.M), the calibration marker MAK which is a solid object is suspended vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 by using, for example, a string STR or a cord, and, further, the circular calibration floor marker MAK2 is installed at a position on the floor located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 (refer to
[0916] If two locations (refer to
[0917] If two locations (refer to
[0918] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (128) and computes the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (129), by using the distance L.sub.COv (=L.sub.CO) from the PTZ camera apparatus 1 to the calibration floor marker MAK2, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the calibration floor marker MAK2 in the triangle COP illustrated in
[Equation 128]
L.sub.COh=L.sub.CMh=L.sub.COvcos .sub.COv (128)
[Equation 129]
H.sub.C=L.sub.COvsin .sub.COv (129)
[0919] A relational expression of Equation (130) is established based on the sine theorem for the triangle COO illustrated in
[Equation 130]
L.sub.COv/sin(/2+.sub.COv)=(H.sub.ML.sub.MOv)/sin(.sub.COv.sub.COv) (130)
[Equation 131]
H.sub.M=L.sub.MOv+L.sub.COvsin(.sub.COv.sub.COv)/sin(/2+.sub.COv) (131)
[0920] Further, a method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) in the sound collection direction computation section 34 after computing the calibration parameters according to the eighth calibration method is the same as the above-described fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), and thus description thereof will be omitted.
[0921] As mentioned above, in the eighth calibration method and the fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the two locations (refer to
Fourth Method of Computing Sound Collection Direction Coordinates according to Ninth Calibration Method
[0922] Next, with reference to
[0923]
[0924] In the ninth calibration method, respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is installed so as not to be located within an imaging viewing angle of the PTZ camera apparatus 1. For this reason, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms the calibration floor marker MAK2, and thus the calibration floor marker MAK2 is located at a central point of a screen of the display device 36 (refer to
[0925] In the ninth calibration method, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor is greater than the height He of the PTZ camera apparatus 1 from the floor (H.sub.C<H.sub.M), and the circular calibration floor marker MAK2 is installed at a position on the floor located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 (refer to
[0926] If a central point O (refer to
[0927] If the central point O (refer to
[0928] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (132) and computes the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (133), by using the distance L.sub.COv (=L.sub.CO) from the PTZ camera apparatus 1 to the calibration floor marker MAK2, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the calibration floor marker MAK2 in the triangle COP illustrated in
[Equation 132]
L.sub.COh=L.sub.CMh=L.sub.COvcos .sub.COv (132)
[Equation 133]
H.sub.C=L.sub.COvsin .sub.COv (133)
[0929] Further, a method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv) in the sound collection direction computation section 34 after computing the calibration parameters according to the ninth calibration method is the same as the above-described fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), and thus description thereof will be omitted.
[0930] As mentioned above, in the ninth calibration method and the fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), even in a case where the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is not present within the imaging viewing angle of the PTZ camera apparatus 1, if the central point O (refer to
Fourth Method of Computing Sound Collection Direction Coordinates according to Tenth Calibration Method
[0931] Last, with reference to
[0932]
[0933]
[0934] In the tenth calibration method, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is installed so as to be located within an imaging viewing angle of the PTZ camera apparatus 1. For this reason, the PTZ camera apparatus 1 is driven in a panning direction and a tilting direction and then focuses and zooms both of the omnidirectional microphone array apparatus 2 and the calibration marker MAK, and thus the omnidirectional microphone array apparatus 2 and the calibration marker MAK are located at a central point of a screen of the display device 36 (refer to
[0935] In the tenth calibration method, the height H.sub.C of the PTZ camera apparatus 1 from the floor is greater than the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor (H.sub.M<H.sub.C), and the circular calibration floor marker MAK2 is installed at a position on the floor located vertically downward directly under a predetermined position of the omnidirectional microphone array apparatus 2 (refer to
[0936] If two locations (refer to
[0937] If the two locations (refer to
[0938] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (134) and computes the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (135), by using the distance L.sub.COv (=L.sub.CO) from the PTZ camera apparatus 1 to the calibration floor marker MAK2, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK2 in the triangle COP illustrated in
[Equation 134]
L.sub.COh=L.sub.CMh=L.sub.COvcos .sub.COv (134)
[Equation 135]
H.sub.C=L.sub.COvsin .sub.COv (135)
[0939] The sound collection direction computation section 34 computes a distance corresponding to a difference (H.sub.CH.sub.M) between the height H.sub.C of the PTZ camera apparatus 1 from the floor and the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor according to Equation (136), and computes the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor according to Equation (137), by using the computation result of Equation (134), and a difference (.sub.COv.sub.CMv) between the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK2 and the vertical angle .sub.CMv which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2 in the triangle CMP illustrated in
[Equation 136]
H.sub.C=L.sub.COvsin .sub.COv (136)
[Equation 137]
H.sub.M=H.sub.CL.sub.COvcos .sub.COvtan(.sub.COv.sub.CMv)=L.sub.COv(sin .sub.COvcos .sub.COvtan(.sub.COv.sub.CMv)) (137)
[0940] Next, in the fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), a height of the target sound source position A from a horizontal surface (for example, the floor) is an input value which is input by the user and is different from that in the third method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv). In addition, the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other.
[0941] Specifically, the sound collection direction computation section 34 computes the sound collection direction coordinates (.sub.MAh,.sub.MAv) indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using:
[0942] (1) the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2;
[0943] (2) the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2;
[0944] (3) the horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1;
[0945] (4) the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor;
[0946] (5) a horizontal angle .sub.CAh and a vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A; and
[0947] (6) the height H.sub.A of the target sound source position A from the floor, input by the user.
[0948] With reference to
[0949] (1) As the horizontal angle .sub.CMh which is directed from the PTZ camera apparatus 1 toward the omnidirectional microphone array apparatus 2, a value computed by the PTZ camera apparatus 1 in the tenth calibration method is used (.sub.CMh=.sub.COh).
[0950] (2) The horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 is a calibration parameter in the tenth calibration method (refer to Equation (134)).
[0951] (3) The horizontal angle .sub.MCh which is directed from the omnidirectional microphone array apparatus 2 toward the PTZ camera apparatus 1 is a predefined value (zero) obtained based on a positional relationship between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in the tenth calibration method.
[0952] (4) The respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are calibration parameters in the tenth calibration method (refer to Equations (135) and (137)).
[0953] (5) As the horizontal angle .sub.CAh and the vertical angle .sub.CAv from the PTZ camera apparatus 1 to the target sound source position A, values are used which are computed by the PTZ camera apparatus 1 in response to designation of the designated position A in captured image data displayed on the display device 36.
[0954] (6) The height H.sub.A of the target sound source position A from the floor is a value which is input through a user's input operation.
[0955] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.CAh of the distance L.sub.CA between the PTZ camera apparatus 1 and the target sound source position A according to Equation (138) by using the distance corresponding to a difference (H.sub.CH.sub.A) between the heights of the PTZ camera apparatus 1 and the target sound source position A from the floor, and the vertical angle .sub.CAv which is directed from the PTZ camera apparatus 1 toward the target sound source position A in the triangle CAS illustrated in
[Equation 138]
L.sub.CAh=(H.sub.CH.sub.A)/tan .sub.CAv (138)
[0956] The sound collection direction computation section 34 computes a horizontal direction distance L.sub.MAh of the distance L.sub.MA from the omnidirectional microphone array apparatus 2 to the target sound source position A according to Equation (139) based on the cosine theorem for the triangle CAM illustrated in
[0957] The sound collection direction computation section 34 computes a cosine value cos .sub.MAh of the horizontal angle .sub.MAh which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (140) based on the cosine theorem for the triangle CAM illustrated in
[0958] In addition, the sound collection direction computation section 34 computes a tangent value tan .sub.MAv of the vertical angle .sub.MAv which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A according to Equation (142) based on a tangent for the triangle MAS illustrated in
[0959] As mentioned above, in the tenth calibration method and the fourth method of computing the sound collection direction coordinates (.sub.MAh,.sub.MAv), if the two locations (refer to
[0960] In addition, the sound collection direction computation section 34 can easily compute the sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh, the horizontal direction distance L.sub.CMh, the respective heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor, which are computed during calibration, the predefined horizontal angle .sub.MCh, the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A, and the input value of the height H.sub.A of the target sound source position A from the floor, input by the user. Further, since a value desired by the user can be used as the height H.sub.A of the target sound source position A from the floor, a degree of freedom of selection of the height H.sub.A of the target sound source position A from the floor is improved.
[0961] In addition, the fourth method of computing the sound collection direction coordinates based on the tenth calibration method is the same as the fourth method of computing the sound collection direction coordinates based on the seventh calibration method corresponding to a case where the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor is greater than the height H.sub.C of the PTZ camera apparatus 1 from the floor. Further, in a case where the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor is greater than the height H.sub.C of the PTZ camera apparatus 1 from the floor, the same applies to the third method of computing the sound collection direction coordinates based on the sixth calibration method or the the fourth method of computing the sound collection direction coordinates based on each of the eighth and ninth calibration methods, and thus the sound collection direction computation section can compute the sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A.
[0962] Hereinafter, configurations, operations, and effects of the above-described directionality control system and directionality control method related on the present invention will be described.
[0963] According to an embodiment of the present invention, there is provided a directionality control system including an imaging part that images a target object in a predetermined sound collection region; a sound collection part that collects sound of the sound collection region; a display part that displays image data of the target object imaged by the imaging part; a sound collection direction computation part that computes a sound collection direction which is directed from the sound collection part toward a target sound source position corresponding to a designated position in the image data in response to designation of any position in the image data of the target object; and a sound collection control part that forms sound collection directionality of the sound collected by the sound collection part in the sound collection direction computed by the sound collection direction computation part.
[0964] In the above-described configuration, the PTZ camera apparatus 1 images a target object (for example, a person) present in a predetermined sound collection region by using a casing driving function in a panning direction or a tilting direction, and a zooming function which well-known techniques. The omnidirectional microphone array apparatus 2 collects sound of the target object present in an imaging direction of the PTZ camera apparatus 1. If any position is designated in image data captured by the PTZ camera apparatus 1 and displayed on the display device 36, the sound collection direction computation section 34 of the directionality control apparatus 3 computes sound collection direction coordinates (.sub.MAh,.sub.MAv) in a sound collection direction which is directed from the PTZ camera apparatus 1 toward a target sound source position A corresponding to a designated position A. The output control section 35 of the directionality control apparatus 3 forms sound collection directionality of sound collected by the omnidirectional microphone array apparatus 2 in the sound collection direction indicated by the computed sound collection direction coordinates (.sub.MAh,.sub.MAv).
[0965] Consequently, in the sound collection system 10, since the designated position A corresponding to the target sound source position A is designated on the display device 36 which displays the image data captured by the PTZ camera apparatus 1, the directionality control apparatus 3 can easily acquire an input parameter (for example, a distance or a direction) which is required to compute a predetermined calibration parameter during calibration which is required in advance prior to computation of sound collection direction coordinates, and can thus easily compute the calibration parameter. Therefore, the directionality control apparatus 3 can easily form sound collection directionality in a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A in the image data captured by the PTZ camera apparatus 1 and can thus easily obtain audio data of sound which is collected in the sound collection direction with high accuracy.
[0966] In addition, in the directionality control system according to the embodiment of the present invention, respective heights of the imaging part and the sound collection part from a horizontal surface are the same as each other, and the sound collection direction computation part computes a horizontal angle and a vertical angle of a sound collection direction which is directed from the sound collection part toward the target sound source position by using a horizontal angle which is directed from the imaging part toward the sound collection part, a horizontal direction distance between the imaging part and the sound collection part, a horizontal angle which is directed from the sound collection part toward the imaging part, and a horizontal angle and a vertical angle which are directed from the imaging part toward the target sound source position.
[0967] In the above-described configuration, respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from a floor are the same as each other. Therefore, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute sound collection direction coordinates which are directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh and the horizontal direction distance L.sub.CMh which are computed during calibration, the predefined horizontal angle .sub.MCh, and the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A.
[0968] In addition, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the sound collection direction coordinates (.sub.MAh,.sub.MAv) which are directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A without the user inputting the height H.sub.A of the target sound source position A from the floor during computation of the sound collection direction coordinates.
[0969] In addition, in the directionality control system according to the embodiment of the present invention, a first marker having a constant distance to the sound collection part is suspended directly under the sound collection part, a height of the first marker from the horizontal surface is the same as a height of the target sound source position from the horizontal surface, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part by using a distance between the sound collection part and the first marker in response to designation of the first marker displayed on the display part.
[0970] In the above-described configuration, the calibration marker MAK having a constant distance (L.sub.MOv) to the omnidirectional microphone array apparatus 2 is suspended directly under the omnidirectional microphone array apparatus 2, and respective heights H.sub.O and H.sub.A of the calibration marker MAK and the target sound source position A from the floor are the same as each other. Therefore, if the calibration marker MAK displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter, for example, even in a case where the PTZ camera apparatus 1 cannot image a floor surface directly under the omnidirectional microphone array apparatus 2 due to the presence of an obstacle on the floor directly under the omnidirectional microphone array apparatus 2.
[0971] Further, in the directionality control system according to the embodiment of the present invention, respective heights of the imaging part and the sound collection part from a horizontal surface are the same as each other, and the sound collection direction computation part computes a horizontal angle and a vertical angle of a sound collection direction which is directed from the sound collection part toward the target sound source position by using a horizontal angle which is directed from the imaging part toward the sound collection part, a horizontal direction distance between the imaging part and the sound collection part, a horizontal angle which is directed from the sound collection part toward the imaging part, the respective heights of the imaging part and the sound collection part from the horizontal surface, a horizontal angle and a vertical angle which are directed from the imaging part toward the target sound source position, and an input value of a height of the target sound source position from the horizontal surface.
[0972] In the above-described configuration, the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 are the same as each other. Therefore, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh and the horizontal direction distance L.sub.CMh which are computed during calibration, the predefined horizontal angle .sub.MCh, the respective predefined heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface, the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A, and the input value of the height H.sub.A of the target sound source position A from the floor, input by the user.
[0973] In addition, the sound collection direction computation section 34 of the directionality control apparatus 3 uses the height H.sub.M of the omnidirectional microphone array apparatus 2 from the horizontal surface, which is easily measured when the omnidirectional microphone array apparatus 2 is initially installed. Therefore, in a case where the calibration marker MAK is suspended from the omnidirectional microphone array apparatus 2, a suspension distance can be easily adjusted. Further, since a value desired by the user can be used as the height H.sub.A of the target sound source position A from the floor, a degree of freedom of selection of the height H.sub.A of the target sound source position A from the floor is improved.
[0974] In addition, in the directionality control system according to the embodiment of the present invention, a first marker which has a constant distance to the sound collection part is suspended directly under the sound collection part, a height of the sound collection part from the horizontal surface is a predetermined value, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part by using a distance between the sound collection part and the first marker in response to designation of the first marker displayed on the display part.
[0975] In the above-described configuration, the calibration marker MAK having a constant distance (L.sub.MOv) to the omnidirectional microphone array apparatus 2 is suspended directly under the omnidirectional microphone array apparatus 2, and the height H.sub.M of the omnidirectional microphone array apparatus 2 from the horizontal surface is known. Therefore, if the calibration marker MAK displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter, for example, even in a case where the PTZ camera apparatus 1 cannot image a floor surface directly under the omnidirectional microphone array apparatus 2 due to the presence of an obstacle on the floor directly under the omnidirectional microphone array apparatus 2.
[0976] Further, in the directionality control system according to the embodiment of the present invention, a circular second marker is installed at a position on the horizontal surface directly under the sound collection part, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, and a height of the sound collection part from the horizontal surface, by using a distance from the imaging part to the second marker, and a vertical angle which is directed from the imaging part toward the second marker, in response to designation of a central point of the second marker displayed on the display part.
[0977] In the above-described configuration, the circular calibration floor marker MAK2 is installed at the position on the horizontal surface directly under the omnidirectional microphone array apparatus 2. Therefore, if the central point O of the calibration floor marker MAK2 displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the distance L.sub.CO from the PTZ camera apparatus 1 to the calibration floor marker MAK2 by using, for example, a focus function of the PTZ camera apparatus 1, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter.
[0978] Further, in the directionality control system according to the embodiment of the present invention, a circular second marker having a predetermined radius is installed at a position on the horizontal surface directly under the sound collection part, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, and a height of the sound collection part from the horizontal surface, by using a vertical angle which is directed from the imaging part toward a central point of the second marker, a vertical angle which is directed from the imaging part toward an end point on a circumference of the second marker, and the radius of the second marker, in response to designation of the central point and the end point of the second marker displayed on the display part.
[0979] In the above-described configuration, the circular calibration floor marker MAK2 having the predetermined radius is installed at the position on the floor directly under the omnidirectional microphone array apparatus 2. Therefore, for example, even when a distance from the omnidirectional microphone array apparatus 2 to the calibration floor marker MAK2 cannot be measured, if two locations including the central point O of the calibration floor marker MAK2 and the end point O on the circumference thereof displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, for example, even in a case where it is hard to measure a distance from the omnidirectional microphone array apparatus 2 to the central point of the calibration floor marker MAK2, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter.
[0980] In addition, in the directionality control system according to the embodiment of the present invention, a third marker having a constant height from the horizontal surface is installed directly under the sound collection part, a height of the third marker from the horizontal surface is the same as a height of the target sound source position from the horizontal surface, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, and a height of the sound collection part from the third marker, by using a distance from the imaging part to the third marker, and a vertical angle which is directed from the imaging part toward the third marker, in response to designation of the third marker displayed on the display part.
[0981] In the above-described configuration, the calibration marker MAK3 is installed at the position where the height H.sub.O from the horizontal surface is constant directly under the omnidirectional microphone array apparatus 2, and the heights H.sub.O and H.sub.A of the calibration marker MAK3 and the target sound source position A from the horizontal surface are the same as each other. Therefore, if the central point O of the calibration marker MAK3 displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the height L.sub.MOv of the omnidirectional microphone array apparatus 2 from the calibration marker MAK3 as calibration parameters, for example, even in a case where the PTZ camera apparatus 1 cannot image a floor surface directly under the omnidirectional microphone array apparatus 2 due to the presence of an obstacle on the floor directly under the omnidirectional microphone array apparatus 2.
[0982] In addition, in the directionality control system according to the embodiment of the present invention, respective heights of the imaging part and the sound collection part from a horizontal surface are different from each other, the sound collection part is installed so as to be located within an imaging viewing angle of the imaging part, and the sound collection direction computation part computes a horizontal angle and a vertical angle of a sound collection direction which is directed from the sound collection part to the target sound source position by using a horizontal angle and a vertical angle which are directed from the imaging part toward the sound collection part, a horizontal direction distance between the imaging part and the sound collection part, a horizontal angle which is directed from the sound collection part toward the imaging part, a height of the imaging part from the target sound source position, a height of the sound collection part from the target sound source position, and a horizontal angle and a vertical angle which are directed from the imaging part toward the target sound source position.
[0983] In the above-described configuration, the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is present within the imaging viewing angle of the PTZ camera apparatus 1, that is, the omnidirectional microphone array apparatus 2 is installed so as to be displayed on the display device 36. Therefore, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh and the vertical angle .sub.CMv, the horizontal direction distance L.sub.CMh, the respective heights L.sub.MOv and L.sub.MO of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the target sound source position A, which are computed during calibration, the predefined horizontal angle .sub.MCh, and the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A.
[0984] In addition, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the sound collection direction coordinates which are directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A without the user inputting the height H.sub.A of the target sound source position A from the floor even in a case where the heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other.
[0985] In addition, in the directionality control system according to the embodiment of the present invention, a first marker having a constant distance to the sound collection part is suspended directly under the sound collection part, a height of the first marker from the horizontal surface is the same as a height of the target sound source position from the horizontal surface, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, and a height of the imaging part from the target sound source position, by using a vertical angle which is directed from the imaging part toward the sound collection part, and a vertical angle which is directed from the imaging part toward the first marker, in response to designation of the sound collection part and the first marker displayed on the display part.
[0986] In the above-described configuration, the calibration marker MAK having a constant distance (L.sub.MOv) to the omnidirectional microphone array apparatus 2 is suspended directly under the omnidirectional microphone array apparatus 2, and respective heights H.sub.O and H.sub.A of the calibration marker MAK and the target sound source position A from the floor are the same as each other. Therefore, if two locations including the omnidirectional microphone array apparatus 2 and the calibration marker MAK displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter, for example, even in a case where the PTZ camera apparatus 1 cannot image a floor surface directly under the omnidirectional microphone array apparatus 2 due to the presence of an obstacle on the floor directly under the omnidirectional microphone array apparatus 2.
[0987] Further, in the directionality control system according to the embodiment of the present invention, respective heights of the imaging part and the sound collection part from a horizontal surface are different from each other, the sound collection part is installed so as to be located within an imaging viewing angle of the imaging part, and the sound collection direction computation part computes a horizontal angle and a vertical angle of a sound collection direction which is directed from the sound collection part toward the target sound source position by using a horizontal angle which is directed from the imaging part toward the sound collection part, a horizontal direction distance between the imaging part and the sound collection part, a horizontal angle which is directed from the sound collection part toward the imaging part, the height of the imaging part from the horizontal surface, the height of the sound collection part from the horizontal surface, a horizontal angle and a vertical angle which are directed from the imaging part toward the target sound source position, and an input value of a height of the target sound source position from the horizontal surface.
[0988] In the above-described configuration, the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 are different from each other, and the omnidirectional microphone array apparatus 2 is installed so as to be displayed on the display device 36. Therefore, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh, the horizontal direction distance L.sub.CMh, the respective predefined heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor surface, which are computed during calibration, the predefined horizontal angle .sub.MCh, the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A, and the input value of the height H.sub.A of the target sound source position A from the floor, input by the user.
[0989] Further, since a value desired by the user can be used as the height H.sub.A of the target sound source position A from the floor, a degree of freedom of selection of the height H.sub.A of the target sound source position A from the floor is improved.
[0990] In addition, in the directionality control system according to the embodiment of the present invention, a circular second marker is installed at a position on the horizontal surface directly under the sound collection part, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, a height of the imaging part from the horizontal surface, and a height of the sound collection part from the horizontal surface, by using a vertical angle which is directed from the imaging part toward the sound collection part, a vertical angle which is directed from the imaging part toward the second marker, and a distance from the imaging part to the second marker, in response to designation of a central point of the second marker displayed on the display part.
[0991] In the above-described configuration, the circular calibration floor marker MAK2 is installed at the position on the floor directly under the omnidirectional microphone array apparatus 2. Therefore, if two locations including the central point O of the calibration floor marker MAK2 and the omnidirectional microphone array apparatus 2 displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter.
[0992] In addition, in the directionality control system according to the embodiment of the present invention, respective heights of the imaging part and the sound collection part from a horizontal surface are different from each other, the sound collection part is installed so as not to be located within an imaging viewing angle of the imaging part, and the sound collection direction computation part computes a horizontal angle and a vertical angle of a sound collection direction which is directed from the sound collection part to the target sound source position by using a horizontal angle which is directed from the imaging part toward the sound collection part, a horizontal direction distance between the imaging part and the sound collection part, a horizontal angle which is directed from the sound collection part toward the imaging part, a height of the imaging part from the horizontal surface, a height of the sound collection part from the horizontal surface, a horizontal angle and a vertical angle which are directed from the imaging part toward the target sound source position, and an input value of a height of the target sound source position from the horizontal surface.
[0993] In the above-described configuration, the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the floor are different from each other, and the omnidirectional microphone array apparatus 2 is not present within the imaging viewing angle of the PTZ camera apparatus 1, that is, the omnidirectional microphone array apparatus 2 is installed so as not to be displayed on the display device 36. Therefore, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute sound collection direction coordinates indicating a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A by using the horizontal angle .sub.CMh, the horizontal direction distance L.sub.CMh, the respective predefined heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface, which are computed during calibration, the predefined horizontal angle .sub.MCh, the horizontal angle .sub.CAh and the vertical angle .sub.CAv which are directed from the PTZ camera apparatus 1 toward the target sound source position A, and the input value of the height H.sub.A of the target sound source position A from the horizontal surface, input by the user.
[0994] Further, since a value desired by the user can be used as the height H.sub.A of the target sound source position A from the floor, a degree of freedom of selection of the height H.sub.A of the target sound source position A from the floor is improved.
[0995] In addition, in the directionality control system according to the embodiment of the present invention, a first marker having a constant distance to the sound collection part is suspended directly under the sound collection part, a circular second marker is installed at a position on the horizontal surface directly under the sound collection part, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, a height of the imaging part from the horizontal surface, and a height of the sound collection part from the horizontal surface, by using a vertical angle which is directed from the imaging part toward the first marker, a vertical angle which is directed from the imaging part toward the second marker, a distance from the imaging part to the second marker, in response to designation of central points of the first marker and the second marker displayed on the display part.
[0996] In the above-described configuration, the calibration marker MAK having a constant distance to the omnidirectional microphone array apparatus 2 is suspended directly under the omnidirectional microphone array apparatus 2, and the circular calibration floor marker MAK2 is installed at the position on the floor directly under the omnidirectional microphone array apparatus 2. Therefore, if two locations including the central points of the calibration marker MAK and the calibration floor marker MAK2 and the omnidirectional microphone array apparatus 2 displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter.
[0997] In addition, in the directionality control system according to the embodiment of the present invention, a circular second marker is installed at a position on the horizontal surface directly under the sound collection part, and the sound collection direction computation part computes a horizontal direction distance between the imaging part and the sound collection part, and a height of the imaging part from the horizontal surface, by using a vertical angle which is directed from the imaging part toward the second marker, and a distance from the imaging part to the second marker, in response to designation of a central point of the second marker displayed on the display part.
[0998] In the above-described configuration, the circular calibration floor marker MAK2 is installed at the position on the horizontal surface directly under the omnidirectional microphone array apparatus 2. Therefore, even in a case where the respective heights of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface are different from each other, and the omnidirectional microphone array apparatus 2 is not present within the imaging viewing angle of the PTZ camera apparatus 1, if the central point O of the calibration floor marker MAK2 displayed on the display device 36 is designated with the finger FG of the user, the sound collection direction computation section 34 of the directionality control apparatus 3 can easily compute the height H.sub.C of the PTZ camera apparatus 1 from the floor, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 as a calibration parameter.
[0999] According to another embodiment of the present invention, there is provided a directionality control method for a directionality control system including an imaging part that images a target object in a predetermined sound collection region and a sound collection part that collects sound of the sound collection region, the method including a step of displaying image data of the target object imaged by the imaging part; a step of receiving designation of any position in the displayed image data of the target object; a step of computing a sound collection direction which is directed from the sound collection part toward a target sound source position corresponding to the designated position in the image data in response to the designation of any position in the image data of the target object; and a step of forming sound collection directionality of the sound collected by the sound collection part in the computed sound collection direction.
[1000] In the above-described configuration, the PTZ camera apparatus 1 images a target object (for example, a person) present in a predetermined sound collection region by using a casing driving function in a panning direction or a tilting direction, and a zooming function which well-known techniques. The omnidirectional microphone array apparatus 2 collects sound of the target object present in an imaging direction of the PTZ camera apparatus 1. If any position is designated in image data captured by the PTZ camera apparatus 1 and displayed on the display device 36, the sound collection direction computation section 34 of the directionality control apparatus 3 computes sound collection direction coordinates (.sub.MAh,.sub.MAv) in a sound collection direction which is directed from the PTZ camera apparatus 1 toward a target sound source position A corresponding to a designated position A. The output control section 35 of the directionality control apparatus 3 forms sound collection directionality of sound collected by the omnidirectional microphone array apparatus 2 in the sound collection direction indicated by the computed sound collection direction coordinates (.sub.MAh,.sub.MAv).
[1001] Consequently, in the sound collection system 10, since the designated position A corresponding to the target sound source position A is designated on the display device 36 which displays the image data captured by the PTZ camera apparatus 1, the directionality control apparatus 3 can easily acquire an input parameter (for example, a distance or a direction) which is required to compute a predetermined calibration parameter during calibration which is required in advance prior to computation of sound collection direction coordinates, and can thus easily compute the calibration parameter. Therefore, the directionality control apparatus 3 can easily form sound collection directionality in a sound collection direction which is directed from the omnidirectional microphone array apparatus 2 toward the target sound source position A corresponding to the designated position A in the image data captured by the PTZ camera apparatus 1 and can thus easily obtain audio data of sound which is collected in the sound collection direction with high accuracy.
Modification Example of Calibration Method in Eleventh Embodiment
[1002] Next, with reference to
[1003] Specifically, in the above-described method of controlling sound collection direction coordinates, the horizontal angle .sub.MCh of a direction in which the PTZ camera apparatus 1 is viewed from the omnidirectional microphone array apparatus 2 is zero. In other words, the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction (a horizontal angle 0 direction; refer to
Method of Computing Calibration Parameter according to Eleventh Calibration Method
[1004] In the eleventh calibration method, it is assumed that the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface (floor surface) are the same as each other but are not known. Calibration parameters in the eleventh calibration method include the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, the height H.sub.C of the PTZ camera apparatus 1 from the floor, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2.
[1005]
[1006] The PTZ camera apparatus 1 focuses the point O of the calibration floor marker MAK4 according to the focus function of the PTZ camera apparatus 1 in response to a user's input operation (for example, a touch operation of the finger FG on the display device 36), so as to compute a distance L.sub.CO, a horizontal angle .sub.COh, and a vertical angle .sub.COv from the PTZ camera apparatus 1 to the point O. In addition, the PTZ camera apparatus 1 focuses the point X of the calibration floor marker MAK4 according to the focus function of the PTZ camera apparatus 1 in response to a user's input operation (for example, a touch operation of the finger FG on the display device 36), so as to compute a distance L.sub.CX, a horizontal angle .sub.CXh, and a vertical angle .sub.CXv from the PTZ camera apparatus 1 to the point X. The computation results in the PTZ camera apparatus 1 are transmitted to the directionality control apparatus 3.
[1007]
[1008] If two locations (refer to
[1009] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (144), and computes the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (145), by using the distance L.sub.CO from the PTZ camera apparatus 1 to the central point O of the calibration floor marker MAK4, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK4 in the triangle COM illustrated in
[Equation 144]
L.sub.COh=L.sub.CMh=L.sub.COcos .sub.COv (144)
[Equation 145]
H.sub.C=H.sub.M=L.sub.COsin .sub.COv (145)
[1010] In addition, the sound collection direction computation section 34 computes the horizontal direction distance L.sub.CXh between the PTZ camera apparatus 1 and the point X of the calibration floor marker MAK4 according to Equation (146) by using the distance L.sub.CX from the PTZ camera apparatus 1 to the calibration floor marker MAK4, and the vertical angle .sub.CXv which is directed from the PTZ camera apparatus 1 toward the point X of the calibration floor marker MAK4 in the triangle CXM illustrated in
[Equation 146]
L.sub.CXh=L.sub.CXcos .sub.CXv (146)
[1011] The sound collection direction computation section 34 computes the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 according to Equation (147) based on the cosine theorem for the triangle CXM illustrated in
[1012] As mentioned above, in the eleventh calibration method of the present modification example, if the two locations including the central point O and the point X of the calibration floor marker MAK4 displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 can easily compute the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 as calibration parameters.
[1013] Consequently, since a front direction (0 direction) of a horizontal angle of the omnidirectional microphone array apparatus 2 can be specified, the directionality control apparatus 3 can also similarly compute a calibration parameter between each PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in a case of a configuration of a sound collection system in which a plurality of PTZ camera apparatuses 1 are provided for a single omnidirectional microphone array apparatus 2. Thus, there is no limitation on installation of the omnidirectional microphone array apparatus 2, such as the omnidirectional microphone array apparatus 2 being installed so as to be directed toward the PTZ camera apparatus 1, and thus the omnidirectional microphone array apparatus 2 can be easily installed.
Method of Computing Calibration Parameter according to Twelfth Calibration Method
[1014] In the twelfth calibration method, it is assumed that the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface (floor surface) are the same as each other and are known. Calibration parameters in the twelfth calibration method include the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2. In addition, the height H.sub.C of the PTZ camera apparatus 1 from the floor is known, but may be specifically computed according to the twelfth calibration method. Further, in description of the twelfth calibration method, description of the same content as that in the description of the eleventh calibration method will be omitted or made briefly, and different content will be described.
[1015]
[1016]
[1017] If two locations (refer to
[1018] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (148) by using the distance L.sub.CO from the PTZ camera apparatus 1 to the central point O of the calibration floor marker MAK4, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK4 in the triangle COM illustrated in
[Equation 148]
L.sub.COh=H.sub.C/tan .sub.COv=L.sub.CMh (148)
[1019] In addition, the sound collection direction computation section 34 computes the horizontal direction distance L.sub.CXh between the PTZ camera apparatus 1 and the point X of the calibration floor marker MAK4 according to Equation (149) by using the distance L.sub.CX from the PTZ camera apparatus 1 to the calibration floor marker MAK4, the height H.sub.C (a known value) of the PTZ camera apparatus 1 from the horizontal surface or the height H.sub.C of the PTZ camera apparatus 1 from the horizontal surface, computed according to Equation (148), and the vertical angle .sub.CXv which is directed from the PTZ camera apparatus 1 toward the point X of the calibration floor marker MAK4 in the triangle CXM illustrated in
[Equation 149]
L.sub.CXh=H.sub.C/tan .sub.CXv (149)
[1020] The sound collection direction computation section 34 computes the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 according to Equation (150) based on the cosine theorem for the triangle CXM illustrated in
[1021] As mentioned above, in the twelfth calibration method of the present modification example, if the two locations including the central point O and the point X of the calibration floor marker MAK4 displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 can easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 as calibration parameters. In addition, in the twelfth calibration method, the sound collection direction computation section 34 can easily compute the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface even in a case where the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface are different from each other.
[1022] Consequently, since a front direction (0 direction) of a horizontal angle of the omnidirectional microphone array apparatus 2 can be specified, the directionality control apparatus 3 can also similarly compute a calibration parameter between each PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in a case of a configuration of a sound collection system in which a plurality of PTZ camera apparatuses 1 are provided for a single omnidirectional microphone array apparatus 2. Thus, there is no limitation on installation of the omnidirectional microphone array apparatus 2, such as the omnidirectional microphone array apparatus 2 being installed so as to be directed toward the PTZ camera apparatus 1, and thus the omnidirectional microphone array apparatus 2 can be easily installed.
Method of Computing Calibration Parameter according to Thirteenth Calibration Method
[1023] In the thirteenth calibration method, it is assumed that the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface (floor surface) are the same as each other but are not known. Calibration parameters in the thirteenth calibration method include the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, the height He of the PTZ camera apparatus 1 from the floor, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2. Further, in description of the thirteenth calibration method, description of the same content as that in the description of the eleventh calibration method will be omitted or made briefly, and different content will be described.
[1024]
[1025] The PTZ camera apparatus 1 focuses the central point O of the calibration floor marker MAK4 according to the focus function of the PTZ camera apparatus 1 in response to a user's input operation (for example, a touch operation of the finger FG on the display device 36), so as to compute a distance L.sub.CO, a horizontal angle .sub.COh, and a vertical angle .sub.COv from the PTZ camera apparatus 1 to the point O. In addition, the PTZ camera apparatus 1 focuses the point X of the calibration floor marker MAK4 according to the focus function of the PTZ camera apparatus 1 in response to a user's input operation (for example, a touch operation of the finger FG on the display device 36), so as to compute a distance L.sub.CX, a horizontal angle .sub.CXh, and a vertical angle .sub.CXv from the PTZ camera apparatus 1 to the point X. Further, the PTZ camera apparatus 1 focuses the central point O of the calibration floor marker MAK4 according to the focus function of the PTZ camera apparatus 1 in response to a user's input operation (for example, a touch operation of the finger FG on the display device 36), so as to compute a distance L.sub.CO, a horizontal angle .sub.COh, and a vertical angle .sub.COv from the PTZ camera apparatus 1 to the point O. The computation results in the PTZ camera apparatus 1 are transmitted to the directionality control apparatus 3.
[1026]
[1027] If three locations including the central point O, the point X, and the point O of the calibration floor marker MAK4 displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, the height H.sub.C of the PTZ camera apparatus 1 from the floor, the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 by using the radius R.sub.0 of the calibration floor marker MAK4, the the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 to the central point O of the calibration floor marker MAK4, the vertical angle .sub.CXv which is directed from the PTZ camera apparatus 1 to the point X of the calibration floor marker MAK4, and the the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 to the point O of the calibration floor marker MAK4. With reference to
[1028] The sound collection direction computation section 34 obtains a relational expression of the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the height H.sub.C of the PTZ camera apparatus 1 from the horizontal surface according to Equation (151) by using the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK4, and obtains a relational expression of the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, and the height H.sub.C of the PTZ camera apparatus 1 from the horizontal surface according to Equation (152) by using the radius R.sub.0 of the calibration floor marker MAK4 and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the point O of the calibration floor marker MAK4, in the triangle COM illustrated in
[Equation 151]
tan .sub.COv=H.sub.C/L.sub.CMh (151)
[Equation 152]
tan .sub.COv=H.sub.C/(L.sub.CMh+R.sub.0) (152)
[1029] The sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 according to Equation (153), and computes the height H.sub.C of the PTZ camera apparatus 1 from the floor according to Equation (154), by using the respective computation results of Equations (151) and (152).
[1030] In addition, the sound collection direction computation section 34 computes the horizontal direction distance L.sub.CXh between the PTZ camera apparatus 1 and the point X of the calibration floor marker MAK4 according to Equation (155) by using the computation result of Equation (154) and the vertical angle .sub.CXv which is directed from the PTZ camera apparatus 1 toward the point X of the calibration floor marker MAK4 in the triangle CXM illustrated in
[Equation 155]
L.sub.CXh=H.sub.C/tan .sub.CXv (155)
[1031] The sound collection direction computation section 34 computes the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 according to Equation (156) based on the cosine theorem for the triangle CXM illustrated in
[1032] As mentioned above, in the thirteenth calibration method of the present modification example, if the three locations including the central point O, the point O, and the point X of the calibration floor marker MAK4 displayed on the display device 36 are designated with the finger FG of the user, the sound collection direction computation section 34 can easily compute the heights H.sub.C and H.sub.M of the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 from the horizontal surface, and can also easily compute the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 and the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 as calibration parameters.
[1033] Consequently, since a front direction (0 direction) of a horizontal angle of the omnidirectional microphone array apparatus 2 can be specified, the directionality control apparatus 3 can also similarly compute a calibration parameter between each PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2 in a case of a configuration of a sound collection system in which a plurality of PTZ camera apparatuses 1 are provided for a single omnidirectional microphone array apparatus 2. Thus, there is no limitation on installation of the omnidirectional microphone array apparatus 2, such as the omnidirectional microphone array apparatus 2 being installed so as to be directed toward the PTZ camera apparatus 1, and thus the omnidirectional microphone array apparatus 2 can be easily installed.
[1034] In addition, in the calibration floor marker MAK4 used in the thirteenth calibration method, the user is required to define the point X indicating a front direction (0 direction) of a horizontal angle of the omnidirectional microphone array apparatus 2 in advance. However, the user's work for defining the point X in advance may be omitted, and the user may read the horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 from an image obtained by the PTZ camera apparatus 1 and may input the horizontal angle via the operation unit 32 by himself or herself.
[1035] In this case, the user uses a calibration floor marker MAK5 with an angle memory illustrated in
[1036] If two locations including the central point O and the point O of the calibration floor marker MAK5 displayed on the display device 36 are designated with the finger FG of the user, and angle information read from an image is also input via the operation unit 32 by the user, the sound collection direction computation section 34 computes the horizontal direction distance L.sub.CMh between the PTZ camera apparatus 1 and the omnidirectional microphone array apparatus 2, the height H.sub.C of the PTZ camera apparatus 1 from the floor, and the height H.sub.M of the omnidirectional microphone array apparatus 2 from the floor, by using the radius R.sub.0 of the calibration floor marker MAK5, the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 toward the central point O of the calibration floor marker MAK5, and the vertical angle .sub.COv which is directed from the PTZ camera apparatus 1 to the point O of the calibration floor marker MAK5. The horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 is the value which is input by the user via the operation unit 32. The following method of computing a calibration parameter is the same as the calibration method of the fourth embodiment, the unknown horizontal angle .sub.MCh indicating a deviation amount of the X axis direction of the omnidirectional microphone array apparatus 2 is input by the user, and thus description thereof will be omitted.
[1037] This application is based on Japanese Patent Application No. 2013-028402, filed Feb. 15, 2013, Japanese Patent Application No. 2013-119850, filed Jun. 6, 2013, Japanese Patent Application No. 2013-129964, filed Jun. 20, 2013, Japanese Patent Application No. 2013-154867, filed Jul. 25, 2013, and Japanese Patent Application No. 2013-217707, filed Oct. 18, 2013; the entire contents of which are incorporated herein by reference.
INDUSTRIAL APPLICABILITY
[1038] The present invention is useful as a directionality control system and a directionality control method, in which sound collection directionality is formed toward a location or in a direction corresponding to a position designated in a video of a predetermined region imaged by a camera apparatus with a microphone array apparatus as a reference, and thus audio data is collected in the corresponding direction with high accuracy.
[1039] The present invention is useful as a calibration method of matching a reference direction of a horizontal angle of coordinates indicating an imaging direction of a camera apparatus with a reference direction of a horizontal angle of coordinates indicating a sound collection direction of a microphone array apparatus in a case where the camera apparatus and the microphone array apparatus are integrally used.
[1040] The present invention is useful as a directionality control system and a horizontal deviation angle computation method, in which a horizontal deviation angle indicating an angle between a 0 direction of each horizontal angle of imaging direction coordinates of a camera apparatus and sound collection direction coordinates of a microphone array apparatus and mutual reference directions connecting both the apparatuses to each other is computed, and thus the microphone array apparatus can appropriately collect conversation voice of a subject who is present in an imaging direction of the camera apparatus.
[1041] The present invention is useful as a directionality control system and a directionality control method, in which a height of a target sound source position present in a sound collection space from a reference surface is determined, and sound collection directionality is formed in a sound collection direction which is directed from a microphone array apparatus toward the target sound source position based on the height of the target sound source position from the reference surface.
[1042] The present invention is useful as a directionality control system and a directionality control method, in which sound collection directionality is formed in a sound collection direction which is directed toward a target sound source position corresponding to a position designated in an captured by a camera apparatus with a microphone array apparatus as a reference, and sound is collected in the sound collection direction with high accuracy.
REFERENCE SIGNS LIST
[1043] 1: PTZ CAMERA APPARATUS
[1044] 1Z, 1A, 1B, AND 1C: SOUND COLLECTION SYSTEM
[1045] 2, 2A, 2B, 2C, AND 2D: MICROPHONE ARRAY APPARATUS (OMNIDIRECTIONAL MICROPHONE ARRAY APPARATUS)
[1046] 3, 3A, 3B, 3C: DIRECTIONALITY CONTROL APPARATUS
[1047] 3z, 3A, 3C, AND 3D: OMNIDIRECTIONAL CAMERA APPARATUS
[1048] 4: RECORDER (RECORDER APPARATUS)
[1049] 5, 5A, 5B, AND 5C: OMNIDIRECTIONAL MICROPHONE ARRAY APPARATUS
[1050] 7: ATTACHMENT MEMBER
[1051] 7a AND 7b: ENGAGEMENT PIECE
[1052] 7c AND 7d: HOLE PORTION
[1053] 7e: SCREW HOLE
[1054] 8: CEILING SURFACE
[1055] 10 AND 10A: DIRECTIONALITY CONTROL SYSTEM (SOUND COLLECTION SYSTEM)
[1056] 11 AND 11n: CAMERA APPARATUS
[1057] 11y: KEY
[1058] 11z: OMNIDIRECTIONAL CAMERA APPARATUS
[1059] 13: OPENING
[1060] 15: KEY GROOVE
[1061] 17: CASING
[1062] 18: MICROPHONE UNIT
[1063] 21z AND 23z: MARKER
[1064] 26: ADDER
[1065] 31 AND 31A: COMMUNICATION UNIT
[1066] 32: OPERATION UNIT
[1067] 33, 33A, 33B, 33C: SIGNAL PROCESSING UNIT
[1068] 34: SOUND COLLECTION DIRECTION COMPUTATION SECTION
[1069] 34a: SOUND SOURCE HEIGHT DETERMINATION SECTION
[1070] 34b: SOUND COLLECTION DIRECTIONAL DIRECTION COMPUTATION SECTION
[1071] 34c: OUTPUT CONTROL SECTION
[1072] 34w: HORIZONTAL DEVIATION ANGLE COMPUTATION SECTION
[1073] 34x: COORDINATE COMPUTATION SECTION
[1074] 34z: COORDINATE TRANSFORM PROCESSING SECTION
[1075] 35: OUTPUT CONTROL SECTION
[1076] 36: DISPLAY DEVICE
[1077] 37: SPEAKER DEVICE
[1078] 38, 38A, AND 38B: MEMORY
[1079] 61: TOOL
[1080] 63: MARK
[1081] 61A: TOOL IMAGE
[1082] 63A: MARK IMAGE
[1083] 71: ENGAGEMENT HOLE
[1084] 221, 222, 223, 22(n1), AND 22n: MICROPHONE
[1085] 241, 242, 243, 24(n1), AND 24n: A/D CONVERTER
[1086] 251, 252, 253, 25(n1), AND 25n: DELAY DEVICE
[1087] C1: CALIBRATION OMNIDIRECTIONAL CAMERA APPARATUS
[1088] CF1, CF2, AND CF3: CONFIGURATION FILE