Pitch information generation device, pitch information generation method, and computer-readable recording medium therefor
10242697 ยท 2019-03-26
Assignee
Inventors
Cpc classification
G10H3/125
PHYSICS
International classification
G10H3/12
PHYSICS
Abstract
A pitch information generation device includes a first envelope generator configured to generate, with regard to a first sound range, a first envelope that attenuates at a first rate of change from a detected value corresponding to a peak in the sound signal, a second envelope generator configured to generate, with regard to a second sound range, which includes a sound range of higher frequency than the first sound range, a second envelope that attenuates from a detected value corresponding to a peak in the sound signal at a second rate of change. The second rate of change is greater than the first rate of change. A pitch information identifier is configured to identify the pitch information based on the first envelope and the second envelope.
Claims
1. A pitch information generation device for generating pitch information indicating a pitch of a sound signal, the pitch information generation device comprising: an input device configured to receive a sound signal; a first envelope generator configured to generate a first time-series envelope, for a first predetermined sound range, that attenuates at a first rate of change from a detected value corresponding to a peak in the received sound signal; a second envelope generator configured to generate a second time-series envelope, for a second predetermined sound range having a higher frequency than the first predetermined sound range, that attenuates from a detected value corresponding to a peak in the received sound signal at a second rate of change, wherein the second rate of change is greater than the first rate of change; a pitch information identifier configured to identify the pitch information based on the first time-series envelope and the second time-series envelope; and a display device configured to display the pitch information identified by the pitch information identifier.
2. The pitch information generation device according to claim 1, further comprising a frequency characteristics adjuster configured to apply to the received sound signal a processing that emphasizes frequency components corresponding to the first predetermined sound range, to supply the processed sound signal to the first envelope generator.
3. The pitch information generation device according to claim 2, wherein the frequency characteristics adjuster comprises a filter.
4. The pitch information generation device according to claim 1, wherein: the first envelope generator generates the detected value corresponding to the peak by multiplying the received sound signal by a first coefficient, the second envelope generator generates the detected value corresponding to the peak by multiplying the received sound signal by a second coefficient, and the second coefficient is smaller than the first coefficient.
5. The pitch information generation device according to claim 1, wherein the first predetermined sound range and the second predetermined sound range partially overlap with each other.
6. The pitch information generation device according to claim 1, wherein the pitch information identifier comprises: a first pitch information generator configured to generate first pitch information indicating the pitch of the received sound signal, in a case where the pitch is identifiable based on the first time-series envelope; a second pitch information generator configured to output second pitch information indicating the pitch of the sound signal, in a case where the pitch is identifiable based on the second time-series envelope; and a selector configured to output the second pitch information as the pitch information in a case where both the first pitch information and the second pitch information are generated.
7. The pitch information generation device according to claim 1, wherein the input device comprises a microphone that collects sound and outputs a sound signal.
8. A pitch information generation method of generating pitch information indicating a pitch of an input sound signal, the method comprising: a first envelope generating step of generating a first time-series envelope, for a first predetermined sound range, that attenuates at a first rate of change from a detected value corresponding to a peak in the received sound signal; a second envelope generating step of generating a second time-series envelope, for a second predetermined sound range having a higher frequency than the first predetermined sound range, that attenuates from a detected value corresponding to a peak in the received sound signal at a second rate of change, wherein the second rate of change is greater than the first rate of change; a pitch identifying step of identifying the pitch information based on the first time-series envelope and the second time-series envelope; and a displaying step of displaying, in a display device, the pitch information identified by the pitch information identifier.
9. A non-transitory computer-readable recording medium storing a program executable by a computer to execute a method of generating pitch information indicating a pitch of an input sound signal, the method comprising: a first envelope generating step of generating a first time-series envelope, for a first predetermined sound range, that attenuates at a first rate of change from a detected value corresponding to a peak in the input sound signal; a second envelope generating step of generating a second time-series envelope, for a second predetermined sound range having a higher frequency than the first predetermined sound range, that attenuates from a detected value corresponding to a peak in the input sound signal at a second rate of change, wherein the second rate of change is greater than the first rate of change; a pitch identifying step of identifying the pitch information based on the first time-series envelope and the second time-series envelope; and a displaying step of displaying, in a display device, the pitch information identified by the pitch information identifier.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
DETAILED DESCRIPTION
(15) The present invention relates to a technique for detecting information on sound pitches (fundamental frequencies) from sound signals.
(16)
(17)
(18) A storage device 14 stores the pitch information generation program for generating pitch information from the sound signal A and various types of data. Any commonly known storage medium, such as a semiconductor recording medium and magnetic recording medium, can be used as the storage device 14. The pitch information generation program can be provided to users in a format stored in a computer-readable recording medium and be installed in the pitch information generation device 100. The recording medium is a non-transitory recording medium, and examples thereof other than the above include an optical recording medium (optical disc), such as a CD-ROM, and a USB (Universal Serial Bus) memory. The pitch information generation program may instead be distributed via the communication network N for example and be installed on the pitch information generation device 100.
(19)
(20) An operation input device 133 is also displayed on the display screen F, the operation input device 133 including, for example, a group of button images that are used to input information, such as figures and notes (A to F), and an exit button image. The operator can carry out an input operation by, for example, touching the button images displayed on the screen. A parameter display device 134 displays setting and measurement information of the various parameters related to a frequency of the sound signal A. The parameters displayed on the parameter display device 134 include OCT-NOTE that indicates an octave and a note corresponding to a frequency of the sound signal A, KEY No. that indicates the key number thereof, CENT that indicates the degree of difference from the tuning curve, CURVE that indicates a measurement curve selected as the measurement standard and PITCH (reference frequency) that corresponds to the key number 49. A key number is a number unique to each key of a piano keyboard (88 keys) that is assigned from the lowest key to the highest key in the order of 1 to 88. A reference frequency corresponding to the key number 49 is a value predetermined by the operator from among 440 Hz, 441 Hz, 442 Hz, etc., and formal frequencies of the other key numbers are determined based on this reference frequency. A formal frequency is a value determined for each pitch, and it may be determined by table lookup or calculation.
(21) In the present embodiment, the pitch information generation device 100 generates pitch information of the sound signal A of a sound emitted when a keyboard key is pressed, and then displays, on the display screen F, a key number corresponding to the generated pitch information as KEY No. and an octave and a note corresponding to the key number as OCT-NOTE. The key number displayed here as KEY No. is identified based on the formal frequency that is the closest to the pitch information detected by the pitch information generation device 100, from among the formal frequencies corresponding to the different key numbers.
(22)
(23) By executing the pitch information generation program stored in the storage device 14, the CPU 12 functions as multiple elements (a frequency characteristics adjuster 20, a low sound range envelope generator 30-1, a middle sound range envelope generator 30-2, a high sound range envelope generator 30-3 and a pitch information identifier 40). A configuration where a hardware (circuitry) exclusively used for processing the sound signal A[a], such as a DSP, realizes the different elements of the CPU 12, and a configuration where the different elements of the CPU 12 are dispersedly mounted on multiple integrated circuits are also possible.
(24) The low sound range envelope generator 30-1 generates a first envelope from the sound signal A[a] for a low sound range between 20 Hz and 200 Hz inclusive. The middle sound range envelope generator 30-2 generates a second envelope from the sound signal A[a] for a middle sound range between 100 Hz and 1000 Hz inclusive. The high sound range envelope generator 30-3 generates a third envelope from the sound signal A[a] for a high sound range between 700 Hz and 5000 Hz inclusive. The low sound range and the middle sound range partially overlap each other, and the middle sound range and the high sound range partially overlap each other. In other words, the middle sound range includes a sound range with frequencies higher than those of the low sound range, and the high sound range includes a sound range with frequencies higher than those of the middle sound range.
(25) The sound signal A[a] supplied to the pitch information generation device 100 is supplied to each of the frequency characteristics adjuster 20, the middle sound range envelope generator 30-2 and the high sound range envelope generator 30-3. The frequency characteristics adjuster 20 applies to the sound signal A[a] a processing that emphasizes frequency components corresponding to a part or all of the low sound range (20 Hz to 200 Hz) and supplies the outcome to the low sound range envelope generator 30-1. The frequency characteristics adjuster 20 may be low-pass filters or high-cut filters, for example.
(26)
(27)
(28) As
(29) The reference value calculator 56 calculates the reference value x_p from the detected value e_p that is sequentially selected by the comparer 52 and a rate of change R3. More specifically, the reference value calculator 56 is a multiplier that sequentially calculates, as the reference value x_p, the multiplied value of the detected value e_p and the rate of change R3 (in this embodiment a coefficient in specific terms). The coefficient is set to a positive number less than 1. Accordingly, in the section Q2_p shown in
(30) Similar to the positive side envelope generator 32, the negative side envelope generator 34 is configured to include the gain imparter 50, the comparer 52, the delay unit 54, and the reference value calculator 56. But the relationships (small and large, positive and negative) between the different values become the opposite of those of the positive side envelope generator 32. More specifically, the reference value x_n that the reference value calculator 56 of the negative side processor 34 calculates is a negative number, and the comparer 52 sequentially selects as the detected value e_n the smaller of the reference value x_n and the intensity a of the sound signal A[a] (i.e., the one with a larger absolute value). In other words, as
(31) The middle sound range envelope generator 30-2 and the low sound range envelope generator envelope generator 30-1 have a similar configuration as the high sound range envelope generator 30-3 shown in
(32) The gain imparter 50 included in the low sound range envelope generator 30-1 uses a coefficient E1, and the gain imparter 50 included in the middle sound range envelope generator 30-2 uses a coefficient E2, with both the coefficient E1 and E2 differing from the coefficient E3 by which the intensity a of the sound signal A[a] is multiplied in the gain imparter 50 of the high sound range envelope generator 30-3. In the present embodiment, the coefficient E1, which the gain imparter 50 of the positive side envelope generator 32 (or the negative side envelope generator 34) in the low sound range envelope generator 30-1 uses, and the coefficient E2, which the gain imparter 50 of the positive side envelope generator 32 (or the negative side envelope generator 34) in the middle sound range envelope generator 30-2 uses, are set to 1, whereas the coefficient E3, which the gain imparter 50 of the positive side envelope generator 32 (or negative side envelope generator 34) in the high sound range envelope generator 30-3 uses, is set to a positive number less than 1 (E3<E1=E2=1). In a sound range with a high frequency, the peaks of the sound signal A[a] tend to be more erratic compared to those in a sound range with a low frequency. In this embodiment, with respect to a sound range with a high frequency, a detected value corresponding to a peak K_p is generated using a coefficient with a smaller absolute value (i.e., gain is reduced) compared to that with respect to a sound range with a low frequency. Therefore, it is possible to reduce the erratic nature of the peaks of the waveform of the sound signal A[a].
(33) In this way, the low sound range envelope generator 30-1, the middle sound range envelope generator 30-2, and the high sound range envelope generator 30-3 use different rates of change R1, R2, and R3, respectively, and different coefficients E1, E2, and E3, respectively. Consequently, a first envelope output from the low sound range envelope generator 30-1, a second envelope output from the middle sound range envelope generator 30-2, and a third envelope output from the high sound range envelope generator 30-3 are all different from one another, even when the same sound signal A[a] is input.
(34)
(35) As shown in
(36) As a further comparison,
(37) As shown in
(38) The sound signal A[a] that is of a sound range close to the lowest tone of pianos (in the case of 88-key pianos, 27.5 Hz) is characterized by having a weak fundamental tone and including many overtones. As a result, there are cases where it is difficult, because of the influences of the overtones, to generate an envelope that represents a pitch corresponding to a fundamental tone that is the primary target. In view of this, in the present invention, the frequency characteristics adjuster 20 is provided, and the sound signal A[a] is supplied to the low sound range envelope generator 30-1 after it has undergone a processing that emphasizes a part or all of frequency components that correspond to a low sound range in the sound signal A[a].
(39)
(40) Next, the pitch information identifier 40 will be described. As
(41) Next, a description will be given on a pitch information generation process. A pitch information generation process is a process carried out by each of the first to third pitch information generators 41-1 to 41-3 serving as functional elements of the CPU 12.
(42)
(43) Subsequently, the third pitch information generator 41-3 determines whether or not the identified pitch PA3 is in a predetermined sound range (S2). More specifically, the third pitch information generator 41-3 determines whether or not the identified pitch PA3 is in the high sound range between 700 Hz and 5000 Hz inclusive. When this determination requirement is met (S2: YES), the third pitch information generator 41-3 outputs the third pitch information D[PA3] that indicates the pitch PA3 (S3). On the other hand, when the determination requirement is not met (S2: NO), the process returns to step S1 and the subsequent processing is carried out again.
(44) As mentioned above, the high sound range envelope generator 30-3 is a functional element that can generate, with a high degree of accuracy, an envelope of the sound signal AH[a] of the high sound range. Accordingly, if the sound signal A[a] supplied to the high sound range envelope generator 30-3 is a sound signal AM[a] of the middle sound range, the pitch PA3 identified by the third pitch information generator 41-3 may be of low accuracy. For this reason, the third pitch information generator 41-3 supplies to the selector 42 the third pitch information D[PA3] that indicates the pitch PA3, only when the pitch PA3 is in the high sound range between 700 Hz and 5000 Hz inclusive. In other words, the third pitch information generator 41-3 generates the third pitch information D[PA3] that indicates the pitch PA3 of the sound signal A[a] provided that the pitch PA3 is identifiable based on the third envelope.
(45) The first pitch information generator 41-1 and the second pitch information generator 41-2 also generate a pitch PA1 and a pitch PA2, respectively, and determine whether or not the generated pitch is in a predetermined sound range (The first pitch information generator 41-1 determines whether or not the pitch PA1 is in the low sound range between 20 Hz and 200 Hz inclusive. The second pitch information generator 41-2 determines whether or not the pitch PA2 is in the middle sound range between 100 Hz and 1000 Hz inclusive.). Only when the pitch PA1 is in the predetermined sound range does the first pitch information generator 41-1 supply to the selector 42 the first pitch information D[PA1] that indicates the pitch PAL Only when the pitch PA2 is in the predetermined sound range does the second pitch information generator 41-2 supply to the selector 42 the second pitch information D[PA2] that indicates the pitch PA2. In other words, the first pitch information generator 41-1 generates the first pitch information D[PA1] that indicates the pitch of the sound signal A[a] provided that the pitch PA1 is identifiable based on the first envelope. The second pitch information generator 41-2 generates the second pitch information D[PA2] that indicates the pitch of the sound signal A[a] provided that the pitch PA2 is identifiable based on the second envelope.
(46)
(47) When the determination requirement of step S11 is not met (S11: NO), i.e., when the number of pieces of the supplied pitch information is 1, the selector 42 outputs the one piece of pitch information as a definite pitch information D[PA] (S13).
(48) On the other hand, when the determination requirement of step S11 is met (S11: YES), i.e., when the number of pieces of the supplied pitch information is 2, the selector 42 selects, out of the two pieces of pitch information, the pitch information D[PA] that has been output by the pitch information generator 41 that covers a higher sound range (S12). More specifically, the selector 42 selects the second pitch information D[PA2] when two pieces of pitch information, that is, the first pitch information D[PA1] generated by the first pitch information generator 41-1 and the second pitch information D[PA2] generated by the second pitch information generator 41-2, are supplied to the selector 42. The selector 42 selects the third pitch information D[PA3] when another set of two pieces of pitch information, that is, the second pitch information D[PA2] generated by the second pitch information generator 41-2 and the third pitch information D[PA3] generated by the third pitch information generator 41-3, are supplied to the selector 42.
(49) The greater the degree of change in waveform of an envelope is (i.e., the greater the rate of change R is), the faster the response speed is, and therefore, the subsequent peak K_p of a sound signal is easily captured. Consequently, the envelope generator 30 that uses a greater rate of change R can generate pitch information with higher accuracy, provided that the sound range is the same. Accordingly, in the present invention, when two pieces of pitch information D[PA] are identifiable in an overlapping sound range, the pitch information D[PA], the rate of change R of which has been used in generating the envelope that serves as the basis of the pitch information D[PA] is greater, is selected. If instead a sound range is set exclusively, in frequencies near the upper and lower limits of the sound range that the envelope generator 30 covers, the peaks cannot be outlined accurately, depending on the waveform, resulting in the pitch information generator 41 not being able to output pitch information. By allocating two adjacent sound ranges in an overlapping manner, it is possible to generate the pitch information D[PA] when one of the pitch information generators 41 can generate pitch information, even when the other pitch information generator 41 cannot generate pitch information.
(50) Next, the selector 42 returns to step S11 after outputting the selected pitch information as the definite pitch information D[PA] (S13) and executes the selection process again for a new piece of pitch information D[PA].
(51) After the execution of the abovementioned process, on the display screen F of the display device 13, a key number corresponding to the pitch PA indicated by the pitch information D[PA] output by the selector 42 is displayed as KEY No., and an octave and a note corresponding to the key number is displayed as OCT-NOTE. In tuning a piano, the pitch of a sound signal of a piano performance sound obtained by the pressing of a keyboard key by the tuner is off the formal frequency corresponding to the key. The difference, however, is within 1 percent, below or above, of the formal frequency, and never deviates as much as the formal frequency of neighboring keys. Accordingly, based on the detected pitch, it is possible to identify a target frequency that is a tuning target and to identify the key number that corresponds to the target frequency. The operator tunes a key that is the object of tuning, so that the pitch PA indicated by the pitch information D[PA] output every time said key is pressed and the target frequency that has been automatically set match each other (i.e., so that the indicator 132 on the display screen F stops). When the operator ends tuning the current tuning-object key and plays a new sound by pressing another tuning-object key, a new piece of pitch information D[PA] is generated with respect to this sound signal A[a] and a target frequency is identified. On the display screen F, the key number displayed as KEY No. and the octave and note displayed as OCT-NOTE switch to those corresponding to the newly identified target frequency. The operator plays the tuning-object key referring to the indicator 132 and tunes the tuning-object key so that the indicator 132 stops.
(52) As described above, according to the pitch information generating device 100 of the present invention, it is possible to generate pitch information for a wide range of sound with a small amount of calculation and a high degree of accuracy since the pitch information generating device 100 identifies pitch information by generating an envelope that attenuates at the rate of change R corresponding to a sound range, based on the detected value corresponding to the peaks K_p of the sound signal A[a].
(53) Moreover, since a key number, etc., that corresponds to a tuning-object key is automatically set, it is possible to set a tuning-object key with less burden, compared to when setting a tuning-object key by inputting the key number of the tuning-object key in the operation input device 133.
(54) The abovementioned embodiment may be modified in various ways. The following are examples of specific modifications. Two or more of the following examples may be freely combined.
(55) In a first modification, the method by which the reference value calculator 56 calculates the reference value x (x_p or x_n) from the rate of change R and the detected value e (e_p or e_n) may be changed as appropriate. For example, a configuration where the reference value x_p is calculated by subtracting the rate of change from the detected value e_p on the positive side and a configuration where the reference value x_n is calculated by adding the rate of change to the detected value e_n on the negative side may be adopted. In other words, as long as the reference value x is calculated so that it attenuates in a speed corresponding to a rate of change (the reference value x_p on the positive side decreasing, or the reference value x_n on the negative side increasing), the specific method by which the reference value x is calculated may be freely chosen. A preferable configuration is to set a rate of change with which the reference value x changes at a faster speed for the envelope generator 30 that covers a sound range with a higher frequency.
(56) The rate of change R described in the abovementioned embodiment is provided as a coefficient that is used to multiply the output of the delay unit 54 by. The rate of change R, however, is not limited to such coefficient, and it may be any index that indicates a change in envelope per unit time. For example, the rate of change may be a so-called time constant, or if it is desirable to have the envelope change in a straight line, it may be the angle of the line.
(57) In the abovementioned embodiment, each of the envelope generators 30 uses a single rate of change R, but in another embodiment, two or more different rates of change R may be used. For example, in a case where a value (absolute value) corresponding to the peak K_p or K_n is smaller than the intensity a of the sound signal A[a] as an effect of the gain imparter 50, it is preferable to switch from one rate of change R to another rate of change R where the change speed of an envelope becomes slower (i.e., attenuates more slowly), the switching being performed at a timing the envelope attenuating from a value corresponding to peak K_p or K_n crosses the waveform A[a] of the sound signal A (i.e., at a timing the detected value e_p or e_n (absolute value) of the envelope surpasses the intensity a of the sound signal A). According to this mode, it is possible to reduce the risk of erroneously detecting peaks (peaks appearing as a result of overtones and noises) other than peaks of a fundamental tone that is the primary target since the rate of change switches from a rate by which the envelope sharply attenuates to a rate by which the envelope slowly attenuates.
(58) In each of the above embodiments, each of the envelope generators 30 was configured to include the positive side envelope generator 32 and the negative side envelope generator 34. But in another embodiment, it is also preferable to configure each of the envelope generators 30 to include either one of the positive side envelope generator 32 or the negative side envelope generator 34. For example, according to a configuration where each envelope generator 30 includes the positive side envelope generator 32 only, the pitch PA of the sound signal A is identified from the intervals between the respective points I_p detected from the detected values e_p on the positive side.
(59) The pitch information D[PA] refers to information related to the pitch PA of the sound signal A, but in another embodiment, it is not limited to the pitch PA (frequency) of the sound signal A in terms of the above embodiments. For example, one preferable configuration is one where a cycle corresponding to the pitch PA (pitch cycle [i.e., time]) or a key number corresponding to the pitch PA is identified as the pitch information D.
(60) In the above embodiment, a sound range that is the object of pitch information generation is divided into three sound ranges, a low sound range between 20 Hz and 200 Hz inclusive, a middle sound range between 100 Hz and 1000 Hz inclusive, and a high sound range between 700 Hz and 5000 Hz inclusive. But in another embodiment, the object sound range can be divided into two sound ranges or into four or more sound ranges. Accordingly, in the modified embodiment(s), the number of envelope generators 30 and pitch information generators 41 can be 2 or 4 or more. The sound ranges do not necessarily have to partially overlap. In such a case, the selector 42 does not have to be included in the pitch information generation device 100.
(61) In other words, the pitch information generation device can include at least two envelope generators that respectively correspond to a first sound range and a second sound range that includes a sound range with a higher frequency than the first sound range.
(62) Additionally, it is not necessary that the first sound range and the second range be adjacent to each other (or consecutive). In other words, in a case where a sound range that is an object of pitch information generation is divided into three sound ranges (for example, into the low sound range, the middle sound range, and the high sound range), the first sound range may be the low sound range, and in this case the second sound range may be either the middle sound range or the high sound range. Alternatively, the first sound range may be the middle sound range, in which case the second sound range may be the high sound range. For example, when the middle sound range is assumed to be the first sound range and the high sound range the second sound range, the middle sound range envelope generator 30-2 of the embodiment, with respect to the first sound range, functions as a first envelope generator that generates a first envelope that attenuates at the first rate of change (R2) from detected values corresponding to the peaks of a sound signal, and the high sound range envelope generator 30-3 of the embodiment, with respect to the second sound range, functions as a second envelope generator that generates a second envelope that attenuates at the second rate of change (R3) from detected values corresponding to the peaks of the sound signal. Similarly, the second pitch information generator 41-2 of the embodiment functions as a first pitch information generator that generates first pitch information indicating the pitch of the sound signal when the pitch is identifiable based on the first envelope, and the third pitch information generator 41-3 functions as a second pitch information generator that generates second pitch information indicating the pitch of the sound signal when the pitch is identifiable based on the second envelope.
(63) Furthermore, when the low sound range is assumed to be the first sound range and the high sound range the third sound range for example, the first envelope generator 30-1 generates detected values corresponding to the peaks by multiplying a sound signal by the coefficient E1 (first coefficient), and the third envelope generator 30-3 generates detected values corresponding to the peaks by multiplying the sound signal by the coefficient E3 (second coefficient). In this case, the coefficient E3 (second coefficient) is smaller than the coefficient E1 (first coefficient). Furthermore, when the middle sound range is assumed to be the second sound range and the high sound range the third sound range for example, the second envelope generator 30-2 generates detected values corresponding to the peaks by multiplying a sound signal by the coefficient E2 (first coefficient), and the third envelope generator 30-3 generates detected values corresponding to the peaks by multiplying the sound signal by the coefficient E3 (second coefficient). In this case, the coefficient E3 (second coefficient) is smaller than the coefficient E2 (first coefficient).
(64) The upper and lower limits in frequencies in the different sound ranges are just one example, and they may be changed as appropriate as long as the effects of the present invention are maintained.
(65) The configuration where the gain imparter 50 is included in each of the low sound range envelope generator 30-1, the middle sound range envelope generator 30-2, and the high sound range envelope generator 30-3 can be changed as appropriate. For example, a preferable configuration can be one where the gain imparter 50 is included in only the high sound range envelope generator 30-3 (an envelope generator 30 covering the sound range with a higher frequency in a case where the entire sound range is divided into two sound ranges, and one or more envelope generators 30 including an envelope generator 30 covering the highest sound range in a case where the entire sound range is divided into four or more sound ranges). Alternatively, a configuration where none of the envelope generators 30 includes the gain imparter 50 can be adopted. Furthermore, a configuration where the frequency characteristics adjuster 20 is not included can be selected.
(66) A coefficient used in the gain imparter 50 included in each envelope generator 30 in the above embodiment is E3<E1=E2=1, but this coefficient can be changed as appropriate as long as the effects of the present invention are maintained.
(67) In the above embodiment, the pitch PA is identified based on the intervals between respective points I_p, I_n. But instead, the pitch PA can be identified based on the intervals between respective peak K_p. Each envelope generator 30 is understood as an element that identifies a sequence of the detected values e in a way that the detected values e attenuate from the respective peaks K of the sound signal A[a] at a speed corresponding to the rate of change R (i.e., in a way that the angle of an envelope of the sound signal A[a] is controlled according to the rate of change R). A comparison between the reference value x and the intensity a of the embodiment is not an absolute requirement.
(68) In the above embodiments, a key number corresponding to a key that is the object of tuning and other information are automatically set based on the definite pitch information D[PA] output from the selector 42. But an alternative configuration can have an operator input from the operation input device 133 to set a key number of a key that is the object of tuning. Carrying out tuning based on pitch information detected with a high degree of accuracy is possible since, in such a case also, the indicator 132 indicates a phase relation between the definite pitch information D[PA] output from the selector 42 and a target frequency corresponding to a set key number.
(69) The pitch information generation device of the present invention may be applied, not only in detecting a pitch of a musical sound of pianos, but also in detecting a pitch of a musical sound of other musical instruments or of a singing voice. The pitch information generation device 100 is not limited to a smartphone or other tablet terminals but can be a desktop personal computer, a notebook personal computer, a Ultra-Mobile Personal Computer (UMPC), or a portable game machine.
DESCRIPTION OF REFERENCE SIGNS
(70) 100 . . . pitch information generation device, 11 . . . communication device 12 . . . CPU, 13 . . . display device, 14 . . . storage device, 15 . . . audio interface, 16 . . . microphone, 20 . . . frequency characteristics adjuster, 30-1 . . . low sound range envelope generator, 30-2 . . . middle sound range envelope generator, 30-3 . . . high sound range envelope generator, 32 . . . positive side envelope generator, 34 . . . negative side envelope generator, 40 . . . pitch information identifier, 41-1 . . . first pitch information generator, 41-2 . . . second pitch information generator, 41-3 . . . third pitch information generator, 42 . . . selector, 50 . . . gain imparter, 52 . . . comparer, 54 . . . delay unit, 56 . . . reference value calculator.