Voice signal processing apparatus and voice signal processing method
09699570 ยท 2017-07-04
Assignee
Inventors
Cpc classification
International classification
G10L21/00
PHYSICS
Abstract
A voice signal processing apparatus and a voice signal processing method are provided. A last sampling point of an m.sup.th original frequency-lowered signal frame is determined according to a phase reference sampling point number of the m.sup.th original frequency-lowered signal frame. Here, the phase reference sampling point number corresponds to a middle sampling point of an m.sup.th renovating frequency-lowered signal frame, and the last sampling point is phase-matched with a sampling point corresponding to the phase reference sampling point number in the m.sup.th original frequency-lowered signal frame. P consecutive sampling points starting from the last sampling point are applied as sampling points of an (m+1).sup.th renovating frequency-lowered signal frame.
Claims
1. A voice signal processing apparatus comprising: a sampling circuit configured to sample an original voice signal to generate a sampling voice signal; a processor coupled to the sampling circuit and configured to: lower a frequency of the sampling voice signal to generate a frequency-lowered signal including a sequence of original frequency-lowered signal frames, wherein each of the original frequency-lowered signal frames comprises p sampling points; adjust the sampling points of the original frequency-lowered signal frames to obtain renovating frequency-lowered signal frames; and mix and stack adjacent renovating frequency-lowered signal frames of the renovating frequency-lowered signal frames to generate and output an overlapping voice signal, wherein the processor determines a last sampling point of an m.sup.th original frequency-lowered signal frame of the original frequency-lowered signal frames phase-matched with the sampling point corresponding to a phase reference sampling point number according to the phase reference sampling point number of the m.sup.th original frequency-lowered signal frame corresponding to a middle sampling point of an m.sup.th renovating frequency-lowered signal frame of the renovating frequency-lowered signal frames, wherein the processor sets p consecutive sampling points starting from the last sampling point phase-matched with the sampling point corresponding to the phase reference sampling point number as the sampling points of an (m+1).sup.th renovating frequency-lowered signal frame of the renovating frequency-lowered signal frames, and wherein the phase reference sampling point number is a serial number of the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the middle sampling point of the m.sup.th renovating frequency-lowered signal frame, p is a positive integer, and m is a positive integer greater than 1.
2. The voice signal processing apparatus of claim 1, wherein each of two adjacent renovating frequency-lowered signal frames of the renovating frequency-lowered signal frames comprises a 50% overlapping section.
3. The voice signal processing apparatus of claim 2, wherein the processor further adds up a first count value and a second count value according to sampling values of the sampling points of the m.sup.th original frequency-lowered signal frame, when the frequency-lowered signal in a positive half cycle is changed to a negative half cycle, the processor returns the first count value to 0, when the frequency-lowered signal in the negative half cycle is changed to the positive half cycle, the processor returns the second count value to 0, the processor sets the first count value or the second count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as a reference value, and the processor determines the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number according to the reference value.
4. The voice signal processing apparatus of claim 3, wherein the processor further determines whether the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, the processor sets the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number as the reference value and sets a last-sampled sampling point of the sampling points of the m.sup.th original frequency-lowered signal frame where the first count value is equal to the reference value as the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number, if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is greater than the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, the processor sets the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number as the reference value and sets the last-sampled sampling point of the sampling points of the m.sup.th original frequency-lowered signal frame where the second count value is equal to the reference value as the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number.
5. The voice signal processing apparatus of claim 1, wherein the processor further multiplies the frequency-lowered signal by a Hamming window.
6. A voice signal processing method, applicable to a voice signal processing apparatus having a sampling circuit and a processor, comprising the following steps: sampling an original voice signal by the sampling circuit to generate a sampling voice signal; lowering a frequency of the sampling voice signal by the processor to generate a frequency-lowered signal including a sequence of original frequency-lowered signal frames, wherein each of the original frequency-lowered signal frames comprises p sampling points, and p is a positive integer; adjusting the sampling points of the original frequency-lowered signal frames by the processor to obtain renovating frequency-lowered signal frames comprising: determining a last sampling point of an m.sup.th original frequency-lowered signal frame of the original frequency-lowered signal frames by the processor according to a phase reference sampling point number of the m.sup.th original frequency-lowered signal frame, the phase reference sampling point number corresponding to a middle sampling point of an m.sup.th renovating frequency-lowered signal frame of the renovating frequency-lowered signal frames, the last sampling point being phase-matched with a sampling point corresponding to the phase reference sampling point number in the m.sup.th original frequency-lowered signal frame, wherein the phase reference sampling point number is a serial number of the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the middle sampling point of the m.sup.th renovating frequency-lowered signal frame, and m is a positive integer greater than 1; and setting p consecutive sampling points starting from the last sampling point phase-matched with the sampling point corresponding to the phase reference sampling point number as the sampling points of an (m+1).sup.th renovating frequency-lowered signal frame of the renovating frequency-lowered signal frames by the processor; and mixing and stacking adjacent renovating frequency-lowered signal frames of the renovating frequency-lowered signal frames by the processor to generate and output an overlapping voice signal.
7. The voice signal processing method of claim 6, wherein each of two adjacent renovating frequency-lowered signal frames comprises a 50% overlapping section.
8. The voice signal processing method of claim 7, wherein the step of determining the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number by the processor according to the phase reference sampling point corresponding to the middle sampling point number of the m.sup.th renovating frequency-lowered signal frame comprises: adding up a first count value and a second count value by the processor according to sampling values of the sampling points of the m.sup.th original frequency-lowered signal frame, wherein when the frequency-lowered signal in a positive half cycle is changed to a negative half cycle, returning the first count value to 0, and when the frequency-lowered signal in the negative half cycle is changed to the positive half cycle, returning the second count value to 0; setting the first count value or the second count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as a reference value by the processor; and determining the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number by the processor according to the reference value.
9. The voice signal processing method of claim 8, wherein the step of setting the first count value or the second count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as the reference value by the processor comprises: determining whether the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number by the processor; if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, setting the first count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as a reference value by the processor; and if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is greater than the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, setting the second count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as a reference value by the processor.
10. The voice signal processing method of claim 9, wherein if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, the method comprises: setting a last-sampled sampling point of the sampling points of the m.sup.th original frequency-lowered signal frame where the first count value is equal to the reference value as the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number by the processor.
11. The voice signal processing method of claim 9, wherein if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is greater than the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, the method comprises: setting a last-sampled sampling point of the sampling points of the m.sup.th original frequency-lowered signal frame where the second count value is equal to the reference value as the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number by the processor.
12. The voice signal processing method of claim 9, comprising: multiplying the frequency-lowered signal by the processor by using a Hamming window.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The accompanying drawings are included to provide a further understanding of the invention, and are incorporated in and constitute a part of this specification. The drawings illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention.
(2)
(3)
(4)
(5)
(6)
DETAILED DESCRIPTION OF DISCLOSED EMBODIMENTS
(7)
(8) The processing unit 102 is able to obtain renovating frequency-lowered signal frames (e.g., renovating frequency-lowered signal frames WL1-WL4 shown in
(9) Each of the original frequency-lowered signal frames includes p sampling points, and p is a positive integer. The processing unit 102 applies the sampling point number of an m.sup.th original frequency-lowered signal frame corresponding to a middle sampling point of the m.sup.th renovating frequency-lowered signal frame as a phase reference sampling point number, determines the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to a phase reference sampling point number according to the phase reference sampling point number, and applies p consecutive sampling points starting from the last sampling point as the sampling points of the (m+1).sup.th renovating frequency-lowered signal frame, such that the middle sampling point of the m.sup.th renovating frequency-lowered signal frame is phase-matched with the initial sampling point of the (m+1).sup.th renovating frequency-lowered signal frame. Here, m is a positive integer greater than 1. Accordingly, when a 50% signal frame overlapping action is performed on the (m+1).sup.th renovating frequency-lowered signal frame and the m.sup.th renovating frequency-lowered signal frame (i.e., each of the (m+1).sup.th renovating frequency-lowered signal frame and the m.sup.th renovating frequency-lowered signal frame includes a 50% overlapping section), the phase mismatch problem may be significantly lessened, and the issue of signal distortion can be resolved to a great extent.
(10) Specifically, the processing unit 102 may add up a first count value and a second count value according to sampling values of the sampling points of the m.sup.th original frequency-lowered signal frame. When the frequency-lowered signal SL in a positive half cycle is changed to a negative half cycle, the first count value is returned to 0, and when the frequency-lowered signal SL in the negative half cycle is changed to the positive half cycle, the second count value is returned to 0. The method to add up the first and second count values can be represented by the following formulas (1) to (4).
(11)
(12) Here, m is a positive integer greater than 1, n=0, 1, 2, . . . , or 2N2, N is a positive integer greater than 1, s.sub.m(n) is the sampling value of the sampling point numbered as n in the m.sup.th original frequency-lowered signal frame, PN.sub.m(n) serves to convert the sampling value s.sub.m(n) into values represented by 1 or 0, wherein PN.sub.m(1)=PN.sub.m(0). Cot.sub.m.sup.+(n) is the first count value corresponding to the sampling point numbered as n in the m.sup.th original frequency-lowered signal frame, Cot.sub.m.sup.(n) is the second count value corresponding to the sampling point numbered as n in the m.sup.th original frequency-lowered signal frame, wherein Cot.sub.m.sup.+(1)=0, and Cot.sub.m.sup.(1)=0. It can be derived from (1) and (2) that Cot.sub.m.sup.+(n) is an accumulated count value corresponding to the frequency-lowered signal in a positive half cycle, whereas Cot.sub.m.sup.(n) is an accumulated count value corresponding to the frequency-lowered signal in a negative half cycle. As shown in formulas (1) to (4), in the present embodiment, the sampling value s.sub.m(n) greater than or equal to 0 and the sampling value s.sub.m(n) less than 0 are set to be 1 and 0, respectively; while the first count value Cot.sub.m.sup.+(n) is being counted, the first count value corresponding to PN.sub.m.sup.D(n) equal to 1 is returned to 0, and while the second count value Cot.sub.m.sup.(n) is being counted, the second count value corresponding to PN.sub.m.sup.D(n) equal to 1 is returned to 0.
(13) The processing unit 102 applies the first count value or the second count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number obtained from the m.sup.th renovating frequency-lowered signal frame as a reference value, and the processing unit 102 determines the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number according to the reference value. For instance, the processing unit 102 determines whether the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, which may be represented by the following formula (5):
Cot.sub.m.sup.+(n.sub.Cot.sub.
(14) Here, n.sub.Cot.sub.
(15) If the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is less than or equal to the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, the processing unit 102 applies the first count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as a reference value and applies the last-sampled sampling point of the m.sup.th original frequency-lowered signal frame where the first count value is equal to the reference value as the last sampling point of the m.sup.th original frequency-lowered signal frame, which can be represented by the following formulas (6) and (7):
(16)
(17) It can be derived from the formulas (6) and (7) that if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point with the serial number n is equal to the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number, n.sub.Cot.sub.
(18) By contrast, if the first count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number is greater than the second count value of the m.sup.th original frequency-lowered signal frame corresponding to the sampling point corresponding to the phase reference sampling point number, i.e., the formula (5) is not satisfied, the processing unit 102 applies the second count value corresponding to the sampling point of the m.sup.th original frequency-lowered signal frame corresponding to the phase reference sampling point number as the reference value and applies the last-sampled sampling point of the m.sup.th original frequency-lowered signal frame where the second count value is equal to the reference value as the last sampling point of the m.sup.th original frequency-lowered signal frame, which can be represented by the following formulas (8) and (9):
(19)
(20) It can be derived from the formulas (8) and (9) that if the second count value n.sub.Cot.sub.
(21) For instance, it is assumed that each of the original frequency-lowered signal frames WL1-WL4 shown in
(22) To obtain the initial sampling point of the renovating frequency-lowered signal frame WL3, the processing unit 102 can count the serial number of the corresponding sampling point of the original frequency-lowered signal frame WL2 while the first count value Cot.sub.2.sup.+(n) is equal to 18. Since the first count value of the original frequency-lowered signal frame WL2 corresponding to the sampling point numbered as 88 is less than the corresponding second count value Cot.sub.2.sup.(88), the first count value Cot.sub.2.sup.+(88) is applied as the reference value. As shown by the schematic diagram illustrating the original frequency-lowered signal frame WL2 in
(23) It should be mentioned that the original frequency-lowered signal frame WL1 is the first original frequency-lowered signal frame, and thus the sampling points of the renovating frequency-lowered signal frame WL1 are included in the original frequency-lowered signal frame WL1, and the phase reference sampling point number of the original frequency-lowered signal frame WL1 corresponding to the middle sampling point of the renovating frequency-lowered signal frame WL1 is 100. In the present embodiment, the serial number of the last sampling point of the original frequency-lowered signal frame WL1 phase-matched with the middle sampling point of the original frequency-lowered signal frame WL1 is 188, which should however not be construed as a limitation to the invention. The method for obtaining the last sampling point (with the serial number 188) is similar to that applied in the foregoing embodiments, and people having ordinary skill in the art should be able to derive the way to implement the invention from the teachings provided in the foregoing embodiment. Hence, no further description is provided hereinafter.
(24) After adjusting the sampling points of each of the original frequency-lowered signal frames and obtaining the corresponding renovating frequency-lowered signal frames, the processing unit 102 may perform a 50%-mixing and stacking action on the adjacent renovating frequency-lowered signal frames to generate an overlapping voice signal. Since the middle sampling point of each renovating frequency-lowered signal frame is phase-matched with the initial sampling point of the next renovating frequency-lowered signal frame, the issue of signal distortion caused by phase mismatch while the signal frames are overlapped can be resolved to a great extent. Besides, in some embodiments, after the renovating frequency-lowered signal frames corresponding to the original frequency-lowered signal frames are obtained, the frequency-lowered signal may be multiplied by a Hamming window to enhance continuity between the right-end and the left-end of the renovating frequency-lowered signal. As shown in
(25)
(26)
(27) To sum up, the last sampling point of the m.sup.th original frequency-lowered signal frame phase-matched with the sampling point corresponding to the phase reference sampling point number is determined according to the phase reference sampling point number of the m.sup.th original frequency-lowered signal frame corresponding to the middle sampling point of the m.sup.th renovating frequency-lowered signal frame. The P consecutive sampling points starting from the last sampling point phase-matched with the sampling point corresponding to the phase reference sampling point number is applied as the sampling points of the (m+1).sup.th renovating frequency-lowered signal frame, such that the issue of signal distortion caused by overlapped signal frames with phase mismatch can be effectively resolved.
(28) It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.