In-vehicle voice recognition apparatus and method of controlling the same
10861448 ยท 2020-12-08
Assignee
Inventors
- Hye Joo Lee (Pucheon-si, KR)
- Sung Chan PARK (Seoul, KR)
- Myoung Kyoung Ji (Yongin-si, KR)
- Jeong Suck Park (Yongin-si, KR)
Cpc classification
H04R2499/13
ELECTRICITY
H03G5/165
ELECTRICITY
G10K11/17885
PHYSICS
G06F3/167
PHYSICS
G10L15/22
PHYSICS
G10K11/178
PHYSICS
G10K11/17819
PHYSICS
H04R3/02
ELECTRICITY
H04R2410/01
ELECTRICITY
B60K35/00
PERFORMING OPERATIONS; TRANSPORTING
B60K35/28
PERFORMING OPERATIONS; TRANSPORTING
G10L15/20
PHYSICS
B60K35/10
PERFORMING OPERATIONS; TRANSPORTING
International classification
B60K35/00
PERFORMING OPERATIONS; TRANSPORTING
H04R3/02
ELECTRICITY
G10K11/178
PHYSICS
G10L15/22
PHYSICS
Abstract
An in-vehicle voice recognition apparatus capable of recognizing user voice more correctly by removing sound generated in a vehicle from sound input to a microphone, and a method of controlling the same are disclosed. The in-vehicle voice recognition apparatus according to an embodiment of the present disclosure includes a central processing unit configured to output a media signal, a digital signal processor configured to receive the media signal from the central processing unit and to convert the media signal into an analog signal, and an eco cancellation hardware connected to the digital signal processor and configured to receive the analog signal, to acquire a first final media output on the basis of the analog signal and to acquire a voice command from a sound input through a microphone on the basis of the first final media output.
Claims
1. An in-vehicle voice recognition apparatus comprising: a central processing unit configured to output a media signal; a digital signal processor configured to receive the media signal from the central processing unit and convert the media signal into an analog signal; and an eco cancellation hardware connected to the digital signal processor and configured to receive the analog signal, and configured to acquire a first final media output on the basis of the analog signal and acquire a voice command from a sound input through a microphone on the basis of the first final media output, wherein the eco cancellation hardware includes a plurality of equalizer settings corresponding to a plurality of modes and, upon reception of a mode value from the central processing unit, applies an equalizer setting corresponding to the mode value among the plurality of equalizer settings to the analog signal to acquire the first final media output.
2. The in-vehicle voice recognition apparatus according to claim 1, wherein the eco cancellation hardware determines the equalizer setting corresponding to the mode value with reference to a table in which a corresponding relationship between mode values corresponding to the plurality of modes and the plurality of equalizer settings is defined.
3. The in-vehicle voice recognition apparatus according to claim 1, further comprising an external amplifier connected to the digital signal processor and configured to receive the analog signal, apply an equalizer setting corresponding to a current mode to the analog signal to acquire a second final media output, and amplify the second final media output and transfer the amplified second final media output to a speaker.
4. The in-vehicle voice recognition apparatus according to claim 3, wherein the first final media output corresponds to the second final media output.
5. The in-vehicle voice recognition apparatus according to claim 3, wherein the sound input through the microphone includes at least sound obtained when the second final media output is output through the speaker.
6. The in-vehicle voice recognition apparatus according to claim 1, wherein the central processing unit performs mode management according to an input of a user command.
7. The in-vehicle voice recognition apparatus according to claim 3, further comprising: a communication module configured to receive a customized equalizer setting from an external server; and a personalization service module configured to transmit the received customized equalizer setting to the eco cancellation hardware and the external amplifier.
8. The in-vehicle voice recognition apparatus according to claim 1, wherein the voice command includes a wakeup command and a direct command.
9. A vehicle including the in-vehicle voice recognition apparatus according to claim 1.
10. A method of controlling an in-vehicle voice recognition apparatus, comprising: receiving, by a digital signal processor, a media signal from a central processing unit and converting the media signal into an analog signal; receiving, by an eco cancellation hardware, the analog signal converted by the digital signal processor; acquiring, by the eco cancellation hardware, a first final media output on the basis of the analog signal; and acquiring, by the eco cancellation hardware, a voice command from a sound input through a microphone on the basis of the first final media output, wherein the eco cancellation hardware includes a plurality of equalizer settings corresponding to a plurality of modes, and wherein the acquiring of the final media output comprises: receiving, by the eco cancellation hardware, a mode value from the central processing unit; and applying, by the eco cancellation hardware, an equalizer setting corresponding to the mode value among the plurality of equalizer settings to the analog signal to acquire the first final media output.
11. The method according to claim 10, wherein the applying of the equalizer setting comprises determining, by the eco cancellation hardware, the equalizer setting corresponding to the mode value with reference to a table in which a corresponding relationship between mode values corresponding to the plurality of modes and the plurality of equalizer settings is defined.
12. The method according to claim 10, further comprising: receiving, by an external amplifier, the analog signal from the digital signal processor; applying, by the external amplifier, an equalizer setting corresponding to a current mode to the analog signal to acquire a second final media output; and amplifying, by the external amplifier, the second final media output and transferring the amplified second final media output to a speaker.
13. The method according to claim 12, wherein the first final media output corresponds to the second final media output.
14. The method according to claim 12, wherein the sound input through the microphone includes at least sound obtained when the second final media output is output through the speaker.
15. The method according to claim 10, wherein the central processing unit performs mode management according to an input of a user command.
16. The method according to claim 12, further comprising: receiving, by a communication module, a customized equalizer setting from an external server; and transmitting, by a personalization service module, the received customized equalizer setting to the eco cancellation hardware and the external amplifier.
17. The method according to claim 10, wherein the voice command includes a wake command and a direct command.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principle of the invention. In the drawings:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
DETAILED DESCRIPTION OF THE INVENTION
(9) Reference will now be made in detail to the preferred embodiments of the present disclosure, examples of which are illustrated in the accompanying drawings. The suffixes module and unit of elements herein are used for convenience of description and thus can be used interchangeably and do not have any distinguishable meanings or functions.
(10) Embodiments of the present disclosure will be described in detail with reference to the accompanying drawings so that the present disclosure can be easily realized by those skilled in the art. However, the present disclosure can be realized in various different forms and is not limited to the embodiments described herein. Parts that are not related to description will be omitted for clear description in the drawings, and the same reference numbers will be used throughout this specification to refer to the same or like parts.
(11) Throughout the specification, the term includes should be interpreted not to exclude other elements but to further include such other elements since the corresponding elements may be included unless mentioned otherwise. Further, the same reference numbers will be used throughout this specification to refer to the same or like parts.
(12) In an embodiment of the present disclosure, an analog signal input to an external amplifier is directly input to eco cancellation hardware and the eco cancellation hardware acquires a final media signal prior to a microphone by directly applying a suitable equalizer logic. To this end, it is desirable that the eco cancellation hardware acquire information for applying a suitable equalizer logic from a central processing unit.
(13) An equalizer logic may refer to a processing function per band for correcting frequency characteristics to adjust a tone in the following embodiments but is not limited thereto and may be referred to as equalizer setting.
(14) First, a configuration of an in-vehicle voice recognition apparatus to which embodiments of the present disclosure are applicable will be described with reference to
(15) Referring to
(16) The DSP 512 and the voice recognition module 513 each may be a hardware device and may be an electric circuitry that executes instructions of software which thereby performs various functions described hereinafter. The voice recognition module 513 may further include a controller, a processor, a decoder, and a memory.
(17) The CPU 511 may manage media modes according to user operations and transfer audio data input through the external input device 514 to the DSP 512 through two analog and digital channels. For example, audio data transferred in a digital manner may be AV sound, Bluetooth streaming sound, Android auto-sound, CarPlay sound and the like acquired through the external input device 514 and audio data transferred in an analog manner may be navigation guidance sound and the like, without being limited thereto.
(18) The DSP 512 tunes in/mixes audio data X received from the CPU 511 or received through the radio tuner 520, converts the audio data into an analog signal X through a digital-to-analog converter DAC and transfers the analog signal X to the external amplifier 530. Here, hard wiring is provided between the DSP 520 and the voice recognition module 513 distinguished from the configuration shown in
(19) An equalizer logic corresponding to a current media mode is applied to the analog signal X output from the DSP 520 to convert the analog signal X into a final media signal F(X) in the external amplifier 530 and the final media signal F(X) is output to the speaker 540. In addition, the equalizer logic corresponding to the current media mode is also applied to the analog signal X to convert the analog signal X into the final media signal F(X) in the E/C hardware 513_EC of the voice recognition module 513. To this end, the CPU 511 may notify the external amplifier 530 and the E/C hardware 513_EC of a signal corresponding to the current media mode.
(20) Consequently, the E/C hardware 513_EC can acquire the same signal as the final media output signal F(X) more rapidly compared to a method of receiving a final media output signal from the external amplifier 530 irrespective of a physical distance to the external amplifier 530 by performing an equalization process on the analog signal X in the same manner in which the external amplifier 530 processes the analog signal X although it does not directly receive the final media output signal F(X).
(21) As the E/C hardware 513_EC acquires the final media output signal F(X) in advance, the E/C hardware 513_EC can effectively separate a voice command VC of a user from sound A input through the microphone 550 even if the sound A input through the microphone 550 includes the final media output signal F(X) and the voice command VC of the user.
(22) Accordingly, the voice recognition module 513 can perform pre-processing on the separated voice command VC and compare the pre-processed voice command with values stored in the memory to determine the type of the command. Here, command types may include a wake command and a direct command. The wake command may refer to a command for voice recognition function initiation and the direct command may refer to a command belonging to a command pool which is executable immediately after command recognition even in modes other than a voice recognition mode. Particularly, the eco cancellation function is more important for the wake command because the wake command needs to be recognized all the time even in bad conditions.
(23) Hereinafter, a method of determining an equalizer logic corresponding to a current mode in the E/C hardware will be described with reference to
(24) In
(25) The E/C hardware 513_EC according to the present embodiment may include a mapping table in which equalizer logics are defined for respective modes. Since vehicle models have different optimum equalizer logics according to concepts, brands or sizes thereof, equalizer logics stored in the E/C hardware 513_EC are values optimized for spatial characteristics of vehicles and tuned for respective vehicle models. Equalizer logics may have different characteristics but are not limited thereto. Such equalizer logics are applied for respective modes for the reason below.
(26) Regarding navigation guidance voice, most sound sources are within a range of 200 Hz to 5 kHz as characteristics of human voice. Regarding music, rock music (1 kHz to 20 kHz) and classical music (500 Hz to 20 kHz) use frequencies different from guidance voice. Further, in the case of radio which is a combination of guidance voice and music, an EQ value different from music and navigation guidance voice is applied. In this manner, optimum EQ values are tuned and provided for respective media modes.
(27) Accordingly, an equalizer logic corresponding to a real-time media mode notified by the CPU 511 can be applied. Here, it is desirable that the mapping table included in the E/C hardware and equalizer logics defined therein be the same as a mapping table included in the external amplifier and equalizer logics defined therein.
(28) For example, a mapping table of the CPU and the mapping table of the E/C hardware may have forms as shown in Tables 1 and 2.
(29) TABLE-US-00001 TABLE 1 Mode Value FM 1 AM 2 Music 3 BT 4
(30) TABLE-US-00002 TABLE 2 Value EQ logic 1 F( ) 2 G( ) 3 H( ) 4 I( )
(31) Referring to
(32) Accordingly, the E/C hardware 513_EC checks an equalizer logic (for example, H( ) in Table 2) corresponding to a changed mode identifier and applies the equalizer logic to the analog signal X received from the DSP 512 (S630).
(33) Since the external amplifier 530 also applies the logic H( ) to the analog signal X, the E/C hardware 513_EC can acquire the same media output H(X) as the final media output of the external amplifier 530 in advance and compare the media output H(X) with microphone input to separate a voice command (S640).
(34) According to another embodiment of the present disclosure, the aforementioned eco cancellation function may be applied to customized equalizer logics. This will be described with reference to
(35)
(36) The configuration shown in
(37) The communication module 515 and the personalization service module 516 each may be a hardware device implemented with various electronic circuits to transmit and receive signals via wireless or landline connections.
(38) Referring to
(39) Such a customized equalizer logic is applicable to engine sound tuning as well as general media output.
(40) The above-described embodiments of the present disclosure have the following effects.
(41) It is possible to improve a recognition rate for the wake command which is essential for voice recognition function. Particularly, while the wake command needs to constantly operate even in bad conditions, eco cancellation can be performed on the wake command in real time by acquiring media sound which is a comparison group in advance. Accordingly, voice recognition can be performed using the external amplifier without regard to the physical distance between the external amplifier and the head unit.
(42) In actual vehicle implementation, improved voice recognition can be achieved simply by adding hardware wiring between the E/C hardware and the DSP and adding an equalizer application logic and a selection logic according to modes to the E/C hardware, requiring little additional expenses.
(43) The present disclosure may be implemented as code readable by a computer and stored in a computer-readable recording medium. The computer-readable recording medium includes all kinds of recording devices in which data readable by computer systems is stored. Examples of the computer-readable recording medium include an HDD (Hard Disk Drive), an SSD (Solid State Drive), an SDD (Silicon Disk Drive), a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
(44) Accordingly, the above description needs to be construed in all aspects as illustrative and not restrictive. The scope of the present disclosure should be determined by the appended claims and their legal equivalents, not by the above description, and all changes coming within the meaning and equivalency range of the appended claims are intended to be embraced therein.