VOICE CONTROL METHOD AND VOICE CONTROL SYSTEM FOR IN-VEHICLE DEVICE
20200357393 ยท 2020-11-12
Inventors
Cpc classification
G10L15/22
PHYSICS
G06F3/167
PHYSICS
B60K35/00
PERFORMING OPERATIONS; TRANSPORTING
International classification
G10L15/22
PHYSICS
B60K35/00
PERFORMING OPERATIONS; TRANSPORTING
Abstract
A voice control method for an in-vehicle device includes receiving an audio signal by an information capturing device, transmitting the audio signal to a base of the information capturing device by the information capturing device, performing voice recognition on the audio signal by the base to generate at least one context instruction, transmitting the at least one context instruction to a host of an in-vehicle device by the base, and correspondingly controlling an operation of at least one function module of the in-vehicle device according to the at least one context instruction to perform at least one context operation by the host.
Claims
1. A voice control method for an in-vehicle device, comprising: receiving an audio signal by an information capturing device; transmitting the audio signal to a base of the information capturing device by the information capturing device; performing voice recognition on the audio signal by the base to generate at least one context instruction; transmitting the at least one context instruction to a host of the in-vehicle device by the base; and correspondingly controlling at least one function module of the in-vehicle device according to the at least one context instruction to perform at least one context operation by the host.
2. The voice control method for an in-vehicle device according to claim 1, wherein the information capturing device records an ambient sound by using an audio collecting unit to generate the audio signal.
3. The voice control method for an in-vehicle device according to claim 1, wherein the host correspondingly controls an operation of an audio recording module of the in-vehicle device according to an audio recording instruction in the at least one context instruction to activate or deactivate audio recording.
4. The voice control method for an in-vehicle device according to claim 1, wherein the host correspondingly controls an operation of an image capturing unit of the in-vehicle device according to a video recording instruction in the at least one context instruction to activate or deactivate video recording.
5. The voice control method for an in-vehicle device according to claim 1, wherein the host correspondingly controls an operation of an alert light of the in-vehicle device according to an alert instruction in the at least one context instruction to turn on or turn off the alert light.
6. The voice control method for an in-vehicle device according to claim 1, wherein the host correspondingly controls an operation of a network module of the in-vehicle device according to a help instruction in the at least one context instruction to output an emergency event alert.
7. The voice control method for an in-vehicle device according to claim 1, further comprising: translating the audio signal; duplicating the translated audio signal to obtain a plurality of input audios; and outputting one of the input audios through an audio input/output router to the host.
8. The voice control method for an in-vehicle device according to claim 7, wherein the step of performing voice recognition on the audio signal comprises voice recognition on another of the input audios.
9. The voice control method for an in-vehicle device according to claim 1, wherein the step of transmitting the at least one context instruction to the host comprises outputting the at least one context instruction through a USB input/output interface to the host.
10. A voice control method for an in-vehicle device, suitable for a base of an information capturing device, the voice control method for an in-vehicle device comprising: receiving an audio signal from the information capturing device; performing voice recognition on the audio signal to generate at least one context instruction; and transmitting the at least one context instruction to a host of the in-vehicle device to enable the host to correspondingly control an operation of at least one function module of the in-vehicle device according to the at least one context instruction to perform at least one context operation.
11. A voice control system for an in-vehicle device, comprising: an information capturing device, detecting and generating an audio signal; a base, for mounting and charging the information capturing device, the base comprising: a first connecting unit, electrically and signally coupled to the information capturing device, configured to receive the audio signal; a voice recognition engine, electrically and signal coupled to the first connecting unit, configured to perform voice recognition on the audio signal to generate at least one context instruction; and a second connecting unit, electrically and signally coupled to the voice recognition engine, configured to output the at least one context instruction; at least one function module, individually performing a context operation; and a host, electrically and signally coupled to the second connecting unit and the at least one function module, configured to receive the at least one context instruction outputted from the second connecting unit, and to correspondingly control an operation of the at least one function module according to the at least one context instruction.
12. The voice control system for an in-vehicle device according to claim 11, wherein the at least one context instruction comprises an audio recording activating instruction, an audio recording deactivating instruction, a video recording activating instruction, a video recording deactivating instruction, an alert turning-on instruction, an alert turning-off instruction, a help instruction, or any combination thereof.
13. The voice control system for an in-vehicle device according to claim 11, wherein the at least one function module is an audio recording module, an image capturing unit, an alert light, a network module, or any combination thereof.
14. The voice control system for an in-vehicle device according to claim 11, wherein the voice recognition engine comprises: a translation circuit, electrically and signally coupled to the first connecting unit, configured to translate the audio signal; and a recognition circuit, electrically and signally coupled to the translation circuit, configured to perform voice recognition on the translated audio signal.
15. The voice control system for an in-vehicle device according to claim 14, wherein the voice recognition engine further comprises: a one-to-many router circuit, electrically and signally coupled between the translation circuit and the recognition circuit and between the first connecting unit and the second connecting unit, configured to duplicate the audio signal to obtain a plurality of input audios, and to output the input audios to the recognition circuit and the host.
16. The voice control system for an in-vehicle device according to claim 15, wherein the second connecting unit is an audio input/output router, and the audio input/output router is configured to output one of the input audios to the host.
17. The voice control system for an in-vehicle device according to claim 11, wherein the second connecting unit is a USB input/output interface, and the voice recognition engine outputs the at least one context instruction to the host through the USB input/output interface.
18. The voice control system for an in-vehicle device according to claim 11, wherein the base further comprises: a charging circuit, electrically and signally coupled to the first connecting unit, configured to charge the information capturing device when the information capturing device is electrically and signally coupled to the first connecting unit.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0008]
[0009]
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0016] Referring to
[0017] Referring to
[0018] In some embodiments, the information capturing device 10 is coupled to a connecting unit (to be referred to as the first connecting unit 220 below) of the base 20 through a connecting unit thereof 120, so as to electrically and signally couple the information capturing device 10 to the base 20. The base 20 can be electrically and signally coupled by another connecting unit thereof (to be referred to as a second connecting unit 230) to the connecting unit (not shown) of the host 30 through a USB connecting line 51 and an audio line 53, so as to electrically and signally couple the base 20 to the host 30. The host 30 is further electrically and signally coupled to the function modules 40. In an example, the information capturing device 10 can be a wireless microphone, a wireless walkie talkie or a portable camera device. Wherein, the portable camera device can be a video recorder, a body video camera, a portable search video camera or a miniature video camera mounted on a hat or a garment. The host 30 can be, for example, a digital video recorder in car (car DVR)).
[0019] In some embodiments, the host 30 and the function modules 40 coupled thereto can be built in an in-vehicle device and/or be externally connected to one or more in-vehicle devices (including the function modules 40). The base 20, the host 30 and the in-vehicle device corresponding to the host 30 can be mounted on a mobile carrier (e.g., a vehicle), and the information capturing device 10 can be mounted in a pluggable form on the base 20.
[0020] When the information capturing device 10 is a wireless microphone or a wireless walkie talkie, the information capturing device 10 includes a processing unit 110, a connecting unit 120 and an audio collecting unit 150.
[0021] The processing unit 110 is electrically and signally coupled to the connecting unit 120 and the audio collecting unit 150.
[0022] In some embodiments, when the information capturing device 10 is a portable camera device, the information capturing device 10 further includes an image capturing unit 140. In a general operation, the processing unit 110 can perform image capturing on an ambient environment by using the image capturing unit 140 and corresponding generate an image signal. In some embodiments, the image capturing unit 140 can be implemented by a camera lens, a light sensing unit or an image processing unit, wherein the image processing unit can be an image signal processor (ISP).
[0023] Referring to
[0024] In some embodiments, the base 20 includes a voice recognition engine 210 and two connecting units (to be referred to as a first connecting unit 220 and a second connecting unit 230 below). The voice recognition engine 210 is electrically and signally coupled to the first connecting unit 220 and the second connecting unit 230.
[0025] At this point, the base 20 is electrically and signally coupled to the information capturing device 10 through the first connecting unit 220, so as to receive the audio signal from the information capturing device 10. The voice recognition engine 210 receives the audio signal outputted from the connecting unit 120 through the first connecting unit 220, and performs voice recognition on the audio signal to generate at least one context instruction (step S05). The at least one context instruction generated by the voice recognition engine 210 of the base 20 is transmitted to the host 30 through the second connecting unit 230 (step S07). In one example, the second connecting unit 230 can be a USB input/output (I/O) interface. At this point, the base 20 outputs the at least one context instruction to the host 30 through the USB I/O interface.
[0026] The host 30 receives the at least one context instruction outputted from the second connecting unit 230, and correspondingly controls an operation of the function module 40 according to the at least one context instruction received to perform at least one context operation (step S09). The number of the context instruction outputted each time from the second connecting unit 230 can be one, or be two, three or more according to requirements.
[0027] In some embodiments, the context instruction can be an audio recording instruction, and the function module 40 can include an audio recording module. At this point, the host 30 correspondingly controls an operation of the audio recording module of the in-vehicle device to activate or deactivate audio recording. The audio recording instruction can be an audio recording activating instruction or an audio recording deactivating instruction. In one example, the host 30 correspondingly controls and activates the audio recording module according to the audio recording activating instruction to perform audio recording. In another example, the host 30 correspondingly controls and deactivates the audio recording module according to the audio recording deactivating instruction to stop audio recording.
[0028] In some embodiments, the context instruction can include a video recording instruction, and the function module 40 can include an image capturing unit. At this point, the host 30 correspondingly controls an operation of the image capturing unit of the in-vehicle device according to the video recording instruction to activate or deactivate video recording. The video recording instruction can be a video recording activating instruction or a video recording deactivating instruction. In one example, the host 30 correspondingly controls and activates the image capturing unit according to the video recording activating instruction to perform video recording. In another example, the host 30 correspondingly controls and deactivates the image capturing unit according to the video recording deactivating instruction to stop video recording.
[0029] In some embodiments, the context instruction can include an alert instruction, and the function module 40 can include an alert light. At this point, the host 30 correspondingly controls an operation of the alert light of the in-vehicle device to turn on or turn off according to the alert instruction, wherein the alert instruction can be an alert turning-on instruction or an alert turning-off instruction. In one example, the host 30 correspondingly turns on the alert light according to the alert turning-on instruction. In another example, the host 30 correspondingly turns off the alert light according to the alert turning-off instruction.
[0030] In some embodiment, the context instruction can include a help instruction, and the function module 40 can include a network module. At this point, the host 30 correspondingly controls an operation of the network module of the in-vehicle device according to the help instruction to output an emergency event alert to a remote monitoring center.
[0031] In some embodiments, the base 20 further includes a charging circuit 240, of which one end is coupled to a power supply of the vehicle and the other end is coupled to the first connecting unit 220. When the information capturing device 10 is mounted on the base 20 (i.e., when the information capturing device 10 is electrically and signally coupled to the first connecting unit 220), the charging circuit 240 of the base 20 further charges the information capturing device 10 through the first connecting unit 220. In one example, the charging circuit 240 receives power from the power supply, and provides an appropriately adjusted voltage to the information capturing device 10 through the first connecting unit 220, as power needed for operating the information capturing device 10.
[0032] In some embodiments, referring to
[0033] In some embodiments, referring to
[0034] For example, the control signal can be an activation signal, and the processing unit 110 receives the activation signal and enables the image capturing unit 140 in response to the activation signal. At this point, the indication light group 160 provides an indication signal to thereby notify the user that the image capturing unit 140 is currently performing video recording. In another example, the control signal can be a stop signal, and the processing unit 110 receives the stop signal and disables the image capturing unit 140 in response to the stop signal. At this point, the indication light group 160 provides an indication signal or stops providing the originally provided indication signal to thereby notify the user that the image capturing unit 140 has stopped video recording. In yet another example, the control signal can be a selection signal, and the processing unit 110 receives the selection signal and switches a video recording mode in response to the selection signal. At this point, the indication light group 160 provides an indication signal to thereby indicate the current video recording mode (the video recording mode after switching).
[0035] In some embodiments, referring to
[0036] In some embodiment, referring to
[0037] In some embodiments, referring to
[0038] In some embodiments, referring to
[0039] The translation circuit 212 receives through the first connecting unit 220 the audio signal from the information capturing device 10, and translates the audio signal. The one-to-many router circuit 216 duplicates and routes the audio signal to obtain multiple input audios, outputs one of the input audios to the recognition circuit 214 for the recognition circuit 214 to perform recognition on the input audio, and further outputs another of the input audios through the second connecting unit 230 to the host 30. In one example, when an audio recording activating instruction is generated from the voice recognition performed on the translated audio signal by the recognition circuit 214, the host 30 can correspondingly control and activate the audio recording module according to the audio recording activating instruction to record the input audio received; meanwhile, the recognition circuit 214 can still continue recognition on the input audio and perform processing corresponding to the instruction.
[0040] In some embodiments, referring to
[0041] In some embodiments, the base 20 can further include an antenna 270 and a wireless communication module 272. The wireless communication module 272 is electrically and signally coupled between the antenna 270 and the translation circuit 212, and/or electrically and signally coupled between the antenna 270 and the recognition circuit 214. The wireless communication module 272 establishes wireless communication by using the antennas 270 and 170 to the information capturing device 10 to transmit wireless RF signals (e.g., the foregoing audio signal and/or operation instruction). In other words, the translation circuit 212 (and/or the recognition circuit 214) and the wireless communication module 272 wirelessly communicate with the processing unit 110 through the antennas 270 and 170. The wireless communication module 272 can correspond to the information capturing device 10 supporting wireless communication to thereby support a long-distance wireless transmission technology or a short-distance wireless transmission technology.
[0042] In some embodiments, referring to
[0043] At this point, the audio collecting unit 250 is electrically and signally coupled to the translation circuit 212. The translation circuit 212 can record an ambient sound by using the audio collecting unit 250 to correspondingly generate an audio signal, and translate the audio signal. The one-to-many router circuit 216 duplicates and routes the audio signal to obtain multiple input audios, outputs one of the input audios to the recognition circuit 214 for the recognition circuit 214 to perform recognition on the input audio, and further outputs another of the input audios through the second connecting unit 230 to the host 30. In an example, when the information capturing device 10 is not plugged on the base 20, and an audio recording activating instruction and an alert activating instruction are generated from the voice recognition performed on the translated input audio by the recognition circuit 214, the host 30 can correspondingly control and activate/turn on the audio recording module and the alert light according to the audio recording activating instruction and the alert light turning-on instruction, so as to record the input audio received and to turn on the alert light.
[0044] In some embodiment, the voice recognition engine 210 can be implemented by one or more processing units.
[0045] In some embodiments, each of the processing units can be a microprocessor, a microcontroller, a digital signal processor (DSP), a microcomputer, a central processing unit (CPU), a field-programmable gate array (FPGA), programmable logical device, a state machine, a logical circuit, an analog circuit, a digital circuit, and/or any device operating a signal (analog or digital) based on an operation instruction.
[0046] In some embodiments, one of the connecting unit 120 and the first connecting unit 220 can be, for example, a POGO connector, and the other can be a connecting pad group matching the POGO connector.
[0047] In some embodiments, each of the audio recording units can be a built-in microphone or microphone array.
[0048] In some embodiments, the long-distance wireless transmission technology can be, for example but not limited to, transmission technologies such as a Wi-Fi (a Wi-Fi hotspot) and Long-Term Evolution (LTE). The short-distance wireless transmission or broadcasting technology can be, for example but not limited to, transmission technologies such as infrared, Bluetooth, UWB, ZigBee, ANT, Near-Field Communication (NFC).
[0049] In conclusion, the voice control method and the voice control system for an in-vehicle device provided according to the embodiments of the present invention are capable of obtaining actual voice contents by performing voice recognition on an audio signal using a base to further output a corresponding context instruction, thereby enabling a host to perform an operation corresponding to the context instruction in response to the context instruction.