VOICE-CONTROLLED DISPLAY DEVICE AND METHOD FOR EXTRACTING VOICE SIGNALS
20190369955 ยท 2019-12-05
Assignee
Inventors
- Cheng-Lung LIN (New Taipei City, TW)
- Yen-Yun Chang (New Taipei City, TW)
- Chic-Chen HUANG (New Taipei City, TW)
- Shih-Pin CHANG (New Taipei City, TW)
Cpc classification
G10L15/22
PHYSICS
A63F13/424
HUMAN NECESSITIES
G06F3/167
PHYSICS
H04N21/42204
ELECTRICITY
G10L2021/02161
PHYSICS
International classification
Abstract
A voice-controlled display device comprises a display panel, a signal input port, two microphones, a microprocessor and a display controller. The signal input port is configured to receive a first video signal from a host. Each of the microphone comprises a sound-receiving terminal for receiving an external audio, wherein the sound-receiving terminal is disposed adjacent to the display panel and the sound-receiving terminal and the display panel are located on the same side of the voice-controlled display device. The microprocessor electrically connects to the microphones and the microprocessor performs a voice recognition procedure to obtain an instruction according to the external audio. The display controller electrically connects to the signal input port, the display panel and the microprocessor, wherein the display controller transforms the first video signal to a second video signal and the display panel display one of the first video signal and the second video signal.
Claims
1. A voice-controlled display device comprising: a display panel; a signal input port configured to receive a first video signal from a host, a first microphone comprising a first sound-receiving terminal for receiving an external audio, wherein the first sound-receiving terminal is disposed adjacent to the display panel, and the first sound-receiving terminal and the display panel are located on the same side of the voice-controlled display device; a second microphone comprising a second sound-receiving terminal for receiving the external audio, wherein the second sound-receiving terminal is disposed adjacent to the display panel and the first sound-receiving terminal, and the second sound-receiving terminal and the display panel are located on the same side of the voice-controlled display device; a microprocessor electrically connecting to the first microphone and the second microphone, wherein the microprocessor performs a voice recognition procedure to obtain an instruction according to the external audio; and a display controller electrically connecting to the signal input port, the display panel and the microprocessor, wherein the display controller transforms the first video signal to a second video signal according to the instruction, and the display panel displays an image corresponding to one of the first video signal and the second video signal.
2. The voice-controlled display device of claim 1, wherein a distance between the first sound-receiving terminal and the second sound-receiving terminal is 2-4 centimeters.
3. The voice-controlled display device of claim 1, wherein the first microphone and the second microphone are directional microphones.
4. The voice-controlled display device of claim 3, wherein a coverage angle of each of the directional microphones is 15-60 degrees, and a coverage angular range of the first microphone and a coverage angular range of the second microphone overlap with each other to define an intersectional area.
5. The voice-controlled display device of claim 1, wherein an image corresponding to the first video signal comprises a default display area, and according to the instruction, an image corresponding to the second video signal generated by the display controller and transformed from the first video signal has an enlarged image of the default display area.
6. The voice-controlled display device of claim 1 further comprising a light module electrically connecting to the display controller, wherein the light module is configured to emit a light with a specified color according to the instruction.
7. A method for extracting voice signals comprising: receiving two external audio signals by a first microphone and a second microphone respectively, wherein a first receiving terminal of the first microphone and a second receiving terminal of the second microphone are located on the same side of a voice-controlled display device; calculating two waveforms of said two external audio signals by a microprocessor; calculating a difference between said two waveforms by the microprocessor; performing a voice recognition procedure to obtain an instruction according to the external audio by the microprocessor when the difference is smaller than a threshold, or dropping said two waveforms by the microprocessor when the difference is larger than or equals to the threshold.
8. The method for extracting voice signals of claim 7, wherein the difference is a time difference or an intensity difference.
9. The method for extracting voice signals of claim 7, wherein a distance between the first sound-receiving terminal and the second sound-receiving terminal is 2-4 centimeters.
10. The method for extracting voice signals of claim 7, wherein the first microphone and the second microphone are directional microphones.
11. The method for extracting voice signals of claim 10, wherein a coverage angle of each of the directional microphones is 15-60 degrees, and a coverage angular range of the first microphone and a coverage angular range of the second microphone overlap with each other to define an intersectional area.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0006] The present disclosure will become more fully understood from the detailed description given hereinbelow and the accompanying drawings which are given by way of illustration only and thus are not limitative of the present disclosure and wherein:
[0007]
[0008]
[0009]
[0010]
[0011]
[0012]
DETAILED DESCRIPTION
[0013] In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the disclosed embodiments. It will be apparent, however, that one or more embodiments may be practiced without these specific details. In other instances, well-known structures and devices are schematically shown in order to simplify the drawings.
[0014] Please refer to
[0015] The display panel 1 is an element for showing an image, and the user is able to view the image via the display panel 1. In practice, the display panel 1 may be the twisted nematic (TN) panel, the in-plane-switching (IPS) panel or the vertical alignment (VA) panel. However, the hardware structure of the display panel 1 is not limited by aforementioned examples.
[0016] The signal input port 3 is adapted for receiving the first video signal from a host, wherein the host may be such as a personal computer (PC), a server, a smart phone or a tablet having the central processing unit (CPU). However, the host is not limited by aforementioned examples. In practice, the signal input port 3 may be the interface such as the D-SUB (subminiature), the digital video interface (DVI), the high definition multimedia interface (HDMI) or the DisplayPort (DP).
[0017] The microphone 5 is adapted for receiving the external audio. In practice, the microphone 5 may be a microelectromechanical systems (MEMS) microphone. It is worth to emphasizing that, the configuration of two microphones as the first microphone 52 and the second microphone 54 shown in
[0018] Please refer to
[0019] Please refer to
[0020] In an embodiment of this disclosure, the voice recognition procedure is mainly associated to an algorithm. Specifically, after the microprocessor 7 obtains the external audio, the voice recognition procedure calculates a time difference between two microphones receiving the same voice. When the time difference is smaller a threshold, the voice recognition procedure uses the external audio to perform the voice recognition for obtaining the voice instruction included in the external audio. When the time difference is larger or equals to than the threshold, the voice recognition procedure drops the external audio. The setting of the threshold is associated with the distance between the first sound-receiving terminal 52 and the second sound-receiving terminal 54. In another aspect, when the external audio is generated at the place out of the intersectional area P and is received by the microphone 5, the voice recognition procedure is able to exclude the voice signal such as aforementioned example. Hence, it could make the voice-controlled display device avoid to mistake the environmental noise as the voice instruction. Base on aforementioned mechanics, the microprocessor 7 is able to perform the voice recognition for the voice signal in the range of the intersectional area P in an embodiment of this disclosure. On the other hand, in addition to the time difference, the intensity difference or other measurements which are able to show the distance of the voice transmission could also be used as the criterion, and this disclosure is not limited by aforementioned measurements.
[0021] Please refer to
[0022] From another aspect, the second video signal may be an enlarging signal, so that the image corresponding to the second video signal includes an enlarged image of the default display area. For example, the player often needs to enlarge a part of the image for viewing more clearly and operating more preciously during the video game. Please refer to
[0023] In another embodiment of this disclosure, the voice-controlled display device further comprises a light module electrically connected to the display controller 9. Also, the light module is adapted for emitting a light with a specified color according to the instruction. In practice, the light module may be a light emitting diode (LED) disposed at the back of the display panel 1 in the voice-controlled display device. The emitting time and the color of the light are able to be controlled via the instruction, wherein the instruction is the voice instruction received by the first microphone 52 and the second microphone 54 on the front of the display panel 1. Compared to the conventional display device which is only adapted for outputting an image, the voice-controlled display device disclosed by this disclosure is further used as an inputting device adapted for controlling the peripheral light. Hence, the visual experience may be improved when the user watches the screen. In addition, in comparison with the light module provided by the conventional game host whose setting is only able to be edited through the operation interface of the manufacture, the control method of the voice instruction used by the voice-controlled display device in an embodiment of this disclosure provides a simpler and more intuitive way to control or set the parameter. As a result, the user does not need to spend extra time to learn how to control or set the parameter.
[0024] Please refer to
[0025] As a result, the voice-controlled display device disclosed by this disclosure uses two directional microphones disposed at the same side of the display panel to receive the same external audio. Furthermore, the external audio recorded from the outside of the best sensitive angular range is considered as the ambient noise and is filtered out. Since the method for extracting the voice signal disclosed by this disclosure does not use the conventional way which the ambient noise is deducted from the external audio by the hardware circuit, the reorganization of the ambient noise may be improved through the algorithm which is able to be adjusted continuously and preciously. Hence, the voice recognition procedure performed by the microprocessor is able to recognize the voice sent from the user and output the corresponding voice instruction, and the display controller further uses the voice instruction to transform a first image to a second image. Also, the display controller shows the first image and the second image via the display panel. Therefore, the common user is able to change the display mode of the screen easily for achieving the best screen viewing experience. On the other hand, for the professional video game player, the scene and the display are able to be switched currently during the game, so the player does not need to spend extra time for switching the scene or the display manually during the game. For these reasons, the voice-controlled display device and the method for extracting the voice signal disclosed by this disclosure provides a friendlier way to control the screen, and the operation experience during the game is able to be improved.