Finger reading method and device based on visual gestures

Abstract

A finger reading method and device based on visual gestures. A user makes a circle on a required finger reading region on a book by using a finger; a camera captures the circle making action of the finger, and an image processing module acquires the position a fingertip according to a profile analysis algorithm, acquires the endpoints of the track edge in upper, lower, left and right directions, and fits a rectangle according to the upper, lower, left and right end points in order to identify content such as characters or graphs of rectangular region content. A voice synthesis technology is performed. Voice information is fed to the user to realize a finger reading functions. The device can be worn on the head of the user, and includes camera and bone conduction earphones arranged on both sides thereof.

Claims

1. A finger reading method based on visual gesture, comprising following steps: 1) capturing, by a camera, a circle making action of a finger, the circle making action referring to that a user makes a circle on a required finger reading region on a book by using a finger; 2) determining, by an image processing module, a rectangular region according to the captured circle making action; 3) identifying, by the image processing module, characters or graphs of the determined rectangular region; 4) performing voice synthesis, by a voice processing module, according to identified result from identifying the character or graphs of the rectangular region or according to internet searching result based on the identified result, generating synthesized voice data based on the voice synthesis, and playing the synthesized voice data by a playing device, wherein in step 2), the image processing module acquires a fingertip position by profile analysis algorithm, and acquires edge end points in upper, lower, left and right directions on the captured circle making action of the finger, then fits the rectangle region according to the edge end points.

2. The finger reading method based on visual gesture according to claim 1, wherein the step 4) further comprises the user web searches designated vocabularies or content by voice command.

3. The finger reading method based on visual gesture according to claim 1, wherein the voice processing module further identifies fixing clauses of the user for issuing command.

4. The finger reading method based on visual gesture according to claim 1, wherein in step 2), the image processing module firstly analyzes a camera image by a skin color segmentation algorithm, and detects whether a human hand appears in the camera, if not, continuing to analyze camera image by skin color segmentation algorithm, if yes, the camera captures the circle making action of the finger.

5. A device for detecting visual gesture, comprising a main housing, a camera, a bone voice conduction module, and the main housing having an image processing module, a voice processing module, a wifi network module and an embedded microprocessor module therein; the camera being mounted on the main housing or being embedded in the main housing, the bone voice conduction module being located on both sides of the main housing for attaching on cheekbones above a user's ears; a scope of the camera covering a required reading field for capturing an image for acquiring a circle making action of user finger and content image to be identified; the image processing module acquiring the user finger moving track in image identifying camera scope by the camera, acquiring edge end points in upper, lower, left and right directions on the user finger moving track, fitting the user finger moving track inside a rectangle according to the edge end points, and intelligently identifying content in the fitted rectangle region; the voice processing module performing voice synthesis according to identified result of the image processing module or network searching result, and identifying fixed clauses of the user for issuing command; the bone voice conduction module providing learning instruction and voice prompt by bone conduction according to output result of the voice processing module; the wifi network module being used in such a way that after accessing LAN or internet, the voice processing module web searching designated vocabularies or content by identifying user voice command; the voice processing module transmitting designated content to LAN or internet database server after voice instruction recognition for expanding content searching; the embedded microprocessor module building an embedded processor therein for controlling communication and working order of the modules.

6. The device according to claim 5, wherein the bone voice conduction module is implemented by bone conduction earphones.

7. The device according to claim 5, wherein the main housing has shape of head band for being worn on forehead and back side of head, and the camera is arranged in middle of the head band.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) FIG. 1 is a schematically outside structural view of a finger reading device based on visual gesture according to an embodiment of the present invention.

(2) FIG. 2 is a schematically structural view of modules of the finger reading device based on visual gesture according to the embodiment of the present invention.

(3) FIG. 3 is a flow chart of a finger reading method based on visual gesture according to the embodiment of the present invention.

(4) FIG. 4 is an image processing flow chart of the finger reading method based on visual gesture according to the embodiment of the present invention.

(5) FIGS. 5a and 5b are schematic process view of identifying content scope based on finger track in the finger reading method based on visual gesture according to the embodiment of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

(6) The present invention will be further described combined with appending drawings and detailed embodiments.

(7) As shown in FIG. 1, a head wearing device has a shape of head band, and includes a main housing, i.e. head band housing 01, a wide-angle camera 02, two bone conduction earphones 03. The head band housing 01 has material of environment friendly compound resin, healthy and contaminant, and may be attached to skin. The wide-angle cameral is specifically a 150 degree wide-angle camera, and covers a book on a desk when wearing the head band. The bone conduction earphones 03 are worn on both sides of the head band. When wearing the head band, the bone conduction earphones are exactly attached to cheekbones above the ears.

(8) The finger reading device based on visual gesture, as shown in FIG. 2, includes main modules as follows: 1) Embedded microprocessor module, the module builds embedded micro processor therein for generally controlling communication and working order of all modules of the device. 2) Image processing module, the module identifies a user's finger moving track in scope of the camera and fits the user finger moving track to a rectangle, and intelligently identifies content in the fitted rectangle region. 3) Voice processing module, the module performs TTS voice synthesis based on the intelligently identifying result or network searching result, and identifies the user's fixed voice command to the device. 4) Wifi network module, the module is used in such a way that after accessing LAN or internet, the user web searches designated vocabularies or content by voice command. The voice processing module transmits designated content to LAN or internet database server after voice instruction recognition for expanding content searching. 5) Bone voice conduction module, the module provides learning instruction and voice prompt by bone conduction according to result of the voice processing module.

(9) The finger reading method based on visual gesture, as shown in FIG. 3, includes steps as following: 1) The user starts the head band by voice command. The voice command is system predetermined command, for example “Start Please”. 2) The user makes circle in finger reading region by a finger. In this step, the finger of the user is normally a finger and other four fingers are redundant, and the user makes circle evenly in the required finger reading region, the speed of making circle is suggested to be not so quick. 3) The camera captures circle making action of the finger. If the user operates irregularly, for example, speed of making circle is so quick that the camera can not identify the finger correctly, the system may provide voice prompt for the user through the bone conduction earphones, for example “Error, Please restart”. 4) The image processing module forms rectangle region according to finger track. The step is illustrated in detail, as shown in FIGS. 4 and 5. When detecting starts, the image processing module firstly analyzes camera image by skin color segmentation algorithm, detecting whether human hand appears in the camera. If not, the camera image is analyzed by skin segmentation algorithm. If yes, the camera captures circle making action of the finger, the image processing module acquires fingertip position by profile analysis algorithm, and acquires edge end points of upper, lower, left and right directions on a track, as shown in FIG. 5a. Then, fitting a rectangle is performed according to the end points track, as shown in FIG. 5b. 5) Camera captures characters or images information in the rectangle region, the image processing module identifies information collected by the camera through intelligent recognition technology (for example OCR characters recognition technology, image recognition technology based on neural network etc.), and translates the identified content to characters to transmit to micro controller. 6) Voice processing module performs voice synthesis by TTS technology, and feeds the voice information back to the user through bone conduction earphones. 7) The user further consults the identified result by voice command. The voice command of this step includes some fixed commands which are predetermined, for example, “I want to learn more about it”. 8) Device accesses network to search related content and feeds back through bone conduction earphones. The system searches related content by will network module accessing network, and filters unwanted content, finally feeds back required content to the user by bone conduction technology.

(10) As can be seen, the device of the present invention is a wearable device which can be worn on the user's head. The camera on the device identifies the user's finger track to fit to a rectangle when the user needs finger reading, and content in the rectangle region is intelligently identified, the identified characters contents processed by TTS voice synthesis and is fed back to the user by bone conduction technology. Moreover, when the user wants to know more about identified content related information, voice command may start the device to access to network to search related content, and the content may be fed back to the user by bone conduction technology. The device combines advanced wearable principle, avoids restraint of handheld electronic device when the user is learning, and obtains intelligent finger reading learning instruction facing ordinary printed material.

Finger reading method and device based on visual gestures

Assignee

Inventors

Cpc classification

Classification Explorer

G06F3/017

PHYSICS

Classification Explorer

G06T2207/20104

PHYSICS

Classification Explorer

G06F3/167

PHYSICS

Classification Explorer

G10L13/00

PHYSICS

Classification Explorer

G06V40/28

PHYSICS

Classification Explorer

G10L15/22

PHYSICS

Classification Explorer

G06V10/235

PHYSICS

Classification Explorer

G06T2207/10024

PHYSICS

Classification Explorer

G09B5/062

PHYSICS

Classification Explorer

G09B17/006

PHYSICS

Classification Explorer

G10L13/04

PHYSICS

Classification Explorer

G06T2207/20112

PHYSICS

Classification Explorer

G09B19/06

PHYSICS

Classification Explorer

G06T2207/30196

PHYSICS

International classification

Classification Explorer

G09B17/00

PHYSICS

Classification Explorer

G10L13/00

PHYSICS

Classification Explorer

G10L15/22

PHYSICS

Classification Explorer

G06F3/16

PHYSICS

Classification Explorer

G06K9/00

PHYSICS

Classification Explorer

G06K9/20

PHYSICS

Classification Explorer

G10L13/04

PHYSICS

Classification Explorer

G09B19/06

PHYSICS

Classification Explorer

G09B5/06

PHYSICS

Classification Explorer

G06F3/01

PHYSICS

Abstract

Claims

Description