METHOD AND APPARATUS FOR IMPROVING RECOGNITION ACCURACY FOR THE HANDWRITTEN INPUT OF ALPHANUMERIC CHARACTERS AND GESTURES
20180225507 ยท 2018-08-09
Inventors
Cpc classification
G10L15/22
PHYSICS
H04L67/10
ELECTRICITY
International classification
G10L13/04
PHYSICS
G06F3/0488
PHYSICS
Abstract
A method for automatically selecting one of a plurality of recognition algorithms for a handwritten input of alphanumeric characters and/or gestures into a selected input field displayed on a screen using a touch-sensitive input apparatus comprises carrying out optical character recognition in a region of the screen which comprises at least the input field and the immediate environment of the input field, or carrying out voice recognition for a voice instruction acoustically output after the selected input field has been displayed. Terms describing field types are searched for in the result of the optical character recognition or the voice recognition, and a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition is selected.
Claims
1. A method for automatically selecting one of a plurality of recognition algorithms or one of a plurality of parameter sets for a recognition algorithm for a handwritten input of alphanumeric characters and/or gestures into a selected input field displayed on a screen using a touch-sensitive input apparatus, comprising: carrying out optical character recognition in a region of the screen which includes at least the input field and the immediate environment of the input field, or carrying out voice recognition for a voice instruction acoustically output after the selected input field has been displayed, searching for terms in the result of the optical character recognition or the voice recognition, on the basis of which terms the field type of the input field can be determined, determining the field type on the basis of the terms found, and selecting a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition or a parameter set for the recognition algorithm.
2. The method as claimed in claim 1, wherein the optical character recognition comprises: transmitting an image of what is displayed in at least one region of the screen which includes at least the input field and the immediate environment of the input field to an apparatus or to a computer program for optical character recognition, and receiving the result of the optical character recognition.
3. The method as claimed in claim 1, wherein the voice recognition comprises: recording the acoustically output voice instruction or receiving a signal representing the acoustic voice instruction, and receiving the result of the voice recognition.
4. The method as claimed in claim 3, wherein the signal representing the acoustic voice instruction is a digital or analog representation of electrical signals output via one or more loudspeakers or a control signal for a text-to-speech output unit.
5. The method as claimed in claim 1, wherein the optical character recognition or the voice recognition is carried out after one of a plurality of input fields has been selected on the screen.
6. An apparatus for automatically selecting one of a plurality of recognition algorithms for a handwritten input of alphanumeric characters and/or gestures into a selected input field displayed on a screen using a touch-sensitive input apparatus, comprising: first means which are set up to carry out optical character recognition in a region of the screen which includes at least the input field and the immediate environment of the input field, or are set up to carry out voice recognition for a voice instruction acoustically output after the selected input field has been displayed, second means which are set up to search for terms describing field types in the result of the optical character recognition or the voice recognition, and third means which are set up to select a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition or a parameter set for the recognition algorithm.
7. The apparatus as claimed in claim 6, also comprising a fourth means which is set up to select a recognition algorithm or a parameter set for the recognition algorithm on the basis of possible characters or gestures to be input, wherein the possible characters or gestures to be input are determined from preceding inputs, and wherein the determination is carried out according to linguistic rules or by comparing words stored in a database.
8. The apparatus as claimed in claim 7, wherein one or more of the first, second, third and/or fourth means have one or more microprocessors and main memories and non-volatile memories communicatively connected to the one or more microprocessors, wherein respective non-volatile memories store computer program instructions which, when loaded into the respective main memory from the one or more microprocessors and executed, cause the performance of comprising: carrying out optical character recognition in a region of the screen which includes at least the input field and the immediate environment of the input field, or carrying out voice recognition for a voice instruction acoustically output after the selected input field has been displayed, searching for terms in the result of the optical character recognition or the voice recognition, on the basis of which terms the field type of the input field can be determined, determining the field type on the basis of the terms found, and selecting a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition or a parameter set for the recognition algorithm, wherein the first, second and third means together perform all steps of the method.
9. The apparatus as claimed in claim 6, wherein the screen, the first, second, third and/or fourth means are arranged in a manner spatially separated from one another and are connected to one another by means of one or more communication networks.
10. A motor vehicle having an apparatus for automatically selecting one of a plurality of recognition algorithms for a handwritten input of alphanumeric characters and/or gestures into a selected input field displayed on a screen using a touch-sensitive input apparatus, comprising: first means which are set up to carry out optical character recognition in a region of the screen which includes at least the input field and the immediate environment of the input field, or are set up to carry out voice recognition for a voice instruction acoustically output after the selected input field has been displayed, second means which are set up to search for terms describing field types in the result of the optical character recognition or the voice recognition, and third means which are set up to select a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition or a parameter set for the recognition algorithm.
11. The apparatus as claimed in claim 8, wherein the optical character recognition comprises: transmitting an image of what is displayed in at least one region of the screen which includes at least the input field and the immediate environment of the input field to an apparatus or to a computer program for optical character recognition, and receiving the result of the optical character recognition.
12. The apparatus as claimed in claim 8, wherein the voice recognition comprises: recording the acoustically output voice instruction or receiving a signal representing the acoustic voice instruction, and receiving the result of the voice recognition.
13. The method as claimed in claim 12, wherein the signal representing the acoustic voice instruction is a digital or analog representation of electrical signals output via one or more loudspeakers or a control signal for a text-to-speech output unit.
14. The method as claimed in claim 8, wherein the optical character recognition or the voice recognition is carried out after one of a plurality of input fields has been selected on the screen.
15. The apparatus as claimed in claim 7, wherein the screen, the first, second, third and/or fourth means are arranged in a manner spatially separated from one another and are connected to one another by means of one or more communication networks.
16. The apparatus as claimed in claim 8, wherein the screen, the first, second, third and/or fourth means are arranged in a manner spatially separated from one another and are connected to one another by means of one or more communication networks.
17. The apparatus as claimed in claim 10, also comprising a fourth means which is set up to select a recognition algorithm or a parameter set for the recognition algorithm on the basis of possible characters or gestures to be input, wherein the possible characters or gestures to be input are determined from preceding inputs, and wherein the determination is carried out according to linguistic rules or by comparing words stored in a database.
18. The apparatus as claimed in claim 17, wherein one or more of the first, second, third and/or fourth means have one or more microprocessors and main memories and non-volatile memories communicatively connected to the one or more microprocessors, wherein respective non-volatile memories store computer program instructions which, when loaded into the respective main memory from the one or more microprocessors and executed, cause the performance of steps comprising: carrying out optical character recognition in a region of the screen which includes at least the input field and the immediate environment of the input field, or carrying out voice recognition for a voice instruction acoustically output after the selected input field has been displayed, searching for terms in the result of the optical character recognition or the voice recognition, on the basis of which terms the field type of the input field can be determined, determining the field type on the basis of the terms found, and selecting a recognition algorithm which is adapted to a field type found in the result of the optical character recognition or the voice recognition or a parameter set for the recognition algorithm, wherein the first, second and third means together perform all steps of the method.
19. The apparatus as claimed in claim 10, wherein the screen, the first, second, third and/or fourth means are arranged in a manner spatially separated from one another and are connected to one another by means of one or more communication networks.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0036] The invention is described below on the basis of the figures in the drawing. In the drawing:
[0037]
[0038]
[0039]
[0040]
DETAILED DESCRIPTION
[0041] In the figures, the same or similar elements are provided with the same reference symbols.
[0042]
[0043]
[0044]
[0045]
[0046] Optionally, it is also possible to provide means 412 which dynamically control the input with respect to a database taking into account already available contents by restricting the inputs which are still expected. This can be carried out locally or remotely, for example, using speller functionalities which are already provided by the corresponding application or alternatively by directly accessing the corresponding database, or structure rules for the input are retrieved locally or remotely. These structure rules may be country-dependent, for example. This step is used to determine an extended parameter set for the recognition algorithm.