G06F40/53

DISPLAY OF TEXTS
20230214606 · 2023-07-06 ·

The present disclosure relates to a method of displaying two sets of characters, the method being implemented by a computing device, said method comprising: receiving, by the computing device, a first set of characters; receiving, by the computing device, a second set of characters; modifying, by the computing device, an appearance of one or more of the second set of characters to receive, without overlap, one or more of the first set of characters; and displaying, on a device screen, the one or more of the first set of characters and the modified second set of characters, wherein the one or more of the first set of characters are embedded in the modified second set of characters.

DISPLAY OF TEXTS
20230214606 · 2023-07-06 ·

The present disclosure relates to a method of displaying two sets of characters, the method being implemented by a computing device, said method comprising: receiving, by the computing device, a first set of characters; receiving, by the computing device, a second set of characters; modifying, by the computing device, an appearance of one or more of the second set of characters to receive, without overlap, one or more of the first set of characters; and displaying, on a device screen, the one or more of the first set of characters and the modified second set of characters, wherein the one or more of the first set of characters are embedded in the modified second set of characters.

Method and device for sorting Chinese characters, searching Chinese characters and constructing dictionary
20230004707 · 2023-01-05 ·

The invention discloses a method and a device for sorting Chinese characters, searching for Chinese characters and constructing a dictionary, and relates to the technical field of computers. A specific implementation of the method includes: obtaining the first basic character-forming component of a Chinese character according to the stroke order as the First Character, and encoding the First Character to obtain the First Character code, where the First Character includes the first character-forming component and the first main stroke component of a Chinese character; obtaining the number of strokes included in each Chinese character, and obtaining the corresponding stroke string of each Chinese character; using the First Character code as the first and highest priority sorting field, the number of strokes as the second sorting field, and the stroke string as the third and the lowest priority sorting field to sort Chinese characters. This embodiment can solve the problem of difficulty in sorting and searching of Chinese characters caused by the unfixed definition and position of radicals.

Chinese Character Input Method, System and Keyboard
20230004730 · 2023-01-05 ·

The invention discloses a Chinese character input method, system and a keyboard, and relates to the technical field of computers. A specific implementation of the method includes: recognizing the received key signal; in the case where the recognition result of the received key signal indicates a Chinese character Category Code and/or phrase Category Code, determining the recognized Chinese character and/or phrase represented by the Chinese character Category Code and/or the phrase Category Code; where the Chinese character Category Code is a combination of component Category Codes or a combination of component Category Codes and stroke Category Codes, used to represent Chinese characters; phrase Category Codes are combinations of component Category Codes, used to indicate phrases; display the determined Chinese characters and/or phrases. This implementation method solves the problem of messy character splitting, conforms to the character theory, is easy to remember and easy to use, does not require special learning. The entire input process is very natural, there are not many rules, the learning difficulty is reduced, and there are no special requirements for equipment conditions.

Chinese Character Input Method, System and Keyboard
20230004730 · 2023-01-05 ·

The invention discloses a Chinese character input method, system and a keyboard, and relates to the technical field of computers. A specific implementation of the method includes: recognizing the received key signal; in the case where the recognition result of the received key signal indicates a Chinese character Category Code and/or phrase Category Code, determining the recognized Chinese character and/or phrase represented by the Chinese character Category Code and/or the phrase Category Code; where the Chinese character Category Code is a combination of component Category Codes or a combination of component Category Codes and stroke Category Codes, used to represent Chinese characters; phrase Category Codes are combinations of component Category Codes, used to indicate phrases; display the determined Chinese characters and/or phrases. This implementation method solves the problem of messy character splitting, conforms to the character theory, is easy to remember and easy to use, does not require special learning. The entire input process is very natural, there are not many rules, the learning difficulty is reduced, and there are no special requirements for equipment conditions.

METHOD AND SYSTEM FOR AUTOMATIC AUGMENTATION OF SIGN LANGUAGE TRANSLATION IN GLOSS UNITS
20220414350 · 2022-12-29 ·

There are provided a method and system for automatic augmentation of gloss-based sign language translation data. A system for automatic augmentation of sign language translation training data according to an embodiment includes: a database configured to store a sequence of sign language glosses and a sequence of spoken-language words in pairs; and an augmentation module configured to augment the pairs stored in the database. Accordingly, gloss-based training data of high quality may be acquired by performing automatic augmentation for gloss-based training data for sign language translation in an efficient method in terms of time and economic aspects, and eventually, accuracy of translation between sign language glosses and sentences may be enhanced.

METHOD AND SYSTEM FOR AUTOMATIC AUGMENTATION OF SIGN LANGUAGE TRANSLATION IN GLOSS UNITS
20220414350 · 2022-12-29 ·

There are provided a method and system for automatic augmentation of gloss-based sign language translation data. A system for automatic augmentation of sign language translation training data according to an embodiment includes: a database configured to store a sequence of sign language glosses and a sequence of spoken-language words in pairs; and an augmentation module configured to augment the pairs stored in the database. Accordingly, gloss-based training data of high quality may be acquired by performing automatic augmentation for gloss-based training data for sign language translation in an efficient method in terms of time and economic aspects, and eventually, accuracy of translation between sign language glosses and sentences may be enhanced.

Parallel unicode tokenization in a distributed network environment

Unicode data can be protected in a distributed tokenization environment. Data to be tokenized can be accessed or received by a security server, which instantiates a number of tokenization pipelines for parallel tokenization of the data. Unicode token tables are accessed by the security server, and each tokenization pipeline uses the accessed token tables to tokenization a portion of the data. Each tokenization pipeline performs a set of encoding or tokenization operations in parallel and based at least in part on a value received from another tokenization pipeline. The outputs of the tokenization pipelines are combined, producing tokenized data, which can be provided to a remote computing system for storage or processing.

Parallel unicode tokenization in a distributed network environment

Unicode data can be protected in a distributed tokenization environment. Data to be tokenized can be accessed or received by a security server, which instantiates a number of tokenization pipelines for parallel tokenization of the data. Unicode token tables are accessed by the security server, and each tokenization pipeline uses the accessed token tables to tokenization a portion of the data. Each tokenization pipeline performs a set of encoding or tokenization operations in parallel and based at least in part on a value received from another tokenization pipeline. The outputs of the tokenization pipelines are combined, producing tokenized data, which can be provided to a remote computing system for storage or processing.

TEXT PROCESSING METHOD
20230101401 · 2023-03-30 ·

A text processing method is provided. The method includes: a first probability value of each candidate character of a plurality of candidate characters corresponding to a target position is determined based on character feature information corresponding to the target position in a text fragment to be processed, wherein the character feature information is determined based on a context at the target position in the text fragment to be processed; a second probability value of each candidate character of the plurality of candidate characters is determined based on a character string including the candidate character and at least one character in at least one position in the text fragment to be processed adjacent to the target position; and a correction character at the target position is determined based on the first probability value and the second probability value of each candidate character of the plurality of candidate characters.