G06V30/246

AUDIENCE-BASED OPTIMIZATION OF COMMUNICATION MEDIA
20230080271 · 2023-03-16 ·

Introduced here are communication optimization platforms configured to improve comprehension, persuasion, or clarity of communications. Initially, a communication optimization platform can acquire input sample(s) that are associated with a source audience. The communication optimization platform can then create a linguistic profile for the source audience by examining the content of the input sample(s). Additionally or alternatively, the communication optimization platform may produce a psychographic profile that specifies various characteristics of the source audience, such as personality, opinions, attitudes, interests, etc. The communication optimization platform can then generate, based on the linguistic profile and/or the psychographic profile, affinity language for communicating with a target audience. By incorporating the affinity language into communications, the communication optimization platform can increase appeal to the target audience.

AUDIENCE-BASED OPTIMIZATION OF COMMUNICATION MEDIA
20230080271 · 2023-03-16 ·

Introduced here are communication optimization platforms configured to improve comprehension, persuasion, or clarity of communications. Initially, a communication optimization platform can acquire input sample(s) that are associated with a source audience. The communication optimization platform can then create a linguistic profile for the source audience by examining the content of the input sample(s). Additionally or alternatively, the communication optimization platform may produce a psychographic profile that specifies various characteristics of the source audience, such as personality, opinions, attitudes, interests, etc. The communication optimization platform can then generate, based on the linguistic profile and/or the psychographic profile, affinity language for communicating with a target audience. By incorporating the affinity language into communications, the communication optimization platform can increase appeal to the target audience.

AUTOMATIC LANGUAGE IDENTIFICATION IN IMAGE-BASED DOCUMENTS

The present embodiments relate to identifying a native language of text included in an image-based document. A cloud infrastructure node (e.g., one or more interconnected computing devices implementing a cloud infrastructure) can utilize one or more deep learning models to identify a language of an image-based document (e.g., a scanned document) that is formed of pixels. The cloud infrastructure node can detect text lines that are bounded by bounding boxes in the document, determine a primary script classification of the text in the document, and derive a primary language for the document. Various document management tasks can be performed responsive to determining the language, such as perform optical character recognition (OCR) or derive insights into the text.

AUTOMATIC LANGUAGE IDENTIFICATION IN IMAGE-BASED DOCUMENTS

The present embodiments relate to identifying a native language of text included in an image-based document. A cloud infrastructure node (e.g., one or more interconnected computing devices implementing a cloud infrastructure) can utilize one or more deep learning models to identify a language of an image-based document (e.g., a scanned document) that is formed of pixels. The cloud infrastructure node can detect text lines that are bounded by bounding boxes in the document, determine a primary script classification of the text in the document, and derive a primary language for the document. Various document management tasks can be performed responsive to determining the language, such as perform optical character recognition (OCR) or derive insights into the text.

VISION-BASED DOCUMENT LANGUAGE IDENTIFICATION BY JOINT SUPERVISION

The present embodiments relate to a language identification system for predicting a language and text content of text lines in an image-based document. The language identification system uses a trainable neural network model that integrates multiple neural network models in a single unified end-to-end trainable architecture. A CNN and an RNN of the model can process text lines and derive visual and contextual features of the text lines. The derived features can be used to predict a language and text content for the text line. The CNN and the RNN can be jointly trained by determining losses based on the predicted language and content and corresponding language labels and text labels for each text line.

VISION-BASED DOCUMENT LANGUAGE IDENTIFICATION BY JOINT SUPERVISION

The present embodiments relate to a language identification system for predicting a language and text content of text lines in an image-based document. The language identification system uses a trainable neural network model that integrates multiple neural network models in a single unified end-to-end trainable architecture. A CNN and an RNN of the model can process text lines and derive visual and contextual features of the text lines. The derived features can be used to predict a language and text content for the text line. The CNN and the RNN can be jointly trained by determining losses based on the predicted language and content and corresponding language labels and text labels for each text line.

Audience-based optimization of communication media
11631110 · 2023-04-18 ·

Introduced here are communication optimization platforms configured to improve comprehension, persuasion, or clarity of communications. Initially, a communication optimization platform can acquire input sample(s) that are associated with a source audience. The communication optimization platform can then create a linguistic profile for the source audience by examining the content of the input sample(s). Additionally or alternatively, the communication optimization platform may produce a psychographic profile that specifies various characteristics of the source audience, such as personality, opinions, attitudes, interests, etc. The communication optimization platform can then generate, based on the linguistic profile and/or the psychographic profile, affinity language for communicating with a target audience. By incorporating the affinity language into communications, the communication optimization platform can increase appeal to the target audience.

Audience-based optimization of communication media
11631110 · 2023-04-18 ·

Introduced here are communication optimization platforms configured to improve comprehension, persuasion, or clarity of communications. Initially, a communication optimization platform can acquire input sample(s) that are associated with a source audience. The communication optimization platform can then create a linguistic profile for the source audience by examining the content of the input sample(s). Additionally or alternatively, the communication optimization platform may produce a psychographic profile that specifies various characteristics of the source audience, such as personality, opinions, attitudes, interests, etc. The communication optimization platform can then generate, based on the linguistic profile and/or the psychographic profile, affinity language for communicating with a target audience. By incorporating the affinity language into communications, the communication optimization platform can increase appeal to the target audience.

METHOD, APPARATUS, AND SYSTEM FOR AUTO-REGISTRATION OF NESTED TABLES FROM UNSTRUCTURED CELL ASSOCIATION FOR TABLE-BASED DOCUMENTATION
20220318235 · 2022-10-06 ·

In some forms containing keywords and content, there may be nested levels of keywords, also referred to as a hierarchy. Content in the forms may be associated with one or more keywords in one or more of the nested levels, or in the hierarchy. Identifying keywords in adjacent cells in a table (with a nested keyword being either to the right of or below another keyword) enables distinguishing between keywords and content in filled forms, and enables correct association of content with respective keywords.

Optical character recognition method

The optical character recognition method applies a first OCR engine to provide an identification of characters of at least a first type of characters and zones of at least a second type of characters in the character string image. A second OCR engine is applied on the zones of the at least second type of characters to provide an identification of characters of a second type of characters. The characters identified by the first OCR engine and by the second OCR engine are in a further step combined to obtain the identification of the characters of the character string image.