G06F40/191

AUTOMATIC ENHANCEMENT OF PARAGRAPH JUSTIFICATION
20220405469 · 2022-12-22 ·

Embodiments are disclosed for automatic enhancement of paragraph justification. A method includes receiving a selection of at least one paragraph, determining a plurality of penalty values for at least one typographic feature by varying a typographic feature value, the penalty values indicating a deviation from an optimal layout of the at least one paragraph, determining at least one optimal penalty value for the at least one typographic feature, the at least one optimal penalty value corresponding to at least one optimal typographic feature value of the at least one typographic feature, determining a priority for each of the at least one typographic feature based on a plurality of justification rules and the at least one optimal penalty value, and updating the at least one typographic feature of the at least one paragraph based on the priority and the at least one optimal typographic feature value.

AUTOMATIC ENHANCEMENT OF PARAGRAPH JUSTIFICATION
20220405469 · 2022-12-22 ·

Embodiments are disclosed for automatic enhancement of paragraph justification. A method includes receiving a selection of at least one paragraph, determining a plurality of penalty values for at least one typographic feature by varying a typographic feature value, the penalty values indicating a deviation from an optimal layout of the at least one paragraph, determining at least one optimal penalty value for the at least one typographic feature, the at least one optimal penalty value corresponding to at least one optimal typographic feature value of the at least one typographic feature, determining a priority for each of the at least one typographic feature based on a plurality of justification rules and the at least one optimal penalty value, and updating the at least one typographic feature of the at least one paragraph based on the priority and the at least one optimal typographic feature value.

ARRANGING AND/OR CLEARING SPEECH-TO-TEXT CONTENT WITHOUT A USER PROVIDING EXPRESS INSTRUCTIONS

Implementations described herein relate to an application and/or automated assistant that can identify arrangement operations to perform for arranging text during speech-to-text operations—without a user having to expressly identify the arrangement operations. In some instances, a user that is dictating a document (e.g., an email, a text message, etc.) can provide a spoken utterance to an application in order to incorporate textual content. However, in some of these instances, certain corresponding arrangements are needed for the textual content in the document. The textual content that is derived from the spoken utterance can be arranged by the application based on an intent, vocalization features, and/or contextual features associated with the spoken utterance and/or a type of the application associated with the document, without the user expressly identifying the corresponding arrangements. In this way, the application can infer content arrangement operations from a spoken utterance that only specifies the textual content.

Document analysis apparatus, document analysis method, and computer-readable recording medium
11645448 · 2023-05-09 · ·

A document analysis apparatus 10 includes: a candidate generation unit 11 that, for each line included in a document that is a target of structural analysis, specifies another line in a parallel relationship with the line by performing extraction of a marker indicating a hierarchy, and generates a candidate for a hierarchical structure of the document that is the target based on the result of the specification of each line; and a candidate evaluation unit 12 that, if two or more candidates have been generated, performs evaluation on each candidate for the hierarchical structure, and selects one candidate for the hierarchical structure as the hierarchical structure of the document that is the target based on the evaluation result.

Transformation Of Marked-Up Content Into A File Format That Enables Automated Browser Based Pagination

A method and a file format transformation system (FFTS) for transforming marked-up content in a first file format (FFF) to a second file format (SFF) that enables automated browser based pagination are provided. The FFTS reflows marked-up content of the FFF into a continuous page. The FFTS generates and appends tags to spaces and block elements identified in the reflown marked-up content of the FFF. For each space and block element, the FFTS determines and tags line breaks in the reflown marked-up content. For each line break, the FFTS identifies, tags, and positions anchored floats and footnotes on a current page based on space availability. The FFTS positions page breaks in the continuous page based on a configurable page height and the line breaks. The FFTS groups the marked-up content, inserts pagination elements and renders the grouped marked-up content in the SFF based on a selected level of reversibility.

DOCUMENT ANALYSIS APPARATUS, DOCUMENT ANALYSIS METHOD, AND COMPUTER-READABLE RECORDING MEDIUM
20220164522 · 2022-05-26 · ·

A document analysis apparatus 10 includes: a candidate generation unit 11 that, for each line included in a document that is a target of structural analysis, specifies another line in a parallel relationship with the line by performing extraction of a marker indicating a hierarchy, and generates a candidate for a hierarchical structure of the document that is the target based on the result of the specification of each line; and a candidate evaluation unit 12 that, if two or more candidates have been generated, performs evaluation on each candidate for the hierarchical structure, and selects one candidate for the hierarchical structure as the hierarchical structure of the document that is the target based on the evaluation result.

Text Wrap Modification Using Variable Inset
20230259690 · 2023-08-17 · ·

In implementations of text wrap modification using variable inset, a display screen of a device displays lines of text wrapped to an inset space maintained between an object boundary and the lines of text. The device implements a text wrap modification module to determine that a penalty value associated with a line of text is reduced if the line of text is extended to include one or more words from a subsequent line of text, determine that the one or more words fit within an additional space for the line of text based on a variable overlap of the line of text into the inset space, and display the one or more words from the line of text as extended to include the one or more words from the subsequent line of text.

Machine learning based document editing engine

A method for machine learning based document editing is provided. The method may include receiving, from a client, one or more inputs associated with a document. A recommendation to include and/or exclude a clause, a term, and/or a line item from the document may be generated by at least processing the one or more inputs with a machine learning model. The recommendation to include and/or exclude the clause, the term, and/or the line item from the document may be provided to the client. Related systems and articles of manufacture, including computer program products, are also provided.

AUTOMATIC DETECTION AND REMOVAL OF TYPESETTING ERRORS IN ELECTRONIC DOCUMENTS
20230281379 · 2023-09-07 · ·

Embodiments are disclosed for eliminating typographical errors from an electronic document. The method may include obtaining an electronic document comprising a plurality of text paragraphs. The method may further include detecting a plurality of typographical errors in the plurality of text paragraphs. The method may further include indexing a set of error paragraphs, wherein each paragraph in the set of error paragraphs includes at least one typographical error. The method may further include determining a priority for each typographical error based on a magnitude of the typographical error. The method may further include adjusting one or more attributes of each paragraph in the set of error paragraphs based on the priority for each typographical error.

GUIDED FORM GENERATION IN A DOCUMENT MANAGEMENT SYSTEM

Systems and methods are disclosed herein for generating and modifying a workflow comprising a series of webpages based on an online document. A document management system accesses an online document selected by a user and classifies each field of the online document into one of a set of categories. For each category, the system generates a form webpage comprising questions corresponding to each field classified as the category and combines the generated webpages to create a workflow. The system may modify the workflow by generating and adding one or more additional form webpages based on one or more answers provided by an entity completing the webform page. In response to the entity completing the modified generated workflow, the system generates a completed document based on the online document and the answers provided by the entity.