Patent classifications
G06F40/154
Controlling mark positions in documents
A document is represented as a node tree in a document processing system. Edits to a node are represented in a change record that has a one-way link to the node. A text mark has a one-way link to the change record. It deletes that link when the changes represented by the change record are reflected in the text mark. A memory management system releases the memory allocated to the change record when no other object links to it.
Controlling mark positions in documents
A document is represented as a node tree in a document processing system. Edits to a node are represented in a change record that has a one-way link to the node. A text mark has a one-way link to the change record. It deletes that link when the changes represented by the change record are reflected in the text mark. A memory management system releases the memory allocated to the change record when no other object links to it.
Configuration of content site user interaction monitoring in data networks
Style instructions, which may be coded in a style language and not include application logic, can be used to both control presentation of content of a web page by a web browser and control monitoring of interactions with the web page. Selectors in the style instructions, for example, can be used to identify elements of the web page and define styles for the elements. As a result of implementing such style instructions, individuals tasked with design of a web page can more easily and quickly configure monitoring of interactions with the web page.
Configuration of content site user interaction monitoring in data networks
Style instructions, which may be coded in a style language and not include application logic, can be used to both control presentation of content of a web page by a web browser and control monitoring of interactions with the web page. Selectors in the style instructions, for example, can be used to identify elements of the web page and define styles for the elements. As a result of implementing such style instructions, individuals tasked with design of a web page can more easily and quickly configure monitoring of interactions with the web page.
Hierarchical entity recognition and semantic modeling framework for information extraction
Extracting entities from a document with a hierarchical entity graph of entities. Entity definitions and entity recognition definitions are customized by a user and provided. The configuration information is utilized to generate (905) an entity graph, which is then utilized to parse one or more documents. In some implementations, the resulting parse tree may be utilized, in conjunction with user feedback, to generate one or more training instances for a machine learning model assigned to one or more of the custom nodes as an entity recognition definition. Parsing of the resulting tree may be performed with a lazy parsing methodology, with only the portions of interest to the user being identified in the document.
Hierarchical entity recognition and semantic modeling framework for information extraction
Extracting entities from a document with a hierarchical entity graph of entities. Entity definitions and entity recognition definitions are customized by a user and provided. The configuration information is utilized to generate (905) an entity graph, which is then utilized to parse one or more documents. In some implementations, the resulting parse tree may be utilized, in conjunction with user feedback, to generate one or more training instances for a machine learning model assigned to one or more of the custom nodes as an entity recognition definition. Parsing of the resulting tree may be performed with a lazy parsing methodology, with only the portions of interest to the user being identified in the document.
Structured text translation
Approaches for the translation of structured text include an embedding module for encoding and embedding source text in a first language, an encoder for encoding output of the embedding module, a decoder for iteratively decoding output of the encoder based on generated tokens in translated text from previous iterations, a beam module for constraining output of the decoder with respect to possible embedded tags to include in the translated text for a current iteration using a beam search, and a layer for selecting a token to be included in the translated text for the current iteration. The translated text is in a second language different from the first language. In some embodiments, the approach further includes scoring and pointer modules for selecting the token based on the output of the beam module or copied from the source text or reference text from a training pair best matching the source text.
Method, apparatus, and computer-readable medium for transforming a hierarchical document object model to filter non-rendered elements
An apparatus, computer-readable medium, and computer-implemented method for transforming a hierarchical document object model (DOM) to filter non-rendered elements, including parsing elements in a hierarchical DOM to identify one or more tags, any properties, and any values of the elements, removing invisible elements determined based on properties of each invisible element, each invisible element comprising an element of the DOM that is hidden from a user when the DOM is rendered, removing empty elements based on the tags of each element, each empty element comprising a tag without any associated values, identifying remaining elements of the hierarchical DOM that have parent elements that have been removed from the hierarchical DOM, and re-parenting the remaining elements to new parent elements remaining in the hierarchical DOM based on traversing the hierarchical DOM from each of the remaining elements.
Layout generation for an electronic document that includes a web object
A method of determining a layout for an electronic document comprising a web object that includes a set of components is provided. The method, for each component of the web object, injects code into a web frame used to retrieve the component of the web object. The program sends a request to a web content retriever to retrieve the web object. The program, for each component of the web object, receives a message at the layout generation application from the injected code in the corresponding web frame indicating the completion of the retrieving of the component. Each message includes the height of the corresponding web object component. The method determines a layout for the web object by using the height of the web object components after the messages received from the injected code indicate that every component of the web object is retrieved.
Layout generation for an electronic document that includes a web object
A method of determining a layout for an electronic document comprising a web object that includes a set of components is provided. The method, for each component of the web object, injects code into a web frame used to retrieve the component of the web object. The program sends a request to a web content retriever to retrieve the web object. The program, for each component of the web object, receives a message at the layout generation application from the injected code in the corresponding web frame indicating the completion of the retrieving of the component. Each message includes the height of the corresponding web object component. The method determines a layout for the web object by using the height of the web object components after the messages received from the injected code indicate that every component of the web object is retrieved.