G06F16/86

Generation of mapping definitions for content management system

The method includes automatic creation of mapping definitions. The method further includes analyzing an external data structure, wherein the external data structure has one or more elements. The method further includes determining a path length and a number of occurrences for each element of a first set of elements of the one or more elements. The method further includes generating at least one query statement for the first set of elements. The method further includes generating mapping definitions based, at least in part on the at least one generated query statement and the path length of each element of the first set of elements.

Converting portions of documents between structured and unstructured data formats to improve computing efficiency and schema flexibility

According to embodiments of the present invention, machines, systems, computer-implemented methods and computer program products are provided for converting part or all of a document from an unstructured format into a structured format, and combining structured and unstructured data into an unstructured format. In some embodiments, documents are received in an unstructured format, and stored in a repository for querying (e.g., including updating and other operations (e.g., SQL SELECT, UPDATE, INSERT, DELETE, MERGE, etc.)). Responsive to detecting a predetermined number of the documents in an unstructured format stored in a repository for querying, an optimization criteria may be applied to identify one or more components of the stored documents for conversion, and the one or more components of the stored documents are converted from the unstructured format to a structured format.

Automated document modification
11256742 · 2022-02-22 · ·

Method, computer program and apparatus, with which receiving from a user a subject selection of a document and a selection of document components from a set of available document components, and automatically: including the selected document components in the document; recognising a set of tags in the selected document components; maintaining the mapping of available tag types and database fields; for each recognised tag acquiring from a database dynamic data based on the subject, the tag, and mapping of the database fields, and combining at least a portion of the acquired dynamic data to the document at the tag; and indicating to the user modifiable parts of the document. Further, modification data is received from the user and saved to the user's modifiable parts.

System and method for efficient data removal in a deduplicated storage system
09792316 · 2017-10-17 · ·

A system, method, and medium for detecting expired data in a data deduplicated storage system are disclosed. The system receives a request to insert a file into the deduplicated storage system. In response, the system determines a unique identifier for the file and stores the unique identifier in a list of unique identifiers associated with a data object that stores the file content. The system determines that the data object is not expired based at least in part on the identifier being present in the list of unique identifiers.

Building and managing data-processing attributes for modeled data sources

In some aspects, a model of a remote data source is defined. The model includes an entity representing a data segment from the remote data source. The entity has properties, which represent fields of a data structure in the remote data source and have data type descriptors indicating data types used by the fields. The model also includes an attribute specifying a function, which uses the properties as inputs. The entity is matched to exported data from the remote data source, and the exported data is used as input data to the attribute's function.

DYNAMIC SEMANTIC MODELS HAVING MULTIPLE INDICES

Embodiments are directed towards dynamic semantic models having multiple indices. Source data may be provided to a network computer from at least one separate data source. A raw data graph may be generated from the source data such that the structure of the raw data graph may be based on the structure of the source data. Elements of the raw data graph may be mapped to a concept graph. Concept instances may be generated based on the concept graph, the raw data graph, and the source data. Model-identifiers (MIDs) that correspond to the concept instances may be generated to include at least a path in the concept graph. The MID values may be indexed into a plurality of indices based on a content-type of the data associated with the MIDs. In response to a query, a result set may be generated that includes result MIDs.

METHOD AND SYSTEM FOR PROVIDING TARGET INFORMATION USING APPLICATION LIST
20170293624 · 2017-10-12 ·

A method and system for providing target information through an application list includes collecting user identifiers to identify each users and application lists of applications installed on terminals of the users; extracting text information using the application list for the user identifiers; and selecting a user identifier having text information corresponding to a keyword from among the user identifiers.

Data migration and integration system

A data migration and integration system is disclosed. In various embodiments, the system includes a memory configured to store a mapping from a source schema to a target schema; and a processor coupled to the memory and configured to migrate to a target schema an instance of source data organized according to the source schema, including by using a chase engine to perform an ordered sequence of steps comprising adding a bounded layer of new elements to a current canonical chase state associated with migrating the source data to the target schema; adding coincidences associated with one or more of the target schema data integrity constraints and a mapping from the source schema to the target schema; and merging equal elements based on the coincidences; and repeat the preceding ordered sequence of steps iteratively until an end condition is met.

Storing semi-structured data
09754048 · 2017-09-05 · ·

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for storing semi-structured data. One of the methods includes maintaining a plurality of schemas; receiving a first semi-structured data item; determining that the first semi-structured data item does not match any of the schemas in the plurality of schemas; and in response to determining that the first semi-structured data item does not match any of the schemas in the plurality of schemas: generating a new schema, encoding the first semi-structured data item in the first data format to generate the first new encoded data item in accordance with the new schema, storing the first new encoded data item in the data item repository, and associating the first new encoded data item with the new schema.

Information technology asset type identification using a mobile vision-enabled robot

Mechanisms are provided for classifying an obstacle as an asset type. The mechanisms receive a digital image of an obstacle from an image capture device of an automated robot. The mechanisms perform a classification operation on the digital image of the obstacle to identify a proposed asset type classification for the obstacle. The mechanisms determine a final asset type for the obstacle based on the proposed asset type classification for the obstacle. The mechanisms update a map data structure for a physical premises in which the obstacle is present based on the final asset type.