Patent classifications
G06F16/81
Systems and methods for generating an edit script
A method for generating a hierarchical edit script comprises generating, using a first linear sequence of nodes of a first hierarchical dataset and a second linear sequence of nodes of a second hierarchical dataset, a linear edit script defining a linear sequence of operations that can be applied to the first linear sequence to generate the second linear sequence, generating a first delta by matching one or more operations of the linear sequence of operations to the first hierarchical dataset, generating a second delta by matching one or more operations of the linear sequence of operations to the second hierarchical dataset, and generating the hierarchical edit script by merging the first delta and the second delta.
LARGE OBJECT DATA TYPE SUPPORT FOR COLUMN-BASED DATABASE SYSTEM
A method for processing an unsupported data type in a database is disclosed. The method for processing an unsupported data type in a database comprises detecting, while copying data from a primary to a secondary database, a table for data comprising a data type that is unsupported by the secondary database. Furthermore, the method comprises generating a base table in the secondary database as a copy of the detected table of the primary database without the data of the unsupported data type, generating an additional table in the secondary database for data of the unsupported data type, and linking the additional table to the base table in the secondary database via a row-identifier relationship. Additionally, the method comprises accessing data of the unsupported data type via the additional table while performing queries against the secondary database.
METHODS AND SYSTEMS FOR EXPANDING GPU MEMORY FOOTPRINT BASED ON HYBRID-MEMORY
In one aspect, a computerized method for expanding a graphics processing unit (GPU) memory footprint based on a hybrid-memory of a distributed database system (DDBS) includes the step of providing the DDBS. The DDBS is modified to include a plurality of GPUs; providing a local memory of a GPU of the plurality of GPUs. The method includes the step of filling the local memory of the GPU with one or more digests from the DDBS. The method includes the step of running a distributed general-purpose cluster-computing framework instance on the local memory of the GPU. The method includes the step of fetching data from the local memory of the GPU using the distributed general-purpose cluster-computing framework instance. The method includes the step of storing a result of the fetch operation in the DDBS to extend the local memory of the GPU to handle more data than what is fitted into the local memory of the GPU.
Method and system for optimization of faceted search
In general, embodiments of the technology relate to a method for serv1cmg requests. The method includes receiving a search request from a client, determining a main path and a conditional subpath associated with the search request, determining a subpath index associated with the main path and the conditional subpath, obtaining, using at least a portion of the search request, a set of subpath index entries from the subpath index, wherein each of the subpath index entries specifies a facet subpath and content associated with the facet subpath, generating a final result using at least a portion of the contents in the set of subpath index entries, and providing the final result to the client.
Method and system for optimization of faceted search
In general, embodiments of the technology relate to a method for serv1cmg requests. The method includes receiving a search request from a client, determining a main path and a conditional subpath associated with the search request, determining a subpath index associated with the main path and the conditional subpath, obtaining, using at least a portion of the search request, a set of subpath index entries from the subpath index, wherein each of the subpath index entries specifies a facet subpath and content associated with the facet subpath, generating a final result using at least a portion of the contents in the set of subpath index entries, and providing the final result to the client.
System and method for clustering documents
A method and system are provided for classifying semi-structured documents. According to one embodiment, the method includes accessing a number of documents, and identifying in each of the documents pairs of character segments. The method also includes generating a map of unique identified segment pairs across all documents in the several documents. The method further includes generating for each of the several documents a respective feature vector based on one or more unique pair of segments in the map that are also identified in the document, and clustering the plurality of documents into a plurality of clusters, using the feature vectors.
System and method for clustering documents
A method and system are provided for classifying semi-structured documents. According to one embodiment, the method includes accessing a number of documents, and identifying in each of the documents pairs of character segments. The method also includes generating a map of unique identified segment pairs across all documents in the several documents. The method further includes generating for each of the several documents a respective feature vector based on one or more unique pair of segments in the map that are also identified in the document, and clustering the plurality of documents into a plurality of clusters, using the feature vectors.
Annotation control features for systems and platforms that support information analysis, editing, and storage in web browsers
The present disclosure relates to systems for creating, organizing, viewing, and connecting annotations of web documents within web browsers that are made to be annotation capable. The rationale for designing systems using the principles disclosed herein is explained in detail. This rationale is illustrated using sequence diagrams and use case diagrams to elucidate the need for the design principles and data structures employed. The disclosure also provides exemplary hardware specifications and describes design tradeoffs.
Annotation control features for systems and platforms that support information analysis, editing, and storage in web browsers
The present disclosure relates to systems for creating, organizing, viewing, and connecting annotations of web documents within web browsers that are made to be annotation capable. The rationale for designing systems using the principles disclosed herein is explained in detail. This rationale is illustrated using sequence diagrams and use case diagrams to elucidate the need for the design principles and data structures employed. The disclosure also provides exemplary hardware specifications and describes design tradeoffs.
METADATA DRIVEN DATASET MANAGEMENT
A method for configuring the operation of the software of a data as a service (DAAS) system during run time is described. The configuring includes at least one of configuring ingestion of a vendor dataset to produce an ingested dataset and which analysis operations to perform on the vendor dataset to produce an analyzed dataset, and the configuring also includes at least one of how to search the vendor dataset based on a search query from a customer to allow the customer to locate a new record from the vendor dataset and how to match records in the vendor dataset with a match query from the customer to provide an updated record to the customer.