G06F16/835

Storing semi-structured data
09754048 · 2017-09-05 · ·

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for storing semi-structured data. One of the methods includes maintaining a plurality of schemas; receiving a first semi-structured data item; determining that the first semi-structured data item does not match any of the schemas in the plurality of schemas; and in response to determining that the first semi-structured data item does not match any of the schemas in the plurality of schemas: generating a new schema, encoding the first semi-structured data item in the first data format to generate the first new encoded data item in accordance with the new schema, storing the first new encoded data item in the data item repository, and associating the first new encoded data item with the new schema.

Rules-based targeted content message serving systems and methods

A method of serving targeted content messages for display in a website accessed in a browser program of a networked computer communicatively connected to a network at a network address for communications, delivers uniquely targeted content messages displayed in websites viewed in web browsers. The method includes placing a script device in a website file, processing the website file, together with the script device by a particular web browser on download of the website file, including by determining the network address of the networked computer, determining an identifier of the website file, and sending an artifact representing the network address and the identifier over the network to a server computer. The method also includes detecting the network address and the identifier by the server computer, querying a database for a database article related to the network address and the identifier, constructing a script program stored in memory of the server computer for the particular browser and website file, and constructing an ad device stored in memory of the web browser device from the script program. The method further includes calling the server computer by the ad device by communication of an identifier representing an action of the web browser device, receiving the identifier by the server computer, querying the database for a select message artifact related to the script program, the identifier, the website file, and the web browser, and responding by the server computer to the web browser with the select message artifact. A message represented by the select message artifact is displayed in the website then viewed in a browser window of the web browser. Messages can be prioritized and are uniquely targeted in content, based on real-time activities of the web browser.

System and method for blockchain-based data synchronization

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for blockchain-based data synchronization, are provided. One of the methods includes: obtaining, from one or more blockchain nodes associated with a blockchain, data associated with a plurality of blockchain transactions recorded in one or more blocks of the blockchain; storing the obtained data in one or more data stores, wherein the storing comprises organizing the obtained data according to one or more schemas, at least one of the one or more schemas being different from a data structure of the blockchain; receiving, from a client device, a data query based on one of the one or more schemas; executing the data query on the data in the one or more data stores to obtain a result; and sending, to the client device, a response comprising the obtained result.

Generating structured metrics from log data

The disclosed technique can be performed by a data intake and query system. The technique includes ingesting data including log data obtained over a network from systems, and receiving user input indicating a scope for retrieving data and a criterion expressed in a structured language. The technique further includes retrieving data based on the scope indicated by the user input and extracting a first field value and a second field value from the retrieved data based on the criterion and the scope. The first field value includes a first numerical value indicative of a measured characteristic of a computing device and the second field value includes a first dimension. The technique further includes storing a first structured metric and the first dimension in a time-series metrics store. The first structured metric includes the first numerical value. The first dimension is associated with the first numerical value.

Systems and methods for generating schemas that represent multiple data sources

Systems and methods generating schemas that represent multiple data sources are provided herein. According to some embodiments, methods may include determining a schema for each of the multiple data sources via a computing device communicatively couplable with each of the multiple data sources, each of the multiple data sources including one or more data structures that define how data is stored in the data source, generating a negotiated schema by comparing the schemas of the multiple data sources to one another and interrelating data points of the multiple data sources based upon the schemas, interrelating the negotiated schema with the schema for each of the multiple data sources based upon the interrelation of the data points, and storing the negotiated schema in a storage media by way of the computing device.

Technique and framework to provide diagnosability for XML query/DML rewrite and XML index selection

A method and apparatus for automatically analyzing and providing feedback regarding the optimizability of a relational database query. A query developer's primary goal is to ensure that queries and DML operations are rewritten for the most efficient execution. Rewrite diagnosability captures metadata for each attempted query optimization including success or failure and the reasons for failure. The metadata is stored in association with the operators that were not removed through rewriting. Once all optimizations have been attempted and rewriting is complete, the metadata is selectively displayed based on the cost to perform the associated operation. The context of performing the operation may affect the cost. The cost may be based at least on the type of operation and where within the query tree the operation is located. A query developer may configure the database system not to execute the resulting query plan based on one or more criteria.

System and method for performing object-modifying commands in an unstructured storage service

Embodiments of system and method for performing object-modifying commands in an unstructured storage service are described. Embodiments may include utilizing one or more nodes to implement an unstructured storage service including a service interface configured to service data storage requests and data retrieval requests from remote clients. The unstructured storage service is configured to isolate access to each of multiple storage domains of the service to different clients. Embodiments may also include receiving from a given remote client, a request to store a data object to a given storage domain of the service. The request further specifying a command to modify source data for the object prior to storing the object. Embodiments may also include using the unstructured storage service to perform the command without exporting the source data from the unstructured storage service to the given remote client prior to completion of the command.

Method for preparing documents written in markup languages while implementing a user interface for dealing with data of an information system

The present technical solution relates to user interfaces in general and more specifically to user interfaces related to dealing with data in computer information systems. A method of preparing documents written in markup languages during implementing a user interface for dealing with data of an information system, wherein: forming metadata for at least one operation; forming a template for representing said operation, the template including at least one document written in at least one markup language; forming and storing a link between the template and at least one operation; displaying the template, forming and storing links between the template elements and operations, wherein selecting a template element and displaying information about operations available for links forming; selecting an operation; forming and storing the link comprising the identifier for the template element selected previously and the identifier for the operation selected previously. The technical result is increasing the efficiency of preparing documents written in markup languages while implementing a user interface for dealing with data of an information system, optimizing time for preparing the documents and simplifying their further maintenance.

Weighted behavioral signal association graphing for search engines
11210341 · 2021-12-28 · ·

Systems and methods are disclosed for optimizing responses to queries. Analyses of user interactions and other behaviors can lead to association of queries with signals, including ASINs and other product descriptions. The associations can be algorithmically graphed and analyzed on a disaggregated basis and individually weighted to improve search recall and reduce the risk of returning defective search results. Machine learning techniques can further optimize the associations and/or the search results.

Hash bucketing of data

The disclosed embodiments include a method performed by a data intake and query system. The method includes ingesting metrics including respective key values and respective measured values, where the respective key values include a primary key value of a selected primary key. The method further includes generating a hash value for each metric by processing each primary key value with a hashing function, and indexing each metric in association with an existing hash bucket having a matching hash value.