IPIQ

G06F16/1734

TERMINAL AND METHOD FOR STORING AND PARSING LOG DATA

20230237019 · 2023-07-27 ·

S2W Inc.

Provided are a terminal and method for storing and parsing log data. The method includes collecting log data on the basis of a file path of the log data, storing metadata including the file path and log data paired with the metadata in a database (DB), classifying the log data on the basis of the metadata, acquiring type information of a parser related to the log data, and parsing the log data through the parser having the type information.

Incremental addition of data to partitions in database tables

11567957 · 2023-01-31 ·

salesforce.com, inc.

Gopi Krishnan Nambiar

A method and system for accessing updated data from a database in response to a user query has been developed. First, multiple transaction logs are generated for a database. Each transaction log contains a record of actions executed by a database management system and referenced according to the specified date of the actions. Data updates are received and stored with the database. An incremental database partition is created for each data update. Each incremental database partition is stored with reference to a corresponding transaction log for the date of the data update. The updated data is accessed through the incremental database partition in response to an outdated user query. The outdated user query contains a data access request for a date earlier than the receipt of data updates.

Managing dependent delete operations among data stores

11567899 · 2023-01-31 ·

Western Digital Technologies, Inc.

Example distributed storage systems, delete managers, and methods provide for managing dependent delete operations among data stores. Dependent data operation entries and corresponding dependency sets may be identified in an operations log. Dependent data operations may be identified in each shard and data operation entries. A delete process for the data objects in the dependency set may be delayed until the delete process for the dependent data object completes.

Generation and traversal of a hierarchical index structure for efficient data retrieval

11567906 · 2023-01-31 ·

Microsoft Technology Licensing, Llc

Methods, systems, apparatuses, and computer program products are described herein for the generation and traversal of a hierarchical index structure. The structure indexes search keys from data ingested from different data sources and enables efficient retrieval of the keys. As data is ingested, index nodes are generated at the lowest level of the structure. The nodes are analyzed to determine whether such nodes comprise duplicate keys. Responsive to doing so, a new index node is generated located at a higher level of the structure. This process results in a DAG comprising orphan nodes including different search keys. When processing a query for search keys, the orphan index nodes are initially analyzed for the keys. Upon finding a search key, its child nodes are recursively searched until location information specifying the location of ingested data in which the search key is located is found.

Scaling delta table optimize command

11567900 · 2023-01-31 ·

Databricks, Inc.

The interface is to receive an indication to execute an optimize command. The processor is to receive a file name; determine whether adding a file of the file name to a current bin causes the current bin to exceed a threshold; associate the file with the current bin in response to determining that adding the file does not cause the current bin to exceed the bin threshold; in response to determining that adding the file to the current bin causes the current bin to exceed the bin threshold: associate the file with a next bin, indicate that the current bin is closed, and add the current bin to a batch of bins; determine whether a measure of the batch of bins exceeds a batch threshold; and in response to determining that the measure exceeds the batch threshold, provide the batch of bins for processing.

DETERMINING A SHARING RELATIONSHIP OF A FILE IN NAMESPACE SNAPSHOTS

20230027230 · 2023-01-26 ·

The technology described herein efficiently determines whether a real inode is shared among views, or owned. In-memory data structures include a view snapshot generation counter that is increased as a snapshot that generates a view is created, and an inode total weight. An in-memory virtual inode cache dataset for a filesystem object associated with the view is instantiated with the value of snapshot generation counter, sharing-related data based on the inode mapping file entry for the object, and an inode access weight. To determine whether the inode is shared (and needs to be split), such as on a write to the object, the in-memory data is evaluated. The real inode is shared if the generation counters are unequal, if the sharing-related data indicates sharing at an intermediate indirect block level, or indicates sharing at the inode level and the inode access weight is less than the inode total weight.

Storage constrained synchronization engine

11562000 · 2023-01-24 ·

Dropbox, Inc.

A client application of a content management system provides instructions for synchronizing content items and placeholder items using a local file journal and updated file journal. The client application compares entries in the updated file journal to entries in the local file journal to determine modifications to make to content items or placeholder items stored in a shared content storage directory on the client device. Based on the result of the comparison, the client application replaces placeholder items with content items or replaces content items with placeholder items.

Fast migration of metadata

11561716 · 2023-01-24 ·

Cohesity, Inc.

One or more buckets of key-value pairs of a first node of a distributed storage system are selected to be migrated to a second node of the distributed storage system. One or more underlying database files corresponding to the one or more selected buckets are identified. The one or more identified underlying database files are directly copied from a storage of a first node to a storage of the second node. The copied underlying database files are linked in a database of the second node to implement the one or more selected buckets in the second node.

Filesystem view separation for data confidentiality and integrity using lattice-based security domains

11562086 · 2023-01-24 ·

International Business Machines Corporation

A stackable filesystem architecture that curtails data theft and ensures file integrity protection. In this architecture, processes are grouped into ranked filesystem views, or “security domains.” Preferably, an order theory algorithm is utilized to determine a proper domain in which an application is run. In particular, a root domain provides a single view of the filesystem enabling transparent filesystem operations. Each security domain transparently creates multiple levels of stacking to protect the base filesystem, and to monitor file accesses without incurring significant performance overhead. By combining its layered architecture with view separation via security domains, the filesystem maintains data integrity and confidentiality.

DISTRIBUTING STORAGE PERFORMANCE METRICS RECORDING WORK ACROSS STORAGE APPLIANCES OF A STORAGE CLUSTER

20230229579 · 2023-07-20 ·

A technique of processing storage cluster performance metrics involves obtaining access to performance metrics from storage appliances of a storage cluster, the performance metrics identifying performance for storage objects managed by the storage appliances. The technique further involves, after access to the performance metrics is obtained, disregarding a duplicate set of performance metrics for a storage object that migrates from a first storage appliance of the storage cluster to a second storage appliance of the storage cluster. The technique further involves, after the duplicate set of performance metrics is disregarded, archiving the performance metrics to an archive. After archiving the performance metrics to the archive, the technique may provide a performance analysis based on the performance metrics from the archive as well as adjust operation of the storage cluster according to the performance analysis.

Patent classifications

G06F16/1734