G06F16/2228

SYSTEMS, METHODS, AND APPARATUS FOR HIERARCHICAL AGGREGATION FOR COMPUTATIONAL STORAGE
20230049602 · 2023-02-16 ·

A method for computational storage may include storing, at a storage device, two or more portions of data, wherein a first one of the two or more portions of data comprises a first fragment of a record and a second one of the two or more portions of data comprises a second fragment of the record, and performing, by the storage device, an operation on the first and second fragments of the record. The method may further include performing, by the storage node, a second operation on first and second fragments of a second record. The operation may include a data selection operation, and the method may further include sending a result of the data selection operation to a server. The method may further include sending a result of a first data selection operation to a server.

SYSTEMS, METHODS, AND APPARATUS FOR DATA RESIZING FOR COMPUTATIONAL STORAGE
20230046030 · 2023-02-16 ·

A method for computational storage may include storing, at a storage device, a first portion of data, wherein the first portion of data may include a first fragment of a record, and a second portion of data may include a second fragment of the record, and appending the second fragment of the record to the first portion of data. The method may further include performing, at the storage device, an operation on the first and second fragments of the record. The method may further include determining that the first portion of data may include a first fragment of a record, and a second portion of data may include a second fragment of the record, wherein appending the second fragment of the record to the first portion of data may include appending, based on the determining, the second fragment of the record to the first portion of data.

SYSTEMS AND METHODS FOR MATCHING ELECTRONIC ACTIVITIES WITH RECORD OBJECTS BASED ON ENTITY RELATIONSHIPS

The present disclosure relates to systems and methods for matching electronic activities with record objects based on entity relationships. The method can include accessing a plurality of electronic activities, identifying an electronic activity, identifying a first participant associated with a first entity and a second participant associated with a second entity, determining whether a record object identifier is included in the electronic activity, identifying a first record object of the system of record that includes an instance of the record object identifier, and storing an association between the electronic activity and the first record object. The method can include determining a second record object corresponding to the second entity, identifying, using a matching policy, a third record object linked to the second record object and identifying a third entity, and storing, by the one or more processors, an association between the electronic activity and the third record object.

CONTEXTUAL GEOANALYTICS ENGINE IN A DATA ANALYTICS SYSTEM

Methods, systems, and computer storage media for providing a unified multilayer-based index for a contextual geoanalytics engine in a data analytics system. The contextual geoanalytics engine is configured to aggregate point-of-interest geographical data from multiple data sources into an aggregate or composite dataset. The contextual geoanalytics engine then transforms and maps the data into a homogenous dataset—i.e., a location embedding record that is homogenous representation of an aggregated dataset—comparable across global geographical regions. The homogenous dataset is accessible via the unified multilayer-based index that is a single geographical index, where the homogenous dataset is a composite of different datasets. The data includes different data types, where the data types are stored in different layers while sharing a common index (i.e., unified multilayer-based index). In this way, the unified multilayer-based index is a shared common index with a plurality of different layers associated with data used in geographically-based analytics.

Systems and methods for automatically determining utterances, entities, and intents based on natural language inputs

Systems and methods for processing natural language inputs to determine user intents using an insights repository are provided. An insights repository system is configured to build an insights repository as a data structure representing a plurality of entities and relationships among those various entities. The insights repository system may receive information from various sources via an event stream, and may process the information using event rules. Based on the application of the event rules, the system may configure an insights repository data structure representing various entities, relationships between various entities, and the strengths of relationships between various entities. After the insights repository is created, consumers may execute queries against the insights repository. Furthermore, the insights repository system may automatically query the insights repository to generate insight information to be published to an insight feed to which consumer systems may subscribe to receive automatic updates.

Virtual research platform

Systems and methods for automated data curation and presentation are disclosed herein. The system can include a memory including a structured database and a plurality of storage bins. The system can include at least one server that can receive a packetized data file generated from a data file. This packetized data file can include a first packet generated from a content file of the data file, a second packet generated from metadata of the data file, and a third packet generated from a payload of the data file. The at least one server can automatically generate at least one tag for the packetized data file, which at least one tag is automatically generated based at least one key phrase identified in at least the targeted portion of the content file. The at least one server can index the packetized data file according to the at least one tag into a predetermined taxonomy, receive a data request including a plurality of parameters identifying attributes of packetized data, and deliver curated data selected according to the at least some of the plurality of parameters of the data request.

High performance dictionary for managed environment

Systems and methods are provided for optimizing data structures to improve the data retrieval through the use of bucketing techniques. A number of objects within an environment is drastically reduced utilizing bucketing techniques. Within the buckets, items are sequentially organized such that location is quicker. Items, or keys, are aligned with the same hash value together in a bucket and a mapping of the hash value to the offset of the first key occurrence in that bucket. This guarantees each lookup operation is only two random read accesses. Systems and methods provided herein control the pressures on a system for garbage collection and minimize memory usage with minimal impacts on performance.

Bucket data distribution for exporting data to worker nodes

Systems and methods are described for exporting bucket data from one or more buckets to one or more worker nodes. The system can identify data from different bucket data from buckets stored in a data intake and query system that is to be processed by one or more worker nodes. The system can allocate one or more execution resources, such as a processing pipeline, to process and export the bucket data from the buckets. The system can assign bucket data corresponding to individual buckets to the execution resource based on a bucket distribution policy. The indexer can export the bucket data to the worker nodes for further processing based on the bucket data-execution resource assignment.

Method and apparatus for stress management in a searchable data service

Method and apparatus for stress management in a searchable data service. The searchable data service may provide a searchable index to a backend data store, and an interface to build and query the searchable index, that enables client applications to search for and retrieve locators for stored entities in the backend data store. Embodiments of the searchable data service may implement a distributed stress management mechanism that may provide functionality including, but not limited to, the automated monitoring of critical resources, analysis of resource usage, and decisions on and performance of actions to keep resource usage within comfort zones. In one embodiment, in response to usage of a particular resource being detected as out of the comfort zone on a node, an action may be performed to transfer at least part of the resource usage for the local resource to another node that provides a similar resource.

Interfacing with results of artificial intelligent models

The improved exercise of artificial intelligence by providing a systematic way for a computing system to interface with output from AI models. To do this, the computing system obtains results of an input data set being applied to an AI model. The results are then refined based upon characteristic(s) of the AI model and perhaps the input data set. Based upon characteristic(s) of the AI model and perhaps the input data set, interface element(s) are identified that can be used to interface with the refined results. The interface element(s) are then communicated to an interface element that interfaces with the refined results. The interface element(s) may include, for instance, operator(s) or term(s) that may be used to query against the refined results and/or an identification of visualization(s) that may be used to present to a user results of queries against the refined results.