G06F16/319

Determining the schema of a graph dataset

A schema for a dataset is identified by identifying a dataset comprising data and relationships between data pairs. An original schema is identified for the dataset. This original schema comprises an organizational structure. An initial fit between the dataset and the original schema is determined. The initial fit quantifying a conformity of the data in the dataset to the organizational structure of the original schema. A plurality of additional schemas are identified. Each additional schema is a distinct organizational schema. The dataset is partitioned into a plurality of subsets. Each subset comprises a modified fit quantifying a modified conformity of subset data in each subset to one of the original schema and the additional schemas. The modified fit is greater than the original fit.

MULTI-TENANT HOSTING OF INVERTED INDEXES FOR TEXT SEARCHES

Multi-tenant hosting of inverted indexes for text searches is implemented. Text search requests are routed to different index nodes that cache inverted indexes for different user accounts. Updates to inverted indexes are routed to index nodes that have acquired a lock on an inverted index. The index nodes have access to a common data store that persistently stores the inverted indexes.

METHOD FOR PROVIDING A PERSONALIZED RESPONSE FOR AN ELECTRONIC DEVICE
20230090023 · 2023-03-23 ·

The disclosure provides a method for receiving a query from a user of the electronic device, wherein the query is one of a voice query, a gesture query and a text query; determining an intermediate response for an augmented query; categorizing, by the electronic device, the intermediate response; selecting at least one other user communicating with the user of the electronic device for the determined category of the intermediate response; determining a perception of the at least one other user based on a profile of the at least one other user and a communication history with the at least one other user; and generating, by the electronic device, a final response for the user of the electronic device based on the perception of at least one other user and the determined intermediate response.

Generation and utilization of vector indexes for data processing systems and methods

Example data processing systems and methods are described. In one implementation, a system accesses a corpus of data and analyzes the data contained in the corpus of data to identify multiple documents. The system generates vector indexes for the multiple documents such that the vector indexes allow a computing system to quickly access the plurality of documents and identify an answer to a question associated with the corpus of data.

Multiscale Quantization for Fast Similarity Search

The present disclosure provides systems and methods that include or otherwise leverage use of a multiscale quantization model that is configured to provide a quantized dataset. In particular, the multiscale quantization model can receive and perform vector quantization of a first dataset. The multiscale quantization model can generate a residual dataset based at least in part on a result of the vector quantization. The multiscale quantization model can apply a rotation matrix to the residual dataset to generate a rotated residual dataset that includes a plurality of rotated residuals. The multiscale quantization model can perform reparameterization of each rotated residual in the rotated residual dataset into a direction component and a scale component. The multiscale quantization model can perform product quantization of the direction components of the plurality of rotated residuals, and perform scalar quantization of the scale components of the plurality of rotated residuals.

GENERATING SIMILARITY SCORES BETWEEN DIFFERENT DOCUMENT SCHEMAS

A document may be received as part of a request to identify similar documents in a collection of documents. However, the received document and the documents in the collection may have different schemas or formats. To provide semantic context to the search and allow similarity scores to be generated between different document types, a configuration may be accessed that defines how to generate queries from one schema into another schema. The configuration may map queries between different fields in both schemas. Results of the multiple queries can be combined to generate a weighted combination for each document that can be used as a similarity score between different document types.

Algorithm downloading method, device, and related product

An algorithm download method, a device, and a related product. The method includes: obtaining an algorithm identifier of an algorithm and a capability description of a client; sending the algorithm identifier and the capability description to a cloud; and receiving a version code that is of the algorithm and that is returned by the cloud, where the version code is obtained by the cloud by searching based on the algorithm identifier and the capability description. According to the foregoing solution, an algorithm can be easily downloaded.

Detecting duplicated code patterns in visual programming language code instances

In various embodiments, a process for detecting duplicated code patterns in visual programming language code instances includes analyzing a repository of graph based visual programming language code instances and detecting a similar code portion pattern duplicated among a group of graph based visual programming language code instances included in the repository of graph based visual programming language code instances including by using an index and tokenizing a flow corresponding to at least one graph based visual programming language code instance in the group of graph based visual programming language code instance. The process includes visually indicating elements belonging to the detected similar code portion pattern within a visual representation of at least one of the group of graph based visual programming language code instances.

Recommender and remediation system for enterprise service management

A system and method for automatically and algorithmically resolving service tickets through the utilization of historical solution data obtained from multiple sources. The system and method are optionally capable of providing and executing a BOT capable of implementing one or solutions based on the automatic and algorithmic recommendation of a solution to resolve the service ticket(s).

IDENTIFYING EQUIVALENT TECHNICAL TERMS IN DIFFERENT DOCUMENTS
20220335090 · 2022-10-20 ·

A computer-implemented method, system and computer program product for identifying equivalent technical terms. A deep learning model is trained to identify equivalent technical terms. The deep learning model is then applied to a new document. The sentences of the document are analyzed to identify technical terms. Text is then analyzed surrounding the technical term identified in the document to determine the meaning of such text. A glossary list is then reviewed to determine if the identified meaning of the analyzed text matches a meaning/concept in the glossary list linked to a technical term. In response to determining that the meaning of the analyzed text matches a meaning/concept in the glossary list linked to an equivalent technical term, the technical term in the document is annotated with the equivalent technical term. In this manner, non-standard equivalent technical terms in different documents with the same meaning/concept are able to be identified.