Patent classifications
G06F16/316
SYSTEM FOR OPTIMIZING CONTENT QUERIES
An indexing scheme generates a token index associating token index values with keywords in queries and generates expression trees for the queries that use the token index values to represent the keywords. The indexing scheme generates a document index assigning document index values to uploaded documents. The indexing scheme generates a document-token index that associates the token index values with the document index values for the documents containing the keywords associated with the token index values. The indexing scheme applies the expression trees to the document-token index to quickly identify the documents satisfying the queries. For example, the indexing scheme may generate bit arrays for each of the token index values identifying the documents containing the keywords and apply logical operators from the queries to the bit arrays. The resulting data structure provides a list of documents satisfying the queries.
Searching data files using a key map
Approaches for searching for key terms in a plurality of files include associating a respective key map with each file of the plurality of files in memory of a server. Each key map includes a plurality of bit values and each bit value indicates for a key term whether or not the key term is present in the associated file. The server inputs a search map, and the search map includes a plurality of bit values. Each bit value in the search map indicates for a key term whether or not the key term is a key term to search. The server determines for each key map, whether or not the key map satisfies the search map. Data indicating each file of the plurality of files having an associated key map that satisfies the search map is output by the server.
Index structure navigation using page versions for read-only nodes
Read-only nodes of a distributed database system may implement index structure navigation using page versions. A read request may be received at a read only node of a distributed database for select data. Data pages linked together to form an index structure for data stored for the distributed database may be navigated according to versions maintained for the data pages in order to identify one or more locations to access for the select data. One or more prior versions of data pages may be selected as part navigating the index structure according to a consistent view of the distributed database associated with the read request. Change notifications may also be received at the read-only node modifying the data pages of the index structure. The index structure modifications may be applied without blocking the index structure navigation for servicing the read request.
Method and system for document indexing and data querying
Generating a document index comprises: obtaining a document to be indexed; determining whether each monadic partition obtained from the document is a filter character and if so, forming a polynary partition with the monadic partition and at least one adjacent monadic partition and indexing the polynary partition, otherwise, indexing the monadic partition. Querying data comprising: receiving a data query, determining whether each monadic partition obtained from the data query is a filter character and if so, forming a polynary partition with the monadic partition and at least one adjacent monadic partition and using the polynary partition to obtain search results, otherwise, using the monadic partition to obtain search results; and combining search results to form a final query search result.
INFORMATION PROCESSING APPARATUS, DOCUMENT ENCODING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM
A non-transitory computer-readable recording medium stores a document encoding program that causes a computer to execute a process including: first generating index information in which an appearance position is associated with each word appearing on document data of a target as bit map data at the time of encoding the document data of the target in word unit; second generating document structure information in which a relationship with respect to the appearance position included in the index information is associated with each specific sub structure included in the document data as bit map data; and retaining the index information and the document structure information in a storage in association with each other.
COMPUTER-READABLE RECORDING MEDIUM, INDEX CREATION DEVICE, INDEX CREATION METHOD, COMPUTER-READABLE RECORDING MEDIUM, SEARCH DEVICE, AND SEARCH METHOD
An index creation device reads target text data therein and creates a bitmap index in which, with regard to each of a character or a word and a tag that appear in the target text data, an appearance position of each of the character or the word and the tag in text data is represented as bitmap data.
Dynamic threshold gates for indexing queues
Electronic files are selectively assigned to a plurality of different indexing queues by one or more dynamic throughput threshold gates based on characteristics of the different indexing queues as well as the static file characteristics associated with each of the files. The files are then indexed. Upon detecting a change in a dynamic characteristic of one or more indexed files, the throughput threshold gate(s) are then modified to obtain, maintain or modify a desired throughput for one or more of the indexing queues.
COLD-START FORECASTING VIA BACKCASTING AND COMPOSITE EMBEDDING
Techniques are described herein for cold-start forecasting datasets using backcasting and composite embedding. An example method can include a system receiving a set of time series and metadata text comprising a first subset of metadata text and a second subset of metadata text. The system can generate a plurality of embeddings, each embedding comprising a numerical representation of a metadata text of the set of metadata text. The system can generate a plurality of vectors, each vector comprising a time series of the set of time series each time series associated with a metadata text of the first subset of metadata text. The system can generate a plurality of composite embeddings based at least in part on combining each embedding with a respective vector of the plurality of vectors. The system can determine a forecasted value associated with the second subset of metadata text based on the composite embeddings.
System and Method for Modification, Personalization and Customization of Search Results and Search Result Ranking in an Internet-Based Search Engine
A computer server system and method are disclosed for personalization and customization of network search results and rankings, such as for Internet searching. A representative server system comprises: a network interface to receive a query from a user and transmit return queries and search results; a data storage device having a first, lexical database having one or more compilations and templates; and one or more processors configured to access the first database and search a selected compilation using the query to generate initial search results; to comparatively score each selected parsed phrase of the initial search results, for each classification of a selected template and a selected compilation, and to output initial and final search results arranged according to the classifications and the predetermined order of the template. A representative embodiment may also include use of a second, semantic database having multi-dimensional vectors corresponding to parsed phrases, paragraphs, or clauses.
SEMANTIC SEARCH AND SUMMARIZATION FOR ELECTRONIC DOCUMENTS
Techniques for an artificial intelligence (AI) platform to search a document collection are described. Embodiments may use AI and machine learning techniques within a framework of an electronic document management system to perform semantic searching of an electronic document or a collection of electronic documents for certain types of information. The AI platform may summarize the information in a natural language representation of a human language. Other embodiments are described and claimed.