G06F16/316

SYSTEM AND METHOD FOR PRE-INDEXING FILTERING AND CORRECTION OF DOCUMENTS IN SEARCH SYSTEMS
20240004930 · 2024-01-04 ·

Embodiments as disclosed herein provide a search system with an pre-indexing filter that provides both a sophisticated and contextually tailored approach to filtering documents and a corrector that is adapted to alter a document that has been designated to be filtered out from the indexing process and determine if the altered document should be indexed. The alteration of the document may be tied to the attributes, rules or thresholds used to initially filter the document from the indexing process. The filtering criteria can thus be tailored to a specific context such that both the initial filtering and the alteration process may be better suited for application in that context.

System to organize search and display unstructured data
10885085 · 2021-01-05 · ·

A system to organize, search and display unstructured data comprising a token retrieval module, a document indexing engine, a subspace search module and a user interface module has been devised. The system retrieves a plurality of tokens and associates them with coordinates in subspace. It also retrieves documents and creates a multidimensional matrix of documents and tokens where each cell contains the number of times the token occurs in each document. That matrix is employed in a search using user specified search terms. The search results are displayed such that the search tokens occupy specific spatial coordinates and documents spatial coordinates are dictated by the relative preponderance of each search term in each document.

SYSTEM AND METHOD FOR DOCUMENT DATA EXTRACTION, DATA INDEXING, DATA SEARCHING AND DATA FILTERING
20200394243 · 2020-12-17 ·

Systems and methods are described for extracting data from digital documents, indexing the data, and providing a user interface for filtering the data and generating a document based on the filtered data. In one implementation, a method includes extracting data from one or more digital documents, the extracted data including elements of a first type, the elements of the first type including key-value pairs; indexing the extracted data; hosting a web-based application instance, the web-based application instance including a user interface for searching the indexed data and filtering elements of the first type based on rules defined by a user of the user interface; receiving rules for filtering the elements of the first type; and filtering the elements of the first type based on the received rules.

Systems, methods, and apparatuses for implementing change value indication and historical value comparison

Disclosed herein are systems and methods for implementing change value indication and historical value comparison at a user interface including means for storing records in a database, wherein updates to the records are recorded into a historical trending data object to maintain historical values for the records when the records are updated in the database; receiving input from a user device specifying data to be displayed at the user device; receiving historical filter input from the user device; querying the records stored in the database for the data to be displayed; querying the historical trending data object for the historical values of the data to be displayed; comparing the data to be displayed with the historical values of the data to be displayed to determine one or more changed values corresponding to the data to be displayed; and displaying a change value indication GUI to the user device displaying at least the data to be displayed and a changed value indication based on the one or more changed values determined via the comparing. Other related embodiments are further disclosed.

Method for semantic indexing of big data using a multidimensional, hierarchical scheme

A method for indexing semantic, non-transitory, computer-stored data comprising the following steps: storing the data in a database; representing the data in a structured framework having at least three elements derived from an ontology; expressing each element as a hierarchical-index value based on an ontology such that semantic information is embedded therein; combining the elements in a multi-dimensional index; and converting the multi-dimensional index into a one-dimensional index.

System and method for interactive searching of transcripts and associated audio/visual/textual/other data files
10860638 · 2020-12-08 ·

A system and method for processing digital multimedia files to provide searchable results includes the steps of converting a digital multimedia file to a plain text data format, annotating each word in the file with an indicator such as a time stamp to indicate where the word appears in the file, converting each indicator to an encoded indicator using characters that are not indexed by search software, indexing the converted, annotated file, storing the converted, annotated file and a file location of the converted, annotated file, receiving a query from a user's computer, and returning search results to the user's computer that include search snippets comprising unindexed portions of one or more files considered responsive to the query and the file location of those files.

Page compete
10860674 · 2020-12-08 · ·

Optimizations are provided for generating a list of search results. At a user interface, a query is received from a user who is using the user interface. This query includes a request to access digital content. In response to the request, a set of query results is obtained. This set of query results includes a first list of selectable links. Each of these links is associated with the digital content requested by the query and is prioritized according to a particular order. Then, an access performance rate is determined for at least some of the links included within the first list. A second list of links is then generated by evaluating the links of the first list against a set of rules. This set of rules prioritizes the links based at least partially on the determined access performance rates. Subsequently, the user interface is updated to reflect the second list.

Processing system using intelligent messaging flow markers based on language data

Some aspects disclosed herein are directed to, for example, a system and method comprising a client device receiving an input of at least a portion of a message. The client device may transmit, to a server device, the at least the portion of the message for display via a second client device. The client device may determine an identifier for the at least the portion of the message. The client device may determine, based on a lexicon, a marker name for the at least the portion of the message. The client device may generate an association between the marker name for the at least the portion of the message and the identifier for the at least the portion of the message. The client device may store, at a storage location, the marker name for the at least the portion of the message, the identifier for the at least the portion of the message, and the association between the marker name for the at least the portion of the message and the identifier for the at least the portion of the message.

Text analysis of morphemes by syntax dependency relationship with determination rules

A morpheme analysis unit sets beforehand a meaning-candidate tag and a sentimental theme tag for a morpheme required to be input as a text. A syntax analysis unit generates an index where a clause including a meaning-candidate tag and a sentimental theme tag and a type of each tag. A meaning attribute extraction unit recognizes a clause including a meaning-candidate and a type of tag with reference to the index, and then applies a meaning attribute rule, sets a meaning attribute tag for a necessary clause, and updates the index. A sentimental analysis unit also recognizes a clause including a sentimental theme tag and a clause including a meaning attribute tag with reference to the index, and then applies a sentimental analysis rule and sets a sentimental attribute tag for a necessary clause.

System and method for improving data compression of a storage system using coarse and fine grained similarity

Techniques for improving data compression of a storage system using coarse and fine grained similarity are described herein. According to one embodiment, region sketches for a plurality of regions of the set of data are generated, each region storing a plurality of data chunks. A region sketch index having a plurality of entries is maintained, each corresponding to one of the region sketches of the regions. The entries of the region sketch index are sorted based on the sketches of the regions, such that regions with an identical region sketch are positioned adjacent to each other within the region sketch index, representing similar regions. The data chunks of the similar regions that are identified based on the sorted entries of the region sketch index are reorganized to improve data compression of the data chunks of the similar regions.