Patent classifications
H03M7/707
Content-aware compression of data using multiple parallel prediction functions
Multiple parallel prediction functions are employed for content-aware data compression. An exemplary method comprises obtaining a floating point number comprising a sign, an exponent at a given base and a significand having a length; applying a plurality of distinct prediction algorithms to the floating point number to generate a corresponding plurality of predictions; selecting a given one of the plurality of distinct prediction algorithms for the floating point number by evaluating a compression metric applied to the plurality of predictions; and encoding the floating point number by encoding the exponent and the length as a single code using a residual generated by the selected prediction algorithm. A disambiguation index optionally identifies the selected prediction algorithm among a set of prediction algorithms that potentially generated the selected prediction. A set of one or more predictors out of a larger set of predictors can be determined for a specific data set based on a performance-based ranking.
DEVICE WITH COMMUNICATION INTERFACE AND METHOD FOR CONTROLLING DATABASE ACCESS
The invention relates to a device that is set up for device-internal handling of data in a text format, that has a communication interface set up to interchange binary data, particularly EXI data, and that provides a coder for reciprocally converting the binary data into text format data, wherein the coder is set up to associate resource descriptions with a respective identification number. This association is managed via a resource table, which is managed by the coder and in which a respective resource description in text form has an associated respective identification number. In addition, the device has, in a manner that is standard in the art, at least one device-internal database for storing resources and the relationships between said resources. According to the invention, a database access control unit is provided that has access both to the database and to the resource table. Centralisation of the resource table advantageously avoids redundant storage of resource descriptions. A further measure according to the invention provides for the database access control unit to be designed such that a resource to be stored is referenced on the basis of the associated identification number rather than the resource description. In comparison with the measures known in the prior art, the invention leads to faster reaction times for production of the EXI data stream.
Compression of javascript object notation data using structure information
A method for encoding and decoding a javascript object notation (JSON) document utilizing a statistical tree representing a JSON Schema. The encoded statistical tree may be optimized.
XML FILE CONDENSING
Examples associated with extensible markup language (XML) file condensing are described. On example includes generating a set of indexes from an XML file. The set of indexes may include a tag index that indexes tag terms used in the XML file, an attribute indexes attribute terms in the XML file, and a value index that indexes value terms in the XML file. The example also includes condensing the XML file into a condensed version of the XML file by translating the XML file according to the indexes. The example also includes generating a translated file. The translated file may be made from the set of indexes and the condensed version of the XML file.
ENCODING METHOD AND ENCODING APPARATUS
An encoding apparatus includes a memory and a processor configured to acquire text data, specify a first dynamic dictionary among a plurality of dynamic dictionaries based on attribute information of a first word included in the text data, register the first word in association with a first dynamic code in the first dynamic dictionary, and encode the first word into the first dynamic code.
SYSTEMS, METHODS, AND MEDIA FOR MANAGING AN IN-MEMORY NOSQL DATABASE
Systems, methods, and media for managing an in-memory NoSQL database are provided. In some embodiments, systems for managing an in-memory NoSQL database are provided, the systems comprising a hardware processor that is configured to: identify a group of keys capable of being included in a complex multi-key database command, the group including a first key and a second key, the first key being associated with a first value and the second key being associated with a second value, the first value and the second value being stored on different servers that are part of the database; and cause the first value and the second value to be both stored on a given server.
Hybrid bit-sliced dictionary encoding for fast index-based operations
Techniques are described herein for storing and processing codes included in dictionary-encoded data. In an embodiment, for each respective code of a plurality of codes in the dictionary-encoded data: a plurality of bits from a first portion of the respective code is contiguously stored. One or more bits from a second portion of the respective code is stored in one or more slices. Each respective slice of the one or more slices stores a bit from the one or more bits with a corresponding bit position in the respective code. In another embodiment, a bit-vector is generated based on at least one slice by loading each respective bit of the plurality of bits into different respective partitions in a register at a bit position corresponding to the at least one slice. A plurality of codes may be reconstructed by combining the bit-vector with one or more other bit-vectors.
System, Methods, and Media for Compressing Non-Relational Database Objects
Method, media, and systems for compressing objects, comprising: receiving a request to write a first object including a first key and a first value, wherein the first object is of a given type; receiving a request to write a second object including a second key and a second value, wherein the second object is of the given type; classifying the first object to a compression dictionary according to at least one rule based on a value of the first object and/or the key of the first object; classifying the second object to the compression dictionary according to at least one rule based on a value of the second object and/or the key of the second object; and compressing the first object and the second object based on the compression dictionary.
Systems, methods, and media for managing an in-memory NOSQL database
Systems, methods, and media for managing an in-memory NoSQL database are provided. In some embodiments, systems for managing an in-memory NoSQL database are provided, the systems comprising a hardware processor that is configured to: identify a group of keys capable of being included in a complex multi-key database command, the group including a first key and a second key, the first key being associated with a first value and the second key being associated with a second value, the first value and the second value being stored on different servers that are part of the database; and cause the first value and the second value to be both stored on a given server.
Document Store Utilizing Partial Object Compression
A query is received by a document store from a client. The query specifies at least one database operation implicating documents within the document store. At least one of the implicated documents has partially compressed objects in which only a portion of the document is compressed. Thereafter, the documents are scanned to identify the compressed objects and to identify decompression keys. The scanned compressed objects are then decompressed using the corresponding decompression keys. Data responsive to the query can then be provided to the client. lated apparatus, systems, techniques and articles are also described.