Patent classifications
H03M7/4043
System and method for data storage, transfer, synchronization, and security using automated model monitoring and training
A system and method for lossy precompression for data compaction using automated model monitoring and training, wherein statistical analyses of test datasets are used to determine if the probability distribution of two datasets are within a pre-determined range, and responsive to that determination new encoding and decoding algorithms may be retrained in order to produce new data sourceblocks, and pre-compression of data prior to processing and statistical analysis allows for the compaction of already compressed data into highly dense formats. The new data sourceblocks may then be processed and assigned new codewords which are compiled into an updated codebook which may be distributed back to encoding and decoding systems and devices.
SYSTEMS AND METHODS FOR IMPROVED ENTROPY CODING EFFICIENCY
Systems and methods for an entropy coding system are described. The entropy coding systems include an encoding apparatus and a decoding apparatus. The encoding apparatus is configured to receive an original input stream comprising a plurality of symbols having a known entropy characteristic according to a probability distribution of each of the symbols appearing in the original input stream, determine an input and respective state for each symbol read from the original input stream, append the determined input to the encoded output stream, and provided the encoded output stream to the decoding apparatus. The decoding apparatus is configured to receive the encoded output stream, process the encoded output stream, and for each read input: determine an output symbol and a respective output, persist the respective output state to the encoded output stream, and append the determined output symbol to the results output stream.
Computer data compression utilizing multiple symbol alphabets and dynamic binding of symbol alphabets
The generation of symbol-encoded data from digital data, as part of the compression of the digital data into a compressed digital data, can be performed with reference to multiple alternative alphabets. A selection of a specific alphabet is made based on the digital data being compressed, the compression parameters, or combinations thereof. Information indicative of the selected alphabet is encoded into one or more headers of the resulting compressed digital data. A single alphabet can be selected for all of a set of digital data being compressed, or multiple different alphabets can be selected, with different ones of the multiple different alphabets being utilized to compress different portions of the digital data. Additionally, rather than explicitly specifying a specific selected alphabet, the header information can comprise information from which a same alphabet can be independently selected heuristically by both the compressor and the corresponding decompressor.
Codebook management based on data source grouping
A system and method for codebook management is disclosed. Training datasets are obtained from various data sources. A similarity score is generated for each training dataset with reference to the other training datasets. In response to detecting a similarity score above a predetermined threshold for one or more of the other training datasets, a combined codebook is created based on training datasets that have a similarity score above a predetermined threshold. Based on the similarity score, multiple data sources are combined into a group, and the combined codebook is used for the data sources within the group. A mismatch performance metric can be computed for the combined codebook, and a revised combined codebook can be regenerated in response to the mismatch performance metric being above a predetermined threshold.
Adaptive Data Compression and Encryption System Using Reinforcement Learning for Pipeline Configuration
A system and method for optimizing data compression and encryption using reinforcement learning. The system analyzes incoming data streams to extract statistical features and data characteristics, which are processed by a reinforcement learning engine to automatically configure a multi-stage compression pipeline. Each compression stage transforms data into optimized distributions, applies Huffman coding, and maintains full encryption using homomorphic operations. A performance monitor tracks compression efficiency, processing speed, and output quality in real-time, providing feedback to continuously improve the reinforcement learning model's decisions. The system can dynamically adjust between one to five compression stages and select appropriate compression methods, including traditional algorithms or neural network-based approaches, based on data characteristics and performance requirements. All processing occurs on encrypted data without requiring decryption, ensuring complete data security throughout the pipeline. The adaptive nature of the system enables optimal compression performance across diverse data types while maintaining encryption integrity.
Systems and methods for improved entropy coding efficiency
Systems and methods for an entropy coding system are described. The entropy coding systems include an encoding apparatus and a decoding apparatus. The encoding apparatus is configured to receive an original input stream comprising a plurality of symbols having a known entropy characteristic according to a probability distribution of each of the symbols appearing in the original input stream, determine an input and respective state for each symbol read from the original input stream, append the determined input to the encoded output stream, and provided the encoded output stream to the decoding apparatus. The decoding apparatus is configured to receive the encoded output stream, process the encoded output stream, and for each read input: determine an output symbol and a respective output, persist the respective output state to the encoded output stream, and append the determined output symbol to the results output stream.