G06F3/0608

QUALITY-PERFORMANCE OPTIMIZED IDENTIFICATION OF DUPLICATE DATA

An approach is provided for providing optimized identification of duplicate data in a networked computing environment. An aggregate feature vector is created that is specific to an attribute of the data (e.g., a field that holds specific informational content). The aggregate feature vector has a set of dimensions that each define a specific comparison function used to test for similarity between data entries in the attribute. Each dimension in the aggregate feature vector is assigned an effectiveness, and a cost is computed for each dimension. Based on these two, a subset of dimensions is selected to form an optimized feature vector. This optimized feature vector can then be used to analyze a dataset to find matching data.

Tiered Storage System with Defragmentation Based on Weighted Flash Fragmentation Factor
20220413729 · 2022-12-29 ·

Systems and method for providing tier selection for data based on a weighted flash fragmentation factor. A weighted flash fragmentation factor is determined indicating a severity of fragmentation in a non-volatile storage based on a logical block address range in a logical-to-physical mapping table for data from a host device to be stored in the tiered data storage system. The factor is shared with the host device to determine a tier selection. The data is stored according to the tier selection based on the factor.

Apparatus and method for erasing data programmed in a non-volatile memory block in a memory system
11537315 · 2022-12-27 · ·

A memory system includes a memory device having a plurality of memory blocks for storing data, and a controller configured to perform an erase operation including plural unit erase operations to erase data stored in at least one target memory block included in the plurality of memory blocks. The controller can be configured to perform at least some of the plural unit erase operations onto the at least one target memory block before the at least one target memory block allocated for storing data.

Determining content-dependent deltas between data sectors
11537563 · 2022-12-27 · ·

In one implementation, a method includes identifying a first content-dependent feature associated with a data sector. The method further includes determining a baseline data sector associated with the data sector. The method further includes determining, by a processing device, a content-dependent delta between the first content-dependent feature and a second content-dependent feature of the baseline data sector. The method further includes providing the content-dependent delta and an indicator to the baseline data sector for storage on a plurality of storage devices.

Deleting snapshot pages using sequence numbers and page lookups

The present disclosure is related to methods, systems, and machine-readable media for deleting snapshot pages using sequence numbers and page lookups. A monotonically-increasing sequence number (SN) can be assigned to each created page of a first snapshot of a storage volume. A first snapshot sequence number (snapSN) can be assigned to the first snapshot responsive to a creation of a second snapshot, wherein the first snapSN is equal to a largest SN of the first snapshot. An SN can be assigned to each created page of the second snapshot, wherein a first page of the second snapshot is assigned an SN monotonically increased from the first snapSN. A second snapSN can be assigned to the second snapshot responsive to a creation of a third snapshot, wherein the second snapSN is equal to a largest SN of the second snapshot. An SN can be assigned to each created page of the third snapshot, wherein a first page of the third snapshot is assigned an SN monotonically increased from the second snapSN. A deletion process can be performed in response to receiving a request to delete the second snapshot that includes deleting a particular page of the second snapshot responsive to determining that the particular page is not shared between the second snapshot and the first snapshot or between the second snapshot and the third snapshot.

DATA ACCUMULATION SYSTEM AND DATA ACCUMULATION METHOD
20220404974 · 2022-12-22 ·

A first controller included in a data recorder automatically starts transmission of data in response to contactless communication being enabled between a first communication module included in the data recorder and a second communication module included in a communication device. First control A3 includes automatically starting transmission of the data in response to contactless communication being enabled between the first communication module and the second communication module.

STORAGE DEVICE AND OPERATING METHOD OF STORAGE DEVICE

A storage device according to the technical ideas of the inventive concepts include: a memory device including a plurality of memory blocks, the plurality of memory blocks including a plurality of pages; a data controller configured to receive, from a host, data to be written to the plurality of pages and a stream identifier corresponding the data, and determine whether to change the stream identifier based on an age of the data representing a frequency of update on the data on one of the plurality of memory blocks or one of the plurality of pages; and a garbage collection controller configured to classify the data based on the stream identifier, and control a garbage collection operation of the memory device based on a classification result.

Intelligent Block Allocation In A Heterogeneous Storage System
20220404997 · 2022-12-22 ·

A storage system forms an allocation unit for writing into solid-state storage memory. The allocation unit is formed from at least a portion of a first erase block and a first sub block of a partitioned second erase block. The system forms multiple subsequent allocation units. Each subsequent allocation unit is formed from a remaining second sub block of a partitioned erase block and at least a portion of a next erase block. Forming the subsequent allocation units consumes each of multiple erase blocks in a cascading sequence. The system allocates the allocation unit and the subsequent allocation units for writing in the storage system.

Data Storage Device and Method for Progressive Fading for Video Surveillance Systems

A data storage device and method for progressive fading for video surveillance systems are provided. In one embodiment, a data storage device is provided comprising a memory and a controller. The controller is configured to store a plurality of digital video frames in the memory over time; and create free space in the memory by deleting some of the plurality of digital video frames across a plurality of subsets of digital video frames, wherein fewer digital video frames are deleted from a subset stored more recently in time than from a subset stored less recently in time. Other embodiments are provided.

FAST GARBAGE COLLECTION IN ZONED NAMESPACES SSDS

A data storage device includes a memory device and a controller coupled to the memory device. The data storage device supports zoned namespace. The controller is configured to maintain a zone timestamp table that includes a corresponding timestamp for each zone and add a timestamp to each garbage collection block of the memory device. The controller is further configured to scan a garbage collection block from a last physical block address (PBA) entry to a first PBA entry, determine a zone timestamp for the scanned PBA entry, and compare the zone timestamp to a timestamp of the garbage collection block. The controller is further configured to create and maintain a zone timestamp table and create and maintain a zone based defragmentation table.