Delete stored data

How UltiHash provides rapid data deletion at scale

Efficient data deletion in a distributed storage system is often overlooked, yet it is inherently complex. In many systems, deleting an object requires coordination across multiple nodes to ensure all associated data fragments are removed safely. This coordination can be time-consuming, and in some architectures, the disk space freed by deletes is only reclaimed during deferred maintenance tasks such as compaction or garbage collection. Both issues can lead to slow deletions, delayed space reclamation, and unpredictable storage availability.

UltiHash avoids these drawbacks by maintaining fine-grained reference counting information directly at the level of each storage service instance. When an object is deleted, each instance can immediately determine whether the data fragments it holds are still referenced by any other objects in the cluster. Because this determination is local, there is no need for cluster-wide coordination protocols, which greatly reduces deletion latency.

As soon as a data fragment’s reference count reaches zero, the fragment is removed and the disk space it occupied is reclaimed instantly. This ensures that storage capacity is returned to the system without waiting for background compaction or garbage collection cycles.

This design provides predictable performance for delete-heavy workloads and ensures that available disk space reflects the true live data set at all times.

Last updated 4 months ago

Was this helpful?