|
Project Information
Featured
Downloads
Links
|
Application to measure and analyze file systems to find the internal and temporal redundancy for file-based chunking and fingerprint-based data de-duplication. The fs-c tools allow to analyze the internal and temporal redundancy of file system directories that are found by content-defined chunking using Rabin's fingerprinting method and static chunking with different chunk sizes. The goal is to allow users to provide a rough estimate of the redundancy found by de-duplication systems for their concrete workload and to provide a basis for further enhancement to the tools and for e.g. application-specific chunking methods. The code provided in this project is based on the code used in "Dirk Meister, Andre Brinkmann: Multi-Level Comparision of Data Deduplication in a Backup Scenario, SYSTOR 2009" |