deduplicating backup program
git clone
Log | Files | Refs | README | LICENSE

commit c464b663ad1fde4840833877c100064aef076a9e
parent 63f22ba05c13f6070704aa494e3f54b602194e74
Author: sin <>
Date:   Mon, 18 Feb 2019 17:57:04 +0000


MREADME | 8++++----
1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/README b/README @@ -3,9 +3,9 @@ dedup is a simple data deduplication program. dedup only handles a single file at a time, so using tar is advised. For example, to dedup a tar file you can invoke dedup as follows: - tar cf - ~/bak | dedup -r ~/bak-dedup -m "$(date)" + tar -c ~/bak | dedup -r ~/bak -m "$(date)" -This will create .{cache,index,store} files in the ~/bak-dedup +This will create .{cache,index,store} files in the ~/bak directory. The store file contains all the unique blocks. The index file contains all the revisions of files that have been deduplicated. Each revision is identified by its SHA256 hash. The cache file is @@ -13,14 +13,14 @@ only used to speed up block comparison. To list all known revisions run: - dedup -r ~/bak-dedup -l + dedup -r ~/bak -l You will get a list of hashes. Each hash corresponds to a single file (in this case, a tar archive). To extract a file from the deduplicated store run: - dedup -r ~/bak-dedup -e <hash> > bak.tar + dedup -r ~/bak -e <hash> > bak.tar Cheers, sin