Genome Compression

This paper details a tool written in C and several command-line shell scripts (diff, bzip) that uses a reference genome to compress another genome. The source code can be found here:

This paper uses the Korean genome sequence, analysis can be found here and the dataset can be found here.

