Authors
Divon Lan, Ray Tobler, Yassine Souilmi, Bastien Llamas
Publication date
2021/8/15
Journal
Bioinformatics
Volume
37
Issue
16
Pages
2225-2230
Publisher
Oxford University Press
Description
We present Genozip, a universal and fully featured compression software for genomic data. Genozip is designed to be a general-purpose software and a development framework for genomic compression by providing five core capabilities—universality (support for all common genomic file formats), high compression ratios, speed, feature-richness and extensibility. Genozip delivers high-performance compression for widelyused genomic data formats in genomics research, namely FASTQ, SAM/BAM/CRAM, VCF, GVF, FASTA, PHYLIP and 23andMe formats. Our test results show that Genozip is fast and achieves greatly improved compression ratios, even when the files are already compressed. Further, Genozip is architected with a separation of the Genozip Framework from file-format-specific Segmenters and data-type-specific Codecs. With this, we intend for Genozip to be a general-purpose …
Total citations
202120222023202441159
Scholar articles