These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
4. Nucleotide Archival Format (NAF) enables efficient lossless reference-free compression of DNA sequences. Kryukov K; Ueda MT; Nakagawa S; Imanishi T Bioinformatics; 2019 Oct; 35(19):3826-3828. PubMed ID: 30799504 [TBL] [Abstract][Full Text] [Related]
5. LCQS: an efficient lossless compression tool of quality scores with random access functionality. Fu J; Ke B; Dong S BMC Bioinformatics; 2020 Mar; 21(1):109. PubMed ID: 32183707 [TBL] [Abstract][Full Text] [Related]
6. RENANO: a REference-based compressor for NANOpore FASTQ files. Dufort Y Álvarez G; Seroussi G; Smircich P; Sotelo-Silveira J; Ochoa I; Martín Á Bioinformatics; 2021 Dec; 37(24):4862-4864. PubMed ID: 34128963 [TBL] [Abstract][Full Text] [Related]
7. Reference-free lossless compression of nanopore sequencing reads using an approximate assembly approach. Meng Q; Chandak S; Zhu Y; Weissman T Sci Rep; 2023 Feb; 13(1):2082. PubMed ID: 36747011 [TBL] [Abstract][Full Text] [Related]
8. FastqCLS: a FASTQ compressor for long-read sequencing via read reordering using a novel scoring model. Lee D; Song G Bioinformatics; 2022 Jan; 38(2):351-356. PubMed ID: 34623374 [TBL] [Abstract][Full Text] [Related]
9. LFQC: a lossless compression algorithm for FASTQ files. Nicolae M; Pathak S; Rajasekaran S Bioinformatics; 2015 Oct; 31(20):3276-81. PubMed ID: 26093148 [TBL] [Abstract][Full Text] [Related]
11. Compression of genomic sequencing reads via hash-based reordering: algorithm and analysis. Chandak S; Tatwawadi K; Weissman T Bioinformatics; 2018 Feb; 34(4):558-567. PubMed ID: 29444237 [TBL] [Abstract][Full Text] [Related]
12. PQSDC: a parallel lossless compressor for quality scores data via sequences partition and run-length prediction mapping. Sun H; Zheng Y; Xie H; Ma H; Zhong C; Yan M; Liu X; Wang G Bioinformatics; 2024 May; 40(5):. PubMed ID: 38759114 [TBL] [Abstract][Full Text] [Related]
13. genozip: a fast and efficient compression tool for VCF files. Lan D; Tobler R; Souilmi Y; Llamas B Bioinformatics; 2020 Jul; 36(13):4091-4092. PubMed ID: 32407471 [TBL] [Abstract][Full Text] [Related]
14. FaStore: a space-saving solution for raw sequencing data. Roguski L; Ochoa I; Hernaez M; Deorowicz S Bioinformatics; 2018 Aug; 34(16):2748-2756. PubMed ID: 29617939 [TBL] [Abstract][Full Text] [Related]
15. ScaleQC: a scalable lossy to lossless solution for NGS data compression. Yu R; Yang W Bioinformatics; 2020 Nov; 36(17):4551-4559. PubMed ID: 32458976 [TBL] [Abstract][Full Text] [Related]
16. CALQ: compression of quality values of aligned sequencing data. Voges J; Ostermann J; Hernaez M Bioinformatics; 2018 May; 34(10):1650-1658. PubMed ID: 29186284 [TBL] [Abstract][Full Text] [Related]
17. Efficient DNA sequence compression with neural networks. Silva M; Pratas D; Pinho AJ Gigascience; 2020 Nov; 9(11):. PubMed ID: 33179040 [TBL] [Abstract][Full Text] [Related]