These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
22. GBZ file format for pangenome graphs. Sirén J; Paten B Bioinformatics; 2022 Nov; 38(22):5012-5018. PubMed ID: 36179091 [TBL] [Abstract][Full Text] [Related]
23. Data-dependent bucketing improves reference-free compression of sequencing reads. Patro R; Kingsford C Bioinformatics; 2015 Sep; 31(17):2770-7. PubMed ID: 25910696 [TBL] [Abstract][Full Text] [Related]
24. ChIPWig: a random access-enabling lossless and lossy compression method for ChIP-seq data. Ravanmehr V; Kim M; Wang Z; Milenkovic O Bioinformatics; 2018 Mar; 34(6):911-919. PubMed ID: 29087447 [TBL] [Abstract][Full Text] [Related]
26. FastqCLS: a FASTQ compressor for long-read sequencing via read reordering using a novel scoring model. Lee D; Song G Bioinformatics; 2022 Jan; 38(2):351-356. PubMed ID: 34623374 [TBL] [Abstract][Full Text] [Related]
27. Constructing small genome graphs via string compression. Qiu Y; Kingsford C Bioinformatics; 2021 Jul; 37(Suppl_1):i205-i213. PubMed ID: 34252955 [TBL] [Abstract][Full Text] [Related]
28. AQUa: an adaptive framework for compression of sequencing quality scores with random access functionality. Paridaens T; Van Wallendael G; De Neve W; Lambert P Bioinformatics; 2018 Feb; 34(3):425-433. PubMed ID: 29028894 [TBL] [Abstract][Full Text] [Related]
29. Building large updatable colored de Bruijn graphs via merging. Muggli MD; Alipanahi B; Boucher C Bioinformatics; 2019 Jul; 35(14):i51-i60. PubMed ID: 31510647 [TBL] [Abstract][Full Text] [Related]
30. Nucleotide Archival Format (NAF) enables efficient lossless reference-free compression of DNA sequences. Kryukov K; Ueda MT; Nakagawa S; Imanishi T Bioinformatics; 2019 Oct; 35(19):3826-3828. PubMed ID: 30799504 [TBL] [Abstract][Full Text] [Related]
31. RENANO: a REference-based compressor for NANOpore FASTQ files. Dufort Y Álvarez G; Seroussi G; Smircich P; Sotelo-Silveira J; Ochoa I; Martín Á Bioinformatics; 2021 Dec; 37(24):4862-4864. PubMed ID: 34128963 [TBL] [Abstract][Full Text] [Related]
32. High-speed and high-ratio referential genome compression. Liu Y; Peng H; Wong L; Li J Bioinformatics; 2017 Nov; 33(21):3364-3372. PubMed ID: 28651329 [TBL] [Abstract][Full Text] [Related]
34. QVZ: lossy compression of quality values. Malysa G; Hernaez M; Ochoa I; Rao M; Ganesan K; Weissman T Bioinformatics; 2015 Oct; 31(19):3122-9. PubMed ID: 26026138 [TBL] [Abstract][Full Text] [Related]
35. GABAC: an arithmetic coding solution for genomic data. Voges J; Paridaens T; Müntefering F; Mainzer LS; Bliss B; Yang M; Ochoa I; Fostier J; Ostermann J; Hernaez M Bioinformatics; 2020 Apr; 36(7):2275-2277. PubMed ID: 31830243 [TBL] [Abstract][Full Text] [Related]
36. Compression of next-generation sequencing reads aided by highly efficient de novo assembly. Jones DC; Ruzzo WL; Peng X; Katze MG Nucleic Acids Res; 2012 Dec; 40(22):e171. PubMed ID: 22904078 [TBL] [Abstract][Full Text] [Related]
37. Performance evaluation of lossy quality compression algorithms for RNA-seq data. Yu R; Yang W; Wang S BMC Bioinformatics; 2020 Jul; 21(1):321. PubMed ID: 32689929 [TBL] [Abstract][Full Text] [Related]
38. GMHCC: high-throughput analysis of biomolecular data using graph-based multiple hierarchical consensus clustering. Lu Y; Yu Z; Wang Y; Ma Z; Wong KC; Li X Bioinformatics; 2022 May; 38(11):3020-3028. PubMed ID: 35451457 [TBL] [Abstract][Full Text] [Related]
39. CURC: a CUDA-based reference-free read compressor. Xie S; He X; He S; Zhu Z Bioinformatics; 2022 Jun; 38(12):3294-3296. PubMed ID: 35579371 [TBL] [Abstract][Full Text] [Related]