151 related articles for article (PubMed ID: 29762754)
1. Optimized distributed systems achieve significant performance improvement on sorted merging of massive VCF files.
Sun X; Gao J; Jin P; Eng C; Burchard EG; Beaty TH; Ruczinski I; Mathias RA; Barnes K; Wang F; Qin ZS;
Gigascience; 2018 Jun; 7(6):. PubMed ID: 29762754
[TBL] [Abstract][Full Text] [Related]
2. The variant call format and VCFtools.
Danecek P; Auton A; Abecasis G; Albers CA; Banks E; DePristo MA; Handsaker RE; Lunter G; Marth GT; Sherry ST; McVean G; Durbin R;
Bioinformatics; 2011 Aug; 27(15):2156-8. PubMed ID: 21653522
[TBL] [Abstract][Full Text] [Related]
3. ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications.
Herrick N; Walsh S
BMC Bioinformatics; 2023 Nov; 24(1):424. PubMed ID: 37940870
[TBL] [Abstract][Full Text] [Related]
4. VC@Scale: Scalable and high-performance variant calling on cluster environments.
Ahmad T; Al Ars Z; Hofstee HP
Gigascience; 2021 Sep; 10(9):. PubMed ID: 34494101
[TBL] [Abstract][Full Text] [Related]
5. SeqHBase: a big data toolset for family based sequencing data analysis.
He M; Person TN; Hebbring SJ; Heinzen E; Ye Z; Schrodi SJ; McPherson EW; Lin SM; Peissig PL; Brilliant MH; O'Rawe J; Robison RJ; Lyon GJ; Wang K
J Med Genet; 2015 Apr; 52(4):282-8. PubMed ID: 25587064
[TBL] [Abstract][Full Text] [Related]
6. VCF-Explorer: filtering and analysing whole genome VCF files.
Akgün M; Demirci H
Bioinformatics; 2017 Nov; 33(21):3468-3470. PubMed ID: 29036499
[TBL] [Abstract][Full Text] [Related]
7. SeqArray-a storage-efficient high-performance data format for WGS variant calls.
Zheng X; Gogarten SM; Lawrence M; Stilp A; Conomos MP; Weir BS; Laurie C; Levine D
Bioinformatics; 2017 Aug; 33(15):2251-2257. PubMed ID: 28334390
[TBL] [Abstract][Full Text] [Related]
8. Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing.
Zhao S; Prenger K; Smith L; Messina T; Fan H; Jaeger E; Stephens S
BMC Genomics; 2013 Jun; 14():425. PubMed ID: 23802613
[TBL] [Abstract][Full Text] [Related]
9. Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework.
Ahmad T; Ahmed N; Al-Ars Z; Hofstee HP
BMC Genomics; 2020 Nov; 21(Suppl 10):683. PubMed ID: 33208101
[TBL] [Abstract][Full Text] [Related]
10. VCF-kit: assorted utilities for the variant call format.
Cook DE; Andersen EC
Bioinformatics; 2017 May; 33(10):1581-1582. PubMed ID: 28093408
[TBL] [Abstract][Full Text] [Related]
11. Analysis-ready VCF at Biobank scale using Zarr.
Czech E; Millar TR; White T; Jeffery B; Miles A; Tallman S; Wojdyla R; Zabad S; Hammerbacher J; Kelleher J
bioRxiv; 2024 Jun; ():. PubMed ID: 38915693
[TBL] [Abstract][Full Text] [Related]
12. Big Data in metagenomics: Apache Spark vs MPI.
Abuín JM; Lopes N; Ferreira L; Pena TF; Schmidt B
PLoS One; 2020; 15(10):e0239741. PubMed ID: 33022000
[TBL] [Abstract][Full Text] [Related]
13. re-Searcher: GUI-based bioinformatics tool for simplified genomics data mining of VCF files.
Karabayev D; Molkenov A; Yerulanuly K; Kabimoldayev I; Daniyarov A; Sharip A; Seisenova A; Zhumadilov Z; Kairov U
PeerJ; 2021; 9():e11333. PubMed ID: 33987016
[TBL] [Abstract][Full Text] [Related]
14. MISS-D: A fast and scalable framework of medical image storage service based on distributed file system.
Li W; Feng C; Yu K; Zhao D
Comput Methods Programs Biomed; 2020 Apr; 186():105189. PubMed ID: 31759298
[TBL] [Abstract][Full Text] [Related]
15. CloudDOE: a user-friendly tool for deploying Hadoop clouds and analyzing high-throughput sequencing data with MapReduce.
Chung WC; Chen CC; Ho JM; Lin CY; Hsu WL; Wang YC; Lee DT; Lai F; Huang CW; Chang YJ
PLoS One; 2014; 9(6):e98146. PubMed ID: 24897343
[TBL] [Abstract][Full Text] [Related]
16. Analysis of Genetic Ancestry from NGS Data Using EthSEQ.
Dalfovo D; Romanel A
Curr Protoc; 2023 Feb; 3(2):e663. PubMed ID: 36779822
[TBL] [Abstract][Full Text] [Related]
17. VIVA (VIsualization of VAriants): A VCF File Visualization Tool.
Tollefson GA; Schuster J; Gelin F; Agudelo A; Ragavendran A; Restrepo I; Stey P; Padbury J; Uzun A
Sci Rep; 2019 Sep; 9(1):12648. PubMed ID: 31477778
[TBL] [Abstract][Full Text] [Related]
18. FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy.
Ferraro Petrillo U; Palini F; Cattaneo G; Giancarlo R
BMC Bioinformatics; 2021 Mar; 22(1):144. PubMed ID: 33752596
[TBL] [Abstract][Full Text] [Related]
19. Variant-Kudu: An Efficient Tool kit Leveraging Distributed Bitmap Index for Analysis of Massive Genetic Variation Datasets.
Fan J; Dong S; Wang B
J Comput Biol; 2020 Sep; 27(9):1350-1360. PubMed ID: 31904999
[TBL] [Abstract][Full Text] [Related]
20. Large-scale virtual screening on public cloud resources with Apache Spark.
Capuccini M; Ahmed L; Schaal W; Laure E; Spjuth O
J Cheminform; 2017; 9():15. PubMed ID: 28316653
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]