262 related articles for article (PubMed ID: 30999863)
1. Analyzing big datasets of genomic sequences: fast and scalable collection of k-mer statistics.
Ferraro Petrillo U; Sorella M; Cattaneo G; Giancarlo R; Rombo SE
BMC Bioinformatics; 2019 Apr; 20(Suppl 4):138. PubMed ID: 30999863
[TBL] [Abstract][Full Text] [Related]
2. Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms.
Ferraro Petrillo U; Roscigno G; Cattaneo G; Giancarlo R
Bioinformatics; 2018 Jun; 34(11):1826-1833. PubMed ID: 29342232
[TBL] [Abstract][Full Text] [Related]
3. CloudDOE: a user-friendly tool for deploying Hadoop clouds and analyzing high-throughput sequencing data with MapReduce.
Chung WC; Chen CC; Ho JM; Lin CY; Hsu WL; Wang YC; Lee DT; Lai F; Huang CW; Chang YJ
PLoS One; 2014; 9(6):e98146. PubMed ID: 24897343
[TBL] [Abstract][Full Text] [Related]
4. ADS-HCSpark: A scalable HaplotypeCaller leveraging adaptive data segmentation to accelerate variant calling on Spark.
Xiao A; Wu Z; Dong S
BMC Bioinformatics; 2019 Feb; 20(1):76. PubMed ID: 30764760
[TBL] [Abstract][Full Text] [Related]
5. Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends.
Mohammed EA; Far BH; Naugler C
BioData Min; 2014; 7():22. PubMed ID: 25383096
[TBL] [Abstract][Full Text] [Related]
6. Big Data in metagenomics: Apache Spark vs MPI.
Abuín JM; Lopes N; Ferreira L; Pena TF; Schmidt B
PLoS One; 2020; 15(10):e0239741. PubMed ID: 33022000
[TBL] [Abstract][Full Text] [Related]
7. A distributed computing model for big data anonymization in the networks.
Ashkouti F; Khamforoosh K
PLoS One; 2023; 18(4):e0285212. PubMed ID: 37115783
[TBL] [Abstract][Full Text] [Related]
8. SparkSeq: fast, scalable and cloud-ready tool for the interactive genomic data analysis with nucleotide precision.
Wiewiórka MS; Messina A; Pacholewska A; Maffioletti S; Gawrysiak P; Okoniewski MJ
Bioinformatics; 2014 Sep; 30(18):2652-3. PubMed ID: 24845651
[TBL] [Abstract][Full Text] [Related]
9. PySpark and RDKit: Moving towards Big Data in Cheminformatics.
Lovrić M; Molero JM; Kern R
Mol Inform; 2019 Jun; 38(6):e1800082. PubMed ID: 30844132
[TBL] [Abstract][Full Text] [Related]
10. Scalability and Validation of Big Data Bioinformatics Software.
Yang A; Troup M; Ho JWK
Comput Struct Biotechnol J; 2017; 15():379-386. PubMed ID: 28794828
[TBL] [Abstract][Full Text] [Related]
11. HBLAST: Parallelised sequence similarity--A Hadoop MapReducable basic local alignment search tool.
O'Driscoll A; Belogrudov V; Carroll J; Kropp K; Walsh P; Ghazal P; Sleator RD
J Biomed Inform; 2015 Apr; 54():58-64. PubMed ID: 25625550
[TBL] [Abstract][Full Text] [Related]
12. VC@Scale: Scalable and high-performance variant calling on cluster environments.
Ahmad T; Al Ars Z; Hofstee HP
Gigascience; 2021 Sep; 10(9):. PubMed ID: 34494101
[TBL] [Abstract][Full Text] [Related]
13. Bioinformatics applications on Apache Spark.
Guo R; Zhao Y; Zou Q; Fang X; Peng S
Gigascience; 2018 Aug; 7(8):. PubMed ID: 30101283
[TBL] [Abstract][Full Text] [Related]
14. PyGMQL: scalable data extraction and analysis for heterogeneous genomic datasets.
Nanni L; Pinoli P; Canakoglu A; Ceri S
BMC Bioinformatics; 2019 Nov; 20(1):560. PubMed ID: 31703553
[TBL] [Abstract][Full Text] [Related]
15. MaRe: Processing Big Data with application containers on Apache Spark.
Capuccini M; Dahlö M; Toor S; Spjuth O
Gigascience; 2020 May; 9(5):. PubMed ID: 32369166
[TBL] [Abstract][Full Text] [Related]
16. SeQual-Stream: approaching stream processing to quality control of NGS datasets.
Castellanos-Rodríguez Ó; Expósito RR; Touriño J
BMC Bioinformatics; 2023 Oct; 24(1):403. PubMed ID: 37891497
[TBL] [Abstract][Full Text] [Related]
17. FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy.
Ferraro Petrillo U; Palini F; Cattaneo G; Giancarlo R
BMC Bioinformatics; 2021 Mar; 22(1):144. PubMed ID: 33752596
[TBL] [Abstract][Full Text] [Related]
18. Distributed Fast Self-Organized Maps for Massive Spectrophotometric Data Analysis
Dafonte C; Garabato D; Álvarez MA; Manteiga M
Sensors (Basel); 2018 May; 18(5):. PubMed ID: 29751580
[TBL] [Abstract][Full Text] [Related]
19. A quantitative assessment of the Hadoop framework for analyzing massively parallel DNA sequencing data.
Siretskiy A; Sundqvist T; Voznesenskiy M; Spjuth O
Gigascience; 2015; 4():26. PubMed ID: 26045962
[TBL] [Abstract][Full Text] [Related]
20. An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics.
Taylor RC
BMC Bioinformatics; 2010 Dec; 11 Suppl 12(Suppl 12):S1. PubMed ID: 21210976
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]