Huang, Liren: Cloud-based Bioinformatics Framework for Next-Generation Sequencing Data. 2019

Inhalt

2 Related Work

2.1 The Apache Hadoop and Spark frameworks

2.2 Sequence alignment and its cloud implementations

2.3 De novo assembly and its cloud implementations

2.4 Conclusion

3 Sparkhit: Distributed sequence alignment

3.1 The pipeline for sequence alignment

3.2 Distributed implementation

4 Reflexiv: Parallel De Novo genome assembly

5 Large scale genomic data analyses

6 Conclusion and outlook