
mSystems, Journal Year: 2024, Volume and Issue: 9(10)
Published: Oct. 1, 2024
ABSTRACT While numerous computational frameworks and workflows are available for recovering prokaryote eukaryote genomes from metagenome data, only a limited number of pipelines designed specifically viromics analysis. With many tools developed in the last few years alone, it can be challenging scientists with bioinformatics experience to easily recover, evaluate quality, annotate genes, dereplicate, assign taxonomy, calculate relative abundance coverage viral using state-of-the-art methods standards. Here, we describe Modular Viromics Pipeline (MVP) v.1.0, user-friendly pipeline written Python providing simple framework perform standard analyses. MVP combines multiple enable genome identification, characterization filtering, clustering, taxonomic functional annotation, binning, comprehensive summaries results that used downstream ecological Overall, provides standardized reproducible both extensive robust viruses large-scale sequencing data including metagenomes, metatranscriptomes, viromes, isolate genomes. As typical use case, show how entire applied set 20 metagenomes wetland sediments 10 modules executed via command lines, leading identification 11,656 contigs 8,145 operational units (vOTUs) displaying clear beta-diversity pattern. Further, acting as dynamic wrapper, is continuously incorporate updates integrate new tools, ensuring its ongoing relevance rapidly evolving field viromics. at https://gitlab.com/ccoclet/mvp versioned packages PyPi Conda. IMPORTANCE The significance our work lies development (MVP), an integrated tailored exclusively stands out due modular design, which ensures easy installation, execution, integration databases. By combining such geNomad CheckV, high-quality recovery taxonomy host assignment, addressing limitations existing pipelines. MVP’s ability handle diverse sample types, environmental, human microbiome, plant-associated samples, makes versatile tool broader microbiome research community. standardizing analysis process interpretable results, enables researchers studies communities, significantly advancing understanding ecology impact on various ecosystems.
Language: Английский