Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life

doi:10.1038/S41564-017-0012-7

Open AccessJournal ArticleDOI

Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life

Donovan H. Parks, +7 more

- 11 Sep 2017 -

Nature microbiology

- Vol. 2, Iss: 11, pp 1533-1542

Chats0

TLDR

The recovery of 7,903 bacterial and archaeal metagenome-assembled genomes increases the phylogenetic diversity represented by public genome repositories and provides the first representatives from 20 candidate phyla.

Abstract:

Challenges in cultivating microorganisms have limited the phylogenetic diversity of currently available microbial genomes. This is being addressed by advances in sequencing throughput and computational techniques that allow for the cultivation-independent recovery of genomes from metagenomes. Here, we report the reconstruction of 7,903 bacterial and archaeal genomes from >1,500 public metagenomes. All genomes are estimated to be ≥50% complete and nearly half are ≥90% complete with ≤5% contamination. These genomes increase the phylogenetic diversity of bacterial and archaeal genome trees by >30% and provide the first representatives of 17 bacterial and three archaeal candidate phyla. We also recovered 245 genomes from the Patescibacteria superphylum (also known as the Candidate Phyla Radiation) and find that the relative diversity of this group varies substantially with different protein marker sets. The scale and quality of this data set demonstrate that recovering genomes from metagenomes provides an expedient path forward to exploring microbial dark matter.

Citations

PDF

Open Access

More filters

Journal Article

Fast Tree: Computing Large Minimum-Evolution Trees with Profiles instead of a Distance Matrix

Morgan N. Price, +2 more

- 18 Jun 2009 -

Lawrence Berkeley National Laboratory

TL;DR: FastTree as mentioned in this paper uses sequence profiles of internal nodes in the tree to implement neighbor-joining and uses heuristics to quickly identify candidate joins, then uses nearest-neighbor interchanges to reduce the length of the tree.

...read moreread less

Journal ArticleDOI

High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

Chirag Jain, +4 more

- 30 Nov 2018 -

Nature Communications

TL;DR: FastANI is developed, a method to compute ANI using alignment-free approximate sequence mapping, and it is shown 95% ANI is an accurate threshold for demarcating prokaryotic species by analyzing about 90,000 proKaryotic genomes.

...read moreread less

Journal ArticleDOI

A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life

Donovan H. Parks, +6 more

- 27 Aug 2018 -

Nature Biotechnology

TL;DR: This work used a concatenated protein phylogeny as the basis for a bacterial taxonomy that conservatively removes polyphyletic groups and normalizes taxonomic ranks on the basis of relative evolutionary divergence.

...read moreread less

Journal ArticleDOI

GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database

Pierre-Alain Chaumeil, +3 more

- 15 Nov 2019 -

Bioinformatics

TL;DR: The accuracy of the GTDB-Tk taxonomic assignments is demonstrated by evaluating its performance on a phylogenetically diverse set of 10 156 bacterial and archaeal metagenome-assembled genomes.

...read moreread less

Journal ArticleDOI

MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Dongwan D. Kang, +8 more

- 26 Jul 2019 -

PeerJ

TL;DR: Comparing MetaBAT 2 to alternative software tools on over 100 real world metagenome assemblies shows superior accuracy and computing speed, and recommends the community adopts Meta BAT 2 for their meetagenome binning experiments.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Fast and accurate short read alignment with Burrows–Wheeler transform

Heng Li, +1 more

- 01 Jul 2009 -

Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Journal ArticleDOI

Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities

Patrick D. Schloss, +16 more

- 01 Dec 2009 -

Applied and Environmental Microbiology

TL;DR: M mothur is used as a case study to trim, screen, and align sequences; calculate distances; assign sequences to operational taxonomic units; and describe the α and β diversity of eight marine samples previously characterized by pyrosequencing of 16S rRNA gene fragments.

...read moreread less

Journal ArticleDOI

BLAST+: architecture and applications.

Christiam Camacho, +6 more

- 15 Dec 2009 -

BMC Bioinformatics

TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.

...read moreread less

Journal ArticleDOI

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Todd M. Lowe, +1 more

- 01 Mar 1997 -

Nucleic Acids Research

TL;DR: A program is described, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases.

...read moreread less

Journal ArticleDOI

Database resources of the National Center for Biotechnology Information

David L. Wheeler, +12 more

- 01 Jan 2004 -

Nucleic Acids Research

TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.

...read moreread less