Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.
TLDR
Tests on both synthetic and real reads show Unicycler can assemble larger contigs with fewer misassemblies than other hybrid assemblers, even when long-read depth and accuracy are low.Abstract:
The Illumina DNA sequencing platform generates accurate but short reads, which can be used to produce accurate but fragmented genome assemblies. Pacific Biosciences and Oxford Nanopore Technologies DNA sequencing platforms generate long reads that can produce complete genome assemblies, but the sequencing is more expensive and error-prone. There is significant interest in combining data from these complementary sequencing technologies to generate more accurate "hybrid" assemblies. However, few tools exist that truly leverage the benefits of both types of data, namely the accuracy of short reads and the structural resolving power of long reads. Here we present Unicycler, a new tool for assembling bacterial genomes from a combination of short and long reads, which produces assemblies that are accurate, complete and cost-effective. Unicycler builds an initial assembly graph from short reads using the de novo assembler SPAdes and then simplifies the graph using information from short and long reads. Unicycler uses a novel semi-global aligner to align long reads to the assembly graph. Tests on both synthetic and real reads show Unicycler can assemble larger contigs with fewer misassemblies than other hybrid assemblers, even when long-read depth and accuracy are low. Unicycler is open source (GPLv3) and available at github.com/rrwick/Unicycler.read more
Citations
More filters
SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Journal ArticleDOI
Department of health & human services
TL;DR: The SHSIP provides a well-detailed description of the proposed value-based models of care through the Patient-Centered Medical Home (PCMH) model, resulting in the statewide implementation of Accountable Health Communities (AHCs).
Journal ArticleDOI
Opportunities and challenges in long-read sequencing data analysis.
Shanika L. Amarasinghe,Shanika L. Amarasinghe,Shian Su,Shian Su,Xueyi Dong,Xueyi Dong,Luke Zappia,Matthew E. Ritchie,Matthew E. Ritchie,Quentin Gouil,Quentin Gouil +10 more
TL;DR: The current landscape of available tools is reviewed, the principles of error correction, base modification detection, and long-read transcriptomics analysis are focused on, and the challenges that remain are highlighted.
Journal ArticleDOI
GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes
Jian-Jun Jin,Wen-Bin Yu,Wen-Bin Yu,Jun-Bo Yang,Yu Song,Yu Song,Claude W. dePamphilis,Ting-Shuang Yi,De-Zhu Li +8 more
TL;DR: GetOrganelle assemblies are more accurate than published and/or NOVOPlasty-reassembled plastomes as assessed by mapping and are able to reassemble the circular Plastomes from 47 datasets using GetOrganelle.
Journal ArticleDOI
The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities.
James J. Davis,James J. Davis,Alice R. Wattam,Alice R. Wattam,Ramy K. Aziz,Thomas Brettin,Thomas Brettin,Ralph Butler,Ralph Butler,Rory Butler,Philippe Chlenski,Neal Conrad,Neal Conrad,Allan Dickerman,Emily M. Dietrich,Emily M. Dietrich,Joseph L. Gabbard,Svetlana Gerdes,Andrew Guard,Ronald W. Kenyon,Dustin Machi,Chunhong Mao,Daniel E. Murphy-Olson,Daniel E. Murphy-Olson,Marcus Nguyen,Marcus Nguyen,Eric K. Nordberg,Gary J. Olsen,Robert Olson,Robert Olson,Jamie C. Overbeek,Jamie C. Overbeek,Ross Overbeek,Bruce Parrello,Bruce Parrello,Gordon D. Pusch,Maulik Shukla,Maulik Shukla,Chris Thomas,Margo VanOeffelen,Veronika Vonstein,Andrew S. Warren,Fangfang Xia,Fangfang Xia,Dawen Xie,Hyunseung Yoo,Hyunseung Yoo,Rick Stevens,Rick Stevens +48 more
TL;DR: The recent updates to the PATRIC resource are reported, including new web-based comparative analysis tools, eight new services and the release of a command-line interface to access, query and analyze data.
References
More filters
Journal ArticleDOI
Fast gapped-read alignment with Bowtie 2
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Journal ArticleDOI
SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
Anton Bankevich,Sergey Nurk,Dmitry Antipov,Alexey Gurevich,Mikhail Dvorkin,Alexander S. Kulikov,Valery M. Lesin,Sergey I. Nikolenko,Son Pham,Andrey D. Prjibelski,Alexey V. Pyshkin,Alexander Sirotkin,Nikolay Vyahhi,Glenn Tesler,Max A. Alekseyev,Pavel A. Pevzner +15 more
TL;DR: SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies.
SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Posted ContentDOI
Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM
TL;DR: BWA-MEM automatically chooses between local and end-to-end alignments, supports paired-end reads and performs chimeric alignment, which is robust to sequencing errors and applicable to a wide range of sequence lengths from 70bp to a few megabases.
Journal ArticleDOI
T-Coffee: A novel method for fast and accurate multiple sequence alignment.
TL;DR: A new method for multiple sequence alignment that provides a dramatic improvement in accuracy with a modest sacrifice in speed as compared to the most commonly used alternatives but avoids the most serious pitfalls caused by the greedy nature of this algorithm.