scispace - formally typeset
Search or ask a question
Institution

Heidelberg Institute for Theoretical Studies

FacilityHeidelberg, Germany
About: Heidelberg Institute for Theoretical Studies is a facility organization based out in Heidelberg, Germany. It is known for research contribution in the topics: Galaxy & Galaxy formation and evolution. The organization has 344 authors who have published 1293 publications receiving 94075 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: This work presents some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting post-analyses on sets of trees.
Abstract: Motivation: Phylogenies are increasingly used in all fields of medical and biological research. Moreover, because of the next-generation sequencing revolution, datasets used for conducting phylogenetic analyses grow at an unprecedented pace. RAxML (Randomized Axelerated Maximum Likelihood) is a popular program for phylogenetic analyses of large datasets under maximum likelihood. Since the last RAxML paper in 2006, it has been continuously maintained and extended to accommodate the increasingly growing input datasets and to serve the needs of the user community. Results: I present some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting postanalyses on sets of trees. In addition, an up-to-date 50-page user manual covering all new RAxML options is available. Availability and implementation: The code is available under GNU

23,838 citations

Journal ArticleDOI
18 Oct 2016-PeerJ
TL;DR: VSEARCH is here shown to be more accurate than USEARCH when performing searching, clustering, chimera detection and subsampling, while on a par with US EARCH for paired-ends read merging and dereplication.
Abstract: Background: VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing and preparing metagenomics, genomics and population genomics nucleotide sequence data. It is designed as an alternative to the widely used USEARCH tool (Edgar, 2010) for which the source code is not publicly available, algorithm details are only rudimentarily described, and only a memory-confined 32-bit version is freely available for academic use. Methods: When searching nucleotide sequences, VSEARCH uses a fast heuristic based on words shared by the query and target sequences in order to quickly identify similar sequences, a similar strategy is probably used in USEARCH. VSEARCH then performs optimal global sequence alignment of the query against potential target sequences, using full dynamic programming instead of the seed-and-extend heuristic used by USEARCH. Pairwise alignments are computed in parallel using vectorisation and multiple threads. Results: VSEARCH includes most commands for analysing nucleotide sequences available in USEARCH version 7 and several of those available in USEARCH version 8, including searching (exact or based on global alignment), clustering by similarity (using length pre-sorting, abundance pre-sorting or a user-defined order), chimera detection (reference-based or de novo), dereplication (full length or prefix), pairwise alignment, reverse complementation, sorting, and subsampling. VSEARCH also includes commands for FASTQ file processing, i.e., format detection, filtering, read quality statistics, and merging of paired reads. Furthermore, VSEARCH extends functionality with several new commands and improvements, including shuffling, rereplication, masking of low-complexity sequences with the well-known DUST algorithm, a choice among different similarity definitions, and FASTQ file format conversion. VSEARCH is here shown to be more accurate than USEARCH when performing searching, clustering, chimera detection and subsampling, while on a par with USEARCH for paired-ends read merging. VSEARCH is slower than USEARCH when performing clustering and chimera detection, but significantly faster when performing paired-end reads merging and dereplication. VSEARCH is available at https://github.com/torognes/vsearch under either the BSD 2-clause license or the GNU General Public License version 3.0. Discussion: VSEARCH has been shown to be a fast, accurate and full-fledged alternative to USEARCH. A free and open-source versatile tool for sequence analysis is now available to the metagenomics community.

5,850 citations

Journal ArticleDOI
TL;DR: The PEAR software for merging raw Illumina paired-end reads from target fragments of varying length evaluates all possible paired- end read overlaps and does not require the target fragment size as input, and implements a statistical test for minimizing false-positive results.
Abstract: Motivation The Illumina paired-end sequencing technology can generate reads from both ends of target DNA fragments, which can subsequently be merged to increase the overall read length. There already exist tools for merging these paired-end reads when the target fragments are equally long. However, when fragment lengths vary and, in particular, when either the fragment size is shorter than a single-end read, or longer than twice the size of a single-end read, most state-of-the-art mergers fail to generate reliable results. Therefore, a robust tool is needed to merge paired-end reads that exhibit varying overlap lengths because of varying target fragment lengths. Results We present the PEAR software for merging raw Illumina paired-end reads from target fragments of varying length. The program evaluates all possible paired-end read overlaps and does not require the target fragment size as input. It also implements a statistical test for minimizing false-positive results. Tests on simulated and empirical data show that PEAR consistently generates highly accurate merged paired-end reads. A highly optimized implementation allows for merging millions of paired-end reads within a few minutes on a standard desktop computer. On multi-core architectures, the parallel version of PEAR shows linear speedups compared with the sequential version of PEAR. Availability and implementation PEAR is implemented in C and uses POSIX threads. It is freely available at http://www.exelixis-lab.org/web/software/pear.

3,270 citations

Journal ArticleDOI
TL;DR: The Illustris Project as mentioned in this paper is a series of large-scale hydrodynamical simulations of galaxy formation, which includes primordial and metal-line cooling with self-shielding corrections, stellar evolution, stellar feedback, gas recycling, chemical enrichment, supermassive black hole growth, and feedback from active galactic nuclei.
Abstract: We introduce the Illustris Project, a series of large-scale hydrodynamical simulations of galaxy formation. The highest resolution simulation, Illustris-1, covers a volume of (106.5 Mpc)^3, has a dark mass resolution of 6.26 × 10^6 M_⊙, and an initial baryonic matter mass resolution of 1.26 × 10^6 M_⊙. At z = 0 gravitational forces are softened on scales of 710 pc, and the smallest hydrodynamical gas cells have an extent of 48 pc. We follow the dynamical evolution of 2 × 1820^3 resolution elements and in addition passively evolve 1820^3 Monte Carlo tracer particles reaching a total particle count of more than 18 billion. The galaxy formation model includes: primordial and metal-line cooling with self-shielding corrections, stellar evolution, stellar feedback, gas recycling, chemical enrichment, supermassive black hole growth, and feedback from active galactic nuclei. Here we describe the simulation suite, and contrast basic predictions of our model for the present-day galaxy population with observations of the local universe. At z = 0 our simulation volume contains about 40 000 well-resolved galaxies covering a diverse range of morphologies and colours including early-type, late-type and irregular galaxies. The simulation reproduces reasonably well the cosmic star formation rate density, the galaxy luminosity function, and baryon conversion efficiency at z = 0. It also qualitatively captures the impact of galaxy environment on the red fractions of galaxies. The internal velocity structure of selected well-resolved disc galaxies obeys the stellar and baryonic Tully–Fisher relation together with flat circular velocity curves. In the well-resolved regime, the simulation reproduces the observed mix of early-type and late-type galaxies. Our model predicts a halo mass dependent impact of baryonic effects on the halo mass function and the masses of haloes caused by feedback from supernova and active galactic nuclei.

2,012 citations

Journal ArticleDOI
Bernhard Misof, Shanlin Liu, Karen Meusemann1, Ralph S. Peters, Alexander Donath, Christoph Mayer, Paul B. Frandsen2, Jessica L. Ware2, Tomas Flouri3, Rolf G. Beutel4, Oliver Niehuis, Malte Petersen, Fernando Izquierdo-Carrasco3, Torsten Wappler5, Jes Rust5, Andre J. Aberer3, Ulrike Aspöck6, Ulrike Aspöck7, Horst Aspöck6, Daniela Bartel6, Alexander Blanke8, Simon Berger3, Alexander Böhm6, Thomas R. Buckley9, Brett Calcott10, Junqing Chen, Frank Friedrich11, Makiko Fukui12, Mari Fujita8, Carola Greve, Peter Grobe, Shengchang Gu, Ying Huang, Lars S. Jermiin1, Akito Y. Kawahara13, Lars Krogmann14, Martin Kubiak11, Robert Lanfear15, Robert Lanfear16, Robert Lanfear17, Harald Letsch6, Yiyuan Li, Zhenyu Li, Jiguang Li, Haorong Lu, Ryuichiro Machida8, Yuta Mashimo8, Pashalia Kapli18, Pashalia Kapli3, Duane D. McKenna19, Guanliang Meng, Yasutaka Nakagaki8, José Luis Navarrete-Heredia20, Michael Ott21, Yanxiang Ou, Günther Pass6, Lars Podsiadlowski5, Hans Pohl4, Björn M. von Reumont22, Kai Schütte11, Kaoru Sekiya8, Shota Shimizu8, Adam Slipinski1, Alexandros Stamatakis3, Alexandros Stamatakis23, Wenhui Song, Xu Su, Nikolaus U. Szucsich6, Meihua Tan, Xuemei Tan, Min Tang, Jingbo Tang, Gerald Timelthaler6, Shigekazu Tomizuka8, Michelle D. Trautwein24, Xiaoli Tong25, Toshiki Uchifune8, Manfred Walzl6, Brian M. Wiegmann26, Jeanne Wilbrandt, Benjamin Wipfler4, Thomas K. F. Wong1, Qiong Wu, Gengxiong Wu, Yinlong Xie, Shenzhou Yang, Qing Yang, David K. Yeates1, Kazunori Yoshizawa27, Qing Zhang, Rui Zhang, Wenwei Zhang, Yunhui Zhang, Jing Zhao, Chengran Zhou, Lili Zhou, Tanja Ziesmann, Shijie Zou, Yingrui Li, Xun Xu, Yong Zhang, Huanming Yang, Jian Wang, Jun Wang, Karl M. Kjer2, Xin Zhou 
07 Nov 2014-Science
TL;DR: The phylogeny of all major insect lineages reveals how and when insects diversified and provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.
Abstract: Insects are the most speciose group of animals, but the phylogenetic relationships of many major lineages remain unresolved. We inferred the phylogeny of insects from 1478 protein-coding genes. Phylogenomic analyses of nucleotide and amino acid sequences, with site-specific nucleotide or domain-specific amino acid substitution models, produced statistically robust and congruent results resolving previously controversial phylogenetic relations hips. We dated the origin of insects to the Early Ordovician [~479 million years ago (Ma)], of insect flight to the Early Devonian (~406 Ma), of major extant lineages to the Mississippian (~345 Ma), and the major diversification of holometabolous insects to the Early Cretaceous. Our phylogenomic study provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

1,998 citations


Authors

Showing all 351 results

NameH-indexPapersCitations
Volker Springel165746123399
Douglas B. Kell11163450335
Alex Bateman85205104180
Rebecca C. Wade6829914258
Alexandros Stamatakis6225367832
Federico Marinacci5821712420
Christopher C. Hayward5822010211
J. Michael Cherry5712526442
Marcus A. Koch5720712133
Friedrich K. Röpke5516310602
Tilmann Gneiting5313817248
Rüdiger Pakmor5214910324
Stephen A. Smith5114816747
Markus Kromer5011810238
Nicolas Salamin501809178
Network Information
Related Institutions (5)
Max Planck Society
406.2K papers, 19.5M citations

87% related

Massachusetts Institute of Technology
268K papers, 18.2M citations

86% related

University of California, Berkeley
265.6K papers, 16.8M citations

85% related

Princeton University
146.7K papers, 9.1M citations

85% related

Centre national de la recherche scientifique
382.4K papers, 13.6M citations

84% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20235
202210
2021103
2020105
2019136
2018142