Novel methods improve prediction of species' distributions from occurrence data

doi:10.1111/J.2006.0906-7590.04596.X

Home
/
Papers
/
Novel methods improve prediction of species' distributions from occurrence data

Open AccessJournal ArticleDOI

Novel methods improve prediction of species' distributions from occurrence data

Jane Elith,Catherine H. Graham,Robert P. Anderson,Miroslav Dudík,Simon Ferrier,Antoine Guisan,Robert J. Hijmans,Falk Huettmann,John R. Leathwick,Anthony Lehmann,Jin Li,Lúcia G. Lohmann,Bette A. Loiselle,Glenn Manion,Craig Moritz,Miguel Nakamura,Yoshinori Nakazawa,Jacob C. M. Mc Overton,A. Townsend Peterson,Steven J. Phillips,Karen Richardson,Ricardo Scachetti-Pereira,Robert E. Schapire,Jorge Soberón,Stephen E. Williams,Mary S. Wisz,Niklaus E. Zimmermann +26 moreUniversity of Melbourne,Stony Brook University,City University of New York,Princeton University,University of Lausanne,University of California, Berkeley,University of Alaska Fairbanks,National Institute of Water and Atmospheric Research,Commonwealth Scientific and Industrial Research Organisation,University of São Paulo,University of Missouri,Consejo Nacional de Ciencia y Tecnología,University of Kansas,Landcare Research,AT&T,McGill University,James Cook University,Swiss Federal Institute for Forest, Snow and Landscape Research

- 01 Apr 2006 -

Ecography

- Vol. 29, Iss: 2, pp 129-151

Show Less

TLDR

This work compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date and found that presence-only data were effective for modelling species' distributions for many species and regions.

Abstract:

Prediction of species' distributions is central to diverse applications in ecology, evolution and conservation science. There is increasing electronic access to vast sets of occurrence records in museums and herbaria, yet little effective guidance on how best to use this information in the context of numerous approaches for modelling distributions. To meet this need, we compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date. We used presence-only data to fit models, and independent presence-absence data to evaluate the predictions. Along with well-established modelling methods such as generalised additive models and GARP and BIOCLIM, we explored methods that either have been developed recently or have rarely been applied to modelling species' distributions. These include machine-learning methods and community models, both of which have features that may make them particularly well suited to noisy or sparse information, as is typical of species' occurrence data. Presence-only data were effective for modelling species' distributions for many species and regions. The novel methods consistently outperformed more established methods. The results of our analysis are promising for the use of data from museums and herbaria, especially as methods suited to the noise inherent in such data improve.

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal ArticleDOI

The global distribution and burden of dengue

Samir Bhatt,Peter W. Gething,Oliver J. Brady,Jane P. Messina,Andrew Farlow,Catherine L. Moyes,John M. Drake,John M. Drake,John S. Brownstein,Anne G. Hoen,Osman Sankoh,Osman Sankoh,Monica F. Myers,Dylan B. George,Thomas Jaenisch,G. R. William Wint,Cameron P. Simmons,Thomas W. Scott,Thomas W. Scott,Jeremy Farrar,Jeremy Farrar,Simon I. Hay,Simon I. Hay +22 moreUniversity of Oxford,University of Georgia,Boston Children's Hospital,Dartmouth College,University of the Witwatersrand,Heidelberg University,National Institutes of Health,University of California, Davis,National University of Singapore

- 25 Apr 2013 -

Nature

Show Less

TL;DR: These new risk maps and infection estimates provide novel insights into the global, regional and national public health burden imposed by dengue and will help to guide improvements in disease control strategies using vaccine, drug and vector control methods, and in their economic evaluation.

...read moreread less

Journal ArticleDOI

Collinearity: a review of methods to deal with it and a simulation study evaluating their performance

Carsten F. Dormann,Jane Elith,Sven Bacher,Carsten M. Buchmann,Gudrun Carl,Gabriel Carré,Jaime Ricardo García Márquez,Bernd Gruber,Bruno Lafourcade,Pedro J. Leitão,Tamara Münkemüller,Colin J. McClean,Patrick E. Osborne,Björn Reineking,Boris Schröder,Andrew K. Skidmore,Damaris Zurell,Sven Lautenbach +17 moreHelmholtz Centre for Environmental Research - UFZ

- 01 Jan 2013 -

Ecography

Show Less

TL;DR: It was found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection and the value of GLM in combination with penalised methods and thresholds when omitted variables are considered in the final interpretation.

...read moreread less

Journal ArticleDOI

Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation

Steven J. Phillips,Miroslav Dudík +1 moreAT&T

- 01 Apr 2008 -

Ecography

Show Less

TL;DR: This paper presents a tuning method that uses presence-only data for parameter tuning, and introduces several concepts that improve the predictive accuracy and running time of Maxent and describes a new logistic output format that gives an estimate of probability of presence.

...read moreread less

Journal ArticleDOI

Species Distribution Models: Ecological Explanation and Prediction Across Space and Time

Jane Elith,John R. Leathwick +1 moreUniversity of Melbourne,National Institute of Water and Atmospheric Research

- 06 Feb 2009 -

Annual Review of Ecology, Evolution, and...

Show Less

TL;DR: Species distribution models (SDMs) as mentioned in this paper are numerical tools that combine observations of species occurrence or abundance with environmental estimates, and are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time.

...read moreread less

Journal ArticleDOI

A working guide to boosted regression trees

Jane Elith,John R. Leathwick,Trevor Hastie +2 moreUniversity of Melbourne,National Institute of Water and Atmospheric Research,Stanford University

- 01 Jul 2008 -

Journal of Animal Ecology

Show Less

TL;DR: This study provides a working guide to boosted regression trees (BRT), an ensemble method for fitting statistical models that differs fundamentally from conventional techniques that aim to fit a single parsimonious model.

...read moreread less

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

Robert Tibshirani

- 01 Jan 1996 -

Journal of the royal statistical society...

Show Less

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

...read moreread less

Book

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach

Kenneth P. Burnham,David E. Anderson +1 more

Show Less

TL;DR: The second edition of this book is unique in that it focuses on methods for making formal statistical inference from all the models in an a priori set (Multi-Model Inference).

...read moreread less

Journal ArticleDOI

A Coefficient of agreement for nominal Scales

Jacob CohenYork University

- 01 Apr 1960 -

Educational and Psychological Measuremen...

Show Less

TL;DR: In this article, the authors present a procedure for having two or more judges independently categorize a sample of units and determine the degree, significance, and significance of the units. But they do not discuss the extent to which these judgments are reproducible, i.e., reliable.

...read moreread less

Journal ArticleDOI

The meaning and use of the area under a receiver operating characteristic (ROC) curve.

James A. Hanley,Barbara J. McNeil +1 more

- 01 Apr 1982 -

Radiology

Show Less

TL;DR: A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented and it is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a random chosen non-diseased subject.

...read moreread less

Book

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 moreUniversity of New South Wales

Show Less

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.

...read moreread less