When using data.type ='binary' in BIOMOD_FormatingData,
biomod2 requires either presence / absence
data, or presence-only data supplemented with
pseudo-absences that can be generated within the same
function.
The general idea behind is to select points in the studied area that will be used to compare observed environment (represented by the presences) against what is available. Those points are NOT to be considered as absences, and rather represent the available environment. From a semantic point of view, several terms can be encountered in the literature for the same purpose : background data when it comes to MaxEnt mostly, pseudo-absences, or quadrature points when applying point-process model (PPM).
Note that it is NOT recommended to mix both absence and pseudo-absences data.
3 different methods are implemented within biomod2 to
select pseudo-absences (PA) through the BIOMOD_FormatingData
function :
The selection of one or the other method will depend on a more
important and underlying question :
how were obtained the
dataset presence points ?
The 3 methods proposed within biomod2 do not depend on
the same assumptions :
| random | disk | SRE | |
|---|---|---|---|
| Geographical assumption | no | yes | no |
| Environmental assumption | no | no | yes |
| Realized niche fully sampled | no | yes | yes |
The random method is the one with the least assumptions, and should be the default choice when no sufficient information is available about the species ecology and/or the sampling design. The disk and SRE methods assume that the realized niche of the species has been fully sampled, either geographically or environmentally speaking.
Note that it is also possible for the user to select by himself its own pseudo-absence points.
Barbet-Massin, M., Jiguet, F., Albert, C.H. and Thuiller, W. (2012), Selecting pseudo-absences for species distribution models: how, where and how many?. Methods in Ecology and Evolution, 3: 327-338. https://doi.org/10.1111/j.2041-210X.2011.00172.x
This paper tried to estimate the relative effect of method and number of PA on predictive accuracy of common modelling techniques, using :
Results were varying between modelling techniques :
advice from biomod2’s team: