This repository contains the framework and R code in development for risk modelling and mapping of alien species throughout Belgium and greater Europe at 1 km2 resolution as part of the TrIAS project.
├── README.md : Description of this repository
├── LICENSE : Repository license
├── risk-modelling-and-mapping.Rproj : RStudio project file
├── .gitignore : Files and directories to be ignored by git
│
├── data
│ ├── external : external files (e.g. climate rasters, occurrence data, GIS files) required to run the model. Download via Zenodo (see below).
│ └── processed : Risk maps and associated files needed for reproducibility GENERATED
└── src : R Code
Although theoretically possible, this workflow is not applied to all species listed in the published unified checklist and whose occurrences are found. We limit our analysis to a list of species labelled as emerging. The emerging status is object of another work package and it is a semi-automated process described in repository indicators: see webpage.
- Species scientific name.
- Climate and habitat raster data files downloaded from Zenodo (links will be provided when data is available)
- R studio installed in your computer.
- After cloning this repository, add folders to the existing folder structure shown above as shown below. This will allow you to use the relative path structure in the trias_sdm.R file.
├── data
├── external
├── bias_grids (Global taxonomic occurrence grids, downloaded from Zenodo here )
├── climate (put climate rasters downloaded from Zenodo here)
├── GIS (GIS data downloaded from Zenodo)
├── habitat (put habitat rasters downloaded from Zenodo here)
The automated workflow can be divided in three sections:
- Develop global scale climate-only species distribution models (SDMs)
- Generate European level SDMs
- Forecast species distributions under climate change scenarios
- Automatically generates IAS risk maps using machine learning. Our workflow requires only a species name and generates an ensemble of machine learning algorithms stacked together as a meta-model to produce the final risk map at 1 km2 resolution. Risk maps are generated automatically for standard IPCC greenhouse gas emission scenarios (RCP).
- Automatically generates confidence maps for each IAS risk map. These illustrate confidence of each individual prediction across your study extent.
- Addresses geographic sampling bias
- Incorporates best practices for the placement of pseudo-absences: pseudo absences are placed in the same ecoregions where presences occur. We use the global model to restrict pseudo absences to areas of low predicted suitability. We use the taxonomic occurrence grid (aka bias grid) to not place pseudoabsences in areas of low sampling effort. The taxonomic occurrence grid summarize the sampling effort of the higher taxon ,the modelled species belongs to.
- Flag highly correlated predictors. Highly correlated predictors can have undesirable effects and confuse the interpretation of variable importance
- Integrates multiple machine learning algorithms to predict risk. It has been consistently demonstrated that the choice of algorithm has the largest impact on predicted risk and area of predicted risk.
- Assesses spatial autocorrelation in the residuals to assess the impacts of clustering. If high, thinning can be employed.
Inputs required run the workflow:
- Species name and the GBIF taxon Key (which can be retrieved using the workflow). A list of species can be used to retrieve global occurrences for each species using the global_download.Rmd
- Predictor data (download using the links below )
Download CHELSA data: https://envicloud.wsl.ch/#/?prefix=chelsa%2Fchelsa_V1%2Fclimatologies
Download TrIAS EU Climate data from Zenodo: https://doi.org/10.5281/zenodo.3694065
Download habitat predictors from Zenodo: https://doi.org/10.5281/zenodo.7841324
Download taxonomic occurrence grids from Zenodo: https://doi.org/10.5281/zenodo.7556851