This repository contains the functionality to standardize the data of the List of Invasive Alien Species of Union concern to a Darwin Core Archive that can be harvested by GBIF.
source data → Darwin Core mapping script → generated Darwin Core files
The source data are manually created from:
- 2022 consolidated version pdf: scientific names with authors, synonym names (in parenthesis) with authors
- spreadsheets containing common names: first version (January 2018) and later additions (April 2020 and January 2022)
- Commission Implementing Regulation (EU) 2025/1422 updating the list of invasive alien species of Union concern
The repository structure is based on Cookiecutter Data Science and the Checklist recipe. Files and directories indicated with GENERATED should not be edited manually.
├── README.md : Description of this repository
├── LICENSE : Repository license
├── union-list.Rproj : RStudio project file
├── .gitignore : Files and directories to be ignored by git
│
├── data
│ ├── raw : Source data, input for mapping script
│ └── processed : Darwin Core output of mapping script GENERATED
│
└── src
└── dwc_mapping.Rmd : Darwin Core mapping script, core functionality of this repository
- Clone this repository to your computer
- Open the RStudio project file
- Open the
dwc_mapping.RmdR Markdown file in RStudio - Install any required packages
- Click
Run > Run Allto generate the processed data
MIT License for the code and documentation in this repository. The included data is released under another license.