sjevelazco / bdc

Check out the vignettes with detailed documentation on each step of the BDC workflow

Home Page:https://brunobrr.github.io/bdc

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BDC

A comprehensive and straightforward workflow for standardizing, integrating, and cleaning biodiversity data

R-CMD-check

Overview

Handle biodiversity data from several different sources is not an easy task. Here we present the Biodiversity Data Cleaning (BDC) workflow, an automated workflow to address quality issues and improve datas’ fitness-for-use. The workflow harmonizes and integrates data from different sources following common standards and protocols and implements various tests and tools to flag, document, clean, and correct the taxonomic, spatial, and temporal information of biodiversity data.

The workflow is composed of five core steps:

  1. Standardization and integration of different datasets;
  2. Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments (e.g., correct transposed coordinates and standardize country names);
  3. Taxonomy: cleaning, parsing, and standardization of scientific names against multiple taxonomic references. The workflow corrects spelling errors and converts nomenclatural synonyms to currently accepted names;
  4. Space: flagging of erroneous, suspicious, and low-precision geographic coordinates;
  5. Time: flagging and, whenever possible, correction of inconsistent collection date.

Aim to facilitate the documentation, visualization, and interpretation of results of data quality tests, in all steps, several files documenting the workflow results are saved automatically in a folder named “Output”. These files include i) records needing further inspection, ii) databases containing the results of each step, iii) figures, and iv) data-quality reports documenting the results.

Installation

You can install the released version of “BDC” from github with:

if (!require("remotes")) install.packages("remotes")
if (!require("bdc")) remotes::install_github("brunobrr/bdc")

Package website

See BDC package website (https://brunobrr.github.io/bdc/) for detailed explanation on each step of the workflow.

Getting help

If you encounter a clear bug, please file an issue here. For questions or suggestion, please send us a email (ribeiro.brr@gmail.com).

About

Check out the vignettes with detailed documentation on each step of the BDC workflow

https://brunobrr.github.io/bdc

License:GNU General Public License v3.0


Languages

Language:R 99.0%Language:Makefile 1.0%