A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control
Güntsch, A.; Mathew, C.; Obst, M.; Vicario, S.; Williams, A.; de Jong, Y.; Goble, C.; Haines, R. (2014). A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control. Biodiversity Data Journal 2: e4221. https://dx.doi.org/10.3897/bdj.2.e4221
In: Biodiversity Data Journal. Pensoft Publishers: Sofia. ISSN 1314-2836; e-ISSN 1314-2828, more
| |
| Author keywords |
biodiversity informatics, web services, workflows, service oriented architecture, data cleaning, e-Science |
| Authors | | Top |
- Güntsch, A.
- Mathew, C.
- Obst, M.
- Vicario, S.
|
- Williams, A.
- de Jong, Y.
- Goble, C.
- Haines, R.
|
|
| Abstract |
The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users. |
|