Ghent University Academic Bibliography

Advanced

Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases

Leen Vandepitte, Samuel Bosch, Lennert Tyberghein, Filip Waumans, Bart Vanhoorne, Francisco Hernandez, Olivier De Clerck UGent and Jan Mees UGent (2015) DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION.
abstract
Being able to assess the quality and level of completeness of data has become indispensable in marine biodiversity research, especially when dealing with large databases that typically compile data from a variety of sources. Very few integrated databases offer quality flags on the level of the individual record, making it hard for users to easily extract the data that are fit for their specific purposes. This article describes the different steps that were developed to analyse the quality and completeness of the distribution records within the European and international Ocean Biogeographic Information Systems (EurOBIS and OBIS). Records are checked on data format, completeness and validity of information, quality and detail of the used taxonomy and geographic indications and whether or not the record is a putative outlier. The corresponding quality control (QC) flags will not only help users with their data selection, they will also help the data management team and the data custodians to identify possible gaps and errors in the submitted data, providing scope to improve data quality. The results of these quality control procedures are as of now available on both the EurOBIS and OBIS databases. Through the Biology portal of the European Marine Observation and Data Network (EMODnet Biology), a subset of EurOBIS records-passing a specific combination of these QC steps-is offered to the users. In the future, EMODnet Biology will offer a wide range of filter options through its portal, allowing users to make specific selections themselves. Through LifeWatch, users can already upload their own data and check them against a selection of the here described quality control procedures.
Please use this url to cite or link to this publication:
author
organization
year
type
journalArticle (original)
publication status
published
subject
keyword
BIODIVERSITY, SPECIES-DIVERSITY, CHALLENGES, FACILITY, OUTLIERS, STANDARD
journal title
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION
Database
article number
bau125
pages
14 pages
Web of Science type
Article
Web of Science id
000348650000001
JCR category
MATHEMATICAL & COMPUTATIONAL BIOLOGY
JCR impact factor
2.627 (2015)
JCR rank
8/56 (2015)
JCR quartile
1 (2015)
ISSN
1758-0463
DOI
10.1093/database/bau125
language
English
UGent publication?
yes
classification
A1
copyright statement
I have retained and own the full copyright for this publication
id
5917175
handle
http://hdl.handle.net/1854/LU-5917175
date created
2015-03-30 14:01:02
date last changed
2016-12-21 15:41:16
@article{5917175,
  abstract     = {Being able to assess the quality and level of completeness of data has become indispensable in marine biodiversity research, especially when dealing with large databases that typically compile data from a variety of sources. Very few integrated databases offer quality flags on the level of the individual record, making it hard for users to easily extract the data that are fit for their specific purposes. This article describes the different steps that were developed to analyse the quality and completeness of the distribution records within the European and international Ocean Biogeographic Information Systems (EurOBIS and OBIS). Records are checked on data format, completeness and validity of information, quality and detail of the used taxonomy and geographic indications and whether or not the record is a putative outlier. The corresponding quality control (QC) flags will not only help users with their data selection, they will also help the data management team and the data custodians to identify possible gaps and errors in the submitted data, providing scope to improve data quality. The results of these quality control procedures are as of now available on both the EurOBIS and OBIS databases. Through the Biology portal of the European Marine Observation and Data Network (EMODnet Biology), a subset of EurOBIS records-passing a specific combination of these QC steps-is offered to the users. In the future, EMODnet Biology will offer a wide range of filter options through its portal, allowing users to make specific selections themselves. Through LifeWatch, users can already upload their own data and check them against a selection of the here described quality control procedures.},
  articleno    = {bau125},
  author       = {Vandepitte, Leen and Bosch, Samuel and Tyberghein, Lennert and Waumans, Filip and Vanhoorne, Bart and Hernandez, Francisco and De Clerck, Olivier and Mees, Jan},
  issn         = {1758-0463},
  journal      = {DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION},
  keyword      = {BIODIVERSITY,SPECIES-DIVERSITY,CHALLENGES,FACILITY,OUTLIERS,STANDARD},
  language     = {eng},
  pages        = {14},
  title        = {Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases},
  url          = {http://dx.doi.org/10.1093/database/bau125},
  year         = {2015},
}

Chicago
Vandepitte, Leen, Samuel Bosch, Lennert Tyberghein, Filip Waumans, Bart Vanhoorne, Francisco Hernandez, Olivier De Clerck, and Jan Mees. 2015. “Fishing for Data and Sorting the Catch: Assessing the Data Quality, Completeness and Fitness for Use of Data in Marine Biogeographic Databases.” Database-the Journal of Biological Databases and Curation.
APA
Vandepitte, Leen, Bosch, S., Tyberghein, L., Waumans, F., Vanhoorne, B., Hernandez, F., De Clerck, O., et al. (2015). Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION.
Vancouver
1.
Vandepitte L, Bosch S, Tyberghein L, Waumans F, Vanhoorne B, Hernandez F, et al. Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION. 2015;
MLA
Vandepitte, Leen, Samuel Bosch, Lennert Tyberghein, et al. “Fishing for Data and Sorting the Catch: Assessing the Data Quality, Completeness and Fitness for Use of Data in Marine Biogeographic Databases.” DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2015): n. pag. Print.