2.1 Relevance of research data management: FAIR and CARE
Collecting biodiversity data takes a lot of energy and resources. This makes biodiversity data not only valuable for the scientist who collected it, but also to support further research and to inform policy on conservation, natural resources, land use, agriculture and more [1][2]. It is important to create a maximum output of this valuable data to protect nature, wildlife and endangered species, to prevent overfishing and extinctions as a way to counteract the climate crisis and biodiversity crisis [3].
Thus, biodiversity data need to be carefully handled, preserved, and shared, by making them Findable, Accessible, Interoperable and Reusable (FAIR) [4], and considering CARE-Principles (Collective Benefit, Authority to Control, Responsibility, and Ethics) [5].
Figure 1: Global extinction risk in different species groups. Assessments of extinction risks are only possible for taxa with sufficient data. Here, there is a difference between * comprehensive, ** sampled and *** selected coverage of taxa. Large and important taxa like e.g. insects are almost completely missing in this analysis, due to insufficient data. Source: IPBES 2019, figure SPM3, p. 26, Copyright © IPBES 2019
To put good research practice [6] and the FAIR principles into practice effective research data management (RDM) is needed. This requires a comprehensive approach that encompasses the planning, collection, storage, analysis, and collaborative sharing of diverse biodiversity data through a well-structured and coordinated research data management strategy [7]. For research in the biodiversity field, research data management is even more important as non-repeatability and the need to rely on old data, e.g. for trend analysis, is of utmost relevance.
This video (published by GFBio, 2020) explains the challenges and solutions of dealing with heterogeneous data.
[1] Biodiversity Data Journal. (2023). Biodiversity Data Journal. Available at: https://bdj.pensoft.net/. Last accessed 24 November 2023.
[2] Haase, P., Tonkin, J.D., Stoll, S., Burkhard, B., Frenzel, M., Geijzendorffer, I.R., et al. (2018). The next generation of site-based long-term ecological monitoring: Linking essential biodiversity variables and ecosystem integrity. Science of The Total Environment, 613–614, 1376–1384. https://doi.org/10.1016/j.scitotenv.2017.08.111
[3] Brondízio, E.S., Settele, J., Díaz, S. & Ngo, H.T. (Eds.). (2019). The global assessment report of the intergovernmental science-policy platform on biodiversity and ecosystem services. Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services (IPBES), Bonn. https://doi.org/10.5281/zenodo.3831673
[4] Wilkinson, M.D., Dumontier, M., Aalbersberg, Ij.J., Appleton, G., Axton, M., Baak, A., et al. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Sci Data, 3, 160018. https://doi.org/10.1038/sdata.2016.18
[5] Carroll, S.R., Garba, I., Figueroa-Rodríguez, O.L., Holbrook, J., Lovett, R., Materechera, S., et al. (2020). The CARE Principles for Indigenous Data Governance. Data Science Journal, 19, 43. https://doi.org/10.5334/dsj-2020-043
[6] DFG. (2022). Guidelines for Safeguarding Good Research Practice. Code of Conduct. https://doi.org/10.5281/zenodo.6472827
[7] Members of the Working Group on Data Management of the DFG Senate Commission on Biodiversity Research. (n.d.). Guidelines on the Handling of Research Data in Biodiversity Research. https://www.dfg.de/download/pdf/foerderung/grundlagen_dfg_foerderung/forschungsdaten/guidelines_biodiversity_research.pdf