How reproducible and reliable is geophysical research? A review of the availability and accessibility of data and software for research published in journals

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.26443/seismica.v2i1.278. This is version 2 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Supplementary Files
Authors

Mark Ireland, Guillermo Algarabel, Michael Steventon, Marcus Munafo

Abstract

Geophysical research frequently makes use of agreed methodologies, formally published software, and bespoke code to process and analyse data. The reliability and repeatability of these methods is vital in maintaining the integrity of research findings and thereby avoiding the dissemination of unreliable results. In recent years there has been increased attention on aspects of reproducibility, which includes data availability, across scientific disciplines. This review considers aspects of reproducibility of geophysical studies relating to their publication in peer reviewed journals. For 100 geophysics journals it considers the extent to which reproducibility in geophysics is the focus of published literature. For 20 geophysical journals it considers a) journal policies on the requirements for providing code, software, and data for submission and, b) the availability of data and software associated for 200 published journal articles. The findings show that: 1) between 1991 and 2021 there were 72 articles with reproducibility in the title and 417 with reliability, with an overall increase in the number of articles with reproducibility or reliability as the subject over the same period; 2) while 60% of journals have a definition of research data, only 20% of journals have a requirement for a data availability statement, and 3) despite ~86% of randomly sampled journal articles including a data availability statement, only 54% of articles have the original data accessible via data repositories or web servers, and only 49% of articles name software used. It is suggested that despite journals and authors working towards improving the availability of data and software, frequently they are not identified, or easily accessible, therefore limiting the possibility of reproducing studies.

DOI

https://doi.org/10.31223/X5W943

Subjects

Earth Sciences, Geophysics and Seismology, Oceanography and Atmospheric Sciences and Meteorology, Oil, Gas, and Energy, Other Earth Sciences, Physical Sciences and Mathematics, Sustainability

Keywords

FAIR, data, data availability, reproducibility

Dates

Published: 2022-11-17 18:07

Last Updated: 2023-04-18 01:02

Older Versions
License

CC BY Attribution 4.0 International

Additional Metadata

Data Availability (Reason not available):
10.25405/data.ncl.21564381