Recent efforts to standardize data sharing and archiving guidelines within research institutions, professional societies, and academic publishers make clear that the scientific community does not have consistent recommendations regarding data produced as output from computational simulations. The massive size of some simulation outputs, as well as the large computational cost to produce these outputs, present problems related to data storage, preservation, duplication, and replication. Large-scale and smaller-scale simulations present additional challenges related to usability, understandability, documentation, and citation.
The goal of the session is to engage data facilities and other ESIP participants in a discussion of the following questions:
How to determine the value of computational model outputs? Who makes this determination?
How are requests to archive data (including model outputs) currently funded?
How do archives maximize the utility of the model-based data?
How should model codes be made accessible?
How should citation work for model-based research, including all of its aspects (input data, model codes, processing codes, model output, metadata related to all of these)?
This session will feature multiple speakers who will present on different aspects of the challenges related to preserving and replicating simulation-based research. The first set of speakers, Gretchen Mullendore, Matt Mayernik, and Doug Schuster, will talk about the “What about Model Data?” project (
https://modeldatarcn.github.io/), funded by the NSF as an EarthCube Research Coordination Network, which is focused on determining best practices for preservation and replicability of simulation experiments. Peter Fox will discuss his experiences as Editor in Chief of the AGU Earth and Space Science journal in working with authors of computational model-based papers, especially those on intercomparisons and evaluations. Mark Parsons will discuss challenges that computational models present for citation and identification of data and software. Gary Strand will discuss how these questions are dealt with for data produced by the Community Earth System Model (CESM) community.
View Recording
View Notes