Produce RO-Crate archives for Autosubmit workflows
This is an idea for enhancement, about adding RO Crate to Autosubmit: https://www.researchobject.org/ro-crate/
RO-Crate is a community effort to establish a lightweight approach to packaging research data with their metadata. It is based on schema.org annotations in JSON-LD, and aims to make best-practice in formal metadata description accessible and practical for use in a wider variety of situations, from an individual researcher working with a folder of data, to large data-intensive computational research environments.
At the moment we archive and share experiment & workflow meta/data via pickle files, SQLite databases. I believe we should be able to easily add an option to generate RO-Crate artifacts. Here's a good introduction video about RO-Crates.
With that we would be using an open standard for archiving the digital artifacts produced by Autosubmit. Each asset could/would have a digital identifier, and could be packaged together as a RO-Crate. Besides users familiar with RO-Crate being able to quickly understand and utilize the artifacts produced by Autosubmit, we would be able to use all the tooling and integrations from RO-Crate.
Finally, because RO-Crate is built on years of research/academic/professional community experience, as well as several fronts of work on FAIRness, this would improve FAIRness in Autosubmit, and in the research produced with Autosubmit.
RO-Crate is used in CWL, Sapporo, StreamFlow, WfExS, COMPSs, ELIXIR, EOSC, and other projects in EU, Japan, Africa, and Oceania as a standard for metadata archiving and sharing.
The following web applications support Research Objects and/or RO-Crate:
- Common Workflow Language (CWL) Viewer
- ATAP Data Portal (Australia)
- WfExS-backend (BSC-led project
- Research Object Hub (EOSC service)
RO-Crate in the BSC
The main articles for citing RO-Crate include BSC members too, so I reckon we would be able to find other people to consult with about RO-Crate.