Archiving and publishing research data

At the end of your study, you must archive the data that underlie your publication for a minimum of 10 years. Proper archiving promotes scientific integrity and enables (internal) reuse. It adds to the findability and usability of your research, and as a result increases the impact of your scientific output. Radboud University's RDM policy also states that you must share your data as open as possible, and as closed as necessary. That means that in most cases you want to publish (a part of) your data. Both archiving in a closed environment for scientific integrity and publishing require similar preparations: 

Decide (open and/or closed archiving)

The Radboud University RDM policy states that “Research data should be archived as open as possible, as closed as necessary. The starting point is that research data should be made publicly available.”. This means that you need to decide what part of your data can be openly archived (published) and what part belongs in a closed archive. 

Find a repository

Once you have decided whether your data belongs in a closed and/or open archive, you need to find a suitable repository (or sometimes two if you need an open and a closed archive). First check your institute’s RDM policy or ask a (senior) colleague for advice. They might recommend specific repositories that are well known in your field of study. Also see if your research institute can make use of the Radboud Data Repository (RDR). If you use the RDR, you automatically comply with Radboud University’s standards and the requirements stated below.

Trustworthy and FAIR-enabling

The repository of your choice, whether it is closed or open, needs to be trustworthy and apply the FAIR principles (Findable, Accessible, Interoperable and Reusable). Using a trustworthy and FAIR-enabling repository can make your data more future-proof and it increases the impact of your scientific work. Additionally, the latter (FAIR-enabling) is required by Radboud University as defined in the RDM policy. 

Prepare your data

Once you have found a proper repository, you need to prepare your data for long-term storage, whether this is closed archiving or publishing. 

Add rich metadata to your dataset. Provide a proper title, add a comprehensive description of the research context and the content of the dataset, include keywords, and fill in as many of the other metadata fields that the repository of your choice provides. This step is key for the Findability and Reusability of your dataset. It is not only important for published datasets, but also for closed archived datasets, because those metadata might be the only visible and thus findable part of your dataset.

Add documentation. For Findability and Reusability, add rich documentation about the context, quality and condition, or characteristics of the data. Usually this is done in a readme file that is part of the metadata and openly accessible. 

Additional FAIR-ification steps:

  • For Accessibility, make sure all access rights are clear and implemented correctly. For closed archiving, this means making sure that those who are allowed access to the archive, have access and no-one else. For publishing, this means choosing a fitting access level and license or Data Use Agreement. Remember to publish your data “As open as possible, as closed as necessary”
  • For Interoperability and Reusability: If your field has a vocabulary for specific terms or concepts, use those in your dataset and add information about the vocabulary. Otherwise add documentation to clarify what your terms and concepts are (e.g. by adding a codebook)

If you are publishing parts of your data, make sure that they do not contain any information that you are not allowed to share under the selected access level, especially when it comes to personal data. 

More information on anonymisation and pseudonymisation

Archive

A lot of trustworthy repositories curate a dataset before it is published in order to safeguard the quality of the repository and the datasets in them. These checks can include an assessment on personal data, metadata, documentation and/or file formats. As a result, the repository could require you to make changes to your dataset. Be aware that this process can take several days, so make sure you include this in your planning.

Contact

Do you have a question? Get in touch with Research Data Management support:

+ 31 24 - 361 28 63