Store & Secure

Storage Strategies

The storage of research data poses challenges for an institution and its technical infrastructure, requiring the creation of long-term and secure storage solutions for various quantitative and qualitative data types. The requirements for storing complex experiments or video files differ from those for storing social science survey data, that is why potentially different storage technologies may be necessary to ensure fast access to different data types.

Within research data management, a distinction must be made between short-term, medium-term, and long-term data storage. Short-term storage occurs during the research process as part of data collection and further data processing.

Short-Term Storage Strategy

You can provide guidance to researchers on short-term data storage during the research process. This can help to ensure appropriate handling of research data during the research process.

Medium-Term Storage Strategy

Medium-term data preservation requires an institutional backup strategy. The following questions should be considered for its development:

  • Who is responsible for data backup at your institution?
  • Who has access to the backed-up data?
  • When and how often should data backup be performed?
  • How should the backed-up data be secured?
  • Which data should be backed up?
  • What storage media should be used?
  • How many backup copies should be created?
  • Where should the backups be stored, and how should they be protected?
  • How should the transport of backup copies be handled?
  • Over what timeframe should a backup be retained?

[Source: https://ianus-fdz.de/dateispeicherung]

Answering these questions can assist in developing a medium-term data backup strategy in order to protect your institution from potential data loss.

Long-Term Storage Strategy

A long-term storage strategy involves archiving research data. Archiving requires not only secure storage but also comprehensive data description with Metadata, which makes the reuse of the research data possible in the first place.

In terms of long-term storage of research data, some file formats are more suitable than others. The decision for or against a certain format depends largely on data compatibility with other data or software to be used, suitability for long-term archiving, and the ability to convert data if necessary.

A rough overview of the suitability of different data formats is shown in the table below:

SustainabilityMachine ReadabilityHuman ReadabilityLong-Term ReadabilityMetadata
very goodwith widely used open softwareyes, without specialized softwarestandardized standardfully included
goodwith well-known and documented softwarecompressed according to standard procedures but technically yeslong-established or widely establishedtechnical details are included
averageproprietary standard formatwith open software (reliable?) convertible to a higher classrelatively new formatsome important (e.g., units) details are included
poorself-developed reading softwarenojust inventedno details
[Source: https://forschungsdaten.info/themen/veroeffentlichen-und-archivieren/formate-erhalten/ (last accessed: 13.09.2022, 11:07 AM)]

For institutional storage of research data, in addition to the appropriate storage strategy, a backup must also be established. For further guidance on this, please refer to the article ‘FDM Consulting: Storage Guidance for Researchers.