Research data will typically be stored, managed and often shared throughout the course of a research project. After a project has ended, the data will need to be retained for a specified period. When considering longer term archiving, it can be tempting to keep all data for possible future use. However, not all data can or should be preserved. Evaluating your data will enable you to decide which data to deposit in a trusted archive or repository.
This set of guidelines are a starting point for determining how long data should be retained and selecting which data should be preserved in the long term. They may help you at the beginning of your project when you are composing your data management plan. Including information on the preservation of research data will help you to clarify what the requirements for your data are and enable advance planning for long-term storage of your research data. They may also be useful towards the end of your project, when you are sorting your data for deposit in a data archive.
You may also find the following sources of information useful:
Although it may be tempting to deposit all of your data into an archive or repository, just in case it proves useful in the future, this is not always possible, as archiving everything could prove to be costly and time-consuming and in some cases, unethical.
While it is true that the cost of data storage has declined over time, the cost of organising, describing and then maintaining data in a usable form has a considerable cost. In fact, the cost of data “curation” is often much greater than the cost of data storage.
If you are gathering data from human research participants, you will also need to ensure that you have gained their informed and valid consent for the specific archival, share and re-use management plan for the data, and you will need to have a plan to securely dispose of any data which cannot be shared.
The following questions may help you when considering which data to deposit:
The following policies and formal requirements will also impact on your decision:
The decision of what data to select for long-term preservation will never be entirely objective, as it is impossible to know exactly what information will be useful in the future. However, by thinking it through carefully, abiding by any relevant policies or requirements (e.g. from funders) and documenting the decisions made and the reasons for them, you can ensure that your selection process will be as fair and as accurate as possible.
It is helpful to consider the long-term preservation of your research data when writing your Data Management Plan; by planning which data to archive and incorporating a research data management strategy into the life-cycle of your project you will find the process of selecting and preparing data for long-term preservation easier and quicker.
The Open University’s Retention Schedule advises that all research records (which include raw and analysed research data) be kept for ten years from publication. Any research records or data which you decide not to deposit in an archive or repository should still be kept for ten years however, in these cases, the management and accessibility of those data and records remains your responsibility.
Open University requirements: The Open University’s Retention Schedule advises that research records (which include raw and analysed research data) be kept for ten years from publication. After ten years, if the data prove to still be pertinent, then a longer retention period may be considered. In the case of data retained in support of patent applications, the data must be retained for the lifetime of the patent (up to 20 years or until the patent is lapsed). Please discuss any queries on data related to patents with RES. For more information, consult the University’s Retention Schedule
Funder requirements: Different funders have different requirements about where, when and for how long data should be kept post-project. Ensure you are aware of your funder’s policy for data retention.
Retention “in perpetuity”: Certain types of data might have a recommendation that retention is ‘in perpetuity’. Usually this relates to data that cannot be recreated (e.g. observations of unique events). Although true for all retained data, it is important to ensure that additional care is taken to hold this data in a durable format.
Funders with differing retention periods: If the retention period differs between your funder and the University, or multiple funders on the same grant, data should be retained for the longer of the periods.
Data and publication: Increasingly, publishers are requiring the deposit of supporting data with the article, while others require the provision of a link to the data. You should consider this when deciding how long you will need to retain the data and it may influence your choice of storage location.
As part of the deposit process you will be asked to consider what should happen at the end of the retention period and who is responsible for carrying this out. At the end of the retention period you may need to re-evaluate your data in case it needs to be retained for longer than you had planned. You will need to consult the data centre or archive in which it is stored in order to arrange an extension. If you leave the OU before the end of the retention period, it is your responsibility to hand over custody of the research data to a colleague who will be able to make any future decisions about it.
For more information on securely disposing of your data consult the Media Disposal Policy within the Information Security Specific Policy Set