The need for data archives is on the rise now that research funders are increasingly demanding that research data is made available via open access. At the same time, the funding model for data archives is still far from clear. There is a tension between short-term funding, which is inherent to research projects, and the long-term effort involved in continually making research data from those projects available.
There are quite a few projects(2) that have identified the costs for long-term storage. “Unfortunately, we've all used different models, and different ways of describing the data”, so says Paul Wheatley in a blog post. The European 4C project(4) is trying to introduce change and has compared the various cost models(5). "The aim of this project is to help organisations across Europe to invest more effectively in digital curation and preservation", according to project partner DANS. Identifying the costs is one thing; identifying the benefits, such as added value for future users, is also part of the story."
The authors of the white paper(6) 'Sustaining Domain Repositories for Digital Data' feel that the best solution would be if research funders would include the costs for long-term storage (by making a percentage of the total budget available for this purpose). Since this model would lead to all data being equal: they all have the same chance of being archived.
UK Data Archive developed a costing tool(7). This provides insight into the cost items that you need to take into account during the entire lifecycle of the research data (so not only for long term preservation). The person completing the tool remains responsible for the estimate.
On the 4C platform Curation Costs Exchange(9) data archives and other stakeholders can compare cost information and discuss their expenses and underlying choices.