Research data represent a new currency. Traditionally, publishers did not publish or curate research data, they were interested only in publications. Now, research data and publications are at least equally important.
Data aren鈥檛 just good for science.听They are听also good for driving the kind of innovation needed to solve some of the biggest issues facing the world today. The outbreak of the coronavirus is a perfect example of how open data can help tackle a major global issue.
The spread of the virus sparked discussion in Paris last month where nine global university networks signed the vowing to develop appropriate reward systems for scholars who made their data open and accessible. It was a major milestone in advancing the cause of research data management in an open science/scholarship world.
But, without investment and culture change for managing and sharing data, the risk could be bigger than the return.听
探花视频
A recent European Commission found that sharing and better managing research data would save 鈧10.2 billion per year in Europe, with an additional potential of 鈧16 billion of added value by the innovation generated.
Research data that are听FAIR (findable, accessible, interoperable and reusable) and open for sharing and reuse mean that researchers have easy access to experimentation by others, thus avoiding the need for costly duplication.听
探花视频
They听also enable research groups to look at research methodology, to test the results produced by others, and to detect mistaken, even fraudulent, use of data. This is an important part of research integrity which, in the UK, is being promoted by the .
Not all research data can be open, because much 鈥 such as patient data 鈥 contains personal information听that would be inappropriate to share. In this sense, open data听are different from open access publications, where current debates centre on making the whole of a publisher鈥檚 output open, rather than a part of it.
These are the kinds of challenges for universities in embracing FAIR, open data.听
Universities have to create institutional research data repositories in which data can be curated and stored for the long term, as 鈥渙pen as possible, as closed as necessary鈥. In turn, there need to be data specialists to run and manage the service, and these salary costs need to be met.
So new is the concept of research data management that the first report on the stressed that Europe needed half a million research data stewards within a decade. It also said that well-budgeted data stewardship plans should be made mandatory, with the expectation that, on average, about 5 per cent of research expenditure should be spent on properly managing and stewarding data.
These are not trivial costs.What proportion of these costs should be funded by the university, and how much should be written into grants from research funders? That is an ongoing debate. And promotion and reward schemes need to value data, alongside publications, to encourage researchers to adopt open practices.
探花视频
For researchers in many disciplines, the concept of sharing data and making听them open is completely new. In other areas, such as high-energy physics or astronomy, it is virtually business as usual. The great challenges听that face society 鈥撎齭uch as poverty, global warming, ill health and injustice 鈥 are best tackled collaboratively. Research is quicker and, where open and transparent, available for close scrutiny.
How open is the research community now? As Dr Simon Hodson, executive director of CODATA, the committee on data of the International Science Council, showed at the launch of the Sorbonne Declaration, the response to the Ebola outbreak included many organisations and the resulting data were very scattered geographically 鈥 65 per cent of the data collected was not shared.听
探花视频
Most data cannot be accessed directly at the record level (eg, it鈥檚 summarised in studies and not shared). Meanwhile, most clinical records from the outbreak are simply PDF scans (and thus not universally accessible).听
There is a lack of both metadata to describe the data and also a common data dictionary (a set of definitions听that allows the variables in the data to be understood). It is also technically difficult to integrate all the different types of data that have been collected.听
There are lessons to be learned here 鈥 data capture for the current coronavirus must not make the same mistakes.
The benefits of open data can be seen in the project, which is regarded as a model for making research data openly available over the internet. Before the project, scientists shared their research findings in scientific journals. By the end of the project, scientists were willing to release their data to the world before publication.听
Former US president Bill Clinton called it 鈥渙ne of the most important, most wondrous maps ever produced by humankind鈥 and former听UK prime minister Tony Blair called it 鈥渁 revolution in medical science whose implications far surpass even the discovery of antibiotics, the first great technological triumph of the 21st century鈥.
The Sorbonne Declaration builds on the growing importance of research data in the research landscape. There is a mountain to climb, but the declaration marks the commitment of global university networks to expedite the journey.
探花视频
Paul Ayris is pro vice-provost of UCL library services and co-chair of the League of European Research Universities Info Community.
Register to continue
Why register?
- Registration is free and only takes a moment
- Once registered, you can read 3 articles a month
- Sign up for our newsletter
Subscribe
Or subscribe for unlimited access to:
- Unlimited access to news, views, insights & reviews
- Digital editions
- Digital access to 罢贬贰鈥檚 university and college rankings analysis
Already registered or a current subscriber?




