Sharing data – Where and how?

Planning data sharing
Deciding if the data can be shared and how requires planning. Researchers have a key role to play in deciding what data can be shared, but it is important to note that there are also other stakeholders (e.g. data sharing must be agreed within the research group, research participants must be informed, and organisational and funder requirements must be checked) involved in making this decision.
This “horror story” shows what can happen when data is not managed and shared adequately (4:41, CC BY NYU Health Sciences Library):
Always plan well and carry out data sharing carefully.
Here is a short checklist for data sharing:
- Data may only be published by permission from the data owner. Make written agreements on data ownership issues with all parties, preferably before the data collection.
- Think carefully about which data you can make open. Consult with relevant stakeholders when making decisions. Find out if there are any ethical, juridical or contractual limitations for sharing the data.
- Inform research participants about data sharing. In most cases, only anonymous data can be openly shared. So, anonymise your data if needed.
- Update your data management plan, whenever needed, as your project progresses.
- Provide good documentation and metadata about the data. Aim at making your data as FAIR as possible. As a part of data documentation and description, provide a contextual framework of your data. Data without metadata is impossible to make sense of or reuse, thus it will likely not get the reuse or citations you might hope for.
- Choose a suitable repository or journal for publishing your data and/or metadata. In Finland, it is also good idea Share your data via a data repository or other service, but in Finland, it is also good idea to describe your data with Qvain tool and publish metadata in the national research dataset finder Etsin.
- Assign a digital persistent identifier (e.g. DOI) for your data and metadata. These are usually provided by the repositories.
- License your data. Creative Commons licenses offer different license types that will meet most researchers’ needs. Also, watch this video (9:34): How to choose a license for open scientific data and code?
- Data repositories often provide a recommendation for data citation. If there is no recommendation, provide a recommended way to cite your data. Notice data citation guidelines introduced in Reusing and citing research data.
- Remember to link all the outputs to your ORCID identifier.
Ways to share your research data
There are different ways to share your data, e.g.:

Metadata sharing
In case you can’t publish your data (because of confidentiality, contracts or other issues limit it), you can usually publish its description, i.e. metadata, though. In Finland, Etsin is a national service for metadata of research data. Etsin enables you to find research datasets from all fields of science. It harvests information about the datasets and metadata in the national services, like Finnish Fairdata services, the Language Bank of Finland and the Finnish Social Science Data archive, but also metadata of datasets located in other repositories are recommended to be added.
In Etsin, the published metadata on the dataset is open to everyone. The data owner decides how the underlying research data can be accessed and by whom. Etsin works independently of actual data storage locations and contains no research datasets. Datasets can be described and metadata published through the Qvain service (Qvain -> Describe a dataset). When producing metadata, it is useful to use general subject headings (Qvain uses KOKO ontology maintained by Finto) for describing the research data with commonly agreed terms. The use of Etsin is recommended for all research projects in Finland. If you are outside of Finland, find out if there are similar metadata services available in your country.
Remember:
- There are many ways to share research data: subject-specific and general data repositories, institutional data repositories, data journals and supplements to a peer-reviewed article.
- Share at least the metadata of the datasets.
- Choose a repository that meets your needs and provides a persistent identifier for your data.
- Consult with data support of your research organisation if needed.
- Data that will be shared for reuse should be licensed. Licensing defines the data’s author and user rights.
< Module 4: Making it FAIR
Module 4: Reusing and citing research data >