
- Introduction
- Getting started
- Open Research training
- Teams
- Open Access
- Preprints
- Discovering and citing Open Access resources
- Open data
- Managing and storing data
- Data management plans
- Authorship and contributorship
- Make your research discoverable and visible
- Funders鈥 requirements
- Reproducibility
- UK Reproducibility Network
- Preregistration and Registered Reports
- Open peer review
- Copyright and licences
- Case studies
- Events
- News
- Open educational resources
- R4RI-like narrative CVs
- Responsible use of metrics


Open data
Read through our guide on making research data open and accessible.
What is research data?
The 麻豆视频 considers research data to be any material collected, observed, processed, or created for the purpose of analysis and on which research findings and outputs are based. This includes data and documentation which is commonly accepted in the scholarly community as necessary for validation or replication of research findings. Research data may be in digital or non-digital formats. This could include:
- Audio, video, and images or photographs
- Text documents and spreadsheets
- Code, scripts, algorithms, models, and software
- Protocols and methodologies
- Specimens and samples
- Collections of digital objects
- Lab notebooks, field notes, and diaries
- Questionnaires and codebooks
- Interview schedules and transcripts
- Test responses
- Slides, artefacts, specimens, samples
- Databases.
Why share your data?
Sharing data that underpins conclusions is at the heart of academic inquiry. Data sharing for verification and reuse can catch errors earlier, foster innovative uses of data, and push research forward faster and more transparently to the benefit of the field. Beyond academia, data can be used to the benefit of policy makers, entrepreneurs, and the public. There鈥檚 also that , greater visibility of your work, and potential collaborations and opportunities. For more check out these .
Of course, not all data is suitable to share openly. Instead, data can be shared with a range of appropriate restrictions. Be sure you have consent or permission from your participants, collaborators, partners, or supervisor before sharing any data. Once you have identified which data is shareable, you should apply appropriate safeguards. If your data cannot be shared, but has long-term value, then it should be preserved.
Sharing your research data
Data sharing should strive to be as 鈥渁s open as possible, as closed as necessary.鈥 Ask yourself: what data is necessary for verifying your findings or which data could be reused? Creating data that is easily verifiable or reusable will require some planning and preparation. It鈥檚 best to plan for data sharing and build it into your research project before you start. (A data management plan is a good way to do this.)
Of course, you will want to make sure you have permission to share data from your project. Be sure to include data sharing in your consent forms. Check out the UK Data Service鈥檚 for data sharing. If you have an industry partner or other collaborators, you should agree on any data sharing before the project begins.
In most cases, the best place to share data is through a data repository. These are online platforms designed to hold and disseminate research data. Some are discipline specific and others take all types of data. Repositories provide several advantages over trying to share data yourself, they can:
- Rank highly in search engine results
- Provide a DOI for your data for publications and citations
- Track view/download counts
- Allow versioning
- Facilitate access requests
- Provide long-term storage of your data.
Option 1: Identify a suitable external repository
- Does your funder require or recommend a particular repository? Some funders have their own platforms or recommend certain repositories, like , , and ESRC鈥檚
- Is there is a repository typically used in your research discipline? Public platforms like and accept all types of data. Some publishers may .
Please note: When you share your data externally, you will need to create an official University record of where the data is held.
Option 2: Use Surrey鈥檚 Open Research repository
If an external repository is not recommended, use Surrey's , which accepts a wide variety of research outputs. Please use our Research data deposit guide (PDF).
Whether you are creating a university record indicating the external location of the data or uploading your datasets in the University repository, follow the steps below:
- Visit the
- On the top right corner, select Surrey Researchers sign in (use your university username and password)
- Once logged in, select the 'add content' button (top right corner)
- Select 鈥asset type鈥. By 'asset鈥 the system means type of research output (for example, article, book, etc). Select 鈥榙ataset鈥
- If you are creating a record of your dataset, go to 鈥楢dd links to 'files' to indicate where the dataset is
- If you are uploading your dataset directly, drop or select the file to upload
- Remember to add the DOI if your dataset already has one, or reserve one in the repository if your data doesn鈥檛 have a DOI
- Please create a record of your data even in cases where the datasets cannot be shared
- If you have specific requirements for your data or would like more guidance, contact openresearch@surrey.ac.uk.
Data can be shared anytime! Some disciplines share data almost immediately. Others tend to do it alongside a publication. Some funders suggest specific timelines for sharing data usually tied to publications, project end dates, or norms within your discipline.
Your journal may stipulate a timeframe for data sharing as a condition for publication. Surrey鈥檚 own policy (PDF) requires sharing data that underpins publication within 12 months (or sooner if required by funders).
If you don鈥檛 have a funder or your funder doesn鈥檛 specify a timeline, then follow Surrey鈥檚 policy. Exceptions to funder expectations and Surrey鈥檚 policy should be outlined and justified in the project鈥檚 data management plan.
We recommend the following best practices when sharing your data to make it easier to find and use. Of course, your data should be well organised, labelled, and accompanied by sufficient documentation. In addition:
- Create a for shared data
- Use an appropriate data repository or
- Get a DOI for your data (available from repositories as part of deposit)
- Include a data availability statement in your publications (and your data鈥檚 DOI)
- See section below
- Apply a . Some funders recommend specific licences.
One way to gauge your data sharing practices is to ask if it鈥檚 鈥淔AIR鈥 or findable, accessible, interoperable, and reusable. The outline best practices for how to share data. The provide a complimentary set of people-focused best practices.
鈥極pen data鈥 encompasses a continuum of sharing practices allowing researchers the flexibility to balance transparency and appropriate protections for their data. Data repositories have a range of access controls that can be applied to sensitive data. Some data repositories can even handle very sensitive data, like the , which accepts clinical trial data. Depending on the sensitivities of your data your open data practices might include:
- Sharing a mix of openly available and restricted data
- Transforming the data to make it more shareable, e.g. de-identification or aggregation
- Restricting access and setting terms of access, e.g., only bona fide researchers
- with the same characteristics as your data
- For verification purposes only, subject to a non-disclosure agreement
- Creating a publicly discoverable metadata record outlining what data is held and why it is not accessible.
The Research Integrity and Governance Office and can provide guidance if you are unsure.
Funders recognise that data may need to be restricted for commercial reasons. If commercial data can鈥檛 be transformed to make it more shareable, then consider making the data available only for verification purposes under a non-disclosure agreement. This meets open data expectations around transparency and verification of published findings.
If your data has commercial potential, please ensure that you have read and followed the University鈥檚 Intellectual Property Code, and contact the Technology Transfer Office (techtransferteam@surrey.ac.uk).
While not all data may be suitable for sharing immediately, any data with long-term value should be preserved. To increase the likelihood of survival and reusability, data preservation should address the following:
- Preparing your data for preservation, including:
- Considering the cost of preserving data
- Identifying what can be discarded
- Good documentation and file organisation
- If your data isn鈥檛 in widely used formats, consider transforming it into open formats.
- Finding a home for your data:
- Data already in a data repository? They may have preservation policy
- Surrey can accept data for long term preservation through .
- Timelines for retention:
- Your data may be subject to statutory or funder requirements for preservation
- Surrey requires research data be retained for a minimum of ten years.
Please note: USB sticks, external storage, personal laptops, project websites, and local hard drives are not suitable for long term preservation.
Physical data with long-term value should also be preserved. If you can鈥檛 make a digital surrogate of the physical data, then you can create a metadata record in indicating what physical objects are held and how they can be accessed.
Digital Curation Centre has a useful guide for preservation, , and Jisc鈥檚 Research Data Management Toolkit includes a . Software Sustainability Institute offers .
Data access statements
A data access statement (also referred to as 'data availability' statement), is a short statement added to a research paper, to inform the reader:
- Whether there is research data associated with the paper
- Whether the research data associated with the paper is available, and if this is the case, where and under what terms it can be accessed
- Whether the research data associated with the paper is restricted, and if this is the case, the reasons why.
The University's Research Data Management policy expects you to include a data access statement in your publications. This is in line with requirements set by some research funders, including UKRI (): "in-scope research articles to include a Data Access Statement, even where there are no data associated with the article or the data are inaccessible".
Many journals support the inclusion of data access statements, and provide relevant guidance. See examples from Springer Nature, , and .
You can also use examples provided below, if a journal does not provide its own guidance.
Data availability statements should include:
1. Terms of access (if any).
2. Persistent identifier (e.g. DOI) linking to data in a repository; or where the data can be found (e.g. a third party).
3. If the data is restricted, a statement justifying why
4. If there is no data or all the data required to verify the findings appears within the publication, then the statement can simply say that there is no data or that the data appears within the publication.
The data underlying this article are available in [repository name, e.g. the xxxx Repository], at , or give [URL]
The data underlying this article were derived from sources in the public domain: [list sources, including URLs]
- This publication is supported by multiple datasets that are openly available at locations referenced in this paper.
If the data is already included in the paper:
The data underlying this article are available in the article / in the online supplementary material.
- The data underlying this article are subject to an embargo of [period of embargo of X months from the publication date of the article] to allow for commercialisation of the results. Once the embargo expires the data will be available [give details of availability, e.g. in a repository plus embargoed link; upon reasonable request, etc.]
- The data underlying this article cannot be shared publicly due to [briefly describe why the data cannot be shared, e.g. for the privacy of individuals that participated in the study]
- The data underlying this article were provided by [third party] under licence / by permission. Data will be shared on request to the corresponding author with permission of [third party].
No data were created, collected or analysed in this study.
Resources
- Qualitative Data Archive鈥檚 module
- .