Yale IT provides indepth guidance to assess your data risk level and approved services to keep your research safe.
For pros and cons of data storage options: Ruggiero, Paul and Matthew A. Heckathorn. Data Backup Options. Carnegie Mellon University for US-CERT, 2012.
Dryad is an open-source, research data curation and publication platform, making data publishing easy for the researcher. The Dryad platform accepts data from any discipline. As institutional member Yale researchers can deposit their data free of charge without limitation on the number of datasets deposited.
Highlights of the platform:
Dryad is committed to supporting the changing needs of research allowing for datasets to be submitted and published at any point in the research process, providing full support for versioning, and fields for notes, methods and vocabularies. While Dryad accepts all research data, the platform is intended for complete, re-usable, low risk and open research datasets. For information on Dryad’s guidelines for human subjects data, see https://datadryad.org/docs/HumanSubjectsData.pdf
Getting started with Dryad is easy. The platform uses ORCID as its primary login method, which is required to deposit data. You can connect your ORCID ID to the Yale Institutional membership using your NetID here. Don’t have an ORCID ID? Registration is simple and quick at https://orcid.org/register. Anyone can browse available datasets using the Explore Data link.
1. always keep original data
2. back up regularly (automate this if at all possible)
3. document your data thoroughly (metadata, data dictionary)
4. name and organize files according to a schema
5. use version control
6. secure the data appropriately
7. cite any secondary data you use
8. consider your long-term plan: reuse?, sharing? what to keep? where to store?
For help finding, using, managing, or archiving your research data, contact Research Data Support Services.
Research data is loosely defined as information collected, observed, or created for purposes of analysis to produce original research.
This includes observational variables like rainfall, wind speed, water quality, or survey data; simulated data from earthquake models; experimental data from lab instruments; and derived or compiled data for text mining or testing algorithms. Research data can take almost any digital file format (video, text, photographs, numbers), so managing it effectively can be a challenge.
Good data management: