A "README" file is a guide for your dataset. It is typically a plain text file to maximize ease of use and long-term preservation potential. The purpose of a README file is to help other researchers (or yourself in the future) in understanding your dataset, its content, origin, license, and how to interact with it. README files are commonly named README or LISEZ-MOI, readme.txt or lisezmoi.txt, or read-me.md.
The name "README" signifies that the file contains important information, and the file type, "TXT," can be opened by various software, making the content widely accessible.
README files are included as a component of a dataset.
When you deposit your data in repositories (e.g. Borealis or FRDR), you are asked to provide metadata. A README file complements, but does not replace, repository metadata.
The best practice is to record information in both the repository metadata and the README file. Repository metadata will facilitate searching within and between data repositories, while the README file follows the dataset and continues to describe it after it has been separated from its original context. In all cases, you should use the conventions appropriate to your discipline to record information about your dataset.
Access this data set by clicking on the link:
Clark, Luke, 2019, “Role Reversal: The Influence of Slot Machine Gambling on Subsequent Alcohol Consumption”, https://doi.org/10.5683/SP2/SLOY0N, Borealis, V1, UNF:6:zsehCAz4agntvPwDZF03OA== [fileUNF]
Select and download the data file "Gambling_Alcohol_Study 1_Archive.tab" in the original file format.
Looking at the data, try to answer the following questions:
"What do you observe?"
The recommended minimum content for data reuse is shown in bold (Cornell, 2023).
General information
Data and file overview
Sharing and accessing information
Methodological information
Data-specific information
*Repeat this section if necessary for each data set (or file, if applicable)*.
THE STYLE
The way you write your README is as important as the information you include. Remember to be as clear as possible. Here are some best practices for documenting data:
An example of README content:
To use the ÉTS README template, click here.
For more information, see the README section of this guide.
THE PROCESS
"Document your work as you go along, so you don't lose any details. If you wait until the end of your project, you may have already lost or forgotten valuable information."
You can create a README using any text editor (e.g. TextEdit, Notepad++, Atom.io, Sublime Text) or word processor (e.g. Word, LibreOffice).
However, save your README as UTF-8 encoded text. Using plain text preserves your information because it relies on sustainable, open standards rather than proprietary formats. If you're using GitHub, your README should be written using Markdown syntax (readme.md).
Store the README at the top level of the project folder on your computer, next to the project files.
Download the ÉTS README template and choose a data project you're currently working on. Spend 5 to 7 minutes filling it in.
Pay particular attention to the list of variables. A dataset without named variables is not useful. How would your peers know what a variable named "Data.VF.1", for example, means?
Now you're ready to write a good README file so that other researchers can understand your dataset without a hitch!
Photo by Vasily Koloda on Unsplash
This page is an adaptation of the guide Introduction to Research Data Management : File Naming by UBC Library Research Commons licensed under a Creative Commons Attribution 4.0 International licence (CC BY).. Research Data Management : File naming is licensed under a Creative Commons Attribution 4.0 International licence (CC BY) by ÉTS Library.
Kristin Briney (2023).Chapter 2- Documentation. Dans The Research Data Management Workbook. Caltech Library. https://doi.org/10.7907/z6czh-7zx60
Alert icon. Icône repéré sur Flaticon: <a href="https://www.flaticon.com/free-icons/alert" title="alert icons">Alert icons created by Pixel perfect - Flaticon</a>