Many definitions attempt to capture the notion of Research data. This challenge is all the more difficult because it is necessary to find a common denominator for many types of data (sociological, economic, medical, biological, constructed from sounds, images or even odours, etc.).
It is commonly accepted (OECD - 2007) that research data are factual records used as primary sources for scientific research. They are generally recognized by the scientific community as necessary to validate research results. They can take many forms (experimental data, observational data, operational data, third party data, public sector data, etc.).
At UNIL, according to Directive 4.5, research data are records used as primary sources for scientific research.
They include in particular, but not exclusively :
The following are not considered as research data :
In order to facilitate their publication and/or sharing in possible open access, research data are organized and managed according to international standards specific to each field in order to respect the FAIR principles (Findable, Accessible, Interoperable, Re-usable) supported in particular by the Swiss National Science Foundation (SNSF) .
Research data have a long lifespan, often longer than the period between their creation and the writing of the scientific publication for which they were created. The function and value of the data changes from one phase of the cycle to the next. The concept of research data life cycle is a tool that can be used to map different phases and see how they connect to each other. The use of a life cycle makes it possible to move from a short-term perspective to a long-term perspective in data management.
Developed by UK Data Archive, the Research Data Lifecycle Reference Model defines 6 main steps : Data creation ; Data processing ; Data analysis ; Preparing data for preservation ; Data access ; Data reuse.
Each of these steps consists of several actions to be carried out to ensure proper management of research data.
Uniris has developed a similar vision based also on 6 phases :
Taking these 6 phases into account allows the following aspects to be achieved :
A distinction is made between active research data, the preservation of part of this data (long-term preservation) and permanent archiving and data sharing.
See the diagram below.
Research data is numerous, varied and highly heterogeneous. They can be distinguished into five categories (André, 2014):
Depending on the context in which it is created (capture or production), how it is exploited, analysed and processed, research data can be of different kinds:
Contained in various media:
Of all types: