Difference between revisions of "FAIR - Reusable"

Line 4: Line 4:
 
== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:large;">Data should be accompanied by enough information on how it was collected or processed, as to guarantee its quality and hence make it usable by other. It should have a license that allows and&nbsp;facilitates reuse</span></span>''' ==
 
== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:large;">Data should be accompanied by enough information on how it was collected or processed, as to guarantee its quality and hence make it usable by other. It should have a license that allows and&nbsp;facilitates reuse</span></span>''' ==
  
=== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Data has a license</span></span>''' ===
+
&nbsp;
  
=== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Data has provenance</span></span>''' ===
+
=== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Data has a detailed provenance</span></span>''' ===
 +
 
 +
<span style="font-size:small;"><span style="font-family:Arial,Helvetica,sans-serif;">Provenance indicates the history of your data, so it should include as much as possible information on the&nbsp;code, datasets and processes used to produce the data. Provenance can be recorded as a separate document, but it is often composed by several elements. The metadata attached to&nbsp;the publication is part of the provenance, as is metadata available in the data files, any relevant technical report, links to&nbsp;source code and data documentation and other references.&nbsp;They can all contribute to your data provenance.&nbsp;</span></span>
 +
 
 +
Provenance is central to data reproducibility and hence to build trust in the data.
 +
 
 +
&nbsp;
 +
 
 +
'''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Data has a license</span></span>'''
 +
 
 +
A dataset without a license cannot be used. A potential user would have to contact the owner and ask for permission to use the data. A license tells immediately to a user what can be done with the data.
 +
 
 +
The license should be clear, it is always better to use an internationally recognised license, rather than a custom one. Widely used licenses are easily recognised by other users, so they know what the license cover without having to read it. These licenses are also more machine-readable as software to run queries on repositories will recognised them.
 +
 
 +
&nbsp;
  
 
=== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Data uses community standards</span></span>''' ===
 
=== '''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Data uses community standards</span></span>''' ===
 +
 +
Data that uses&nbsp;file formats, standards and conventions used by the community are more reusable. Applying discipline conventions makes the data more readable both by other reaserchers, and by software developed for the same community. Discipline specific software modules often adopt the same conventions, and make assumptions on how data might be structured or variables named.&nbsp;
 +
 +
Using accepted vocabularies, for example to name variables, reduce the&nbsp;risk of the data being mis-intepreted and mis-used.
 +
 +
&nbsp;
  
 
----
 
----
  
 
'''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Related pages</span></span>'''
 
'''<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">Related pages</span></span>'''
 +
 +
[[Open_access_licenses|<span style="font-size:small;"><span style="font-family:Arial,Helvetica,sans-serif;">Open Access licenses</span></span>]]
 +
 +
<span style="font-size:small;"><span style="font-family:Arial,Helvetica,sans-serif;">Standard Conventions</span></span>
 +
 +
<span style="font-size:small;"><span style="font-family:Arial,Helvetica,sans-serif;">Provenance</span></span>
 +
 +
&nbsp;
 +
 +
&nbsp;

Revision as of 00:59, 24 May 2021

Template:Working-on

Data should be accompanied by enough information on how it was collected or processed, as to guarantee its quality and hence make it usable by other. It should have a license that allows and facilitates reuse

 

Data has a detailed provenance

Provenance indicates the history of your data, so it should include as much as possible information on the code, datasets and processes used to produce the data. Provenance can be recorded as a separate document, but it is often composed by several elements. The metadata attached to the publication is part of the provenance, as is metadata available in the data files, any relevant technical report, links to source code and data documentation and other references. They can all contribute to your data provenance. 

Provenance is central to data reproducibility and hence to build trust in the data.

 

Data has a license

A dataset without a license cannot be used. A potential user would have to contact the owner and ask for permission to use the data. A license tells immediately to a user what can be done with the data.

The license should be clear, it is always better to use an internationally recognised license, rather than a custom one. Widely used licenses are easily recognised by other users, so they know what the license cover without having to read it. These licenses are also more machine-readable as software to run queries on repositories will recognised them.

 

Data uses community standards

Data that uses file formats, standards and conventions used by the community are more reusable. Applying discipline conventions makes the data more readable both by other reaserchers, and by software developed for the same community. Discipline specific software modules often adopt the same conventions, and make assumptions on how data might be structured or variables named. 

Using accepted vocabularies, for example to name variables, reduce the risk of the data being mis-intepreted and mis-used.

 


Related pages

Open Access licenses

Standard Conventions

Provenance