Difference between revisions of "Publishing options"

Line 1: Line 1:
[[Category: Data]]
 
  
 
Recently journal editors have updated their data policy and now require that data relating to the submitted paper should be made available by the authors. From the [http://publications.agu.org/author-resource-center/publication-policies/data-policy/ AGU data policy] : "..all data necessary to understand, evaluate, replicate, and build upon the reported research must be made available and accessible whenever possible ..."
 
Recently journal editors have updated their data policy and now require that data relating to the submitted paper should be made available by the authors. From the [http://publications.agu.org/author-resource-center/publication-policies/data-policy/ AGU data policy] : "..all data necessary to understand, evaluate, replicate, and build upon the reported research must be made available and accessible whenever possible ..."
Line 10: Line 9:
 
*<span style="font-family: Arial,Helvetica,sans-serif;">Derived data products reported or described in a paper. "</span>  
 
*<span style="font-family: Arial,Helvetica,sans-serif;">Derived data products reported or described in a paper. "</span>  
  
There can be practical and even copyright limitations to do this, but these can be taken into account and it should not be an impediment to publication if properly documented. The JGR-Space Physics editor-in-chief has listed some of these challenges and clarified the scope of the policy on his [https://liemohnjgrspace.wordpress.com/category/publication-policy/ blog]. This include references to model data which apply to lots of the CLEx data as well.
+
There can be practical and even copyright limitations to do this, but these can be taken into account and it should not be an impediment to publication if properly documented. The JGR-Space Physics editor-in-chief has listed some of these challenges and clarified the scope of the policy on his [https://liemohnjgrspace.wordpress.com/category/publication-policy/ blog]. This include references to model data which apply to lots of the CLEX&nbsp;data as well.
  
 
The [[Publisher_policies|publishing policies]] page offer a&nbsp;list of data policies and&nbsp;requirements by publisher.
 
The [[Publisher_policies|publishing policies]] page offer a&nbsp;list of data policies and&nbsp;requirements by publisher.
  
 +
&nbsp;
  
 +
=== <span style="color:#3498db;">'''Where should I publish my data?'''</span> ===
  
 +
You usually have at least three&nbsp;options and there's not a straight answer it depends on what you're publishing and why.
 +
 +
<u>CLEX&nbsp;Data Collection on NCI</u>: this is the best place if you have actually produced the data yourself. We'll help you document&nbsp;your data and make it user friendly, it will be part of a climate data collection and so it will be easier to discover. NCI also has more storage capacity than&nbsp;other repositories and services which are designed around the netcdf format.
 +
 +
<u>Institutional repository</u>: this can be adeguate if you have a small dataset and it is really specific to your study or only a subset/post processing of another dataset. While institutions offer some data curation they usually won't check the data&nbsp;is well described, consistent and user friendly, so you will get a DOI for your record&nbsp;but no added value.
 +
 +
<u>Zenodo, Figshare, Mendeley:&nbsp;</u>these services are free and you can create your own account, create a record for your data and get a DOI fairly easily and quickly. You can also publish here different kind of materials. This can be useful if you want to publish some very specific data, for example code and data to produce a specific figure required to publish&nbsp;a paper.&nbsp;If you are publishing data that can be actually useful to others, then you want to make sure is FAIR (see above for a full description).&nbsp;Findable, which is harder when your record is not part of a discipline repository or collection or you haven't used keywords in an effective manner. Accessible, Interoperable and Reusable which means the data should have enough metadata, use standards, be&nbsp;properly described. Finally the&nbsp;data size is limited to 50 GB and you won't get any additional data services apart from HTTP&nbsp;download.&nbsp;This should be your last resort, still if you go this way please make sure you document your data properly. we are happy to provide support and review your record. We are also looking into creating &nbsp;CLEX community in Zenodo to be able to collect in one place these data records.
 +
 +
If you're confused feel free to ask us we are always happy to provide advice and support and whatever you choose to do remember to report the title, citation and doi on Clever. Published datasets are part of the Centre&nbsp;KPIs and something we report to our funders. It will only take a couple of minutes.
 +
 +
=== <span style="color:#3498db;">'''Where should I publish my code?'''</span> ===
 +
 +
More and more frequently you are reuqired to publish also your code by journal editors. This is important to make your research reproducible, in particular for the paper reviewers. It&nbsp;can&nbsp;also be an opportunity to share a successful code with other researchers. If your code is properly published and someone else uses it, then it is easier for them to cite you as the author. Like data, code represents part of your work and funders are starting looking at all research products not only papers when reviewing applications. Putting your code on GitHub or another version control service helps to keep track of the code, expose it to others and manage pontetial issue sans enhancements. But GitHub is not ideal if you want to pinpoint the code you used for a paper or to create some data. Zenodo is a platform that integrates well with GitHub to allow you&nbsp;[https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=&cad=rja&uact=8&ved=2ahUKEwiL1tLP8snvAhVL7XMBHUolAmIQFjAAegQIAhAD&url=https://guides.github.com/activities/citable-code/&usg=AOvVaw1VVBoyAZecm_pcXGudJRHP to publish automatically every release]. Zenodo will keep a snapshot in time of your code and assign a DOI to it. if you're not using GitHub you can simply upload your files directly in Zenodo. These are the reasons we started a Zenodo community for a&nbsp;[https://zenodo.org/communities/arc-coe-clex/?page=1&size=20 CLEX Code Collection (CCC)]in 2020. We published there some of our code and&nbsp;code used to produce papers as&nbsp;required by journal editors. We are now looking into broadening this and actively seek contributions of code, notebooks etc that might be useful to others. Zenodo has given our codes much more visibility than GitHub and some of the code has receveid lots of views and downloads.&nbsp;
  
 
=== '''How to publish''' ===
 
=== '''How to publish''' ===
  
NCI is now providing web services to publish data and metadata. This include a [https://geonetwork.nci.org.au geonetwork catalogue] to describe your dates (i.e. a metadata repository) and provide links to other description and to the dataset access point. Once you have a geonetwork record, NCI can mint a DOI for the dataset, as for papers a DOI makes the dataset easy to cite. The files can be made accessible to the public by using their [https://dapds00.nci.org.au/thredds/catalog.html TDS catalogue]&nbsp;(THREDDS)'''''.'''''
+
&nbsp;
 
 
Up to now this was our main option to publish both metadata and the actual files. We are working towards offering an laternative for datasets which are not suitable for this collection. More details on these options will be added in this page soon!
 
  
 
[[Publishing_with_NCI|Publishing with NCI]]
 
[[Publishing_with_NCI|Publishing with NCI]]
Line 30: Line 42:
  
 
You will need to record there only the main information: author, title, doi and citation.
 
You will need to record there only the main information: author, title, doi and citation.
 +
 +
[[Category:Data]]

Revision as of 18:18, 24 March 2021

Recently journal editors have updated their data policy and now require that data relating to the submitted paper should be made available by the authors. From the AGU data policy : "..all data necessary to understand, evaluate, replicate, and build upon the reported research must be made available and accessible whenever possible ..."

The aim of this change in the policy is to satisfy the principle that someone reading the paper should be able to reproduce your experiment. Again form the AGU policy : " For the purposes of this policy, data include, but are not limited to, the following:

  • Data used to generate, or be displayed in, figures, graphs, plots, videos, animations, or tables in a paper.
  • New protocols or methods used to generate the data in a paper.
  • New code/computer software used to generate results or analyses reported in the paper.
  • Derived data products reported or described in a paper. "

There can be practical and even copyright limitations to do this, but these can be taken into account and it should not be an impediment to publication if properly documented. The JGR-Space Physics editor-in-chief has listed some of these challenges and clarified the scope of the policy on his blog. This include references to model data which apply to lots of the CLEX data as well.

The publishing policies page offer a list of data policies and requirements by publisher.

 

Where should I publish my data?

You usually have at least three options and there's not a straight answer it depends on what you're publishing and why.

CLEX Data Collection on NCI: this is the best place if you have actually produced the data yourself. We'll help you document your data and make it user friendly, it will be part of a climate data collection and so it will be easier to discover. NCI also has more storage capacity than other repositories and services which are designed around the netcdf format.

Institutional repository: this can be adeguate if you have a small dataset and it is really specific to your study or only a subset/post processing of another dataset. While institutions offer some data curation they usually won't check the data is well described, consistent and user friendly, so you will get a DOI for your record but no added value.

Zenodo, Figshare, Mendeley: these services are free and you can create your own account, create a record for your data and get a DOI fairly easily and quickly. You can also publish here different kind of materials. This can be useful if you want to publish some very specific data, for example code and data to produce a specific figure required to publish a paper. If you are publishing data that can be actually useful to others, then you want to make sure is FAIR (see above for a full description). Findable, which is harder when your record is not part of a discipline repository or collection or you haven't used keywords in an effective manner. Accessible, Interoperable and Reusable which means the data should have enough metadata, use standards, be properly described. Finally the data size is limited to 50 GB and you won't get any additional data services apart from HTTP download. This should be your last resort, still if you go this way please make sure you document your data properly. we are happy to provide support and review your record. We are also looking into creating  CLEX community in Zenodo to be able to collect in one place these data records.

If you're confused feel free to ask us we are always happy to provide advice and support and whatever you choose to do remember to report the title, citation and doi on Clever. Published datasets are part of the Centre KPIs and something we report to our funders. It will only take a couple of minutes.

Where should I publish my code?

More and more frequently you are reuqired to publish also your code by journal editors. This is important to make your research reproducible, in particular for the paper reviewers. It can also be an opportunity to share a successful code with other researchers. If your code is properly published and someone else uses it, then it is easier for them to cite you as the author. Like data, code represents part of your work and funders are starting looking at all research products not only papers when reviewing applications. Putting your code on GitHub or another version control service helps to keep track of the code, expose it to others and manage pontetial issue sans enhancements. But GitHub is not ideal if you want to pinpoint the code you used for a paper or to create some data. Zenodo is a platform that integrates well with GitHub to allow you to publish automatically every release. Zenodo will keep a snapshot in time of your code and assign a DOI to it. if you're not using GitHub you can simply upload your files directly in Zenodo. These are the reasons we started a Zenodo community for a CLEX Code Collection (CCC)in 2020. We published there some of our code and code used to produce papers as required by journal editors. We are now looking into broadening this and actively seek contributions of code, notebooks etc that might be useful to others. Zenodo has given our codes much more visibility than GitHub and some of the code has receveid lots of views and downloads. 

How to publish

 

Publishing with NCI

Reporting to CLEVER

Whichever way you decide to publish your data, as part of the nCI collection or with a repository provided by your institution, remember to add your published record to CLEVER the CLEX reporting hub, in the "Publications and Datasets" section.

You will need to record there only the main information: author, title, doi and citation.