Difference between revisions of "Requesting data"

 
(10 intermediate revisions by the same user not shown)
Line 1: Line 1:
  
{{Working on}}
+
== '''<span style="font-size:large;"><span style="font-family:Arial,Helvetica,sans-serif;">How to request data</span></span>''' ==
  
=== <span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">'''Check if data is already available'''</span></span> ===
+
=== '''<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">Check if data is already available</span></span>''' ===
  
<span style="font-family:Arial,Helvetica,sans-serif;">This might seem obvious but we often get request for data that is already available at NCI. Not all the data is well described or easy to find and we will always double check once you email us with a request before downloading. Still it saves a lot of yours and our time if you look for the data you want first through the available documentation. See the[[Data_Access|Data Access]] page on this wiki for suggestions on how to do that.</span>
+
<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">It saves a lot of yours and our time if you look for the data you want first through the available documentation. See the&nbsp;[[Finding_datasets|Finding datasets]]&nbsp;page on this wiki for suggestions on how to do that.</span></span>
  
<span style="font-family:Arial,Helvetica,sans-serif;">It is also worth pointing out that we do not download anymore CMIP and ERA5 data, these are managed by NCI and so requests have to go to them. Both this wiki and documentation on [[Opus.nci.org.au|NCI confluence website]] will have information on how to do that.</span>
+
<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">It is worth pointing out that we do not download anymore [https://opus.nci.org.au/display/CMIP/CMIP+Community+Home CMIP] and [https://opus.nci.org.au/display/ERA5/ERA5+Community+Home ERA5] data, these are managed by NCI and so requests have to go to them. Refer to the&nbsp;NCI confluence website&nbsp;(CMIP and ERA5 links) on how to do that.</span></span>
  
<span style="font-size:medium;">'''<span style="font-family:Arial,Helvetica,sans-serif;">Detailing your request</span>'''</span>
+
=== <span style="font-size:medium;">'''<span style="font-family:Arial,Helvetica,sans-serif;">Detailing your request</span>'''</span> ===
  
The more detailed is your request, the faster we can tell you if we can download the data for you and, in case the answer is positive, the actual data download. Wherever possiible try to include:
+
<span style="font-size:medium;">The more detailed is your request, the faster we can give you an answer and, in case this is positive, the actual data download. Wherever possible try to include:</span>
  
*the actual dataset official name and/or acronym, avoid alternative or obsolete names  
+
*<span style="font-size:medium;">the actual dataset official name and/or acronym, avoid alternative or obsolete names;</span>
*the&nbsp;url of the&nbsp;official metadata record for the dataset  
+
*<span style="font-size:medium;">the&nbsp;url of the&nbsp;official metadata record for the dataset;</span>
*the&nbsp;url for the data portal from which the data is available for download, or for a set of instructions, if it is not directly available.
+
*<span style="font-size:medium;">the&nbsp;url for the data portal from which the data is available for download, or for a set of instructions, if it is not directly available;</span>
*if more than one version/format is available for the same data &nbsp;be specific about the version/format you need  
+
*<span style="font-size:medium;">if more than one version/format is available for the same data &nbsp;be specific about the version/format you need;</span>
*if a different version/format for the same data is already available at NCI, please detail&nbsp;why this version/format is not suitable  
+
*<span style="font-size:medium;">if a different version/format for the same data is already available at NCI, please detail&nbsp;why this version/format is not suitable;</span>
*the variables, frequency, and any other relavnt information (i.e. experiments, models, where this is a collection of datasets) of the subset you need  
+
*<span style="font-size:medium;">the variables, frequency, and any other relavnt information (i.e. experiments, models, where this is a collection of datasets) of the subset you need;</span>
*an estimate of the data subset size  
+
*<span style="font-size:medium;">an estimate of the data subset size.</span>
  
&nbsp;
+
=== <span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">'''Mail CWS helpdesk'''</span></span> ===
  
=== <span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:medium;">'''Mail CWS helpdesk'''</span></span> ===
+
<span style="font-size:medium;">Once you have worked the details of your request&nbsp;you need to contact the CWS helpdesk cws_help{at}nci.org.au. You are more likely to receive a quick answer and it helps us keep a track of your&nbsp;request.</span>
  
Once you have worked the details of your request&nbsp;you need to contact the CWS helpdesk cws_help{at}nci.org.au. You are more likely to receive a quick answer and it helps us keep a track of your&nbsp;request.
+
=== <span style="font-size:medium;">'''<span style="font-family:Arial,Helvetica,sans-serif;">What happens next</span>'''</span> ===
  
&nbsp;
+
<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">First, we will check if the data is already available.&nbsp;If we confirm it is not already available at NCI, we will:</span></span>
  
=== '''<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">What happens next</span></span>''' ===
+
*<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">assess the storage and work involved in downloading the data based on the information you sent:</span></span>
 +
**<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">if we don't have enough information, we will contact you and ask to clarify your request until we can assess it</span></span>
 +
**<span style="font-size:medium;"><span style="font-family:Arial,Helvetica,sans-serif;">if the data requires a substantial amount of work and/or storage we have to contact the Infrastructure Committee to request permission to proceed. The Committee will decide if the data is worth the effort and resources necessary.</span></span>  
 +
*<span style="font-size:medium;">In&nbsp;any other case we will get back to you with a timeframe and a location for the data</span>
 +
*<span style="font-size:medium;">We usually store new data in the ua8 project, unless there is a dedicated project for the dataset already available.</span>
  
<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:small;">First, we will check if the data is already available.&nbsp;If we confirm it is not already available at NCI we will:</span></span>
+
[[Category:Data induction]]
 
 
*<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:small;">asses the storage and work involved in donwloading the data based on the information you sent:</span></span>
 
**<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:small;">if we don't have enough information we will contact you and ask to clarify your request until we can asses it</span></span>
 
**<span style="font-family:Arial,Helvetica,sans-serif;"><span style="font-size:small;">if the data requires a substantial amount of work and/or storage we have to contact the Infrastructure Committee to request permission to proceed. The Committee will decide if the data is worth the effort and resources necessary.</span></span> 
 
*If any other case we will get back to you with a timeframe and a location for the data
 
*We usually store new data in the ua8 project, unless there is a dedicated project for the dataset already available.
 

Latest revision as of 20:35, 2 May 2022

How to request data

Check if data is already available

It saves a lot of yours and our time if you look for the data you want first through the available documentation. See the Finding datasets page on this wiki for suggestions on how to do that.

It is worth pointing out that we do not download anymore CMIP and ERA5 data, these are managed by NCI and so requests have to go to them. Refer to the NCI confluence website (CMIP and ERA5 links) on how to do that.

Detailing your request

The more detailed is your request, the faster we can give you an answer and, in case this is positive, the actual data download. Wherever possible try to include:

  • the actual dataset official name and/or acronym, avoid alternative or obsolete names;
  • the url of the official metadata record for the dataset;
  • the url for the data portal from which the data is available for download, or for a set of instructions, if it is not directly available;
  • if more than one version/format is available for the same data  be specific about the version/format you need;
  • if a different version/format for the same data is already available at NCI, please detail why this version/format is not suitable;
  • the variables, frequency, and any other relavnt information (i.e. experiments, models, where this is a collection of datasets) of the subset you need;
  • an estimate of the data subset size.

Mail CWS helpdesk

Once you have worked the details of your request you need to contact the CWS helpdesk cws_help{at}nci.org.au. You are more likely to receive a quick answer and it helps us keep a track of your request.

What happens next

First, we will check if the data is already available. If we confirm it is not already available at NCI, we will:

  • assess the storage and work involved in downloading the data based on the information you sent:
    • if we don't have enough information, we will contact you and ask to clarify your request until we can assess it
    • if the data requires a substantial amount of work and/or storage we have to contact the Infrastructure Committee to request permission to proceed. The Committee will decide if the data is worth the effort and resources necessary.
  • In any other case we will get back to you with a timeframe and a location for the data
  • We usually store new data in the ua8 project, unless there is a dedicated project for the dataset already available.