Difference between revisions of "ERA INTERIM"

m
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
  
<span style="background-color: #ffffff;">A subset of the ERA Interim re-analysis data ranging from 1979 to date are now hosted on the RDSI fast disk storage at NCI. </span><span style="background-color: #ffffff; font-size: 1em; line-height: 1.5;">This subset is currently managed by the ARCCSS.</span> <span style="background-color: #ffffff; font-size: 1em; line-height: 1.5;">The original grib data is downloaded from the [http://apps.ecmwf.int/datasets/data/interim_full_daily ECMWF server] , it is regularly updated and specific variables are extracted and converted to monthly netcdf files.</span> For more information on the original ECMWF collection visit their website: [https://confluence.ecmwf.int/display/CKB/What+is+ERA-Interim ERA-Interim page]
+
<span style="background-color: #ffffff;">A subset of the ERA Interim re-analysis data ranging from 1979 to the 31st of August, when ERA Interim production ceased&nbsp;is replicated&nbsp;on NCI server. We downloaded t</span><span style="background-color: #ffffff; font-size: 1em; line-height: 1.5;">he original grib data from the [http://apps.ecmwf.int/datasets/data/interim_full_daily ECMWF server] ,&nbsp;specific variables were then extracted and converted to monthly netcdf files.</span> For more information on the original ECMWF collection visit their website: [https://confluence.ecmwf.int/display/CKB/What+is+ERA-Interim ERA-Interim page]
 
 
<span style="background-color:#f1c40f;">ERA-Interim production will cease on the 31st of August 2019,</span> so the complete span of ERA-Interim data will be from 1st January 1979 to 31st August 2019. Keeping in mind that ERA-Interim is published '''with an offset of about three months''' from the dataset's reference date, ERA-Interim August 2019 data will be made available towards the end of 2019.
 
  
 
=== '''Known issues''' ===
 
=== '''Known issues''' ===
Line 19: Line 17:
  
 
We found very high values for temperature (~330-350 K) near the lower troposphere in the following periods: 26-31 Dec 2002 and 21-25 Dec 2003. Other variables might be similarly affected.
 
We found very high values for temperature (~330-350 K) near the lower troposphere in the following periods: 26-31 Dec 2002 and 21-25 Dec 2003. Other variables might be similarly affected.
 +
  
 
=== '''License''' ===
 
=== '''License''' ===
  
<span style="background-color: #ffffff;">To comply with the ECMWF licensing you need to be registered with the ECMWF to use this data.</span> <span style="background-color: #ffffff;">To register with the ECMWF you have to </span><span style="background-color: #ffffff; line-height: 1.5;">sign the </span>[http://apps.ecmwf.int/datasets/data/interim_full_daily/licence/ ERA Interim license and agreement]<span style="background-color: #ffffff; line-height: 1.5;">.</span><br/> <br/> <span style="background-color: #ffffff;">'''NB''': Users of the ECMWF data sets are requested to reference the source of the data in any publication, e.g. "''ECMWF ERA-Interim data used in this study/project have been provided by ECMWF/have been obtained from the ECMWF MARS Data Server on <date>''".</span><br/> You also have to add the following paper to your references:<br/> [https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828 https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828]
+
<span style="background-color: #ffffff;">To comply with the ECMWF licensing you need to be registered with the ECMWF to use this data.</span> <span style="background-color: #ffffff;">To register with the ECMWF you have to </span><span style="background-color: #ffffff; line-height: 1.5;">sign the </span>[http://apps.ecmwf.int/datasets/data/interim_full_daily/licence/ ERA Interim license and agreement]<span style="background-color: #ffffff; line-height: 1.5;">.</span><br/> &nbsp;
 +
 
 +
To acknowledge the ERA-Interim data, please refer to the ERA-Interim&nbsp;[https://apps.ecmwf.int/datasets/data/interim-full-daily/licence/ licence]&nbsp;for details on the wording to use.
 +
 
 +
To cite the source of the data, you may use the following data citation (as part of the bibliography):
 +
 
 +
European Centre for Medium-range Weather Forecast (ECMWF) (2011): The ERA-Interim reanalysis dataset, Copernicus Climate Change Service (C3S) (accessed&nbsp;''<insert date of access here''>), available from&nbsp;[https://www.ecmwf.int/en/forecasts/datasets/archive-datasets/reanalysis-datasets/era-interim https://www.ecmwf.int/en/forecasts/datasets/archive-datasets/reanalysis-datasets/era-interim]&nbsp;
 +
 
 +
If no specific advice is given by the journals, it is usually recommended that the above data citation is put in the acknowledgements section.
 +
 
 +
<br/> You also have to add the following paper to your references:<br/> [https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828 https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828]
  
 
=== '''Data access''' ===
 
=== '''Data access''' ===
  
<span style="background-color: #ffffff; line-height: 1.5;">You can then request to join the ub4 project by using your NCI account to access </span>[https://my.nci.org.au/ | My NCI portal].<br/> Remember you have to register with ECMWF before. <span style="color: #ed0622;">Be aware that by requesting to be part of the ub4 project you are automatically agreeing to the ECMWF license agreement terms</span>'''<span style="background-color: #ffffff;">.</span>'''
+
<span style="background-color: #ffffff; line-height: 1.5;">You can then request to join the ub4 project by using your NCI account to access </span>[https://my.nci.org.au/ My NCI portal].<br/> Remember you have to register with ECMWF before. <span style="color: #ed0622;">Be aware that by requesting to be part of the ub4 project you are automatically agreeing to the ECMWF license agreement terms</span>'''<span style="background-color: #ffffff;">.</span>'''
  
 
&nbsp;
 
&nbsp;
  
<span style="background-color: #ffffff;">The data is available on raijin.nci.org.au under /g/data1/ub4/erai/grib for the grib version and /g/data1/ub4/erai/netcdf for the netcdf version.</span>
+
<span style="background-color: #ffffff;">The data is available on gadi under /g/data/ub4/erai/grib for the grib version and /g/data/ub4/erai/netcdf for the netcdf version.</span>
  
*<span style="color: #1134e8; font-size: 120%;">[https://docs.google.com/spreadsheets/d/1qnQC_Ki5IAwZPD9viV79tfenemPGoYWKfDa5vEwDl90/pubhtml Data Inventory]</span>''': what's available and what we are downloading, regularly updated.'''  
+
*<span style="color: #1134e8; font-size: 120%;">[https://docs.google.com/spreadsheets/d/1qnQC_Ki5IAwZPD9viV79tfenemPGoYWKfDa5vEwDl90/pubhtml Data Inventory]</span>''': what's available for each level'''  
  
NB this is a new version of the inventory, includes more specific information on available variables. Please note that we download all the available surface fields, both analysis and forecast, but we don't convert all of them to netcdf, unless they are specifically requested. Just recently we added: three surface forecast fields: r<span style="background-color: #ffffff; font-family: arial,sans,sans-serif;">unoff, surface thermal radiation downwards, convective available potential energy (CAPE);</span> <span style="background-color: #ffffff; font-family: arial,sans,sans-serif;">three surface analysis fields: total cloud cover, high cloud cover, low cloud cover</span>
+
NB &nbsp;that we download all the available surface fields, both analysis and forecast, but we don't convert all of them to netcdf, unless they are specifically requested.
  
<span style="background-color: #ffffff;">The netcdf version is organized in monthly field files while the grib files have all the fields in one file.</span>
+
<span style="background-color: #ffffff;">The netcdf version is organized in monthly field files, while the grib files have all the fields in one file.</span>
  
<span style="background-color: #ffffff;">The current netcdf version was released in April 2015, the previous version is still available in the ua8 project. If you request access before we moved the data to ub4, you are already part of ub4 and don't need to request access again to use the data. The older version is not updated anymore and we'll be kept there only for users who are at the ned of their project. In January 2016 we'll definitely delete the older version and anyone is encouraged to move to ub4, since this version is updated and is a better data product overall.</span>
+
<span style="background-color: #ffffff;">The current netcdf version was released in April 2015, the previous version hosted on ua8 is not anymore available. If you requested access before we moved the data to ub4, you are already part of ub4 and don't need to request access again to use the data.</span>
  
*<span style="background-color: #ffffff;">Monthly averaged fields</span>  
+
<span style="background-color: #ffffff;">Monthly averaged fields are available in ua8 as part of the CREATE-IP temporary copy hosted in</span>
  
We stopped updating the monthly fields since the BoM has a copy of the same in their rr7 project. You can request access as for the other NCI projects on <span style="line-height: 1.5;">[https://my.nci.org.au/ My NCI portal] . Anyone working with the ARCCSS should be granted access, if you are having trouble getting access or the data you are looking for is not available there, let us know by e-mailing climate_help@nci.org.au .</span>
+
/g/data/ua8/synda/CREATE-IP/reanalysis/ECMWF/ERA-Interim/
 +
 
 +
This is a temporary replica , eventually NCI should provide the CREATE-IP dataset in a different project, see this [[Data_projects_update|page]] for details.
 +
 
 +
<span style="line-height: 1.5;">If you are having trouble getting access or the data you are looking for is not available there, let us know by e-mailing cws_help<at>nci.org.au .</span>
  
 
=== '''<span style="background-color: #ffffff;">Current netcdf version: v1.0</span>''' ===
 
=== '''<span style="background-color: #ffffff;">Current netcdf version: v1.0</span>''' ===
Line 56: Line 69:
 
<span class="s1">3) some of the variable names have changed to the correspondent CMIP5 standard name, all names are now lower cases</span>
 
<span class="s1">3) some of the variable names have changed to the correspondent CMIP5 standard name, all names are now lower cases</span>
  
<span class="s1">4) new directory structure /g/data1/ub4/erai/netcdf/<frequency>/<realm>/<level>/<version>/<variable>/files…</span>
+
<span class="s1">4) new directory structure /g/data/ub4/erai/netcdf/<frequency>/<realm>/<level>/<version>/<variable>/files…</span>
  
<span class="s1">Ex.&nbsp;: /g/data1/ub4/erai/netcdf/6hr/atmos/oper_an_ml/v01/ta/</span>
+
<span class="s1">Ex.&nbsp;: /g/data/ub4/erai/netcdf/6hr/atmos/oper_an_ml/v01/ta/</span>
  
 
<span class="s1">5) there is more metadata information in the files, in particular we adopted CF conventions and added standard_names wherever possible. We also added an attribute MD5 checksum for each variable, this can be use to check for data corruption by using ncks or nco</span>
 
<span class="s1">5) there is more metadata information in the files, in particular we adopted CF conventions and added standard_names wherever possible. We also added an attribute MD5 checksum for each variable, this can be use to check for data corruption by using ncks or nco</span>
 
<span class="s1">The reason for the changes in filenames and directory structure is to make both as much as possible compliant to the CMIP5 filename and DRS standards, so the entire collection could be accessible from the CWSlab. This should be tested and implemented in the next few months.</span>
 
  
 
<span class="s1">Most of the data will be in the "atmos" realm, with the exception of ERAI land which is is "land" and few ocean fields. And most files have "6hr" frequency, with the exception of the forecast data which has "3hr" frequency.</span>
 
<span class="s1">Most of the data will be in the "atmos" realm, with the exception of ERAI land which is is "land" and few ocean fields. And most files have "6hr" frequency, with the exception of the forecast data which has "3hr" frequency.</span>
Line 70: Line 81:
 
&nbsp;
 
&nbsp;
  
[[Category:Dataset]][[Category:Clex-managed-data]]
+
[[Category:Dataset]] [[Category:Clex-managed-data]]

Latest revision as of 23:30, 25 August 2022

A subset of the ERA Interim re-analysis data ranging from 1979 to the 31st of August, when ERA Interim production ceased is replicated on NCI server. We downloaded the original grib data from the ECMWF server , specific variables were then extracted and converted to monthly netcdf files. For more information on the original ECMWF collection visit their website: ERA-Interim page

Known issues

There are several known quality issues for the ERA Interim data, here is a list of web resources:

ERA Interim quality issues from ECMWF website

Climate Data Guide page on ERAI

Overview and re-analysis comparison from reanalyses.org

Some of these analysis are described more in depth in the following article: The ERA-Interim reanalysis: configuration and performance of the data assimilation system

We are listing all the issues we are aware of in this ERAI known issues page, please feel free to contribute, we just provide the data as it is downloaded from the ECMWF server, we do checks to make sure that our data is the same as the original but we cannot run checks for the original data.

We found very high values for temperature (~330-350 K) near the lower troposphere in the following periods: 26-31 Dec 2002 and 21-25 Dec 2003. Other variables might be similarly affected.


License

To comply with the ECMWF licensing you need to be registered with the ECMWF to use this data. To register with the ECMWF you have to sign the ERA Interim license and agreement.
 

To acknowledge the ERA-Interim data, please refer to the ERA-Interim licence for details on the wording to use.

To cite the source of the data, you may use the following data citation (as part of the bibliography):

European Centre for Medium-range Weather Forecast (ECMWF) (2011): The ERA-Interim reanalysis dataset, Copernicus Climate Change Service (C3S) (accessed <insert date of access here>), available from https://www.ecmwf.int/en/forecasts/datasets/archive-datasets/reanalysis-datasets/era-interim 

If no specific advice is given by the journals, it is usually recommended that the above data citation is put in the acknowledgements section.


You also have to add the following paper to your references:
https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828

Data access

You can then request to join the ub4 project by using your NCI account to access My NCI portal.
Remember you have to register with ECMWF before. Be aware that by requesting to be part of the ub4 project you are automatically agreeing to the ECMWF license agreement terms.

 

The data is available on gadi under /g/data/ub4/erai/grib for the grib version and /g/data/ub4/erai/netcdf for the netcdf version.

NB  that we download all the available surface fields, both analysis and forecast, but we don't convert all of them to netcdf, unless they are specifically requested.

The netcdf version is organized in monthly field files, while the grib files have all the fields in one file.

The current netcdf version was released in April 2015, the previous version hosted on ua8 is not anymore available. If you requested access before we moved the data to ub4, you are already part of ub4 and don't need to request access again to use the data.

Monthly averaged fields are available in ua8 as part of the CREATE-IP temporary copy hosted in

/g/data/ua8/synda/CREATE-IP/reanalysis/ECMWF/ERA-Interim/

This is a temporary replica , eventually NCI should provide the CREATE-IP dataset in a different project, see this page for details.

If you are having trouble getting access or the data you are looking for is not available there, let us know by e-mailing cws_help<at>nci.org.au .

Current netcdf version: v1.0

The main differences with the previous version, which is not anymore available, are:

1) netcdf4 format with compression, which allowed us to get rid of scale and offset

2) new filenames of the form <varname>_<frequency>_ERAI_historical_<level>_<from-date>_<to-date>.nc

Ex. : ta_6hrs_ERAI_historical_an-ml_20110101_20110131.nc

3) some of the variable names have changed to the correspondent CMIP5 standard name, all names are now lower cases

4) new directory structure /g/data/ub4/erai/netcdf/<frequency>/<realm>/<level>/<version>/<variable>/files…

Ex. : /g/data/ub4/erai/netcdf/6hr/atmos/oper_an_ml/v01/ta/

5) there is more metadata information in the files, in particular we adopted CF conventions and added standard_names wherever possible. We also added an attribute MD5 checksum for each variable, this can be use to check for data corruption by using ncks or nco

Most of the data will be in the "atmos" realm, with the exception of ERAI land which is is "land" and few ocean fields. And most files have "6hr" frequency, with the exception of the forecast data which has "3hr" frequency.

The scripts used to convert the files from grib to netcdf are available in the checks github.
https://github.com/coecms/CollectionsScripts