Difference between revisions of "ERA INTERIM"

(Imported from Wikispaces)
 
(11 intermediate revisions by the same user not shown)
Line 1: Line 1:
=====<span style="background-color: #ffffff;">A subset of the ERA Interim re-analysis data ranging from 1979 to date are now hosted on the RDSI fast disk storage at NCI. </span><span style="background-color: #ffffff; font-size: 1em; line-height: 1.5;">This subset is currently managed by the ARCCSS.</span>=====
 
<span style="background-color: #ffffff; font-size: 1em; line-height: 1.5;">The original grib data is downloaded from the [http://apps.ecmwf.int/datasets/data/interim_full_daily | ECMWF server] , it is regularly updated and specific variables are extracted and converted to monthly netcdf files.</span>
 
For more information on the original ECMWF collection visit their website: [https://confluence.ecmwf.int/display/CKB/What+is+ERA-Interim | ERA-Interim page]
 
  
===Known issues===  
+
<span style="background-color: #ffffff;">A subset of the ERA Interim re-analysis data ranging from 1979 to the 31st of August, when ERA Interim production ceased&nbsp;is replicated&nbsp;on NCI server. We downloaded t</span><span style="background-color: #ffffff; font-size: 1em; line-height: 1.5;">he original grib data from the [http://apps.ecmwf.int/datasets/data/interim_full_daily ECMWF server] ,&nbsp;specific variables were then extracted and converted to monthly netcdf files.</span> For more information on the original ECMWF collection visit their website: [https://confluence.ecmwf.int/display/CKB/What+is+ERA-Interim ERA-Interim page]
 +
 
 +
=== '''Known issues''' ===
  
 
There are several known quality issues for the ERA Interim data, here is a list of web resources:
 
There are several known quality issues for the ERA Interim data, here is a list of web resources:
  
[http://old.ecmwf.int/research/era/do/get/index/QualityIssues | ERA Interim quality issues from ECMWF website]
+
[http://old.ecmwf.int/research/era/do/get/index/QualityIssues ERA Interim quality issues from ECMWF website]
[https://climatedataguide.ucar.edu/climate-data/era-interim | Climate Data Guide page on ERAI]
+
 
[http://reanalyses.org/atmosphere/overview-current-reanalyses | Overview and re-analysis comparison from reanalyses.org]
+
[https://climatedataguide.ucar.edu/climate-data/era-interim Climate Data Guide page on ERAI]
 +
 
 +
[http://reanalyses.org/atmosphere/overview-current-reanalyses Overview and re-analysis comparison from reanalyses.org]
  
Some of these analysis are described more in depth in the following article: [http://onlinelibrary.wiley.com/doi/10.1002/qj.828/full | The ERA-Interim reanalysis: configuration and performance of the data assimilation system]
+
Some of these analysis are described more in depth in the following article: [http://onlinelibrary.wiley.com/doi/10.1002/qj.828/full The ERA-Interim reanalysis: configuration and performance of the data assimilation system]
  
We are listing all the issues we are aware of in this[[ERAI-issues | ERAI known issues page]], please feel free to contribute, we just provide the data as it is downloaded from the ECMWF server, we do checks to make sure that our data is the same as the original but we cannot run checks for the original data.
+
We are listing all the issues we are aware of in this&nbsp;[[ERAI-issues|ERAI known issues page]], please feel free to contribute, we just provide the data as it is downloaded from the ECMWF server, we do checks to make sure that our data is the same as the original but we cannot run checks for the original data.
  
We found very high values for temperature (~330-350 K) near the lower troposphere in the following periods:
+
We found very high values for temperature (~330-350 K) near the lower troposphere in the following periods: 26-31 Dec 2002 and 21-25 Dec 2003. Other variables might be similarly affected.
26-31 Dec 2002 and 21-25 Dec 2003. Other variables might be similarly affected.
 
  
===Data access===  
+
=== '''License''' ===
  
<span style="background-color: #ffffff;">To comply with the ECMWF licensing you need to be registered with the ECMWF to use this data.</span>
+
<span style="background-color: #ffffff;">To comply with the ECMWF licensing you need to be registered with the ECMWF to use this data.</span> <span style="background-color: #ffffff;">To register with the ECMWF you have to </span><span style="background-color: #ffffff; line-height: 1.5;">sign the </span>[http://apps.ecmwf.int/datasets/data/interim_full_daily/licence/ ERA Interim license and agreement]<span style="background-color: #ffffff; line-height: 1.5;">.</span><br/> <br/> <span style="background-color: #ffffff;">'''NB''': Users of the ECMWF data sets are requested to reference the source of the data in any publication, e.g. "''ECMWF ERA-Interim data used in this study/project have been provided by ECMWF/have been obtained from the ECMWF MARS Data Server on <date>''".</span><br/> You also have to add the following paper to your references:<br/> [https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828 https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828]
<span style="background-color: #ffffff;">To register with the ECMWF you have to </span><span style="background-color: #ffffff; line-height: 1.5;">sign the </span>[http://apps.ecmwf.int/datasets/data/interim_full_daily/licence/ | ERA Interim license and agreement]<span style="background-color: #ffffff; line-height: 1.5;">.</span>
 
<span style="background-color: #ffffff; line-height: 1.5;">You can then request to join the ub4 project by using your NCI account to access </span>[https://my.nci.org.au/ | My NCI portal] , and you can send <span style="background-color: #ffffff; line-height: 1.5;">an e-mail to </span><span style="line-height: 1.5;">[[mailto:climate_help@nf.nci.org.au | climate_help@nf.nci.org.au]] to let us know you have registered. Remember you have to register with ECMWF before.</span>
 
<span style="color: #ed0622;">Be aware that by requesting to be part of the ub4 project you are automatically agreeing to the ECMWF license agreement terms</span>'''<span style="background-color: #ffffff;">.</span>'''
 
  
<span style="background-color: #ffffff;">The data is available on raijin.nci.org.au under /g/data1/ub4/erai/grib for the grib version and /g/data1/ub4/erai/netcdf for the netcdf version.</span>
+
=== '''Data access''' ===
  
**<span style="color: #1134e8; font-size: 120%;">[https://docs.google.com/spreadsheets/d/1qnQC_Ki5IAwZPD9viV79tfenemPGoYWKfDa5vEwDl90/pubhtml | Data Inventory]</span>''': what's available and what we are downloading, regularly updated.
+
<span style="background-color: #ffffff; line-height: 1.5;">You can then request to join the ub4 project by using your NCI account to access </span>[https://my.nci.org.au/ My NCI portal].<br/> Remember you have to register with ECMWF before. <span style="color: #ed0622;">Be aware that by requesting to be part of the ub4 project you are automatically agreeing to the ECMWF license agreement terms</span>'''<span style="background-color: #ffffff;">.</span>'''
NB this is a new version of the inventory, includes more specific information on available variables.
 
Please note that we download all the available surface fields, both analysis and forecast, but we don't convert all of them to netcdf, unless they are specifically requested.
 
Just recently we added:
 
three surface forecast fields: r<span style="background-color: #ffffff; font-family: arial,sans,sans-serif;">unoff, surface thermal radiation downwards, convective available potential energy (CAPE);</span>
 
<span style="background-color: #ffffff; font-family: arial,sans,sans-serif;">three surface analysis fields: total cloud cover, high cloud cover, low cloud cover</span>
 
  
<span style="background-color: #ffffff;">The netcdf version is organized in monthly field files while the grib files have all the fields in one file.</span>
+
&nbsp;
  
<span style="background-color: #ffffff;">The current netcdf version was released in April 2015, the previous version is still available in the ua8 project. If you request access before we moved the data to ub4, you are already part of ub4 and don't need to request access again to use the data. The older version is not updated anymore and we'll be kept there only for users who are at the ned of their project. In January 2016 we'll definitely delete the older version and anyone is encouraged to move to ub4, since this version is updated and is a better data product overall.</span>
+
<span style="background-color: #ffffff;">The data is available on gadi under /g/data/ub4/erai/grib for the grib version and /g/data/ub4/erai/netcdf for the netcdf version.</span>
  
**<span style="background-color: #ffffff;">Monthly averaged fields</span>'''
+
*<span style="color: #1134e8; font-size: 120%;">[https://docs.google.com/spreadsheets/d/1qnQC_Ki5IAwZPD9viV79tfenemPGoYWKfDa5vEwDl90/pubhtml Data Inventory]</span>''': what's available for each level'''
We stopped updating the monthly fields since the BoM has a copy of the same in their rr7 project. You can request access as for the other NCI projects on <span style="line-height: 1.5;">[https://my.nci.org.au/ | My NCI portal] . Anyone working with the ARCCSS should be granted access, if you are having trouble getting access or the data you are looking for is not available there, let us know by e-mailing climate_help@nci.org.au .</span>
 
  
===<span style="background-color: #ffffff;">ERAI netcdf v1.0</span>===
+
NB &nbsp;that we download all the available surface fields, both analysis and forecast, but we don't convert all of them to netcdf, unless they are specifically requested.
  
<span class="s1">The main differences with the previous version, which is still available in the ua8 project, are:</span>
+
<span style="background-color: #ffffff;">The netcdf version is organized in monthly field files, while the grib files have all the fields in one file.</span>
 +
 
 +
<span style="background-color: #ffffff;">The current netcdf version was released in April 2015, the previous version hosted on ua8 is not anymore available. If you requested access before we moved the data to ub4, you are already part of ub4 and don't need to request access again to use the data.</span>
 +
 
 +
<span style="background-color: #ffffff;">Monthly averaged fields are available in ua8 as part of the CREATE-IP temporary copy hosted in</span>
 +
 
 +
/g/data/ua8/synda/CREATE-IP/reanalysis/ECMWF/ERA-Interim/
 +
 
 +
This is a temporary replica , eventually NCI should provide the CREATE-IP dataset in a different project, see this [[Data_projects_update|page]] for details.
 +
 
 +
<span style="line-height: 1.5;">If you are having trouble getting access or the data you are looking for is not available there, let us know by e-mailing cws_help<at>nci.org.au .</span>
 +
 
 +
=== '''<span style="background-color: #ffffff;">Current netcdf version: v1.0</span>''' ===
 +
 
 +
<span class="s1">The main differences with the previous version, which is not anymore available, are:</span>
  
 
<span class="s1">1) netcdf4 format with compression, which allowed us to get rid of scale and offset</span>
 
<span class="s1">1) netcdf4 format with compression, which allowed us to get rid of scale and offset</span>
Line 49: Line 54:
 
<span class="s1">2) new filenames of the form <varname>_<frequency>_ERAI_historical_<level>_<from-date>_<to-date>.nc</span>
 
<span class="s1">2) new filenames of the form <varname>_<frequency>_ERAI_historical_<level>_<from-date>_<to-date>.nc</span>
  
<span class="s1"> Ex. : ta_6hrs_ERAI_historical_an-ml_20110101_20110131.nc</span>
+
<span class="s1">Ex.&nbsp;: ta_6hrs_ERAI_historical_an-ml_20110101_20110131.nc</span>
  
 
<span class="s1">3) some of the variable names have changed to the correspondent CMIP5 standard name, all names are now lower cases</span>
 
<span class="s1">3) some of the variable names have changed to the correspondent CMIP5 standard name, all names are now lower cases</span>
  
<span class="s1">4) new directory structure /g/data1/ub4/erai/netcdf/<frequency>/<realm>/<level>/<version>/<variable>/files…</span>
+
<span class="s1">4) new directory structure /g/data/ub4/erai/netcdf/<frequency>/<realm>/<level>/<version>/<variable>/files…</span>
  
<span class="s1"> Ex. : /g/data1/ub4/erai/netcdf/6hr/atmos/oper_an_ml/v01/ta/</span>
+
<span class="s1">Ex.&nbsp;: /g/data/ub4/erai/netcdf/6hr/atmos/oper_an_ml/v01/ta/</span>
  
 
<span class="s1">5) there is more metadata information in the files, in particular we adopted CF conventions and added standard_names wherever possible. We also added an attribute MD5 checksum for each variable, this can be use to check for data corruption by using ncks or nco</span>
 
<span class="s1">5) there is more metadata information in the files, in particular we adopted CF conventions and added standard_names wherever possible. We also added an attribute MD5 checksum for each variable, this can be use to check for data corruption by using ncks or nco</span>
  
<span class="s1">The reason for the changes in filenames and directory structure is to make both as much as possible compliant to the CMIP5 filename and DRS standards, so the entire collection could be accessible from the CWSlab. This should be tested and implemented in the next few months.</span>
+
<span class="s1">Most of the data will be in the "atmos" realm, with the exception of ERAI land which is is "land" and few ocean fields. And most files have "6hr" frequency, with the exception of the forecast data which has "3hr" frequency.</span>
 
 
<span class="s1">Most of the data will be in the "atmos" realm, with the exception of ERAI land which is is "land" and few ocean fields. And most files have "6hr" frequency, with the exception of the forecast data which has "3hr" frequency. </span>
 
 
 
<span class="s1">We will soon add documentation on the exact procedure we use to create the netcdf files from the original grib.</span>
 
 
 
===<span style="background-color: #ffffff;">Old version on ua8</span>===
 
  
<span style="background-color: #ffffff;">The two slides below from a presentation given at the 1st ARCCSS winter school show the filenames and directory structure for the old version:</span>
+
<span class="s1">The scripts used to convert the files from grib to netcdf are available in the checks github.<br/> [https://github.com/coecms/CollectionsScripts https://github.com/coecms/CollectionsScripts]</span>
  
<span style="background-color: #ffffff; line-height: 0px; overflow: hidden;">[[File:Slide1.jpg|648x486px]]</span>
+
&nbsp;
  
<span style="background-color: #ffffff;">'''NB''': Users of the ECMWF data sets are requested to reference the source of the data in any publication, e.g. "''ECMWF ERA-Interim data used in this study/project have been provided by ECMWF/have been obtained from the ECMWF Data Server''".</span>
+
[[Category:Dataset]] [[Category:Clex-managed-data]]

Revision as of 21:20, 3 June 2021

A subset of the ERA Interim re-analysis data ranging from 1979 to the 31st of August, when ERA Interim production ceased is replicated on NCI server. We downloaded the original grib data from the ECMWF server , specific variables were then extracted and converted to monthly netcdf files. For more information on the original ECMWF collection visit their website: ERA-Interim page

Known issues

There are several known quality issues for the ERA Interim data, here is a list of web resources:

ERA Interim quality issues from ECMWF website

Climate Data Guide page on ERAI

Overview and re-analysis comparison from reanalyses.org

Some of these analysis are described more in depth in the following article: The ERA-Interim reanalysis: configuration and performance of the data assimilation system

We are listing all the issues we are aware of in this ERAI known issues page, please feel free to contribute, we just provide the data as it is downloaded from the ECMWF server, we do checks to make sure that our data is the same as the original but we cannot run checks for the original data.

We found very high values for temperature (~330-350 K) near the lower troposphere in the following periods: 26-31 Dec 2002 and 21-25 Dec 2003. Other variables might be similarly affected.

License

To comply with the ECMWF licensing you need to be registered with the ECMWF to use this data. To register with the ECMWF you have to sign the ERA Interim license and agreement.

NB: Users of the ECMWF data sets are requested to reference the source of the data in any publication, e.g. "ECMWF ERA-Interim data used in this study/project have been provided by ECMWF/have been obtained from the ECMWF MARS Data Server on <date>".
You also have to add the following paper to your references:
https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/qj.828

Data access

You can then request to join the ub4 project by using your NCI account to access My NCI portal.
Remember you have to register with ECMWF before. Be aware that by requesting to be part of the ub4 project you are automatically agreeing to the ECMWF license agreement terms.

 

The data is available on gadi under /g/data/ub4/erai/grib for the grib version and /g/data/ub4/erai/netcdf for the netcdf version.

NB  that we download all the available surface fields, both analysis and forecast, but we don't convert all of them to netcdf, unless they are specifically requested.

The netcdf version is organized in monthly field files, while the grib files have all the fields in one file.

The current netcdf version was released in April 2015, the previous version hosted on ua8 is not anymore available. If you requested access before we moved the data to ub4, you are already part of ub4 and don't need to request access again to use the data.

Monthly averaged fields are available in ua8 as part of the CREATE-IP temporary copy hosted in

/g/data/ua8/synda/CREATE-IP/reanalysis/ECMWF/ERA-Interim/

This is a temporary replica , eventually NCI should provide the CREATE-IP dataset in a different project, see this page for details.

If you are having trouble getting access or the data you are looking for is not available there, let us know by e-mailing cws_help<at>nci.org.au .

Current netcdf version: v1.0

The main differences with the previous version, which is not anymore available, are:

1) netcdf4 format with compression, which allowed us to get rid of scale and offset

2) new filenames of the form <varname>_<frequency>_ERAI_historical_<level>_<from-date>_<to-date>.nc

Ex. : ta_6hrs_ERAI_historical_an-ml_20110101_20110131.nc

3) some of the variable names have changed to the correspondent CMIP5 standard name, all names are now lower cases

4) new directory structure /g/data/ub4/erai/netcdf/<frequency>/<realm>/<level>/<version>/<variable>/files…

Ex. : /g/data/ub4/erai/netcdf/6hr/atmos/oper_an_ml/v01/ta/

5) there is more metadata information in the files, in particular we adopted CF conventions and added standard_names wherever possible. We also added an attribute MD5 checksum for each variable, this can be use to check for data corruption by using ncks or nco

Most of the data will be in the "atmos" realm, with the exception of ERAI land which is is "land" and few ocean fields. And most files have "6hr" frequency, with the exception of the forecast data which has "3hr" frequency.

The scripts used to convert the files from grib to netcdf are available in the checks github.
https://github.com/coecms/CollectionsScripts