Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Example pipeline for ERA5 #22

Open
alxmrs opened this issue Mar 16, 2021 · 1 comment
Open

Example pipeline for ERA5 #22

alxmrs opened this issue Mar 16, 2021 · 1 comment

Comments

@alxmrs
Copy link

alxmrs commented Mar 16, 2021

Source Dataset

  • Link to the website / online documentation for the data
  • The file format (e.g. netCDF, csv)
    • netCDF, Grib2
  • How are the source files organized? (e.g. one file per day)
    • Data is accessible by querying the MARS API from ECMWF (docs here).
    • Data is accessible from the MARS API via a selection DSL (examples, syntax)
  • Any special steps required to access the data (e.g. password required)
    • ECMWF keys are required (docs).
    • Copernicus (CDS) provides fast access to ERA5.
    • If data is not listed in CDS, ECMWF provides slow access to the data, which is stored on tape drives. Care needs to be taken to access data in this archival format. Consider reviewing the retrieval documentation, especially the "retrieval efficiency" section.
    • ERA5 files can be quite large (~1GB in size per query). Downloading jobs should be partitioned (select smaller subsets of the overall data).

Transformation / Alignment / Merging

Output Dataset

Ideally, the datasets would be converted into a single Zarr with a 10-100MB chunk size.

@rabernat
Copy link
Contributor

rabernat commented Apr 6, 2021

Thanks for sharing this @alxmrs! ERA5 is definitely on our roadmap. We have a formal partnership with ECMWF around this. But it is a big job! We will keep you updated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants