The objective of this training course is to increase scientists’ expertise on scientific data analysis at scale applied to climate and weather domains, using high-performance data analytics tools available from the open source market (i.e., Ophidia). The training covers from simple analytics tasks to workflows and applications (e.g., Python-based) and provides best practices and guidelines on dealing with massive scientific datasets on HPC architectures. The training foresees hands-on exercises carried out through Jupyter Notebooks.
Topics:
- Big data
- Introduction to scientific data management
- Scientific data analytics at scale
- Analytics workflows for eScience
- Big Data in HPC: High-performance data management
- Open-source High Performance Data Analytics (HPDA) tools
Audience:
The training addresses several vertical training levels (e.g., intermediate, expert) from different horizontal perspectives (e.g., end-user, developer, administrator).
Schedule:
This training was planned as three 8-hour online training courses in 2020, 2021 and 2022, over four consecutive weeks, with 2 hours of work per week required for the participants.
- 6 October to 3rd November 2020: Please visit our event page or take a look at the success story of this first training.
- 13 to 16 September 2021: Please visit our event page.
- 6 to 9 September 2022: Please visit our event page.
All three training editions have been completed.
Our Open Educational Resources (OER) training material for High Performance Data Analytics (HPDA) is available on the OER Commons page.
For a recording of the training sessions delivered in 2021, have a look at this playlist on YouTube.
Contact Persons: Donatello Elia, CMCC and Florian Ziemen, DKRZ