RSEHPC@ISC25 Workshop: Tools and Techniques for Continuous Integration and Benchmarking, 13th June 2025, ISC, Hamburg
Motivation
It has long been understood that there is a strong overlap between the fields of HPC and RSE. Although the two are not entirely congruent, the techniques used and the communities are closely interlinked. For the second time an RSEHPC workshop will be held at ISC.
When developing research software, it is often relevant to track its performance over time. It is even vital when targeting HPC architectures. Changes to the software itself, the used toolchains, or the system setup should not compromise how fast users obtain their results. Ideally, performance or scalability should only ever increase. Hence benchmarking should be an integral part of testing, in particular for HPC codes. At the same time, up-to-date benchmarks that are publicly available can advertise the code and the machines running them, informing users how to set-up the software in the most ideal way or whether they are achieving the expected performance. To limit the burden on developers, the aforementioned steps should be automated within continuous integration (CI) practices, introducing continuous benchmarking (CB) to it. For HPC, an added complexity is the requirement of more than the usual CI backends, with access to longer running steps, more resources than available on a single node, and a diverse range of architectures that the software needs to be tested on.
In this workshop, participants will be able to exchange ideas, approaches, good practices, but also obstacles on the way to continuous benchmarking on HPC systems. Driven by three keynote talks which will set the stage (see below), participants are encouraged to share their own experiences, both from a users and from an operations perspective. To foster this exchange, we foresee the second part of the workshop to provide a stage for participants to contribute their expertise, and experience with lightning talks. For the lightning talks a light-weight submission will be set up and reviewed by well known international RSE and HPC experts. In addition, we foresee a part where open questions to the audience will be posed, helping everyone to add their thoughts to this event.
Topics include, but are not limited to:
- User’s perspective: How do users experience the existing solutions? Is there something missing, from technological solutions to convenience?
- Operations’ perspective: What does it take for hosting sites to start and to maintain an CI/CB infrastructure? What are benefits, what are burdens?
- Success stories and scary tales: where did CI/CB help to identify problems? Where could it have helped? And what did hinder the adaptation of CI/CB?
- Past, present, future: what are lessons learned and what does a wishlist look like?
The RSEHPC@ISC25 will be held in-person on 13th June 2025, 2pm-6pm in Hamburg.
Timetable
Time | Topic | Speaker |
---|---|---|
2:00 pm | Welcome and introductions | René Caspart and Robert Speck |
2:05 pm | The past, the present and the future of Cx at NHR@KIT and KIT | Michele Mesiti, Scientific Computing Centre, Karlsruhe Institute of Technology, Germany |
2:25 pm | The Impact of CI/CD/CB during JUPITER’s Ascension | Jayesh Badwaik, Jülich Supercomputing Centre, Forschungszentrum Jülich GmbH, Germany |
2:45 pm | Utilization of Cx in Fugaku and FugakuNEXT | Hitoshi Murai, RIKEN Center for Computational Science, Japan |
3:05 pm - 4:00 pm | 5 Lightning Talks | |
UK Living Benchmarks | Andy Turner, EPCC - The University of Edinburgh, United Kingdom | |
Collaborative benchmarking | Olga Pearce, Lawrence Livermore National Laboratory, USA | |
A CI/CB Setup for a Quantum Transport Code | Christoph Conrads, Jülich Supercomputing Centre, Forschungszentrum Jülich GmbH, Germany | |
Continuous Transpilation with Coccinelle and AutoBench | Michele Martone, Leibniz Supercomputing Centre, Germany | |
Enhancing HPC Operations: Linking JUBE, LLview, and GitLab for Continuous Insight into System Health | Thomas Breuer, Jülich Supercomputing Centre, Forschungszentrum Jülich GmbH, Germany | |
4:00 pm | Coffee Break | |
4:30 pm - 5:25 pm | 5 Lightning talks | |
bwRSE4HPC - Regional RSE Support for HPC Users | Marcel Koch, Karlsruhe Institute of Technology, Germany | |
Development Containers for HPC | Tobias Ribizel, TU Munich, Germany | |
Continuous evaluation of Performance-Portabilty tradeoff in Containerized applications using Jacamar CI | Varun Sudharshnam, Helmholtz-Zentrum Dresden-Rossendorf, Germany | |
Smarter testing in the CI - How-to get great coverage but skip thousands of build configurations | Simeon Ehrig, Center for Advanced Systems Understanding, Germany | |
Experience from continuous validation for a open-source CFD software | Fabian Schlegel, Helmholtz-Zentrum Dresden-Rossendorf, Germany | |
5:25 pm - 6:00 pm | Open Floor and Discussion |
Keynotes
The three keynotes will set the stage for the workshop, showing was has been achieved so far and how Cx plays a vital role for running, procuring and planning HPC machines.
The past, the present and the future of Cx at NHR@KIT and KIT
Speaker: Michele Mesiti, Scientific Computing Centre, Karlsruhe Institute of Technology, Germany
Abstract: Since 2021 the German National High Performance Computing Center at Karlsruhe Institute of Technology (NHR@KIT) has been offering and advertising the possibility of running Cx workloads, including the possibility for containerised workflows, on our diverse HPC infrastructure, leveraging the customizability of the GitLab CI/CD ecosystem. In this talk we will discuss the evolution of our Cx service offer, both from the technical and user adaption point-of-view, its current status, including current and planned in-house developments to suite the demands for this service, and compare it with alternatives. We will also describe the challenges and opportunities observed by us in user engagements, which is being offered through courses, support tickets and ad-hoc collaborations and outline our plans for the future.
The Impact of CI/CD/CB during JUPITER’s Ascension
Speaker: Jayesh Badwaik, Jülich Supercomputing Centre, Forschungszentrum Jülich GmbH, Germany
Abstract: Ensuring performance consistency and early regression detection is critical in high-performance computing (HPC) operations. Traditional benchmarking methods rely on manual execution, leading to inconsistencies and delayed issue detection. During the JUREAP program, we integrated Continuous Benchmarking (CB) using exacb to standardize performance evaluation across 50 applications. This automation improved reproducibility, streamlined reporting, and enabled early detection of system anomalies, such as faulty Slurm updates and workflow execution issues on the JEDI machine. Even without a fully operational exascale supercomputer, exacb facilitated systematic performance comparisons, providing valuable insights into application scalability. Beyond JUREAP, CI/CD/CB enhances research software development and HPC system management. Our framework simplifies benchmarking, ensuring efficient performance tracking and optimization at scale—key for the upcoming JUPITER exascale supercomputer. Automating benchmarking reduces manual overhead, improves system stability, and aids in troubleshooting by providing structured performance insights. In this talk, we share our experience implementing CB in JUREAP, key findings from benchmarking 50 applications, and the broader impact of CI/CD/CB on research software, system administration, and future exascale computing.
Utilization of Cx in Fugaku and FugakuNEXT
Speaker: Hitoshi Murai, RIKEN Center for Computational Science, Japan
Abstract: RIKEN R-CCS starts the FugakuNEXT project from April, 2025. FugakuNEXT is a project for developing Japanese next-generation supercomputer. Unlike its predecessor, Supercomputer Fugaku, it is equipped with GPUs and targets both HPC and AI workloads; therefore it is crucial to develop and/or optimize applications for such a system. For this goal, we are planning to utilize Cx technologies. In this talk, we will explain briefly our plan of utilizing Cx in the FugakuNEXT project along with that in Fugaku.
Organising committee:
- René Caspart (Karlsruhe Institute of Technology)
- Robert Speck (Forschungszentrum Juelich)
- Stefanie Reuter (ECMWF)
- Matthew Archer (University of Cambridge)
- Andy Turner (EPCC)
- Daniel S. Katz (National Center for Supercomputing Applications, University of Illinois Urbana-Champaign)
- Jayesh Badwaik (Jülich Supercomputing Centre at Forschungszentrum Jülich)
- Weronika Filinger (EPCC, University of Edinburgh)
- Sarah Neuwirth (Johannes Gutenberg University of Mainz)
Code of conduct:
As an ISC co-located workshop we follow ISC’s code of conduct, which can be found here
Location: https://maps.app.goo.gl/9wCj7aRcMUeJ5PL87