Description
This short seminar provides an introduction to modern scientific workflows and hands-on practice with two workflow systems. Over two half-days (4 hours each), attendees will:
Day 1
Understand the role and purpose of scientific workflows, including how and why they are used to structure and automate complex data-processing pipelines. Modules include an introduction to workflow concepts and an extended practical session creating workflows with Common Workflow Language (CWL), including exercises in a Google Colab Jupyter Notebooks to install and run CWL workflows.
Day 2
Explore dispel4py, a Python-based streaming workflow framework. Participants will work through basic and advanced concepts, build simple examples, and complete hands-on exercises using dispel4py in Google Colab Jupyter Notebooks, including some real-use cases drawn from scientific applications.
Prerequisites
Attendees should be comfortable with basic Python programming concepts and familiar with using Jupyter notebooks or Google Colab; no prior experience in workflow languages is required.
Learning objectives
By the end of the course attendees will be able to describe workflow concepts and systems, write and run basic CWL workflows in a notebook environment, and develop and test simple dispel4py workflows for streaming data processing.
Requirements:
Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on.
They are also required to abide by the ARCHER2 Code of Conduct.
Timetable:
Details to follow