286x Filetype PDF File size 1.45 MB Source: indico.scc.kit.edu
Scientific Workflows: Tutorial
Ivan Kondov, Elnaz Azmi
STEINBUCH CENTRE FOR COMPUTING - SCC
KIT – The Research University in the Helmholtz Association www.kit.edu
What is a scientific workflow?
Many different definitions, depending on community
Coordinated execution of repeatable actions accounting for dependencies and
concurrency
Typical actions
Computing, also high performance computing
Data management and analysis
Pre- and post-processing
Visualization
Other common but imprecise names: “protocol”, “recipe”, “procedure”,
“job chain”, “task sequence”, …
Not to confuse with: business process workflows, pipelines and stream
processing
2 29.08.2017 Scientific workflows with FireWorks Steinbuch Centre for Computing
Basic example: image processing
montage
rotate -90° montage
2
flip horizontally
1
montage
flip vertically montage
3 29.08.2017 Scientific workflows with FireWorks Steinbuch Centre for Computing
Using a bash script
#!/bin/bash -e Drawbacks
No clear interfaces between steps
# Flip vertically No reuse of data and code
convert -flip piece-1.png bottom_right.png No dependencies
# Rotate 90 degrees anti-clockwise No state tracking
convert -rotate -90 piece-2.png top_left.png No concurrency: sequential
execution
# Flop horizontally top_left.png No capability for heterogeneous /
convert -flop top_left.png top_right.png distributed resources
# Flop horizontally bottom_right.png
convert -flop bottom_right.png bottom_left.png Workarounds
Partitioning / Refactoring
# Put the four pieces together Checkpointing
montage -mode concatenate -tile 2x2 \
top_left.png top_right.png bottom_left.png \ Chaining with dependencies:
bottom_right.png montaged_image.png sub-blocking, multi-step jobs,
multi-jobs or job chains
Scheduling
4 29.08.2017 Scientific workflows with FireWorks Steinbuch Centre for Computing
no reviews yet
Please Login to review.