TimedText:Common Workflow Language explained in 64 seconds.webm.en.srt
1 00:00:00,000 --> 00:00:03,630 In many scientific fields such as bioinformatics, medical imaging, and
2 00:00:03,629 --> 00:00:06,299 astronomy large quantities of data need to be
3 00:00:06,299 --> 00:00:10,769 analyzed. This can involve large-scale and repetitive processes in long
4 00:00:10,769 --> 00:00:15,239 pipelines of different tools â referred to as workflows. It can be very
5 00:00:15,240 --> 00:00:18,930 time-consuming to run data for all these different tools by hand and convert
6 00:00:18,930 --> 00:00:21,950 outputs to various formats to make them compatible with the next step.
7 00:00:21,949 --> 00:00:26,129 Workflow management systems are designed to alleviate this problem by allowing
8 00:00:26,130 --> 00:00:30,090 these workflows to be expressed formally and providing infrastructure to set up.
9 00:00:30,090 --> 00:00:34,290 execute, and monitor them. This formal expression of workflows allows for
10 00:00:34,290 --> 00:00:38,400 scientists to easily share and reuse them. Crucially they can also be used to
11 00:00:38,399 --> 00:00:43,109 verify results of computation for published work. However there are many
12 00:00:43,110 --> 00:00:46,580 competing [ways] for describing workflows which is a barrier to this aim.
13 00:00:46,579 --> 00:00:50,399 Currently there are over a hundred different data analysis workflow systems
14 00:00:50,399 --> 00:00:55,019 with no interoperability between them. The need has arisen to have a single
15 00:00:55,020 --> 00:00:58,470 common standard and so the "Common Workflow Language" project was created: an
16 00:00:58,469 --> 00:01:02,429 open standard designed to express workflows and their tooling in groups of
17 00:01:02,430 --> 00:01:05,540 YAML structured text files.