diff options
-rw-r--r-- | reproducible-paper.tex | 32 |
1 files changed, 28 insertions, 4 deletions
diff --git a/reproducible-paper.tex b/reproducible-paper.tex index 7d9f4f6..a0a72f0 100644 --- a/reproducible-paper.tex +++ b/reproducible-paper.tex @@ -35,8 +35,7 @@ %% Set the title \title{\huge\textbf{BIG} Data, \textbf{BIG} responsibility - \\ {\normalsize (Data lineage management with template for - reproducible scientific papers)}} + \\ {\normalsize (Reproducible paper template to manage data analysis)}} %% Set the author \author{\vspace{1cm}\\ @@ -860,15 +859,40 @@ {First input is also used to build another file (using two configuration parameters).} \makedemoslide{img/make-demo-6-out3b.pdf} - {\texttt{out-bb.dat} is built after \texttt{out-2a.dat}.} + {\texttt{out-3b.dat} is automatically built after (depends on) \texttt{out-2a.dat}.} \makedemoslide{img/make-demo-7-input2.pdf} {We now need a second input file, which is downloaded.} \makedemoslide{img/make-demo-8-out1b.pdf} {\texttt{out-1b.dat} depends on \texttt{out-1a.dat}, \texttt{input2.dat} and \texttt{param-1.conf}.} \makedemoslide{img/make-demo-9-out2b.pdf} - {SUMMARY: The exact lineage of data is thus recorded in Makefiles.} + {Final hypothetical output file is also built.} + \begin{frame}{Benefits of using Make for storing data lineage} + \Large + \begin{itemize} + \setlength\itemsep{5mm} + \item Every output file's position in the analysis is formally defined.\\ + {\normalsize (edges between the nodes of the graph in the previous slide)} + \vspace{3mm} + \begin{itemize} + \Large + \setlength\itemsep{3mm} + \item Make can \alert{parallelize} the analysis: \\Make knows + which steps are indepenent and will run them at the same time.\\ + \item Make can \alert{automatically detect a change} and will + re-do \emph{only} the affected steps.\\ {\normalsize (for + example to change the multiple of sigma in a configuration + file to see its effect)} + \item Easily \alert{backtrace} any step (without needing to remember!).\\ + {\normalsize (very useful to find problems/improvements)} + \end{itemize} + \vspace{3mm} All will speed up your work, and encourage + experimentation for a robust result. + \item Make is \alert{available} on any system: many people are \alert{already familiar} with it. + \item And again: its \alert{all in plain text}!\\{\normalsize (doesn't take much space, easy to read, distribute, parse automatically, or archive)} + \end{itemize} + \end{frame} |