aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorMohammad Akhlaghi <mohammad@akhlaghi.org>2019-12-07 18:58:04 +0000
committerMohammad Akhlaghi <mohammad@akhlaghi.org>2019-12-07 18:58:04 +0000
commit52ccebefe0a91636ec154e7a9339588d185ae36c (patch)
treee135dd596acfb5bd564a51fc1c5b4c314301178f
parent64385bc9f672484b72033a9416b6f1c367f8997c (diff)
New slide after the analysis graph, summarizing the benefits of Make
After showing the graph (and how everything is connected), it was useful to remind the viewers on the benefits of Make to create this structure.
-rw-r--r--reproducible-paper.tex32
1 files changed, 28 insertions, 4 deletions
diff --git a/reproducible-paper.tex b/reproducible-paper.tex
index 7d9f4f6..a0a72f0 100644
--- a/reproducible-paper.tex
+++ b/reproducible-paper.tex
@@ -35,8 +35,7 @@
%% Set the title
\title{\huge\textbf{BIG} Data, \textbf{BIG} responsibility
- \\ {\normalsize (Data lineage management with template for
- reproducible scientific papers)}}
+ \\ {\normalsize (Reproducible paper template to manage data analysis)}}
%% Set the author
\author{\vspace{1cm}\\
@@ -860,15 +859,40 @@
{First input is also used to build another file (using
two configuration parameters).}
\makedemoslide{img/make-demo-6-out3b.pdf}
- {\texttt{out-bb.dat} is built after \texttt{out-2a.dat}.}
+ {\texttt{out-3b.dat} is automatically built after (depends on) \texttt{out-2a.dat}.}
\makedemoslide{img/make-demo-7-input2.pdf}
{We now need a second input file, which is downloaded.}
\makedemoslide{img/make-demo-8-out1b.pdf}
{\texttt{out-1b.dat} depends on \texttt{out-1a.dat},
\texttt{input2.dat} and \texttt{param-1.conf}.}
\makedemoslide{img/make-demo-9-out2b.pdf}
- {SUMMARY: The exact lineage of data is thus recorded in Makefiles.}
+ {Final hypothetical output file is also built.}
+ \begin{frame}{Benefits of using Make for storing data lineage}
+ \Large
+ \begin{itemize}
+ \setlength\itemsep{5mm}
+ \item Every output file's position in the analysis is formally defined.\\
+ {\normalsize (edges between the nodes of the graph in the previous slide)}
+ \vspace{3mm}
+ \begin{itemize}
+ \Large
+ \setlength\itemsep{3mm}
+ \item Make can \alert{parallelize} the analysis: \\Make knows
+ which steps are indepenent and will run them at the same time.\\
+ \item Make can \alert{automatically detect a change} and will
+ re-do \emph{only} the affected steps.\\ {\normalsize (for
+ example to change the multiple of sigma in a configuration
+ file to see its effect)}
+ \item Easily \alert{backtrace} any step (without needing to remember!).\\
+ {\normalsize (very useful to find problems/improvements)}
+ \end{itemize}
+ \vspace{3mm} All will speed up your work, and encourage
+ experimentation for a robust result.
+ \item Make is \alert{available} on any system: many people are \alert{already familiar} with it.
+ \item And again: its \alert{all in plain text}!\\{\normalsize (doesn't take much space, easy to read, distribute, parse automatically, or archive)}
+ \end{itemize}
+ \end{frame}