aboutsummaryrefslogtreecommitdiff
path: root/paper.tex
diff options
context:
space:
mode:
authorMohammad Akhlaghi <mohammad@akhlaghi.org>2020-11-23 15:27:51 +0000
committerMohammad Akhlaghi <mohammad@akhlaghi.org>2020-11-23 15:27:51 +0000
commit1b513de78bb77d276ed337e1f03aa3a8168eb1d3 (patch)
treea5f2366ed0ca39c94d45dcdb76e6433579042d70 /paper.tex
parent49a6067514d48da65e5fcc8089d171d07c186311 (diff)
Minor edits and corrections
Raul's added point on the answer to the referee was very good, so I edited it a little to be more clear (and removed his name). Also, after looking in a few parts of the text, I fixed a few typos.
Diffstat (limited to 'paper.tex')
-rw-r--r--paper.tex12
1 files changed, 6 insertions, 6 deletions
diff --git a/paper.tex b/paper.tex
index 16658a4..3907c8e 100644
--- a/paper.tex
+++ b/paper.tex
@@ -329,16 +329,16 @@ Figure \ref{fig:datalineage} (right) is the data lineage graph that produced it
\vspace{-3mm}
\caption{\label{fig:datalineage}
Left: an enhanced replica of Figure 1C in \cite{menke20}, shown here for demonstrating Maneage.
- It shows the ratio of the number of papers mentioning software tools (green line, left vertical axis) to the total number of papers studied in that year (light red bars, right vertical axis on a log scale).
+ It shows the fraction of the number of papers mentioning software tools (green line, left vertical axis) in each year (red bars, right vertical axis on a log scale).
Right: Schematic representation of the data lineage, or workflow, to generate the plot on the left.
- Each colored box is a file in the project and \new{arrows show the operation of various software, showing what inputs it takes and what outputs it produces}.
+ Each colored box is a file in the project and \new{arrows show the operation of various software: linking input file(s) to output file(s)}.
Green files/boxes are plain-text files that are under version control and in the project source directory.
Blue files/boxes are output files in the build directory, shown within the Makefile (\inlinecode{*.mk}) where they are defined as a \emph{target}.
For example, \inlinecode{paper.pdf} \new{is created by running \LaTeX{} on} \inlinecode{project.tex} (in the build directory; generated automatically) and \inlinecode{paper.tex} (in the source directory; written manually).
\new{Other software are used in other steps.}
- The solid arrows and full-opacity built boxes correspond to this paper.
- The dotted arrows and built boxes show the scalability by adding hypothetical steps to the project.
- The underlying data of the top plot is available at
+ The solid arrows and full-opacity built boxes correspond to the lineage of this paper.
+ The dotted arrows and built boxes show the scalability of Maneage (ease of adding hypothetical steps to the project as it evolves).
+ The underlying data of the left plot is available at
\href{https://zenodo.org/record/\projectzenodoid/files/tools-per-year.txt}{zenodo.\projectzenodoid/tools-per-year.txt}.
}
\end{figure*}
@@ -741,7 +741,7 @@ For example \citeappendix{lofstead19} propose a ``data pallet'' concept to conta
In summary, containers or VMs are just a built product themselves.
If they are built properly (for example building a Maneage'd project inside a Docker container), they can be useful for immediate usage and fast moving of the project from one system to another.
With robust building, the container or VM can also be exactly reproduced later.
-However, attempting to archive the actual binary container or VM files as a black box (not knowing the precise versions of the software in them) is expensive, and will not be able to answer the most fundamental
+However, attempting to archive the actual binary container or VM files as a black box (not knowing the precise versions of the software in them, and \emph{how} they were built) is expensive, and will not be able to answer the most fundamental questions.
\subsubsection{Independent build in host's file system}
\label{appendix:independentbuild}