diff options
Diffstat (limited to 'paper.tex')
-rw-r--r-- | paper.tex | 12 |
1 files changed, 6 insertions, 6 deletions
@@ -329,16 +329,16 @@ Figure \ref{fig:datalineage} (right) is the data lineage graph that produced it \vspace{-3mm} \caption{\label{fig:datalineage} Left: an enhanced replica of Figure 1C in \cite{menke20}, shown here for demonstrating Maneage. - It shows the ratio of the number of papers mentioning software tools (green line, left vertical axis) to the total number of papers studied in that year (light red bars, right vertical axis on a log scale). + It shows the fraction of the number of papers mentioning software tools (green line, left vertical axis) in each year (red bars, right vertical axis on a log scale). Right: Schematic representation of the data lineage, or workflow, to generate the plot on the left. - Each colored box is a file in the project and \new{arrows show the operation of various software, showing what inputs it takes and what outputs it produces}. + Each colored box is a file in the project and \new{arrows show the operation of various software: linking input file(s) to output file(s)}. Green files/boxes are plain-text files that are under version control and in the project source directory. Blue files/boxes are output files in the build directory, shown within the Makefile (\inlinecode{*.mk}) where they are defined as a \emph{target}. For example, \inlinecode{paper.pdf} \new{is created by running \LaTeX{} on} \inlinecode{project.tex} (in the build directory; generated automatically) and \inlinecode{paper.tex} (in the source directory; written manually). \new{Other software are used in other steps.} - The solid arrows and full-opacity built boxes correspond to this paper. - The dotted arrows and built boxes show the scalability by adding hypothetical steps to the project. - The underlying data of the top plot is available at + The solid arrows and full-opacity built boxes correspond to the lineage of this paper. + The dotted arrows and built boxes show the scalability of Maneage (ease of adding hypothetical steps to the project as it evolves). + The underlying data of the left plot is available at \href{https://zenodo.org/record/\projectzenodoid/files/tools-per-year.txt}{zenodo.\projectzenodoid/tools-per-year.txt}. } \end{figure*} @@ -741,7 +741,7 @@ For example \citeappendix{lofstead19} propose a ``data pallet'' concept to conta In summary, containers or VMs are just a built product themselves. If they are built properly (for example building a Maneage'd project inside a Docker container), they can be useful for immediate usage and fast moving of the project from one system to another. With robust building, the container or VM can also be exactly reproduced later. -However, attempting to archive the actual binary container or VM files as a black box (not knowing the precise versions of the software in them) is expensive, and will not be able to answer the most fundamental +However, attempting to archive the actual binary container or VM files as a black box (not knowing the precise versions of the software in them, and \emph{how} they were built) is expensive, and will not be able to answer the most fundamental questions. \subsubsection{Independent build in host's file system} \label{appendix:independentbuild} |