aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorBoud Roukema <boud@cosmo.torun.pl>2020-04-19 17:14:18 +0200
committerBoud Roukema <boud@cosmo.torun.pl>2020-04-19 17:14:18 +0200
commit13d0a688a100e973f05e2991334903b22f233a01 (patch)
tree3625751a1ced7d6eee772589ac767b911c2201d1
parenta1339189dae70488a52b128fa5bd9a61934e199c (diff)
principles - P2 modularity
Minor wording improvements; reduction by 10 words.
-rw-r--r--paper.tex12
1 files changed, 6 insertions, 6 deletions
diff --git a/paper.tex b/paper.tex
index aeb7404..d43aa8c 100644
--- a/paper.tex
+++ b/paper.tex
@@ -236,15 +236,15 @@ A detailed list of principles shows how Maneage is unique compared to these othe
A plain-text project, besides being extremely low volume ($\sim100$ kilobytes), is human-readable and parsable by any machine, even if it can't be executed.
\item \label{principle:modularity}\textbf{Modularity:}
-A project should be compartmentalized or partitioned into independent modules or components with well-defined inputs/outputs having no side-effects.
-In a modular project, communication between the independent modules is explicit, providing optimizations on multiple levels:
-1) Execution: independent modules can run in parallel, or modules that do not need to be run (because their dependencies have not changed) will not be re-done.
+A project should be compartmentalized into independent modules with well-defined inputs/outputs having no side effects.
+Communication between the independent modules should be explicit, providing several optimizations:
+1) Execution: independent modules can run in parallel. Modules that do not need to be run (because their dependencies have not changed) will not be re-run.
2) Data provenance extraction (recording any dataset's origins).
-3) Citation: allowing others to credit specific parts of a project.
+3) Citation: others can credit specific parts of a project.
4) Usage in other projects.
-\emph{Comparison with existing:} Visual workflow tools like Apache Taverna, GenePattern, Kepler or VisTrails do encourage this, but the more recent tools leave such design choices to the experience of project authors.
-However, designing a modular project needs to be encouraged and facilitated, otherwise scientists (who are not usually trained in data management) will not design their projects to be modular, leading to great inefficiencies in terms of project cost and/or scientific accuracy.
+\emph{Comparison with existing:} Visual workflow tools like Apache Taverna, GenePattern, Kepler or VisTrails encourage this, but the more recent tools leave this design choice as the responsibility of project authors.
+However, designing a modular project needs to be encouraged and facilitated. Otherwise, scientists, who are not usually trained in data management, will rarely design their projects to be modular, leading to great inefficiencies in terms of project cost and/or scientific accuracy.
\item \label{principle:complexity}\textbf{Minimal complexity:}
This principle is essentially Ockham's razor: ``\emph{Never posit pluralities without necessity}'' \citep{schaffer15}, but extrapolated to project management: