aboutsummaryrefslogtreecommitdiff
path: root/peer-review
diff options
context:
space:
mode:
Diffstat (limited to 'peer-review')
-rw-r--r--peer-review/1-answer.txt154
1 files changed, 85 insertions, 69 deletions
diff --git a/peer-review/1-answer.txt b/peer-review/1-answer.txt
index 5c27866..b837ce4 100644
--- a/peer-review/1-answer.txt
+++ b/peer-review/1-answer.txt
@@ -32,10 +32,9 @@ reader can easily access them.
2. [Associate Editor] There are general concerns about the paper
lacking focus
-ANSWER: We believe that by responding to the specific concerns raised
-by the reviewers, as detailed below, we have tightened the focus of
-the paper.
-
+ANSWER: With all the corrections/clarifications that have been done in this
+review the focus of the paper should be clear now. We are very grateful to
+the thorough listing of points by the referees.
------------------------------
@@ -46,9 +45,10 @@ the paper.
3. [Associate Editor] Some terminology is not well-defined
(e.g. longevity).
-ANSWER: Longevity has now been defined in the first paragraph of Section
-II. With this definition, the main argument of the paper is clearer,
-thank you (and thank you to the referees for highlighting this).
+ANSWER: Reproducibility, Longevity and Usage have now been explicitly
+defined in the first paragraph of Section II. With this definition, the
+main argument of the paper is clearer, thank you (and thank you to the
+referees for highlighting this).
------------------------------
@@ -133,14 +133,16 @@ future.
is on the article, it is important that readers not be confused
when they visit your site to use your tools.
-ANSWER: Improving the consistency between this research paper and
-the Maneage website is a useful recommendation. We have listed
-this together with point 29 below at
-https://savannah.nongnu.org/task/index.php?15823
-on the Maneage development task list. As indicated there, the
-website is developed on a public git repository, so any specific
-proposals for improvements can be handled efficiently and
-transparently.
+ANSWER: Thank you for raising this important point. We have broken down the
+very long "About" page into multiple pages to help in readability:
+
+https://maneage.org/about.html
+
+Generally, the webpage will soon undergo major improvements to be even more
+clear. The website is developed on a public git repository
+(https://git.maneage.org/webpage.git), so any specific proposals for
+improvements can be handled efficiently and transparently and we welcome
+any feedback in this aspect.
------------------------------
@@ -602,9 +604,9 @@ level of peer-review control.
Tutorial. A topic breakdown is interesting, as the markdown reading may
be too long to find information.
-ANSWER: Thank you for the very useful suggestion. We have listed this as
-a task at https://savannah.nongnu.org/task/index.php?15823 .
-
+ANSWER: Thank you very much for this good suggestion, it has been
+implemented: https://maneage.org/about.html . The webpage will continuously
+be improved and such feedback is always very welcome.
------------------------------
@@ -696,18 +698,17 @@ highly modular and flexible nature of Makefiles run via 'Make'.
which might occur because of the way the code is written, and the
hardware architecture (including if code is optimised / parallelised).
+ANSWER: Floating point errors and optimizations have been mentioned in the
+discussion (Section V). The issue with parallelization has also been
+discussed in Section IV, in the part on verification ("Where exact
+reproducibility is not possible (for example due to parallelization),
+values can be verified by any statistical means, specified by the project
+authors.").
-ANSWER: The authors of particular projects have to choose the level
-floating point reproducibility that they judge viable. In section IV,
-within the 6500-word limit, this is briefly described in the discussion
-of the "verify.mk" rule file. The main paragraph is "Just before reaching ...
-All project deliverables ... are verified ... with their checksums, to
-automatically ensure exact reproducibility. .... [or] by any statistical
-means, specified by the project authors."
-
-We have added a brief reference to zenodo.3951151, pointing out that
-it illustrates an approach for statistical verifiability of
-parallelised code using Maneage.
+#####################
+Find a good way to link to (Peper and Roukema:
+https://doi.org/10.5281/zenodo.4062460)
+#####################
------------------------------
@@ -719,26 +720,44 @@ parallelised code using Maneage.
[reproducibility] ... will come with a tradeoff agianst
performance, which is never mentioned.
-ANSWER: The criteria we propose and the proof-of-concept with
-Maneage do not force the choice of a tradeoff between exact bitwise
-floating point reproducibility versus performance (e.g. speed). The
-specific concepts of "verification" and "reproducibility" will vary
-between domains of scientific computation, but we expect that the
-criteria allow this wide range. We did not add text on this point.
+ANSWER: The criteria we propose and the proof-of-concept with Maneage do
+not force the choice of a tradeoff between exact bitwise floating point
+reproducibility versus performance (e.g. speed). The specific concepts of
+"verification" and "reproducibility" will vary between domains of
+scientific computation, but we expect that the criteria allow this wide
+range.
+Performance is indeed an important issue for _immediate_ reproducibility
+and we would have liked to discuss it. But due to the strict word-count, we
+feel that adding it to the discussion points, without having adequate space
+to elaborate, can confuse the readers away from the focus of this paper
+(which is focused on long term usability). It has therefore not been added.
------------------------------
+
+
+
+
38. [Reviewer 4] Tradeoff, which might affect Criterion 3 is time to result,
people use popular frameworks because it is easier to use them.
-ANSWER: Section IV includes some quantified examples of timing
-involved in the Maneage implementation of the criteria of our
-paper. It is true that the initial build time of a Maneage install
-may discourage some scientists; but a serious scientific research
-project is never started and completed on a time scale of a few
-hours.
+ANSWER: That is true. In section IV, we have given the time it takes to
+build Maneage (only once on each computer) to be around 1.5 hours on an
+8-core CPU (a typical machine that may be used for data analysis). We
+therefore conclude that when the analysis is complex (and thus taking many
+hours or days to complete), this time is negligible.
+But if the project's full analysis takes 10 minutes or less (like the
+extremely simple analysis done in this paper which takes a fraction of a
+second). Indeed, the 1.5 hour building time is significant. In those cases,
+as discussed in the main body, the project can be built once in a Docker
+image and easily moved to other computers.
+
+Generally, it is true that the initial configuration time (only once on
+each computer) of a Maneage install may discourage some scientists; but a
+serious scientific research project is never started and completed on a
+time scale of a few hours.
------------------------------
@@ -772,14 +791,16 @@ already written.
40. [Reviewer 4] Potentially an interesting sidebar to investigate how
LaTeX/TeX has ensured its longevity!
+ANSWER: That is indeed a very interesting subject to study (an obvious link
+is that LaTeX/TeX is very strongly based on plain text files). We have been
+in touch with Karl Berry (one of the core people behind TeX Live, who also
+plays a prominent role in GNU) and have whitnessed the TeX Live community's
+efforts to become more and more portable and longer-lived.
-ANSWER: We agree that this would be interesting; an obvious link is
-that LaTeX/TeX is very strongly based on plain text files, making user
-hacking easy, provided that the user is willing to experiment and
-search and read through the source files. However, as the reviewer states,
-this would be a sidebar, and we are constrained for space.
-
-
+However, as the reviewer states, this would be a sidebar, and we are
+constrained for space, so we couldn't find a place to highlight this. But
+it is indeed a subject worthy of a full paper (that can be very useful for
+many software projects0..
------------------------------
@@ -790,19 +811,17 @@ this would be a sidebar, and we are constrained for space.
41. [Reviewer 4] The title is not specific enough - it should refer to the
reproducibility of workflows/projects.
-ANSWER: A problem here is that "workflow" and "project" taken in
-isolation risk being vague for wider audiences. Also, we aim at
-covering a wider range of aspects of a project than just than the
-workflow alone; in the other direction, the word "project" could be
-seen as too broad, including the funding, principal investigator,
-and team coordination.
+ANSWER: A problem here is that "workflow" and "project" taken in isolation
+risk being vague for wider audiences. Also, we aim at covering a wider
+range of aspects of a project than just than the workflow alone; in the
+other direction, the word "project" could be seen as too broad, including
+the funding, principal investigator, and team coordination.
-A specific title that might be appropriate could be, for example,
-"Towards long-term and archivable reproducibility of scientific
-computational research projects". Using a term proposed by one of
-our reviewers, "Towards long-term and archivable end-to-end
-reproducibility of scientific computational research projects"
-might also be appropriate.
+A specific title that might be appropriate could be, for example, "Towards
+long-term and archivable reproducibility of scientific computational
+research projects". Using a term proposed by one of our reviewers, "Towards
+long-term and archivable end-to-end reproducibility of scientific
+computational research projects" might also be appropriate.
Nevertheless, we feel that in the context of an article published in CiSE,
our current short title is sufficient.
@@ -872,7 +891,6 @@ longevity; same as supported CPU architectures."
longevity of the workflows that can be produced using these tools?
What happens if you use a combination of all four categories of tools?
-
ANSWER: We have changed the section title to "Longevity of existing tools"
to clarify that we refer to longevity of the tools.
@@ -950,14 +968,12 @@ effort, without any major difficulties.
write a "paper", ease of depositing in a repository, and ease of
use by another researcher.
-
-ANSWER: This type of sociological survey will make sense once the
- number of projects run with Maneage is sufficiently high. The
- time taken to write a paper should be measurably automatically,
- from the git history. The other parameters suggested would
- require cooperation from the scientists in responding to
- the survey, or will have to be collected anecdotally in the
- short term.
+ANSWER: This type of sociological survey will make sense once the number of
+projects run with Maneage is sufficiently high. The time taken to write a
+paper should be measurable automatically: from the git history. The other
+parameters suggested would require cooperation from the scientists in
+responding to the survey, or will have to be collected anecdotally in the
+short term.
------------------------------