diff options
author | Pedram Ashofteh Ardakani <pedramardakani@gmail.com> | 2020-04-09 12:09:00 +0430 |
---|---|---|
committer | Mohammad Akhlaghi <mohammad@akhlaghi.org> | 2020-04-10 02:12:07 +0100 |
commit | 2d7e9a1ce0ad1ef75e7e6f2a59e4763a75ba4b07 (patch) | |
tree | b22865c453ac2474df62120fc5d65e036ea0b34a | |
parent | 720c154da56758594bca9242660bd837d7bf3576 (diff) |
Suggest minor changes in the abstract
to shorten some sentences, fix some spelling/typos, and further
simplify some parts.
I can see that there are some spelling errors in the rest of the paper.
They will be taken care of in the next commit.
-rw-r--r-- | paper.tex | 17 |
1 files changed, 9 insertions, 8 deletions
@@ -46,16 +46,17 @@ %% Abstract {\noindent\mpregular - The era of big data has also ushered an era of big responsability. + The era of big data has also ushered an era of big responsibility. Without it, the integrity of the results will be a subject of perpetual debate. - In this paper, Maneage (management + lineage) is introduced as a low-level solution to this problem. - It is designed considering the following principles: complete (e.g., not requiring any dependencies beyond a POSIX-compatible system, administrator previlages or a network connection), modular, fully in plain-text, minimal complexity in design, verifiable inputs and outputs, temporal lineage/provenance, and free software (in scientific applications). + In this paper, Maneage (management + lineage) is introduced as a low-level solution. + Maneage is a publishing and archival friendly data lineage management system (in machine-actionable plain-text) for projects in the sciences or industry. + Its core principles include: stand-alone (e.g., not requiring anything beyond a POSIX-compatible system, administrator privileges, or a network connection), modular, straight-forward design, traceable input and output, temporal lineage/provenance and free software (for scientific applications). A project that uses Maneage will be able to publish the complete data lineage, making it exactly reproducible (as a test on sufficiently conveying the data lineage). - This control goes as far back as the automatic downloading of input data, and automatic building of necessary software (with fixed versions and build configurations) that are used in the analysis. - It also contains the narrative description of the final project's report (built into a PDF), while providing automatic and direct links between the analysis and the part of the narrative description that it was used. - Adopting Maneage on a wide scale will greatly improve scientific collaborations and building upon the work of other researchers, instead of the current technical frustrations that many researchers experience and can affect their scientific result and interpretations. - It can also be used on more ambitious projects like automatic workflow creation through machine learning tools, or automating data management plans. - As a demostration, this paper has itself been generated with Maneage (snapshot \projectversion). + The offered lineage/control isn't limited downloading the raw input data and processing them automatically, but also includes building the necessary data analyze software with fixed versions and build configurations. + Additionally, Maneage also includes the final PDF report of the project, establishing direct links between the data analysis and the narrative (with the precision of sentence). + Maneage enables incremental projects, where a new project can branch off an existing one making only moderate changes and experimentation on the methods. + It can also be used on more ambitious projects once a sufficiently large number of projects use it, for example automatic workflow creation through machine learning tools, or automating data management plans. + As a demonstration, this paper is written using Maneage (snapshot \projectversion). \horizontalline \noindent |