Provenance Tools

This is a non-exhaustive list of tools for provenance (in a large sense). Please contact us at tfjmp@seas.harvard.edu if you believe a tool should be added to this list.

Provenance Capture
CamFlow* An open-source project to bring observed provenance collection to the Linux Operating System. Comply with W3C PROV-DM standard. http://camflow.org
Jupyter A notebook style IDE for ``literate programming.'' http://jupyter.org/
Kepler A system for creating scientific workflows with explicit provenance capture tools. http://kepler-project.org
LPM An open-source project to bring observed provenance collection to the Linux Operating System. Comply with W3C PROV-DM standard. https://bitbucket.org/uf_sensei/redhat-linux-provenance-release
RDataTracker* An R library to capture fine resolution provenance from R scripts and RMarkDown notebooks. Comply with W3C PROV-DM standard. https://github.com/End-to-end-provenance/RDataTracker
recordr An R library to capture coarse provenance from scripts written in R and intended to work in conjunction with the DataOne scientific database. https://github.com/NCEAS/recordr
Starflow* A fine resolution provenance library for the Python language. http://end-to-end-provenance.github.io/
Provenance Storage and Query
Core Provenance Library* A portable, multi-lingual library that application programmers can easily incorporate into a variety of tools to collect and integrate provenance. W3C PROV-DM version is available https://github.com/jacksonokuhn/prov-cpl https://github.com/End-to-end-provenance/core-provenance-library
Dataverse An open-source repository platform to archive and share data, currently developing support for Provenance. It will accept any provenance data following W3C PROV standard. http://dataverse.org/
DataOne An open-source repository platform, accepting provenance following the recordr format. https://www.dataone.org/
Git Version control system that tracks changes made to code or
  other files with provenance-like capabilities.
https://git-scm.com/
Provenance Analysis and Visualization
CamFlow Visualiser* A web-visualiser providing a graphical, network-based
  representation of provenance. Accept any W3C PROV-DM compliant data.
http://camflow.org
Provoviz A web-visualiser providing a Sankey Diagrams representation of provenance. Accept any PROV-DM compliant data. http://provoviz.org/
DDG-Explorer* Provenance visualiser for the RDataTracker library. http://end-to-end-provenance.github.io/
Orbiter* A visualisation tool developed for large provenance graph. http://end-to-end-provenance.github.io/
Vistories Data visualization tools based on rich provenance information used to efficiently communicate scientific finding. http://vistories.org/
Caleydo An open-source framework to visualize data-set interdependencies. This served as a building block for vistories. http://caleydo.org/

* developped in collaboration with Provenance@Harvard