Using Git to improve reproducibility & transparency

Authors: Gemayel, Toni, Banyan

Track: Posters


Using Git to facilitate reproducibility and transparency can improve trust, credibility, and help foster the development of new ideas. Git can help scientists standardize their approach to research projects. With git, scientists no longer need to use informal and uninformative file names (example: research-1.pdf, research-2.pdf) to keep multiple revisions of papers, code, and datasets.

There are a number of Version Control Systems available, but git stands out in particular because it is decentralized and distributed in nature. Authors can work asynchronously without being connected to a central server. This is especially useful when working from remote field sites. Git also keeps a complete history of all changes, including authorship, which can be viewed and searched by anyone.

I will outline the advantages of using git, and describe how git can be used to manage common science outputs and work flows. I will also describe how git can be used to manage use cases such as lab notebooks, latex documents, lectures, datasets, and more.