Versioning & Containerization

By

Category

Computer science

Level

Beginner

This course outlines how versioning code, data, and analysis software is crucially important to rigorous and open neuroscience workflows that maximize reproducibility and minimize errors.Version control systems, code-capable notebooks, and virtualization containers such as Git, Jupyter, and Docker, respectively, have become essential tools in data science.

Course Features

Videos

Lectures

Slides

Tutorials

Lessons of this Course

1

1

Tools of the Trade: From Data to Results in Neuroimaging

Duration:

57:52

Speaker:

This talk highlights a set of platform technologies, software, and data collections that close and shorten the feedback cycle in research.

2

2

An Introduction to Version Control Using Git and GitHub

Duration:

51:55

Speaker:

Saskia Bollmann, Steffen Bollmann

This presentation by the OHBM OpenScienceSIG covers common scenarios where Git can be extremely valuable. The essentials covered include cloning a repository and keeping it up to date, how to create and use your own repository, and how to contribute to other projects via forking and pull requests.

3

3

Collaborating With Git and GitHub

Duration:

2:15:50

Speaker:

Elizabeth DuPre

This tutorial covers the fundamentals of collaborating with Git and GitHub.

4

4

DataLad: Decentralized Distribution and Management of Scientific Datasets

Duration:

59:34

Speaker:

Yaroslav O. Halchenko

This lesson describes how DataLad allows you to track and mange both your data and analysis code, thereby facilitating reliable, reproducible, and shareable research.

5

5

An Introduction to DataLad

Duration:

01:29:08

Speaker:

DataLad is a versatile data management and data publication multi-tool. In this session, you can learn the basic concepts and commands for version control and reproducible data analysis. You’ll get to see, create, and install DataLad datasets of many shapes and sizes, master local version workflows and provenance-captured analysis-execution, and you will get ideas for your next data analysis project.

6

6

Containers for Science

Duration:

01:21:59

Speaker:

Tom Shaw & Steffen Bollmann

Presented by the OHBM OpenScienceSIG, this lesson covers how containers can be useful for running the same software on different platforms and sharing analysis pipelines with other researchers.

7

6

The Brain Imaging Data Structure (BIDS)

Duration:

56:49

Speaker:

Chris Gorgolewski

This lecture provides an introduction to the Brain Imaging Data Structure (BIDS), a standard for organizing human neuroimaging datasets.

8

8

Containerization With Docker

Duration:

2:51:34

Speaker:

This lesson gives a tour of how popular virtualization tools like Docker and Singularity are playing a crucial role in improving reproducibility and enabling high-performance computing in neuroscience.

Recent courses

Simulating Brain Microcircuit Activity and Signals in Mental Health

Krembil Centre for Neuroinformatics

View the course

Neuroscience for Machine Learners (Neuro4ML)

Neural Reckoning Group

View the course

Linear Systems

Neuromatch Academy

View the course