Skip to main content

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner
Duration: 1:16:04

JupyterHub is a simple, highly extensible, multi-user system for managing per-user Jupyter Notebook servers, designed for research groups or classes. This lecture covers deploying JupyterHub on a single server, as well as deploying with Docker using GitHub for authentication.

Difficulty level: Beginner
Duration: 1:36:27
Speaker: : Thomas Kluyver
Course:

The Mouse Phenome Database (MPD) provides access to primary experimental trait data, genotypic variation, protocols and analysis tools for mouse genetic studies. Data are contributed by investigators worldwide and represent a broad scope of phenotyping endpoints and disease-related traits in naïve mice and those exposed to drugs, environmental agents or other treatments. MPD ensures rigorous curation of phenotype data and supporting documentation using relevant ontologies and controlled vocabularies. As a repository of curated and integrated data, MPD provides a means to access/re-use baseline data, as well as allows users to identify sensitized backgrounds for making new mouse models with genome editing technologies, analyze trait co-inheritance, benchmark assays in their own laboratories, and many other research applications. MPD’s primary source of funding is NIDA. For this reason, a majority of MPD data is neuro- and behavior-related.

Difficulty level: Beginner
Duration: 55:36
Speaker: : Elissa Chesler

This lesson describes the Neuroscience Gateway , which facilitates access and use of National Science Foundation High Performance Computing resources by neuroscientists.

Difficulty level: Beginner
Duration: 39:27
Speaker: : Subha Sivagnanam

This demonstration walks through how to import your data into MATLAB.

Difficulty level: Beginner
Duration: 6:10
Speaker: : MATLAB®

This lesson provides instruction regarding the various factors one must consider when preprocessing data, preparing it for statistical exploration and analyses. 

Difficulty level: Beginner
Duration: 15:10
Speaker: : MATLAB®

This tutorial outlines, step by step, how to perform analysis by group and how to do change-point detection.

Difficulty level: Beginner
Duration: 2:49
Speaker: : MATLAB®

This tutorial walks through several common methods for visualizing your data in different ways depending on your data type.

Difficulty level: Beginner
Duration: 6:10
Speaker: : MATLAB®

This tutorial illustrates several ways to approach predictive modeling and machine learning with MATLAB.

Difficulty level: Beginner
Duration: 6:27
Speaker: : MATLAB®

This brief tutorial goes over how you can easily work with big data as you would with any size of data.

Difficulty level: Beginner
Duration: 3:55
Speaker: : MATLAB®

In this tutorial, you will learn how to deploy your models outside of your local MATLAB environment, enabling wider sharing and collaboration.

Difficulty level: Beginner
Duration: 3:52
Speaker: : MATLAB®

The lecture provides an overview of the core skills and practical solutions required to practice reproducible research.

Difficulty level: Beginner
Duration: 1:25:17
Speaker: : Fernando Perez

This tutorial teaches users how to use Pandas objects to help store and manipulate various datasets in Python. 

Difficulty level: Beginner
Duration: 1:21:40
Speaker: : Tal Yarkoni

This lesson provides an overview of self-supervision as it relates to neural data tasks and the Mine Your Own vieW (MYOW) approach.

Difficulty level: Beginner
Duration: 25:50
Speaker: : Eva Dyer

This video gives a short introduction to the EBRAINS data sharing platform, why it was developed, and how it contributes to open data sharing.

Difficulty level: Beginner
Duration: 17:32
Speaker: : Ida Aasebø

This video introduces the key principles for data organization and explains how you could make your data FAIR for data sharing on EBRAINS.

Difficulty level: Beginner
Duration: 10:54
Course:

This lesson gives a quick walkthrough the Tidyverse, an "opinionated" collection of R packages designed for data science, including the use of readr, dplyr, tidyr, and ggplot2.

Difficulty level: Beginner
Duration: 1:01:39
Speaker: : Thomas Mock

This lesson provides a hands-on tutorial for generating simulated brain data within the EBRAINS ecosystem. 

Difficulty level: Beginner
Duration: 32:58
Speaker: : Jil Meier