Skip to main content

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner
Duration: 1:16:04

This is a tutorial introducing participants to the basics of RNA-sequencing data and how to analyze its features using Seurat. 

Difficulty level: Intermediate
Duration: 1:19:17
Speaker: : Sonny Chen

This demonstration walks through how to import your data into MATLAB.

Difficulty level: Beginner
Duration: 6:10
Speaker: : MATLAB®

This lesson provides instruction regarding the various factors one must consider when preprocessing data, preparing it for statistical exploration and analyses. 

Difficulty level: Beginner
Duration: 15:10
Speaker: : MATLAB®

This tutorial outlines, step by step, how to perform analysis by group and how to do change-point detection.

Difficulty level: Beginner
Duration: 2:49
Speaker: : MATLAB®

This tutorial walks through several common methods for visualizing your data in different ways depending on your data type.

Difficulty level: Beginner
Duration: 6:10
Speaker: : MATLAB®

This tutorial illustrates several ways to approach predictive modeling and machine learning with MATLAB.

Difficulty level: Beginner
Duration: 6:27
Speaker: : MATLAB®

This brief tutorial goes over how you can easily work with big data as you would with any size of data.

Difficulty level: Beginner
Duration: 3:55
Speaker: : MATLAB®

In this tutorial, you will learn how to deploy your models outside of your local MATLAB environment, enabling wider sharing and collaboration.

Difficulty level: Beginner
Duration: 3:52
Speaker: : MATLAB®

This lesson provides a brief overview of the Python programming language, with an emphasis on tools relevant to data scientists.

Difficulty level: Beginner
Duration: 1:16:36
Speaker: : Tal Yarkoni

This tutorial teaches users how to use Pandas objects to help store and manipulate various datasets in Python. 

Difficulty level: Beginner
Duration: 1:21:40
Speaker: : Tal Yarkoni
Course:

This lesson gives a quick walkthrough the Tidyverse, an "opinionated" collection of R packages designed for data science, including the use of readr, dplyr, tidyr, and ggplot2.

Difficulty level: Beginner
Duration: 1:01:39
Speaker: : Thomas Mock

This lecture covers FAIR atlases, including their background and construction, as well as how they can be created in line with the FAIR principles.

Difficulty level: Beginner
Duration: 14:24
Speaker: : Heidi Kleven

This lecture goes into detailed description of how to process workflows in the virtual research environment (VRE), including approaches for standardization, metadata, containerization, and constructing and maintaining scientific pipelines. 

Difficulty level: Intermediate
Duration: 1:03:55
Speaker: : Patrik Bey

This lesson provides an overview of how to conceptualize, design, implement, and maintain neuroscientific pipelines in via the cloud-based computational reproducibility platform Code Ocean. 

Difficulty level: Beginner
Duration: 17:01
Speaker: : David Feng

This lesson provides an overview of how to construct computational pipelines for neurophysiological data using DataJoint.

Difficulty level: Beginner
Duration: 17:37
Speaker: : Dimitri Yatsenko

This talk describes approaches to maintaining integrated workflows and data management schema, taking advantage of the many open source, collaborative platforms already existing.

Difficulty level: Beginner
Duration: 15:15
Speaker: : Erik C. Johnson

This hands-on tutorial walks you through DataJoint platform, highlighting features and schema which can be used to build robost neuroscientific pipelines. 

Difficulty level: Beginner
Duration: 26:06
Speaker: : Milagros Marin

In this third and final hands-on tutorial from the Research Workflows for Collaborative Neuroscience workshop, you will learn about workflow orchestration using open source tools like DataJoint and Flyte. 

Difficulty level: Intermediate
Duration: 22:36
Speaker: : Daniel Xenes

This lecture provides a detailed description of how to incorporate HED annotation into your neuroimaging data pipeline. 

Difficulty level: Beginner
Duration: 33:36
Speaker: : Dung Truong