Skip to main content

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner
Duration: 1:16:04

This lesson provides an overview of how to conceptualize, design, implement, and maintain neuroscientific pipelines in via the cloud-based computational reproducibility platform Code Ocean. 

Difficulty level: Beginner
Duration: 17:01
Speaker: : David Feng

This lesson provides an overview of how to construct computational pipelines for neurophysiological data using DataJoint.

Difficulty level: Beginner
Duration: 17:37
Speaker: : Dimitri Yatsenko

This hands-on tutorial walks you through DataJoint platform, highlighting features and schema which can be used to build robost neuroscientific pipelines. 

Difficulty level: Beginner
Duration: 26:06
Speaker: : Milagros Marin

This lesson provides an introduction to the DataLad, a free and open source distributed data management system that keeps track of your data, creates structure, ensures reproducibility, supports collaboration, and integrates with widely used data infrastructure.

Difficulty level: Beginner
Duration: 22:56

This lesson introduces several open science tools like Docker and Apptainer which can be used to develop portable and reproducible software environments. 

Difficulty level: Beginner
Duration: 17:22
Speaker: : Joanes Grandjean

This lecture provides a detailed description of how to incorporate HED annotation into your neuroimaging data pipeline. 

Difficulty level: Beginner
Duration: 33:36
Speaker: : Dung Truong

This lecture covers a wide range of aspects regarding neuroinformatics and data governance, describing both their historical developments and current trajectories. Particular tools, platforms, and standards to make your research more FAIR are also discussed.

Difficulty level: Beginner
Duration: 54:58
Speaker: : Franco Pestilli

In this tutorial, you will learn the basic features of uploading and versioning your data within OpenNeuro.org.

Difficulty level: Beginner
Duration: 5:36
Speaker: : OpenNeuro

This tutorial shows how to share your data in OpenNeuro.org.

Difficulty level: Beginner
Duration: 1:22
Speaker: : OpenNeuro

Following the previous two tutorials on uploading and sharing data with OpenNeuro.org, this tutorial briefly covers how to run various analyses on your datasets.

Difficulty level: Beginner
Duration: 2:26
Speaker: : OpenNeuro
Course:

The Mouse Phenome Database (MPD) provides access to primary experimental trait data, genotypic variation, protocols and analysis tools for mouse genetic studies. Data are contributed by investigators worldwide and represent a broad scope of phenotyping endpoints and disease-related traits in naïve mice and those exposed to drugs, environmental agents or other treatments. MPD ensures rigorous curation of phenotype data and supporting documentation using relevant ontologies and controlled vocabularies. As a repository of curated and integrated data, MPD provides a means to access/re-use baseline data, as well as allows users to identify sensitized backgrounds for making new mouse models with genome editing technologies, analyze trait co-inheritance, benchmark assays in their own laboratories, and many other research applications. MPD’s primary source of funding is NIDA. For this reason, a majority of MPD data is neuro- and behavior-related.

Difficulty level: Beginner
Duration: 55:36
Speaker: : Elissa Chesler

This lesson provides an overview of GeneWeaver, a web application for the integrated cross-species analysis of functional genomics data to find convergent evidence from heterogeneous sources.

Difficulty level: Beginner
Duration: 1:03:26
Speaker: : Erich J. Baker

This lesson provides a demonstration of GeneWeaver, a system for the integration and analysis of heterogeneous functional genomics data.

Difficulty level: Beginner
Duration: 25:53
Speaker: :

This talk highlights a set of platform technologies, software, and data collections that close and shorten the feedback cycle in research. 

Difficulty level: Beginner
Duration: 57:52
Speaker: : Satrajit Ghosh

This lecture outlines GeneNetwork.org, a group of linked data sets and tools used to study complex networks of genes, molecules, and higher order gene function and phenotypes.

Difficulty level: Beginner
Duration: 1:00:43
Speaker: : Robert Williams

This talk covers the Neuroimaging Informatics Tools and Resources Clearinghouse (NITRC), a free one-stop-shop collaboratory for science researchers that need resources such as neuroimaging analysis software, publicly available data sets, or computing power.

Difficulty level: Beginner
Duration: 1:00:10
Speaker: : David Kennedy

This tutorial shows how to use the UCSC genome browser to find a list of genes in a given genomic region.

Difficulty level: Beginner
Duration: 4:32

This tutorial shows how to find all the single nucleotide polymorphisms (SNPs) upstream from genes using the UCSC Genome Browser.

Difficulty level: Beginner
Duration: 8:13

This tutorial demonstrates how to find all the single nucleotide polymorphisms (SNPs) in a gene using the UCSC Genome Browser.

Difficulty level: Beginner
Duration: 6:12