Lessons | INCF TrainingSpace

Course:

Reproducible Science (Including Git, Docker, and Binder)

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner

Duration: 1:16:04

Speaker: : Erin Dickie and Sejal Patel

Managing Genotype Data Using PLINK

Course:

Fundamental Methods for Genomic Analysis

This is a hands-on tutorial on PLINK, the open source whole genome association analysis toolset. The aims of this tutorial are to teach users how to perform basic quality control on genetic datasets, as well as to identify and understand GWAS summary statistics.

Difficulty level: Intermediate

Duration: 1:27:18

Speaker: : Dan Felsky

Calculation of Polygenic Risk Scores in PRSice

Course:

Fundamental Methods for Genomic Analysis

This is a tutorial on using the open-source software PRSice to calculate a set of polygenic risk scores (PRS) for a study sample. Users will also learn how to read PRS into R, visualize distributions, and perform basic association analyses.

Difficulty level: Intermediate

Duration: 1:53:34

Speaker: : Dan Felsky

Introduction to Transcriptomic Data Types

Course:

Fundamental Methods for Single-Cell Transcriptome Analysis

This is a tutorial introducing participants to the basics of RNA-sequencing data and how to analyze its features using Seurat.

Difficulty level: Intermediate

Duration: 1:19:17

Speaker: : Sonny Chen

Cellular Changes in Major Depression Disorder (MDD)

Course:

Fundamental Methods for Single-Cell Transcriptome Analysis

This tutorial demonstrates how to perform cell-type deconvolution in order to estimate how proportions of cell-types in the brain change in response to various conditions. While these techniques may be useful in addressing a wide range of scientific questions, this tutorial will focus on the cellular changes associated with major depression (MDD).

Difficulty level: Intermediate

Duration: 1:15:14

Speaker: : Keon Arbabi

Processing Workflows in VRE

Course:

Session 5: Infrastructure for Sensitive Data

This lecture goes into detailed description of how to process workflows in the virtual research environment (VRE), including approaches for standardization, metadata, containerization, and constructing and maintaining scientific pipelines.

Difficulty level: Intermediate

Duration: 1:03:55

Speaker: : Patrik Bey

Cloud Neurodata Pipelines with Code Ocean

Course:

Session 6: Research Workflows for Collaborative Neuroscience

This lesson provides an overview of how to conceptualize, design, implement, and maintain neuroscientific pipelines in via the cloud-based computational reproducibility platform Code Ocean.

Difficulty level: Beginner

Duration: 17:01

Speaker: : David Feng

Building Data Pipelines With DataJoint Elements

Course:

Session 6: Research Workflows for Collaborative Neuroscience

This lesson provides an overview of how to construct computational pipelines for neurophysiological data using DataJoint.

Difficulty level: Beginner

Duration: 17:37

Speaker: : Dimitri Yatsenko

Neuroscience Workflows: Joint Management of Data and Computation

Course:

Session 6: Research Workflows for Collaborative Neuroscience

This talk describes approaches to maintaining integrated workflows and data management schema, taking advantage of the many open source, collaborative platforms already existing.

Difficulty level: Beginner

Duration: 15:15

Speaker: : Erik C. Johnson

Research Workflows for Collaborative Neuroscience - Hands-On Tutorial 2

Course:

Session 6: Research Workflows for Collaborative Neuroscience

This hands-on tutorial walks you through DataJoint platform, highlighting features and schema which can be used to build robost neuroscientific pipelines.

Difficulty level: Beginner

Duration: 26:06

Speaker: : Milagros Marin

Research Workflows for Collaborative Neuroscience - Hands-On Tutorial 3

Course:

Session 6: Research Workflows for Collaborative Neuroscience

In this third and final hands-on tutorial from the Research Workflows for Collaborative Neuroscience workshop, you will learn about workflow orchestration using open source tools like DataJoint and Flyte.

Difficulty level: Intermediate

Duration: 22:36

Speaker: : Daniel Xenes

DataLad Intro and Background

Course:

Session 7: Practical Guide to Overcome the Reproducibility Crisis in Small Animal Neuroimaging: Workflows, Tools, and Repositories

This lesson provides an introduction to the DataLad, a free and open source distributed data management system that keeps track of your data, creates structure, ensures reproducibility, supports collaboration, and integrates with widely used data infrastructure.

Difficulty level: Beginner

Duration: 22:56

Speaker: : Michał Szczepanik

Toolkits for Open Science: Data Structures and Containers

Course:

Session 7: Practical Guide to Overcome the Reproducibility Crisis in Small Animal Neuroimaging: Workflows, Tools, and Repositories

This lesson introduces several open science tools like Docker and Apptainer which can be used to develop portable and reproducible software environments.

Difficulty level: Beginner

Duration: 17:22

Speaker: : Joanes Grandjean

Adding Annotations to Neuroimaging Data: The Path From Experiment to Analysis - Tools and Pipelines

Course:

Session 9: Event Annotation in Neuroimaging Using HED: From Experiment to Analysis

This lecture provides a detailed description of how to incorporate HED annotation into your neuroimaging data pipeline.

Difficulty level: Beginner

Duration: 33:36

Speaker: : Dung Truong

Data Governance, FAIR Science, and Neuroinformatics

Course:

INCF Short Course: Introduction to Neuroinformatics

This lecture covers a wide range of aspects regarding neuroinformatics and data governance, describing both their historical developments and current trajectories. Particular tools, platforms, and standards to make your research more FAIR are also discussed.

Difficulty level: Beginner

Duration: 54:58

Speaker: : Franco Pestilli

Scientific Workflows

Course:

INCF Short Course: Introduction to Neuroinformatics

This lecture describes how to build research workflows, including a demonstrate using DataJoint Elements to build data pipelines.

Difficulty level: Intermediate

Duration: 47:00

Speaker: : Dimitri Yatsenko

Uploading and Versioning Your Data

Course:

OpenNeuro.org Tutorials

In this tutorial, you will learn the basic features of uploading and versioning your data within OpenNeuro.org.

Difficulty level: Beginner

Duration: 5:36

Speaker: : OpenNeuro

Sharing Your Data

Course:

OpenNeuro.org Tutorials

This tutorial shows how to share your data in OpenNeuro.org.

Difficulty level: Beginner

Duration: 1:22

Speaker: : OpenNeuro

Running Analysis

Course:

OpenNeuro.org Tutorials

Following the previous two tutorials on uploading and sharing data with OpenNeuro.org, this tutorial briefly covers how to run various analyses on your datasets.

Difficulty level: Beginner

Duration: 2:26

Speaker: : OpenNeuro

Mouse Phenome Database: An integrative database and analysis suite for curated empirical phenotype data

Course:

The Mouse Phenome Database (MPD) provides access to primary experimental trait data, genotypic variation, protocols and analysis tools for mouse genetic studies. Data are contributed by investigators worldwide and represent a broad scope of phenotyping endpoints and disease-related traits in naïve mice and those exposed to drugs, environmental agents or other treatments. MPD ensures rigorous curation of phenotype data and supporting documentation using relevant ontologies and controlled vocabularies. As a repository of curated and integrated data, MPD provides a means to access/re-use baseline data, as well as allows users to identify sensitized backgrounds for making new mouse models with genome editing technologies, analyze trait co-inheritance, benchmark assays in their own laboratories, and many other research applications. MPD’s primary source of funding is NIDA. For this reason, a majority of MPD data is neuro- and behavior-related.

Difficulty level: Beginner

Duration: 55:36

Speaker: : Elissa Chesler

Lesson type

Difficulty level

Topics

Contact info

Links

Related sites