Lessons | INCF TrainingSpace

Course:

Reproducible Science (Including Git, Docker, and Binder)

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner

Duration: 1:16:04

Speaker: : Erin Dickie and Sejal Patel

Neurobagel - An Ecosystem for Distributed Dataset Harmonization and Search

Course:

Session 1: A FAIR Roadmap for Knowledge Graphs and Ontologies

This talk goes over Neurobagel, an open-source platform developed for improved dataset sharing and searching.

Difficulty level: Beginner

Duration: 13:37

Speaker: : Jean-Babtiste Poline

Federation of Data Sharing Approaches Across the BICAN

Course:

Session 3: Streamlining Cross-Platform Data Integration

In this lesson, you will learn about the BRAIN Initiative Cell Atlas Network (BICAN) and how this project adopts a federated approach to data sharing.

Difficulty level: Beginner

Duration: 11:23

Speaker: : Owen White

Data Science and Reproducibility: Part 2

Course:

INCF Short Course: Introduction to Neuroinformatics

In this second part of the lecture Data Science and Reproducibility, you will learn how to apply the awareness of the intersection between neuroscience and data science (discussed in part one) to an understanding of the current reproducibility crisis in biomedical science and neuroscience.

Difficulty level: Beginner

Duration: 31:31

Speaker: : Ashley Juavinett

Open for Research: Challenges and Opportunities in Re-Using Publicly Available Datasets

Course:

FAIR Approaches for Neuroimaging Research

This lecture covers the benefits and difficulties involved when re-using open datasets, and how metadata is important to the process.

Difficulty level: Beginner

Duration: 11:20

Speaker: : Elizabeth DuPre

Open Data Resources (and How to Download Them)

Course:

Data Management, Repositories, & Search Engines

This lesson provides a quick tour of some data repositories and how to download and manipulate data from them.

Difficulty level: Beginner

Duration: 00:49:06

Speaker: : Sebastian Urchs

KnowledgeSpace

Course:

KnowledgeSpace (KS) is a data discoverability portal and neuroscience encyclopedia that was developed to make it easier for the neuroscience community to find publicly available datasets that adhere to the FAIR Principles and to provide an integrated view of neuroscience concepts found in Wikipedia and NeuroLex linked with PubMed and 17 of the world's leading neuroscience repositories. In short, KS provides a single point of entry where reseaerchers can search for a neuroscience concept of interest and receive results that include: i. a description of the term found in Wikipedia/NeuroLex, ii. links to publicly available datasets related to the concept of interest, and iii. up-to-date references that support the concept of interests found in PubMed. APIs are available so that developers of other neuroscience research infrastructures can integrate KS components in their infrastructures. If your repository or your favorite repository is not indexed in KS, please contact us.

Difficulty level: Beginner

Duration: 6:14

Speaker: : Heather Topple

Introduction to Data Structure Standards (BIDS)

Course:

Enabling Multi-Scale Data Integration: Turning Data to Knowledge

In this lesson, attendees will learn about the data structure standards, specifically the Brain Imaging Data Structure (BIDS), an INCF-endorsed standard for organizing, annotating, and describing data collected during neuroimaging experiments.

Difficulty level: Beginner

Duration: 21:56

Speaker: : Michael Schirner

Curating Electrophysiology Data for Reuse in EBRAINS

Course:

FAIR Approaches for Electrophysiology

This lecture contains an overview of electrophysiology data reuse within the EBRAINS ecosystem.

Difficulty level: Beginner

Duration: 15:57

Speaker: : Andrew Davison

How Standards and Use Cases Shape Up the FAIR DANDI Archive

Course:

FAIR Approaches for Electrophysiology

This lecture contains an overview of the Distributed Archives for Neurophysiology Data Integration (DANDI) archive, its ties to FAIR and open-source, integrations with other programs, and upcoming features.

Difficulty level: Beginner

Duration: 13:34

Speaker: : Yaroslav O. Halchenko

What is federated analysis

Course:

The Future of Medical Data Sharing in Clinical Neurosciences

This lecture explains the concept of federated analysis in the context of medical data, associated challenges. The lecture also presents an example of hospital federations via the Medical Informatics Platform.

Difficulty level: Intermediate

Duration: 19:15

Speaker: : Yannis Ioannidis

An Intersectional Approach to Model Construction and Evaluation in Mental Healthcare

Course:

Applied Ethics in Machine Learning and Mental Health

This lesson contains both a lecture and a tutorial component. The lecture (0:00-20:03 of YouTube video) discusses both the need for intersectional approaches in healthcare as well as the impact of neglecting intersectionality in patient populations. The lecture is followed by a practical tutorial in both Python and R on how to assess intersectional bias in datasets. Links to relevant code and data are found below.

Difficulty level: Beginner

Duration: 52:26

Speaker: : Laura Sikstrom, Marta Maslej, Darla Reslan, and Yifan Wang

Managing Genotype Data Using PLINK

Course:

Fundamental Methods for Genomic Analysis

This is a hands-on tutorial on PLINK, the open source whole genome association analysis toolset. The aims of this tutorial are to teach users how to perform basic quality control on genetic datasets, as well as to identify and understand GWAS summary statistics.

Difficulty level: Intermediate

Duration: 1:27:18

Speaker: : Dan Felsky

Calculation of Polygenic Risk Scores in PRSice

Course:

Fundamental Methods for Genomic Analysis

This is a tutorial on using the open-source software PRSice to calculate a set of polygenic risk scores (PRS) for a study sample. Users will also learn how to read PRS into R, visualize distributions, and perform basic association analyses.

Difficulty level: Intermediate

Duration: 1:53:34

Speaker: : Dan Felsky

Transcriptomics at the Single-Cell and Bulk Level

Course:

Fundamental Methods for Single-Cell Transcriptome Analysis

This lesson is an overview of transcriptomics, from fundamental concepts of the central dogma and RNA sequencing at the single-cell level, to how genetic expression underlies diversity in cell phenotypes.

Difficulty level: Intermediate

Duration: 1:29:08

Speaker: : Shreejoy Tripathy

Introduction to Transcriptomic Data Types

Course:

Fundamental Methods for Single-Cell Transcriptome Analysis

This is a tutorial introducing participants to the basics of RNA-sequencing data and how to analyze its features using Seurat.

Difficulty level: Intermediate

Duration: 1:19:17

Speaker: : Sonny Chen

Cellular Changes in Major Depression Disorder (MDD)

Course:

Fundamental Methods for Single-Cell Transcriptome Analysis

This tutorial demonstrates how to perform cell-type deconvolution in order to estimate how proportions of cell-types in the brain change in response to various conditions. While these techniques may be useful in addressing a wide range of scientific questions, this tutorial will focus on the cellular changes associated with major depression (MDD).

Difficulty level: Intermediate

Duration: 1:15:14

Speaker: : Keon Arbabi

Simulating and Analyzing Spiking From Neurons and Microcircuits

Course:

Simulating Brain Microcircuit Activity and Signals in Mental Health

This is a tutorial on how to simulate neuronal spiking in brain microcircuit models, as well as how to analyze, plot, and visualize the corresponding data.

Difficulty level: Intermediate

Duration: 1:39:50

Speaker: : Frank Mazza

Maximize Your Research With Cloud Workspaces

Course:

Maximize Your Research With Cloud Workspaces is a talk aimed at researchers who are looking for innovative ways to set up and execute their life science data analyses in a collaborative, extensible, open-source cloud environment. This panel discussion is brought to you by MetaCell and scientists from leading universities who share their experiences of advanced analysis and collaborative learning through the Cloud.

Difficulty level: Beginner

Duration: 55:43

Speaker: : Stephanie Jones, Salvador Dura-Bernal, Padraig Gleeson, Andrew Hardaway, Stephen Larson

Brain Atlases as Tools for Data Integration