Lessons | INCF TrainingSpace

This lesson gives a quick walkthrough the Tidyverse, an "opinionated" collection of R packages designed for data science, including the use of readr, dplyr, tidyr, and ggplot2.

Difficulty level: Beginner

Duration: 1:01:39

Speaker: : Thomas Mock

Data science for psychology and neuroscience in Python

Course:

An introduction to data management, manipulation, visualization, and analysis for neuroscience. Students will learn scientific programming in Python, and use this to work with example data from areas such as cognitive-behavioral research, single-cell recording, EEG, and structural and functional MRI. Basic signal processing techniques including filtering are covered. The course includes a Jupyter Notebook and video tutorials.

Difficulty level: Beginner

Duration: 1:09:16

Speaker: : Aaron J. Newman

Neuroimaging and Data science

Course:

This book was written with the goal of introducing researchers and students in a variety of research fields to the intersection of data science and neuroimaging. This book reflects our own experience of doing research at the intersection of data science and neuroimaging and it is based on our experience working with students and collaborators who come from a variety of backgrounds and have a variety of reasons for wanting to use data science approaches in their work. The tools and ideas that we chose to write about are all tools and ideas that we have used in some way in our own research. Many of them are tools that we use on a daily basis in our work. This was important to us for a few reasons: the first is that we want to teach people things that we ourselves find useful. Second, it allowed us to write the book with a focus on solving specific analysis tasks. For example, in many of the chapters you will see that we walk you through ideas while implementing them in code, and with data. We believe that this is a good way to learn about data analysis, because it provides a connecting thread from scientific questions through the data and its representation to implementing specific answers to these questions. Finally, we find these ideas compelling and fruitful. That’s why we were drawn to them in the first place. We hope that our enthusiasm about the ideas and tools described in this book will be infectious enough to convince the readers of their value.

Difficulty level: Intermediate

Duration:

Speaker: :

Introducing state-of-art of methods for ensuring data privacy

Course:

The Future of Medical Data Sharing in Clinical Neurosciences

This talk presents state-of-the-art methods for ensuring data privacy with a particular focus on medical data sharing across multiple organizations.

Difficulty level: Intermediate

Duration: 22:49

Speaker: : Barbara Carminati

How to Build and Validate a Federated Algorithm in the Medical Informatics Platform (MIP)

Course:

The Future of Medical Data Sharing in Clinical Neurosciences

The Medical Informatics Platform (MIP) is a platform providing federated analytics for diagnosis and research in clinical neuroscience research. The federated analytics is possible thanks to a distributed engine that executes computations and transfers information between the members of the federation (hospital nodes). In this talk the speaker will describe the process of designing and implementing new analytical tools, i.e. statistical and machine learning algorithms. Mr. Sakellariou will further describe the environment in which these federated algorithms run, the challenges and the available tools, the principles that guide its design and the followed general methodology for each new algorithm. One of the most important challenges which are faced is to design these tools in a way that does not compromise the privacy of the clinical data involved. The speaker will show how to address the main questions when designing such algorithms: how to decompose and distribute the computations and what kind of information to exchange between nodes, in order to comply with the privacy constraint mentioned above. Finally, also the subject of validating these federated algorithms will be briefly touched.

Difficulty level: Intermediate

Duration: 20:26

Speaker: : Jason Skellariou

Risk-Based Anonymization for Medical Research

Course:

The Future of Medical Data Sharing in Clinical Neurosciences

This lecture discusses risk-based anonymization approaches for medical research.

Difficulty level: Intermediate

Duration: 15:43

Speaker: : Fabian Prasser

Neuroscience Data Integration Through Use of Digital Brain Atlases (Day 2)

Course:

INCF Assembly 2022 - Training Day 2

This lesson introduces concepts and practices surrounding reference atlases for the mouse and rat brains. Additionally, this lesson provides discussion around examples of data systems employed to organize neuroscience data collections in the context of reference atlases as well as analytical workflows applied to the data.

Difficulty level: Beginner

Duration: 03:04:29

Speaker: :

Introduction to Enabling Multi-Scale Data Integration

Course:

Enabling Multi-Scale Data Integration: Turning Data to Knowledge

This lesson is a brief introduction to the course, reiterating the goals of the NFDI-Neuro: to advance and disseminate a federated interoperable ecosystem for data and for reproducible research.

Difficulty level: Beginner

Duration: 2:44

Speaker: : Petra Ritter

Generating Simulated Data Using EBRAINS

Course:

Enabling Multi-Scale Data Integration: Turning Data to Knowledge

This lesson provides a hands-on tutorial for generating simulated brain data within the EBRAINS ecosystem.

Difficulty level: Beginner

Duration: 32:58

Speaker: : Jil Meier

Guiding Principles for FAIR and Open Science

Course:

Reproducible Science (Including Git, Docker, and Binder)

This is the first of two workshops on reproducibility in science, during which participants are introduced to concepts of FAIR and open science. After discussing the definition of and need for FAIR science, participants are walked through tutorials on installing and using Github and Docker, the powerful, open-source tools for versioning and publishing code and software, respectively.

Difficulty level: Intermediate

Duration: 1:20:58

Speaker: : Erin Dickie and Sejal Patel

An Intersectional Approach to Model Construction and Evaluation in Mental Healthcare

Course:

Applied Ethics in Machine Learning and Mental Health

This lesson contains both a lecture and a tutorial component. The lecture (0:00-20:03 of YouTube video) discusses both the need for intersectional approaches in healthcare as well as the impact of neglecting intersectionality in patient populations. The lecture is followed by a practical tutorial in both Python and R on how to assess intersectional bias in datasets. Links to relevant code and data are found below.

Difficulty level: Beginner

Duration: 52:26

Speaker: : Laura Sikstrom, Marta Maslej, Darla Reslan, and Yifan Wang

Managing Genotype Data Using PLINK

Course:

Fundamental Methods for Genomic Analysis

This is a hands-on tutorial on PLINK, the open source whole genome association analysis toolset. The aims of this tutorial are to teach users how to perform basic quality control on genetic datasets, as well as to identify and understand GWAS summary statistics.

Difficulty level: Intermediate

Duration: 1:27:18

Speaker: : Dan Felsky

Calculation of Polygenic Risk Scores in PRSice

Course:

Fundamental Methods for Genomic Analysis

This is a tutorial on using the open-source software PRSice to calculate a set of polygenic risk scores (PRS) for a study sample. Users will also learn how to read PRS into R, visualize distributions, and perform basic association analyses.

Difficulty level: Intermediate

Duration: 1:53:34

Speaker: : Dan Felsky

Maximize Your Research With Cloud Workspaces

Course:

Maximize Your Research With Cloud Workspaces is a talk aimed at researchers who are looking for innovative ways to set up and execute their life science data analyses in a collaborative, extensible, open-source cloud environment. This panel discussion is brought to you by MetaCell and scientists from leading universities who share their experiences of advanced analysis and collaborative learning through the Cloud.

Difficulty level: Beginner

Duration: 55:43

Speaker: : Stephanie Jones, Salvador Dura-Bernal, Padraig Gleeson, Andrew Hardaway, Stephen Larson

Live Papers

Course:

Session 2: FAIR Sharing, Integration, & Analysis of Neuroscience Data

This talk enumerates the challenges regarding data accessibility and reusability inherent in the current scientific publication system, and discusses novel approaches to these challenges, such as the EBRAINS Live Papers platform.

Difficulty level: Beginner

Duration: 18:08

Speaker: : Andrew Davison

Lesson type

Difficulty level

Topics

Contact info

Links

Related sites