Lessons | INCF TrainingSpace

Course:

Reproducible Science (Including Git, Docker, and Binder)

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner

Duration: 1:16:04

Speaker: : Erin Dickie and Sejal Patel

Neurobagel - An Ecosystem for Distributed Dataset Harmonization and Search

Course:

Session 1: A FAIR Roadmap for Knowledge Graphs and Ontologies

This talk goes over Neurobagel, an open-source platform developed for improved dataset sharing and searching.

Difficulty level: Beginner

Duration: 13:37

Speaker: : Jean-Babtiste Poline

Federation of Data Sharing Approaches Across the BICAN

Course:

Session 3: Streamlining Cross-Platform Data Integration

In this lesson, you will learn about the BRAIN Initiative Cell Atlas Network (BICAN) and how this project adopts a federated approach to data sharing.

Difficulty level: Beginner

Duration: 11:23

Speaker: : Owen White

Data Science and Reproducibility: Part 2

Course:

INCF Short Course: Introduction to Neuroinformatics

In this second part of the lecture Data Science and Reproducibility, you will learn how to apply the awareness of the intersection between neuroscience and data science (discussed in part one) to an understanding of the current reproducibility crisis in biomedical science and neuroscience.

Difficulty level: Beginner

Duration: 31:31

Speaker: : Ashley Juavinett

Open for Research: Challenges and Opportunities in Re-Using Publicly Available Datasets

Course:

FAIR Approaches for Neuroimaging Research

This lecture covers the benefits and difficulties involved when re-using open datasets, and how metadata is important to the process.

Difficulty level: Beginner

Duration: 11:20

Speaker: : Elizabeth DuPre

Open Data Resources (and How to Download Them)

Course:

Data Management, Repositories, & Search Engines

This lesson provides a quick tour of some data repositories and how to download and manipulate data from them.

Difficulty level: Beginner

Duration: 00:49:06

Speaker: : Sebastian Urchs

KnowledgeSpace

Course:

KnowledgeSpace (KS) is a data discoverability portal and neuroscience encyclopedia that was developed to make it easier for the neuroscience community to find publicly available datasets that adhere to the FAIR Principles and to provide an integrated view of neuroscience concepts found in Wikipedia and NeuroLex linked with PubMed and 17 of the world's leading neuroscience repositories. In short, KS provides a single point of entry where reseaerchers can search for a neuroscience concept of interest and receive results that include: i. a description of the term found in Wikipedia/NeuroLex, ii. links to publicly available datasets related to the concept of interest, and iii. up-to-date references that support the concept of interests found in PubMed. APIs are available so that developers of other neuroscience research infrastructures can integrate KS components in their infrastructures. If your repository or your favorite repository is not indexed in KS, please contact us.

Difficulty level: Beginner

Duration: 6:14

Speaker: : Heather Topple

Introduction to Data Structure Standards (BIDS)

Course:

Enabling Multi-Scale Data Integration: Turning Data to Knowledge

In this lesson, attendees will learn about the data structure standards, specifically the Brain Imaging Data Structure (BIDS), an INCF-endorsed standard for organizing, annotating, and describing data collected during neuroimaging experiments.

Difficulty level: Beginner

Duration: 21:56

Speaker: : Michael Schirner

Mathematics for Data Science Practitioners

Course:

Foundations of Data Science

This lesson gives an introduction to the Mathematics chapter of Datalabcc's Foundations in Data Science series.

Difficulty level: Beginner

Duration: 2:53

Speaker: : Barton Poulson

Elementary Algebra

Course:

Foundations of Data Science

This lesson serves a primer on elementary algebra.

Difficulty level: Beginner

Duration: 3:03

Speaker: : Barton Poulson

Linear Algebra

Course:

Foundations of Data Science

This lesson provides a primer on linear algebra, aiming to demonstrate how such operations are fundamental to many data science.

Difficulty level: Beginner

Duration: 5:38

Speaker: : Barton Poulson

Systems of Linear Equations

Course:

Foundations of Data Science

In this lesson, users will learn about linear equation systems, as well as follow along some practical use cases.

Difficulty level: Beginner

Duration: 5:24

Speaker: : Barton Poulson

Calculus

Course:

Foundations of Data Science

This talk gives a primer on calculus, emphasizing its role in data science.

Difficulty level: Beginner

Duration: 4:17

Speaker: : Barton Poulson

Calculus and Optimization

Course:

Foundations of Data Science

This lesson clarifies how calculus relates to optimization in a data science context.

Difficulty level: Beginner

Duration: 8:43

Speaker: : Barton Poulson

Mathematics: Big O

Course:

Foundations of Data Science

This lesson covers Big O notation, a mathematical notation that describes the limiting behavior of a function as it tends towards a certain value or infinity, proving useful for data scientists who want to evaluate their algorithms' efficiency.

Difficulty level: Beginner

Duration: 5:19

Speaker: : Barton Poulson

Probability

Course:

Foundations of Data Science

This lesson serves as a primer on the fundamental concepts underlying probability.

Difficulty level: Beginner

Duration: 7:33

Speaker: : Barton Poulson

Maths for Programmers Tutorial - Full Course on Sets and Logic

Course:

Conceptual Background & Refreshers

Serving as good refresher, this lesson explains the maths and logic concepts that are important for programmers to understand, including sets, propositional logic, conditional statements, and more.

This compilation is courtesy of freeCodeCamp.

Difficulty level: Beginner

Duration: 1:00:07

Speaker: : Shawn Grooms

Linear Algebra for Machine Learning

Course:

Conceptual Background & Refreshers

This lesson provides a useful refresher which will facilitate the use of Matlab, Octave, and various matrix-manipulation and machine-learning software.

This lesson was created by RootMath.

Difficulty level: Beginner

Duration: 1:21:30

Speaker: :

Medical Informatics Platform (MIP) Federated Analytics

Course:

The Future of Medical Data Sharing in Clinical Neurosciences

In this session the Medical Informatics Platform (MIP) federated analytics is presented. The current and future analytical tools implemented in the MIP will be detailed along with the constructs, tools, processes, and restrictions that formulate the solution provided. MIP is a platform providing advanced federated analytics for diagnosis and research in clinical neuroscience research. It is targeting clinicians, clinical scientists and clinical data scientists. It is designed to help adopt advanced analytics, explore harmonized medical data of neuroimaging, neurophysiological and medical records as well as research cohort datasets, without transferring original clinical data. It can be perceived as a virtual database that seamlessly presents aggregated data from distributed sources, provides access and analyze imaging and clinical data, securely stored in hospitals, research archives and public databases. It leverages and re-uses decentralized patient data and research cohort datasets, without transferring original data. Integrated statistical analysis tools and machine learning algorithms are exposed over harmonized, federated medical data.

Difficulty level: Intermediate

Duration: 15:05

Speaker: : Giorgos Papanikos

How to Build and Validate a Federated Algorithm in the Medical Informatics Platform (MIP)

Course:

The Future of Medical Data Sharing in Clinical Neurosciences

The Medical Informatics Platform (MIP) is a platform providing federated analytics for diagnosis and research in clinical neuroscience research. The federated analytics is possible thanks to a distributed engine that executes computations and transfers information between the members of the federation (hospital nodes). In this talk the speaker will describe the process of designing and implementing new analytical tools, i.e. statistical and machine learning algorithms. Mr. Sakellariou will further describe the environment in which these federated algorithms run, the challenges and the available tools, the principles that guide its design and the followed general methodology for each new algorithm. One of the most important challenges which are faced is to design these tools in a way that does not compromise the privacy of the clinical data involved. The speaker will show how to address the main questions when designing such algorithms: how to decompose and distribute the computations and what kind of information to exchange between nodes, in order to comply with the privacy constraint mentioned above. Finally, also the subject of validating these federated algorithms will be briefly touched.

Difficulty level: Intermediate

Duration: 20:26

Speaker: : Jason Skellariou

Lesson type

Difficulty level

Topics

Contact info

Links

Related sites