Skip to main content

This lesson continues with the second workshop on reproducible science, focusing on additional open source tools for researchers and data scientists, such as the R programming language for data science, as well as associated tools like RStudio and R Markdown. Additionally, users are introduced to Python and iPython notebooks, Google Colab, and are given hands-on tutorials on how to create a Binder environment, as well as various containers in Docker and Singularity.

Difficulty level: Beginner
Duration: 1:16:04

This talk goes over Neurobagel, an open-source platform developed for improved dataset sharing and searching. 

Difficulty level: Beginner
Duration: 13:37

In this lesson, you will learn about the BRAIN Initiative Cell Atlas Network (BICAN) and how this project adopts a federated approach to data sharing. 

Difficulty level: Beginner
Duration: 11:23
Speaker: : Owen White

In this second part of the lecture Data Science and Reproducibility, you will learn how to apply the awareness of the intersection between neuroscience and data science (discussed in part one) to an understanding of the current reproducibility crisis in biomedical science and neuroscience. 

Difficulty level: Beginner
Duration: 31:31
Speaker: : Ashley Juavinett

This lecture covers the benefits and difficulties involved when re-using open datasets, and how metadata is important to the process.

Difficulty level: Beginner
Duration: 11:20
Speaker: : Elizabeth DuPre

This lesson provides a quick tour of some data repositories and how to download and manipulate data from them.

Difficulty level: Beginner
Duration: 00:49:06
Speaker: : Sebastian Urchs
Course:

KnowledgeSpace (KS) is a data discoverability portal and neuroscience encyclopedia that was developed to make it easier for the neuroscience community to find publicly available datasets that adhere to the FAIR Principles and to provide an integrated view of neuroscience concepts found in Wikipedia and NeuroLex linked with PubMed and 17 of the world's leading neuroscience repositories. In short, KS provides a single point of entry where reseaerchers can search for a neuroscience concept of interest and receive results that include: i. a description of the term found in Wikipedia/NeuroLex, ii. links to publicly available datasets related to the concept of interest, and iii. up-to-date references that support the concept of interests found in PubMed. APIs are available so that developers of other neuroscience research infrastructures can integrate KS components in their infrastructures. If your repository or your favorite repository is not indexed in KS, please contact us.

 

Difficulty level: Beginner
Duration: 6:14
Speaker: : Heather Topple

In this lesson, attendees will learn about the data structure standards, specifically the Brain Imaging Data Structure (BIDS), an INCF-endorsed standard for organizing, annotating, and describing data collected during neuroimaging experiments. 

Difficulty level: Beginner
Duration: 21:56
Speaker: : Michael Schirner

This presentation by the OHBM OpenScienceSIG covers common scenarios where Git can be extremely valuable. The essentials covered include cloning a repository and keeping it up to date, how to create and use your own repository, and how to contribute to other projects via forking and pull requests.

Difficulty level: Beginner
Duration: 51:55

DataLad is a versatile data management and data publication multi-tool. In this session, you can learn the basic concepts and commands for version control and reproducible data analysis. You’ll get to see, create, and install DataLad datasets of many shapes and sizes, master local version workflows and provenance-captured analysis-execution, and you will get ideas for your next data analysis project.

Difficulty level: Beginner
Duration: 01:29:08
Speaker: : Adina Wagner

This lecture discusses the the importance and need for data sharing in clinical neuroscience.

Difficulty level: Intermediate
Duration: 25:22
Speaker: : Thomas Berger

This lecture gives insights into the Medical Informatics Platform's current and future data privacy model.

Difficulty level: Intermediate
Duration: 17:29
Speaker: : Yannis Ioannidis

This lecture gives an overview on the European Health Dataspace. 

Difficulty level: Intermediate
Duration: 26:33

This is a tutorial on designing a Bayesian inference model to map belief trajectories, with emphasis on gaining familiarity with Hierarchical Gaussian Filters (HGFs).

 

This lesson corresponds to slides 65-90 of the PDF below. 

Difficulty level: Intermediate
Duration: 1:15:04
Speaker: : Daniel Hauke

This lesson contains the first part of the lecture Data Science and Reproducibility. You will learn about the development of data science and what the term currently encompasses, as well as how neuroscience and data science intersect. 

Difficulty level: Beginner
Duration: 32:18
Speaker: : Ariel Rokem

In this lesson, you will learn about the current challenges facing the integration of machine learning and neuroscience. 

Difficulty level: Beginner
Duration: 5:42
Speaker: : Dan Goodman

JupyterHub is a simple, highly extensible, multi-user system for managing per-user Jupyter Notebook servers, designed for research groups or classes. This lecture covers deploying JupyterHub on a single server, as well as deploying with Docker using GitHub for authentication.

Difficulty level: Beginner
Duration: 1:36:27
Speaker: : Thomas Kluyver

This demonstration walks through how to import your data into MATLAB.

Difficulty level: Beginner
Duration: 6:10
Speaker: : MATLAB®

This lesson provides instruction regarding the various factors one must consider when preprocessing data, preparing it for statistical exploration and analyses. 

Difficulty level: Beginner
Duration: 15:10
Speaker: : MATLAB®

This tutorial outlines, step by step, how to perform analysis by group and how to do change-point detection.

Difficulty level: Beginner
Duration: 2:49
Speaker: : MATLAB®