This tutorial covers how to simulate a Hidden Markov Model (HMM) and observe how changing the transition probability and observation noise impacts what the samples look like. Then we'll look at how uncertainty increases as we make future predictions without evidence (from observations) and how to gain information from the observations.
This tutorial covers how to infer a latent model when our states are continuous. Particular attention is paid to the Kalman filter and its mathematical foundation.
This lecture covers multiple topics on dynamical neural modeling and inference and their application to basic neuroscience and neurotechnology design.
This lecture provides an introduction to optimal control, different types of control, as well as potential applications.
In this tutorial, you will perform a Sequential Probability Ratio Test between two hypotheses HL and HR by running simulations of a Drift Diffusion Model (DDM).
In this tutorial, you will implement a continuous control task: you will design control inputs for a linear dynamical system to reach a target state.
This lecture covers the utility of action, including the vigor and neuroeconomics of movement and applications to foraging and the marginal value theorem.
Question and Answer Session for participants in the Americas.
Question and Answer Session for participants in Africa and Europe.
Question and Answer Session for participants in Asia and Australia.
Question and Answer session for participants in Africa and Europe.
Question and Answer session for participants in Asia and Australia.
Question and Answer session for participants in the Americas.
This lecture provides an introduction to a variety of topics in reinforcement learning.
This tutorial presents how to estimate state-value functions in a classical conditioning paradigm using Temporal Difference (TD) learning and examine TD-errors at the presentation of the conditioned and unconditioned stimulus (CS and US) under different CS-US contingencies. These exercises will provide you with an understanding of both how reward prediction errors (RPEs) behave in classical conditioning and what we should expect to see if dopamine represents a "canonical" model-free RPE.
In this tutorial, you will use 'bandits' to understand the fundamentals of how a policy interacts with the learning algorithm in reinforcement learning.
In this tutorial, you will learn how to act in the more realistic setting of sequential decisions, formalized by Markov Decision Processes (MDPs). In a sequential decision problem, the actions executed in one state not only may lead to immediate rewards (as in a bandit problem), but may also affect the states experienced next (unlike a bandit problem). Each individual action may therefore affect affect all future rewards. Thus, making decisions in this setting requires considering each action in terms of their expected cumulative future reward.
In this tutorial, you will implement one of the simplest model-based reinforcement learning algorithms, Dyna-Q. You will understand what a world model is, how it can improve the agent's policy, and the situations in which model-based algorithms are more advantageous than their model-free counterparts.
This lecture highlights up-and-coming issues in the neuroscience of reinforcement learning.
BioImage Suite is an integrated image analysis software suite developed at Yale University. BioImage Suite has been extensively used at different labs at Yale since about 2001.