Tutorial 2: Learning to Act: Multi-Armed Bandits

Neuromatch Academy

Difficulty level

Beginner

Speaker

Type

Duration

6:55

Topic

Computational neuroscience

In this tutorial, you will use 'bandits' to understand the fundamentals of how a policy interacts with the learning algorithm in reinforcement learning.

Topics covered in this lesson

The fundamental tradeoff between exploration and exploitation in a policy
How the learning rate interacts with exploration to find the best available action

External Links

Tutorial Exercises

Tutorial Slides

Neuromatch Academy

Prerequisites

Experience with Python Programming Language