Intro to Reinforcement Learning

Dates

August 2024 - Present

Organization

Hugging Face

Course Length

30 hours

Status

Ongoing

NumPy

Optuna

Gymnasium

Stable Baselines3

Motivation

My interest in Reinforcement Learning (RL) has been steadfast ever since watching the Hide & Go seek video by OpenAI. When DeepMind released AlphaGo and AlphaStar, I was exposed to, and excited by, the application of this new technology to games. After building my bbrbattle site, I speculated that the application of RL could optimize its functionality, and would potentially allow myself and other players to discover new optimal strategies.

Content

In this course I learned a variety of algorithms to apply to different toy RL environments. I learned how to construct my own custom environment, as well as many different on and off-policy algorithms to apply to them. I learned the difference between training a policy compared to a state-value function, how imitation learning works, as well as different training policies such as temporal learning and monte-carlo methods.

@Open AI Hide & Seek

@AlphaGo

@AlphaStar

Certificate coming soon...