VizuaraVizuara AI Pods

Basics of Reinforcement Learning

From the agent-environment loop to Bellman equations, rewards, and your first OpenAI Gymnasium agent -- all from first principles.

beginner~4 hours3 notebooksReinforcement Learning FundamentalsThe Four Elements of RLMarkov Decision ProcessesRewards, Returns, and DiscountingOpenAI GymnasiumQ-Learning

Curator of this Module

Dr. Rajat Dandekar

Dr. Rajat Dandekar

Course Instructor

Dr. Rajat Dandekar is a researcher and educator specializing in AI/ML, with a passion for making complex concepts accessible through intuitive explanations and hands-on learning.

Checking access…

Learning Path

Article
1
Notebook 1
2
Notebook 2
3
Notebook 3
Case Study
Certificate