Value Functions and Q-Learning
From Bellman's recursive insight to teaching machines to learn optimal behavior through trial and error.
intermediate~4 hours4 notebooksState Value FunctionsAction Value Functions (Q-Values)The Bellman EquationThe Bellman Optimality EquationQ-LearningExploration vs ExploitationDeep Q-Networks (DQN)
Curator of this Module
Dr. Rajat Dandekar
Course Instructor
Dr. Rajat Dandekar is a researcher and educator specializing in AI/ML, with a passion for making complex concepts accessible through intuitive explanations and hands-on learning.
Checking access…
Learning Path
Article
1
Notebook 12
Notebook 23
Notebook 34
Notebook 4Case Study
Certificate