Value Functions and Q-Learning

From Bellman's recursive insight to teaching machines to learn optimal behavior through trial and error.

intermediate~4 hours4 notebooksState Value FunctionsAction Value Functions (Q-Values)The Bellman EquationThe Bellman Optimality EquationQ-LearningExploration vs ExploitationDeep Q-Networks (DQN)

Curator of this Module

Dr. Rajat Dandekar

Course Instructor

Dr. Rajat Dandekar is a researcher and educator specializing in AI/ML, with a passion for making complex concepts accessible through intuitive explanations and hands-on learning.

Checking access…

Learning Path

Article

Notebook 1

Notebook 2

Notebook 3

Notebook 4

Case Study

Certificate

Value Functions and Q-Learning

Curator of this Module

Dr. Rajat Dandekar

Learning Path

Read the Article

Practice with Notebooks

Apply Your Knowledge

Get Your Certificate