Markov Decision Processes 1 - Value Iteration | Stanford CS221: AI (Autumn 2019)

586K subscribers

424,267 views

About
Share

Published On Jan 8, 2020

For more information about Stanford’s Artificial Intelligence professional and graduate programs, visit: https://stanford.io/3pUNqG7

Topics: MDP1, Search review, Project
Percy Liang, Associate Professor & Dorsa Sadigh, Assistant Professor - Stanford University
http://onlinehub.stanford.edu/

Associate Professor Percy Liang
Associate Professor of Computer Science and Statistics (courtesy)
https://profiles.stanford.edu/percy-l...

Assistant Professor Dorsa Sadigh
Assistant Professor in the Computer Science Department & Electrical Engineering Department
https://profiles.stanford.edu/dorsa-s...

To follow along with the course schedule and syllabus, visit:
https://stanford-cs221.github.io/autu...

Chapters:
0:00 intro
2:12 Course Plan
3:45 Applications
10:48 Rewards
18:46 Markov Decision process
19:33 Transitions
20:45 Transportation Example
29:28 What is a Solution?
30:58 Roadmap
36:36 Evaluating a policy: volcano crossing
37:38 Discounting
53:21 Policy evaluation computation
55:23 Complexity
57:10 Summary so far

#artificialintelligencecourse

Published On Jan 8, 2020

Share/Embed

Video Link