扫码访问手机版
0 人学习
扫码访问手机版
课程目录
会员 1. Introduction
00:00会员 2. Markov decision processes (MDPs), POMDPs
00:00会员 3. Solving known MDPs: Dynamic Programming
00:00会员 OpenAI Gym & TensorFlow
00:00会员 4. Monte Carlo learning: value function (VF) estimation and optimization
00:00会员 Temporal difference learning: VF estimation and optimization, Q learning, SARS
00:00会员 Lecture 06
00:00会员 Deep Learning
00:00