cs234 / lecture 5 - value function approximation

Resources:

Value Function Approximation

Review: Gradient Descent

Value Function Approximation for Policy Evaluation

Model Free Value Function Approximation

Feature Factors

Linear Value Function Approximation

Monte Carlo Value Function Approximation

Convergence Guarantees for Linear Value Function Approximation for Policy Evaluation

Batch Monte Carlo Value Function Approximation

Temporal Difference Learning with Value Function Approximation

Control Using Value Function Approximation

Action-Value Approximation with an Oracle

Linear State Action Value Approximation with an Oracle

Incremental Model-Free Control Approaches

Convergence of TD Methods with VFA

Convergence Guarantees:

  Tabular Linear VFA Nonlinear VFA
Monte Carlo Control ✅(might be oscillation)
SARSA ✅ (might be oscillation)
Q-Learning