RUOCHI.AI


  • Home

  • Tags

  • Categories

  • Archives

  • About

  • projects

  • Search

Artificial IntelligenceCategory

CS224W - Colab 1

01-22

Implement your agent

10-19

Average Reward Softmax Actor-Critic

10-16

Function Approximation and Control

10-15

Semi Gradient TD with a Neural Network

10-14

TD with State Aggregation

10-13

Dyna-Q and Dyna-Q+

09-30

Planning and learning with Tabular Methods

09-29

Q-Learning and Expected Sarsa

09-29

Policy Evaluation in Cliff Walking Environment

09-28
12…14
Ruochi Zhang

Ruochi Zhang

263 posts
45 categories
29 tags
RSS
GitHub E-Mail
Friend links
  • HILab
  • Rose
  • Chunxia
© 2019 — 2021 Ruochi Zhang