RUOCHI.AI


  • Home

  • Tags

  • Categories

  • Archives

  • About

  • projects

  • Search

Assignment 4: Chatbot

Posted on 2020-09-28 | In Artificial Intelligence , Deep Learning |
Assignment 4: Chatbot Welcome to the last assignment of Course 4. Before you get started, we want to congratulate you on getting here. It is your 16t ...
Read more »

Assignment 3: Question Answering

Posted on 2020-09-27 | In Artificial Intelligence , Deep Learning |
Assignment 3: Question AnsweringWelcome to this week’s assignment of course 4. In this you will explore question answering. You will implement the “Te ...
Read more »

Transformer Summarizer

Posted on 2020-09-27 | In Artificial Intelligence , Deep Learning |
Assignment 2: Transformer SummarizerWelcome to the second assignment of course 4. In this assignment you will explore summarization using the transfor ...
Read more »

Using RL to Solve Blackjack

Posted on 2020-09-17 | In Artificial Intelligence , Reinforcement Learning |
转载自: https://github.com/zht007/tensorflow-practice/blob/master/7_Renforcement_Learning_blackjack/强化学习——MC(蒙特卡洛)玩21点扑克游戏.md 1. 关于21点游戏1.1 规则简介21点的游戏规则详 ...
Read more »

Generalized Policy Iteration

Posted on 2020-09-10 | In Artificial Intelligence , Reinforcement Learning |
The key idea of DP, and of reinforcement learning generally, is the use of value functions to organize and structure the search for good policies. As ...
Read more »

Optimal Policies with Dynamic Programming

Posted on 2020-09-10 | In Artificial Intelligence , Reinforcement Learning |
Assignment 2: Optimal Policies with Dynamic ProgrammingWelcome to Assignment 2. This notebook will help you understand: Policy Evaluation and Policy ...
Read more »

Markov Decision Processes II

Posted on 2020-09-06 | In Artificial Intelligence , Reinforcement Learning |
Lesson 1: Policies and Value FunctionsRecognize that a policy is a distribution over actions for each possible state.a policy is a mapping from states ...
Read more »

Markov Decision Processes I

Posted on 2020-09-04 | In Artificial Intelligence , Reinforcement Learning |
Lesson 1: Introduction to Markov Decision ProcessesUnderstand Markov Decision Processes, or MDPsMDPs are a classical formalization of sequential decis ...
Read more »

The K-Armed Bandit Problem

Posted on 2020-09-03 | In Artificial Intelligence , Reinforcement Learning |
Lesson 1: The K-Armed Bandit ProblemDefine rewardIn the k-armed bandit problem, each of the k actions has an expected or mean reward given that that a ...
Read more »

Bandits and Exploration/Exploitation

Posted on 2020-09-03 | In Artificial Intelligence , Reinforcement Learning |
Assignment 1: Bandits and Exploration/ExploitationWelcome to Assignment 1. This notebook will: Help you create your first bandit algorithm Help you u ...
Read more »
123…27
Ruochi Zhang

Ruochi Zhang

263 posts
45 categories
29 tags
RSS
GitHub E-Mail
Friend links
  • HILab
  • Rose
  • Chunxia
© 2019 — 2021 Ruochi Zhang