超级超级小天才

IP属地：澳门

超全！深度强化学习领域值得一读的论文列表
参考自：https://spinningup.openai.com/en/latest/spinningup/keypapers.html[ht...

0.1 672 0 1
[TRPO] Trust Region Policy Optimization
论文链接：http://proceedings.mlr.press/v37/schulman15[http://proceedings.mlr....

0.1 1067 0 1

[DDPG] Continuous Control with Deep Reinforcement Learning
论文链接：https://arxiv.org/abs/1509.02971[https://arxiv.org/abs/1509.02971]引...

0.1 606 0 1
[DQN] Playing Atari with Deep Reinforcement Learning
论文链接：https://arxiv.org/abs/1312.5602[https://arxiv.org/abs/1312.5602]引用：...

0.1 728 0 1
[Chapter 6] Reinforcement Learning (4) Policy Search
In the previous sections, we try to learn the utility function, or more ...

0.7 305 0 2
[Chapter 5] Reinforcement Learning (3) Function Approximation and Going Deep
Function Approximation While we are learning the Q-functions, but how to...

0.4 230 0 2
[Chapter 4] Reinforcement Learning (2) Model-Free Method
Model-Free RL Method In model-based method, we need firstly model the en...

0.1 361 0 2

[Chapter 3] Reinforcement Learning (1) Model-Based Method
Reinforcement Learning Firstly, we assume that all the environments in t...

0.1 262 0 2
[Chapter 2] Value Iteration and Policy Iteration
We now know the most important thing for computing an optimal policy is ...

0.1 311 0 1