Knowledge Sharing for Reinforcement Learning:
Writing a BOOK
Simyung Chang1
, YoungJoon Yoo2
, Jaeseok Choi1
, Nojun Kwak1
Seoul National University
1{timelighter, jaeseok.choi, nojunk}@snu.ac.kr, 2yjyoo3312@gmail.com
Abstract
This paper proposes a novel deep reinforcement learning (RL) method integrating
the neural-network-based RL and the classical RL based on dynamic programming.
In comparison to the conventional deep RL methods, our method enhances
the convergence speed and the performance by delving into the following two
characteristic features in the training of conventional RL: (1) Having many credible
experiences is important in training RL algorithms, (2) Input states can be
semantically clustered into a relatively small number of core clusters, and the
states belonging to the same cluster tend to share similar Q-values given an action.
By following the two observations, we propose a dictionary-type memory that
accumulates the Q-value for each cluster of states as well as the corresponding
action, in terms of priority. Then, we iteratively update each Q-value in the memory
from the Q-value acquired from the network trained by the experiences stored in
the memory. We demonstrate the effectiveness of our method through training RL
algorithms on widely used game environments from OpenAI.
Knowledge Sharing for Reinforcement Learning: Writing a BOOK
最后编辑于 :
©著作权归作者所有,转载或内容合作请联系作者
- 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
- 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
- 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
推荐阅读更多精彩内容
- 前言 互联网世界中,HTTP协议算是使用最广泛的协议。最近Http2.0诞生再次使得互联网技术成为关注的焦点。接下...