Bayesian Q learning method with Dyna architecture and prioritized sweeping
academic paper|更新时间:
|
Bayesian Q learning method with Dyna architecture and prioritized sweeping
Communication JournalVol. 34, Issue 11, Pages: 129-139(2013)
作者机构:
作者简介:
基金信息:
The National Natural Science Foundation of China(61070223);The National Natural Science Foundation of China(61103045);The National Natural Science Foundation of China(61070122);The National Natural Science Foundation of China(61272005);The Natural Science Foundation of Jiangsu Province(BK2012616);The High School Natural Foundation of Jiangsu Province(09KJA520002);The High School Natural Foundation of Jiangsu Province(09KJB520012);The Foundation of Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University(93K172012K04)