Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
Papers|更新时间:2024-06-05
|
Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
Journal on CommunicationsVol. 34, Issue 1, Pages: 77-89(2013)
作者机构:
1. 苏州大学 计算机科学与技术学院,江苏 苏州 215006
2. 吉林大学 符号计算与知识工程教育部重点实验室,吉林 长春 130012
作者简介:
基金信息:
The National Natural Science Foundation of China(61070223);The National Natural Science Foundation of China(61103045);The National Natural Science Foundation of China(61070122);The National Natural Science Foundation of China(61272005);The Natural Science Foundation of Jiangsu Province(BK2012616);The High School Natural Foundation of Jiangsu Province(09KJA520002);The High School Natural Foundation of Jiangsu Province(09KJB520012);The Foundation of Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University(93K172012K04)
Fei XIAO, Quan LIU, Qi-ming FU, et al. Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism[J]. Journal on Communications, 2013, 34(1): 77-89.
DOI:
Fei XIAO, Quan LIU, Qi-ming FU, et al. Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism[J]. Journal on Communications, 2013, 34(1): 77-89. DOI: 1000-436X(2013)01-0077-12.
Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism