Journal on CommunicationsVol. 36, Issue 2, Pages: 186-192(2015)
作者机构:
1. 苏州大学 计算机科学与技术学院,江苏 苏州 215006
2. 吉林大学 符号计算与知识工程教育部重点实验室,吉林 长春 130012
作者简介:
基金信息:
The National Natural Science Foundation of China(61272005);The National Natural Science Foundation of China(61472262);The Natural Science Foundation of Jiangsu Province(BK2012616)
mergence supported adaptive tile coding algorithm was presented which would eliminate the unnecessary division.Simulation is conducted on mountain car problem with discrete actions and continuous state space Results show that the proposed method can eliminate the influence of false division in the traditional tile coding method and achieve a more accurate adaptive partition of continuous state space.A higher convergence rate is achieved at the same time.
关键词
Keywords
references
SUTTON R S , BARTO A G . Reinforcement Learning:An Introduction [M ] . Cambridge:MIT Press , 1998 .
LIN C S , KIM H . Selection of learning parameters for CMAC-based adaptive critic learning [J ] . IEEE Trans Neural Networks , 1999 , 6 ( 3 ): 642 - 647 .
PELLEG D , MOORE A , SHROFF N B . X-means:extending K-Means with efficient estimation of the number of clusters [A ] . Proc of the 17th International Conf on Machine Learning [C ] . Boston:Morgan Kaufmann Press , 2000 . 727 - 734 .
PELLEG D , MOORE A . Accelerating exact k-means algorithms with geometric reasoning [A ] . Proc of the fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining [C ] . 1999 . 277 - 281 .
CHEN Z H , WEN F , NIE J B , et al . A reinforcement learning method based on node-growing k-means cluster algorithm [J ] . Journal of Computer Research and Development , 2006 , 43 ( 4 ): 661 - 666 .
WEN F , CHEN Z H , ZHUO R , et al . Reinforcement learning method of continuous state adaptively discretized based on K-means clustering [J ] . Control and Decision , 2006 , 21 ( 2 ): 143 - 147 .
GU D L , CHEN W D , XI Y G . A novel adaptive control algorithm based on reinforcement learning [J ] . Control and Decision , 2002 , 17 ( 4 ): 473 - 479 .
MOORE A W , ATKESON C G . The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces [J ] . Machine Learning , 1995 , 21 ( 3 ): 199 - 233 .
UTHER W T B , VELOSO M M . Tree based discretization for continuous state space reinforcement learning [A ] . AAAI’98 [C ] . Madison,Wisconsin,United States , 1998
SHERSTOV A A , STONE P . Function Approximation Via Tile Coding:Automating Parameter Choice Abstraction,Reformulation and Approximation [M ] . Springer Berlin Heidelberg , 2005 : 194 - 205 .
WHITESON S , TAYLOR M E , STONE P . Adaptive tile Coding for Value Function Approximation [M ] . Computer Science Department,University of Texas at Austin , 2007 .
WHITESON S , STONE P . Evolutionary function approximation for reinforcement learning [J ] . The Journal of Machine Learning Research , 2006 , 7 : 877 - 917 .
NOKHBEH-ZAEEM M , KHASHABI D , TALEBI H A , et al . Adaptive tiled neural networks [A ] . 2011 IEEE International Conference on Systems,Man,and Cybernetics (SMC) [C ] . New Orleans,LA,USA , 2011 . 2543 - 2548 .
LIN S , WRIGHT R . Evolutionary tile coding:an automated state abstraction algorithm for reinforcement learning [A ] . AAAI Workshops [C ] . 2010 .