Mergeable adaptive tile coding method

Meng-yu SHI; Quan LIU; Qi-ming FU

doi:10.11959/j.issn.1000-436x.2015047

您当前的位置：

首页 >

文章列表页 >

Mergeable adaptive tile coding method

Academic papers | 更新时间：2024-06-05

- Mergeable adaptive tile coding method
- Journal on Communications Vol. 36, Issue 2, Pages: 186-192(2015)
- 作者机构：
  
  1. 苏州大学计算机科学与技术学院，江苏苏州 215006
  2. 吉林大学符号计算与知识工程教育部重点实验室，吉林长春 130012
- 作者简介：
- 基金信息：
  
  The National Natural Science Foundation of China(61272005);The National Natural Science Foundation of China(61472262);The Natural Science Foundation of Jiangsu Province(BK2012616)
- DOI：10.11959/j.issn.1000-436x.2015047
  CLC： TP181
- Online First：2015-02，
  
  Published：25 February 2015
- 稿件说明：
移动端阅览
Meng-yu SHI, Quan LIU, Qi-ming FU. Mergeable adaptive tile coding method[J]. Journal on Communications, 2015, 36(2): 186-192.
DOI：

Meng-yu SHI, Quan LIU, Qi-ming FU. Mergeable adaptive tile coding method[J]. Journal on Communications, 2015, 36(2): 186-192. DOI： 10.11959/j.issn.1000-436x.2015047.

摘要

针对自适应 tile coding 算法会产生多余划分的问题，提出一种支持合并的自适应 tile coding 算法——MATC。该算法能够消除传统自适应tile coding算法中产生的多余划分，进一步解决连续状态空间离散化的问题。将MATC算法应用于离散动作连续状态的Mountain Car问题上，实验结果表明，该算法在学习过程中能消除传统tile coding算法的误划分所产生的不良影响，更准确地自动调整划分的精度，并更快地收敛到最佳策略。

Abstract

In order to solve many unnecessary division

mergence supported adaptive tile coding algorithm was presented which would eliminate the unnecessary division.Simulation is conducted on mountain car problem with discrete actions and continuous state space Results show that the proposed method can eliminate the influence of false division in the traditional tile coding method and achieve a more accurate adaptive partition of continuous state space.A higher convergence rate is achieved at the same time.

关键词

Keywords

references

SUTTON R S , BARTO A G . Reinforcement Learning:An Introduction [M ] . Cambridge:MIT Press , 1998 .

LIN C S , KIM H . Selection of learning parameters for CMAC-based adaptive critic learning [J ] . IEEE Trans Neural Networks , 1999 , 6 ( 3 ): 642 - 647 .

PELLEG D , MOORE A , SHROFF N B . X-means:extending K-Means with efficient estimation of the number of clusters [A ] . Proc of the 17th International Conf on Machine Learning [C ] . Boston:Morgan Kaufmann Press , 2000 . 727 - 734 .

PELLEG D , MOORE A . Accelerating exact k-means algorithms with geometric reasoning [A ] . Proc of the fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining [C ] . 1999 . 277 - 281 .

陈宗海 , 文锋 , 聂建斌等 . 基于节点生长 k-均值聚类算法的强化学习方法 [J ] . 计算机研究与发展 , 2006 , 43 ( 4 ): 661 - 666 .

CHEN Z H , WEN F , NIE J B , et al . A reinforcement learning method based on node-growing k-means cluster algorithm [J ] . Journal of Computer Research and Development , 2006 , 43 ( 4 ): 661 - 666 .

文锋 , 陈宗海 , 卓睿等 . 连续状态自适应离散化基于K-均值聚类的强化学习方法 [J ] . 控制与决策 , 2006 , 21 ( 2 ): 143 - 147 .

WEN F , CHEN Z H , ZHUO R , et al . Reinforcement learning method of continuous state adaptively discretized based on K-means clustering [J ] . Control and Decision , 2006 , 21 ( 2 ): 143 - 147 .

顾冬雷 , 陈卫东 , 席裕庚 . 一种基于增强学习的自适应控制方法 [J ] . 控制与决策 , 2002 , 17 ( 4 ): 473 - 479 .

GU D L , CHEN W D , XI Y G . A novel adaptive control algorithm based on reinforcement learning [J ] . Control and Decision , 2002 , 17 ( 4 ): 473 - 479 .

MOORE A W , ATKESON C G . The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces [J ] . Machine Learning , 1995 , 21 ( 3 ): 199 - 233 .

UTHER W T B , VELOSO M M . Tree based discretization for continuous state space reinforcement learning [A ] . AAAI’98 [C ] . Madison,Wisconsin,United States , 1998

SHERSTOV A A , STONE P . Function Approximation Via Tile Coding:Automating Parameter Choice Abstraction,Reformulation and Approximation [M ] . Springer Berlin Heidelberg , 2005 : 194 - 205 .

WHITESON S , TAYLOR M E , STONE P . Adaptive tile Coding for Value Function Approximation [M ] . Computer Science Department,University of Texas at Austin , 2007 .

WHITESON S , STONE P . Evolutionary function approximation for reinforcement learning [J ] . The Journal of Machine Learning Research , 2006 , 7 : 877 - 917 .

NOKHBEH-ZAEEM M , KHASHABI D , TALEBI H A , et al . Adaptive tiled neural networks [A ] . 2011 IEEE International Conference on Systems,Man,and Cybernetics (SMC) [C ] . New Orleans,LA,USA , 2011 . 2543 - 2548 .

LIN S , WRIGHT R . Evolutionary tile coding:an automated state abstraction algorithm for reinforcement learning [A ] . AAAI Workshops [C ] . 2010 .

Views

1814

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Actor-critic algorithm with incremental dual natural policy gradient

RL-WGAN based method for 5G network anomalous data generation

Study on Co-EDCA mechanism for multi-AP collaboration in FTTR C-WAN architecture

Design and implementation of an IPv6+ based intelligent computing-network scheduling scheme for Internet of vehicles

Survey of node localization scheme in underwater wireless sensor network

Related Author

Peng ZHANG

Shan ZHONG

Jian-wei ZHAI

Wei-sheng QIAN

Ning Zhaolong

Zou Daoyuan

Zhou Li

Ouyang Ruiqi

Related Institution

Collaborative Innovation Center of Novel Software Technology and Industrialization

College of Electronic Science and Technology, National University of Defense Technology

School of Communications and Information Engineering, Chongqing University of Posts and Telecommunications

School of Electronic Information and Communication, Huazhong University of Science and Technology

Intelligent and Connected Vehicle Research Institute, China Unicom Smart Connection Technology Limited

AI问答

⁰