Citation: | YANG Xudong, LIU Quan, JING Ling, et al., “A Scalable Parallel Reinforcement Learning Method Based on Divide-and-Conquer Strategy,” Chinese Journal of Electronics, vol. 22, no. 2, pp. 242-246, 2013, |
R.M. Kretchmar, "Parallel reinforcement learning", Proc. of the 6th World Conference on Systemics, Cybernetics, and Informatics, Orlando, Florida, USA, pp.114-118, 2002.
|
D. Wingate, K.D. Seppi, "P3VI: A partitioned, prioritized, parallel value iterator", Proc. of the 21st International Conference on Machine Learning, Banff, Alberta, Canada, pp.109- 116, 2004.
|
W. Meng, X.D. Han, "Parallel reinforcement learning algorithm and its application", Chinese Computer Engineering and Applications, Vol.45, No.34, pp.25-28, 2009.
|
M. Kaya, A. Arslan, "Parallel and distributed multi-agent reinforcement learning", Proc. of the 8th International Conference on Parallel and Distributed Systems, KyongJu City, Korea, pp.437-441, 2001.
|
J. Shi, J. Malik, "Normalized cuts and image segmentation", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.22, No.8, pp.888-905, 2000.
|
J.B. MacQueen, "Some methods for classification and analysis of multivariate observations", Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, California, USA, pp.281-297, 1967.
|
R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, USA, 1998.
|
J. Holland, Adaptation in Natural and Artificial Systems. University of Michigan Press, Ann Arbor, Michigan, USA, 1975.
|
J.N. Tsitsiklis, "Asynchronous stochastic approximation and Qlearning", Maching Learning, Vol.16, No.3, pp.185-202, 1994.
|
R.M. Kretchmar, "Reinforcement learning algorithms for homogenous multi-agent systems", Workshop on Agent and Swarm Programming, Cleveland, OH, USA, 2003.
|
A.M. Printista, M.L. Errecalde, C.I. Montoya, "A parallel implementation of Q-learning based on communication with cache", Journal of Computer Science & Technology, Vol.6, No.1, pp.268-278, 2002.
|