搜索资源列表
q-learning.rar
- 强化学习的q学习算法,能够通过此算法,使得某种动作不断得到加强,希望对那些想使用Q学习算法的人有所帮助,ReinforcementLearning Q learning
Q-algorithm-of-RL
- 强化学习的Q算法matlab代码。适合寻找最优路径。让您对q算法一定有直观的理解!-A matlab program about Q of Reinforcement Learning Algorithm
ReinforcementLearning
- 强化学习的原代码还可以的 -The original source of reinforcement learning reinforcement learning can also be in the original code
ReinforcementLearning
- 强化学习算法的MATLAB仿真实例,希望能够对大家有帮助.-Q-learning algorithm of MATLAB simulation examples,wish it can help you.
TicTacToe
- 将强化学习算法应用于Tic-tac-toe游戏-The reinforcement learning algorithm is applied to Tic-tac-toe game
Brainstormers05PublicRelease
- 这是bs球队的仿真2D比赛程序,其中的智能算法和强化学习算法值得借鉴。-This is bs tournament team 2D simulation procedure, in which intelligent algorithms and reinforcement learning algorithm is worth learning from.
rl_cmac
- 强化学习算法。通过cmac神经网络实现强化学习状态空间到动作空间的映射。-Reinforcement learning algorithm. Cmac neural networks through the reinforcement learning state-space to the action of the mapping space.
catandmouse
- 强化学习实例:catchmouse 该实例用到了很多的算法monte carlo method,TD method and saras method.-an example for reinforcement learning:catchmouse.the example includes the following method:monte carlo method,TD method and saras method.
万门大学强化学习算法代码RW模型+TD模型
- 万门大学,强化学习,rw模型算法代码实现, V(CS) = V(CS) + A * ( V(US) * us - V(CS) * cs ) td模型, V(s{t}) = V(s{t}) + a[R(t+1) + rV{S(t+1)} - V{S(t)}](In the intensive learning of the University of Wan men, the RW algorithm, the python implementation, the algorithm f
万门大学强化学习GridEvalu模型GridPolicy模型
- 万门大学,强化学习,Grid_world_evaluation模型算法代码实现, V(S) = V(S) + A * ( R(S) + r*V(new_S) - V(S) ) Grid_world_Policy模型, P(S) = P(S) + A * ( R(S) + r*P(new_S) - P(S) )(In the intensive learning of the University of Wan men, the Grid_world_evaluation algo
深度强化学习综述
- 一本介绍深度强化学习的综述文章,对初学者很有帮助。(A review of intensive learning is very helpful for beginners.)
基于强化学习的移动机器人自主导航研究
- 关于强化学习的移动机器人自主导航研究,对研究移动机器人很有帮助(Research on autonomous navigation of mobile robot based on Reinforcement Learning)
强化学习_倒立摆_Matlab程序
- MATLAB实现的强化学习程序,用于一级倒立摆控制。 程序调试正常运行(The reinforcement learning program implemented by MATLAB is used for the first inverted pendulum control. Normal operation of program debugging)
基于深度强化学习的复杂环境下机器人自主移动算法
- 本算法是通过深度强化学习方法,模拟人类行走策略,建立避免碰撞的模型,训练机器人底盘以人类习惯在复杂变化的环境中主动生成速度和角度信息,从而自动行走。
强化学习解最优控制的matlab代码
- 强化学习解最优控制的matlab代码,可以运行(The matlab code of reinforcement learning solution optimal control, it can run)
强化学习
- 基础的强化学习Q-learning算法,对初学者对Q-learning算法得理解比较有帮助,程序包括运行脚本,Q-learning算法脚本以及环境脚本。(Basic reinforcement learning Q-learning algorithm is helpful for beginners to understand Q-learning algorithm. Programs include running scr ipt, Q-learning algorithm scr ipt
强化学习
- 使用强化学习实现策略梯度和和马尔科夫决策过程(Implementing Strategic Gradient Sum and Markov Decision Process with Reinforcement Learning)
(强化学习入门)David Silver
- 深度学习的中文版本,里面详细介绍了深度学习的各种算法,其中还有一些用到的基础内容,例如矩阵和向量的范数(Chinese version of the depth of learning, which detailed the depth of learning algorithms, some of which used the basics, such as vector and matrix norm)
强化学习的MATLAB代码(走方格小程序)
- 学习代码转分享:采用强化学习算法做的一个MATLAB小程序,可以通过强化学习实现走方格小程序。
《深入浅出强化学习:原理入门》高清PDF
- 《深入浅出强化学习 原理入门》高清pdf,内容可编辑(reinforcement learning)