搜索资源列表
万门大学强化学习算法代码RW模型+TD模型
- 万门大学,强化学习,rw模型算法代码实现, V(CS) = V(CS) + A * ( V(US) * us - V(CS) * cs ) td模型, V(s{t}) = V(s{t}) + a[R(t+1) + rV{S(t+1)} - V{S(t)}](In the intensive learning of the University of Wan men, the RW algorithm, the python implementation, the algorithm f
万门大学强化学习GridEvalu模型GridPolicy模型
- 万门大学,强化学习,Grid_world_evaluation模型算法代码实现, V(S) = V(S) + A * ( R(S) + r*V(new_S) - V(S) ) Grid_world_Policy模型, P(S) = P(S) + A * ( R(S) + r*P(new_S) - P(S) )(In the intensive learning of the University of Wan men, the Grid_world_evaluation algo
万门LaTeX课程资料
- 万门大学Latex课程资料,包含源码及模板。(Wanmen University Latex course materials, including source code and templates.)