搜索资源列表
markov.m
- An simple illustration of champ markov
WindyGridWorldQLearning
- Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively
3STATSMS
- 基于马尔科夫链概念,3状态概率转移矩的matlab实现-Based on the concept of Markov chain, the realization of the 3 stats matrix s transfer probability matrix in matlab