Markov Decision Process (MDP) is a () process, means action outcomes depend only on the current state.马可夫决策过程(MDP)是一种()过程,意味着动作结果仅仅依赖于当前状态。
Markov Decision Process (MDP) is a () process, means action outcomes depend only on the current state.马可夫决策过程(MDP)是一种()过程,意味着动作结果仅仅依赖于当前状态。
正确答案:discrete time stochastic control
离散时间随机控制