There are two optimal policies for Dynamic Programming, one is （）, and the other is policy iteration.动态规划有两种优化策略，一个是（），而另一种是策略迭代。_精华吧

There are two optimal policies for Dynamic Programming, one is （）, and the other is policy iteration.动态规划有两种优化策略，一个是（），而另一种是策略迭代。

精华吧→答案→慕课→未分类

There are two optimal policies for Dynamic Programming, one is （）, and the other is policy iteration.动态规划有两种优化策略，一个是（），而另一种是策略迭代。

正确答案：value iteration

Tag：人工智能原理策略动态时间：2022-01-15 21:44:51

相关答案

热门答案