Markov控制过程基于性能势的平均代价最优策略

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

Markov控制过程基于性能势的平均代价最优策略

周亚平, 奚宏生, 殷保群, 孙德敏

文章导航 > 自动化学报 > 2002 > 28(6): 904-910

周亚平, 奚宏生, 殷保群, 孙德敏. Markov控制过程基于性能势的平均代价最优策略. 自动化学报, 2002, 28(6): 904-910.

引用本文:

周亚平, 奚宏生, 殷保群, 孙德敏. Markov控制过程基于性能势的平均代价最优策略. 自动化学报, 2002, 28(6): 904-910.

ZHOU Ya-Ping, XI Hong-Sheng, YIN Bao-Qun, SUN De-Min. Optimality Strategy of Average Cost Based Performance Potentials for Markov Control Process. ACTA AUTOMATICA SINICA, 2002, 28(6): 904-910.

Citation:

ZHOU Ya-Ping, XI Hong-Sheng, YIN Bao-Qun, SUN De-Min. Optimality Strategy of Average Cost Based Performance Potentials for Markov Control Process. ACTA AUTOMATICA SINICA, 2002, 28(6): 904-910.

周亚平, 奚宏生, 殷保群, 孙德敏. Markov控制过程基于性能势的平均代价最优策略. 自动化学报, 2002, 28(6): 904-910.

引用本文:

周亚平, 奚宏生, 殷保群, 孙德敏. Markov控制过程基于性能势的平均代价最优策略. 自动化学报, 2002, 28(6): 904-910.

ZHOU Ya-Ping, XI Hong-Sheng, YIN Bao-Qun, SUN De-Min. Optimality Strategy of Average Cost Based Performance Potentials for Markov Control Process. ACTA AUTOMATICA SINICA, 2002, 28(6): 904-910.

Citation:

ZHOU Ya-Ping, XI Hong-Sheng, YIN Bao-Qun, SUN De-Min. Optimality Strategy of Average Cost Based Performance Potentials for Markov Control Process. ACTA AUTOMATICA SINICA, 2002, 28(6): 904-910.

Markov控制过程基于性能势的平均代价最优策略

1.
中国科技大学管理科学系,合肥;
2.
中国科技大学自动化系,合肥

通讯作者:
周亚平

中图分类号: TP202
计量
- 文章访问数: 2240
- HTML全文浏览量: 40
- PDF下载量: 1097
- 被引次数: 0
出版历程
- 收稿日期: 2000-12-07
- 刊出日期: 2002-06-20

Optimality Strategy of Average Cost Based Performance Potentials for Markov Control Process

1.
Department of Management Science,University of Science and Technology of China,Hefei;Department of Automation,University of Science and Technology of China,Hefei

More Information

Corresponding author: ZHOU Ya-Ping

摘要: 研究了一类离散时间Markov控制过程平均代价性能最优控制决策问题.应用 Markov性能势的基本性质,在很一般性的假设条件下,直接导出了无限时间平均代价模型在紧致行动集上的最优性方程及其解的存在性定理.提出了求解最优平稳控制策略的迭代算法,并讨论了这种算法的收敛性问题.最后通过分析一个实例来说明这种算法的应用.
- Markov控制过程 /
- 性能势 /
- 平均代价模型 /
- 最优平稳策略
Abstract: This paper deals with the average cost optimization problem for a class of discrete time Markov control processes. Under quite general assumptions, the optimality equation is directly established and the existence theorem of optimal solution is proved for infinite time average cost model in a compact action set by using basic properties of the Markov performance potentials. The iterate algorithm for solving optimal stationary control strategy is suggested and the convergence problem of this algorithm is discussed. Finally, a numerical example is analyzed to illustrate the application of the proposed algorithm.
- Markov control process /
- performance potentials /
- average cost model /
- optimal stationary strategy

参考文献(0)

资源附件(0)

计量

文章访问数: 2240
HTML全文浏览量: 40
PDF下载量: 1097
被引次数: 0

/

下载: 全尺寸图片幻灯片

分享

用微信扫码二维码

分享至好友和朋友圈

返回

版权所有 © 《自动化学报》编辑部京ICP备14019135号-6

地址：北京中关村东路95号邮政编码：100190E-mail：aas_editor@ia.ac.cn

电话：010-82544677 (日常咨询和稿件处理)，010-82544653(费用管理、寄刊)

本系统由北京仁和汇智信息技术有限公司开发技术支持： info@rhhz.net