连续时间MCP在紧致行动集上的最优策略

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

连续时间MCP在紧致行动集上的最优策略

奚宏生, 唐昊, 殷保群

文章导航 > 自动化学报 > 2003 > 29(2): 206-211

奚宏生, 唐昊, 殷保群. 连续时间MCP在紧致行动集上的最优策略. 自动化学报, 2003, 29(2): 206-211.

引用本文:

奚宏生, 唐昊, 殷保群. 连续时间MCP在紧致行动集上的最优策略. 自动化学报, 2003, 29(2): 206-211.

XI Hong-Sheng, TANG Hao, YIN Bao-Qun. Optimal Policies for a Continuous Time MCP with Compact Action Set. ACTA AUTOMATICA SINICA, 2003, 29(2): 206-211.

Citation:

XI Hong-Sheng, TANG Hao, YIN Bao-Qun. Optimal Policies for a Continuous Time MCP with Compact Action Set. ACTA AUTOMATICA SINICA, 2003, 29(2): 206-211.

奚宏生, 唐昊, 殷保群. 连续时间MCP在紧致行动集上的最优策略. 自动化学报, 2003, 29(2): 206-211.

引用本文:

奚宏生, 唐昊, 殷保群. 连续时间MCP在紧致行动集上的最优策略. 自动化学报, 2003, 29(2): 206-211.

XI Hong-Sheng, TANG Hao, YIN Bao-Qun. Optimal Policies for a Continuous Time MCP with Compact Action Set. ACTA AUTOMATICA SINICA, 2003, 29(2): 206-211.

Citation:

XI Hong-Sheng, TANG Hao, YIN Bao-Qun. Optimal Policies for a Continuous Time MCP with Compact Action Set. ACTA AUTOMATICA SINICA, 2003, 29(2): 206-211.

连续时间MCP在紧致行动集上的最优策略

1.
中国科学技术大学自动化系,合肥

通讯作者:
奚宏生

中图分类号: TP202
计量
- 文章访问数: 2305
- HTML全文浏览量: 42
- PDF下载量: 845
- 被引次数: 0
出版历程
- 收稿日期: 2001-10-08
- 刊出日期: 2003-02-20

Optimal Policies for a Continuous Time MCP with Compact Action Set

1.
Department of Automation,University of Science and Technology of China,Hefei

More Information

Corresponding author: XI Hong-Sheng

摘要: 文中研究了一类连续时间Markov控制过程(CTMCP)无穷水平平均代价性能的最优控制决策问题.文章采用无穷小生成元和性能势的基本性质,直接导出了平均代价模型在紧致行动集上的最优性方程及其解的存在性定理,提出了求解ε-最优平稳控制策略的数值迭代算法,并给出了这种算法的收敛性证明.最后通过分析一个数值例子来说明这种方法的应用.
- 性能势 /
- 平均代价准则 /
- 紧致行动集 /
- 数值迭代
Abstract: In this paper, we study optimal policies for a class of continuous-time Markov control processes (CTMCPs) with infinite horizon average-cost criteria. Using the basic properties of infinitesimal generators and performance potentials, we give directly the optimality equation and establish the existence of solutions to this equation for the average-cost model on a compact action set. A fast value iteration algorithm, which leads to an ε-optimal stationary policy, is proposed and the convergence of this algorithm is studied. Finally, we provide one numerical example to show applications of the proposed method.
- Performance potentials /
- average-cost criteria /
- compact action set /
- value iteration

参考文献(0)

资源附件(0)

计量

文章访问数: 2305
HTML全文浏览量: 42
PDF下载量: 845
被引次数: 0

/

下载: 全尺寸图片幻灯片

分享

用微信扫码二维码

分享至好友和朋友圈

返回

版权所有 © 《自动化学报》编辑部京ICP备14019135号-6

地址：北京中关村东路95号邮政编码：100190E-mail：aas_editor@ia.ac.cn

电话：010-82544677 (日常咨询和稿件处理)，010-82544653(费用管理、寄刊)

本系统由北京仁和汇智信息技术有限公司开发技术支持： info@rhhz.net