学术报告

学术报告

当前位置：首页学术报告正文

【40周年校庆学术活动】学术报告八：Stochastic Linear Quadratic Optimal Control Problem: A Reinforcement Learning Method

时间：2023-03-27 15:42

菲律宾环球360注册账号学术报告[2023] 008号

(高水平大学建设系列报告779号)

报告题目: Stochastic Linear Quadratic Optimal Control Problem: A Reinforcement Learning Method

报告人：李娜（山东财经大学）

报告时间：2023.3.30下午14: 00- 15: 00

直播平台及链接：腾讯会议938-263-431 　

报告内容：This talk is to adopt a reinforcement learning (RL) method to solve infinite horizon continuous-time stochastic linear quadratic problems, where the drift and diffusion terms in the dynamics may depend on both the state and control. Based on the Bellman’s dynamic programming principle, we presented an online RL algorithm to attain optimal control with partial system information. This algorithm computes the optimal control, rather than estimates the system coefficients, and solves the related Riccati equation. It only requires local trajectory information, which significantly simplifies the calculation process. We shed light on our theoretical findings using two numerical examples.

报告人简历：李娜，二级教授，博士生导师，国家级青年人才，首届山东省科学技术青年奖获得者，山东财经大学统计与数学学院副院长；兼任中国自动化学会TCCT随机系统控制委员会委员、山东省大数据研究会理事、《Math Review》评论员等。近年来，在控制论领域国际三大顶级期刊《SIAM Journal on Control and Optimization》、《Automatica》、《IEEE Transactions on Automatic Control》等国际著名学术期刊发表高水平论文20余篇；先后主持国家自然科学基金项目3项、山东省自然科学基金项目2项、山东省高等学校科技项目2项；曾获山东省高等学校科学技术奖二等奖1项、山东省省级教学成果二等奖3项；主持教育部产学合作协同育人项目2项、山东省研究生教育优质课程建设项目1项。

欢迎感兴趣的师生参加！

数学与统计学院

2023年3月22日

上一篇：【40周年校庆学术活动】学术报告九：The Cohen-Macaulay Auslander algebra of string algebras

下一篇：【40周年校庆学术活动】学术报告七：完全对称多项式零点的新下界

主讲人		讲座时间
讲座地点		实际会议时间日
实际会议时间年月