帶狀態(tài)約束的事件觸發(fā)積分強化學(xué)習控制

首頁(yè) > 過(guò)刊瀏覽>2023年第31卷第7期 >143-149

帶狀態(tài)約束的事件觸發(fā)積分強化學(xué)習控制
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者單位:江南大學(xué) 輕工過(guò)程先進(jìn)控制教育部重點(diǎn)實(shí)驗室
作者簡(jiǎn)介:
通訊作者:
中圖分類(lèi)號:TP273
基金項目:國家自然科學(xué)基金(No.61833007)

Event-Triggered Integral Reinforcement Learning Optimal Control with state constraints

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪(fǎng)問(wèn)統計

參考文獻

相似文獻

引證文獻

資源附件

文章評論

摘要:

為克服全狀態(tài)對稱(chēng)約束以及控制策略頻繁更新的局限,同時(shí)使得無(wú)限時(shí)間的代價(jià)函數最優(yōu),針對一類(lèi)具有部分動(dòng)力學(xué)未知的仿射非線(xiàn)性連續系統,提出一種帶狀態(tài)約束的事件觸發(fā)積分強化學(xué)習的控制器設計方法。該方法是一種基于數據的在線(xiàn)策略迭代方法。引入系統轉換將帶有全狀態(tài)約束的系統轉化為不含約束的系統。基于事件觸發(fā)機制以及積分強化學(xué)習算法,通過(guò)交替執行系統轉換、策略評估、策略改進(jìn),最終系統在滿(mǎn)足全狀態(tài)約束的情況下,代價(jià)函數以及控制策略將分別收斂于最優(yōu)值,并能降低控制策略的更新頻率。此外,通過(guò)構建李亞普諾夫函數對系統以及評論神經(jīng)網(wǎng)絡(luò )權重誤差的穩定性進(jìn)行嚴格的分析。單連桿機械臂的仿真實(shí)驗也進(jìn)一步說(shuō)明算法的可行性。

Abstract:

In order to overcome the limitations of the full-state symmetry constraints and the frequent update of the control policy, and to make the infinite horizon cost function optimal, a controller design method with event-triggered integral reinforcement learning with state constraints is proposed for a class of affine nonlinear continuous systems with partial unknown dynamics. It is a data-based online policy iteration approach. Firstly, system transformation is introduced to transform a constrained system into an unconstrained system. Next, based on the event triggering mechanism and integral reinforcement learning algorithm, by alternating system transformation, policy evaluation, and policy improvement, the system will satisfy the full-state constraints, the cost function and control policy will converge make optimal. At the same time, it can reduce the update frequency of the control policy. In addition, the stability of the system is strictly analyzed by constructing the Lyapunov function. The simulation experiment of the single-link robotic arm is given to verify the effectiveness of the proposed approach.

參考文獻

相似文獻

引證文獻

引用本文

田奮銘,劉飛.帶狀態(tài)約束的事件觸發(fā)積分強化學(xué)習控制計算機測量與控制[J].,2023,31(7):143-149.

復制

文章指標

點(diǎn)擊次數:
下載次數:
HTML閱讀次數:
引用次數:

歷史

收稿日期:2023-02-24
最后修改日期:2023-03-02
錄用日期:2023-03-02
在線(xiàn)發(fā)布日期: 2023-07-12
出版日期:

国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

引用本文

分享

文章指標

歷史

文章二維碼