結合類(lèi)腦導航的強化學(xué)習無(wú)人機自主導航

首頁(yè) > 過(guò)刊瀏覽>2024年第32卷第7期 >225-231

結合類(lèi)腦導航的強化學(xué)習無(wú)人機自主導航
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者單位:成都信息工程大學(xué)
作者簡(jiǎn)介:
通訊作者:
中圖分類(lèi)號:
基金項目:四川省科技計劃資助項目

Reinforcement Learning Algorithms Combined with Brain-Inspired Navigation

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪(fǎng)問(wèn)統計

參考文獻

相似文獻

引證文獻

資源附件

文章評論

摘要:

隨著(zhù)無(wú)人移動(dòng)平臺的不斷發(fā)展,為其賦予高效的自主導航能力變得尤為重要；針對無(wú)人機自主導航常用的端到端強化學(xué)習方法存在訓練效率低、泛化能力和通用性差等問(wèn)題,引入了類(lèi)腦導航模型,基于長(cháng)短時(shí)記憶(LSTM)神經(jīng)網(wǎng)絡(luò )構建了類(lèi)腦細胞導航模型,通過(guò)整合編碼無(wú)人機智能體的自運動(dòng)信息,實(shí)現了網(wǎng)格細胞和頭朝向細胞的編碼,進(jìn)一步將這些信息作為深度強化學(xué)習算法D3QN的狀態(tài)補充表示；通過(guò)在A(yíng)irSim仿真環(huán)境的實(shí)驗表明,類(lèi)腦導航模型的引入能夠有效提高算法的訓練能力和無(wú)人機智能體的導航性能,相較于原D3QN算法,在環(huán)境目標改變的情況下仍能尋找到新的目標點(diǎn),有效提升了算法的泛化能力。

Abstract:

With the continuous development of unmanned mobile platforms, it has become particularly important to endow them with efficient autonomous navigation capabilities. In response to the low training efficiency, poor generalization ability, and universality of the commonly used end-to-end reinforcement learning methods for autonomous navigation of UAV, a brain-inspired navigation model is introduced. Based on the long short-term memory (LSTM) neural network, a brain-inspired cell navigation model is constructed, which integrates the self-motion information of the UAV intelligent agent to encode grid cells and head direction cells, and further supplements this information as the state of the deep reinforcement learning algorithm D3QN. The experiments in AirSim simulation environment show that the introduction of the brain-inspired navigation model can effectively improve the training ability of the algorithm and the navigation performance of the UAV intelligent agent. Compared with the original D3QN algorithm, it can still find new target points when the environmental target changes, effectively improving the generalization ability of the algorithm.

參考文獻

相似文獻

引證文獻

引用本文

吳勇,彭輝,熊峰鑰.結合類(lèi)腦導航的強化學(xué)習無(wú)人機自主導航計算機測量與控制[J].,2024,32(7):225-231.

復制

文章指標

點(diǎn)擊次數:
下載次數:
HTML閱讀次數:
引用次數:

歷史

收稿日期:2023-07-03
最后修改日期:2023-08-01
錄用日期:2023-08-02
在線(xiàn)發(fā)布日期: 2024-08-02
出版日期:

国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

引用本文

分享

文章指標

歷史

文章二維碼