国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

基于自適應注意力機制的輕量化語(yǔ)義分割網(wǎng)絡(luò )
DOI:
CSTR:
作者:
作者單位:

北京工商大學(xué) 計算機與人工智能學(xué)院

作者簡(jiǎn)介:

通訊作者:

中圖分類(lèi)號:

TP183

基金項目:

重慶自然科學(xué)基金(CSTB2022NSCO-MSX1415)


Lightweight Semantic Segmentation Network Based On Adaptive Attention Mechanism
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪(fǎng)問(wèn)統計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    針對語(yǔ)義SLAM(simultaneous localization and mapping)中語(yǔ)義分割速度較慢,實(shí)時(shí)性較低、占用資源過(guò)多等問(wèn)題,提出一種含有自適應通道注意力機制的輕量級Mask R-CNN網(wǎng)絡(luò ),由于原有的語(yǔ)義分割網(wǎng)絡(luò )里的殘差網(wǎng)絡(luò )復雜,且應用環(huán)境在室內,環(huán)境較為簡(jiǎn)單,故該輕量級網(wǎng)絡(luò )將原有復雜的主干網(wǎng)絡(luò )中的ResNet-50利用深度可分離卷積與分組卷積改進(jìn)為更加輕量的ResNet-DS-tiny(ResNet with depthwise separable convolutions),并加入自適應通道注意力機制。在自適應通道注意力模塊中,利用加權方式對輸入的RGB-D圖像從空間和通道賦予不同的權重,增強了特征的表達能力。此外,為了輕量化特征金字塔,使用使用不同空洞率的空洞卷積來(lái)提取不同大小感受野的特征信息,有效地獲取了多尺度的特征。相較于傳統的特征金字塔,空洞卷積減少了參數量。在更充分獲取 RGB 信息特征的同時(shí),提升了語(yǔ)義分割系統的實(shí)時(shí)性并減少了資源占用。

    Abstract:

    To address the issues of slow semantic segmentation speed, low real-time performance, and high resource consumption in semantic SLAM (simultaneous localization and mapping), a lightweight Mask R-CNN network with an adaptive channel attention mechanism is proposed. Given the complexity of the residual networks in existing semantic segmentation networks and the relatively simple indoor application environments, this lightweight network replaces the original complex backbone ResNet-50 with a more lightweight ResNet-DS-tiny (ResNet with depthwise separable convolutions) by incorporating depthwise separable convolutions and grouped convolutions. An adaptive channel attention mechanism is also introduced. In the adaptive channel attention module, a weighted approach is used to assign different weights to the input RGB-D images from both spatial and channel dimensions, thereby enhancing the feature representation capability. Additionally, to lighten the feature pyramid, dilated convolutions are employed to expand the receptive field, effectively aggregating multi-scale features with different dilation rates. Compared to traditional feature pyramids, the use of dilated convolutions reduces the number of parameters. This approach not only more effectively captures RGB information features but also improves the real-time performance of the semantic segmentation system while reducing resource consumption.

    參考文獻
    相似文獻
    引證文獻
引用本文

王艷莉,連曉峰,康毛毛.基于自適應注意力機制的輕量化語(yǔ)義分割網(wǎng)絡(luò )計算機測量與控制[J].,2024,32(12):223-228.

復制
分享
文章指標
  • 點(diǎn)擊次數:
  • 下載次數:
  • HTML閱讀次數:
  • 引用次數:
歷史
  • 收稿日期:2024-06-07
  • 最后修改日期:2024-07-19
  • 錄用日期:2024-07-19
  • 在線(xiàn)發(fā)布日期: 2024-12-24
  • 出版日期:
文章二維碼
哈巴河县| 马鞍山市| 驻马店市| 新闻| 界首市| 普定县| 图们市| 武宁县| 教育| 黔西县| 邢台市| 滕州市| 丰台区| 韶山市| 崇仁县| 醴陵市| 龙江县| 抚州市| 金平| 钟山县| 罗田县| 河东区| 祥云县| 平湖市| 西平县| 成安县| 上犹县| 班戈县| 贵南县| 方城县| 江门市| 洪雅县| 通榆县| 星子县| 土默特左旗| 昌图县| 二手房| 南丹县| 调兵山市| 南汇区| 清原|