国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

基于嵌入式注意機制的目標語(yǔ)音提取算法
DOI:
CSTR:
作者:
作者單位:

空軍工程大學(xué)航空機務(wù)士官學(xué)校

作者簡(jiǎn)介:

通訊作者:

中圖分類(lèi)號:

TN912

基金項目:


Target Speech Extraction Algorithm based on Embedded Atten-tion Mechanism
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪(fǎng)問(wèn)統計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    摘要:針對說(shuō)話(huà)人語(yǔ)音提取問(wèn)題,提出了一種基于深度神經(jīng)網(wǎng)絡(luò )多任務(wù)學(xué)習的嵌入式注意機制單聲道說(shuō)話(huà)人語(yǔ)音提取方法。該算法將語(yǔ)音分離和語(yǔ)音提取統一到單個(gè)框架中,向頻譜映射分離模型中嵌入說(shuō)話(huà)人注意機制,并在引入說(shuō)話(huà)人輔助信息的注意機制中得到時(shí)變注意權重,利用時(shí)變注意權重分離出目標說(shuō)話(huà)人的內部嵌入向量,隨后采用提取模型對目標說(shuō)話(huà)人的嵌入向量進(jìn)行非線(xiàn)性處理運算,估計出目標說(shuō)話(huà)人對應的掩蔽,進(jìn)而提取出目標說(shuō)話(huà)人語(yǔ)音。同時(shí)借助TIMIT數據集,進(jìn)行了語(yǔ)音提取實(shí)驗。實(shí)驗結果驗證了所提算法的可行性和有效性,并在說(shuō)話(huà)人語(yǔ)音提取的性能上有明顯的優(yōu)越性。

    Abstract:

    Aiming at the problem of speaker speech extraction, a mono speaker speech extraction method based on deep neural network multi-task learning embedded attention mechanism is proposed. The algorithm unifies speech separation and speech extraction into a single framework, embedding the speaker attention mechanism into the spectrum mapping separation network, embeds the speaker attention mechanism in the spectrum mapping separation network, obtains the time-varying attention weight in the attention mechanism the speaker auxiliary information, uses the time-varying attention weight to separate the internal embedding vector of the target speaker, and then uses the extraction model to perform nonlinear processing operations on the embedding vector of the target speaker, estimates the mask corresponding to the target speaker, and then extracts the target speaker’s voice. At the same time, using the TIMIT dataset, speech extraction experiments are carried out. Experimental results verify the feasibility and effectiveness of the proposed algorithm, and have obvious superiority in the performance of speaker speech extraction.

    參考文獻
    相似文獻
    引證文獻
引用本文

郭志楷,楊明堃,蔣國峰,陶祁,劉歡歡,馬紅強.基于嵌入式注意機制的目標語(yǔ)音提取算法計算機測量與控制[J].,2023,31(10):174-181.

復制
分享
文章指標
  • 點(diǎn)擊次數:
  • 下載次數:
  • HTML閱讀次數:
  • 引用次數:
歷史
  • 收稿日期:2023-04-24
  • 最后修改日期:2023-06-02
  • 錄用日期:2023-06-02
  • 在線(xiàn)發(fā)布日期: 2023-10-26
  • 出版日期:
文章二維碼
红河县| 嵊泗县| 会东县| 万全县| 遂昌县| 讷河市| 阜康市| 常州市| 廉江市| 宝应县| 聂拉木县| 西畴县| 潮州市| 二连浩特市| 东港市| 乌拉特前旗| 安庆市| 阿巴嘎旗| 常宁市| 辉县市| 准格尔旗| 南靖县| 丹巴县| 松潘县| 湖南省| 郧西县| 贺州市| 含山县| 积石山| 冷水江市| 苗栗县| 拜泉县| 常宁市| 崇信县| 平罗县| 延长县| 古浪县| 涪陵区| 齐齐哈尔市| 娱乐| 邯郸县|