国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

基于兩階分區的MapReduce實(shí)驗室系統負載均衡研究
DOI:
CSTR:
作者:
作者單位:

1.深圳市檢驗檢疫科學(xué)研究院;2.深圳市檢驗檢疫科學(xué)研究院深圳

作者簡(jiǎn)介:

通訊作者:

中圖分類(lèi)號:

TP301.6????

基金項目:

國家重點(diǎn)研發(fā)計劃課題(2019YFC1605401);海關(guān)總署課題(2020HK109)。


Research on load balancing of MapReduce laboratory system based on two-tier partition
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪(fǎng)問(wèn)統計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    在實(shí)驗室系統處理海量原始數據時(shí),實(shí)際應用場(chǎng)景中存在采樣率高、偏度(skewness)高的特殊情況,導致在使用兩階分區算法在平衡同構環(huán)境下的Reducer節點(diǎn)負載時(shí),無(wú)法有效地處理這些問(wèn)題。為此,引入MapReduce的并行化處理,可以提高實(shí)驗室系統中采樣數據利用率;同時(shí),為了解決數據偏度和采樣度高的問(wèn)題,則采用了ICSC(Improved Cluster Split Combination)分區調度的算法。經(jīng)過(guò)實(shí)驗證明,基于兩階分區的MapReduce負載均衡算法能夠有效減少Mapper和Reducer節點(diǎn)空轉的時(shí)間。隨著(zhù)數據偏度的增加,算法的執行時(shí)長(cháng)基本不產(chǎn)生變化,即數據偏度對該算法執行時(shí)間的影響較小。此外,數據采樣度的增加,ICSC分區調度算法也保持著(zhù)對比模型中最少的時(shí)間開(kāi)銷(xiāo)。因此,基于兩階分區的MapReduce負載均衡算法弱化了Reducer節點(diǎn)間的依賴(lài)性,并提升MapReduce任務(wù)的執行效率和容錯率,從而高效地實(shí)現MapReduce框架下的實(shí)驗室系統中數據處理的負載均衡。

    Abstract:

    When processing raw data in a laboratory system, there are special cases of high sampling rate and high skewness in real-world application scenarios, which cannot be effectively dealt with when balancing the load on the Reducer nodes in a homogeneous environment using a two-order partitioning algorithm. Therefore, the parallel processing of MapReduce is introduced to improve the utilization of sampling data in the laboratory system; At the same time, in order to solve the problem of data skewness and high sampling, ICSC (Improved Cluster Split Combination) partition scheduling algorithm is adopted. Experiments show that MapReduce load balancing algorithm based on two-tier partition can effectively reduce the idle time of Mapper and Reducer nodes. With the increase of data skewness, the execution time of the algorithm is basically unchanged, that is, data skewness has little impact on the execution time of the algorithm. In addition, with the increase of data sampling, ICSC partition scheduling algorithm also maintains the minimum time cost in the comparison model. Therefore, the MapReduce load balancing algorithm based on two-tier partitions weakens the dependency between the reducer nodes, and improves the execution efficiency and fault tolerance of MapReduce tasks, thus effectively realizing the load balancing of data processing in the laboratory system under the MapReduce framework.

    參考文獻
    相似文獻
    引證文獻
引用本文

鄭文麗,熊貝貝,程立勛,蔡伊娜,包先雨.基于兩階分區的MapReduce實(shí)驗室系統負載均衡研究計算機測量與控制[J].,2023,31(4):252-257.

復制
分享
文章指標
  • 點(diǎn)擊次數:
  • 下載次數:
  • HTML閱讀次數:
  • 引用次數:
歷史
  • 收稿日期:2022-11-11
  • 最后修改日期:2022-12-19
  • 錄用日期:2023-01-03
  • 在線(xiàn)發(fā)布日期: 2023-04-24
  • 出版日期:
文章二維碼
深泽县| 舒兰市| 厦门市| 漠河县| 齐齐哈尔市| 新乡县| 保山市| 海盐县| 平湖市| 若尔盖县| 赤峰市| 绥中县| 常州市| 铜山县| 徐州市| 津南区| 延边| 勃利县| 清原| 苍梧县| 岳普湖县| 进贤县| 延安市| 临夏市| 靖江市| 泰州市| 乌拉特后旗| 登封市| 安国市| 荔波县| 韶关市| 清流县| 策勒县| 铜山县| 临安市| 安溪县| 类乌齐县| 潮安县| 南开区| 盈江县| 合川市|