ty -jour a2 -mateos,cristian au -he,qinlu au -bian,genqing au -shao -shao -shao,bilin au -Zhang,weiqi py -2020 da -2020/10/14 ti-关于DEDUPLICATION SP -deDupleplication SP -sp -deDupleplication sp -sp-2020/10/148869237 VL -2020 AB-重复数据删除是存储系统中一种流行的数据减少技术,具有显着优势,例如查找和消除重复数据,减少所需数据存储容量,增加资源利用以及节省存储成本。文件功能是用于计算文件之间相似性的关键因素,但是单个功能计算出的相似性具有一定的限制,尤其是对于相似文件。存储节点功能反映了节点的负载条件,这是数据路由中要考虑的关键因素。本文介绍了多次数据路由策略(DRMF)。路由策略是根据群集的特征制定的,包括路由通信,文件相似性计算以及目标节点的确定。相互信息交换是通过路由通信,路由服务器和存储节点来实现的。存储节点计算存储的文件之间的相似性,然后根据路由服务器提供的信息路由文件。路由服务器根据相似的结果和节点加载功能确定路线的目标节点。系统原型设计和实施; also, we develop a system to process the feature of cluster and determine the specific parameters of various features of experiments. In the end, we simulate the multifeature data routing and single-feature data routing, respectively, and compare the deduplication rate and data slope between the two strategies. The experimental results show that the proposed data routing strategy using multiple features can improve the deduplication rate of the cluster and maintain a lower data skew rate compared with the single-feature-based routing strategy MCS; DRMF can improve the deduplication rate of the cluster and maintain a lower data skew rate. SN - 1058-9244 UR - https://doi.org/10.1155/2020/8869237 DO - 10.1155/2020/8869237 JF - Scientific Programming PB - Hindawi KW - ER -