taptap下载安装安卓学报 ›› taptap点点手机网页 , taptap点点手机网页 ›› Issue (4): 32-37.

• 民用航空 • 上一篇    下一篇

基于民航数据特性的重删固定长度分块方法

丁建立,李慧,曹卫东   

  1. (taptap下载安装安卓计算机科学与技术学院,天津300300)
  • 收稿日期:2021-04-13 修回日期:2021-05-13 出版日期:2022-08-15 发布日期:2023-10-28
  • 作者简介:丁建立(1963—),男,河南洛阳人,教授,博士,研究方向为智能仿生算法、智能信息处理及民航应用.
  • 基金资助:
    国家自然科学基金项目(U1833114);民航安全能力建设资金项目(SA2020280)

Deduplication fixed-length block method based on characteristics of civil aviation data

DING Jianli, LI Hui, CAO Weidong   

  1. (College of Computer Science and Technology, CAUC, Tianjin 300300, China)
  • Received:2021-04-13 Revised:2021-05-13 Online:2022-08-15 Published:2023-10-28

摘要: 针对民航数据在容灾备份时存在存储数据重复的问题,提出一种基于民航数据特性的重删固定长度分块方法。该方法根据民航数据类型的一致性,结合固定长度分块与可变长度分块的优势,设计了一种分块策略索引表的数据结构,为同种类型的数据提供分块策略,节省了分块时寻找数据块边界的时间,将备份时重复数据的模拟重删率提高到97.8%~99.3%,比固定长度分块方法高11.8%~12.5%,比可变长度分块方法高2.5%~3.0%;同时,为新的数据类型建立新的分块策略,便于后续数据流匹配,提高命中精度。

关键词: 民航数据, 容灾备份, 重复数据删除, 类型一致性, 分块策略, 模拟重删率

Abstract: Aiming at the problem of data duplication in the storage of civil aviation data during disaster recovery and back-up. A deduplication fixed-length block method based on the characteristics of civil aviation data is proposed. According to the consistency of civil aviation data types, this method combines the advantages of fixed-length block and variable-length block, and designs a data structure of the block strategy index table. A block strategy is provided for the same type of data, and time is saved to find the boundary of the data block during block division, the simulated data deduplication rate during backup increases to 97.8%~99.3%, which is 11.8%~12.5% higher than that of the fixed-length block method, and 2.5%~3.0% higher than that of the variable length block method; at the same time, a new block strategy is established for new data types to facilitate subsequent data stream matching and improve hit accuracy.

Key words: civil aviation data, disaster recovery backup, data deduplication, type consistency, block strategy, simulated deduplication rate

中图分类号: 

Baidu
map