The Open Electrical & Electronic Engineering Journal

2013, 7 : 90-97
Published online 2013 October 18. DOI: 10.2174/1874129001307010090
Publisher ID: TOEEJ-7-90

Improving Read Performance with BP-DAGs for Storage-Efficient File Backup

Tianming Yang , Jing Zhang and Ningbo Hao
International College, Huanghuai University, Zhumadian, Henan, 463000, China.

ABSTRACT

The continued growth of data and high-continuity of application have raised a critical and mounting demand on storage-efficient and high-performance data protection. New technologies, especially the D2D (Disk-to-Disk) deduplication storage are therefore getting wide attention both in academic and industry in the recent years. Existing deduplication systems mainly rely on duplicate locality inside the backup workload to achieve high throughput but suffer from read performance degrading under conditions of poor duplicate locality. This paper presents the design and performance evaluation of a D2D-based de-duplication file backup system, which employs caching techniques to improve write throughput while encoding files as graphs called BP-DAGs (Bi-pointer-based Directed Acyclic Graphs). BP-DAGs not only satisfy the ‘unique’ chunk storing policy of de-duplication, but also help improve file read performance in case of poor duplicate locality workloads. Evaluation results show that the system can achieve comparable read performance than non de-duplication backup systems such as Bacula under representative workloads, and the metadata storage overhead for BP-DAGs are reasonably low.

Keywords:

Data De-duplication, File Backup, Storage-Efficient.