EAD: elasticity aware deduplication manager for datacenters with multi-tier storage systems Article

Yang, Z, Wang, Y, Bhamini, J et al. (2018). EAD: elasticity aware deduplication manager for datacenters with multi-tier storage systems . CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 21(3), 1561-1579. 10.1007/s10586-018-2141-z

cited authors

  • Yang, Z; Wang, Y; Bhamini, J; Tan, CC; Mi, N

authors

abstract

  • The popularity of Big Data applications places pressures on storage systems to efficiently scale to meet the demand. At the same time, new developments like solid-state drives have changed to traditional storage hierarchy. Cloud storage systems are transitioning towards a hybrid architecture consisting of large amounts of memory, solid-state disks (SSDs), and traditional magnetic hard disks (HD). This paper presents elasticity aware deduplication (EAD), a data deduplication framework designed for multi-tier cloud storage architectures consisting of SSD and HD. EAD dynamically adjusts the deduplication parameters at runtime in order to improve performance. Experimental results indicate that EAD is able to detect more than 98% of all duplicate data, but it only consumes less than 5% of expected memory space. Additionally, EAD saves approximately 74% of overall IO access cost compared to the traditional design.

publication date

  • September 1, 2018

Digital Object Identifier (DOI)

start page

  • 1561

end page

  • 1579

volume

  • 21

issue

  • 3