Using Deduplicating Storage for Efficient Disk Image Deployment

Xing Lin; Mike Hibler; Eric Eide; Robert Ricci

doi:10.4108/icst.tridentcom.2015.259963

Using Deduplicating Storage for Efficient Disk Image Deployment

Authors

Xing Lin University of Utah
Mike Hibler University of Utah
Eric Eide University of Utah
Robert Ricci University of Utah

DOI:

https://doi.org/10.4108/icst.tridentcom.2015.259963

Keywords:

deduplication, image deployment

Abstract

Many clouds and network testbeds use disk images to initialize local storage on their compute devices. Large facilities must manage thousands or more images, requiring significant amounts of storage. At the same time, to provide a good user experience, they must be able to deploy those images quickly. Driven by our experience in operating the Emulab site at the University of Utah---a long-lived and heavily-used testbed---we have created a new service for efficiently storing and deploying disk images. This service exploits the redundant data found in similar images, using deduplication to greatly reduce the amount of physical storage required. In addition to space savings, our system is also designed for highly efficient image deployment---it integrates with an existing highly-optimized disk image deployment system, Frisbee, without significantly increasing the time required to distribute and install images. In this paper, we explain the design of our system and discuss the trade-offs we made to strike a balance between efficient storage and fast disk image deployment. We also propose a new chunking algorithm, called AFC, which enables fixed-size chunking for deduplicating allocated disk sectors. Experimental results show that our system reduces storage requirements by up to 3x while imposing only a negligible runtime overhead on the end-to-end disk-deployment process.

References

Downloads

Published

03-08-2015

Issue

Vol. 2 No. 6 (2015): EAI Endorsed Transactions on Scalable Information Systems

Section

Research articles

License

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.

How to Cite

Lin X, Hibler M, Eide E, Ricci R. Using Deduplicating Storage for Efficient Disk Image Deployment. EAI Endorsed Scal Inf Syst [Internet]. 2015 Aug. 3 [cited 2026 Jul. 26];2(6):e1. Available from: https://publications.eai.eu/index.php/sis/article/view/2295

Download Citation

Using Deduplicating Storage for Efficient Disk Image Deployment

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Make a Submission