Open Access Open Access  Restricted Access Subscription or Fee Access

Secure Data Deduplication at Client Side and in Cloud Storage with Application Aware System

Dipti Balasaheb Bansode, Amar Buchade

Abstract


Nowadays, cloud based storage services offer easy to access, secure, reliable, and low cost remote storage for file sharing, document suits and online backups. The leading cloud based storage services use data deduplication as the continuous and exponential increase of number of users and size of their data. Data deduplication removes redundant data and tries to ensure that only unique instance of the data is stored. The purpose of deduplication are to reduce the network bandwidth, to reduce storage space, to check user instances have original content, to reduce total cost of ownership and improves backup operation and reduce Recovery Time Objective. Here, in this paper, we are combining both file level data deduplication and chunk level data deduplication to achieve faster deduplication at client side deduplication. To reduce the index overhead we are using application aware index structure method. In order to give security with deduplication convergent encryption is applied. As for chunk level deduplication with encryption, key management issue arises. To solve key management issue we effectively use the metadata manager component. At cloud storage side the cross user data deduplication is done securely.


Keywords


Application Awareness, Cloud Storage, Convergent Encryption, Data Deduplication, Source Deduplication.

Full Text:

PDF

References


Yinjin Fu; Hong Jiang; Nong Xiao; Lei Tian; Fang Liu; Lei Xu, "Application-Aware Local-Global Source Deduplication for Cloud Backup Services of Personal Storage," Parallel and Distributed Systems, IEEE Transactions on , vol.25, no.5, pp.1155,1165, May 2014.

Daehee Kim; Sejun Song; Baek-Young Choi, "SAFE: Structure-aware file and email deduplication for cloud-based storage systems," Cloud Networking (CloudNet), 2013 IEEE 2nd International Conference on, vol., no., pp.130, 137, 11-13 Nov. 2013.

Puzio, P.; Molva, R.; Onen, M.; Loureiro, S., "ClouDedup: Secure Deduplication with Encrypted Data for Cloud Storage," Cloud Computing Technology and Science (CloudCom), 2013 IEEE 5th International Conference on, vol.1, no., pp.363, 370, 2-5 Dec. 2013.

Jin Li; Xiaofeng Chen; Xhafa, F.; Barolli, L., "Secure Deduplication Storage Systems with Keyword Search," Advanced Information Networking and Applications (AINA), 2014 IEEE 28th International Conference on , vol., no., pp.971,977, 13-16 May 2014.

Sun, Z., Shen, J. & Yong, J., “A novel approach to data deduplication over the engineering-oriented cloud systems,” Integrated Computer Aided Engineering, 2013.

Jin Li; Xiaofeng Chen; Mingqiang Li; Jingwei Li; Lee, P.P.C.; Wenjing Lou, "Secure Deduplication with Efficient and Reliable Convergent Key Management," Parallel and Distributed Systems, IEEE Transactions on , vol.25, no.6, pp.1615,1625, June 2014.

Dutch T. Meyer and William J. Bolosky. “A study of practical deduplication”, In Proceedings of the 9th USENIX conference on File and storage technologies. USENIX Association, Berkeley, CA, USA, Dec 2011

Fang Yan, YuAn Tan, “A Method of Object Based Deduplication, Journal of Networks”, Vol 6, No 12(2011), 1705-1712, Dec 2011.

Daehee Kim, Sejun Song, Baek-Young Choi, SAFE: Structure aware file and email deduplication for cloud based storage systems, Cloud Networking (CloudNet) IEEE 2nd International Conference, 2013.

en.wikipedia.org/wiki/Deduplication Material related to deduplication basic is given on the link.

Openstack: https://www.openstack.org/ OpenStack is a free and open-source cloud computing software platform. Users primarily deploy it as an infrastructure as a service (IaaS) solution.

Gethub: https://github.com/openstack/devstack downloaded devstack public cloud.

Danny Harnik, Benny Pinkas, Alexandra Shulman-Peleg, Side Channels in Cloud Services: Deduplication in Cloud Storage, IEEE Security and Privacy, vol.8, no. 6, pp. 40-47, Nov/Dec 2010.

Apache jcloud: http://jclouds.apache.org is referred for interface between openstack and java.

Mohammad Peyraviana, Allen Roginsky, Ajay Kshemkalyani, On Probabilities of Hash Value Matches, Elsevier Science Limited Computers 8 Security Vol. 17, No.2, pp. 171- 174, 1998.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.