1 An Update Model for Network Coding in Cloud Storage Systems 2012 50th Annual Allerton Conference on Communication, Control, and Computing Mohammad Reza.

Slides:



Advertisements
Similar presentations
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates By Yihong Zhao, Prasad M. Desphande and Jeffrey F. Naughton Presented by Kia Hall.
Advertisements

Analysis and Construction of Functional Regenerating Codes with Uncoded Repair for Distributed Storage Systems Yuchong Hu, Patrick P. C. Lee, Kenneth.
XORs in The Air: Practical Wireless Network Coding
Alex Dimakis based on collaborations with Dimitris Papailiopoulos Arash Saber Tehrani USC Network Coding for Distributed Storage.
Henry C. H. Chen and Patrick P. C. Lee
BASIC Regenerating Codes for Distributed Storage Systems Kenneth Shum (Joint work with Minghua Chen, Hanxu Hou and Hui Li)
Simple Regenerating Codes: Network Coding for Cloud Storage Dimitris S. Papailiopoulos, Jianqiang Luo, Alexandros G. Dimakis, Cheng Huang, and Jin Li University.
Yuchong Hu1, Henry C. H. Chen1, Patrick P. C. Lee1, Yang Tang2
Lava: A Reality Check of Network Coding in Peer-to-Peer Live Streaming Mea Wang, Baochun Li Department of Electrical and Computer Engineering University.
Network coding techniques Elena Fasolo Network coding techniques Elena Fasolo PhD Student - SIGNET Group Wireless Systems - Lecture.
Availability in Globally Distributed Storage Systems
Network Coding in Peer-to-Peer Networks Presented by Chu Chun Ngai
1 Rateless codes and random walks for P2P resource discovery in Grids IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, NOV Valerio Bioglio.
Source-Location Privacy Protection in Wireless Sensor Network Presented by: Yufei Xu Xin Wu Da Teng.
Typhoon: An Ultra-Available Archive and Backup System Utilizing Linear-Time Erasure Codes.
Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 11, NO. 1, JANUARY 2009 Xiangyang.
Routing in WSNs through analogies with electrostatics December 2005 L. Tzevelekas I. Stavrakakis.
Cloud Scale Storage Systems Sean Ogden October 30, 2013.
Network Coding in P2P-Systems Christian Ortolf. Overview ● Introduction ● Galois fields ● Encoding/Decoding of Files ● Gain – Coupon Collector's problem.
1 Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes Yunfeng Lin, Ben Liang, Baochun Li INFOCOM 2007.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Mobile Ad Hoc Networks Network Coding and Xors in the Air 7th Week.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Algorithms for Precomputing Constrained Widest Paths and Multicast Trees Paper by Stavroula Siachalou and Leonidas Georgiadis Presented by Jeremy Witmer.
Network Coding Project presentation Communication Theory 16:332:545 Amith Vikram Atin Kumar Jasvinder Singh Vinoo Ganesan.
Algorithms for Precomputing Constrained Widest Paths and Multicast Trees Paper by Stavroula Siachalou and Leonidas Georgiadis Presented by Jeremy Witmer.
Efficient replica maintenance for distributed storage systems Byung-Gon Chun, Frank Dabek, Andreas Haeberlen, Emil Sit, Hakim Weatherspoon, M. Frans Kaashoek,
A Hierarchical Energy-Efficient Framework for Data Aggregation in Wireless Sensor Networks IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 55, NO. 3, MAY.
Page 1 Page 1 Network Coding Theory: Tutorial Presented by Avishek Nag Networks Research Lab UC Davis.
On Error Preserving Encryption Algorithms for Wireless Video Transmission Ali Saman Tosun and Wu-Chi Feng The Ohio State University Department of Computer.
NCCloud: A Network-Coding-Based Storage System in a Cloud-of-Clouds
Network Coding for Distributed Storage Systems IEEE TRANSACTIONS ON INFORMATION THEORY, SEPTEMBER 2010 Alexandros G. Dimakis Brighten Godfrey Yunnan Wu.
Feng Lu Chuan Heng Foh, Jianfei Cai and Liang- Tien Chia Information Theory, ISIT IEEE International Symposium on LT Codes Decoding: Design.
1 Route Table Partitioning and Load Balancing for Parallel Searching with TCAMs Department of Computer Science and Information Engineering National Cheng.
By: Gang Zhou Computer Science Department University of Virginia 1 A Game-Theoretic Framework for Congestion Control in General Topology Networks SYS793.
Repairable Fountain Codes Megasthenis Asteris, Alexandros G. Dimakis IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 32, NO. 5, MAY /5/221.
Efficient Network-Coding-Based Opportunistic Routing Through Cumulative Coded Acknowledgments Dimitrios Koutsonikolas, Chih-Chun Wang and Y. Charlie Hu.
1 Lava: A Reality Check of Network Coding in Peer-to-Peer Live Streaming Mea Wang, Baochun Li Department of Electrical and Computer Engineering University.
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Network Coding and Media Streaming (Invited Paper)
1 Adapted from Pearson Prentice Hall Adapted form James A. Senn’s Information Technology, 3 rd Edition Chapter 7 Enterprise Databases and Data Warehouses.
Wireless Network Coding Martin Xu. Outline Introduction New Solutions – COPE – ANC Conclusions.
Shifted Codes Sachin Agarwal Deutsch Telekom A.G., Laboratories Ernst-Reuter-Platz Berlin Germany Joint work with Andrew Hagedorn and Ari Trachtenberg.
1 Network Coding and its Applications in Communication Networks Alex Sprintson Computer Engineering Group Department of Electrical and Computer Engineering.
User Cooperation via Rateless Coding Mahyar Shirvanimoghaddam, Yonghui Li, and Branka Vucetic The University of Sydney, Australia IEEE GLOBECOM 2012 &
Degraded-First Scheduling for MapReduce in Erasure-Coded Storage Clusters Runhui Li, Patrick P. C. Lee, Yuchong Hu th Annual IEEE/IFIP International.
Salah A. Aly,Moustafa Youssef, Hager S. Darwish,Mahmoud Zidan Distributed Flooding-based Storage Algorithms for Large-Scale Wireless Sensor Networks Communications,
Wireless Access and Networking Technology Lab WANT Opportunistic XOR Network Coding for Multihop Data Delivery in Underwater Acoustic Networks Haojie Zhuang,
Cooperative Recovery of Distributed Storage Systems from Multiple Losses with Network Coding Yuchong Hu, Yinlong Xu, Xiaozhao Wang, Cheng Zhan and Pei.
Multimedia Transmission Over Cognitive Radio Networks using Decode-and-Forward Multi-Relays and Rateless Coding Abdelaali Chaoub, Elhassane Ibn-Elhaj National.
Ahmed Osama Research Assistant. Presentation Outline Winc- Nile University- Privacy Preserving Over Network Coding 2  Introduction  Network coding 
LT Network Codes Mary-Luc Champel, Kevin Huguenin, Anne-Marie Kermarrec and Nicolas Le Scouarnec Technicolor, Rennes, France IEEE ICDCS (International.
Chapter 5 Uma Gupta Introduction to Information Systems  2000 by Prentice Hall. 5-1 Database Design and Management.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
Layer-aligned Multi-priority Rateless Codes for Layered Video Streaming IEEE Transactions on Circuits and Systems for Video Technology, 2014 Hsu-Feng Hsiao.
UEP LT Codes with Intermediate Feedback Jesper H. Sørensen, Petar Popovski, and Jan Østergaard Aalborg University, Denmark IEEE COMMUNICATIONS LETTERS,
Multi-Edge Framework for Unequal Error Protecting LT Codes H. V. Beltr˜ao Neto, W. Henkel, V. C. da Rocha Jr. Jacobs University Bremen, Germany IEEE ITW(Information.
Network Information Flow Nikhil Bhargava (2004MCS2650) Under the guidance of Prof. S.N Maheshwari (Dept. of Computer Science and Engineering) IIT, Delhi.
Exact Regenerating Codes on Hierarchical Codes Ernst Biersack Eurecom France Joint work and Zhen Huang.
Nour KADI, Khaldoun Al AGHA 21 st Annual IEEE International Symposium on Personal, Indoor and Mobile Radio Communications 1.
20/10/ Cooperative Recovery of Distributed Storage Systems from Multiple Losses with Network Coding Yuchong Hu Institute of Network Coding Please.
A Fast Repair Code Based on Regular Graphs for Distributed Storage Systems Yan Wang, East China Jiao Tong University Xin Wang, Fudan University 1 12/11/2013.
Prioritized Distributed Video Delivery With Randomized Network Coding IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 13, NO. 4, AUGUST 2011 Nikolaos Thomos Jacob.
A Bandwidth Scheduling Algorithm Based on Minimum Interference Traffic in Mesh Mode Xu-Yajing, Li-ZhiTao, Zhong-XiuFang and Xu-HuiMin International Conference.
A Social-Network-Aided Efficient Peer-to-Peer Live Streaming System IEEE/ACM TRANSACTIONS ON NETWORKING, JUNE 2015 Haiying Shen, Yuhua Lin Dept. of Electrical.
Pouya Ostovari and Jie Wu Computer & Information Sciences
Trickle Irrigation: Congestion Relief for Communication with Network Coding: where Shannon Meets Lyapunov Dapeng Oliver Wu Department of Electrical and.
James A. Senn’s Information Technology, 3rd Edition
Salah A. Aly ,Moustafa Youssef, Hager S. Darwish ,Mahmoud Zidan
Xiaoyang Zhang1, Yuchong Hu1, Patrick P. C. Lee2, Pan Zhou1
Presentation transcript:

1 An Update Model for Network Coding in Cloud Storage Systems th Annual Allerton Conference on Communication, Control, and Computing Mohammad Reza Zakerinasab Mea Wang Department of Computer Science University of Calgary

2 Outline ﻪIntroduction ﻪRelated Works ﻪProposed System ﻪDifferential Update Model ﻪEvaluation ﻪConclusion

Network Coding (1/2) ﻪThere are different mechanisms for arranging file copies among storage nodes or devices ﻩstandard RAID architectures ﻩerasure code ﻩnetwork coding ﻪThe network coding in cloud storage systems allows storage nodes to collectively host multiple copies of a file. 3

Network Coding (2/2) ﻪIn a network-coding-assisted cloud storage system ﻩa file is divided into n blocks ﻯencoded using random coefficients. ﻩencoded blocks are distributed in the Cloud. ﻯdecoded the n encoded blocks from any subset of the storage nodes. 4

5 Problem Definition ﻪExisting works have been focusing on mechanisms for preserving the level of redundancy. ﻪHowever, the most frequent operations maintaining coded information in the system up to date performed on files. ﻩfile updates ﻪAny change in the file will impact all coded blocks in the system. ﻩreplace all traces of the file

Application ﻪGoogleDocs : online collaborative office suites, let users create, edit and publish a document collaboratively from around the world. ﻪWhen a file is updated, even changing a single byte can outdate all coded blocks in the system. ﻩre-computations ﻩre-deliveries 6

7 Problems ﻪRe-computing coded blocks is very CPU intensive. ﻪReplacing all the coded blocks consumes large amount of bandwidth.

Proposed Model ﻪSending only the modified parts with a minimum possible overhead. ﻪThe mathematical model of Differential Update Mechanism (DUM) was presented by this paper. ﻩupdate algorithms can be performed on all nodes. ﻪThe simulation results show that the proposed DUM saving a significant bandwidth in a cloud storage system. 8

9 Outline ﻪIntroduction ﻪRelated Works ﻪProposed System ﻪDifferential Update Model ﻪEvaluation ﻪConclusion

Related Works (1/2) ﻪCommercial cloud storage systems, such as Microsoft Azure [8] and Google Cloud [9], utilize source erasure codes. ﻪNetwork coding was originally proposed in information theory in 2000 [1]. ﻪIn contrast to source erasure codes, network coding applies coding at intermediate relay nodes throughout the network. 10

Related Works (2/2) ﻪThe benefits for coding at intermediate nodes include ﻩhigh throughput [1], [3] ﻩefficient routing algorithm design [17] ﻩenergy savings in wireless networking [18] ﻩsecurity [19] ﻪThe closest related works of update problem are on the repair problem ﻩprovide mechanisms for one or more nodes fail [25]. ﻩpreserve the level of redundancy. 11

Reference ﻪ[1] R. Ahlswede, N. Cai, S. R. Li, and R. W. Yeung, “Network Information Flow,” IEEE Transactions on Information Theory, vol. 46, no. 4, pp. 1204– 1216, July ﻪ[3] R. Koetter and M. Medard, “An Algebraic Approach to Network Coding,” IEEE/ACM Transactions on Networking, vol. 11, no. 5, pp. 782–795, October ﻪ[8] B. Calder, J. Wang, A. Ogus, N. Nilakantan, A. Skjolsvold, S. McKelvie, Y. Xu, S. Srivastav, J. Wu, H. Simitci, J. Haridas, C. Uddaraju, H. Khatri, A. Edwards, V. Bedekar, S. Mainali, R. Abbasi, A. Agarwal, M. F. ul Haq, M. I. ul Haq, D. Bhardwaj, S. Dayanand, A. Adusumilli, M. McNett, S. Sankaran, K. Manivannan,, and L. Rigas, “Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency,” in Proc. of the 23rd ACM Symposium on Operating Systems Principles (SOSP), Cascais, Portugal, October , pp. 143–

Reference ﻪ[9] D. Ford, F. Labelle, F. I. Popovici, M. Stokely, V.-A. Truong, L. Barroso, C. Grimes, and S. Quinlan, “Availability in Globally Distributed Storage Systems,” in Proc. of the 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Vancouver, BC, October , pp. 1– 14. ﻪ[17] D. S. Lun, N. Ratnakar, R. Koetter, M. Medard, E. Ahmed, and H. Lee, “Achieving Minimum Cost Multicast: A Decentralized Approach Based on Network Coding,” in Proc. of the 24th Conference of the IEEE Communications Society (INFOCOM), Miami, FL, March , pp. 1607–1617. ﻪ[18] H. Rahul, W. Hu, D. Katabi, M. Medard, and J. Crowcroft, “XORs in the Air: Practical Wireless Network Coding,” IEEE/ACM Transactions on Networking, vol. 16, no. 3, pp. 497–510, June ﻪ[19] C. Gkantsidis and P. Rodriguez, “Cooperative Security for Network Coding File Distribution,” in Proc. of the 25th Conference of the IEEE Communications Society (INFOCOM), Barcelona, Spain, April , pp. 1–13. 13

14 Outline ﻪIntroduction ﻪRelated Works ﻪProposed System ﻪDifferential Update Model ﻪEvaluation ﻪConclusion

Modeling the Storage Cloud System 15 Storage Cloud End Hosts

Modeling the Storage Cloud System 16

Network Coding in the Storage Cloud System ﻪWith randomized network coding, a file is divided into n original blocks B = [b 1, b 2, …, b n ], where b i has a fixed number of bytes s. ﻪEncoding a new block c i ﻩthe source node first independently and randomly chooses a set of coding coefficients ε i = [ε i,1, ε i,2, …, ε i,n ] in the Galois field GF(2 8 ). ﻯ. 17 …… B = b 1, b 2, b 3,..…. b j c 1, c 2, c 3,......, c R*n b1,b2,b3,..bnb1,b2,b3,..bn

Network Coding in the Storage Cloud System 18

The Update Problem 19

20 Outline ﻪIntroduction ﻪRelated Works ﻪProposed System ﻪDifferential Update Model ﻪEvaluation ﻪConclusion

Differential Update Model (DUM) ﻪThey believe that the update problem is just as essential as the repair problem. ﻪThey propose the DUM to update coded blocks by delivering only the blocks that are affected by the updates. ﻩavoids transmissions of the entire file for each update. 21

Updating Coded Blocks 22

Updating Coded Blocks 23

Updating Storage Nodes ﻪA significant amount of bandwidth can be saved since most updates will affect only a smaller portion of a file. ﻪRecover Δ from Δ’ ﻩreconstructed by inserting the zero δ-vectors into Δ’ according to the update vector u. 24

Updating Storage Nodes ﻪSend the non-zero rows of Δ’ = [δ 1, δ 2, δ 3, …, δ n’ ] ﻪUpdate vector u v+1 = [u v+1,1, u v+1,2,..., u v+1,n ] ﻩ. ﻪEncode the matrix Δ’, ﻪDecode the matrix Δ’, 25

Updating Storage Nodes 26

Updating Target Nodes 27

Aggregating Updates Across Multiple Versions (1/4) 28

Aggregating Updates Across Multiple Versions (2/4) ﻪA coded block in version v may be expressed in terms of the coded blocks of version 0 and the summation of coded δ-blocks from version 0 to version m. 29

Aggregating Updates Across Multiple Versions (3/4) ﻪTo support such an aggregated update, the update table that stores ﻩthe update vectors ﻩthe coded δ- blocks ﻪIf a storage node misses one or more updates, then find the first non-empty entry following the empty entries. ﻩthe aggregated Δ’ containing changes across the missing versions. 30

Aggregating Updates Across Multiple Versions (4/4) ﻪComputational overhead ﻩgeneration of the aggregated update vector ﻯ. ﻩgeneration of n’ aggregated coded δ-vectors ﻯ. 31

32 Outline ﻪIntroduction ﻪRelated Works ﻪProposed System ﻪDifferential Update Model ﻪEvaluation ﻪConclusion

Numerical Analysis 33

Experiment Results (1/7) ﻪThe number of blocks n should be no more than 100 to ensure that network coding operates at a rate faster than a typical transmission rate in a network. ﻪWe compare the performance of conventional network coding update (NC) and DUM. 34

Experiment Results (2/7) ﻪBandwidth usages 35

Experiment Results (3/7) ﻪBandwidth usage and Computational cost 36

Experiment Results (4/7) ﻪComputational cost on storage nodes dominates the overall cost. 37

Experiment Results (5/7) ﻪAggregated updates 38

Experiment Results (6/7) ﻪUpdate affects 39

Experiment Results (7/7) ﻪSimulation study ﻪDiff [31], bsDiff [32] 40 [31] J. W. Hunt and M. D. McIlroy, “An Algorithm for Differential File Comparison,” Bell Laboratories 41, Computing Science Technical Report, June [32] C. Percival, “Matching with Mismatches and Assorted Applications,” Ph.D. dissertation, Wadham College, University of Oxford, 2006.

41 Outline ﻪIntroduction ﻪRelated Works ﻪProposed System ﻪDifferential Update Model ﻪEvaluation ﻪConclusion

Conclusion ﻪDUM saves both the communication and computational costs, unless the update affects almost the entire file ﻪDUM conserves CPU cycles for large files and when the data is more scattered in the Cloud. ﻪThis paper only considered n’ is smaller than n, what’s happened if n’ is large than n ? 42