HomeViews: P2P Middleware for Personal Data Sharing Applications Roxana Geambasu, Magdalena Balazinska, Steve Gribble, Hank Levy University of Washington.

Slides:



Advertisements
Similar presentations
EzScoreboard.com A Fully Integrated Administration Service.
Advertisements

P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
Facilitating Communal Data Sharing in Public Clouds Roxana Geambasu Steve Gribble Hank Levy University of Washington.
Chapter 7 LAN Operating Systems LAN Software Software Compatibility Network Operating System (NOP) Architecture NOP Functions NOP Trends.
Distributed Systems 1 Topics  What is a Distributed System?  Why Distributed Systems?  Examples of Distributed Systems  Distributed System Requirements.
Chapter 6 User Protections in OS. csci5233 computer security & integrity (Chap. 6) 2 Outline User-level protections 1.Memory protection 2.Control of access.
Vanish: Increasing Data Privacy with Self-Destructing Data Roxana Geambasu Yoshi Kohno Amit Levy Hank Levy University of Washington.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Company Confidential 1 © 2005 Nokia V1-Filename.ppt / yyyy-mm-dd / Initials Towards a mobile content delivery network with a P2P architecture Carlos Quiroz.
PeerDB: A P2P-based System for Distributed Data Sharing Wee Siong Ng, Beng Chin Ooi, Kian-Lee Tan, Aoying Zhou Shawn Jeffery CS294-4 Peer-to-Peer Systems.
Name Services Jessie Crane CPSC 550. History ARPAnet – experimental computer network (late 1960s) hosts.txt – a file that contained all the information.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
Spotlighting Decentralized P2P File Sharing Archie Kuo and Ethan Le Department of Computer Science San Jose State University.
CS-550 (M.Soneru): Protection and Security - 1 [SaS] 1 Protection and Security.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
1 Client-Server versus P2P  Client-server Computing  Purpose, definition, characteristics  Relationship to the GRID  Research issues  P2P Computing.
OmStore Cloud API Harshit Agarwal Sohil Habib. About Us ●We are graduate students at CMU ●Currently at CMU Silicon Valley campus ●Working part time with.
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Sanzaru Capability-Based Interactions for Web Applications Raluca Sauciuc Shaunak Chatterjee University of California, Berkeley Motivation Limitations.
Security Management.
Middleware for P2P architecture Jikai Yin, Shuai Zhang, Ziwen Zhang.
Understanding Active Directory
A centralized system.  Active Directory is Microsoft's trademarked directory service, an integral part of the Windows architecture. Like other directory.
Organizing and Sharing Distributed Web Objects with Menagerie Roxana Geambasu, Cherie Cheung, Alex Moshchuk, Steve Gribble, Hank Levy University of Washington.
MS Access Advanced Instructor: Vicki Weidler Assistant:
1 Networks, advantages & types of What is a network? Two or more computers that are interconnected so they can exchange data, information & resources.
1 Lecture 20: Parallel and Distributed Systems n Classification of parallel/distributed architectures n SMPs n Distributed systems n Clusters.
Networked Information Systems 1 Advantages of and classified by their size & architecture or design.
1 Multi Cloud Navid Pustchi April 25, 2014 World-Leading Research with Real-World Impact!
Designing Group Security Designing security groups Designing user rights.
Master Thesis Defense Jan Fiedler 04/17/98
Open Search Office Web Services Database Doc Mgt Sys Pipeline Index Geospatial Analysis Text Search Faceting Caching Query parsing Clustering Synonyms.
Publish / Subscribe Database Log Shipping over Bittorent P2P CS 848 Fall 2006 Univeristy of Waterloo Project Presentation by N. T c h e r v e n s k i.
PS Security By Deviprasad. Agenda Components of PS Security Security Model User Profiles Roles Permission List. Dynamic Roles Static Roles Building Roles/Rules.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
OS2- Sem ; R. Jalili Introduction Chapter 1.
Kyung Hee University 1/41 Introduction Chapter 1.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Has Agent Scripting Got You FRUSTRATED?. Agent Scripting Simplified!
ADVANTAGES OF DATA BASE MANAGEMENT SYSTEM. TO BE DICUSSED... Advantages of Database Management System  Controlling Data RedundancyControlling Data Redundancy.
Wireless and Mobile Security
PeerDB : A P2P-based System for Distributed Data Sharing DB Lab. M.S. 3 LEE MIN YOUNG Wee Siong Ng, Beng Chin Ooi, Kian-Lee Tan, Aoying Zhou, 19th International.
The world leader in serving science Overview of Thermo 21 CFR Part 11 tools Overview of software used by multiple business units within the Spectroscopy.
Key Management. Authentication Using Public-Key Cryptography  K A +, K B + : public keys Alice Bob K B + (A, R A ) 1 2 K A + (R A, R B,K A,B ) 3 K A,B.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
第 1 讲 分布式系统概述 §1.1 分布式系统的定义 §1.2 分布式系统分类 §1.3 分布式系统体系结构.
Active Directory. Computers in organizations Computers are linked together for communication and sharing of resources There is always a need to administer.
Introduction to AFS IMSA Intersession 2003 An Overview of AFS Brian Sebby, IMSA ’96 Copyright 2003 by Brian Sebby, Copies of these slides.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
Securing Access to Data Using IPsec Josh Jones Cosc352.
ASHRAY PATEL Protection Mechanisms. Roadmap Access Control Four access control processes Managing access control Firewalls Scanning and Analysis tools.
1 Case Study: Business Intelligence & Customer Data Customer Support Web-based Dashboard VP Marketing SQL XSLT XML Data Grid Customer Data Customer Order.
Chapter 14: System Protection
Self Healing and Dynamic Construction Framework:
CHAPTER 3 Architectures for Distributed Systems
Network Services.
IIS.
Advanced Operating Systems
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Data Management Components for a Research Data Archive
Designing IIS Security (IIS – Internet Information Service)
Presentation transcript:

HomeViews: P2P Middleware for Personal Data Sharing Applications Roxana Geambasu, Magdalena Balazinska, Steve Gribble, Hank Levy University of Washington

2 HomeViews P2P personal data sharing Views for organizing and sharing personal data Lightweight protection using capabilities

3 Outline Motivation and goals HomeViews design Evaluation Conclusions

4 Motivation Bob 3 problems 1) How to organize all this data? 2) How to share files with friends? 3) How to integrate others’ shared files with their own?

5 1) How can users organize their data? Hierarchy of directories Views over the personal files  E.g. tools supporting views: Spotlight, WinFS, iTunes Jan 2006 Dec ChristmasParties Bob Goal: Use views to organize personal data Files

6 2) How can users share their data? Protected and selective sharing share Mom ? P2P sharing? Centralized sites? Christmas Goal: Lightweight P2P selective sharing of views Christmas Parties Files Bob

7 3) How can users integrate shared files with their own? Organize and share the global file collection further share Mom Christmas ItalyFrance Goal: Seamless access and composition of views ChristmasParties Bob Files

8 Goals Personal data organization using views Lightweight P2P selective sharing of views  No account registration or management Seamless access and composition of views

9 Outline Motivation and goals HomeViews design Evaluation Conclusions

10 HomeViews Middleware Models the file system as a relation with SQL-based QL to build views over personal files Lightweight protection  No user accounts  Enable P2P selective sharing Provides distribution transparency

11 Design Challenges 1) How to protect views in P2P, with no user accounts? 2) How to integrate the new protection with DB views? 3) How to execute distributed queries in new environment?

12 Challenge 1: Lightweight Protection Use capabilities from operating systems Capability = secure token that:  bundles an object name and a set of access rights  provides holder with authority to execute the specified actions on the named object Without a valid capability for an object, one cannot access that object object ID rights

13 HomeViews Capabilities Serve to access, protect, and share views Long random password ensures unforgeability of capability probabilistically Example rights: select, drop, alter, catalog lookup, etc. view ID password IP hint 128 bits 128 bits 32 bits CapTable (in local catalog) Capability view ID password rights lookup

14 Benefits of Capabilities Facilitate sharing  Grant access simply by ing capabilities  Sharing is similar to sharing private Web pages Ease management  No user accounts  No global protection structure Capabilities perfectly match our unmanaged P2P environment

15 Surprisingly easy to modify SQL to support capability-based protection: SQLCapa SQLCapa accommodates:  Capability-based naming of views  Creation of (restricted) capabilities  Capability revocation Challenge 2: Integrating OS Capabilities with DB Views

16 SQLCapa Use capabilities to name views in queries CREATE VIEW returns a fully-enabled capability  This capability can be used to query the new view  Copies of this capability can be shared Christmas Bob C SELECT * FROM WHERE occasion = `Christmas’ C B0 Bob_ BaseView C B0 CREATE VIEW Christmas => Returns C C Mom C' C Select all Christmas photos

17 SQLCapa Mom Italy Mom’s BaseView C M0 C IT Compose views seamlessly CREATE VIEW Italy AS SELECT * FROM C’ C WHERE place = `Italy’ UNION SELECT * FROM C M0 WHERE place = `Italy’ => Returns C IT C' C (to Bob’s Christmas view)

18 Challenge 3: Distributed Query Execution Two techniques: 1. Recursive evaluation Can be slow C1C1 Base views Files Queries Results C2C2 C3C3 C6C6 C5C5 C4C4 Peer node C Capability

19 Challenge 3: Distributed Query Execution Two techniques: 2. Query rewrite and evaluation Query rewrite yields better performance, but needs right to lookup view definition in catalog Files Step 1. Recursive lookups of view definitions & query rewrite Step 2. Query execution on base views C1C1 Base views C2C2 C3C3 C6C6 C5C5 C4C4

20 Putting it all together: HomeViews Architecture

21 Outline Motivation and goals HomeViews design Evaluation Conclusions

22 Evaluation Questions Q1: What are the system’s bottlenecks? Q2: Are distributed queries fast enough to be practical? Q3: How easy is it to build P2P file sharing apps on top of HomeViews?

23 Q1: What are the system’s bottlenecks? Answer: Beagle search engine and/or network

24  Distribute queries on multiple machines  Simulate broadband (2Mbps, 20ms) Q2: Are distributed queries fast enough to be practical? Method : Answer: Medium-size queries are responsive  Query rewrite technique scales well with number of nodes Query rewrite time (ms)

25 Q3: How easy is it to build P2P sharing apps on HomeViews? Modified Gallery to run atop HomeViews Easy to build  11 out of 787 files touched, 488 lines of code changed Gallery v.1ViewGallery CentralizedPeer-to-peer Static albumsDynamic albums Photos are uploaded by users Photos “appear” in album when they match album’s query Sharing with registered users Sharing with anybody by ing capability

26 Outline Motivation and goals HomeViews design Evaluation Conclusions

27 Conclusions HomeViews – middleware for building personal data sharing applications  Peer-to-Peer environment  Database Views for flexible data organization  OS Capabilities for lightweight selective sharing of views  Seamless composition of local and remote views Easy to build powerful personal data organization and sharing applications

28 Questions?

29 Related Work Personal data organization  Views: WinFS, Spotlight  Personal information management: Haystack [Karger et.al., CIDR '05] File sharing  P2P: Kazaa, BitTorrent Access control  ACLs  Cryptography for selective sharing [Miklau et.al., VLDB '03]  Capabilities [Wulf et.al., Comm.ACM 74; Pose, IEEE Comp. Society '01]

30 Capabilities versus ACLs Advantages of capabilities:  Facilitate sharing  Ease management by not requiring user accounts  Preferable for the unmanaged P2P home environment, in which users want to simplify selective sharing Advantages of ACLs:  Support tight access control  Enable accountability and auditing  Preferable in a business / banking environment Thus, capabilities are better suited for our environment and application

31 Future directions Caching Replication More applications User experience

32 Query execution performance with Spotlight Spotlight is much faster than Beagle The HomeViews overhead remains beyond 25% of the total local query time Result size (# filenames) Beagle time (ms) Spotlight time (ms) 10001, , ,465546

33 Performance of HomeViews queries with Spotlight vs. Beagle Estimated query performance with Spotlight HomeViews query performance with Beagle