Navigating and Sharing in a Decentralized World Francisco Matias Cuenca-Acuna

Slides:



Advertisements
Similar presentations
CPSCG: Constructive Platform for Specialized Computing Grid Institute of High Performance Computing Department of Computer Science Tsinghua University.
Advertisements

Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
P2PR-tree: An R-tree-based Spatial Index for P2P Environments ANIRBAN MONDAL YI LIFU MASARU KITSUREGAWA University of Tokyo.
The Next I.T. Tsunami Paul A. Strassmann. Copyright © 2005, Paul A. Strassmann - IP4IT - 11/15/05 2 Perspective Months  Weeks.
1 On Death, Taxes, & the Convergence of Peer-to-Peer & Grid Computing Adriana Iamnitchi Duke University “Our Constitution is in actual operation; everything.
ICS (072)Database Systems: A Review1 Database Systems: A Review Dr. Muhammad Shafique.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
WayFinder:Navigating and Sharing Information in a Decentralized World Christopher Peery, Matias Cuenca, Richard P. Martin, Thu D. Nguyen Department of.
Computers Are Your Future © 2005 Prentice-Hall, Inc. Excerpts for V New York University.
Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen Department of.
Based on last years lecture notes, used by Juha Takkinen.
A probabilistic approach to building large scale federated systems Francisco Matias Cuenca-Acuna
Rutgers PANIC Laboratory The State University of New Jersey Self-Managing Federated Services Francisco Matias Cuenca-Acuna and Thu D. Nguyen Department.
Francisco Matias Cuenca-Acuna Christopher Peery Thu D. Nguyen Usando algoritmos probabilísticos para construir sistemas.
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
1 Client-Server versus P2P  Client-server Computing  Purpose, definition, characteristics  Relationship to the GRID  Research issues  P2P Computing.
Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities Francisco Matias Cuenca-Acuna, Christopher Peery, Richard P.
Text-Based Content Search and Retrieval in ad hoc P2P Communities Francisco Matias Cuenca-Acuna Thu D. Nguyen
Web Search – Summer Term 2006 V. Web Search - Page Repository (c) Wolfgang Hürst, Albert-Ludwigs-University.
COMPUTER APPLICATIONS TO BUSINESS ||
Chapter 7 Configuring & Managing Distributed File System
Overview
Building Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Semantic web technologies for secure interoperability and.
Introduction. Readings r Van Steen and Tanenbaum: 5.1 r Coulouris: 10.3.
Web Search Created by Ejaj Ahamed. What is web?  The World Wide Web began in 1989 at the CERN Particle Physics Lab in Switzerland. The Web did not gain.
Introduction to Peer-to-Peer Networks. What is a P2P network A P2P network is a large distributed system. It uses the vast resource of PCs distributed.
DM Rasanjalee Himali CSc8320 – Advanced Operating Systems (SECTION 2.6) FALL 2009.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
Characterization of Distributed Systems
The Internet By Amal Wali 10DD. Contents  What is the Internet? What is the Internet?  Who owns the Internet? Who owns the Internet?  How do you connect.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Grid-based Sensor Network Service on Future Internet By Mohammad Mehedi Hassan Student ID:
Information Access and Connectivity Richard N. Taylor University of California, Irvine
Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Autonomous Replication for High Availability in Unstructured P2P Systems (Paper by Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen) Hristo.
EFFECTIVE LOAD-BALANCING VIA MIGRATION AND REPLICATION IN SPATIAL GRIDS ANIRBAN MONDAL KAZUO GODA MASARU KITSUREGAWA INSTITUTE OF INDUSTRIAL SCIENCE UNIVERSITY.
C5- IT Infrastructure and Emerging Technologies. Input – Process - Output 2 A computer  Takes data as input  Processes it  Outputs information CPU.
Grid-based Future Internet with Wireless sensor network By Mohammad Mehedi Hassan Student ID:
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Copyright © 2002 Intel Corporation. Intel Labs Towards Balanced Computing Weaving Peer-to-Peer Technologies into the Fabric of Computing over the Net Presented.
ICS (072)Database Systems: An Introduction & Review 1 ICS 424 Advanced Database Systems Dr. Muhammad Shafique.
Internet Research Tips Daniel Fack. Internet Research Tips The internet is a self publishing medium. It must be be analyzed for appropriateness of research.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Distributed Databases
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
ADVANCED COMPUTER NETWORKS Peer-Peer (P2P) Networks 1.
Communications & Networks National 4 & 5 Computing Science.
The Internet. Internet O Internet is a worldwide system of CPU networks where network connecting millions of computers.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
Next Generation of Apache Hadoop MapReduce Owen
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
Distributed Systems Architecure. Architectures Architectural Styles Software Architectures Architectures versus Middleware Self-management in distributed.
Traffic Source Tell a Friend Send SMS Social Network Group chat Banners Advertisement.
Autonomic aspects in cloud data management Alexandra Carpen-Amarie KerData.
Planning File and Print Services Lesson 5. File Services Role The File Services role and the other storage- related features included with Windows Server.
Chapter 1 Characterization of Distributed Systems
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
SEARCH ENGINES & WEB CRAWLER Akshay Ghadge Roll No: 107.
The Internet An Overview.
Distributed Databases
Introduction To Distributed Systems
Presentation transcript:

Navigating and Sharing in a Decentralized World Francisco Matias Cuenca-Acuna

People Graduate students −Christopher Peery −Konstantinos Kleisouris Faculty −Richard Martin −Thu Nguyen

Federated computing Current trend toward ubiquitous Internet connectivity is driving a new model of federated computing −Computing systems that are geographically distributed and may span multiple organizations Concurrently, deep penetration of computer usage −500 million PCs in operation worldwide (1 for every 12 people) −80% of them are in desktops −40% annual growth −600 million Internet users worldwide  Federated computing appearing at every level −Social group-based sharing −P2P: Gnutella, KaZaA, DirectConnect −Web-based: Ebay, Google groups, Yahoo groups, DMOZ −Scientific computing −Many emerging research grids: −Business-to-business ecommerce Source

The challenge Federated computing provides the opportunity to harness vast amount of resources −Consider just data sharing −Users produce 740TB of information per year −Information per person is growing continuously −80% annual growth on total disk capacity sold per year  Emergence of huge distributed data repositories −Local community of 3000 undergraduate students sharing 20TB of data −WWW: Google had indexed 1 billion pages (20TB of content) by 2000 −The European Data Grid has only 100’s of nodes but PB’s of data Challenge: how to manage and actually use these resources −Decentralized control −Widely distributed −Heterogeneous components Source

The PlanetP Project Information and resource management for networked communities −Data sharing −Provide content-based access & ranking of results −Allow user to cooperatively organize data −Provide predictable data availability −Deployment, monitoring, and management of federated services −Provide a common runtime environment −Distributively follow sysadmin guidelines for service deployment −Example: UDDI naming service for web services Dealing with Decentralization −Self-management & self-configuration −Autonomous cooperation −Loosely synchronized global information −Randomized algorithms

Current state of the project Data propagation Content indexing and ranking Automatic replication for availability Global namespace and storage management Service management

Current state of the project Data propagation Content indexing and ranking Automatic replication for availability Global namespace and storage management Service management Based on epidemic communication Very resilient to node/network failures Membership management Every node has a loosely synchronized view of the community

Current state of the project Data propagation Content indexing and ranking Automatic replication for availability Global namespace and storage management Service management Distributed information ranking algorithm Allows search engine like queries 2 step search & rank to deal with outdated information

Current state of the project Data propagation Content indexing and ranking Automatic replication for availability Global namespace and storage management Service management Allow users to specify data availability Present a probabilistic availability model Monitor availability as community changes

Work in progress Data propagation Content indexing and ranking Automatic replication for availability Global namespace and storage management Service management File system interface over communal content Unlike the Web the namespace is writeable Dynamic namespace management Automated local storage management Remove content if we can recover it Hoarding for disconnected operation

Work in progress Data propagation Content indexing and ranking Automatic replication for availability Global namespace and storage management Service management Distributed runtime for Web Services Administrators just dictate the policy They reason about capacity availability privacy issues Provide self deployment and monitoring

The PlanetP Project Questions?