1 AReNA: Adaptive Distributed Catalog Infrastructure Based On Relevance Networks Vladimir Zadorozhny, University of Pittsburgh, Pittsburgh, PA Avigdor.

Slides:



Advertisements
Similar presentations
High Performance Computing Course Notes Grid Computing.
Advertisements

4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
A Server-less Architecture for Building Scalable, Reliable, and Cost-Effective Video-on-demand Systems Jack Lee Yiu-bun, Raymond Leung Wai Tak Department.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
Zoetrope: Interacting with the Ephemeral Web Eytan Adar, Mira Dontcheva James Fogarty, Dan Weld University of Washington & Adobe Systems.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
1 Virtual Machine Resource Monitoring and Networking of Virtual Machines Ananth I. Sundararaj Department of Computer Science Northwestern University July.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice S 3 : A Scalable Sensing Service.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Wide-scale Botnet Detection and Characterization Anestis Karasaridis, Brian Rexroad, David Hoeflin.
Bgpmon real-time collection and distribution of BGP updates Dave Matthews, Yan Chen, Dan Massey Department of Computer Science Colorado State University.
Object Naming & Content based Object Search 2/3/2003.
CS218 – Final Project A “Small-Scale” Application- Level Multicast Tree Protocol Jason Lee, Lih Chen & Prabash Nanayakkara Tutor: Li Lao.
Mariam Salloum (YP.com) Xin Luna Dong (Google) Divesh Srivastava (AT&T Research) Vassilis J. Tsotras (UC Riverside) 1 Online Ordering of Overlapping Data.
Winter Retreat Connecting the Dots: Using Runtime Paths for Macro Analysis Mike Chen, Emre Kıcıman, Anthony Accardi, Armando Fox, Eric Brewer
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Automated Workload Management in.
New Challenges in Cloud Datacenter Monitoring and Management
23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.
Microsoft ® Official Course Monitoring and Troubleshooting Custom SharePoint Solutions SharePoint Practice Microsoft SharePoint 2013.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Felix Cuadrado Teaching: Big Data Processing (QMUL) Internet.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
Social scope: Enabling Information Discovery On Social Content Sites
HERO: Online Real-time Vehicle Tracking in Shanghai Xuejia Lu 11/17/2008.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
SeLeNe - Architecture George Samaras Kyriakos Karenos Larnaca – April 2003 THE UNIVERSITY OF CYPRUS.
PIER & PHI Overview of Challenges & Opportunities Ryan Huebsch † Joe Hellerstein † °, Boon Thau Loo †, Sam Mardanbeigi †, Scott Shenker †‡, Ion Stoica.
©NEC Laboratories America 1 Huadong Liu (U. of Tennessee) Hui Zhang, Rauf Izmailov, Guofei Jiang, Xiaoqiao Meng (NEC Labs America) Presented by: Hui Zhang.
Presentation of Master’s thesis Simulation and Analysis of Wireless Mesh Network In Smart Grid / Advanced Metering Infrastructure Philip Huynh.
SPREAD TOOLKIT High performance messaging middleware Presented by Sayantam Dey Vipin Mehta.
Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern.
Client-Server Processing, Parallel Database Processing and Distributed Database Systems. KEVIN ROBERTS ANIKET MURLIDHARAN.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Connecting different ethnomusicological archives with ethnoArc Maurice Mengel Music Archive of the Ethnological Museum, National Museum in Berlin (EMEM)
The Saguaro Digital Library for Natural Asset Management Dr. Sudha RamSudha Ram Advanced Database Research Group Dept. of MIS The University of Arizona.
Logistical Networking Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
Validating an Access Cost Model for Wide Area Applications Louiqa Raschid University of Maryland CoopIS 2001 Co-authors V. Zadorozhny, T. Zhan and L. Bright.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Copyright 2007, Information Builders. Slide 1 Machine Sizing and Scalability Mark Nesson, Vashti Ragoonath June 2008.
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
Tapestry : An Infrastructure for Fault-tolerant Wide-area Location and Routing Presenter : Lee Youn Do Oct 5, 2005 Ben Y.Zhao, John Kubiatowicz, and Anthony.
Efficient Evaluation of Queries in a Mediator for WebSources Louiqa Raschid University of Maryland Joint work with Zadorozhny, Vidal, Urhan, Bright.
University of Maryland Scaling Heterogeneous Information Access for Wide area Environments Michael Franklin and Louiqa Raschid.
Societal-Scale Computing: The eXtremes Scalable, Available Internet Services Information Appliances Client Server Clusters Massive Cluster Gigabit Ethernet.
(2) Organize information processing centers environment, the various functions and details Information technology audit: An information technology audit,
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
AUTONOMIC COMPUTING B.Akhila Priya 06211A0504. Present-day IT environments are complex, heterogeneous in terms of software and hardware from multiple.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Hadoop Aakash Kag What Why How 1.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Data/Analysis Challenges in the Electronic Business Environment
Data/Analysis Challenges in the Electronic Business Environment
Data Warehousing and Data Mining
The Globus Toolkit™: Information Services
Batyr Charyyev.
Next-generation Internet architecture
Presentation transcript:

1 AReNA: Adaptive Distributed Catalog Infrastructure Based On Relevance Networks Vladimir Zadorozhny, University of Pittsburgh, Pittsburgh, PA Avigdor Gal, Technion, Haifa Louiqa Raschid, University of Maryland, College Park, MD Quiang Ye, University of Pittsburgh, Pittsburgh, PA Nebula Project:

query optimization evaluation output Statistics about data data Relevant statistics: response time, network delay, data transfer rate, etc. Data sources are remote, distributed, heterogeneous Network is not (well) predictable Statistics is not reliable Networked Query Processing

data Statistics about data query optimization evaluation output Networked Queries with Distributed Catalog Scalability ?

4 Performance Monitoring for Server Selection performance monitor client LEGEND: Handle system Object handle: content server Object Objective: maintaining comprehensive performance repository for WANs (e.g., access latencies). Motivated, in part, by the evolution of information-centric name resolution services, e.g., CNRI Handle system.Handle system Challenge: scaling to the presence of hundreds of servers and thousands of clients, managing millions of constantly changing Performance Profiles.

performance monitor performance profile-based cluster content server client LEGEND: Profile-Based Performance Monitoring PM Aggregation ?

6 Aggregated Latency Profiles A client/server pair is characterized by Individual Latency Profiles (iLP). iLPs capture latency distributions experienced by clients when connecting to a server. iLP1 = iLP2 = iLP3 = Similar non-randomly associated iLPs are aggregated in Relevance Networks iLP similarity measures: Correlation and Mutual Information iLP1 iLP3 iLP

7 Discovering Non-random Associations with Relevance Networks (RNs) LP1 LP4LP2 LP3 Threshold= LP1 LP4LP2 LP3 Threshold= We adopt RNs as a management tool, to manage large numbers of iLPs.

8 Relevance Networks

9 AReNA: Architecture Data Collection Data Preparation RN Generation and Analysis Performance Prediction VIZUALIZERVIZUALIZER AReNA dynamically analyzes and visualizes meaningful relationships among client/ server pairs using Relevance Networks (RNs). Relationships are evaluated using passive measurements made by client applications and gathered on a continuous basis. RNs allow AReNA managing thousands of constantly changing iLPs Large-Scale Experimental Testbeds CNRI Handle System PlanetLab Overlay Around Latency Profiles

10 AReNA: Screenshot

11 Demo Tuesday: 16:00-17:30 Friday: 09:00-10:30