Chapter 4 Realtime Widely Distributed Instrumention System.

Slides:



Advertisements
Similar presentations
Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.
Advertisements

Distributed Data Processing
High Performance Computing Course Notes Grid Computing.
Network+ Guide to Networks, Fourth Edition
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Chapter 19: Network Management Business Data Communications, 4e.
Grid Collector: Enabling File-Transparent Object Access For Analysis Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
Chapter 9: Moving to Design
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
A Robust Health Data Infrastructure P. Jon White, MD Director, Health IT Agency for Healthcare Research and Quality
QoS-enabled middleware by Saltanat Mashirova. Distributed applications Distributed applications have distinctly different characteristics than conventional.
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Moving to Design.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Agent-based Device Management in RFID Middleware Author : Zehao Liu, Fagui Liu, Kai Lin Reporter :郭瓊雯.
DISTRIBUTED COMPUTING
Managing a Cloud For Multi Agent System By, Pruthvi Pydimarri, Jaya Chandra Kumar Batchu.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
DCE (distributed computing environment) DCE (distributed computing environment)
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
Chapter 11 Analysis Concepts and Principles
A Grid Computing Use case Datagrid Jean-Marc Pierson.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
A Collaborative Framework for Scientific Data Analysis and Visualization Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox Department of Computer.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
1 Chapter 1 Introduction to Databases Transparencies.
Jini Architecture Introduction System Overview An Example.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
GYTE - Bilgisayar Mühendisliği Bölümü Bilgisayar Mühendisliği Bölümü GYTE - Bilgisayar Mühendisliği Bölümü AN ARCHITECTURE FOR NEXT GENERATION MIDDLEWARE.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Point-to-point Architecture topics for discussion Remote I/O as a data access scenario Remote I/O is a scenario that, for the first time, puts the WAN.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
- GMA Athena (24mar03 - CHEP La Jolla, CA) GMA Instrumentation of the Athena Framework using NetLogger Dan Gunter, Wim Lavrijsen,
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Building PetaScale Applications and Tools on the TeraGrid Workshop December 11-12, 2007 Scott Lathrop and Sergiu Sanielevici.
From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.
GameChanger’s Rate Quote Issue Solution is Deployed to Microsoft Azure for a Fast, Flexible Direct to Consumer Insurance Sales Solution MICROSOFT AZURE.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Design and Implementation of a High- Performance Distributed Web Crawler Vladislav Shkapenyuk, Torsten Suel 실시간 연구실 문인철
Chapter 19: Network Management
Use Case for Distributed Data Center in SUPA
Clouds , Grids and Clusters
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Joseph JaJa, Mike Smorul, and Sangchul Song
Distribution and components
Grid Computing.
Introduction to Cloud Computing
An Introduction to Computer Networking
HingX Project Overview
Presentation transcript:

Chapter 4 Realtime Widely Distributed Instrumention System

Useful and robust operation of realtime distributed system requires many capability Automated management of data stream and distributed components Dynamic scheduling Resource reservation This capability will be built on supporting architecture,middleware and low level services such as realtime cataloging

Distributed real time application High speed data stream result from on line instrument and imaging system High speed network is providing the potential to collect,organization, storage,analysis and distribution of the large data object that result from such data stream Health care imaging system :both high data rate and realtime cataloging High energy physics experiments :high data rate and volume have to processed and archive in realtime

Problem Characterization and prototype Realtime management of distributed system invlove : distributed data collection and management distributed data analysis and cataloging Each of these reqiures a supporting infrastructure of middlware and of system and communication services The required middleware services include automated cataloging (chapter 5) Automated monitoring and management system of distributed components (chapter 14,15) Policy based access control system to support scheduling and resource allocation (chapter 19)

Nature of the Remote Operation Distributed instrument can be remote in space,scale,time Remote in space is the typical circumstance of network distributed scientific collaboration Another common circumstance is that the control function is remote in scale that direct control is not possible

Cardioangiography Key aspect of realtime data is immediate and automated processing to organize and catalog the data Data is generated in large volume and with high throughout,the people generating the data are geographically seperated from the people cataloging or using the data A realtime digital library system (WALDO) collects data from instrument and automatically processes,catalogs,archive each data unit together with derived data and metadata Waldo uses an object oriented approach for capture,catalog and management of large data object

Waldo software architecture Data collection system High speed network based cache for providing intermediate storage for processing and for high speed application access Processing mechanism for various sort of data Data management for automatic cataloging and metadata generation Data access interface including application oriented interface Flexible mechanism for providing various searching strategy Transparent security that provides strong access control Transparent storage management for data component Curator interfaces for managing both the metadata and large data object collection User access interface

Particle accelerator A detector system at high energy physics particle accelerator.modern detector like STAR generate 20 – 40 MB/s. data must be processed in two phase : In phase 1 : a detector puts out a steady state high data rate stream In phase 2: data analysis using DPSS (distributed parallel storage system) in this system

Electron microscopy This example concern the remote control of electron microscopy based on the image content In situ electron microscopy experiment requires dynamic interaction with the specimen under observations it is exited with external stimuli Remote control via WAN do not offer realtime data and command delivery guarantes are not practical for finely tuned adjustment

Enable remote control in WAN Human interaction can easily be performed in WAN On the other hand dynamic control operation because of the control operation and monitored response to the control or stimuli have to be coupled by low latency communication that is not possible on WAN. dynamic remote control application usually involve automated control operation performed near the instrument to eliminate WAN realtime delivery requirement

In this section we describe some of architectural and middleware approach in implementing high performance distributed instrument A Model data Intensive Architecture Agent based management Policy based access control In the previous example we demonstrate the utility of using high speed distributed cache This cache based approach provides standard interface to large, distributed storage system Each data source deposits its data in cache and each data consumer takes data from catch

Network based cache DPSS provide highly distributed storage system that is usually used a cache of data DPSS is typically used to collect data from on line instrument and supply that data to analysis application It provides high capacity and isolate application from the tertiary storage system It may be dynamically configured by aggregating workstation and disks from all over the network

Agent based management and monitoring In widely distributed system,when we observe that something has wrong, it is generally too late to react Because the needed information is no longer accessible Because it will too long to ask and answer all of required question Agents not only provide standard access to comprehensive monitoring,they can also perform task such as keeping a state history

Monitoring one successful monitoring methodology involve recording every event with precision timestamps This monitoring is designed to facilitate performance tuning, the characterization of distributed algorithm and the management of functioning system When developing, high speed network based distributed services,we often unexpectedly low network throughput /high latency. The reason for this poor performance is not obvious. A precision and comprehensive monitoring is an invaluable tool for diagnosing such problems

Agent based management of widely distributed systems Agent based management may be the key to keeping widely distributed system running reliably Initial experimentation with such agent in DPSS indicates several potential advantages First: Structured access to current and historical information Second :reliability not only does this system keep track of all components within the system, but it restart any component that has crashed Third :automatic reconfiguration.when new components are added,the agents do not have to reconfigured Fourth : information management Fifth: user representation, can perform actions on behalf of user (if the data is not load in DPSS )

Policy based access control the goal for access control in such distributed system is to reflect the general principles that have been established in society for policy based access control The resource has multiple stakeholders and each stakeholder will impose use conditions on resource.all of the use conditions must be met simultaneously in order to satisfy the requirement for access. An approach that addresses the general goals noted above can be based on authorization and attribute certificates Users are permitted access to resource based on their attributes