MySQL and GRID Gabriele Carcassi STAR Collaboration 6 May 2002 - Proposal.

Slides:



Advertisements
Similar presentations
WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
Advertisements

Data Management Expert Panel - WP2. WP2 Overview.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 6 Managing and Administering DNS in Windows Server 2008.
Module 14: Implementing an Active Directory Infrastructure.
Software Delivery. Software Delivery Management  Managing Requirements and Changes  Managing Resources  Managing Configuration  Managing Defects 
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
CMS HLT production using Grid tools Flavia Donno (INFN Pisa) Claudio Grandi (INFN Bologna) Ivano Lippi (INFN Padova) Francesco Prelz (INFN Milano) Andrea.
Data Management for Physics Analysis in PHENIX (BNL, RHIC) Evaluation of Grid architecture components in PHENIX context Barbara Jacak, Roy Lacey, Saskia.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Printing Terminology. Requirements for Network Printing At least one computer to operate as the print server Sufficient RAM to process documents Sufficient.
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW Understand the difference between service.
Event Viewer Was of getting to event viewer Go to –Start –Control Panel, –Administrative Tools –Event Viewer Go to –Start.
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW  Understand the difference between service.
© 2012 IBM Corporation Tivoli Workload Automation Informatica Power Center.
70-294: MCSE Guide to Microsoft Windows Server 2003 Active Directory, Enhanced Chapter 7: Active Directory Replication.
Test Review. What is the main advantage to using shadow copies?
Microsoft Windows 2003 Server. Client/Server Environment Many client computers connect to a server.
Hands-On Microsoft Windows Server 2008 Chapter 5 Configuring, Managing, and Troubleshooting Resource Access.
XP New Perspectives on Microsoft Office Access 2003 Tutorial 12 1 Microsoft Office Access 2003 Tutorial 12 – Managing and Securing a Database.
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
Objectives Configure routing in Windows Server 2008 Configure Routing and Remote Access Services in Windows Server 2008 Network Address Translation 1.
STAR scheduling future directions Gabriele Carcassi 9 September 2002.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
September 18, 2002 Windows 2000 Server Active Directory By Jerry Haggard.
ITN Wake Tech1 ITN270 Advanced Internet Databases Lecture 15. General MySQL Administration Topics: –Securing a New MySQL Installation –MySQL Server.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
IT 456 Seminar 5 Dr Jeffrey A Robinson. Overview of Course Week 1 – Introduction Week 2 – Installation of SQL and management Tools Week 3 - Creating and.
CHEP Sep Andrey PHENIX Job Submission/Monitoring in transition to the Grid Infrastructure Andrey Y. Shevel, Barbara Jacak,
Company Confidential 1 A Course on Global Catalog And Flexible Single Master Operations (Fsmo) Roles Prepared for: *Stars* New Horizons Certified Professional.
High Impact Technology February It’s Awesome!
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
Event Data History David Adams BNL Atlas Software Week December 2001.
INFNGrid Constanza Project: Status Report A.Domenici, F.Donno, L.Iannone, G.Pucciani, H.Stockinger CNAF, 6 December 2004 WP3-WP5 FIRB meeting.
Grid User Management System Gabriele Carcassi HEPIX October 2004.
Chapter 1 Introduction to Databases. 1-2 Chapter Outline   Common uses of database systems   Meaning of basic terms   Database Applications  
A Brief Documentation.  Provides basic information about connection, server, and client.
© Wiley Inc All Rights Reserved. MCSE: Windows Server 2003 Active Directory Planning, Implementation, and Maintenance Study Guide, Second Edition.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
Database Role Activity. DB Role and Privileges Worksheet.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Trusted Virtual Machine Images a step towards Cloud Computing for HEP? Tony Cass on behalf of the HEPiX Virtualisation Working Group October 19 th 2010.
PHENIX and the data grid >400 collaborators 3 continents + Israel +Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
MySQL and GRID status Gabriele Carcassi 9 September 2002.
IT System Administration Lesson 3 Dr Jeffrey A Robinson.
OVERVIEW OF ACTIVE DIRECTORY
Linux Operations and Administration
Introduction to Active Directory
SQL Server 2005 Implementation and Maintenance Chapter 6: Security and SQL Server 2005.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
EMTTS UAT Day1 & Day2 Powered by:. Topics CoversTopics Remaining Comparison Network Infrastructure Separate EP Hosting Fault Tolerance.
Lecture On Introduction (DBMS) By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
11 GLOBAL CATALOG AND FLEXIBLE SINGLE MASTER OPERATIONS (FSMO) ROLES Chapter 4.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
7.5 Using Stored-Procedure and Triggers NAME MATRIC NUM GROUP Muhammad Azwan Bin Khairul Anwar CS2305A Muhammad Faiz Bin Badrol Shah CS2305B.
Calgary Oracle User Group
Database Replication and Monitoring
Replica/File Catalog based on RAM SUNY
U.S. ATLAS Grid Production Experience
Data Management and Database Framework for the MICE experiment
THE STEPS TO MANAGE THE GRID
Global Catalog and Flexible Single Master Operations (FSMO) Roles
DEPLOYING SECURITY CONFIGURATION
PLANNING A SECURE BASELINE INSTALLATION
Status and plans for bookkeeping system and production tools
CNT 4603: System Administration Fall 2010
Presentation transcript:

MySQL and GRID Gabriele Carcassi STAR Collaboration 6 May Proposal

Why?  STAR uses MySQL to keep track of data files (file catalog)  There are already many projects concerning file catalogs  MAGDA already uses MySQL to store file catalog information  We will not concentrate on this aspect

Why?  STAR reconstruction jobs need access to both data files and databases Job DB Raw data file Reconstructed data file Calibration constants Detector geometry Detector readings

Database replication  DB is being replicated through MySQL replication  Master/slave approach Master DB Slave DB Slave DB Slave DB write access update forwarded read access

GRID and DB replicas  Database replication is essential because:  Increase data availability  Allows locations to run jobs independently (in case of network congestion)  Correction to the DB are available to all locations  In order to successfully execute jobs in a GRID environment, we need database replication to be somehow integrated

Steps 1.Aid DB administrator to manage complexity of database replication 2.Tools to install a mirror using GRID technology 3.Database catalogs and integration with file catalogs 4.Integrate GRID authentication

1. Manage complexity  The more server you have, the more complicated is to manage the system  Build a GUI that helps the database administrator (DBA) to have a general picture

1. Manage complexity  Keep track of all the servers, and the relationship between each other.  Help manage more sophisticated network topologies, for example: The arrows shows the direction of the updates NB: This is an example, not STAR topology. BNLLBL Master BNL slaves Master mirrorSlaves Less traffic over WAN

1. Manage complexity  Configurations  Aid the comparison between server characteristics (i.e. OS version, MySQL version)  Aid the comparison of the settings of the different database servers  User management  Create and delete users on a group of servers

1. Manage complexity  Consistency checks  Compare row counts of different replicas  Compare master log pointer with slaves  Check slave connections to the master  Evaluate the replication  Monitor CPU and network activity of the servers to help decide if the current number of servers is sufficient

2. Creating a new mirror  MySQL keeps the slave synchronized, but you have to manually copy the db files during slave initialization  When the DB is already in place, you might have to copy each file by itself, since the total might exceed 2GB  GRID technologies can be used to transfer the first copy of the database from site to site

2. Creating a new mirror  New mirror  Steps for mirror creation 1. Install MySQL (manual?) 2. Copy database files (through GRIDFtp) 3. Configure master and slave  Creating/deleting a database on an existing mirror

2. Creating a new mirror  Ease of use  Integration with the previous GUI  Hide as much details as possible and encourage good configuration policies (i.e. create a user with suitable permissions to be used by the slave connection)

3. Catalogs  Jobs need to know to which database server to connect (for now, this is done by XML files)  In a GRID environment, jobs will contact a file catalog to determine the location of the files to be used  The scheme for the database catalog shouldn’t be different, and it should be as connected as possible to the file catalog

3. Catalogs  Database catalog  A job should query the catalog to know to which database server to connect  If possible, file catalogs could be used directly. For example, instead of a physical file location, the catalog could return the parameters for the database connection (i.e. “mydb.star.bnl.gov:3301”)  If no connection can be established, the job might ask the catalog for an alternative server

3. Catalogs  Administration  The DBA should be able to establish the policies that decide to which server a given job should connect (i.e. IP address, user groups)  Integration with the GUI  The GUI would use the catalog to keep track of the different servers  The GUI would help the DBA to assign policies, and check that the catalog information is not corrupted

4. Authentication  GRID authentication should be integrated to the whole scheme  Connection between database servers  Connection to the catalog  Connection job-database server

4. Authentication  MySQL 4.0 will integrate SSL for both user connections and between server connections  MySQL can be instructed to accept only certificate authentication  MySQL 4.0 is still in alpha and SSL is not GSI, but:  Some issues should be already addressed  A patch to enable GSI on SSH is available and might provide further insights

4. Authentication  Integration with the GUI  Just one authentication at the beginning of the application  Same authentication used to authenticate to the servers  User management