1 By: Solomon Mikael (UMBC) Advisors: Elena Vataga (UNM) & Pavel Murat (FNAL) Development of Farm Monitoring & Remote Concatenation for CDFII Production.

Slides:



Advertisements
Similar presentations
Physics with SAM-Grid Stefan Stonjek University of Oxford 6 th GridPP Meeting 30 th January 2003 Coseners House.
Advertisements

Operating System.
Amber Boehnlein, FNAL D0 Computing Model and Plans Amber Boehnlein D0 Financial Committee November 18, 2002.
Essential Introduction to Computers. What is a Computer? An electronic device, operating under the control of instructions stored in its own memory, that.
Business Technology Applications Computer Basics.
1 Computing for Todays Lecture 22 Yumei Huo Fall 2006.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Guide To UNIX Using Linux Third Edition
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 8: Implementing and Managing Printers.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Computer Skills Preparatory Year Presented by: L.Obead Alhadreti.
Chapter 3 Software Two major types of software
Website Development for the Leave/Travel System and for Interfacing a Database PresentationBy Linda M’mayi Fort Valley State University.
Acceleratio Ltd. is a software development company based in Zagreb, Croatia, founded in We create innovative software solutions for SharePoint,
Advanced File Processing
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
CHAPTER 2 OPERATING SYSTEM OVERVIEW 1. Operating System Operating System Definition A program that controls the execution of application programs and.
Operating System. Architecture of Computer System Hardware Operating System (OS) Programming Language (e.g. PASCAL) Application Programs (e.g. WORD, EXCEL)
Shuei MEG review meeting, 2 July MEG Software Status MEG Software Group Framework Large Prototype software updates Database ROME Monte Carlo.
Operating Systems CS3502 Fall 2014 Dr. Jose M. Garrido
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Chapter Four UNIX File Processing. 2 Lesson A Extracting Information from Files.
Guide To UNIX Using Linux Fourth Edition
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
Remote Production and Regional Analysis Centers Iain Bertram 24 May 2002 Draft 1 Lancaster University.
Microsoft Office 2007 Essential Introduction to Computers.
Central Reconstruction System on the RHIC Linux Farm in Brookhaven Laboratory HEPIX - BNL October 19, 2004 Tomasz Wlodek - BNL.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
November 7, 2001Dutch Datagrid SARA 1 DØ Monte Carlo Challenge A HEP Application.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
Farm Management D. Andreotti 1), A. Crescente 2), A. Dorigo 2), F. Galeazzi 2), M. Marzolla 3), M. Morandin 2), F.
3rd Nov 2000HEPiX/HEPNT CDF-UK MINI-GRID Ian McArthur Oxford University, Physics Department
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
CHEP 2003Stefan Stonjek1 Physics with SAM-Grid Stefan Stonjek University of Oxford CHEP th March 2003 San Diego.
A Design for KCAF for CDF Experiment Kihyeon Cho (CHEP, Kyungpook National University) and Jysoo Lee (KISTI, Supercomputing Center) The International Workshop.
Interactive Job Monitor: CafMon kill CafMon tail CafMon dir CafMon log CafMon top CafMon ps LcgCAF: CDF submission portal to LCG resources Francesco Delli.
Advanced File Processing. 2 Objectives Use the pipe operator to redirect the output of one command to another command Use the grep command to search for.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 3: Operating-System Structures System Components Operating System Services.
Chapter Five Advanced File Processing Guide To UNIX Using Linux Fourth Edition Chapter 5 Unix (34 slides)1 CTEC 110.
An application architecture specifies the technologies to be used to implement one or more (and possibly all) information systems in terms of DATA, PROCESS,
1 Welcome to CSC 301 Web Programming Charles Frank.
International Workshop on HEP Data Grid Nov 9, 2002, KNU Data Storage, Network, Handling, and Clustering in CDF Korea group Intae Yu*, Junghyun Kim, Ilsung.
6/26/01High Throughput Linux Clustering at Fermilab--S. Timm 1 High Throughput Linux Clustering at Fermilab Steven C. Timm--Fermilab.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Outline: Tasks and Goals The analysis (physics) Resources Needed (Tier1) A. Sidoti INFN Pisa.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
Outline: Status: Report after one month of Plans for the future (Preparing Summer -Fall 2003) (CNAF): Update A. Sidoti, INFN Pisa and.
DCAF(DeCentralized Analysis Farm) for CDF experiments HAN DaeHee*, KWON Kihwan, OH Youngdo, CHO Kihyeon, KONG Dae Jung, KIM Minsuk, KIM Jieun, MIAN shabeer,
International Workshop on HEP Data Grid Aug 23, 2003, KNU Status of Data Storage, Network, Clustering in SKKU CDF group Intae Yu*, Joong Seok Chae Department.
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
Hans Wenzel CDF CAF meeting October 18 th -19 th CMS Computing at FNAL Hans Wenzel Fermilab  Introduction  CMS: What's on the floor, How we got.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
XP Creating Web Pages with Microsoft Office
Advanced Computing Facility Introduction
Compute and Storage For the Farm at Jlab
WP18, High-speed data recording Krzysztof Wrona, European XFEL
2. OPERATING SYSTEM 2.1 Operating System Function
Operating System.
Chapter 2: System Structures
Introduction to Operating System (OS)
IS3440 Linux Security Unit 9 Linux System Logging and Monitoring
Guide To UNIX Using Linux Third Edition
Chapter Four UNIX File Processing.
Chapter 2: Operating-System Structures
Chapter 2: Operating-System Structures
Presentation transcript:

1 By: Solomon Mikael (UMBC) Advisors: Elena Vataga (UNM) & Pavel Murat (FNAL) Development of Farm Monitoring & Remote Concatenation for CDFII Production Project

08/08/2006Solomon Mikael 2 Outline CDF Experiment CDF Production Farm Goals & Structure Issues with Concatenation My Contributions 1 Control & Monitoring My Contributions 2 Summary Acknowledgments

08/08/2006Solomon Mikael 3 Goal of CDF Prodution Farm ● The main goal of the Production Farm is the reconstruction of available data for physics analysis as soon as possible, reprocess data when necessary, and generate Monte Carlo events

08/08/2006Solomon Mikael 4 CDF Experiment The CDF ( collider detector experiment at FermiLab) is an international collaboration involving many universities and national laboratories The CDF ( collider detector experiment at FermiLab) is an international collaboration involving many universities and national laboratories 2 intense beams of protons and anti-protons meet head on in the middle of the 100 ton solenoidal CDF detector 2 intense beams of protons and anti-protons meet head on in the middle of the 100 ton solenoidal CDF detector In order to observe the particles there are layers of subdetectors in the CDF detector each layer responsible for the detection of a different particles properties. In order to observe the particles there are layers of subdetectors in the CDF detector each layer responsible for the detection of a different particles properties. Information from electronic channels is recorded Information from electronic channels is recorded

08/08/2006Solomon Mikael 5 Production Farm - Hardware the Production Farm consists of 150 dual CPU PC's with a total computing power of 800 GHz The high throughput Linux clusters are used for event reconstruction and analysis The production farm PC have a total of 25 TB of disk space

08/08/2006Solomon Mikael 6 ● the CDF production farm performs computing and network intensive tasks in a cost effective manner Production Farm - Software ● SAM (sequential data access via metadata) a data handeling system organized as set of servers working together to store and retrieve files. SAM mitigates the problem of one person hogging the tape drives and/or flooding the tape system. SAM provides tools for the database bookkeeping. ● CAF (CDF Analysis Farm) is software and control systems for batch job submission on top of Condor batch system.

08/08/2006Solomon Mikael 7 Stucture of CDF Farm concatenation

08/08/2006Solomon Mikael 8 ● In the present scheme concatenator and tape uploader are running at the same time resulting in limited I/O from the stager. ● Disk access rate depends on the number of simultaneous I/O operations from the disk RAID 5. ● My project at CDF entailed removing the load of concatenation from the stagers to CAF to achieve higher data flow rates. Issues with Concatenation tape transfer rate (MB/s) Entries tape transfer rate (MB/s)

08/08/2006Solomon Mikael 9 Structure Stager mergeSubmit.py 1.Analyze input directory 2.Creates.tcl 3.Send CAF job Worker 1.Script copies input files 2.Run binary code for concatenator 3.Copy output file to stager Monitoring

08/08/2006Solomon Mikael 10 Required Skills Before I could make any changes to the CDF farm it was imperative I learned how the individual parts of the farm operated and how they are interrelated: Effective use of bash scripts & awk text editor Learning python to modifying the concatenating script MergeSubmit.py Modifying Tikiwiki pages using the online Tikiwiki editor & web pages

08/08/2006Solomon Mikael 11 What’s BASH & AWK Shell is a program which interprets commands, either typed in directly by the user or contained in a file called a shell script. Awk named after its developers ( Aho, Weinberger, and Kernighan ) is a programming language which permits manipulation of structured data and generation of formatted reports. A pattern scanning and processing language

08/08/2006Solomon Mikael 12 the_num=2; export JOB_NUM=3 echo $SEGMENT_NUMBER echo $JOB_NUM temp.awk -v seg_num=$SEGMENT_NUMBER #!/bin/awk BEGIN { flag = 0; } /SEGMENT_NUMBER/ { if ($5 == seg) { flag = 1 } } /include/ { if (flag == 1) { print $0 } } { if ($1 == "}") { flag = 0 } # if (($4 == "==") && (ENVIRON["SEGMENT_NUMBER"] == $5)) {print $0} } #!/bin/bash name=`basename $0`../cdfopr/scripts/common_procedures../cdfopr/scripts/parse_parameters $* > temp_parse_log echo $TCL_FILE echo $PARAM_USER echo $PARAM_HOST echo $PARAM_PATH cmd="fcp -c ${RCP} E}."; $cmd STATUS=$?; if[ STATUS -ne 0 ];then echo "$TCL_FILE was not able to be copied" exit 1; fi for file_loc in `grep "include file /" ${TCL_FILE} | awk '{print $3}'`;do cmd="fcp -c ${RCP} "; $cmd STATUS=$?; if [ STATUS -ne 0 ];then echo "$file_loc was not able to be copide fi done

08/08/2006Solomon Mikael 13 if { $env(SEGMENT_NUMBER) == 1 } { # # OutputDir = "/export/data1/cdfmc/concatTest/Monte_Carlo_Test1/mergeLogs/hphysr_0y_01/tmp" ; # total size: # set DATASET xbck0y include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbck0y/reco.xy0339c5.0284bck0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbck0y/reco.xy0339c5.028ebck0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbck0y/reco.xy0339c5.0298bck0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbck0y/reco.xy0339c5.02a2bck0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbck0y/reco.xy0339c5.02acbck0 } if { $env(SEGMENT_NUMBER) == 2 } { # # OutputDir = "/export/data1/cdfmc/concatTest/Monte_Carlo_Test1/mergeLogs/hphysr_0y_01/tmp" ; # total size: # set DATASET xbhd0y include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbhd0y/reco.xy0339c5.027abhd0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbhd0y/reco.xy0339c5.0284bhd0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbhd0y/reco.xy0339c5.028ebhd0 include file /export/data1/cdfmc/concatTest/Monte_Carlo_Test1/xbhd0y/reco.xy0339c5.0298bhd0 } Example.tcl File

08/08/2006Solomon Mikael 14 Control & Monitoring ● Tikiwiki software is used for web based documentation ● The tiki database keeps a history of all changes to the Farm Projects ● Tiki pages enable users to: ● keep track of all existing projects ● Start or stop a project ● Change resource sharing between projects ● Redirect output to another stager ● Forward execution to CAF without having to connect to the main server ● Python’s extensive support for XML, , RSS feeds and many other Internet protocols make it effective for developing custom web solutions

08/08/2006Solomon Mikael 15 Monitoring – Web Page Interface ● to ensure CDF production farm runs smoothly the hardware performance including status reporting must be monitered ● this is done using the production farm web interface (PFWI) ● PFWI parses, calculates, and displays all major characteristics of the farm with online results

08/08/2006Solomon Mikael 16 Tikiwiki Contributions In this page it shows the disk space on the 32 partitions on the different servers p = string.find(output, '/export/data4')stringfindoutput usage = string.strip(output[p-24:p])usagestringstripoutput fp.write(""" fncdfsrv5 %20s /export/data4 """ % usage)fpwriteusage fp.write("\n")fpwrite percentage = string.strip(output[p-5:p-2])percentagestringstripoutput if string.atoi(percentage) > 90 : IsFull=1stringatoipercentage1 Edited pythons script df_disk.py

08/08/2006Solomon Mikael 17 Tiki Editor Using the online tiki editor modifications were made to improve the functionality of the ProjectConfiguration page

08/08/2006Solomon Mikael 18 Summary In these weeks: Implemented improvements to Production Farm monitoring. Participated in development of remote concatenation. Acquired new skills Learned about the physics inside the FermiLab laboratory

08/08/2006Solomon Mikael 19 Acknowledgements SIST committee for giving me this opportunity Elena Vataga & Pavel Murat Ms. Engram & Dr. Elliott McCrory Dr. Davenport & Jamieson Olsen

08/08/2006Solomon Mikael 20 BACKUP

08/08/2006Solomon Mikael 21 History of Production Farm ● Fermilab has used clusters of processors to provide large computing power with dedicated processors like the Motoroloa ● CDF Run 2 data was processed using the first developed Farm Processing System (FPS) using FBSNG batch system (1) ● Farm Processing System was the software that managed, controlled, and monitered the CDF production farm from

08/08/2006Solomon Mikael 22 Monitoring ● PARSING - this layer access MySQL or CAF output files and after processing text and performing calculations the data is fed to cache layer ● CACHE – this layer does statistical preprocessing and has an interface to easily visualize the data. The data is then stored. (1) ● WEB – displays all the information collected by the parseres and gathers data not needing pre-processing. -- Uses PHP4 to generaet the web pages. ● Python tiki