Building and Testing using Condor

Slides:



Advertisements
Similar presentations
Requirements for a UI Test Framework Stanislaw Wozniak Bernie Miles.
Advertisements

Private and Confidential ThinkControl & ProLiant Essentials Rapid Deployment Pack.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Derek Wright Computer Sciences Department, UW-Madison Lawrence Berkeley National Labs (LBNL)
NSDI/NBII Clearinghouse Server Training Slide 1 NSDI/NBII Clearinghouse Server Training Yellowstone to Yukon Initiative 7. December University of.
Computer Programming My Home Page My Paper Job Description Computer programmers write, test, and maintain the detailed instructions, called programs,
Linux Operations and Administration
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Open Science Grid Software Stack, Virtual Data Toolkit and Interoperability Activities D. Olson, LBNL for the OSG International.
Using the WDK for Windows Logo and Signature Testing Craig Rowland Program Manager Windows Driver Kits Microsoft Corporation.
OGCE Building and Testing on NMI Slides by: Rishi Verma.
Terry Henry IS System Manager, SharePoint SME Micron Technology Inc.
Copyright © 2011 EMC Corporation. All Rights Reserved. MODULE – 6 VIRTUALIZED DATA CENTER – DESKTOP AND APPLICATION 1.
Apache Web Server v. 2.2 Reference Manual Chapter 1 Compiling and Installing.
ETICS All Hands meeting Bologna, October 23-25, 2006 NMI and Condor: Status + Future Plans Andy PAVLO Peter COUVARES Becky GIETZEL.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
Testing Grid Software on the Grid Steven Newhouse Deputy Director.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Application Programming in Cloud via Swift Swift Tutorial, CCGrid 2013, Hour 2 Ketan Maheshwari.
11 Computers, C#, XNA, and You Session 1.1. Session Overview  Find out what computers are all about ...and what makes a great programmer  Discover.
Condor Project Computer Sciences Department University of Wisconsin-Madison Running Interpreted Jobs.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison Condor and Virtual Machines.
Microsoft ® Official Course Module 6 Managing Software Distribution and Deployment by Using Packages and Programs.
 OpenBSD runs on many different hardware platforms.  OpenBSD is thought of by many security professionals to be the most secure UNIX-like operating system.
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
GIS IN THE CLOUD Cloud computing furnishes scalable GIS technology that is maintained off premises and delivered on demand as services via the Internet.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Build and Test system for FairRoot
CMS DCS: WinCC OA Installation Strategy
Virtualisation for NA49/NA61
Dag Toppe Larsen UiB/CERN CERN,
ONYX 12.2.
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Dag Toppe Larsen UiB/CERN CERN,
Overview – SOE PatchTT November 2015.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
Work report Xianghu Zhao Nov 11, 2014.
Overview – SOE PatchTT December 2013.
Virtualisation for NA49/NA61
Operating Systems Overview
5.0 : Windows Operating System
GLAST Release Manager Automated code compilation via the Release Manager Navid Golpayegani, GSFC/SSAI Overview The Release Manager is a program responsible.
The Client/Server Database Environment
The ETICS Build and Test Service
Introduction to Operating System (OS)
LQCD Computing Operations
Building Grids with Condor
An easier path? Customizing a “Global Solution”
PHP / MySQL Introduction
Introduction of Week 3 Assignment Discussion
Using the Parallel Universe beyond MPI
Managing Clouds with VMM
Haiyan Meng and Douglas Thain
Semiconductor Manufacturing (and other stuff) with Condor
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Chapter 2: System Structures
An introduction to the Linux environment v
Overview of Workflows: Why Use Them?
Instructor: Xiuwen Liu Department of Computer Science
UFCEUS-20-2 Web Programming
Salesforce.com Salesforce.com is the world leader in on-demand customer relationship management (CRM) services Manages sales, marketing, customer service,
OU BATTLECARD: Oracle Linux Training and Certification
OU BATTLECARD: Oracle Systems Learning Subscription
Basic Setup Internet Firewall Master 7 Nodes Gigabit switch
Infokall Enterprise Solutions
WeBWork and Open Educational Resources (OER)
Presentation transcript:

Building and Testing using Condor

Overview Goals The Condor build and test process Transition to new NMI framework Future plans

Goals Automated nightly builds and tests of development and stable branches Ability to submit on-demand build and test runs View results of individual runs and overall status.

The build and test process Builds and tests are submitted as Condor jobs to desired platforms. For testing, the previously built Condor binaries are installed as a Personal Condor. Tests are compiled and submitted to the Personal Condor to run. Condor returns the results when everything has finished. We use Condor to build and test Condor.

Great results This system has worked great for us for some time. Automated nightly build and test runs. Condor works great to run our builds and tests. We’ve learned from our experience and found some things we could improve on. Perl example /usr/local vs. /usr/local/bin - which version and modules where. Add other software that is PATH dependent and no way to tell which version is used. Gcc - solaris vs. linux

Some lessons learned Pool machines used for Condor and other project builds and tests. Disk space and partitioning not well defined. Not readily apparent which versions of software exist on a system or where to find it. Perl example /usr/local vs. /usr/local/bin - which version and modules where. Add other software that is PATH dependent and no way to tell which version is used. Gcc - solaris vs. linux

Old framework problems Builds and tests tightly coupled to the framework. Difficult to make changes to scripts without undesirable side effects. We created a new build and test system to solve these problems. Perl example /usr/local vs. /usr/local/bin - which version and modules where. Add other software that is PATH dependent and no way to tell which version is used.

The new NMI framework Development funded by NMI - NSF Middleware Initiative - which includes groups such as the GRIDS Center NMI project uses Condor and the pool machines for building and testing software releases. NMI project benefits from new framework New system came out of our collaboration with NMI project.

What does the new NMI framework provide? The same generic framework is used for both builds and tests. “Glue” scripts are used to connect the builds and tests with the framework. Build and test scripts are simpler to write. Inputs are well-defined

Example submit file inputs = input_build_runid.src, condor_test.src pre_all = nmi_glue/test/pre_all remote_declare = nmi_glue/test/remote_declare remote_declare_args = all remote_pre = nmi_glue/test/remote_pre remote_task = nmi_glue/test/remote_task remote_post = nmi_glue/test/remote_post prereqs = perl-5.8.5, tar-1.14, patch-2.5.4, m4-1.4.1, make-3.80, gzip-1.2.4 platforms = x86_rh_9,ia64_sles_8 Flexible inputs - source tarball, build id, ftp site

Workflow NMI Pool x86_rh_9 ia64_sles_8 disk archive database submit file NMI Pool x86_rh_9 ia64_sles_8 Flexible inputs - source tarball, build id, ftp site disk archive database

Condor testing benefits Condor test runs are now submitted per platform as the builds complete. Condor tests for specific features or long-running tests may be run separately from the nightly runs. Ability to pass arguments to tests at submit time. Enhanced data storage of test results. Testclasses example - define group or “class” of tests in Condor source code. Group name is passed to source via glue scripts to run the correct tests. This allows flexibility in test options without directly modifying test scripts. no need to wait for a build to finish on all platforms before starting tests.

New NMI pool management Machines have minimum hardware and disk space requirements. Any software installed or settings modified after the OS install are managed with cfengine. Hawkeye used to advertise all of the software and versions installed in the machine ClassAds. Condor jobs specify which software and version they require in the submit file, and are matched to the correct machine. Hardware and disk requirements are to ensure enough disk space and memory is available for Condor builds and tests. Cfengine - configuration agent - high level policy language to administrate and configure computer systems. Cfengine, or the configuration engine is an agent/software robot and a high level policy language for building expert systems to administrate and configure large computer networks. Cfengine uses the idea of classes and a primitive intelligence to define and automate the configuration and maintenance of system state, for small to huge configurations. Cfengine is designed to be a part of a computer immune system, and can be thought of as a gaming agent.

The build and test pools The old pool The new pool Old pool=14 machines New pool=21 machines and growing

Future plans Collaborators build and test software using our framework/pool Multi-machine testing capabilities using the Parallel Universe Automated Condor-G tests External people already using pool for builds and tests. NMI, Globus, etc.

More information Attend BOF NMI session on Wednesday for more details on the NMI pool setup, framework and glue scripts.