MSc. Miriel Martín Mesa, DIC, UCLV. The idea Installing a High Performance Cluster in the UCLV, using professional servers with open source operating.

Slides:



Advertisements
Similar presentations
INSTALLING LINUX.  Identify the proper Hardware  Methods for installing Linux  Determine a purpose for the Linux Machine  Linux File Systems  Linux.
Advertisements

Planning Server Deployments
Module 1: Installing Windows XP Professional
Workshop: Using the VIC3 Cluster for Statistical Analyses Support perspective G.J. Bex.
Managing Your Network Environment © 2004 Cisco Systems, Inc. All rights reserved. Managing Cisco IOS Devices INTRO v2.0—9-1.
Leveraging WinPE and Linux Preboot for Effective Provisioning Jonathan Richey | Director of Development | Altiris, Inc.
ITE PC v4.0 Chapter 1 1 Operating Systems Computer Networks– 2.
Hands-On Microsoft Windows Server 2003 Administration Chapter 10 Monitoring and Troubleshooting Windows Server 2003.
Lesson 15 – INSTALL AND SET UP NETWARE 5.1. Understanding NetWare 5.1 Preparing for installation Installing NetWare 5.1 Configuring NetWare 5.1 client.
1.1 Installing Windows Server 2008 Windows Server 2008 Editions Windows Server 2008 Installation Requirements X64 Installation Considerations Preparing.
Using the P4-Xeon cluster HPCCC, Science Faculty, HKBU Usage Seminar for the 64-nodes P4-Xeon Cluster in Science Faculty March 24, 2004.
Quick Tutorial on MPICH for NIC-Cluster CS 387 Class Notes.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Chapter 1 Introducing Windows Server 2012/R2
IFIN-HH LHCB GRID Activities Eduard Pauna Radu Stoica.
Welcome Course 20410B Module 0: Introduction Audience
Windows Server 2003 Windows Server Family Products Windows Server 2003 Web Edition Windows Server 2003 Standard Edition Windows Server 2003 Enterprise.
16.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 16: Examining Software Update.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 14: Problem Recovery.
Utilizing Condor and HTC to address archiving online courses at Clemson on a weekly basis Sam Hoover 1 Project Blackbird Computing,
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
Deploying and Managing Windows Server 2012
Welcome Thank you for taking our training. Collection 6421: Configure and Troubleshoot Windows Server® 2008 Network Course 6690 – 6709 at
Abstract Load balancing in the cloud computing environment has an important impact on the performance. Good load balancing makes cloud computing more.
Introduction to HP LoadRunner Getting Familiar with LoadRunner >>>>>>>>>>>>>>>>>>>>>>
Operating Systems Networking for Home and Small Businesses – Chapter 2 – Introduction To Networking.
Hands-On Microsoft Windows Server 2008
Guide to Linux Installation and Administration, 2e1 Chapter 3 Installing Linux.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
VIPBG LINUX CLUSTER By Helen Wang March 29th, 2013.
By Rashid Khan Lesson 10-From Here to There: Remote Installation of the Windows XP Professional Client.
Module 1: Installing and Configuring Servers. Module Overview Installing Windows Server 2008 Managing Server Roles and Features Overview of the Server.
Chapter 3 Installing Windows XP Professional. Preparing for installation Pre-installation requirement; ◦ Hardware requirements ◦ Hardware compatibility.
Module 1: Installing Microsoft Windows XP Professional.
Week #3 Objectives Partition Disks in Windows® 7 Manage Disk Volumes Maintain Disks in Windows 7 Install and Configure Device Drivers.
CHAPTER 2. Overview 1. Pre-Installation Tasks 2. Installing and Configuring Linux 3. X Server 4. Post Installation Configuration and Tasks.
Weekly Report By: Devin Trejo Week of June 14, 2015-> June 20, 2015.
CCNA4 v3 Module 6 v3 CCNA 4 Module 6 JEOPARDY K. Martin.
Microsoft Windows XP Professional MCSE Exam
Chapter 9: Networking with Unix and Linux. Objectives: Describe the origins and history of the UNIX operating system Identify similarities and differences.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
2: Operating Systems Networking for Home & Small Business.
Automating Installations by Using the Microsoft Windows 2000 Setup Manager Create setup scripts simply and easily. Create and modify answer files and UDFs.
Running clusters on a Shoestring Fermilab SC 2007.
Wouter Verkerke, NIKHEF 1 Using ‘stoomboot’ for NIKHEF-ATLAS batch computing What is ‘stoomboot’ – Hardware –16 machines, each 2x quad-core Pentium = 128.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Operating Environment. Installation and Upgrade Options Solaris suninstall program Solaris Web Start Installation Custom Jumpstart procedure Standard.
Windows Certification Paths OR MCSA Windows Server 2012 Installing and Configuring Windows Server 2012 Exam (20410) Administering Windows Server.
Planning Server Deployments Chapter 1. Server Deployment When planning a server deployment for a large enterprise network, the operating system edition.
Windows Server 2003 { First Steps and Administration} Benedikt Riedel MCSE + Messaging
Using ROSSMANN to Run GOSET Studies Omar Laldin ( using materials from Jonathan Crider, Harish Suryanarayana ) Feb. 3, 2014.
Linux Introduction Linux was developed in the early 1990’s by Linus Torvald computer science student at the University of Helsinki Linux is distributed.
Advanced Computing Facility Introduction
Workstations & Thin Clients
Chapter 1 Introducing Windows Server 2012/R2
GRID COMPUTING.
DIT314 ~ Client Operating System & Administration
Welcome to Indiana University Clusters
Create setup scripts simply and easily.
Guide to Linux Installation and Administration, 2e
Virtualization in the gLite Grid Middleware software process
Integration of Singularity With Makeflow
Networking for Home and Small Businesses – Chapter 2
CCR Advanced Seminar: Running CPLEX Computations on the ISE Cluster
Networking for Home and Small Businesses – Chapter 2
Operating Systems Networking for Home and Small Businesses – Chapter 2 – Introduction To Networking.
The Neuronix HPC Cluster:
The EU DataGrid Fabric Management Services
Quick Tutorial on MPICH for NIC-Cluster
Presentation transcript:

MSc. Miriel Martín Mesa, DIC, UCLV

The idea Installing a High Performance Cluster in the UCLV, using professional servers with open source operating system

¿Why? The current researches require a large amount of computational resources that can not be obtained with a single computer. The need to make several runs of the experiments, without having to wait to finish the current run to execute the next The possibility of having an electric back that allows running jobs that require several days to finish

Current Hardware  7 nodes Dell R410 with:  2 Intel processors with 6 cores x processors  12 GB RAM,  250 GB hard drive  2 NIC Gbps,  10 Blade nodes Dell 1955 with:  2 Intel processors with 2 cores x processors  12 GB RAM,  36 GB HDD

Current Hardware 17 nodes with:  28 processors,  132 cores,  204 GB RAM,  1.3 TFLOPS (theoretical)

Cluster design Beowulf design

Basic Software S/O: Debian 7 Resource manager: TorquePBS Scheduler: MAUI Central user authentication: NIS Server

Cluster installation (Master and nodes) PXE (Preboot eXecution Environment) DHCP TFTP HTTP server DNS Server (BIND) Preseed script (Answers to installation questions)

Preseed code d-i mirror/protocol string http d-i mirror/country string manual d-i mirror/http/hostname string master.cluster.uclv.edu.cu d-i mirror/http/directory string /debian d-i mirror/http/proxy string d-i mirror/suite string wheezy d-i partman-auto/disk string /dev/sda d-i partman-auto/method string regular d-i partman-auto/choose_recipe select atomic d-i partman-auto/purge_regular_from_device boolean true d-i partman-regular/confirm boolean true d-i partman/confirm_write_new_label boolean true d-i partman/choose_partition select Finish partitioning and write changes to disk d-i partman/confirm boolean true # If the system has free space you can choose to only partition that space. tasksel tasksel/first multiselect minimal d-i pkgsel/include string openssh-server puppet d-i preseed/late_command string sed -i 's/no/yes/g' /target/etc/default/puppet d-i mirror/protocol string http d-i mirror/country string manual d-i mirror/http/hostname string master.cluster.uclv.edu.cu d-i mirror/http/directory string /debian d-i mirror/http/proxy string d-i mirror/suite string wheezy d-i partman-auto/disk string /dev/sda d-i partman-auto/method string regular d-i partman-auto/choose_recipe select atomic d-i partman-auto/purge_regular_from_device boolean true d-i partman-regular/confirm boolean true d-i partman/confirm_write_new_label boolean true d-i partman/choose_partition select Finish partitioning and write changes to disk d-i partman/confirm boolean true # If the system has free space you can choose to only partition that space. tasksel tasksel/first multiselect minimal d-i pkgsel/include string openssh-server puppet d-i preseed/late_command string sed -i 's/no/yes/g' /target/etc/default/puppet d-i mirror/protocol string http d-i mirror/country string manual d-i mirror/http/hostname string master.uclv.cu d-i mirror/http/directory string /debian d-i mirror/suite string wheezy d-i mirror/protocol string http d-i mirror/country string manual d-i mirror/http/hostname string master.uclv.cu d-i mirror/http/directory string /debian d-i mirror/suite string wheezy d-i partman-auto/disk string /dev/sda d-i partman-auto/method string regular d-i partman-auto/choose_recipe select atomic d-i partman-regular/confirm boolean true d-i partman/confirm_write_new_label boolean true d-i partman/choose_partition select Finish partitioning and write changes to disk d-i partman/confirm boolean d-i partman-auto/disk string /dev/sda d-i partman-auto/method string regular d-i partman-auto/choose_recipe select atomic d-i partman-regular/confirm boolean true d-i partman/confirm_write_new_label boolean true d-i partman/choose_partition select Finish partitioning and write changes to disk d-i partman/confirm boolean tasksel tasksel/first multiselect minimal d-i pkgsel/include string openssh-server puppet d-i preseed/late_command string sed -i 's/no/yes/g' /target/etc/default/puppet tasksel tasksel/first multiselect minimal d-i pkgsel/include string openssh-server puppet d-i preseed/late_command string sed -i 's/no/yes/g' /target/etc/default/puppet

Cluster Management Puppet Package management and configuration of the server and the nodes.

Cluster Management Module: commons Class packages-commons { $packages_commons = ["csh","flex","byacc","vim",tcsh","lsb", "lsb-core"] package { $packages_commons : ensure => installed }

Cluster Management Module: MPICH class mpich ($mpich_version ) { file {mpich: path => "${mpich_path}", owner => root, mode => 775, ensure => directory, } exec { "mpich_configure": cwd => "${mpich_source}-${mpich_version}/", command => "nice -19 sh configure ${mpich_prefix} ${mpich_with_torque}", onlyif => "test ! -e ${mpich_source}-${mpich_version}/config.log", } … }

cron { update_ntpdate: command=> "/usr/sbin/ntpdate ", user=> root, minute=> 0, hour=> '*/1', } service { cron: ensure => running, enable => true, } Cluster Management

Monitoring tools Ganglia Provides real-time monitoring and execution environment

Monitoring tools Icinga Monitors any network resource, notifies the errors, generates performance data for reporting and reports the status of resources

System Access

 Secure shell (SSH): # ssh

System Access

 Web page

System Access  Web page

System Access  Web page

System Access  Web page

Cluster applications

Example #!/bin/bash #PBS -N example1 #PBS -l nodes=2:ppn=4 #PBS -l walltime=01:20:00 #PBS -q default #PBS -m ae #PBS -M cd $PBS_O_WORKDIR module load mpich/3.0.4 mpirun./application

Cluster queues QueuenodesaccessCores Memory (GB)jobs/users Max Time(hours)Priority Default small1 Blade nodes medium1-3Any long1-4Any

To do Implement system of user quotas Add an external storage Continue installing applications demanded by users We always need to do more

Thank you Muchas Gracias