Introduction to CVMFS A way to distribute HEP software on cloud Tian Yan (IHEP Computing Center, BESIIICGEM Cloud Computing Summer School.

Slides:



Advertisements
Similar presentations
CHAPTER 15 WEBPAGE OPTIMIZATION. LEARNING OBJECTIVES How to test your web-page performance How browser and server interactions impact performance What.
Advertisements

Delivering Experiment Software to WLCG sites A new approach using the CernVM Filesystem (cvmfs) Ian Collier – RAL Tier 1 HEPSYSMAN.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
Coda file system: Disconnected operation By Wallis Chau May 7, 2003.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
What is it? Hierarchical storage software developed in collaboration with five US department of Energy Labs since 1992 Allows storage management of 100s.
Seafile - Scalable Cloud Storage System
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
1 Worker Nodes Web Proxies Grid Site Repository Mirrors (Stratum 1) HTTP CernVM-FS fuse module on WNs recommended deployment Worker Nodes Web Proxies Grid.
CVMFS: Software Access Anywhere Dan Bradley Any data, Any time, Anywhere Project.
CS 4700 / CS 5700 Network Fundamentals Lecture 17.5: Project 5 Hints (Getting a job at Akamai) Revised 3/31/2014.
Distributed Computing for CEPC YAN Tian On Behalf of Distributed Computing Group, CC, IHEP for 4 th CEPC Collaboration Meeting, Sep ,
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Ceph Storage in OpenStack Part 2 openstack-ch,
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
StoRM Some basics and a comparison with DPM Wahid Bhimji University of Edinburgh GridPP Storage Workshop 31-Mar-101Wahid Bhimji – StoRM.
By: Ashish Gohel 8 th sem ISE.. Why Cloud Computing ? Cloud Computing platforms provides easy access to a company’s high-performance computing and storage.
Configuration Management with Cobbler and Puppet Kashif Mohammad University of Oxford.
Support in setting up a non-grid Atlas Tier 3 Doug Benjamin Duke University.
Laboratoire LIP6 The Gedeon Project: Data, Metadata and Databases Yves DENNEULIN LIG laboratory, Grenoble ACI MD.
Predrag Buncic (CERN/PH-SFT) WP9 - Workshop Summary
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
Changes to CernVM-FS repository are staged on an “installation box" using a read/write file system interface. There is a dedicated installation box for.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
Breaking Barriers Exploding with Possibility Breaking Barriers Exploding with Possibility The Cloud Era Unveiled.
Ian Gable University of Victoria 1 Deploying HEP Applications Using Xen and Globus Virtual Workspaces A. Agarwal, A. Charbonneau, R. Desmarais, R. Enge,
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
DPM Python tools Ivan Calvet IT/SDC-ID DPM Workshop 10 th October 2014.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
Catalin Condurache STFC RAL Tier-1 GridPP OPS meeting, 10 March 2015.
Using CVMFS to serve site software Sarah Williams Indiana University 2/01/121.
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
+ AliEn site services and monitoring Miguel Martinez Pedreira.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen Budapest
Image Distribution and VMIC (brainstorm) Belmiro Moreira CERN IT-PES-PS.
The CernVM Infrastructure Insights of a paradigmatic project Carlos Aguado Sanchez Jakob Blomer Predrag Buncic.
T3g software services Outline of the T3g Components R. Yoshida (ANL)
Testing CernVM-FS scalability at RAL Tier1 Ian Collier RAL Tier1 Fabric Team WLCG GDB - September
1 Cloud Services Requirements and Challenges of Large International User Groups Laurence Field IT/SDC 2/12/2014.
CernVM-FS Infrastructure for EGI VOs Catalin Condurache - STFC RAL Tier1 EGI Webinar, 5 September 2013.
CVMFS: Software Access Anywhere Dan Bradley Any data, Any time, Anywhere Project.
Feedback from CMS Andrew Lahiff STFC Rutherford Appleton Laboratory Contributions from Christoph Wissing, Bockjoo Kim, Alessandro Degano CernVM Users Workshop.
36 th LHCb Software Week Pere Mato/CERN.  Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis.
Predrag Buncic (CERN/PH-SFT) CernVM Status. CERN, 24/10/ Virtualization R&D (WP9)  The aim of WP9 is to provide a complete, portable and easy.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
Replicazione e QoS nella gestione di database grid-oriented Barbara Martelli INFN - CNAF.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
CernVM-FS – Best Practice to Consolidate Global Software Distribution Catalin CONDURACHE, Ian COLLIER STFC RAL Tier-1 ISGC15, Taipei, March 2015.
Advancing CernVM-FS and its Development Infrastructure José Molina Colmenero CERN EP-SFT.
Considerations on Using CernVM-FS for Datasets Sharing Within Various Research Communities Catalin Condurache STFC RAL UK ISGC, Taipei, 18 March 2016.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
© 2010 VMware Inc. All rights reserved Why Virtualize? Beng-Hong Lim, VMware, Inc.
CVMFS Alessandro De Salvo Outline  CVMFS architecture  CVMFS usage in the.
CernVM-FS vs Dataset Sharing
AWS Integration in Distributed Computing
Virtualisation for NA49/NA61
Dag Toppe Larsen UiB/CERN CERN,
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Dag Toppe Larsen UiB/CERN CERN,
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Introduction to CVMFS A way to distribute HEP software on cloud
Virtualisation for NA49/NA61
HDV130/100 Upgrade Firmware March
WLCG Demonstrator R.Seuster (UVic) 09 November, 2016
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
Presentation transcript:

Introduction to CVMFS A way to distribute HEP software on cloud Tian Yan (IHEP Computing Center, BESIIICGEM Cloud Computing Summer School 2015/9/9, IHEP, Beijing

Outline  Overview  How does it work?  How to install, configure, and debug it? Introduction to CVMFS, Tian Yan, 2015/09/092

OVERVIEW Part I 3

What is CVMFS?  CVMFS = C ERN VM File System  POSIX read-only network file system based on HTTP  designed and optimized for HEP software distribution  official site: Introduction to CVMFS, Tian Yan, 2015/09/094

Why we use it?  Widely used in HEP community developed in CERN and widely used by WLCG and other HEPs  Fast based on HTTP, allow exploitation of various web caches (e.g. squid, commercial content delivery networks) aggressively cached to reduce latency transfer data and metadata on demand  Flexible filesystem versioning and hotpatching file-by-file software in VM/Docker image need further packaged Introduction to CVMFS, Tian Yan, 2015/09/095

Why we use it?  Scalable one stratum 0 (release manager machine), many stratum 1 mirrors automatic mirror server selection based on geographic proximity use web/local cache to reduce direct access to server  Reliable it verifies data integrity by cryptographic hases  Easy to install, configure and maintain only outgoing HTTP, firewall friendly can be installed by one shell script (less than one page) it’s stable, so it rarely need maintain effort Introduction to CVMFS, Tian Yan, 2015/09/096

HOW DOES IT WORK? Part II 7

A glance  left side is client; right side is server with http interface  virtual filesystem based on FUSE in user space  load data only on access Introduction to CVMFS, Tian Yan, 2015/09/098

Openning a file on CVMFS Introduction to CVMFS, Tian Yan, 2015/09/099 FUSE: filesystem in user space metadata stored in SQLite DB

CVMFS server stratums Introduction to CVMFS, Tian Yan, 2015/09/0910 for cern.ch dormain

 CVMFS repository is a form of content-addressable storage  publish: create file catalog, compress data, calculate hash CVMFS repository Introduction to CVMFS, Tian Yan, 2015/09/0911

metadata Introduction to CVMFS, Tian Yan, 2015/09/0912

CVMFS building blocks Introduction to CVMFS, Tian Yan, 2015/09/0913

HOW TO INSTALL, CONFIGURE AND DEBUG IT? Part III 14

Install stratum 0 server  Install AUFS-enabled kernel # wget release-2-5.noarch.rpmhttps://ecsft.cern.ch/dist/cvmfs/cvmfs-release/cvmfs- release-2-5.noarch.rpm # yum install cvmfs-release*.rpm # yum –disablerepo=“*” –enablerepo=“cernvm-kernel” install kernel # yum –enablerepo=“cernvm-kernel” install aufs2-util # reboot  Install cvmfs packages # yum install cvmfs cvmfs-server Introduction to CVMFS, Tian Yan, 2015/09/0915

Creating a repository  create a repository (bes3.ihep.ac.cn) # cvmfs_server mkfs bes3.ihep.ac.cn  update the repository # cvmfs-server transaction bes3.ihep.ac.cn add/edit/delete files/directories in /cvmfs/bes3.ihep.ac.cn # cvmfs_server publish bes3.ihep.ac.cn  if you want to clear all changes and start over again: # cvmfs_server abort bes3.ihep.ac.cn Introduction to CVMFS, Tian Yan, 2015/09/0916

Paths in CVMFS server  Some notable file paths in cvmfs server  /srv/cvmfs, /srv/cvmfs/, /var/spool/cvmfs can be symlinked to another location before creating the repository Introduction to CVMFS, Tian Yan, 2015/09/0917 PathDescription /cvmfsrepository mount point, read-only AUFS mountpoints /srv/cvmfscentral repo. storage location /srv/cvmfs/ storage location of a specific repo. /var/spool/cvmfsinternal states of the repo. /etc/cvmfsconfiguration files and keychains

Install client  Install cvmfs packages: # yum install cvmfs cvmfs-config-default  create a file /etc/cvmfs/default.local with content: CVMFS_REPOSITORIES=bes3.ihep.ac.cn CVMFS_HTTP_PROXY=DIRECT CVMFS_CACHE_BASE=/path/to/your/cache/dir CVMFS_QUOTA_LIMIT=10240  create file /etc/cvmfs/config.d/bes3.ihep.ac.cn with content: CVMFS_SERVER_URL= CVMFS_PUBLIC_KEY=/etc/cvmfs/keys/bes3.ihep.ac.cn.pub Introduction to CVMFS, Tian Yan, 2015/09/0918

Install client  copy the key in CVMFS server to the same dir in client: /etc/cvmfs/keys/bes3.ihep.ac.cn.pub  setup # setenforce 0 # cvmfs_config setup # service autofs restart  check if it can be mounted # cvmfs_config probe Introduction to CVMFS, Tian Yan, 2015/09/0919

Debugging  Check if misconfiguration exist # cvmfs_config chksetup  show all the configuration parameters # cvmfs_config showconfig bes3.ihep.ac.cn  to exclude autofs/automount as a source of problem, try to mount repo. manually # mkdir /mnt/cvmfs_test # mount –t cvmfs bes3.ihep.ac.cn /mnt/cvmfs_test  to exclude SELinux as a source of problem, disable it by # setenforce 0 Introduction to CVMFS, Tian Yan, 2015/09/0920

Summary  CVMFS is a good way to distributed HEP software in grid, cloud, and cluster sites over WAN  It’s a POSIX read-only network filesystem based on FUSE, HTTP, AUFS etc.  You can try setting up a server and client on VMs. Introduction to CVMFS, Tian Yan, 2015/09/0921