Presentation is loading. Please wait.

Presentation is loading. Please wait.

INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org The Information System: GRIS, GIIS, BDII, information providers, Installation,Configuration,

Similar presentations


Presentation on theme: "INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org The Information System: GRIS, GIIS, BDII, information providers, Installation,Configuration,"— Presentation transcript:

1 INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org The Information System: GRIS, GIIS, BDII, information providers, Installation,Configuration, log files, testing & debugging. Valeria Ardizzone INFN EGEE Site Manager Tutorial Bologna, 06-08 June 2007

2 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Outline – Overview of the LCG Information System – Differents between Top and Site BDII – Generic Information Provider – BDII update process – Installation, Configuration & Logging

3 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 LCG Information System

4 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 LCG Information System LCG adopts a combination of solutions – Globus MDS  At the lowest level of the information system  To discover and monitor resources and publish information  Grid Information Security (GSI) credentials  Caching – BDII  At the highest level of the system  Because MDS had some troubles in terms of scalability  Used by the Resource Broker for the matchmaking process  Can be configured by each VO  Queries underlying systems periodically (2 minutes) Hierarchical system – Information is collected on the leaves of a hierarchical tree and travels towards the root – Clients can query the hierarchical tree at every level – The higher the level against which queries are made, the older is the obtained information

5 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Collecting Information Gathering of information at different levels – Lower level: Grid Resource Information Server (GRIS)  Collects information on the state of a given resource  One GRIS on top of each resource  A set of scripts and sensor that try to extract useful info on the resource – Medium level: Grid Index Information Server (GIIS)  Collects information on resources of a given site  One GIIS for each site – Higher level: BDII  Collects information on resources of a given VO  One BDII for each VO (suggested solution) Way of collecting info – Pull model (higher level servers periodically query lower level servers) – LDAP query model

6 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Globus MDS (The past) Globus Meta Directory Server (MDS) – It is a hierarchical system – Based on LDAP servers – GRISes are leaves of the tree – GIISes are intermediate nodes of the tree – The user can query the system at every level – The higher the information is in the tree, the older it is Grid Resource Information Service (GRIS) – One for each Grid Resource (CE or SE) – Collects info on that resource – Static or dynamic info – Adopts techniques to take measures (such as sensors) Grid Index Information Service (GIIS) – One for each site – Collects info from above GRISes – Caches info according to its validity time – Queries above GRISes or GIISes whether needed

7 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII overview The Berkley Database Information Index (BDII) – Developed within the context of LCG project – Solves problems of instability of the MDS occurring when the number of sites grows too much – Stays on top of GIIS sites – One for each VO – Centralized system – Three levels of hierarchy – Accessed by the Workload Management System Way of working – One GRIS for each resource – One GIIS for each site collecting info from below GRIS systems – One BDII for a given VO collecting information from below GIIS systems – Two LDAP servers, one for write access and one for read access – Every two minutes a cron-job runs a script and collects info from a list of GIIS sites – The list of GIIS is placed in the configuration file of the BDII

8 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII overview Every node (except UI,BDII and Wns) has a GRIS service in order to publish its informations A node in every site collects all site GRISes and publishes them using a site BDII, by default installed on the CE The top BDII collects all site BDIIs User can run a set of commands to query the top BDII.

9 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Top BDII vs Site BDII Site BDII – By default it is installed on the CE – It collects all site GRISes* (for example SE,RB,LFC,etc..) – The name of the service is bdii Top BDII – It collects all site BDIIs** (CE1,CE2..... CEn) – The name of the service is bdii – It gives to the RB all needed informations to match and dispatch user's jobs – It can run in the same machine where the RB is running (it's more fast in answer) *GRIS=Grid Resource Information Service **BDII=Berkely Database Infomatin Index

10 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 LCG Information System Hierarchy RB Local GRIS SE Local GRIS CE Local GRIS BDII-A BDII-B SE Local GRIS SE Local GRIS CE Local GRIS SE Local GRIS BDII-C Site 1 Site 2 Site 3 Site BDII

11 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Generic Information Provider The Generic Information Provider, GIP, is a highly configurable information provider that makes a separation between static and dynamic information. It can be used to produce any kind of information for use with LDAP based grid information systems.GIPLDAP An information provider in its simplest form is a script that prints an LDIF file to standard out. The LDIF file should conform to the schema used in the GIP.LDIFGIP The problem is with dynamic information. Usually there are only a few attributes that have to be found dynamically. The idea with the GIP is to use dynamic plug-ins to obtain these values and use a common framework for everything else.GIP

12 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 GIP configuration file The GIP configuration file contains all the configuration parameters for the GIP.GIP – temp_dir The directory to use as the GIPs cache.GIPs – plugin_dir The directory where to put links to the dynamic plugins. – static_dir The directory where to put inks to the static ldif files. – provider_dir The directory where to put links to the information providers. – freshness How long to use the cache before running the dynamic plugins again. – cache_ttl How long the cache is valid. – response How long the GIP will wait for dynamic plugins before continuing.GIP – timeout Timeout for the dynamic plugins in seconds.

13 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 GIP The GIP script reads in all the ldif files from the static_dir and run all the providers in the plugin_dir. It will then run any dynamic plug-ins found in the plugin_dir to obtain the dynamic values. It will print the LDIF to standard out but use the any dynamic values found from the plugins.GIP Plugins and providers should not be put directly into the directory, insted a symbolic link or wrapper script should be used.

14 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 SiteInfo Information Provider dn: in=host.invalid/siteinfo,Mds-Vo-name=local,o=grid objectClass: SiteInfo objectClass: DataGridTop objectClass: DynamicObject siteName: SITE-NAME sysAdminContact: admin@host.invalid userSupportContact: support@host.invalid siteSecurityContact: security@host.invalid dataGridVersion: LCG-2_2_0 installationDate: 20040106120000Z

15 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 SiteInfo Schema objectClass: DataGridTop objectClass: SiteInfo in: information siteName: Name of site sysAdminContact: Contact details for local system administrator userSupportContact: Contact details for local user support siteSecurityContact: Contact details for site security dataGridVersion: Overall DataGrid software version installationDate: Date of installation cpuResourceDescription: Description of available CPU resources diskResourceDescription: Description of available disk resources supportedFileSystem: Supported file systems AFS, NFS, local batchSystem: Available batch system, PBS, LSF, BQS, Condor, fork etc massStorageDescription: Description of available mass storage experimentalSoftware: Availablity of software, ATLAS, CMS, ESA etc…

16 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII Update Process (1) The BDII consists of two or more standard LDAP databases that are populated by an update process.LDAP The update process obtains LDIF (LDAP Data Interchange Format ) file from either doing an ldapsearch on LDAP URLs or by running a local script that generates LDIF. The LDIF is then inserted into the LDAP database.LDIF Options exist to update the the list of LDAP URLs from a web page and to use an LDIF file from a web page to modify the data before inserting it into the database. The daemon script will start the update process is bdii-update.

17 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 The update process does the following in an endless loop: – Update configuration file that contains the LDAP URLs from the web. – Fork off processes to obtain the LDIF from the URLs. – Write the LDIF files in the temporary directory. – Try to obtain an LDIF file that will modify (filter) the data. – Modify the data read from the temporary files. – Kill the oldest database instance along with any stale clients. – Clear the corresponding database. – Add the fresh data to a new database. – Start the new database. – Forward traffic from the BDII read port to the new database.BDII – Sleep for a while. BDII Update Process (2)

18 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII server installation & configuration using YAIM

19 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Installation Pre-requisites Always verify the deployment of – Java SDK – NTP daemon – APT – Latest glite-yaim and ig-yaim (glite-yaim-3.0.1-15 and ig-yaim- 3.0.1-2) Check your host certificates in /etc/grid-security and its permissions: –-rw-r--r-- 1 root root1127Jun 14 12:27 hostcert.pem –-r-------- 1 root root887 Jun 14 12:28 hostkey.pem Firewall must be opened for BDII listening port (2170). – On /etc/sysconfig/iptables add  -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport 2170 -j ACCEPT – And execute  service iptables restart

20 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Installation: site-info.def Copy site-info.def template file provided by ig_yaim in to the root dir and customize it cp /opt/glite/yaim/examples/ig-site-info.def /root/my-site-info.def Open /root/my-site-info.def file using a text editor and set the following values according to your grid environment: MY_DOMAIN=ct.infn.it BDII_HOST= BDII_HTTP_URL= "http://grid018.ct.infn.it/fileadmin/bdii/bdii-update.conf" List of the GIIS (or site BDII) from which the BDII server fetches the information

21 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 List of nodes publishing information on the BDII. BDII_REGIONS: BDII_REGIONS="CE SE RB LFC VOBOX PX FTS MON“ BDII_CE_URL="ldap://$CE_HOST:2135/mds-vo- name=local,o=grid" BDII_SE_URL="ldap://$CLASSIC_HOST:2135/mds-vo- name=local,o=grid" … BDII_FTS_URL="ldap://$FTS_HOST:2170/mds-vo- name=resource,o=grid" BDII_MON_URL="ldap://$MON_HOST:2135/mds-vo- name=local,o=grid" Installation: site-info.def

22 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII Installation This command will download and install all the needed packages: /opt/glite/bin/ig_install_node /root/my-site-info.def ig_BDII Now we can configure the node: /opt/glite/bin/ig_configure_node /root/my-site-info.def ig_BDII

23 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII Configuration The BDII is configured by three configuration files:BDII – /opt/bdii/etc/bdii.conf – /opt/bdii/etc/bdii-update.conf – /opt/bdii/etc/schemas – /opt/bdii/var/bdii-fwd.conf Services when the component is running: – Service: bdii-update Service  Started by: /etc/init.d/bdii Configuration: /opt/bdii/etc/bdii- update.conf /opt/bdii/etc/bdii.conf Started byConfiguration – Service: bdii-fwd Service  Started by: /etc/init.d/bdii Configuration: /opt/bdii/etc/bdii.conf Started byConfiguration – Service: slapd Service  Started by: /etc/init.d/bdii Configuration: /opt/bdii/etc/bdii.conf Started byConfiguration – Service: bdii Service  Started by: n/a Configuration: /opt/bdii/etc/bdii.conf Started byConfiguration

24 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 bdii.conf BDII configuration files in /opt/bdii/etc/bdii.conf: BDII_PORT_READ=2170 <- The port which is used for incoming queries. BDII_PORTS_WRITE="2171 2172 2173“ <- The database ports. Each port specified will create a database. BDII_USER=edguser <- The user that runs the slapds. BDII_BIND=mds-vo-name=local,o=grid <- The top level dn for the database. BDII_PASSWD=XXXXXXX <- The password for the slapds. BDII_SEARCH_FILTER='(|(objectClass=GlueSchemaVersion)(objectClass=GlueTop))‘ <- A search filter for use with the ldapsearches. BDII_SEARCH_TIMEOUT=30 <- Time to wait for LDAP urls to respond. BDII_BREATHE_TIME=60  -- Time to wait before updating the next database. BDII_AUTO_UPDATE=yes  - Whether or not to update the file containing the LDAP URLs. BDII_AUTO_MODIFY=yes <- Whether or not to modify the data using an LDIF modify file. BDII_DIR=/opt/bdii/ <- The location and working directory of the bdii. BDII_UPDATE_URL=http://grid018.ct.infn.it/fileadmin/bdii/bdii-update.conf <- The URL to use to update the LDAP URLs file. BDII_UPDATE_LDIF=http://goc.grid-support.ac.uk/gridsite/bdii/BDII/www/bdii-update.ldif <- The URL to use to update the LDIF modify file. SLAPD=/usr/sbin/slapd <- The location of the slapd command. SLAPADD=/usr/sbin/slapadd <- The location of the slapadd command.

25 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 bdii-update.conf GILDA-INFN-CATANIA ldap://grid010.ct.infn.it:2170/mds-vo-name=gilda-infn-catania,o=grid # 3.0.0 INFN-PADOVA ldap://gilda-ce-01.pd.infn.it:2170/mds-vo-name=infn-padova,o=grid # 3.0.0 INFN-CNAF ldap://grid011f.cnaf.infn.it:2170/mds-vo-name=infn-cnaf,o=grid # 3.0.0 IUCC-LCG2 ldap://grid004.iucc.ac.il:2170/mds-vo-name=IUCC-LCG2,o=grid # 3.0.0 CNR-ROMA ldap://grid0021.datagrid.cnr.it:2170/mds-vo-name=cnr-roma,o=grid # IHEP-BEIJING ldap://gilda01.ihep.ac.cn:2170/mds-vo-name=IHEP-BEIJING,o=grid # 3.0.0 ICEAGE-CATANIA ldap://iceage-ce-01.ct.infn.it:2170/mds-vo-name=ICEAGE-CATANIA,o=grid# 3.0.0 SZTAKI ldap://gn0.hpcc.sztaki.hu:2170/mds-vo-name=SZTAKI,o=grid # 3.0.0 GILDA-TORINO ldap://ce-nano-37.to.infn.it:2170/mds-vo-name=GILDA-TORINO,o=grid# 3.0.0 UNIDIST-GENOVA ldap://grid-ce.bio.dist.unige.it:2170/mds-vo-name=dist-genova,o=grid # 3.0.0 GILDA-ING-MESSINA ldap://trigriden00.unime.it:2170/mds-vo-name=gilda-ing-messina,o=grid # 3.0.0 IISAS-BRATISLAVA ldap://dgt01.ui.savba.sk:2170/mds-vo-name=iisas-gilda,o=grid # 3.0.0 CE-CETA-CIEMAT ldap://ce-gilda-edu.ceta-ciemat.es:2170/mds-vo-name=CE-CETA-CIEMAT,o=grid # 3.0.0 # Sites with problems #NeSC-Edinburgh ldap://tc06.nesc.ed.ac.uk:2170/mds-vo-name=NeSC-Edinburgh,o=grid# 3.0.0 #GILDA-CECUM-MESSINA ldap://trigrid-ce00.unime.it:2170/mds-vo-name=gilda-cecum-messina,o=grid # 3.0.0 #GILDA-ROMA3 ldap://gildace01.roma3.infn.it:2170/mds-vo-name=GILDA-ROMA3,o=grid # 3.0.0 waiting for fix

26 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII in running The BDII is started by a daemon scriptBDII bdii When running the BDII service, the following processes should run on your system:BDII – slapd: as many instances as ports have been enumerated in BDII_PORTS_WRITE – bdii-update – bdii-fwd: 1 master, 2 (usually short-lived) processes per active client

27 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 bdii.log After starting the BDII, the update process can be monitored by looking into the log file:BDII /opt/bdii/var/bdii.log Updating DB on port 2173 Updated configuration. Waiting 30 s for query results. Time for searches: 30 s Time to update DB: 2 s Grabbing port 2170 for 2173 Sat Sep 23 04:09:09 CEST 2006 Sleeping for 60

28 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII Server Testing & Customization

29 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII testing You can ask the BDII in order to see if its database is running: ldapsearch –x –h -p 2170 –b mds-vo- name=local,o=grid

30 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII testing From your account in the UI: Set the BDII enviroenment: [plt@ui-1:~]$ export LCG_GFAL_INFOSYS= :2170 [plt@ui-1:~]$ lcg-infosites –h [plt@ui-1:~]$ lcg-info –h they will show the help with all the possible queries to obtain the sites’ informations

31 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII Customization Example: – Change BDII_UPDATE_URL to:  http://grid-deployment.web.cern.ch/grid-deployment/gis/lcg2- bdii/dteam/lcg2-all-sites.conf – Restart the service with: /etc/init.d/bdii restart – Give a look to the new information published by the service /opt/bdii/etc/bdii-update.conf contains the list of GIIS:... INFN-CATANIA ldap://grid010.ct.infn.it:2170/mds-vo-name=infn- catania,o=grid # 2.6.0 INFN-CNAF ldap://grid011f.cnaf.infn.it:2170/mds-vo-name=infn- cnaf,o=grid # 2.6.0 INFN-PADOVA ldap://gilda-ce-01.pd.infn.it:2170/mds-vo- name=infn-padova,o=grid # 2.6.0 IISAS-BRATISLAVA ldap://dgt01.ui.savba.sk:2170/mds-vo- name=iisas-gilda,o=grid # 2.6.0 ULA-MERIDA ldap://grid006.cecalc.ula.ve:2170/mds-vo-name=ula- merida,o=grid …

32 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 Troubleshooting

33 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 /opt/bdii/var/bdii.log If a query to the BDII doesn’t show the information about a site, you have to look at the bdii log file. For example: EUN-CAIRO-3: ldap_bind: Can't contact LDAP server Check if: – globus-mds is up & running (ps aux |grep 2135) – Firewall setup

34 EGEE Site Manager Tutorial - Bologna, 06-08 June 2007 Enabling Grids for E-sciencE INFSO-RI-508833 BDII References LCG-2 User Guide – https://edms.cern.ch/file/454439//LCG-2-UserGuide.html gLite 3.0 User Guide – https://edms.cern.ch/file/722398/1.1/gLite-3-UserGuide.pdf gLite 3.0 installation manual – http://grid- deployment.web.cern.ch/griddeployment/documentation/LCG2- Manual-Install BDII Documentation: – http://lfield.home.cern.ch/lfield/cgi- bin/wiki.cgi?area=bdii&page=documentation – https://uimon.cern.ch/twiki/bin/view/LCG/BdiiNotes Information System Trouble Shooting Guide – http://lfield.home.cern.ch/lfield/trouble.html


Download ppt "INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org The Information System: GRIS, GIIS, BDII, information providers, Installation,Configuration,"

Similar presentations


Ads by Google