February, 20071 Databases Project Update J.Trumbo LSC/DBI/DBA February 27, 2007.

Slides:



Advertisements
Similar presentations
How We Manage SaaS Infrastructure Knowledge Track
Advertisements

Digital Edge Solutions Overview Services – Application Support.
ITEC474 INTRODUCTION.
DSG Database Administration Projects Presentation July 09, 2003 CD Projects Status Meeting.
Securing Oracle Databases CSS-DSG JTrumbo. Audit Recommendations -Make sure databases are current with patches. -Ensure all current default accounts &
June 23rd, 2009Inflectra Proprietary InformationPage: 1 SpiraTest/Plan/Team Deployment Considerations How to deploy for high-availability and strategies.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
Copyright Tim Antonowicz, This work is the intellectual property of the author. Permission is granted for this material to be shared for non- commercial,
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
D0 DB Taking Stock ‘10 1 By Anil Garg – Database Services June 17, 2010.
Oracle Application Server 10g (9.0.4) Recommended Topologies Pavana Jain.
Clarity Educational Community Get the Results You Need When You Need Them Transitioning to CA PPM On Demand Presented by: Joshua.
M ODULE 2 D ATABASE I NSTALLATION AND C ONFIGURATION Section 1: DBMS Installation 1 ITEC 450 Fall 2012.
Fermilab Oct 17, 2005Database Services at LCG Tier sites - FNAL1 FNAL Site Update By Anil Kumar & Julie Trumbo CD/CSS/DSG FNAL LCG Database.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
Online Database Support Experiences Diana Bonham, Dennis Box, Anil Kumar, Julie Trumbo, Nelly Stanfield.
D0 Taking Stock1 By Anil Kumar CD/CSS/DSG July 10, 2006.
CD Databases & Systems Services & Lessons Learned Btev Workshop June 23, 2004 Updated July 2005 J.Trumbo Core Support Services, Database Systems Group.
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
Maintaining File Services. Shadow Copies of Shared Folders Automatically retains copies of files on a server from specific points in time Prevents administrators.
CDF Taking Stock ‘08 1 By Anil Kumar CD/LSCS/DBI/DBA July 16, 2008.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
CD FY09 Tactical Plan Review FY09 Tactical Plans for Database Services J.Trumbo Sept. 24, 2008.
PC MANAGER MEETING January 23, Agenda  Next Meeting  Training  Windows Policy  Main Topic: Windows AV Service Review.
Chris Wright Senior Systems Engineer, Lucity MOVING TO ONE DATABASE FOR SQL SERVER.
08/30/05GDM Project Presentation Lower Storage Summary of activity on 8/30/2005.
06/22/2005CDF Taking Stock CDF Taking Stock By Anil Kumar CD/CSS/DSG June 22, 2005.
ATLAS Detector Description Database Vakho Tsulaia University of Pittsburgh 3D workshop, CERN 14-Dec-2004.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
Server Virtualization & Disaster Recovery Ryerson University, Computer & Communication Services (CCS), Technical Support Group Eran Frank Manager, Technical.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Database Services Nelly Stanfield October 7, 2009 Database Services3425-v1.
Elizabeth Gallas August 9, 2005 CD Support for D0 Database Projects 1 Elizabeth Gallas Fermilab Computing Division Fermilab CD Grid and Data Management.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
VMware vSphere Configuration and Management v6
CDF DB Taking Stock ‘10 1 By Anil Garg – Database Services Aug 18, 2010.
1 D0 Taking Stock By Anil Kumar CD/LSCS/DBI/DBA June 11, 2007.
Alwayson Availability Groups
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for Database Administration J.Trumbo Presented by 06/16/08.
GOOMAZURE Mannheim, 6 th October 2015 Stamitz Saal, 2:30 – 3:15 pm.
D0 Taking Stock1 By Anil Kumar CD/CSS/DSG June 06, 2005.
Oracle for Physics Services and Support Levels Maria Girone, IT-ADC 24 January 2005.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for LSCS-DBI-APP Dennis Box June
Oracle Applications 11i Concepts II Brian Hitchcock OCP 11i DBA -- OCP 10g DBA Sun Microsystems Brian Hitchcock.
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
November 1, 2004 ElizabethGallas -- D0 Luminosity Db 1 D0 Luminosity Database: Checklist for Production Elizabeth Gallas Fermilab Computing Division /
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
Site Services and Policies Summary Dirk Düllmann, CERN IT More details at
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
DBS Monitor and DAN CD Projects Report July 9, 2003.
March, Database Projects J.Trumbo CSS-DSG May,
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Software sales at U Waterloo Successfully moved software sales online Handle purchases from university accounts Integrated with our Active Directory and.
September Database Projects J.Trumbo CSS-DSG Sept
Calgary Oracle User Group
SQL Replication for RCSQL 4.5
Lead SQL BankofAmerica Blog: SQLHarry.com
By Anil Kumar CD/CSS/DSG June 06, 2005
Database Services at Fermilab
Oracle Solaris Zones Study Purpose Only
Introduction of Week 6 Assignment Discussion
Introduction of Week 3 Assignment Discussion
Klopotek is transitioning to a Global Organization
SpiraTest/Plan/Team Deployment Considerations
IT and Development support services
Presentation transcript:

February, Databases Project Update J.Trumbo LSC/DBI/DBA February 27, 2007

February, Outline What’s included San technology for databases Infrastructure machines Health of D0ora2 Oracle 10 upgrade Advanced Security Option D0 Online transition Backup & Recovery Cad Minos D0 luminosity SDSS Freeware Nova ESH Training Accomplishments in a Nutshell Moving Forward

February, San Technology D0ora2 Disks D0 experienced data corruption on the Clarion array over the holidays. Abandoning the mount point that was a common thread in the corruptions seemed to be the root cause. This issue amplified the urgency of purchasing new storage for d0 offline. A new san has been requisitioned to replace the Clarion array on d0ora2. Minimally sized, initial hardware purchase to move off d0ofprd1, excluding event data. Longer term, with additional purchases, more database instances can be added. Only database files will be on this san, no backups or other app files. Will be starting a plan soon!

February, Hardware Purchased 1x S400 Storage Array –2 nodes (controllers) –2 disk chassis –32 146GB FC disks –16x 500GB FATA disks –dynamic optimization –virtual copy –thin provisioning –1 year 24x7 maintenance and installation

February, San Features Next generation array –Reduce amount of storage use Thin provisioning R/W snapshots –Reduce maintenance outages Dynamic Optimization/Tuning Non-disruptive upgrades –Reduce cost Non-disruptive tiered-storage

February, Infrastructure Machines Requested new infrastructure machines were not purchased last year…will try again this year. CST applications being defined as ‘major’ applications and should be moved to a more isolated dev/int/prd hardware environment. Separate instance for high-availability applications (Helpdesk/Remedy), removing Remedy’s dependency on MISCOMP and ESHTRK database so Remedy is unaffected by miscomp downtimes. Separate hardware for this would be ideal.

February, Plans for Infrastructure Applications Purchase a new dev/prod database server boxes for Infrastructure databases. –Fncduh1/g1 are 5-6 years old. –Currently, fncdug1 has 3 production and 3 integration instances. Not terabytes of data, but lots of users, lots of applications and 6 instances on 1 machine. 2 int dbs are shutdown to preserve resources for production. –G1 apps include several apache servers, miser, matrix, users and growing, resources are tight. Allow g1 to continue to serve applications, but… –Move the databases off g1&h1 to a new box. Move the databases to an exclusive database server machine to release the database from the 3 rd party dependencies as well as maximize the database resources. This new production box will use the san for disk.

February, Health of D0ora2, improved! Last report the cpu load on d0ora2 was often at 100%. We still hit 100%, but are no longer consistently at 100%. DBI/DBA’s dream is to make d0ora2 a database server machine period, no other applications running on it, till then... Actions included: S.White has implemented a new version dbserver using Oracle 10 client from Oracle 8 client on apps side. Use of Oracle 8 client apps is minimized and being deprecated. Doubled the memory from 16g to 32g. Removed int & prod cron jobs that are not utilized. Removed most the dbservers from d0ora2. R.Herber thorough investigation into the queries on datafiles found full table scans being used due to high occurrence of identical characters in the 1 st 14 digits of filename, rendering index histograms from data analysis faulty. A special analysis on datafiles removing the histogram has been invoked. Started tracking long transactions and addressing them with users. Discontinued event recording to the database, Feb 6, What else was on the list? Fix the queries that come out of the dimensions code so they do not traverse the same table 2x.

February, Oracle v10 Upgrade Completed the upgrade to Oracle 10 on all databases with the exception of the infrastructure (miscomp) instances. Upgrade included: Completely new OEM (monitoring) tool. New streams functionality. New tuning parameters. New security methods. New optimizer. Rman configuration modifications. Infrastructure databases cannot be upgraded to v10 till Matrix is retired. They have been upgraded to the terminal release of v9.

February, Oracle Advanced Security Option Advanced Security (ASO) is the Oracle product to kerberize Oracle database access. ASO does not adhere to MIT kerberos standards, and thus, has been unusable at Fermilab. Oracle’s June Futamasa, is the DOE rep. She has promised help in getting ASO fixed. Oracle gotten our issues to ‘bug’ status. We have prepared the test environment and have deployed three ASO bug patches. We have been assigned a developer at Oracle. We are continuing work with J.Futamasa and Oracle development team, holding regular meetings.

February, D0 Online Transition D0 online databases and machine transition from D0 to DBI/DBA is under way. Steps include: –Adding addition space for both dev and prod –Moving the production database that has been running on the dev machine to the production machine, specifically, histdb. –Bringing machines to a current patch level (databases have been maintained to current patch levels). Actually an upgrade to the os needed for improved cluster functionality and the aging version. However, RedHat provides no upgrade patch for clustering. RH’s suggested upgrade path is ‘buy 3 new machines and reinstall’. We do not have hardware for that suggested upgrade path. –Getting rman backups established to enstore and tested –Understanding and documenting failover technology.

February, Backup & Recovery DBI/DBA has standardized on dcache/enstore for tape backups of rman files for our larger databases. Tibs is handling the small databases (<50G), infrastructure, cad. Our homegrown product rman_dcache sends and retrieves rman files from dcache. Rman_dcache still needs a bit development work. We have been too short handed to truly finish the product and make it database platform independent. Work is done as resources can manage. The isa-group suggested last fall that the databases get a dedicated dcache pool fy 07 to minimize problems. This solution was introduced a few weeks ago, backed out and will be rescheduled at a later date.

February, Backup/Recover Database backups going to dcache/entstore: D0ofprd1 (d0 offline) Halted D0ofprd1_readonly (d0 offline events deprecated) D0oflump (d0 luminosity) Minosprd (minos sam) Cdfonprd (cdf online) Cdfofpr2 (cdf offline) D0onl (d0 online) … not quite there yet D0onl (d0 online readonly … not quite there yet

February, Cad Supporting Cad’s 2 database machines and 2 new middle tier windows machines this year. SKovich and NStanfield setup the new Sun boxes for the databases, ARomero is supporting the Windows boxes. Larry Carpenter the cad manager, has left the lab. T.Parker is interim manager. Continuing hosting bi weekly meetings with cad. TD’s goal of implementation of Team Center by Dec was missed. Data scrubbing is still not complete, however a test implementation of Ideas to Team Center was about to be launched when L.Carpenter left. T.Parker will need to pick up the pieces and move forward. Stakeholders are the PPD, TD, CD, ADMS, ADCRYO.

February, Minos Minos sam is running just fine. L.Buckley-Geer has requested us to DBI/DBA minos’ MySql database. This is under discussion.

February, D0 Luminosity D0 luminosity application is running with no real issues for DBI/DBA. D0 lum application owners have been testing constant changes code. Space is tight on dev for testing. Additional disks have been purchased for the production machine. These additional disks were planned at time of the machine purchase. They have arrived in receiving, I believe they are in prep. We will be adding these disks to the machine as soon as the array becomes available.

February, SDSS S.Lebedeva, lead DBI/DBA, J.Platson DBI/DBA in training for SDSS. In the last 6 months SDSS dbas have: Loaded Dr6, backed up raw data to enstore. Used SqlServer 2005 for DR6, major upgrade. Continued documentation. Worked on the web interface. Reviewed monitoring tools. Purchased & installed Idera sql server monitoring tool. Incorporated the Blue Arc into SDSS processes adding flexibility and cheaper disk availability.

February, Freeware DBI/DBA is continually attempting to find time and resources to Update/maintain web documentation. Cross train DBI/DBAs as time allows. Establishment of test and production freeware database environments. With the reorg DBI/DBA took over support for the MySql database server which services ~12 users. Would like to host comparable service for Postgres, but need hardware, training and people. We are working toward this. DBI/DBA plans to establish deeper background and support level for freeware soon. Assuming no unforeseen issues, we should be able to start putting some effort into our freeware environment. The 5 year leased licenses being used to service non Fermi employees for Oracle expire June 1, If serious consideration is being given to deprecate the Oracle lease license by the 2010 expiration, the project to move SAM (and possibly other apps) to freeware needs to be resurrected and given resources. The project to prove Sam under Postgres was on the Taking Stock task list for several years, and was officially removed last taking stock meeting. A Postgres Sam schema exists, but has never been proven. Else, modify Sam and possibly other experiment apps, allowing only Fermi employees to access the database.

February, Nova DBI/DBA has begun attending regularly scheduled meetings to discuss Nova. Work thus far, includes requirements documents for the online database and design discussion at both the application and database levels. This project is in it infancy but is progressing. It is expected to demand additional resources. DBI/DBA intends to provide support and direction as needed and the project progresses. No doubt, there will be more on this next report!

February, ESH Standardized ESH environment on Windows. Completed general system documentation. Established rman backups for recovery consistency. Exports are continued, used to refresh dev with prod as needed. Tested recoveries. Setup recovery scripts. Participated in audit of ESH database. Documented recovery testing for audit.

February, Training 6 months ago I reported ‘DSG is too dependant on individuals with specific expertise in areas. There has been no time to cross train.’ Though SDSS is in much better shape, there has been no improvement in other areas, however, I have hope this situation will soon be easing. DBI/DBA needs to have resources to Cross train existing responsibilities Train, practice & master new technologies Attend classes

February, Accomplishments in a Nutshell Upgraded databases to Oracle v10. Upgraded OEM grid control to v10.2 Established a Cad working group, meeting every other Monday, making small steps to transition to team center. Requisition for San storage for D0ora2. D0 online transition under way. Established standard environment for ESH. Setup a production USCMS oracle instance to accommodate tier 1 data transfer, job request data.

February, Accomplishments in a Nutshell Moved the CMS Pixel databases to Cern. Continued maintenance, patching, refreshing, accounts, etc. of operating systems and databases with a > 99% uptime. I believe cdfonprd continues at a 100% level for over 1 year. Continued maintenance of Sam schema for 3 experiments. Deployed modifications to the Sam Request schema. Continued consult to application owners on schema design and implementation.

February, Accomplishments in a Nutshell SDSS Smooth transition of SDSS to SqlServer 2005 from Implementation of DR6. New monitoring software for SDSS. Jim Gray offer to include S.Lebedeva as co author in Microsoft Tech Report: "SkyServer Traffic Report - The First Five Years": ype=Technical%20Report&id= ype=Technical%20Report&id=1236 With help from R.Pasetes & A.Romero, resolved issues with using Blue Arch with SDSS Windows, saving time, space and preventing fragmentation on SDSS boxes.

February, Moving Forward Replacement of the Clarion array on d0ora2, moving the database to the new san. New hardware, a miscomp database server machines, for dev/int&prod. G1/h1 retained to server applications. New hardware for CST major apps machines. A 24x7 database app machine. (Remedy and others) New hardware for ESH oracle instances to move them off windows. Kerberizing Oracle Continue migration of D0 online responsibilities. Nova

February, Moving Forward Continue attempting to find resources to cross train. Minos MySql responsibilities and transition. Freeware More in-depth training in freeware. –Dba training. –Establish a dev/prod Postgres environment for Fnal small database users, modeled after the existing MySql environment. Requires hardware. –Establish and publish standards, procedures and best practices documents for freeware databases. –Strengthen security baselines.