1 CNAP 22nd March 2004 Summary of Atlas Petabyte Data Store User Group Meeting March 4 th 2004.

Slides:



Advertisements
Similar presentations
Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
Advertisements

The Atlas Petabyte Datastore A grid enabled, networked data storage system: CrystalGrid Workshop 15 th Sept 2004 David Corney.
Peter Berrisford RAL – Data Management Group SRB Services.
Chapter 20 Oracle Secure Backup.
James Crowley C3 – Crowley Computer Consulting. Presentation  Available at
A new standard in Enterprise File Backup. Contents 1.Comparison with current backup methods 2.Introducing Snapshot EFB 3.Snapshot EFB features 4.Organization.
HEPiX, CASPUR, April 3-7, 2006 – Steve McDonald TRIUMF Steven McDonald & Konstantin Olchanski TRIUMF Network & Computing Services
Protect Your Business and Simplify IT with Symantec and VMware Presenter, Title, Company Date.
Harris LiveVault® Online Backup System. Harris LiveVault 2  What is Harris LiveVault?  Why Harris LiveVault?  How Harris LiveVault works  Harris LiveVault.
INFSO-RI Enabling Grids for E-sciencE Grid & Data Preservation Boon Low System Development, EGEE Training National.
Technical Review Group (TRG)Agenda 27/04/06 TRG Remit Membership Operation ICT Strategy ICT Roadmap.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Time Series Analyst An Internet Based Application for Viewing and Analyzing Environmental Time Series Jeffery S. Horsburgh Utah State University David.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
Deploying Visual Studio Team System 2008 Team Foundation Server at Microsoft Published: June 2008 Using Visual Studio 2008 to Improve Software Development.
- Travel Well Criteria.
Copyright © 2007 Quest Software The Changing Role of SQL Server DBA’s Bryan Oliver SQL Server Domain Expert Quest Software.
November 2009 Network Disaster Recovery October 2014.
Digital | Curation | Centre The UK Digital Curation Centre Michael Day UKOLN, University of Bath (with thanks to Peter Burnhill, Chris Rusbridge, et al.)
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
1 The SpaceWire Internet Tunnel and the Advantages It Provides For Spacecraft Integration Stuart Mills, Steve Parkes Space Technology Centre University.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
D. Britton GridPP Status - ProjectMap 22/Feb/06. D. Britton22/Feb/2006GridPP Status GridPP2 ProjectMap.
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
Purpose Intended Audience and Presenter Contents Proposed Presentation Length Intended audience is all distributor partners and VARs Content may be customized.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
Planning and Designing Server Virtualisation.
CSED Computational Science & Engineering Department CHEMICAL DATABASE SERVICE The Current Service is Well Regarded The CDS has a long and distinguished.
Section 2 Section 2.1 Identify hardware Describe processing components Compare and contrast input and output devices Compare and contrast storage devices.
Windows Small Business Server 2003 Setting up and Connecting David Overton Partner Technical Specialist.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
12th November 2003LHCb Software Week1 UK Computing Glenn Patrick Rutherford Appleton Laboratory.
Nick Brook Current status Future Collaboration Plans Future UK plans.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
Services for Object Storage and Preservation March 2008 All content in these slides is considered work in progress. In no way does it represent an absolute.
Microsoft ® Windows ® Small Business Server 2003 R2 Sales Cycle.
An Agile Service Deployment Framework and its Application Quattor System Management Tool and HyperV Virtualisation applied to CASTOR Hierarchical Storage.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Michael Doherty RAL UK e-Science AHM 2-4 September 2003 SRB in Action.
1 BCS, Oxfordshire, 19 February, 2004 WEB ARCHIVING issues and challenges Deborah Woodyard Digital Preservation Coordinator.
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
OAIS Based Certification David Giaretta ERPANET WORKSHOP Antwerpen April 2004.
Integrating Active Directory with eDirectory ™ Using Novell Account Manager Reid Oakes Technical Team Manager Novell, Inc.
IT-IDT-5 Understand, communicate, and adapt to a digital world. File Management.
DISCUSSION DRAFT ONLY Data Management METRICS for NNDC and CLASS David Hermreck.
Welcome and CNAP News Thanks to Gareth Smith and RAL for hosting the meeting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks NA3 Activity – Training and Induction Robin.
Data Preservation at Rutherford Lab David Corney 9 th July 2010 KEK.
The Storage Resource Broker and.
Introducing the RSP Chris Yates, University of Wales, Aberystwyth.
1 5/4/05 Fermilab Mass Storage Enstore, dCache and SRM Michael Zalokar Fermilab.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Office 365 is cloud- based productivity, hosted by Microsoft. Business-class Gain large, 50GB mailboxes that can send messages up to 25MB in size,
Implementing a Security Policy JISC – ICT Security Threats & Promises, April 2002 Mick Ismail ICT Services Manager City of Wolverhampton College.
Office 365 Upsell Paths.
KEEPS – a system for UELMA preservation and security
Planning for Application Recovery
KEEPS – a system for UELMA preservation and security
Office 365 is cloud-based productivity, hosted by Microsoft.
Upsell Small Business Customers to an Office 365 plan
PLM, Document and Workflow Management
Stream 2: Technical research Achievements and future plans
Backing up a Hard Disk Windows XP Tutorial 6.
Ákos Frohner EGEE'08 September 2008
Research Data Archive - technology
Robin Dale RLG OAIS Functionality Robin Dale RLG
Presentation transcript:

1 CNAP 22nd March 2004 Summary of Atlas Petabyte Data Store User Group Meeting March 4 th 2004

2 Summary of recent developments LHC, PP community and hardware upgrade, and media migration (Tim Folkes) SRB interface (Bonny Strong) SE interface for GRIDPP (Jens Jensen) Belt and braces: Improved environmental monitoring disaster recovery: New off site back-up service. OAIS, the RLG and trusted digital repositories (David Giaretta)

3 9940B connections Switch_1Switch_2 RS6000 fsc0fsc1 fsc0 9940B fsc1fsc0fsc1fsc rmt1 rmt4rmt3rmt2 rmt5-8 AAAAAAAA STK 9310 “Powder Horn” Gbit network 1.2TB

4 SRB Example: CMS Largest project using CCLRC SRB services at present is the CERN CMS experiment. SRB chosen for Pre-Challenge Production in 2003, producing data for Data Challenge ADS driver for SRB was developed to meet CMS immediate needs. SRB server installed for CMS which interfaces to ADS.

5 Future Plans for SRB to ADS The SRB driver developed for CMS will be expanded for use by other projects. ADS will run an SRB server for integration into any SRB domain. Will translate the SRB user name and/or domain name into an ADS owner name. Will use the pathtape server to map SRB collection names to ADS 6-character tape names.

6 APS Recent New Users & Potential New Users Recent New Users National Crystallography Service, Southampton University (~2TB/yr?) WASP (30TB/yr?) VIRGO Consortium (3TB/yr?) Potential New users Integrative Biology (15TB/yr?) Diamond? (1-3PB/yr?) BBSRC (BITS)? 10-20TB/yr?) Arts and Humanities Data Service? (2TB/yr)

7

8 Questionnaire responses 62% from CCLRC; 38% external 75% currently using ADS;25% not currently using or not users. Average years of use7.4 Max years of use20.0 Min years of use0.8 SD years of use6.6 Some role descriptions of those responding: “Sys admin”, “Data Analysis and data provision”, “Experiment coordinator”, “Archiver”, ”User”, “Project Data Storage Manager”, “Responsible for project back-ups”, “Project Manager”.

9 Questionnaire – Motivation and assessment “Convenient”, “Easy”, “Reliable”, “Support available”, “Secure”, “Long term back up”, “Large volume” “No need to get involved with tape storage”; “No perceived alternative” Mean Score (out of 10)8.2 Min5.0 Max10.0 SD1.8

10 Questionnaire – Web page usage Web page usage% Never21 Rarely14 Occasional57 Often 7

11 Questionnaire – Communication & Awareness Preferences for improved methods of communication % For% Against% Maybe Need for list server Need for user group meeting User awareness of recent developments Awareness ofAware (%)Not aware (%) Hardware upgrade7921 SE interface2971 SRB interface5050

12 Improvements or changes required to the service (1) Backup service available on wide platform i.e Windows PC etc Require SRM interface Need to store data sets with long names (I.e. > 6 chars) - and better than pathtape look-up is required Native support for full path names (ie. not having to use the pathtape service). Tiny tape names Use more for known downtimes etc Ability to store large files (> 2Gb)

13 Improvements or changes required to the service (2) More online storage / caching (depending on future requirements) Web / Grid interface User-queryable database of usage statistics, e.g. to find out my top-100 datasets, or to see how many times this year / month / etc a particular item has been accessed. Having this as a database that I can query using JDBC from my own management applications would be even better than static reports. Metadata lookups: it would be useful to check the file size directly from flfsys

14 Improvements or changes required to the service (3) Transparent file access (HSM) so that we could forget about (virtual) tapes Fix the problem between Solaris and the ADS software regarding multiple files on ADS datasets; Provide a backup and archive interface for NT servers. Really good tape changer driver mapped into Windows server (More support required) Quicker access to off line tapes to improve speed of restores. More documentation. More user-friendly commands for such things as rules Price control.

15 Ranked User issues questionUser specified IssueMean response (A-K) 3Need to store data sets with long names (I.e. > 6 chars) - and better than pathtape look-up is required 7.9 4Native support for full path names (ie. not having to use the pathtape service).Tiny tape names 7.7 6Ability to store large files (> 2Gb)7.3 18Price control.6.5 8Web / Grid interface6.4 5Use more for known downtimes etc6.2 16More documentation.6.1 7More online storage / caching (depending on future requirements)5.6 17More user-friendly commands for such things as rules5.6 1Backup service available on wide platform i.e Windows PC etc5.4 15Quicker access to off line tapes to improve speed of restores.5.3 9User-queryable database of usage statistics,5.0

16 Conclusions (1) Responses have been received mainly from technical, hands-on users with a good balance from both within CCLRC and from external users. The majority of responses have been received from people who are currently using the Data store. Most have many years of experience of using the Data Store. The responses received represent approximately 20% of the active users. (Total number of active[1] users = 84)[1] Given 1,2 and 3 above, the responses received are from a knowledgeable section of experienced users both internal and external to CCLRC, who comprise a representative proportion of all current active users. On this basis the responses can be believed and should be used reliably.

17 Conclusions (2) Most users understand the advantages of the ADS. I.e. they know what they want. Overall, most users get what they want from the service (8.2/10). We now have a measure from which to improve. Some of the improvements identified by the users have already or are now being addressed. Of those that are not, further clarification is required in order to understand how important the issue is to other users, and to clarify the problem adequately to consider appropriate solutions. What mechanisms could be used to achieve this? Most users were aware of the recent hardware upgrade, although a surprisingly high proportion of users (21%) were not. Most users were unaware of the SE interface, and only half were aware of the SRB interface. This matters because there are improved services coming on line from the development team, which some users may wish to take advantage of.

18 Conclusions (3) Most users (64%) use the web page at least occasionally, whereas 35% use it rarely or never. Communication between users and development team needs to be improved. Given that most users make at least occasional use of the web pages, the most simple and effective means of doing so is to keep the web site up-to-date with current developments. However, this will not be successful for around one third of users. Almost 80% of users are in favour of a list serv. Service. The combination of this with an improved web site should be adequate. Almost 60% of users are in favour of User group meetings. These should be continued, probably yearly.

19 Backups

20 Digital Curation Centre (DCC) Joint collaboration between CCLRC, UKOLN, and Edinburgh and Glasgow Universities. Provide advice, support, research and development into aspects of Digital Curation for the UK HE community Funded jointly by JISC and EPSRC - £1m/year for three years initially. Feb Establish collaboration with industrial partners…

21

/9940 Drive connections (old) STK 9310 ~6000 slots 3590 RS G216G108G 100Mbit Network 9940

23 Real drive performance Upgrade