Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 U.S. Department of the Interior U.S. Geological Survey Mission Support Team Storage Architectures Presented By Ken Gacke, SAIC* U.S. Geological Survey,

Similar presentations


Presentation on theme: "1 U.S. Department of the Interior U.S. Geological Survey Mission Support Team Storage Architectures Presented By Ken Gacke, SAIC* U.S. Geological Survey,"— Presentation transcript:

1 1 U.S. Department of the Interior U.S. Geological Survey Mission Support Team Storage Architectures Presented By Ken Gacke, SAIC* U.S. Geological Survey, EROS Data Center Sioux Falls, SD July, 2004 * Work performed under U.S. Geological Survey contract 03CRCN0001

2 2 Storage Architecture Agenda Online/Nearline Storage Architecture System Backup Architecture Onsite Short-term system recovery Offsite Disaster Recovery Archive Storage Long term data preservation

3 3 Storage Architecture Agenda Online/Nearline Storage Architecture System Backup Architecture Onsite Short-term system recovery Offsite Disaster Recovery Archive Storage Long term data preservation

4 4 Storage Architecture Online Direct Attached Storage (DAS) Just a Bunch Of Disk (JBOD): Intermediate processing. Redundant Array Independent Disk (RAID): Database, Web/ftp, and product generation. Network Attached Storage (NAS): Office automation. Storage Area Network (SAN): Clustered File System for High Performance Processing Nearline: Online disk cache with high performance tape backend

5 5 Storage Architecture EDCs historical nearline experience: EPOCH, AMASS – (1987 – 1993) Optical AMASS – (1992 – 2000) Quantum DLT 2000 UniTree – (1992 – 2001) StorageTek 3480/3490/D-3/9840 DMF, AMASS, LAM (2000 – Present) StorageTek 9840/9940B

6 6 Storage Architecture Multi Tiered Storage Vision Online Supported Configurations DAS – Local processing such as image processing NAS – Data sharing such as office automation SAN – Production processing Data accessed frequently Nearline Integrated within SAN Scalable for large datasets & infrequently accessed data Multiple Copies and/or Offsite Storage

7 7 Storage Architecture Decisions Optimized by individual program and program manager, not the enterprise. Requirements Factors Reliability – Data Preservation Performance – Data Access Cost – $/GB, Engineering Support, O&M Scalability – Data Growth, Multi-mission, etc. Compatibility with current Architecture Evaluated and recommended through engineering white papers and weighted decision matrices.

8 8 High Performance RAID Weighted Matrix

9 9 Bulk RAID Weighted Matrix

10 10 CR1 Storage in Terabytes – May 2004

11 11 CR1 SAN/Nearline Architecture DMF Server Product Distribution Tape Drives 8x9840 2x9940B 1Gb Fibre 2Gb Fibre Disk Cache /dmf/edc 68GB /dmf/doqq 547GB /dmf/guo 50GB /dmf/pds 223GB /dmf/pdsc 1100GB Ethernet

12 12 Future Seamless/Silo Architecture Ethernet DMF PDS Tape Library 8x9840 3x9940B FTP (lxs37) Web/Extract TP9300S 3TB TP9400 CIFS Mount Data Servers

13 13 Storage Architecture Agenda Online/Nearline Storage Architecture System Backup Architecture Onsite Short-term system recovery Offsite Disaster Recovery Archive Storage Long term data preservation

14 14 System Backup Architecture ITS is responsible for generating system backups to maintain system integrity. Promote centralized data backup solution to the Projects Legato is used for automated system backups for the Unix (SGI, SUN, Linux) platforms. ArcServe is used for automated system backups for the Windows based platform. Fully automated backup solution Tapes located within tape library Retention period is three months

15 15 System Backup Architecture Unix Server Weekly Full backups with daily incremental: System partitions Local and third party software packages Databases DORRAN, Earth Explorer, Inventory, Seamless Legato Oracle Module for Very Large Databases Quarterly Full backups with daily incremental RAID Datasets (DRG, Browse, Anonymous FTP) Backups with exclusion of image files and large files User data file systems

16 16 System Backup Architecture Windows Servers Typically full backups with daily incremental (no exclusions) Workstations and PCs Generally, no system backups Production workstations within CR1 are backed up (International, WBV)

17 17 System Backup Resources Sun E450 (4CPU) Legato -- Unix StorageTek L700, Six SDLT 220 Drives Dell 2550 (2 CPU) ArcServe -- Windows Overland Storage NEO 4100, Three LTO-2 Drives

18 18 Legato Monthly Data Backups

19 19 Offsite Backup Architecture ITS is responsible for generating offsite backups for disaster recovery Mission essential data written to media and stored offsite LTO-2 tape generated once per week Data written in an open format (tar) Retention period is three months Projects currently using offsite storage DORRAN Inventory EarthExplorer Digital Archive LAS Source Code Web Servers

20 20 Storage Architecture Agenda Online/Nearline Storage Architecture System Backup Architecture Onsite Short-term system recovery Offsite Disaster Recovery Archive Storage Long term data preservation

21 21 Archive Storage Digital Archive Media Trade Study To analyze offline digital archive technologies and recommend the next EDC archive media of choice. Criteria in decreasing order of importance: Reliability: A second copy will reduce risk somewhat, but a reliable technology is mandatory. Reliability is proven over time. Performance: High capacity saves significant space and high transfer rates speed up transcription. Cost: The actual drive cost is fairly insignificant, but the media cost is quite important.

22 22 Archive Media Weighted Matrix FY04 Revision

23 23 System Overview Quantity of data to be copied: 2 Copies Data SetScenesData VolumeDCTs / HDTs 9940B MSS-P 65,1283.2 terabytes118 DCTs36 MSS-A262,0889.5 terabytes277 DCTs100 TM-A13,7333.6 terabytes108 DCTs40 TM-R386,934102.2 terabytes2,357 DCTs1,040 TM-R ~150,000~40.4 terabytes~7,500 HDTs420 Total:877,883158.9 terabytes10,758 Tapes1,636 Number of HDTs currently transcribed on TMACS to DCT: 30,500 Quantity of HDTs that can be land filled after conversion: 38,000+

24 24 Big Changes Your Order is in What Box?!

25 25 (1 copy)(2 copies) 13 Semis½ cargo space in SUV 38,000 HDTs< 1800 STK 9940B Impact

26 26 (1 copy)(2 copies) 13 Semis1/3 space of STK Silo 38,000 HDTs< 1800 STK 9940B Impact


Download ppt "1 U.S. Department of the Interior U.S. Geological Survey Mission Support Team Storage Architectures Presented By Ken Gacke, SAIC* U.S. Geological Survey,"

Similar presentations


Ads by Google