Presentation is loading. Please wait.

Presentation is loading. Please wait.

CENTER FOR HIGH PERFORMANCE COMPUTING Introduction to I/O in the HPC Environment Brian Haymore, Sam Liston,

Similar presentations


Presentation on theme: "CENTER FOR HIGH PERFORMANCE COMPUTING Introduction to I/O in the HPC Environment Brian Haymore, Sam Liston,"— Presentation transcript:

1 CENTER FOR HIGH PERFORMANCE COMPUTING Introduction to I/O in the HPC Environment Brian Haymore, brian.haymore@utah.edu brian.haymore@utah.edu Sam Liston, sam.liston@utah.edu sam.liston@utah.edu Center for High Performance Computing Spring 2013

2 CENTER FOR HIGH PERFORMANCE COMPUTING Overview CHPCFS Topology 12/2/2015http://www.chpc.utah.eduSlide 2

3 CENTER FOR HIGH PERFORMANCE COMPUTING Overview Couplet Topology –Redundency –Performance –Used on: Redbutte Drycreek Greenriver Saltflats IBrix Scratch –Virt/Horiz Scalable 12/2/2015http://www.chpc.utah.eduSlide 3 Network Clients 10GigE/40GigE

4 CENTER FOR HIGH PERFORMANCE COMPUTING Overview SAN storage redundancy – RAID RAID = Redundant Array of Inexpensive Disks 12/2/2015http://www.chpc.utah.eduSlide 4

5 CENTER FOR HIGH PERFORMANCE COMPUTING Overview Types of storage available at CHPC –Home Directory (i.e. /uufs/chpc.utah.edu/common/home/uNID) Per department backed up (except CHPC_HPC file system) Intended for critical/volatile data Expected to maintain a high level of responsiveness –Group Data Space (i.e. /uufs/chpc.utah.edu/common/home/pi_grp) Optional per department archive Intended for active projects, persistent data, etc. Usage expectations to be set by group –Network Mounted Scratch (i.e. /scratch/serial) No expectation of data retention (It’s scratch) Expected to maintain a high level of I/O performance under significant load –Local Disk (i.e. /tmp) Most consistent I/O No expectation of data retention Unique per machine 12/2/2015http://www.chpc.utah.eduSlide 5

6 CENTER FOR HIGH PERFORMANCE COMPUTING Overview Good Citizens –Shared Environment Characteristics Many to one relationship (over-subscribed) Globally accessible Global resource are still finite Consider your usage impact when choosing a storage location Be aware of any usage policies Evaluate different I/O methodologies Seek additional assistance from CHPC 12/2/2015http://www.chpc.utah.eduSlide 6

7 CENTER FOR HIGH PERFORMANCE COMPUTING Best Practices 12/2/2015http://www.chpc.utah.eduSlide 7 Data Segregation; Where should files be stored? –Classify your Data How important is it? Can it be recreated? Is this dataset currently in use? Will this data be used by others? Does this data need to be backed up? –Put your data in the appropriate space Home Directory Group Space Scratch /tmp

8 CENTER FOR HIGH PERFORMANCE COMPUTING Considerations Backup Impact –Performance Characteristics Time (when, duration) Competition/concurrent access Capacity of files backed up Quantity of files backed up Unintended consequences 12/2/2015http://www.chpc.utah.eduSlide 8

9 CENTER FOR HIGH PERFORMANCE COMPUTING Considerations Network Performance 12/2/2015http://www.chpc.utah.eduSlide 9

10 CENTER FOR HIGH PERFORMANCE COMPUTING Considerations Data Migration –What are we moving; What does it look like? –From where to where are we moving the data? –What transfer performance expectation do we have? –What tool will we use to make the transfer? SSH/SFTP –Simple –Very portable rsync –Restart able –File verification tar via SSH –More efficient with many small files Compression? Secure 12/2/2015http://www.chpc.utah.eduSlide 10

11 CENTER FOR HIGH PERFORMANCE COMPUTING File Operations Directory Structure –Poor performance when too many files are in the same directory –Organizing files in a tree avoids this issue –Directory block count significance Network vs. Local –IOPS vs. Bandwidth –Network I/O Overhead Limited by network pipe More efficient for bandwidth vs. IOPS –Local I/O Limited size Not globally accessible Depending on hardware offers a fair balance between bandwidth and IOPS 12/2/2015http://www.chpc.utah.eduSlide 11

12 CENTER FOR HIGH PERFORMANCE COMPUTING File Operations Metadata Operations Performance Considerations –Create, Destroy, Stat, etc. –IOPS oriented performance Application I/O Performance Considerations –How often do we open and close files? –What I/O granularity do our applications write files? – Are we doing anything else silly? 12/2/2015http://www.chpc.utah.eduSlide 12

13 CENTER FOR HIGH PERFORMANCE COMPUTING Examples Code (user example) –Code wrote millions of small files (<10kB) to a single dir –Code wrote thousands of larger files (100s of MB) –Code uses a compression algorithm to speed up I/O of the larger files. Observations –Changing default r/w chunk of compression I/O from 16kB to 32kB/64kB improved performance 10%-20%. –Changing code to write files in a hierarchical directory structure produced a multiple times speed up. 12/2/2015http://www.chpc.utah.eduSlide 13

14 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Single Directory vs. Hierarchical Directory Structure 12/2/2015http://www.chpc.utah.eduSlide 14 4.3GB file

15 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Processing Many Small Files over Network vs. Local 12/2/2015http://www.chpc.utah.eduSlide 15 4.3GB file

16 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Bonnie Test 12/2/2015http://www.chpc.utah.eduSlide 16 4.3GB file

17 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Tar (Linux Kernel) 12/2/2015http://www.chpc.utah.eduSlide 17 4.3GB file

18 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Compile (Linux Kernel) 12/2/2015http://www.chpc.utah.eduSlide 18 4.3GB file

19 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Fine vs. Coarse I/O (Read) 12/2/2015http://www.chpc.utah.eduSlide 19 4.3GB file

20 CENTER FOR HIGH PERFORMANCE COMPUTING Example s Fine vs. Coarse I/O (Writes) 12/2/2015http://www.chpc.utah.eduSlide 20 4.3GB file

21 CENTER FOR HIGH PERFORMANCE COMPUTING Troubleshooting 12/2/2015http://www.chpc.utah.eduSlide 21 Diagnosing Slowness –Open a ticket (issues@chpc.utah.edu)issues@chpc.utah.edu –File system –System load –Network load Future Monitoring –Ganglia Additional Information –http://www.chpc.utah.eduhttp://www.chpc.utah.edu


Download ppt "CENTER FOR HIGH PERFORMANCE COMPUTING Introduction to I/O in the HPC Environment Brian Haymore, Sam Liston,"

Similar presentations


Ads by Google