W HAT I DID Installed compute manager and AIF on Eos Created test cases for PBS features Created test cases for User Inputs Submit feedback / bug reports with PBS Documented process for future implementations / troubleshooting
R ESULTS Good Easy to create different application forms Instant job monitoring Restrict input values Default input values Secure file transferring
R ESULTS Bad Easy to put results in insecure location Always copies the input files Missing a form entry can result in lost output files Spams the sudo log “Fixed in next version (Week after I leave)”
U PDATING HPC W IKI Moinmoin wiki (python) 1.8.8 to 1.9.4 Used temporary virtual machine to test update and fix issues Added support for viewing reports Deployed on hpcweb Note: Learn what type of service monitoring is being used before taking down a system.
W IKI R EPORTS Automatically generate a visual report of an XML document each month Created the XSL Putting data into charts Automation ('Right' way vs. Working way) Editing to reduce transcription errors
I NTEL C OMPILER I SSUE (ICC) Issue Compile times on Quark are much longer than Fission (head nodes) Quark should be faster (hardware wise) 17 minutes on Quark 8 minutes on Fission
I NTEL C OMPILER S TEPS Create test cases Determine effected systems Enable debugging Strace Wireshark Hardware Test Environment
ICC S OLUTION License files were resolved in the order License manager User's home directory /opt/intel /apps/intel/..../license 'Errors' in the license file cause the system to check all of the sources
ICC S OLUTION The /opt/intel license files pointed to the license manager This caused additional requests to the license manager (takes time) Quark's /opt/intel license files pointed to the license servers the most *Removed /opt/intel/license folder to fix the problem.
T HINGS L EARNED Python XSL Creating and Signing SSL Keys Unix permissions Strace Testing Refactoring Monitoring Vim!