Presentation is loading. Please wait.

Presentation is loading. Please wait.

Software Sustainability Institute www.software.ac.uk The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue.

Similar presentations


Presentation on theme: "Software Sustainability Institute www.software.ac.uk The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue."— Presentation transcript:

1 Software Sustainability Institute www.software.ac.uk The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue Hong (@npch), Software Sustainability Institute ORCID: 0000-0002-8876-7606 | N.ChueHong@software.ac.uk Where indicated slides licensed under Supported by Project funding from

2 Software Sustainability Institute www.software.ac.uk Of course, we don’t deliberately set out to create code that can’t be used by others Right?

3 Software isn’t special, it’s mainstream Survey of researchers from 15 Russell Group unis conducted by SSI between Aug- Oct 2014. 406 respondents covering representative range of funders, discipline and seniority. 69% 92%

4 This isn’t just about the “traditional” computational sciences Survey of researchers from 15 Russell Group unis conducted by SSI between Aug- Oct 2014. 406 respondents covering representative range of funders, discipline and seniority.

5 And it isn’t just using software, it’s researchers developing software too Survey of researchers from 15 Russell Group unis conducted by SSI between Aug- Oct 2014. 406 respondents covering representative range of funders, discipline and seniority. 56% 21%

6 So what’s the issue? Survey of researchers from 15 Russell Group unis conducted by SSI between Aug - Oct 2014. 406 respondents covering representative range of funders, discipline and seniority. Analysis of data from 49,650 grant titles and abstracts published on Gateway to Research covering 2010-2014. Analysis of job adverts posted to jobs.ac.uk in 1H2014. 71% 4% Of UK researchers have had no formal software development training Of jobs advertised in UK universities were software related 77% Of PIs had not included costs for software development in bids 30% Of UK research investment has been spent on research which relies on software … and then there are gender related issues

7 Career Paths in UK Careers outside academic sector Non-university Research (industry, government etc.) ProfessorPermanent Research Staff Early Career Research PhD students Source: The Scientific Century, Royal Society, 2010 (revised to reflect first stage clarification from “What Do PhD’s Do?” study) UK STEM graduate career paths

8 Software Sustainability Institute www.software.ac.uk Software is pervasive in research

9 Software Sustainability Institute www.software.ac.uk Pleiotropic loci – diabetes and Parkinson’s disease Selection at pleiotropic loci underlies disease co-occurrence in human populations. Navarro, Haley, Karosas et al. Submitted to Nature Genetics

10 Software Sustainability Institute www.software.ac.uk Behind every great piece of science… #go through each SNP of interest for(my $x = 0; $x < scalar @pos; $x++) { #and then each downstream SNP of interest for(my $y = $x+1; $y < scalar @pos; $y++) { #if SNPs within our chosen distance (500kb) and both present in the haplotypes file if((!($trait[$x] eq $trait[$y])) && (abs($pos[$x] - $pos[$y]) <= 500000) && (exists($legArrayPos{$pos[$x]})) && (exists($legArrayPos{$pos[$y]}))) { my $snp1ArrayPos = "”; my $snp2ArrayPos = "”; my $snp1All = "”; my $snp2All = "”; #create output file for this SNP pair my $filename = "ConditionedResults2/$chr[$x].$pos[$x]-$pos[$y].EHH.GBR.2.txt”; print "$filename\n”; unless (-e $filename) { open(OUT, ">$filename"); #####################CHANGE THESE IF NOT FOCUSING ON SECOND SNP######################### my $start = $pos[$y]-500000; if ($start < 1) { $start = 1; } my $end = $pos[$y]+500000; if ($end > $chrLengths{$chr[$x]}) { $end = $chrLengths{$chr[$x]}; }

11 Software Sustainability Institute www.software.ac.uk Just the Nature of the problem? Maintenance is not fun Producing new code is fun Statistics courtesy of Jo Hannay et al, “How Do Scientists Develop and Use Scientific Software? Published online 13 October 2010 | Nature 467, 775-777 (2010) doi:10.1038/467775a

12 Software Sustainability Institute www.software.ac.uk The modern researcher… … worries about:  Data management and analysis  Reproducible research  Scalable simulations  Integration of models and workflows  Collaboration Picture of Otto Stern courtesy of Emilio Segre Visual Archives

13 Software Sustainability Institute www.software.ac.uk reuse reproduce repeatreplicate same experiment same lab same experiment different lab same experiment different set up different experiment some of same test Figure by Carole Goble adapted from Drummond C, Replicability is not Reproducibility: Nor is it Good Science, online and Peng RD, Reproducible Research in Computational Science Science 2 Dec 2011: 1226-1227.

14 Software Sustainability Institute www.software.ac.uk SIGMOD Reproducibility SIGMOD conference offered to attempt to repeat/reproduce papers accepted at conference  2008-2012 “High burden on reviewers when setting up experiments”  Use of VMs advocated Bonnet et al, SIGMOD Record, June 2011 (Vol. 40, No. 2) doi: 10.1145/2034863.2034873

15 Software Sustainability Institute www.software.ac.uk People UK Research Computing Ecosystem Network/Collaboration Instruments Software Computing Data Communities … Communities …

16 Software Sustainability Institute www.software.ac.uk SSI Drivers and Themes Two key drivers which cause people to seek the SSI’s advice:  They want to be more productive in their research  They don’t want to be embarrassed by appearing worse than their peers Broadly, our work falls into a few key themes:  Developing the scientific computing / software development skill base  The role and reward of software in research  Recognition of software career paths  Re[peatable|producible|computable] research

17 Software Sustainability Institute www.software.ac.uk The Foundations of Digital Research Re- search Careers Recognition / Reward Skills and Capability

18 Software Sustainability Institute www.software.ac.uk The Software Sustainability Institute A national facility for cultivating world- class research through software Better software enables better research Software reaches boundaries in its development cycle that prevent improvement, growth and adoption Providing the expertise and services needed to negotiate to the next stage Developing the policy and tools to support the community developing and using research software Supported by EPSRC Grant EP/H043160/1

19 Software Sustainability Institute www.software.ac.uk SSI Objectives 1.Getting software on the research agenda 2.Supporting communities that want change 3.Increasing skills 4.Improving software Building a platform Engaging the community Being an authoritative voice

20 Communication Website & blog Campaigns Advice Guides Courses Workshops Fellowship Research Software Policy Training Community Consultancy 41 projects 92 evaluations 4 surgeries 33 UK SWC workshops 1000+ learners 50,000 readers 41 domain ambassadors 20+ workshops organised 740 researchers 50,000 grants analysed 150+ contributed articles 19,000 unique visitors per month 272 RSEs engaged1700 signatures 13 issues highlighted

21 Architectural Dominance www.software.ac.uk Image courtesy PDES Inc Slide from Sean Barker, BAE SYSTEMS, DPC Designed to Last

22 Software Sustainability Institute www.software.ac.uk SSI Organisation Community Engagement (Lead: Shoaib Sufi)  Fellowship Programme Fellowship Programme  Events and Roadshows Research Software (Lead: Steve Crouch)  Open Call for Consultancy Projects / Funded Collaborations Open Call for Consultancy Projects  Software Evaluation Software Evaluation Policy (Lead: Simon Hettrick)  Guides and Case Studies Guides and Case Studies  Best Practice and Policy Training (Lead: Aleksandra Pawlik)  Software Carpentry Software Carpentry  Software Surgeries Collaboration between universities of Edinburgh, Manchester, Oxford and Southampton.

23 Software Sustainability Institute www.software.ac.uk Water Swap Reaction Coordinate A water-swap reaction coordinate for the calculation of absolute protein-ligand binding free energies Woods CJ, Malaisree M, Hannongbua S, Mulholland AJ J. Chem. Phys. (2011) vol. 134, pp. 054114 http://dx.doi.org/10.1063/1.3519057

24 Software Sustainability Institute www.software.ac.uk Case Study: Ligand Binding Centre for Computational Chemistry, Bristol  New methods for rapid MC sampling of biomolecular systems modelled using QM/MM  Developed two codes ProtoMS (F77) + Sire (C++)  Water-Swap Reaction Coordinate method to calculate absolute protein-ligand binding free energies SSI’s work is helping to scale development  ProtoMS and Sire both single developer codes  ASPIRE/ACQUIRE framework has multiple devs Split architecture between ASPIRE (adaptive multiresolution hybrid MD simulation) and ACQUIRE (WorkPacket scheduling system with optimisation for time to result vs “green-ness” http://www.siremol.org/adaptive_dynamics

25 Software Sustainability Institute www.software.ac.uk Case Study: ICAT ICAT metadata catalogue, STFC  Support experimental data linking at Large Facilities  RAL UK (ISIS, DIAMOND, CLF), SNS US, ELLETRA Italy  ICAT operationally critical at sites, other projects looking to use  Undertook interview-based organisational review 92 observations, 32 recommendations SSI’s work means the project has move forward  Taking forward: steering group, induction process, expanded roadmapping activity, technical workshops via Skype, involved with deployment  1 year post-project 25 (75%) recommendations implemented Used by other major projects (PandataODI, CRISP) http://www.software.ac.uk/blog/2012-02-17-evaluating-software- behind-some-worlds-large-experimental-facilities http://www.software.ac.uk/blog/2012-02-17-evaluating-software- behind-some-worlds-large-experimental-facilities http://icatproject.org/

26 Software Sustainability Institute www.software.ac.uk Case Study: Fusion Plasma Culham Centre for Fusion Energy  GS-2 used to study low-frequency turbulence in magnetized plasma  No common visualisation across different groups  Deliver mutually agreeable framework that can be extended easily and can be maintained by the small fusion community SSI’s work means the software can be used between groups  Simplified & enhanced plasma visualisation tool Based on ParaView o/s tool For simulations using GS-2 o/s package  Aim to allow CCFE to contribute back to GS-2 community  “I am very confident the tool will be invaluable” Colin Roach, CCFE http://www.software.ac.uk/who-do-we-work/culham-centre-fusion-energy http://www.ccfe.ac.uk/

27 Software Sustainability Institute www.software.ac.uk Preservation vs sustainability Image courtesy of RGB Kew – not for reuse Image courtesy of London Permaculture under CC-by-nc-sa license Preservation? Sustainability? www.software.ac.uk

28 Software Sustainability Institute www.software.ac.uk Sustainable Communities Cohesion and Identity: Creating a community Tolerance and Diversity: Smart growth through collaboration Efficient use of resources: Leveraging infrastructure Adaptability to change: Governing sustainably www.software.ac.uk

29 Software Sustainability Institute www.software.ac.uk Case Study: Climate Policy Modelling CIAS team at Tyndall Centre for Climate Change Research, University of East Anglia  Develop linked climate and economic models for detailed analysis  Their software was not ready to be used by other groups One researcher/developer at UEA, several users SSI’s work means the software is robust enough that it can be installed and used by others  Enabled use of the software by the WWFN’s Climascope project and James Cook University Documented software to allow extensions by contributors Made it easier to maintain and backup Added job scheduling to improve modeling throughput New modelling framework enables new models i.e. new science http://www.tyndall.ac.uk/research/cias

30 Software Sustainability Institute www.software.ac.uk Case Study: Tinkering with APES NSF/EPSRC Funded  Polarisable empirical force fields (AMOEBA)  AMBER, Tinker, DL_POLY, ONETEP, Q-Chem SSI’s work is helping to coordinate development across many packages and many people  Different languages, licenses, styles, teams http://apes-soft.github.io/

31 Software Sustainability Institute www.software.ac.uk Case Study: Brain Imaging Brain Research Imaging Centre, Edinburgh  Develop PrivacyGuard software, a DICOM image deidentification toolkit  Created software to support new multispectral colouring modulation and variance identification technique (“MCMxxxVI”) to identify white matter lesions that are indicative of declining cognitive ability  BRIC are not principally software developers, but do provide software to other researchers SSI’s work means the software has been reviewed and refactored  Looked at exploitation Usability review, Naming/trademark review  Made it easier for BRIC staff to maintain and develop Move to standard repositories, testing and documentation processes Examination of licencing for MCMxxxVI Extraction and refactoring to create standalone tools http://www.software.ac.uk/who-do-we-work/brain-research-imaging-centre-edinburgh http://www.bric.ed.ac.uk/

32 Software Sustainability Institute www.software.ac.uk SSI Fellows 2014 2014: 16 fellows 2013: 15 fellows 2012: 10 fellows Range of subjects, career stages software.ac.uk/fellows

33 Software Sustainability Institute www.software.ac.uk Journal of Open Research Software http://openresearchsoftware.metajnl.com

34 Software Sustainability Institute www.software.ac.uk SSI Website Disseminating best practice, gathering information 19,000+ unique visits/month

35 Software Sustainability Institute www.software.ac.uk SSI Guides and Top Tips Guides provide in depth information  Licences  Software development  Project management  Repositories and project infrastructure  Open source  Community building  Publicising software  Policy Top Tips provide quick overviews  Software development  Repositories and project infrastructure  Software carpentry  Citing software  Data handling  Promoting and communicating your project  Community building and project management

36 Software Sustainability Institute www.software.ac.uk SSI Briefing Papers

37 Software Sustainability Institute www.software.ac.uk SSI Blog Articles on research software and related issues  Ask the Institute  A Day in the Software Life  Heroes of Software Engineering  Top Tips  Women in Software 150+ posts written by external contributors  10,000+ unique pageviews/month

38 Software Sustainability Institute www.software.ac.uk SSI Training Software Carpentry  http://software-carpentry.org http://software-carpentry.org  International initiative to teach basics of software engineering to computational researchers The “why” more than the “how”  Phenomenally successful – 2x oversubscription  Cheap to run but budget for 3x the coffee!  SSI are UK Coordinators for SWC  We ran 13 workshops in 2013 to 600+ learners Software Sustainability Surgeries  “Bring your own code”  “What makes Good Code good?”  Run at existing conferences, and for software funding programmes  Offering bespoke advice as well as training

39 Software Sustainability Institute www.software.ac.uk Creating a training community Bringing together 39+ organisations with interest in e- Infrastructure training Raising issues and enablers with RCUK, BIS software.ac.uk/policy

40 Software Sustainability Institute www.software.ac.uk Research Software Engineer Join the RSE community at http://www.rse.ac.uk/

41 Software Sustainability Institute www.software.ac.uk No-one sets out to write unsustainable software People need the skills and incentives to maintain software through its lifetime The Institute helps UK researchers benefit from more sustainable software

42 Work with us – www.software.ac.uk www.software.ac.uk


Download ppt "Software Sustainability Institute www.software.ac.uk The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue."

Similar presentations


Ads by Google