© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Best Practices and Models for Sustainability.

Slides:



Advertisements
Similar presentations
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Advertisements

April 19, 2015 CASC Meeting 7 Sep 2011 Campus Bridging Presentation.
TITLE OF PROJECT PROPOSAL NUMBER Principal Investigator PI’s Organization ESTCP Selection Meeting DATE.
Session 5 Intellectual Merit and Broader Significance FISH 521.
What is Cyberinfrastructure?
Bill Barnett, Bob Flynn & Anurag Shankar Pervasive Technology Institute and University Information Technology Services, Indiana University CASC. September.
Data Gateways for Scientific Communities Birds of a Feather (BoF) Tuesday, June 10, 2008 Craig Stewart (Indiana University) Chris Jordan.
National Science Foundation: Transforming Undergraduate Education in Science, Technology, Engineering, and Mathematics (TUES)
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Supporting Polar Research with National Cyberinfrastructure.
ReStore: A sustainable web resources repository Arshad A. Khan National Centre for Research Methods
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
How to Improve your Grant Proposal Assessment, revisions, etc. Thomas S. Buchanan.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI Jetstream Overview.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI Prepared for the.
Von Welch (PI) Susan Sons (HUBzero Engagement Lead) Hubbub September 2014 trustedci.org Cybersecurity for Cyberinfrastructure… and Science!
University of North Dakota Office of Institutional Research November 8, 2013 Drivers get ready - new dashboards are coming your way! Presented at the.
Campus Bridging: What is it and why is it important? Barbara Hallock – Senior Systems Analyst, Campus Bridging and Research Infrastructure.
Statewide IT Conference, Bloomington IN (October 7 th, 2014) The National Center for Genome Analysis Support, IU and You! Carrie Ganote (Bioinformatics.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Craig Stewart 23 July 2009 Cyberinfrastructure in research, education, and workforce development.
THE HAPPINESS INITIATIVE TOOLKIT A Tool kit for creating your own Happiness Initiative How To Conduct A Happiness Initiative in your city or town.
Overview of NSF ACCI Task Force on Campus Bridging Report Craig Stewart Von Welch Presented at Coalition for Academic.
SYMPOSIUM ON OPEN EDUCATIONAL RESOURCES OPEN TEXT BOOKS – A VIGNETTE G. DHANARAJAN WAWASAN OPEN UNIVERSITY PENANG, MALAYSIA.
Overview: FY12 Strategic Communications Plan Meredith Fisher Director, Administration and Communication.
An Introduction to ORCID
Goodbye from Indianapolis, IUPUI, and Craig A. Stewart Executive Director, Pervasive Technology Institute Associate Dean, Research Technologies Indiana.
Chapter 6 : Software Metrics
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
Outcome Based Evaluation for Digital Library Projects and Services
XSEDE Campus Bridging Birds Of a Feather Rich Knepper Craig Stewart James Wade Ferguson Presented at TeraGrid ‘11,
Genomics, Transcriptomics, and Proteomics: Engaging Biologists Richard LeDuc Manager, NCGAS eScience, Chicago 10/8/2012.
EmergingLeadersAlliance.org. The ELA began as a joint effort of the engineering Founder Societies that was started in 2008 and has remained a collaborative.
Leveraging the National Cyberinfrastructure for Top Down Mass Spectrometry Richard LeDuc.
XSEDE12 Closing Remarks Craig Stewart XSEDE12 General Chair Executive Director, Indiana University Pervasive Technology Institute.
September 6, 2013 A HUBzero Extension for Automated Tagging Jim Mullen Advanced Biomedical IT Core Indiana University.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. The IQ-Table & Collection Viewer A.
Campus Cyberinfrastructure – Network Infrastructure and Engineering (CC-NIE) Kevin Thompson NSF Office of CyberInfrastructure April 25, 2012.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream - A self-provisioned, scalable science and.
© 2015 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
July 18, 2012 Campus Bridging Security Challenges from “Panel: Security for Science Gateways and Campus Bridging”
Funding your Dreams Cathy Manduca Director, Science Education Resource Center Iowa State University, 2005.
March 2, 2008 – GEC #2 Newcomerswww.geni.net1 The GPO Solicitation Process Feedback encouraged Chip Elliott GENI Project Director Clearing.
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream Overview – XSEDE ’15 Panel - New and emerging.
Using Prior Knowledge to Improve Scoring in High-Throughput Top-Down Proteomics Experiments Rich LeDuc Le-Shin Wu.
INDIANAUNIVERSITYINDIANAUNIVERSITY Spring 2000 Indiana University Information Technology University Information Technology Services Please cite as: Stewart,
November 18, 2015 Quarterly Meeting 30Aug2011 – 1Sep2011 Campus Bridging Presentation.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
Assessment of Technology Options 1 Naomi Radke, seecon international GmbH.
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Grand Challenges Discussion 7 Oct 2015 Craig.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Update on EAGER: Best Practices and.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Jetstream: A new national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration.
Funding: Staffing for Research Computing What staffing models does your institution use for research computing? How does your institution pay for the staffing.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
1 A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
© 2014 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
Improving Research Proposals: Writing Proposals and the Proposal Review Process Heather Macdonald (based on material from Richelle Allen-King, Cathy Manduca,
Summary of HEP SW workshop Ian Bird MB 15 th April 2014.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Informatics Tools at the Indiana CTSI.
RDA-WDS Publishing Data IG Data Bibliometrics Working Group.
Jetstream Overview Jetstream: A national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor,
PAUL STACEY Except where otherwise noted these materials are licensed under a Creative Commons Attribution 3.0 (CC BY)CC BY Open Licensing Requirements.
1 Campus Bridging: What is it and why is it important? Barbara Hallock – Senior Systems Analyst, Campus Bridging and Research Infrastructure.
Research & Academic Computing Indiana University Statewide IT Conference 11 September 2003 Indianapolis IN.
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Matt Link Associate Vice President (Acting) Director, Systems
funded by the National Science Foundation Award #ACI
Overview of working draft v. 29 January 2018
Presentation transcript:

© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Best Practices and Models for Sustainability for Robust Cyberinfrastructure Software Craig A. Stewart Executive Director, Pervasive Technology Institute Associate Dean, Research Technologies Indiana University Orcid ID: February 2014

Starting points for Software Sustainability EAGER-funded project ACCI Campus Bridging Task Force report: – Finding 4: The existing, aggregate national cyberinfrastructure is not adequate to meet current or future needs of the US open science and engineering research community. – Finding 3: The current state of cyberinfrastructure software and current levels of expert support for use of cyberinfrastructure create barriers in use of the many and varied campus and national cyberinfrastructure facilities. These barriers prevent the US open science and engineering research community from using the existing, open US cyberinfrastructure as effectively and efficiently as possible workshop on software reusability – Cyberinfrastructure Software Sustainability and Reusability Workshop Final Report. C.A. Stewart, G.T. Almes, D.S. McCaulay and B.C. Wheeler, eds., Available from:

Our foci Cyberinfrastructure (CI) software Less emphasis on end-user applications Working definition of CI software: Software and libraries that are essential to the successful operation of other end-user applications, but which are typically not end user applications. CI software tends to lack a market of sufficient size to support commercialization.

Summary of our results No magic bullets Image credit: Bullet from Aaberg, Jason. Stock Xchng.

Initial discussions PIs naïvely hoped to find a formula – similar to the NASA Reuse Readiness Levels – that project leaders could follow and be confident that if executed well would lead to software being sustained Co-PI Von Welch channeled his inner Kierkegaard: Kierkegaard wrote something translated as “Life can only be understood backwards; but it must be lived forwards.” Welch asserted that there may well be characteristics that enable sustainability, but that “sustained” was an adjective that could be applied only after the fact.

2-phase study approach Survey of the community In-depth case studies

Survey results - What factors influence software selection? ① Software capabilities and features are the most important considerations when adopting a software package ② Total cost of ownership (purchase and annual license maintenance fees) ③ Long-term availability ④ Reliability and maturity ⑤ Initial purchase cost of the software

Survey results - Name well-sustained open source software projects ① Linux ② R-project ③ Apache ④ Mozilla ⑤ TeX & LaTeX ⑥ Python ⑦ GNU ⑧ Eclipse ⑨ OpenOffice ⑩ ImageJ

Case studies Software projectGovernance model KitwarePrivate leadership in private / public partnership; CEO ultimate authority R projectBoard of directors, large community with conferences LAPACKBenevolent dictatorship with strong leadership group HUBzeroFoundation structured as LLC KualiFoundation structured as 501(c)3 UnidataExecutive director, advisory group, with strong NSF support Globus OnlineSmall company, led by benevolent dictatorship & strong leadership group. Annual conference GalaxyStrong leadership team with flat organization; strong community guidance HTCondorBenevolent dictatorship with strong leadership group and user conference

Factors that did NOT seem to differ about well- sustained software compared with other software Open source licenses per se Use of good practices in code management NASA Reuse Readiness Levels

What software building, testing, and hardening methodologies have proved effective in enabling software to be widely used by the NSF research community? What licensing terms are preferred? All well-sustained projects we studied had good software management, build, and test processes All well-sustained projects produce software that is robust Most permissive (in terms of reuse) open source licenses policies possible

What administrative structures, governance processes, and operational infrastructure enable software to be sustainable in the long term? Strong, committed leadership Tendency for leadership to have a strong and easily identified core of key leaders (some tendency for the leaders to be geographically co-located) Well-sustained projects tended to have tight control over the official code stack

What community engagement activities (e.g. requirements gathering, prototyping) demonstrate relevance to the software’s eventual success? Developers engaged with users. A centralized development team with regular, routine contact with users and leadership was an essential part of the projects we studied that had been well sustained over the long term. Domain experts engaged with developers. Software initiatives that pair subject matter (domain) expertise with technological and engineering expertise have an increased probability of broad adoption and sustainability. Conferences and web-based self-help were important community engagement mechanisms for some projects.

What financial models provide for long-term sustainability of software as well supported infrastructure? In general, NSF funding matters a lot, but studied success stories align with Dan Katz’s view of sustainable. Continued and ongoing funding for Unidata seems really important but also an edge case. Globus Online, HUBzero, and HTCondor all gain in sustainability in part through very large user communities. Kuali is interesting and different – maybe a model for the scientific community in some ways Kitware is an interesting model but may not be easily replicable.

Example of the value of sustained software to NSF and the scientific community Basic XSEDE-compatible cluster build Based exclusively on open source Distributed as a Rocks Roll and as RPMs Allows sysadmins and IT support experts at small institutions, particularly MSIs, to leverage software and training materials created by XSEDE

Baseline (don’t leave home without these) Software should be licensed with a very permissive open source license (in particular, one that permits commercializing derivative works). Good software engineering, build, and test practices are essential, including using a single (but well backed up) and definitive code repository.

Recommendations to SI2 PIs part 1 Project leaders should maintain control over the definitive code repository, and employ thorough testing and quality control processes for code releases. Provide good documentation and some mechanism for self help or community-mediated help via the web. Software projects should have extensive engagement with their intended user community – with a conference? – which should drive a detailed software development plan. (“Make sure your software matters to someone – a lot”)

Recommendations to SI2 PIs, Part 2 Expect to devote much time and energy to leading a software project, and/or hire a delegate dedicated to leading the project. Have, obtain, or hire expertise in business management in a nonprofit or small business. Develop a careful and phased sustainability management plan, ranging from preliminary to advance; should be part of any proposal that requests funds for development, expansion, or ongoing maintenance of scientific software. Experiment with emerging options for developing income, beyond depending on NSF funding. NB: The last two bullets reflect the age-old question: Which matters more, the planning or the plan?

Sustained software and replicability Multiple models are promoted for replicability of analyses. Distribution of open source code is one option and a very simple one. Repasky example Suggestion for the community and NSF: Production of open source software should be a societal value. Replicability of analyses should be a societal benefit. They should be advocated and valued as a broader impact in proposals that aim to sustain software efforts in the long term.

Nothing unites like a common problem What universities and colleges don’t need to pay for along with their business software is sailboats. Island Graphic from B. Wheeler Speeding up on curves. EDUCAUSE Review. ©Brad Wheeler Used with permission.

Thinking about this issue from the standpoint of a community of practice The community of practice model seems potentially useful to the SI2 projects. Replicability and access seem to be common problems that SI2 projects can address collectively in a community of practice framework (may force changes in community views on what matters in a discipline). One of the things that all software projects seem to struggle with is assessments. There is a steep learning curve on some of the alternate funding mechanisms (e.g. NET+). Also the EU example of “supporting actions.”

License terms Please cite as: Stewart, C.A Update on EAGER: Best Practices and Models for Sustainability for Robust Cyberinfrastructure Software. Presented at SI2 PI meeting, 24 February Arlington VA. Available at Except where otherwise noted, by inclusion of a source url or some other note, the contents of this presentation are © by the Trustees of Indiana University. This content is released under the Creative Commons Attribution 3.0 Unported license ( This license includes the following terms: You are free to share – to copy, distribute and transmit the work and to remix – to adapt the work under the following conditions: attribution – you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work. 22

Acknowledgments This research supported by NSF Award EAGER: Best Practices and Models for Sustainability for Robust Cyberinfrastructure Software. Craig Stewart, PI; William K. Barnett, Richard Knepper, Von Welch, Eric Wernert Co- PIs. Julie Wernert conducted the case studies and the survey upon which this talk rests. This research was also supported by a generous grant from the Lilly Endowment, Inc. to the Indiana University Pervasive Technology Institute. Any opinions expressed here are those of the presenter and do not necessarily reflect positions of the National Science Foundation or the Lilly Endowment. All comparisons between Von Welch and Soren Kierkegaard are completely unauthorized by Von.

Further information Some final reports are online and available from Raw data from the survey are available from IUScholarWorks – linked from this page. One more report (summary of the case studies)will be published this week. A draft of the final overall report on results and findings will be posted soon for community comment, and finalized by March 28. Thank you…. Questions?