Www.openfabrics.org Improving the OFED Development Process.

Slides:



Advertisements
Similar presentations
1 The Path to the Ph.D. in IS: Part 3, Advanced coursework and dissertation research.
Advertisements

Uncovering Performance and Interoperability Issues in the OFED Stack March 2008 Dennis Tolstenko Sonoma Workshop Presentation.
Slide 1 of 10 Job Event Basics A Job Event is the name for the collection of components that comprise a scheduled job. On the iSeries a the available Job.
MTS Delivery Development © 2009 IBM Corporation EMEA GLOBAL Total Microcode Support (GTMS)
VisIt Software Engineering Infrastructure and Release Process LLNL-PRES Lawrence Livermore National Laboratory, P. O. Box 808, Livermore,
Systems Analysis and Design in a Changing World, 6th Edition
Configuration Management
Big Projects  Part of this class is about picking a cool software project and building it 1.
10 Tactics for Building Online Community. Empowering Online Community Since 2001 © 2007 GoLightly, Inc. Introduction - Presenters Heather McKeon Miller.
SAIC-F QA Internal Process (DRAFT ) Sudha Chudamani QA Team, Frederick National Lab Jan 2, 2013.
PopMedNet Software Development Life Cycle Chayim Herzig-Marx Harvard Pilgrim Health Care Institute Daniel Dee Lincoln Peak Partners.
Version Control with git. Version Control Version control is a system that records changes to a file or set of files over time so that you can recall.
Software Reliability: The “Physics” of “Failure” SJSU ISE 297 Donald Kerns 7/31/00.
3 Dec 2003Market Operations Standing Committee1 Market Rule and Change Management Consultation Process John MacKenzie / Darren Finkbeiner / Ella Kokotsis,
Craig Berntson Chief Software Gardener Mojo Software Worx Branches and Merges are Bears, Oh My!
StudioSysAdmins 2 nd Annual SIGGRAPH Birds-of-a-Feather John Hickson - 08/09/2011 StudioSysAdmins 2 nd Annual SIGGRAPH Birds-of-a-Feather John Hickson.
© 2007 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Novell SLES 9 on the xw460c Blade.
OFED 1.x Roadmap & Release Process November 06 Jeff Squyres, Woodruff, Robert J, Betsy Zeller, Tziporet Koren,
DBS to DBSi 5.0 Environment Strategy Quinn March 22, 2011.
OFA-IWG Interop Event March 2008 Rupert Dance, Arkady Kanevsky, Tuan Phamdo, Mikkel Hagen Sonoma Workshop Presentation.
Software Tools and Processes Training and Discussion October 16, :00-4:30 p.m. Jim Willenbring.
ADOPTING OPEN SOURCE INTEGRATED LIBRARY SYSTEMS Best Practices Presented by Vandana Singh, PhD Assistant Professor, School of Information Sciences University.
Current Status of OFED in SUSE Linux Enterprise Server John Jolly Senior Software Engineer SUSE.
1 Lecture 19 Configuration Management Software Engineering.
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller.
OFED for Linux: Status and Next Steps 1 Betsy Zeller (Qlogic), Tziporet Koren (Mellanox) 3/16/2010.
OFED 1.2 Lessons, 1.3 Planning and Field Support May 07 Tziporet Koren.
2006 Sonoma Workshop February 2006Page 1 of (#) General Windows Update Gilad Shainer Mellanox Technologies Inc.
242/102/49 0/51/59 181/172/166 Primary colors 248/152/29 PMS 172 PMS 137 PMS 546 PMS /206/ /227/ /129/123 Secondary colors 114/181/204.
MyFloridaMarketPlace MyFloridaMarketPlace Change Request Board August 30, 2007.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
Plan Design Analyze Develop Test Implement Maintain Systems Development Life Cycle MAT Dirtbikes.
INTRODUCTION SOFTWARE HARDWARE DIFFERENCE BETWEEN THE S/W AND H/W.
CERN IT Department CH-1211 Genève 23 Switzerland t Windows Desktop Applications Life-cycle Management Sebastien Dellabella, Rafal Otto Internet.
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller Rev. 0.9.
OFED - Status and Process November 2007 Tziporet Koren.
March 11, 2008 USCMS Tier-2 Workshop Oh Dear God Alain made a PowerPoint presentation 1.
© 2002 IBM Corporation Confidential | Date | Other Information, if necessary June, 2011 Made available under the Eclipse Public License v Mobile.
PM00. 6 Project Management Preparation for Success
Red Hat RDMA Integration and Testing Processes Doug Ledford.
OpenFabrics Enterprise Distribution (OFED) Update
RCE Platform Technology (RPT) Mark Arndt User Support.
System Maintenance Modifications or corrections made to an information system after it has been released to its customers Changing an information system.
Managing Challenging Projects Presented to the class of: Dr. Jane Mackay M.J. Neely School of Business.
Separate distribution of the analysis code (and more) P. Hristov 19/03/2014.
USM - IT BRANCHING PRESENTATION. Branch copying a codeline to create a new one codelines evolve independently //depot/main/...
INFSO-RI Enabling Grids for E-sciencE gLite Certification and Deployment Process Markus Schulz, SA1, CERN EGEE 1 st EU Review 9-11/02/2005.
Open Fabrics Interfaces Software Sean Hefty - Intel Corporation.
Process changes: Internal processes of CASA, external contributions, release schedule Mark G. Rawlings, CASA Build & Test Lead NRAO, Charlottesville Acknowledgements:
This slide deck is for LPI Academy instructors to use for lectures for LPI Academy courses. ©Copyright Network Development Group Module 01 Introduction.
Aaron Corso COSC Spring What is LAMP?  A ‘solution stack’, or package of an OS and software consisting of:  Linux  Apache  MySQL  PHP.
Repository Manager 1.3 Product Overview Name Title Date.
Component D: Activity D.3: Surveys Department EU Twinning Project.
Chapter 25 – Configuration Management 1Chapter 25 Configuration management.
보안 취약점 비교 Linux vs. Windows
Managing the Project Lifecycle
Project Center Use Cases Revision 2
Project Center Use Cases
Overview – SOE PatchTT November 2015.
Quality Assurance: Early Work Items
Overview – SOE PatchTT December 2013.
Releases and developments
Project Center Use Cases Revision 3
Project Center Use Cases Revision 3
X in [Integration, Delivery, Deployment]
System Review – The Forgotten Implementation Step
Git Best Practices Jay Patel Git Best Practices.
CSE 303 Concepts and Tools for Software Development
Proposal on TSC policy for ONAP release Maintenance
WORKSHOP Establish a Communication and Training Plan
Presentation transcript:

Improving the OFED Development Process

2 Panelists  Betsy Zeller, QLogic (Moderator)  Tziporet Koren, Mellanox  Cristoph Raisch, IBM  Dave Sommers, NetEffect  John Jolly, Novell  Many people, via , and …  You, out there in the audience!

3 Improving the OFED Development Process  Focus for this session is on OFED on Linux, rather than on Unix(s) or Windows  Focus is on process, rather than ways to improve features of release  Presentation based on input from panelists, OFED phone sessions, s to EWG list, and private s  Goals of session:  Make sure major issues are identified  Identify solutions where possible  Plan for followup

4 Agenda  Original OFED goals from 2006  How are we doing on meeting all these goals?  Process improvements since 2006  What issues have been raised?  Includes question of kernel.org/OFED!  Handling support – bug fixes and point releases  Open the floor to issues  Smoothing the way for distro integration  Next steps

5 Original OFED Goals (March, 2006)  Move away from each vendor providing their own unique snapshot of software.  Enable vendors to create a common and supportable enterprise-grade distribution of OpenFabrics SW - something they can stand behind and support 24x7. Note: Expectation was that eventually distros (RH and SUSE) would ramp up to provide support for enterprise customers.

6 How are we doing?  4 major OFED releases over last two years, installed on many sites.  Vendors are no longer pulling from top-of- trunk of SVN trees, delivering random disparate SW releases.  RH integrated OFED 1.3 into RHEL 5 U2  Novell SLES 10 SP2 to ship with OFED 1.3

7 Process improvements since ‘06  All developers now use git  Bi-weekly or weekly teleconferences, with detailed minutes  Feature list and schedule are discussed in advance, posted on OF website, and updated regularly  Processes are documented on website  Nightly builds allow vendors to test their changes before more public releases  Process is evolving for doing point releases  But …

8 Some issues have been raised  Release process is flexible, but:  Some major features have been integrated long after feature freeze, and even after the RC process has started. There’s a delicate balance between “holding to the rules”, while still meeting vendors needs for critical features and bug fixes.  OFED release dates tend to slide a little, not unlike many SW release dates!  Should OFED patches have to go through additional review process, especially after RC?

9 Issues (continued) Issue: How do you get a fast bug fix turnaround (eg 2-3 days) ? Proposed Solution:  For a kernel issue, add a patch to existing release using OFED patch script  For kernel or userland issue, send to list requesting sub-minor release (eg ). For vendor specific problem, this can be turned around fast. This gets rolled into the next release cycle, where it will be tested by all vendors.

10 Issue – Balance with kernel.org Issue: Currently, there is kernel SW in OFED which didn’t come in through kernel.org  SDP – no plan to get it in kernel.org – is this a problem?  In other components, there are fixes/changes which “missed the kernel.org train”. Proposed Solution: Topic for discussion

11 Issues (continued) Issue: What’s the process if a vendor delivers new HW off-cycle, and needs SW?  Has been a major issue for other vendors when a newer kernel version was required Proposed Solution: Topic for discussion

12 Issues (continued) Issue: Interoperability events use GA OFED SW. If a vendor fails to pass testing, they are off the Integrators List for a minimum of six months. Proposed Solution: Run initial interoperability testing with RC candidate. Do final “real” testing with GA version, which should have no surprises. Vendors can choose to be present at one or both of these events. Issue: “Final” versions of tests/test plans are not available until the event, which makes it a bit difficult to prepare in advance. Proposed Solution: Make “final” test plans available at least two weeks before event.

13 Issues (continued) Issue: There’s occasionally a difference between what’s in OFED, and what’s in latest package from maintainer. (eg, changes in verbs API to support XRC didn’t go into Roland’s released verbs package). Proposed Solution: Topic for discussion

14 Other Issues  Are there other issues about the Linux OFED development process you’d like to raise?

15 Proposed Solutions  Clearly publish (and reiterate) Feature Freeze dates from early on in the release. As part of this, clearly differentiate between “feature” and “bug fix”. (No, it’s not a bug that your feature is missing!)  Review new feature proposals, to understand implications on others  Clear discussion/negotiation of implied API changes related to new features.  Showcase process documents on OpenFabrics website, so they can all be easily found and accessed  As a community, either accept that some vendors will miss the opportunity to submit a change they care about, or accept that release dates will slip.  Run initial interoperability testing with RC candidate. Do final “real” testing with GA version, which should have no surprises. Vendors can choose to be present at one or both of these events.

16 Novell – Build Constraints  Packages That work well with 'quilt'  open source patch management utility   Use OpenSUSE Build Service  Consistent backport implementation  ofa-kernel uses kernel versions  ib-bonding uses specific distributions  #include_next can break distro use  Packaging

17 Issues – Packaging  Original plan for OFED was that it would go away when distros were ramped up to deliver and support Open Fabrics SW.  What would have to be true before the distros can handle everything?  Is there a better solution than backport patches?  Backport patches required so users don’t have to compile kernels  SUSE and RH can’t directly use OFED backport patches because “include_next” is not transparent.

18 Packaging Proposal  Kernel distribution  Aggregate kernel patches/modules in one package  Userspace distribution  Tar-balls + sample RPM spec files for releases/RCs  Use git (for daily builds) + pull script  Solution needs to meet the needs of distributors, vendors, and those who want to roll their own  Comments?

19 Point Releases  Release frequency: Between two to three months  Can be more frequent if a critical bug is found  Change guidelines: 1.Use the same kernel base of the major OFED release 2.No API changes (both in kernel and in user libs) 3.Core and ULPs (including MPI): Critical and high priority bug fixes only 4.Low level driver changes: responsibility of the HW vendor 5.Add backports to support a new OS (e.g. SLES10 SP2, FC8, etc.)

20 Point Releases (con’t)  Release verification:  All vendors should run at least basic QA/verification cycle  Full QA by any vendor who changes their low level driver  Release process:  Release manager will publish the release target date 4 weeks prior to the release  Patches will be sent against the major release git repositories  A release will be built and tested by all companies in the usual method

21 Next Steps  Collect a clear statement of the issues and proposed solutions.  Send these out to the EWG mailing list, as pending decisions from Sonoma Workshop. Deal with any issues which are raised.  Summarize feedback.  Execute!