ERCOT Project Update ERCOT Outage Evaluation Phase 2 (SCR745) TDTWG May 7, 2008.

Slides:



Advertisements
Similar presentations
Information Technology Report Dave Pagliai Manager, IT Support Services March 2015 ERCOT Public.
Advertisements

Information Technology Report Trey Felton Manager, IT Service Delivery January 2012 ERCOT Public.
Update to RMS January 6, RMGRR – Revision to Customer Rescission Completion Timeline The revision will implement a written standard in the Retail.
RMS Update to TAC August 7, RMS Update to TAC ► At July 9 RMS Meeting:   RMS Voting Items:
1 TDTWG Update to RMS Wednesday March 11, Primary Activities 1.Reviewed ERCOT System Outages and failures 2.ERCOT update of browser support for.
Retail Market Subcommittee Update to TAC Kathy Scott April 24,
1 TDTWG Report to RMS SCR 745 ERCOT Unplanned System Outages Wednesday, July 13th.
ERCOT PMO Update Robert Connell Director Program Management Fourth Quarter Results (Through 12/31/04) January 11, 2005.
1 TDTWG Update to RMS June 2, MarkeTrak API Performance Metrics Review ERCOT continues work with CenterPoint and Oncor to refine/revise the MT API.
RO Project Priority List Update EDW Projects Update RMS Meeting Adam Martinez Mgr, Market Ops Divisional Projects Organization ERCOT April 12, 2006.
RMS Update to TAC January 3, Goals Update ► Complete and improve SCR745, Retail Market Outage Evaluation & Resolution, implementation and reporting.
RMS Update to TAC May 8, RMS Update to TAC ► At April 9 RMS Meeting:  Antitrust Training  RMS Voting Items: ► NPRR097Changes to Section 8 to Incorporate.
Retail Data Transport Upgrade ERCOT Recommendation ERCOT Public June 2015.
1 TDTWG Update to RMS Tuesday January 6, Primary Activities 1.Reviewed ERCOT System Outages and Failures 2.Presented Service Availability and.
Retail Market Subcommittee Update to COPS Kathy Scott July 16,
Texas Data Transport Work Group Review RMS Meeting May 29, 2002.
Market Impact Assessment TF Final Report to RMS June 11, 2008.
Information Technology Update ERCOT Board of Directors Meeting January 17th, 2005.
RMS Update to TAC January 8, Voting Items From RMS meeting on 12/10/2008  RMGRR069: Texas SET Retail Market Guide Clean-up – Section 7: Historical.
1 TDTWG Update to RMS Wednesday December 5, 2007.
1 TDTWG Update to RMS Wednesday November 7, 2007.
1 TDTWG Update to RMS Wednesday February 14 th, 2007.
ERCOT SCR745 Update ERCOT Outage Evaluation Phase 1 and Phase 2 TDTWG April 2, 2008.
Retail Business Processes PR 50121_07 Project Update Retail Market Subcommittee September 13, 2006 Adam Martinez Mgr, Market Operations DPO.
Information Technology Service Availability Metrics Trey Felton IT Account Manager COPS/RMS September 2009.
PR50121_07 Retail Business Processes (RBP) Project Update Retail Market Subcommittee November 8, 2006 Adam D. Martinez Mgr, Market Operations DPO.
PMO Update to PRS Troy Anderson ERCOT Program Management Office December 17, 2009.
Retail Transaction Processing Year End Review and Recent Issues RMS January 2007.
1 TDTWG Scope and Goals 2015 Wednesday January 8, 2014.
1 New RO Projects Hope Parrish June NEW RO Projects for 2008 Requested by ERCOT - Overview Objective Objective of the following information is to.
1 TDTWG Update to RMS TDTWG Thursday, March TDTWG TDTWG has continued work necessary to further support of the NAESB EDM V1.6 Project Work primarily.
1 TDTWG Update to RMS Wednesday August 17, Primary Activities 1.Reviewed ERCOT System Outages and failures 2.Reviewed 2011 SLA 3.ERCOT presented.
Information Technology Report Trey Felton Manager, IT Service Delivery October 2011 ERCOT Public.
PMO Update to PRS Troy Anderson ERCOT Program Management Office January 21, 2010.
Retail Operations 2008 Fund Release Proposal Retail Market Subcommittee June 11 th, 2008 Hope Parrish Market Operations Division Projects Organization.
June 2010 COPS/RMS Information Technology Report Trey Felton Manager, IT Administration.
May 9 th, 2007 Retail Market Subcommittee Meeting PR50121_07 Retail Business Processes (RBP) Project Update A Sub-project of the Service Oriented Architecture.
Objectives: Upgrade Siebel to a supported application Upgrade Oracle database to current version Deliver all existing user functionality with no degradation.
ERCOT Service Availability Metrics and Retail Systems Update April 2007.
1 Texas Data Transport & MarkeTrak Systems (TDTMS) Update to RMS February 2, 2016 Jim Lee (AEP) – Chair Monica Jones (NRG) – Vice Chair.
Information Technology Service Availability Metrics Trey Felton IT Account Manager COPS/RMS January 2010.
1 TDTWG Update to RMS Tuesday March 3, Primary Activities 1.ERCOT System Outages and Failures 2.MarkeTrak Performance 3.Discussed 4 th QTR Performance.
9/13/2006 RMS Duplicate Retail Transactions. RMS9/13/2006 Background Duplicate Retail Transactions Types of duplicate transactions: –PaperFree duplicate.
1 TDTWG Report to RMS Recommended Solutions for SCR 745 ERCOT Unplanned System Outages and Failures Wednesday, August 10th.
Information Technology Service Availability Metrics March 2008.
RMS Update to TAC November 1, RMS Activity Summary RMGRR057, Competitive Metering Working Group Name Change (VOTE) Update on RMS Working Group and.
Information Technology Update Aaron Smallwood Manager, IT Business & Customer Services.
1 Yearly Project Prioritization Process Overview and New RO Projects Troy Anderson et. al. June 2007.
April 2010 COPS/RMS Information Technology Service Availability Metrics Trey Felton Manager, IT Administration.
1 Texas Data Transport & MarkeTrak Systems (TDTMS) Update to RMS March 1, 2016 Jim Lee (AEP) – Chair Monica Jones (NRG) – Vice Chair.
1 TDTWG Update to RMS Wednesday May 6, Primary Activities 1.Reviewed ERCOT System Outages and Failures 2.Reviewed Service Availability 3.Reviewed.
1 SCR756 – Enhancements to the MarkeTrak application –Fondly called - MarkeTrak Phase 3 –ERCOT CEO determined that SCR756 is not necessary prior to the.
Retail SLA Proposed Changes RMS/TDTWG September 2008 Trey Felton IT Account Manager.
1 TDTWG Report to RMS SCR Addressing ERCOT System Outages Tuesday, May 10.
TDTWG UPDATE TO RMS 1 Tuesday April 1, Reviewed ERCOT System Outages and failures ERCOT presented the monthly Incident Report Planned/Unplanned.
Information Technology Report Dave Pagliai Manager, IT Support Services February 2016 ERCOT Public.
RO Projects Financial Overview Retail Market Subcommittee May 09, 2007 Adam Martinez Market Operations Division Projects Organization.
ERCOT Project Update ERCOT Outage Evaluation Phase 2 (SCR745) TDTWG November 5, 2008.
Lead from the front Texas Nodal 1 TDWG Nodal Update – June 6, Texas Nodal Market Implementation Server.
MODPO Project Update Overview of December Implementations & EDW Changes Commercial Operations Subcommittee December 11, 2006.
1 TDTWG Update to RMS Wednesday August 13, Primary Activities 1.Addressing ERCOT System Outages and failures 2.Discussed Proposed Changes to Incident.
July 2008 RO Projects Financial Overview Retail Market Subcommittee August 13, 2008 Hope Parrish Market Operations Division Projects Organization.
Information Technology Update ERCOT Board of Director’s Meeting September 20, 2005.
Retail Market IT Services SLA and Service Availability Metrics TAC August 2 nd, 2007.
1 PR70007 – MarkeTrak Ph2 ERCOT Project Strategy 07/30/2007.
Emergency Database Failover: Impacts & Recovery Plan
Initial ERCOT Parking Deck Prioritization Recommendation
2011 Prioritization Update to Market Subcommittees
Initial ERCOT Parking Deck Prioritization Recommendation
ERCOT SCR745 Update ERCOT Outage Evaluation Phase 1 and Phase 2
Presentation transcript:

ERCOT Project Update ERCOT Outage Evaluation Phase 2 (SCR745) TDTWG May 7, 2008

2 PR60006_01 Phase 2 ERCOT Update - Overview Background: SCR 745: To achieve improved Market performance and reliability through a reduction of ERCOT Retail Systems unplanned outages. This effort was planned to be implemented in two subprojects; PR60006_01: ERCOT Outage Evaluation Phase I and Phase II Phase I, NAESB and Proxy Clustered (Delivered 02/2007-Goal Achieved) Phase II, Paperfree Clustered environment with File Server Redundancy and High Availability PR60006_02: Phase III, Database Clustered environment (Cancelled per recommendations at 04/02/2008 TDTWG) Phase II Status: 02/10/2007 – Implemented Veritas clustered solution resulted in rollback due to unsuccessful failover. 03/08/2008 – Implemented Polyserve clustered solution resulted in rollback due to performance and stability issues (This would have delivered Redundancy and Failover) 05/07/2008 – Seeking recommendations from TDTWG for Next Steps

3 Recommendations from HP for Performance improvement will require Architectural changes, server rebuilds, and testing ERCOT Recommends pursuing one of the following Options: 1) Place project “On Hold” due to the following (preferred): Stabilization of San Switch Replacement Project (Polyserve known issue with loss of connectivity to SAN) Test Environment Lock down until December 2008 due to Ts and Cs, MarkeTrak, and Nodal Resource constraints due to Ts and Cs, MarkeTrak, and Nodal Eliminate additional Finance charges by placing project on Hold Allow to move forward in 2009 with implementation that will deliver Failover capabilities (High Availability and Redundancy Goal of SCR) 2) Close project and complete effort as O & M: Additional funding will be required for remaining efforts Total Project estimated at $1M approved by Board in 2005 Committed approximately $885K, will require Board approval for additional funding PR60006_01 Phase 2 ERCOT Update – Next Steps

4 PR60006_01 Phase 2 ERCOT Update – Outages Retail Transaction Processing Unplanned Outages by # of Incidents NAESB Seebeyond / TIBCOPaperfreeSiebelTMLRetail Databases Total * Based on IT Incident Report on 04/02/2008 and Metrics in SCR745

5 Retail Transaction Processing Unplanned Outages by Approx. # of Minutes NAESB Seebeyond / TIBCOPaperfreeSiebelTMLRetail Databases PR60006_01 Phase 2 ERCOT Update – Outages Based on IT incident Report and SCR Metrics

6 PR60006_01 Phase 2 ERCOT Update – PF Outage Details (3yrs) PaperFree Availability Metrics Prior to March 2008 as a result of 2007 Intermediate Resolutions Previous Logged incident for PaperFree file server – 02/2007. Until March, 2008 – Paperfree Application was 100% available due to intermediate solutions (meeting SCR Goal for reliability). Issue Date Dura tion (min s) SLA Impacted Application ImpactedIssue DescriptionRoot Cause Service Impact Service Impact Detail 9/25/06829RetailPaperfreePaperfree File Server not respondingInfrastructureOutage Unplanned Outage 10/2/0618RetailPaperfreePaperfree File Server network outageInfrastructureOutage Unplanned Outage 1/3/07130RetailPaperfree Memory failure in the clustered environmentInfrastructureOutage Unplanned Outage 1/5/07270RetailPaperfreeProblem pulling data from NAESBInfrastructureOutage Unplanned Outage 1/8/07195RetailPaperfree Attempted to replace the Paperfree architecture as identified by the on- going Paperfree issues analysisInfrastructureOutage Unplanned Outage 2/7/0785RetailPaperfree Connectivity issue between application and SANInfrastructureOutage Unplanned Outage 3/19/08147RetailPaperfreeSAN Hardware failureInfrastructureOutage Unplanned Outage 3/20/08105 Retail Market Degradation Issues Post SCR745 Phase 2 solution Polyserve Applicaton/PFOutage Unplanned Outage 3/22/08240 Retail Market Rollback from SCR745 Phase 2 implementation Polyserve Applicaton/PFOutage Unplanned Outage

7 PR60006_01 Phase 2 ERCOT Update – TDTWG Recommendations Discussion