EarthLink and Micromuse: Growing up Together Doug McClure EarthLink Operations Sr. Manager, Fault and Performance Mgmt June 3, 2004.

Slides:



Advertisements
Similar presentations
Conducting your own Data Life Cycle Audit
Advertisements

ManageEngine IT360 Product Overview
FIS Enterprise Solutions EPK/EPM Implementation
1 Balancing SOX with Risk Based Audit Planning The Institute of Internal Auditors March 9, 2004 Dave Richards, CIA, CPA Director, Internal Auditing FirstEnergy.
Polycom Unified Collaboration for IBM Lotus Sametime and IBM Lotus Notes January 2010.
1 NameMatrix Number Francis YeeHT036029M George Goh Alex LimHT052467E Hoe Swee SimHT052560I Vijay.
TACTICAL/OPERATIONAL PLANNING
10-1 McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Document #07-12G 1 RXQ Customer Enrollment Using a Registration Agent Process Flow Diagram (Switch) Customer Supplier Customer authorizes Enrollment.
Document #07-12G 1 RXQ Customer Enrollment Using a Registration Agent Process Flow Diagram (Switch) Customer Supplier Customer authorizes Enrollment.
HIPAA Security Presentation to The American Hospital Association Dianne Faup Office of HIPAA Standards November 5, 2003.
SOA for EGovernment 1 Emergency Services Enterprise Framework: A Service-Oriented Approach Sukumar Dwarkanath COMCARE Michael Daconta Oberon Associates.
1 The Metro Ethernet Forum Helping Define the Next Generation of Service and Transport Standards Ron Young Chairman of the Board
Modern Systems Analyst and as a Project Manager
1 Implementing Internet Web Sites in Counseling and Career Development James P. Sampson, Jr. Florida State University Copyright 2003 by James P. Sampson,
Presented by Brad Jacobson The Publisher on the Web Exploiting the new online sales channels.
| Copyright © 2009 Juniper Networks, Inc. | 1 WX Client Rajoo Nagar PLM, WABU.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12 1.
IT Asset Management Status Update 02/15/ Agenda What is Asset Management and What It Is Not Scope of Asset Management Status of Key Efforts Associated.
Chapter 7 Process Management.
Steve Lewis J.D. Edwards & Company
STORAGE MANAGEMENT/ GETTING STARTED: Storage Management 101 Everything you always wanted to know about Storage Management (but were afraid to ask) Stephen.
A BPM Framework for KPI-Driven Performance Management
Effective Engineering April 2002, Burbank, CA 1. Effective Engineering April 2002, Burbank, CA 2.
© 2010 Invensys. All Rights Reserved. The names, logos, and taglines identifying the products and services of Invensys are proprietary marks of Invensys.
The importance of the service catalogue to the service desk
December 2010 Partner Support Service Overview. © 2010 Cisco Systems, Inc. All rights reserved. 2 Agenda Collaborative Services, Customer Response Collaborative.
Abstract To provide efficient and effective access to enterprise information that meets stakeholder needs and supports mission success, NASA is implementing.
Strategic Meetings Management 101
Professional Services Overview
Securing Emerging Mobile Technology JOHN G. LEVINE PH.D. D/CHIEF ARCHITECTURE GROUP 13 SEP
© 2005 AT&T, All Rights Reserved. 11 July 2005 AT&T Enhanced VPN Services Performance Reporting and Web Tools Presenter : Sam Levine x111.
Copyright © 2004 Micromuse Inc. All rights reserved. From Consolidated Operations to Service Management with the Netcool Suite General Session Doug McClure.
Customer Strategic Presentation March 2010
1. 2 Captaris Workflow Microsoft SharePoint User Group 16 May 2006.
CA's Management Database (MDB): The EITM Foundation -WO108SN.
CA Infrastructure Management Solving IT’s Most Complex Problems.
LeadManager™- Internet Marketing Lead Management Solution May, 2009.
New Features of Financial Reporting David Grande eCapital Advisors.
© Prentice Hall CHAPTER 15 Managing the IS Function.
CTS Strategic Roadmap Walkthrough, v1.2 Dan Mercer.
1. 2 Introduction Industry trends Engagement models Governance Innovation Case Study Summary & Wrap Up Agenda.
September 12-14, 2005 Bethesda North Marriott Hotel & Conference Center North Bethesda, Maryland.
The Business Value of CA Solutions Ovidiu VALEANU Senior Consultant DNA Software – CA Regional Representative.
May l Washington, DC l Omni Shoreham SaaS Market Opportunities Jerry Champlin Chief Executive Officer Absolute Performance Inc. Enabling Today’s.
Defining Services for Your IT Service Catalog
MANAGED SERVICES OPERATIONS. Increasing IP Infrastructure Complexity Requires Greater Need for Services Data Center B2B Links Branch Offices Distribution.
Demonstrating IT Relevance to Business Aligning IT and Business Goals with On Demand Automation Solutions Robert LeBlanc General Manager Tivoli Software.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Confidential 1 MAP Value Proposition.
© 2009 IBM Corporation Delivering Quality Service with IBM Service Management April 13 th, 2009.
BMC Software confidential. BMC Performance Manager Will Brown.
Q5 Systems Limited “Pay me now or pay me later!” Reducing field data capture, reporting, trend analysis and corrective action by up to 70%
Network Administration. What is a Systems Administrator?  Person responsible for:  Setting up servers  Configuring the environment for web and other.
© 2006 Avaya Inc. All rights reserved. Avaya Services Michael Dundon Business Development Manager.
Putting a Face on Electronic Commerce Kathy Warden.
MIS3300_Team8 Service Aron Allen Angela Chong Cameron Sutherland Edment Thai Nakyung Kim.
Security Professional Services. Security Assessments Vulnerability Assessment IT Security Assessment Firewall Migration Custom Professional Security Services.
Copyright© 2002 Avaya Inc. All rights reserved Anna Dorcey Director, Avaya DeveloperConnection Program August 4, 2004 Partnering in the VOIP World Anna.
Collaborate 2009 Projects SIG Suhail Maqsood Vice President.
GREG CAPPS [ ASUG INSTALLATION MEMBER MEMBER SINCE:1998 ISRAEL OLIVKOVICH [ SAP EMPLOYEE MEMBER SINCE: 2004 GRETCHEN LINDQUIST [ ASUG INSTALLATION MEMBER.
IT Priorities Minimize CAPEX Maximize employee productivity Grow the business Add new compute resources real- time to support growth Meet compliance requirements.
© 2014 IBM Corporation Does your Cloud have a Silver Lining ? The adoption of Cloud in Grid Operations of Electric Distribution Utilities Kieran McLoughlin.
Microsoft and Symantec
INNOVATE THROUGH MOTIVATION MSP Services Overview KEVIN KIRKPATRICK – OWNER, MSP INC LOGO.
Infrastructure for the People-Ready Business. Presentation Outline POINT B: Pro-actively work with your Account manager to go thru the discovery process.
6/13/2015 Visit the Sponsor tables to enter their end of day raffles. Turn in your completed Event Evaluation form at the end of the day in the Registration.
MEASURING BPM SOFTWARE ROI AND ITS BENEFITS IN RISK MANAGEMENT PROCESS AUTOMATION Contact us at | Web : | Tel: 1.
GDT Automated Scheduling and Operations with C2O.
Doug McClure Sr. Manager, Service and Technology Monitoring
Presentation transcript:

EarthLink and Micromuse: Growing up Together Doug McClure EarthLink Operations Sr. Manager, Fault and Performance Mgmt June 3, 2004

Fault & Performance Mgmt 1 Overview One of the Nations Largest ISPs Headquarters in Atlanta, GA –Key facilities in Dallas, TX, Pasadena and San Jose, CA, Knoxville, TN and Seattle, WA Profitable, strong balance sheet Largest DSL footprint First-to-market with products that provide the best possible Internet experience Customer Advocacy: Fighting SPAM with technical solutions, litigation, legislative support, industry collaboration and consumer education –Howard Carmack, aka the "Buffalo Spammer," was sentenced to 3-1/2 to seven years in prison on May 27 th after EarthLink received a $16.4M civil judgment in May th Anniversary ( ) –

Fault & Performance Mgmt 2 Overview 5.25M Customers ~4M Dialup (Premium ~3.5M, Value ~500K) ~1.2M Broadband (Cable, xDSL) ~160K Web Hosting (Unix, Windows) ~50K Wireless (Blackberry, PDA, Laptops, Wi-Fi) Dial Access Coverage > 90% of US Population ~16K Local Dial Access Numbers ~500K Active Modem Ports (~50% ELNK, ~50% Outsourced) ~400 PoPs (18 Core Backbone PoPs, four data centers) Broadband Coverage ~200 Markets with Broadband Offerings Large and Diverse Infrastructure 2300 Network Elements 1500 Server Elements Thousands of Access Circuits, Hundreds of WAN Circuits

Fault & Performance Mgmt 3 Overview Access Technology Innovation Premium and Value Dial-up Broadband (Cable, xDSL, Satellite) Voice (Converged Devices, VoIP) Wireless (WiFi, CDMA, Blackberry, PDA) Broadband over Power Lines (BPL) Value Added Service and Product Innovation Blocker Family: spamBlocker, POP-UP Blocker, ScamBlocker, Virus Blocker, Spyware Blocker Parental Controls Webmail Web Accelerator

Fault & Performance Mgmt 4 Overview Exceptional Customer Service 2003 PC Magazine Readers' Choice Awards for both high-speed and dial-up services 2003 highest ranking in customer satisfaction for the second year in a row for high-speed Internet service by J.D. Power and Associates in its Internet Service Provider Residential Customer Satisfaction Study SM 2003 CNET Editors' Choice award

Fault & Performance Mgmt 5 Innovation = Constant Change Drivers Speed to Market, Competition – Do more, faster Quality, Performance, Support Costs Compliance - Sarbanes-Oxley Operational Challenges Release Management Change Management Service Level Management

Fault & Performance Mgmt 6 Operations Maturity: Growing Up Production Improvement Program (PIP) Foundation in IT Service Management, ITIL, CobIT Focusing on four main areas: Service Level Mgmt, Change Mgmt, Release Mgmt, and Production Security –Over 10% of Operations staff have now attended ITIL Foundation Training 1 Master Level Certified (more planned) 9 Practitioner Level Trained in CCR Quadrant (pending certification results) 114 Foundation Level Trained (most pending certification results)

Fault & Performance Mgmt 7 Operations Maturity: Growing Up Service Level Management NOC, Help Desk Set and manage expectations internal/external to Operations Change Management Provide oversight and control of the production environment Minimize risk and impact from change activities Release Management Development Operations Minimize poor quality production releases Enterprise Security Compliance, control, audit

Fault & Performance Mgmt 8 EarthLink and Micromuse Facts Very Early Netcool Adopter EarthLink (Mindspring) was Micromuses first US customer –Began evaluating Micromuse Netcool in 1996, official customer April 1997 Early Innovation Early joint innovation and development helped build foundation for many of Micromuses key products –EarthLink and Micromuse are revitalizing joint development projects with emerging service and business activity monitoring products Driving 3 rd Party Vendor Integration & Partnerships EarthLink requires detailed integration with Micromuse suite – much more than just sending SNMP TRAPs –Quest Software, Compuware, PeopleSoft, Remedy, Cisco Systems, Arbor Networks Current Deployment Netcool OMNIbus, Internet Service Monitors, Desktop Clients, Webtop, Impact, numerous Gateways, Probes, Data Source Adaptors –Two Senior System Engineers, Three System Engineers, Two System Analysts devoted to Fault and Performance Management (Netcool + Other) Services provided for NOC (3 shifts, 6 per shift), Systems Administration (3 shifts, 10 per shift), Network Engineering

Fault & Performance Mgmt 9 Moving Beyond MoM and Apple Pie EarthLinks Early Micromuse Netcool Deployment Focused on Netcool as the Manager of Managers or MoM Needed during EarthLinks rapid growth and expansion Enabled event management, eliminated swivel chair NOC Apple Pie is Event Correlation and Deduplication The Netcool sweet spot was providing EarthLink with event correlation and deduplication –Able to reduce the event stream from 100,000s to 1,000s per week –Further reduction expected to 100s per week through use of advanced Netcool/Impact policies and deployment of Netcool/Precision Enables NOC and support staffs to operate efficiently Focus now on End-to-End Service Management Netcool Suite allows EarthLink to manage entire service –Understand service relationships, service levels, perform service modeling and service discovery Enables impact assessment, prioritization, understanding service delivery chain Eliminates needle in the haystack approach of event management –This is the problem that needs attention now (compared to I think this is the event causing problems)

Fault & Performance Mgmt 10 Service Management Complexity Good Customer Experience? Performance? Infrastructure Events to Netcool Source: EarthLink Product Group

Fault & Performance Mgmt 11 Service Management Complexity Number of Components Time (24x7x365) System Changes Infrastructure Events D D DDDD D D DDDD DDDD DDDD DDDD Identify key service elements Instrument those elements Consolidate & analyze data Develop service model and SLAs Dealing with EarthLink Service Complexity: The complexity and amount of data generated from end-to-end service management is enormous Networks, Firewalls, Servers, Applications, Switches, Routers, Load Balancers, Applications, Databases, etc. Netcool/ObjectServer is a must have for EarthLink to effectively manage and understand EarthLinks service event stream from end-to-end Impact 3.0s cluster capability will enable EarthLink to analyze, enrich, suppress, and manage event stream regardless of our growth Source: EarthLink Product Group RAD (future) Impact Precision (future) ISM System Agents SNMP ObjectServer RAD (future) Impact RAD (future) Impact ISM

Fault & Performance Mgmt 12 The Customer IS Important Customer Experience Monitoring and Management The Micromuse Netcool Suite enables proactive, real-time monitoring of the customers experience for core EarthLink services –Over 14K Internet Service Monitors (ISM) instances in operation covering all key services (HTTP, HTTPS, SMTP, POP3, IMAP) and dedicated customers (ICMP) Allows for customer experience monitoring information to be correlated, analyzed, and presented in real-time –Micromuse Netcool/ISMs, Keynote, Compuware Client Vantage, Quest Foglight –External/Internal Synthetic testing system & network element monitoring system and network port monitoring Immediate notification to support groups when customers experience degrades

Fault & Performance Mgmt 13 The Business IS Important Business Activity Monitoring and Management Expands IT Operations visibility vertically and horizontally Ties IT Operations data and Business data together –System Downtime vs. Contact Center Call Volume –Real-Time Customer Subscriptions vs. Sales Forecasts Enables Real Time Monitoring and Management of Business and IT processes –Change and Downtime Management –Customer Registration Management

Fault & Performance Mgmt 14 Production Improvement Program Release Planning Dev / Procurement Release Design, Build Release Acceptance Roll-out Planning Comm, Prep, Training Distribution/ Installation Policy, Procedures, Standards & Guidelines Security Consulting Security Assessment Security Monitoring STATUS CHANGE (1) Prioritization, Risk Assessment and Forward Schedule of Change STATUS CHANGE (2) Change Approval and Proj. Service Availability STATUS CHANGE (3) Final Change Approval and Implementation Metrics & Reporting Corp Project Ops Project Non-Project Prod Sec REQUEST FOR CHANGE (RFC) CLOSED RFC STATUS CHANGE (4) Review Changes Security Test & Sign off Release Mgt Change Mgt Mutual Benefit from EarthLinks Innovation and Advanced Use of Micromuse Products Micromuse OMNIbus, Impact, Webtop, RAD, NFSM Source: EarthLink SLM Group

Fault & Performance Mgmt 15 Business Activity Monitoring Managing the Impact of Change and Downtime Activities on the Business and Operations

Fault & Performance Mgmt 16 Overview Drivers Adoption of ITIL/COBIT Best Practices for Change Management –Production Improvement Program (PIP), SOX Compliance, etc. –Significant change for many groups – Fear, Uncertainty, Doubt (FUD) No Real-Time Visibility into Change/Downtime Management Activities –Business Process Who, What, When, Where, Why, and How, Cost, Risk, and Impact –Workflow – Monitor Lifecycle, SLAs, Bottlenecks – Is the process enabling Operations or is it a bottleneck? –Impact on Infrastructure – False Positives, Contact Center Call Volume (COGS) Drive out False Positives from Production Monitoring Systems –Huge burden on NOC and other support staff Desire to have Automated Remedy Trouble Ticket Creation –Reduce time to address problems, reduces MTTR

Fault & Performance Mgmt 17 Overview Solution Provide Real-Time Visibility into Change/Downtime Process –There are 12 pending and 24 scheduled change requests for tonight, 6 are underway and 8 start in 15 minutes or less Create Actionable Information –Dept. 828 has five outstanding major change requests, attention is needed Ensure Business Rules are Guiding/Enabling the Process – Not Hindering It –Eliminate FUD Report (dashboards, reports) on Process and Impact –NOC and other support groups know whats happening during change and downtime windows –Management has oversight and visibility –Business understands impact of change and downtime activity

Fault & Performance Mgmt 18 Implementation Micromuse Netcool/OMNIbus –Custom integration with Request for Change (RFC) and Downtime Management System –ObjectServer flexibility allows for definition of important business and IT data in each event to capture Change/Downtime Status Service Impact, Business Impact, Customer Impact, SLA, Restoral Priority, Escalation Path, etc. Micromuse Netcool/Impact 3.0 –Impact policies build lists in real time for all nodes listed in change/downtime request –As change/downtime activity progresses through its lifecycle, the change/downtime Netcool event changes states –Change/Downtime event suppression policy updates all incoming events that match node list during the maintenance window with Suppression Status and Change/Downtime Reference Number Micromuse Netcool/Webtop 1.2 – RAD 2.0 –Process owner (Change/Downtime Management Group) dashboard for monitoring and managing the overall end-to-end process, workflow, and business impact –Business group dashboards for monitoring change/downtime activities within area of control (Network Engineering, MIS, etc.)

Fault & Performance Mgmt 19 Webtop 1.2 Presentation

Fault & Performance Mgmt 20 RAD 2.0 Presentation

Fault & Performance Mgmt 21 Netcool Event Management Change/Downtime Request Events Suppressed Change/Downtime Activity Events Change / Downtime Status Event Suppressed by Change / Downtime Change / Downtime ID

Fault & Performance Mgmt 22 Future Enhancements Planned Netcool/Impact Policies COGS Impact –Assess support cost impact due to change and downtime activities within Operations and Customer Support in Real-Time Data Gap Management –A common question: Why does my chart or graph have gaps? –The solution: Annotate graphs, charts, portals, etc. with the reason for data gaps caused by planned change/downtime activities –How: Integrate change and downtime event information with all performance, utilization, and capacity monitoring solutions via Impact 3.0

Fault & Performance Mgmt 23 Business Activity Monitoring EarthLink Customer Registration, Provisioning, and Fulfillment Dashboards

Fault & Performance Mgmt 24 RAD 2.0 Joint Development Business Activity Monitoring: Real-Time Customer Registration Dashboard

Fault & Performance Mgmt 25 RAD 2.0 Joint Development Business Activity Monitoring: Real-Time Customer Registration Dashboard

Fault & Performance Mgmt 26 Continuous Improvement Building better Network and Systems Management Founded Atlanta Network and Systems Management Technical User Group (ANSMTUG) in January 2004 – –Metro-Atlanta Fortune 100, Service Providers, Enterprise, Media, and Emerging Technology Companies Bell South, The Home Depot, EarthLink, Southern Company, N2 Broadband, eDeltacom, Delta, CNN, Cingular, E*Trade, Knology Broadband, Cox Communications Customers helping Customers –Use Micromuse and other NSM products better –Collectively drive product requirements and features into Micromuse and other NSM vendors Special Interest Groups (SIG) Forming –Best practices for NSM using Micromuse Netcool Suite –Aligning NSM solutions to ITIL, MOF, CobIT, etc.

Fault & Performance Mgmt 27 Challenges facing Micromuse Product Development, Focus, and Release Cycle –Business * Monitoring (BAM, BSM, BI, BTI, B-I-N-G-O) –Performance Monitoring & Management Solution –Features vs. New Product – Finding the Right Balance –Licensing – Needs Review and Simpler Approach –Support New Technologies Sooner Across Core Products –Uniform Release Cycle (core architecture components and capabilities) Discovery, Root Cause Analysis (RCA), Next-Gen Polling –Emerging Competition –Service / Application Discovery & RCA –Universal Poller Concept Out of the Box Functionality and Updates –Appearance of Requiring Too Much Customization Competition is focusing on this Many customers have product still on the shelf –Ease of Use More out of the box, templates, examples, plug and play, wizards, Tools and Utilities section on Support website is a start –Improving Documentation

Fault & Performance Mgmt 28 Closing and Q&A Closing Q&A Doug McClure Sr. Manager, Fault and Performance Mgmt EarthLink Operations (W) (C)