Content Analytics for Legacy Data Retention

Slides:



Advertisements
Similar presentations
Business Development Suit Presented by Thomas Mathews.
Advertisements

Chapter 1 Business Driven Technology
Presentation by Priyanka Sawarkar
© 2007 IBM Corporation Enterprise Content Management Integrating Content, Process, and Connectivity for Competitive Advantage Malcolm Holden October 2007.
Internal Control–Integrated Framework
© 2007 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP TRIM HP Information Management.
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. trans for ma tion : a.
Misys Treasury & Capital Markets
SAP BI ConnectorDuet Enterprise for Microsoft SharePoint and SAP SAP NetWeaver Gateway productivity accelerator for Microsoft Synch enterprise data.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Developing a Records & Information Retention & Disposition Program:
1 © Copyright 2008 EMC Corporation. All rights reserved. Litigation Response Planning: eDiscovery Best Practices Stephen O’Leary Sr. eDiscovery and Compliance.
Integration and Insight Aren’t Simple Enough Laura Haas IBM Distinguished Engineer Director, Computer Science Almaden Research Center.
Enterprise Resource Planning ERP Systems
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Agenda Symantec Enterprise Vault 1 Today’s Management Challenges 1 Why Management? 2 The Solution: Symantec Enterprise Vault 3 Benefits & Closing.
Libraries and Institutional Content Management Systems
Microsoft Office Sharepoint Server 2007 (MOSS) Overview Momentum Microsoft November 15, 2007.
Best Practices Using Enterprise Search Technology Aurelien Dubot Consultant – Media and Entertainment, Fast Search & Transfer (FAST) British Computer Society.
Introduction to Content Analytics Ömer Sever IBM SWG Enterprise Content Mangaement.
Databases & Data Warehouses Chapter 3 Database Processing.
Getting Smarter with Information An Information Agenda Approach
By N.Gopinath AP/CSE. Why a Data Warehouse Application – Business Perspectives  There are several reasons why organizations consider Data Warehousing.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
Efficient BI Solution Presented by: Leo Khaskin, PowerCubes Lab Value of Information as Business Asset.
management “A Gen-i Perspective” Paul McTaggart - ECM Practice Manager.
1 EDMS 101 Speaker: Monica Crocker, DHS EDMS Coordinator Overview of current project(s) Objective of this section: This session outlines EDMS fundamentals.
1.Knowledge management 2.Online analytical processing 3. 4.Supply chain management 5.Data mining Which of the following is not a major application.
Navigation Customer Retention Customer Experience Customer Spending Customer retention rate Household retention rate Average customer tenure Satisfaction.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Database Design - Lecture 1
Advance Case Management Brijesh Chopra, Industry Solutions Software IBM India, SA.
© 2008 IBM Corporation ® 1 ECM Product Vision & Strategy Ken Bisconti Vice President, ECM Products and Strategy IBM Software Group February 2009.
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
April, 2008 Better Together! Integrated GP & CRM AN INDEPENDENT MEMBER OF BAKER TILLY INTERNATIONAL 505 AFFILIATE OFFICES WORLDWIDE.
Presentation Path  Introduction to Ved Consultancy and OpenText  Current Challenges  The Valued Customers and Sectors  Our Solutions  Demo. Together,
Enterprise Resource Planning ERP Systems
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Dan Grady The search for the killer productivity application is over… Copyright 2009, Information Builders. Slide 1.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
© 2005 Princeton Softech, Inc. Princeton Softech Anatomy of an Archive Project Let’s Talk About Data!! April 18, 2007 Alan Schneider.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
© 2010 IBM Corporation Business Analytics software Business Analytics Editable Text Editable Text Editable Text.
Project Portfolio Management Business Priorities Presentation.
Business Productivity Infrastructure Optimization Campaign 1 Agenda: BPIO Partner Sales Readiness Workshop Day 3: Topic: Enterprise Content management.
Enterprise Content Management
ControlPoint The Eleventh Hour Presentation and ControlPoint Demonstration Abdullah Noman October, 2015.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Copyright © 2013 Avaali. All Rights Reserved. 1 SAP OpenText ECM Solutions: Travel Receipts Management.
BUSINESS ANALYTICS OVERVIEW. Data Mining  Data mining uses advanced statistical techniques and mathematical algorithms to analyze usually very large.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Data Minimization Framework
CIM Modeling for E&U - (Short Version)
92% of the world’s data was created in the past 2 years
IBM Tivoli Web Site Analyzer Training Document
Brandon Botes #SPSJHB Records Management – Friend or Foe ???
Chapter 13 The Data Warehouse
Navision Business Analytics
IBM Content and Predictive Analytics for Healthcare How it works
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Vertex & EnterpriseOne
ECM for Legal & Law.
Archiving Legal eDiscovery Lotus Notes Domino
Compliant Information Management and the eDiscovery Challenge
Brandon Botes #SPSJHB Records Management – Friend or Foe ???
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Presentation transcript:

Content Analytics for Legacy Data Retention The Dayhuff Group has a long history of providing enterprise content management solutions in a wide variety of industries. In business since 1997 IBM Premier Business Partner Software ValueNet Partner Over 180 projects at 80 companies 96% customer satisfaction rating Content Analytics for Legacy Data Retention The Dayhuff Group

The information lifecycle governance problem 98% $3M 17% Companies that cite defensible disposal as key result of governance programs Average cost to collect, cull and review information per legal case1 Amount of IT budget spent on storage3 22% 70% 44x Companies that can defensibly dispose today Portion of information unnecessarily retained2 Projected information growth, 2009-20204

Watson and IBM ECM Today Natural Language Processing (NLP) is the cornerstone to translate interactions between computers and human (natural) languages Watson uses IBM Content Analytics to perform critical NLP functions Unstructured Information Management Architecture (UIMA) is an open framework for processing text and building analytic solutions Several IBM ECM products leverage UIMA text analytics processing: IBM Content Analytics OmniFind Enterprise Edition IBM Classification Module IBM eDiscovery Analyzer

Going from raw information to rapid insight Uncover business insight through unique visual-based approach Aggregate and extract from multiple sources Organize, analyze and visualize Search and explore to derive insight … to form large text-based collections from multiple internal and external sources (and types), including ECM repositories, structured data, social media and more. … enterprise content (and data) by identifying trends, patterns, correlations, anomalies and business context from collections. … from collections to confirm what is suspected or uncover something new without being forced to build models or deploy complex systems.

Why Content Analytics? #1 problem all accounts have, “don’t know what content they have” #2 problem “Uncontrollable Storage Cost” Want to discover the value that may exist in their existing information / content resources Want to know who, where, when & how to leverage their information / content assets Believe they too can demonstrate a three month return on investment Non-threatening to IT Low Cost investment Content Analytics “Try & Buy”

LEVEL 5 (Transformational) Content Analytics Necessary Information Unnecessary Content In The Wild Content Analytics Dynamically analyze to know what you have Aggregate, correlate, visualize and explore your enterprise information to make rapid decisions about business value, relevance and disposition. Decommission what’s unnecessary Cut costs and reduce risk by eliminating obsolete, over-retained, duplicate, and irrelevant content – and the infrastructure that supports it. Preserve and exploit the content that matters Collect valued content to manage, trust and govern throughout its lifespan … policies that use rule-based metadata, advanced contextual classification, and advanced content analytics Slide 5: Charts and Graphs Use this slide when displaying charts or graphs that fill the slide. LEVEL 5 (Transformational) 6

How to decommission and preserve content Identify Content Sources to be assessed IT Initial Assessment to decommission irrelevant content LOB Specific Assessments to decommission over-retained and obsolete content … and to collect and classify valued and obligated content. System and Application decommissioning by IT Periodic audits by IT and LOB keep content environments optimized. 5 Periodic Audit 1 2 3 4 Identify Content Sources Initial Assessment Specific Assessments System & Application Decommissioning Content Collection Content Collection

Content Collection w/ Content Classification Content Analytics and RIM Step 1: Identify content sources Step 2: Exploration Examine sources and analyze content Records manager uses interface to explore & identify value-based content categories Define policies expressing required actions (delete, move, copy, ...) based on categories Step 3: Archival and Management of content IT manager encodes policies content collection mechanism Content identified by exploration process is collected Content collection is executed in an ongoing basis, as prescribed by policies Supports Selective Content Decommissioning: Operate on a subset of content in the original source Identify & extract records across the enterprise 1 Legacy ECM File SharePoint File 2 Content Exploration Content Collection w/ Content Classification 3 Policies Slide 5: Charts and Graphs Use this slide when displaying charts or graphs that fill the slide. Trusted ECM 8

Step 1: Identify content sources in the wild Content Analytics and Dynamic ESI Collection 1 Step 1: Identify content sources in the wild Step 2: Exploration Examines sources and analyzes content IT or Legal user explores & identifies content relevant to case User determines content to be collected into a case set and invokes collection process Collection tool (embedded ICC) copies identified content into ECM evidence repository Step 3: eDiscovery Cull, hold, audit-track, export ESI Analytics-driven Early Case Assessment across all relevant evidentiary ESI Content Analytics Multi-case Evidentiary ECM based Dynamic on demand reactive collection requests Policy based collections 2 Content Collection Slide 5: Charts and Graphs Use this slide when displaying charts or graphs that fill the slide. 3 eDiscovery Tools 9

Unlock valuable insight from content What our clients are doing with Content Analytics

Basic questions to consider regarding content Know your content Accelerate time to knowledge by providing greater accuracy and more complete business context with enterprise search Dynamically analyze what you have, decommission the unnecessary, and preserve the content that matters with content assessment Do we know what we have and can we find it? Is it properly managed and can we trust it? What does it all mean and how can we benefit from it? Trust your content Manage and govern content in trusted repositories, not in suspect environments, enabling confidence in your content Create and manage 360 degree trusted content views to enrich master data by connecting to enterprise content Leverage and Exploit your content Interactively discover content to derive unexpected business insights and take action with content analytics Exploit content analytics insights by enriching BI and predictive analytics as well as tailoring for industry and customer specific scenarios

Content Analytics for Legacy Data Retention Addresses Two Objectives: Securely Retain Records Requiring Retention Defensibly Decommission duplicate, non-business and information which has satisfied its retention requirements One LARGE ROI: Storage Cost Savings ROI in less than 3 months $44m in Storage Savings for one client And Delivers

How CA for Legacy Data Retention Delivers ... ...I need to dynamically collect electronically stored information (ESI) by knowing what I have, sorting out the case relevant information, declare a records and bring under hold management or decommission as necessary Content Assessment enables content-based decision making for: Decommissioning for cost savings – selected content or entire sources Dynamic collection Records Management for eDiscovery Ongoing proactive information governance Improving metadata & content organization Reduce Storage Cost

Content Analytics Admin Content Sources File Systems Content Repositories Databases Email Collaboration Web Content Web Pages Portals Content Integration (custom crawlers) ...

Content Analytics Admin Parse and Index Content Linguistic understanding of your content Industry and business specific dictionaries Understanding of Named Entities People, Places, Companies Integration with Classification Module Deep concept analysis Annotators specific to industry, business and specific uses Record Types Industry and company specific concepts Business specific concepts such as Employee Names , Products, etc. . . .

Content Analytics for Legacy Data Retention - How it works Analyzed Content (and Data) Financial Record to be Declared Extracted Concept Trade Action Day Reason Adjective Verb Noun Prep Phrase stocks rose Monday on comments from ... Source Information Internal (ECM, Files, DBMS, etc.) and External (Social, News, etc.) Automatic Visualization for Interactive Exploration and Assessment

Content Analytics for Defensible Decomissioning – How it works Analyzed Content (and Data) Non-record Decomissionable Extracted Concept Element of a plant Vegetation Reason Action Noun Noun Verb Noun stocks on the plants require trimming ... Source Information Internal (ECM, Files, DBMS, etc.) and External (Social, News, etc.) Automatic Visualization for Interactive Exploration and Assessment

Classification Process for Legacy Data Retention Analyze Collect information & context needed to make an inform a decision (declare vs. decommission) Decide Assess the collected information and select a category (declare vs. decommission), accurately & repeatably Take Action Use the selected category to determine & initiate an appropriate response (declare vs. decommission) Ensure actions are taken consistently & correctly, creating defensible process Enforce

Content Analytics for Legacy Data Retention Solution Overview IBM ECM 4 Enterprise Records 1 2 Content Collectors Content Analytics Electronic Discovery Classification Module 5 3

Demonstration of Content Analytics for Legacy Data Retention Content Analytics for Legacy Data Retention is a solution to inventory and locate legacy data requiring retention or disposition The Dayhuff Group has created this solution using IBM Content Analytics tools to perform the heavy lifting of mining legacy information It allows data retention analysts to view and analyze their source content based on familiar concepts such as how the content fits into records series’ within their file plan

Facets – Record Types This view exposes facets which categorize the content based on the business’ record types within record series’

Select documents to Decommission or Declare The analyst can graphically see content that is past it’s retention period available for decommissioning

Flag as Past Retention period Documents are flagged based on their retention requirements identified using Content Analytics and the Record Type facets, then exported to Content Collector

decommissioned based on analysis results in Content Analytics Collect to IER Content Collector Documents are decommissioned based on analysis results in Content Analytics or archived to Enterprise Records Decommission Unnecessary Content ECM Records Retention Programs & Policies

IBM Enterprise Records Finance - 2000 accounts payable treasury Budget & forecast payroll Documents are declared as records in Enterprise Records matching Record Type Facets analyzed in Content Analytics cash management Financial reporting tax accounts receivable customs general accounting insurance Cost accounting Sales & Marketing - 7000 sales marketing communication dealer support product management market research Service - 4000 warranty administration customer service quality assurance Legal - 5000

Content Analytics for Legacy Data Retention Benefits: Reduced risk Increased productivity Quantifiable and measurable results Reduced cost Defensible process for evaluation Reduction of information through disposition Reduction of duplicated and old information