0 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG.

Slides:



Advertisements
Similar presentations
Tivoli SANergy. SANs are Powerful, but... Most SANs today offer limited value One system, multiple storage devices Multiple systems, isolated zones of.
Advertisements

CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Presentation by Priyanka Sawarkar
Driving change in information risk within the financial services industry Subtitle Date.
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
QDV 7 Overview A powerful estimating tool designed to match up with your own specific methodologies.
Essbase Reporting Jim Kubik Senior Sales Consultant.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
Web Services Presentation. Site Management Console (SMC)
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank
Information Retrieval in Practice
Chapter 3 Database Management
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Chapter 14 The Second Component: The Database.
With the Help of the Microsoft Azure Platform, Devbridge Group Provides Powerful, Flexible, and Scalable Responsive Web Solutions MICROSOFT AZURE ISV PROFILE:
Overview of Search Engines
Microsoft Office SharePoint Server Business Intelligence Tom Rizzo Director, Microsoft Office SharePoint Server
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Collaboration and Content Customer solution case study The Yaroslavl region Government creates knowledge base of public authorities of the Yaroslavl region.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
1 Progress Software’s OpenEdge Platform Which database is right for your environment? Simon Epps.
Using the Powerful Microsoft Azure Platform, e-SUAP Properly and Securely Manages All Steps for Customizable Business Activities Permissions MICROSOFT.
MarkLogic Overview Clark D. Richey, Jr. – Technical Director,
Slide 1 Copyright © 2010 MarkLogic ® Corporation. All rights reserved. Introduction to MarkLogic 4.2 Kenneth Chestnut, Vice President of Product Marketing.
Master Thesis Defense Jan Fiedler 04/17/98
IMDGs An essential part of your architecture. About me
Designing and Deploying a Scalable EPM Solution Ken Toole Platform Test Manager MS Project Microsoft.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Open Search Office Web Services Database Doc Mgt Sys Pipeline Index Geospatial Analysis Text Search Faceting Caching Query parsing Clustering Synonyms.
(C) 2008 Clusterpoint(C) 2008 ClusterPoint Ltd. Empowering You to Manage and Drive Down Database Costs April 17, 2009 Gints Ernestsons, CEO © 2009 Clusterpoint.
MULTIMEDIA DATABASES -Define data -Define databases.
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
James Akrigg Microsoft Ltd Integrating InfoPath Forms Into Workflow Solutions And Business Processes.
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
Making Watson Fast Daniel Brown HON111. Need for Watson to be fast to play Jeopardy successfully – All computations have to be done in a few seconds –
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Hosting Websites and Web Applications with Microsoft ® SQL Server ® 2008.
Jorke Odolphi Product Technology Specialist WebCentral Using Microsoft Operations Manager To Monitor And Maintain Your Farm.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
An Enterprise Clinical Data Search Solution. is Designed for: Informatics professionals, clinicians, statisticians, data managers and process/quality.
Data-Centric Security and User Access Controls for Hadoop on Microsoft Azure MICROSOFT AZURE APP BUILDER PROFILE: BLUETALON BlueTalon provides data-centric.
Big Data Analytics Are we at risk? Dr. Csilla Farkas Director Center for Information Assurance Engineering (CIAE) Department of Computer Science and Engineering.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
MarkLogic The Only Enterprise NoSQL Database Presented by: Aashi Rastogi ( ) Sanket Patel ( )
Component 8/Unit 1bHealth IT Workforce Curriculum Version 1.0 Fall Installation and Maintenance of Health IT Systems Unit 1b Elements of a Typical.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Ignite in Sberbank: In-Memory Data Fabric for Financial Services
Alfresco – Protecting Trafigura’s Corporate Assets Beverley Verster, Head of IT Back Office & Corporate Systems.
The Derivitec Risk Portal Provides Powerful, Cost-Effective Risk Management Solutions, Powered by Azure, that Deploy in Minutes MICROSOFT AZURE ISV PROFILE:
© 2009 Oracle Corporation – Proprietary and Confidential Agenda Reporting Overview Performance Workspace Dashboards Reports Drill thru Smartview Excel.
Information Retrieval in Practice
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
2016 Citrix presentation.
Presentation Title.
One-Stop Shop Manages All Technical Vendor Data and Documentation and is Globally Deployed Using Microsoft Azure to Support Asset Owners/Operators MICROSOFT.
XtremeData on the Microsoft Azure Cloud Platform:
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
Taming the Big Data Fire Hose
Technical Capabilities
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Presentation transcript:

0 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Paul Preuveneers – Principal Technologist Lee Pollington – Principal Consultant The Only Operational Database Technology for Mission-Critical Big Data Applications

1 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Agenda Big Data and MarkLogic What is MarkLogic? MarkLogic in Financial Services MarkLogic Integration Points (Connectors / Toolkits)

2 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Volume Complexity Variability ValueVariety Petabyte / Exabyte Billions of items Social Media Machine data Data processes producing data 10Ks of transactions per second In & out Streams Bulk processing Patterns Inference Unstructured Disparate events Relationships Varied sources Varied data types Changing data types Value from decision support Value from operational efficiencies Velocity

3 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Agenda Big Data and MarkLogic? What is MarkLogic? MarkLogic in Financial Services MarkLogic Integration Points (Connectors / Toolkits)

4 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. What is MarkLogic Server? Special Purpose DBMS for poly-structured information, with enterprise expectations ACID transactions Backup, Full/Partial Replication, Distributed Txns Search Engine Kernel, with enterprise expectations Full text Faceted navigation, at massive scale Boolean, proximity, stemming, tokenization, decompounding, case, diacritics, language… Application Server HTTP (including RESTful) XCC Java/.NET WebDAV

5 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. What makes MarkLogic DBMS Special? Not Relational (RDBMS) XML The Only Data Model Required Schema Agnostic Text a First-class Citizen among Data Types XQuery/XSLT Optimized Search Engine Algorithms Very Low DBA Overhead (0.5 FTE / 100 hosts) 5-Minute Install 5-Minute Scale-Out Database and Search Engine are the same

6 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. What makes MarkLogic Search Special? Transactional: Enterprise Scale (no index latency) Unicode (Internationalization) Multiple Query Types Analytics: Aggregation, Facets & Ranges, Co-occurrence, Geospatial Text Search: Boolean, Stemming, Word Lexicons, Dictionary & Thesauri Alerting: Profiles, Alerts, Filters, Tipping, Selectors, “Triggers” … Powerful Search Combination (e.g. Text + Analytics + Alerting) Processing Near the Data (fast search, low bandwidth) Database and Search Engine are the same

7 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. 123, 127, 129, 152, 344, , 125, 126, 129, 130, , 126, 130, 142, 143, , 130, 131, 135, 162, , 130, 167, 212, 219, Document References 126, 130, 167, … TermTerm List Range Indexes “accelerating” “creation” “content” “application” “agility” / product: MarkLogic Geospatial Search: Universal Index

8 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. MarkLogic Can Scale Scale Up: Typically 1 TB+ XML per Server Scale Out: Low Hundreds(++) of Servers in a Cluster Commodity Hardware 2-CPU x 6-core/hyperthreaded 32+ GB RAM 3x disk: local mount with failover OS Linux RHEL 5 Solaris 10 Windows 2003/8 (XP/Vista/7 for Dev)

9 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. E Host 1 partition 1 E Host 3 D Host 4 D Host 5 D Host 6 D Host k partition 2 partition 3 partition m E Host 2 partition 4 HA&DR AppServer Data Same Code- base Shared-Nothing Cluster

10 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Agenda Big Data and MarkLogic What is MarkLogic? MarkLogic in Financial Services MarkLogic Integration Points (Connectors / Toolkits)

11 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Financial Services Solutions Operational Data Store / Trade Store Highly Transactional ISDA Contract Analysis (Electronic & Paper) Document Analysis (e.g. Sales Process, Financial Directives) Situational Awareness Customer On-Boarding Content Aggregation & Discovery Research / Policy Authoring & Distribution Content Publishing

12 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Operational Data Store / Trade Store - High Volume Trades (Derivatives, Equities, FX etc.) in siloes - Mostly represented in XML (e.g. FpML, FIXML) - Point-in-time queries (e.g. exposure by counterparty) - Risk Management (understand exposure, auditing) What is it? - High Performance with Native XML compared to RDBMS - We are a transactional DB (ACID + business continuity) - Less hardware required / commodity servers - No shredding of XML (lowers risk of corruption) - Can aggregate over multiple schemas - Easily accommodate new schemas, changes in schema Why are we good at it?

13 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Operational Data Store / Trade Store Example: JP Morgan Chase ODS Live for 12+ months 2.25 million OTC Derivatives (450+ million documents) Strategic platform mandated for core transaction processing Short-listed for Best Investment Banking Initiative at The Banking Technology Awards 2011 Agile onboarding of new Derivatives products Huge reduction in time to process FO XML messages 20 Sybase systems replaced with 3-Node MarkLogic cluster

14 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. It's a Trade Processing Story Started with Derivatives Natural fit with documents Complex instruments, “low volume” instruments It’s a trade workflow engine Enterprise Service Bus / Component architecture New products Modifications to existing products Securities had a new challenge for us

15 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. ISDA Contract Analysis - Swaps / Derivatives Contracts - Risk Management (understand exposure) - Effect of Change (e.g. credit rating, termination events) What is it? - Contracts are combination data/text - Front-end solutions like Exari use Word for contract authoring but output structured XML - Good query functions for filtering and aggregation of exposure as well as other what-if scenarios Why are we good at it? - If in paper form, OCR and enrichment is required. This is hard, time-consuming and costly (up to $150 per doc for managed service) - Most contracts are in paper form (90+ percent) Where do we need help?

16 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Document Analysis (e.g. Sales Process, Financial Directives) - Making sense of poly-structured data (avoid BIG fines) - Extracting patterns and trends (e.g. did we say the right thing to our customer at the right time? / PPI mis-selling) - Developing value calculations in hard-to-handle formats (i.e. aggregating and unlocking the calculations in Excel) What is it? - Good conversion tools for PDF, MS Office etc. - Great full-text search to analyse converted documents - Inclusion of external content where applicable (RSS, Social Media, Web Sites) - Group individual Excel spreadsheets for powerful analysis Why are we good at it? - Enrichment often requires substantial domain expertise Where do we need help?

17 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Situational Awareness - Trading Decision Support - Amalgamation of internal/external poly-structured data - Heavy geospatial element - Analysis across datasets (vessels, pipes, weather, RSS) What is it? - Quick take-up of new sets of data - ML is good at geospatial queries - ML is good at incorporating external data (web, RSS etc.) Why are we good at it?

18 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Situational Awareness

19 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Customer On-Boarding - Content Aggregation from multiple CMS - KYC / Holistic view of customer (good communication) - Avoid duplication of effort (faster on-boarding) - Rapid search and retrieval What is it? - Feature-rich, fast search at volume - 30 Digits allows us to extract from multiple CMS - Flexible metadata-handling (dynamic facets) - Able to apply security model from underlying CMS Why are we good at it? - Lots of content is image-based / requires OCR and data enrichment Where do we need help?

20 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Research / Policy Authoring & Distribution - Template-driven authoring - Ensuring consistency, validation and component re-use - Dynamic Publishing (VISA, Morgan Stanley, Citigroup) What is it? - Easy template creation and maintenance - Great integration with MS Office - Componentisation and versioning easy in ML - Dynamic assembly based on role/geography etc. Why are we good at it?

21 This document is CONFIDENTIAL and its circulation and use are RESTRICTED. © 2012 KPMG LLP, a UK limited liability partnership, is a subsidiary of KPMG Europe LLP and a member firm of the KPMG network of independent member firms affiliated with KPMG International, a Swiss cooperative. All rights reserved. Thank You – Questions?