CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.

Slides:



Advertisements
Similar presentations
Performance Testing - Kanwalpreet Singh.
Advertisements

OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Advance Analytics Capabilities
Week 9 Data Mining System (Knowledge Data Discovery)
GROUP 1 : DATO’ NABIL ABD KADIR SAYNUL ISLAM MOHAMMAD GHAZALI MOHD DAUD.
Fraud Detection in Banking using Big Data By Madhu Malapaka For ISACA, Hyderabad Chapter Date: 14 th Dec 2014 Wilshire Software.
Introduction to Building a BI Solution 권오주 OLAPForum
TOPIC 1: GAINING COMPETITIVE ADVANTAGE WITH IT (CONTINUE) SUPPLY CHAIN MANAGEMENT & BUSINESS INTELLIGENCE.
Lecture-8/ T. Nouf Almujally
Big Data A big step towards innovation, competition and productivity.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Predict To Prosper In Digital Experience Rowan Curran, Researcher June 16, 2015.
Introduction to Hadoop and HDFS
Web Analytics Unit 4-1(2005 Fall) Managing the Digital Enterprise By Professor Michael Rappa.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Managing Knowledge in Business Intelligence Systems Dr. Jan Mrazek.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
IT Architectures for Handling Big Data in Official Statistics: the Case of Scanner Data in Istat Gianluca D’Amato, Annunziata Fiore, Domenico Infante,
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
ORCALE CORPORATION:-Company profile Oracle Corporation was founded in the year 1977 and is the world’s largest s/w company and the leading supplier for.
SAM for SQL Workloads Presenter Name.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware A Cloud Computing Methodology Study of.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
CISC 849 : Applications in Fintech Jin Gu Dept of Computer & Information Sciences University of Delaware Cyber-security & Finance.
3M Display & Graphics © 3M All Rights Reserved. Predict. Measure. Optimize. Control. Realizing the Full Potential of Digital Communication Networks.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Foundations of Business Intelligence: Databases and Information Management Chapter 6 VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors.
Dell Software Unified Communications Command Suite (UCCS) Provides Flexible, Cross-Platform Management, Reporting and Data Diagnostics MICROSOFT AZURE.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Bleeding edge technology to transform Data into Knowledge HADOOP In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log,
From RDBMS to Hadoop A case study Mihaly Berekmeri School of Computer Science University of Manchester Data Science Club, 14th July 2016 Hayden Clark,
CNIT131 Internet Basics & Beginning HTML
CRM has been defined in a multiple ways
SNS COLLEGE OF TECHNOLOGY
Big Data Enterprise Patterns
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
Bridging the Data Science and SQL Divide for Practitioners
IreckonU Offers a Powerful Hospitality Software Solution, Seamlessly Integrating Existing Hospitality Systems and Services on the Powerful Microsoft Azure.
Built Using the Powerful Microsoft Azure Platform, Mailytics Gives Marketers Competitor Data and Campaign Analysis to Better Develop Strategies MICROSOFT.
DEFECT PREDICTION : USING MACHINE LEARNING
IBM INFOSPHERE MDM online Training in Bangalore
Get Real Value and Insights from Your Data: Biin Solutions Provides Predictive Analytics, IoT, and Business Intelligence with Microsoft Azure Power MICROSOFT.
Customer Relationship Management
StorFly-IES Intelligent External Storage for Industrial-IoT ®
KMeans Clustering on Hadoop Fall 2013 Elke A. Rundensteiner
XtremeData on the Microsoft Azure Cloud Platform:
Big Data Young Lee BUS 550.
CRM has been defined in a multiple ways
Improve Patient Experience with Saama and Microsoft Azure
Top five challenges facing the Enterprise Data Warehouse (EDW)
Comparative Evaluation of SOM-Ward Clustering and Decision Tree for Conducting Customer-Portfolio Analysis By 1Oloyede Ayodele, 2Ogunlana Deborah, 1Adeyemi.
Big Data Analysis in Digital Marketing
Big DATA.
Pitch Deck.
Customer 360.
How Dell, SAP and SUSE Deliver Value Quickly
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Presentation transcript:

CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based Banking Customer Analytics

CISC 849 : Applications in Fintech Big Data Extremely Large Datasets that may be analysed computationally to reveal patterns,trends and associations. Three major characteristics of Big Data.

CISC 849 : Applications in Fintech Every digital process and social media exchange produces it. Systems, sensors and mobile devices transmit it.

CISC 849 : Applications in Fintech Customer Analytics Give organizations the customer insight necessary to deliver offers that are anticipated, relevant and timely. Decrease attrition by accurately predicting customers most likely to leave. Developing the right proactive campaigns to retain them.

CISC 849 : Applications in Fintech Challenges To handle the massive amount of complex data in a cost- effective and efficient way. To effectively generate business value from the analytics and obtain competitive advantages for banks.

CISC 849 : Applications in Fintech Intelligent Customer Analytics for the Recognition and Exploration (iCARE) framework. It is presented to analyze banking customer behaviors from banking big data. IBM products - IBM SPSS* Analytic Server and IBM InfoSphere BigInsights* are used. Analytical models are Customized and validated on the processed data according to specified business scenarios iCARE Framework

CISC 849 : Applications in Fintech BigInsights platform is an Apache Hadoop based hardware- agnostic software platform. IBM SPSS Analytic Server (AS) provides big data analytics capabilities. It also includes integrated support for unstructured predictive analytics from the Hadoop environment.

CISC 849 : Applications in Fintech Architecture

CISC 849 : Applications in Fintech There are four phases in the solution: Data acquisition Data preparation Data modeling Business applications iCare Solution Design

CISC 849 : Applications in Fintech A standard input format is defined in iCARE for structured data. Unstructured data can originate from inside a bank, including web log files, call records and external resources. The unstructured data is usually stored as files rather than database tables. Data Acquisition

CISC 849 : Applications in Fintech To enhance the data quality both unstructured and structured data needs additional data preparation. Big Sql tool is used to efficiently handle the incomplete, incorrect, or irrelevant data. o Use of statistical methods Data integration Data Preparation

CISC 849 : Applications in Fintech Integrated data is stored in a data warehouse. Data Conflict is resolved. Consolidated enterprise customer view is generated.

CISC 849 : Applications in Fintech The iCARE analytical models can be built for different business scenarios based on business objectives. Advantages : All the statistic and machine learning methods are customized to suit corresponding business scenarios. Use of Parallelized models iCARE Analytical Models

CISC 849 : Applications in Fintech Example Traditional K-means clustering algorithm. To divide n data points into K clusters with similar points to minimize the total distance between the points to their cluster centers. Steps : The algorithm first randomly selects K data points to be the cluster centers. Then assigns each data point to the closest cluster, and updates the center of each cluster by calculating weighted average of all data points in the cluster.

CISC 849 : Applications in Fintech A customized and parallelized K-means clustering algorithm. It is used to segment customers of a bank into several clusters based on their profile and transaction information. K data points are selected to be the cluster centers. Steps: First data point is used as first cluster center. For each data point, compute the minimum distance between it and each defined cluster center.

CISC 849 : Applications in Fintech The Manhattan distance is used here. Use of manhattan distance. For two D-dimensional data points x and y, the metric is defined as New cluster center is selected. Each data point is assigned to the closest cluster using standard algorithm with distance metric. Updated cluster center :

CISC 849 : Applications in Fintech Data points are redistributed to their closest cluster Parallelized further to follow map reduce model. The distances between data points and cluster centers are updated by the Mapper. Reducer adds up the partial sum to get new cluster center.

CISC 849 : Applications in Fintech Business Applications Customer segmentation and preference analysis Potential customer identification Customer network analysis Market potential analysis Channel allocation and operation optimization

CISC 849 : Applications in Fintech Conclusion It can be extended for other data analytics applications, not limited to customer relationship management or the banking industry. iCARE framework is scalable by adding more parallel analytical models.

CISC 849 : Applications in Fintech 20 terabytes of data was analyzed to help generate insights for retaining active online banking customers. Implemented for commercial bank in southeast China Start-up

CISC 849 : Applications in Fintech The model ran 12 times faster as a single host for the 4 GB test data sample with 1,600 instances.

CISC 849 : Applications in Fintech Thank You !