Fraud Detection with Machine Learning: A Case Study from Sift Science

Slides:



Advertisements
Similar presentations
Your Trade Exchange And
Advertisements

Weighing the Risks and Benefits of Online Financial Transactions
Creating the Ultimate Online Customer-Service Experience Stefan Beeli, Vice President ESP Computer Services Choosing the proper level of Technology A look.
AcqENGIN Functional Review © 2010 M2. All rights reserved.
Technology 2011 David Dickey. Agenda 2010 – What we did 2011 – What will we do Agent Websites Property Tracker Use Technology to get listings.
Welcome! Presenter: Sam Foster. Major Topics To Be Covered In This Presentation Content Management System Amazon and eBay Integration Shopping Networks.
1 Payables Efficiency Through… Access Online PAYMENT PLUS.
1.2.2.G1 © Take Charge Today – Revised May 2010 – Electronic Banking Bonanza – Slide 1 Funded by a grant from Take Charge America, Inc. to the Norton School.
29 Oded Moshe, Director of Product Management Beta Release May 3rd, 2010 Official Release May 24, 2010.
PayPal E-Business Technologies Prof.Dr. Eduard Heindl BCM SS Edit Molnar.
PayPal as an Additional Payment Option How Merchants Can Benefit When They Accept PayPal on Their Site.
OK.gov’s Online Registration System. The Online Registration System Allows CMS Users to: - Gather custom data from registrants - Restrict the number of.
Apps VS Mobile Websites Which is better?. Bizness Apps Survey Bizness Apps surveyed over 500 small business owners with both a mobile app and a mobile.
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
Welcome! Presenter: Sam Foster. Major Topics To Be Covered In This Presentation Content Management System Amazon, eBay, and Gun Broker Integration Shopping.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
What do you get when you combine two leaders in their respective fields?
1 iBizBuilder from ClearPath Welcome to the ClearPath Networks iBizaBuilder online Tool! This innovative tool will be an online source for automating your.
Mobile Apps For Small Businesses Your customers are mobile. Is your business? Myappcompany.com (555)
The Future of gps technology
E-Commerce Sell Your Stuff Online. What We’ll Be Talking About One of the most important parts of your business is collecting the cash, and Infusionsoft.
GOOGLE ANALYTICS Destinee Cushing DIG 4104C Spring 2014.
Welcome to corporate.eircom.ie This presentation aims to give you an overview of corporate.eircom.ie, what it offers you and how best to navigate your.
Mobile Apps For Small Businesses Your customers are mobile. Is your business?
Computerized Accounting Systems The new ways we “PLAY” with money.
Access Marketing- Mobile Apps For Small Businesses Your customers are mobile. Is your business?
Ryan Wilk October, 2014 – The Confluence of Many Challenges Ryan Wilk (385) NuData Security 1.
Mobile Apps for Small Businesses Your customers are mobile. Is your business? EDA Tech Comm
©2015 Apigee Corp. All Rights Reserved. Preserving signal in customer journeys Joy Thomas, Apigee Jagdish Chand, Visa.
Your customers are mobile. WHERE ARE YOU? Lani Minihan ApplifyYourself.com (808)
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Leveraging Asset Reputation Systems to Detect and Prevent Fraud and Abuse at LinkedIn Jenelle Bray Staff Data Scientist Strata + Hadoop World New York,
CONFIDENTIAL1 Hidden Decision Trees to Design Predictive Scores – Application to Fraud Detection Vincent Granville, Ph.D. AnalyticBridge October 27, 2009.
Restaurants & Mobile Why Your Restaurant Needs A Mobile Experience.
Apple Inc ISO. Organisation chosen Apple Store The company designs, manufactures and markets personal computers, portable digital.
Payment processing re-invented Mark Bradbury, CEO.
Mobile Apps For Small Businesses Your customers are mobile. Is your business? Myappcompany.com (555)
Data Mining Copyright KEYSOFT Solutions.
1 Law, Ethical Impacts, and Internet Security. 2 Legal Issues vs. Ethical Issues Ethics — the branch of philosophy that deals with what is considered.
Cybersecurity Test Review Introduction to Digital Technology.
RECLAIM YOUR CHILDHOOD A VINTAGE TOY COMPANY. WEB SITE CREATION - WEEBLY BUSINESS ACCOUNT LOW COST, $20 PER MONTH INTEGRATED E-COMMERCE SHOPPING CART.
Copyright The Asian Banker All rights reserved Channel Mgmt/Distribution Copyright The Asian Banker All rights reserved 1/8 Asian Banker.
Zeus Virus By: Chris Foley. Overview  What is Zeus  What Zeus Did  The FBI investigation  The virus for phones  Removal and detection  Conclusion.
2014 Using machine learning to predict binding sites in proteins Jenelle Bray Stanford University October 10, 2014 #GHC
Talks! Acteva: Event Management Made Easy September 17, 2009 Audio is only available by calling this number: Conference Call: ; Access Code:
KASPERSKY INTERNET SECURITY FOR ANDROID. YOUR MOBILE DEVICES NEED PROTECTION More online communications and transaction are happening on tablets and phones.
9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Paypal PayPal is an e-commerce business allowing payments and money transfers to be made through the Internet. With a PayPal account, you can send and.
Gift Card Risk Mitigation – Presentation A
PAYMENT GATEWAY Presented by SHUJA ASHRAF SHAH ENROLL: 4471
Make Predictions Using Azure Machine Learning Studio
It’s All About Me From Big Data Models to Personalized Experience
Mobile Card Reader Process payments on the go!
Digital Analytics Best Practices for Financial Services Environments
Boost Your Grocery Business with Mobile App.
On Demand Laundry App Development Cost With Key Features.
A GACP and GTMCP company
Project 1 – Twitter Slang Term Extraction
Kathi Kellenberger Redgate Software
Mining and Analyzing Data from Open Source Software Repository
Introduction to Azure Machine Learning Studio
Erik de Nooij, IT Chapter Lead Fraud&Cybersec.
Data Analysis.
Friendly Fraud.
Azure Machine Learning
Inside a Bad Actor’s Studio
Do You Have Multiple Amazon Seller Accounts? Amazon Knows it! By EsellersCare Contact : +1 (855)
Credit Card Fraudulent Transaction Detection
Founded in 2002, Credit Abuse Resistance Education (CARE) educates high school and college students on the responsible use of credit and other fundamentals.
Presentation transcript:

Fraud Detection with Machine Learning: A Case Study from Sift Science #GHC14 Fraud Detection with Machine Learning: A Case Study from Sift Science Katherine Loh, Sift Science October 9th, 2014 2014

What is Sift Science? Sift Science fights fraud using large-scale machine learning Clarify that it’s mostly payments fraud, such as stolen credit cards

What is Fraud? Chargebacks

What is Fraud? Chargebacks From stolen credit cards

What is Fraud? Chargebacks From stolen credit cards Teams dedicated to fighting chargebacks

What is Fraud? Chargebacks From stolen credit cards Teams dedicated to fighting chargebacks Goods lost & fees (~$20)

What is Fraud? Chargebacks Spamming users

What is Fraud? Chargebacks Spamming users Fake listings

What is Fraud? Chargebacks Spamming users Fake listings Promo program abuse

How does Sift help? Site reports page, transaction, and custom events to the Sift API We build up a model of the site’s users in real-time Site may give guidance by labeling some users as “bad” or “not bad” Site consumes scores through the API or workflow tools

TLDR; Site sends data to Sift, Sift calculates fraud scores Site consumes fraud scores

Supervised ML Human judgments on historical data (labels) Statistical analysis of training data Model finds correlations between input data and observed labels Bad or Not Bad?

Real Time! Scores are necessary to process orders Must include latest events & labels Median score latency is under 200ms

How Large is Large? 1,000+ websites 700 events / second (at peak) 350M+ IP addresses roughly $3B of transaction volume analyzed each month 1,000+ features Millions of fraud patterns

SIFT

SIFT

Magic Algorithms Naïve Bayes Logistic Regression

Network vs Customer Models Customers start on our “Network Model” With 20 “bad” labels, they move to a customer-specific model

One User, One Purchase IP Address: 203.189.24.290 Billing Name: Katherine Loh Billing Address: San Francisco, CA Email Address: katherine@siftscience.com Credit Card: 4567xxxxxxxxxxxx Item Purchased: Sleeping Bag Cost: 50.00 USD Authorization Result: Success

One User, Over Time Account created Updated credit card info Updated settings Purchased Item Updated credit card info Purchased Item Purchased Item IP Address: 203.189.24.290 Billing Name: Katherine Loh Billing Address: San Francisco, CA Email Address: katherine@siftscience.com Credit Card: 6543xxxxxxxxxxxx Item Purchased: Sleeping Bag Cost: 50.00 USD Authorization Result: Success

One User, Over Time Account is 4 hours old Account created Updated credit card info Updated settings Purchased Item Updated credit card info Purchased Item Purchased Item IP Address: 203.189.24.290 Billing Name: Katherine Loh Billing Address: San Francisco, CA Email Address: katherine@siftscience.com Credit Card: 6543xxxxxxxxxxxx Item Purchased: Sleeping Bag Cost: 50.00 USD Authorization Result: Success

One User, Over Time 2 credit card updates in user’s history Account is 4 hours old Account created Updated credit card info Updated settings Purchased Item Updated credit card info Purchased Item Purchased Item IP Address: 203.189.24.290 Billing Name: Katherine Loh Billing Address: San Francisco, CA Email Address: katherine@siftscience.com Credit Card: 6543xxxxxxxxxxxx Item Purchased: Sleeping Bag Cost: 50.00 USD Authorization Result: Success

One User, Over Time 2 credit card updates in user’s history 3 transactions in the last hour Account is 4 hours old Account created Updated credit card info Updated settings Purchased Item Updated credit card info Purchased Item Purchased Item IP Address: 203.189.24.290 Billing Name: Katherine Loh Billing Address: San Francisco, CA Email Address: katherine@siftscience.com Credit Card: 6543xxxxxxxxxxxx Item Purchased: Sleeping Bag Cost: 50.00 USD Authorization Result: Success

One Site, Many Users taylor@siftscience.com jtan123@gmail.com time taylor@siftscience.com jtan123@gmail.com beyonce@gmail.com b.yonce@gmail.com katherine@siftscience.com

x = marked bad by site owner One Site, Many Users time taylor@siftscience.com jtan123@gmail.com beyonce@gmail.com b.yonce@gmail.com katherine@siftscience.com x x x = marked bad by site owner

Transacted from same IP One Site, Many Users time taylor@siftscience.com jtan123@gmail.com beyonce@gmail.com b.yonce@gmail.com katherine@siftscience.com x x Transacted from same IP

One Site, Many Users taylor@siftscience.com jtan123@gmail.com time taylor@siftscience.com jtan123@gmail.com beyonce@gmail.com b.yonce@gmail.com katherine@siftscience.com x x Similar email addresses Transacted from same IP

Many Sites, Many Users Site 1 Site 2 Site 3

Transacted from same IP Many Sites, Many Users Site 1 Transacted from same IP Site 2 Site 3

Features Event features State features Temporal features Graph features

Event Features Properties of user’s most recent event Credit card type, billing zip code, shipping type Billing address, shipping address, product SKU

State Features Properties of user’s current state Broad Attributes: Country, time of day, browser type Identity Features: IP address, device fingerprint, cookie, name

Temporal Features Properties of user’s time series up to that point Velocities: Number of purchases in the past hour? IP addresses? Sequence Features: Last 5 actions taken? Last few geo locations?

Graph Features How the user relates to others on the sites and other sites Number of other users using the same shipping address Similarity of this user with the seller of the item (for an online marketplace)

Graph Features normal less normal

Evaluating Features

Evaluating Features

Evaluating Features

Normal Users Eat Lunch

Fraudsters Skip Lunch

Fraudsters Are Night Owls

Fraudsters Don Multiple Identities

Lessons Learned Keep customers happy

Happy Customers? accurate scores great support customer easy to use product ??? customer happiness

Lessons Learned Keep customers happy Results must be understandable

Lessons Learned Keep customers happy Results must be understandable Humans expect stability and speed

Lessons Learned Keep customers happy Results must be understandable Humans expect stability and speed External knowledge changes over time

Data Changes Over Time User labels Exchange rates IP/Geo data New features New models

Lessons Learned Keep customers happy Results must be understandable Humans expect stability and speed External knowledge changes over time Noise is everywhere

Noise is EVERYWHERE Wrong labels Duplicate labels Bad integrations Incomplete integrations Missing fields Bugs System downtime

Questions? katherine@siftscience.com

Got Feedback? Rate and Review the session using the GHC Mobile App To download visit www.gracehopper.org This is the last slide and must be included in the slide deck