Managing Technical Talent: How to Find the Right Analyst for Your Problem Photo by mikebaird, www.flickr.com/photos/mikebaird Presentation to the Wolfram.

Slides:



Advertisements
Similar presentations
NHS Future Forum Event slidepack. The next phase for the NHS Future Forum The Prime Minister and Secretary of State for Health have announced that the.
Advertisements

Predictive modeling competitions
Predictive modeling competitions
What is On Time Booking? Reservation and distribution system for passenger transport companies (airlines and ferries ) Tool that helps you to manage the.
What ails the economy: turning New Zealand’s small size from a weakness to a strength 16 th March 2011 New Zealand Institute, Wellington Nicholas Gruen.
Strategic Decisions (Part II)
Career Opportunities in Statistical Computing. Two Perspectives on Careers in Statistical Computing 1.Software development opportunities at SAS 2.Emerging.
- GALILEO GALILEI. MATHEMATICS BREAKS THE WORLD DOWN INTO NUMBERS AND SYMBOLS.
Concept et Stratégies pour Gouvernement Ouvert et Open Data Jeff Kaplan - Senior Consultant, ICT Unit
Evaluating Inforce Blocks Of Disability Business With Predictive Modeling SOA Spring Health Meeting May 28, 2008 Jonathan Polon FSA
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
Predictive Modeling for Disability Pricing May 13, 2009 Claim Analytics Inc. Barry Senensky FSA FCIA MAAA Jonathan Polon FSA
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Entrepreneurship I Class #3 Financing the Venture.
EECS 349 Machine Learning Instructor: Doug Downey Note: slides adapted from Pedro Domingos, University of Washington, CSE
CSE 546 Data Mining Machine Learning Instructor: Pedro Domingos.
Entrepreneurship I Class #3 Financing the Venture.
Sales forecasting with SAS Advanced Analytics for the Pharmaceutical sector. A business case.
The Exclusive Networks Group. Hands up VADs Everyone claims to be a VAD Overused, undervalued What do you mean you're a VAD?
ITU-T Informal Forum Summit San Francisco, July 2003 Global Standardisation Key to the success of Third Generation Mobile A UMTS Forum industry perspective.
Rated by senior insurance and reinsurance executives, Reactions is seen as a market leading information source Reaction is considered better than its competitors.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Fleet management and logistics
Classifiers, Part 3 Week 1, Video 5 Classification  There is something you want to predict (“the label”)  The thing you want to predict is categorical.
Comparison of Classification Methods for Customer Attrition Analysis Xiaohua Hu, Ph.D. Drexel University Philadelphia, PA, 19104
Introducing NetFuel September NetFuel: Providing capital and talent to fuel Internet start-ups The goal: Highly successful client companies and.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Techniques As Tools for Analysis of Customer Behavior
PIERS Global Intelligence Solutions Joe Davis, Pacific Regional Sales Manager.
Opening Keynote Presentation An Architecture for Intelligent Trading  Alessandro Petroni – Senior Principal Architect, Financial Services, TIBCO Software.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Company factsheet May 2010 Experian is the leading global information services company, providing data and analytical tools. The company helps businesses.
Yield Management
0 COMPETITIVE INTELLIGENCE A PROCESS THAT CREATES COMPETITIVE ADVANTAGE NOT A REPORT THAT SITS ON A SHELF Plan Integrate Collect Analyze Communicate 4200.
Distribution Strategies Chap 05 王仁宏 助理教授 國立中正大學企業管理學系 ©Copyright 2001 製商整合科技中心.
Business Solutions. Agenda Overview Business Solutions Benefits Company Summary.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
© 2011 IBM Corporation STEM One industry perspective Maria Hernandez IBM Corp. Director, Strategy and Transformation January 2012.
Business Understanding the Big Picture. A Note on Advertising.
Converging Worlds – The Degree Apprenticeship Stella McKnight Director for Employer Partnerships University of Winchester Mark Jackson Talent Recruitment.
Instructor: Pedro Domingos
Presentation to the 9 th World Electronics Forum (WEF) Australia 2003 EMS Provider GPC Electronics Pty Limited © GPC Electronics 2003.
Soft Computing methods for High frequency tradin.
Copyright © 2001, SAS Institute Inc. All rights reserved. Data Mining Methods: Applications, Problems and Opportunities in the Public Sector John Stultz,
Institute of Automation and Control Systems KTU BS/2 Conference, Vilnius, 2008 June 13 Intelligent systems in banking industry: survey and future Rimvydas.
See where Mathematics can take you! Linda Galligan Senior Lecturer Department of Mathematics & Computing.
GameChanger’s Rate Quote Issue Solution is Deployed to Microsoft Azure for a Fast, Flexible Direct to Consumer Insurance Sales Solution MICROSOFT AZURE.
Jeremy Howard President, Kaggle web Machine learning competitions Photo by mikebaird,
Using Robotic Process Automation to Create a Digital Workforce Jeff Chandler, Sales Engineer, Kofax.
Analytics Reports Available on 500 Different U. S. Industries
Better decisions through data
Instructor: Pedro Domingos
Data Analytics for ICT.
WEBINAR The Rise Of Insights Services
Health Insurance Eligibility Verification and Authorization:
Department of intelligent systems
CSEP 546 Data Mining Machine Learning
Business Analysis for Data Science Teams
Machine Learning Training
CSEP 546 Data Mining Machine Learning
Analytics Reports Available on 500 Different U. S. Industries
CSEP 546 Data Mining Machine Learning
Using decision trees and their ensembles for analysis of NIR spectroscopic data WSC-11, Saint Petersburg, 2018 In the light of morning session on superresolution.
Solving Your Business Challenges Introduction October 2018
Copyright © JanBask Training. All rights reserved Why learn Hadoop & big data technology in 2019?
Radisson Blu, London Stansted
ΗΑΗΑΗΑΗΑΗΑ.
Built on the Powerful Azure Platform, Angoss Helps Businesses Turn Data into Actionable Insights That Reduce Risk, Increase Organizational Performance.
The Belgian experience on the detection of social contribution fraud
Presentation transcript:

Managing Technical Talent: How to Find the Right Analyst for Your Problem Photo by mikebaird, Presentation to the Wolfram Data Summit Washington DC, Friday, Sept 09,

genetic algorithms random forest Monte Carlo methods principal component analysis Kalman filter evolutionary fuzzy modelling neural networks logistic regression support vector machine decision trees ensemble methods adaBoost Bayesian networks Different users - different techniques.

“A discovery is... an accident meeting a prepared mind.” Albert Szent-Gyorgyi, 1937 Nobel Prize for Medicine ‣ Is the crown pure gold? ‣ We know its weight. ‣ How to measure its volume? Eureka!

4

Finding the world’s most perfectly prepared mind

Our User Base

Competition Mechanics Competitions are judged on objective criteria

1 23 Users create predictive models, submit these to Kaggle, and are scored on their accuracy. How Kaggle Works

Competitions are judged based on predictive accuracy

+ Genetic marker 4 Genetic marker 1 + Genetic marker 3 + Genetic marker 2 Which HIV patients will be sicker next week?

HIV LoadStock PricesChess Ratings Scouring the world for the best analysts for a problem. Traffic flowGrant Forecasting Dr. Derek Gatherer UK John Blatz Baltimore Edmund & Adrian London & USA Jason Trigg Pennsylvania Chih-Li Sung & Roy Tseng Penghu & Taipei Jure Zbontar Ljubljana Thomas Mahony Canberra Emir Delic Australia Glen Maher Canberra Chris Raimondi Batimore Claudio Perlich USA Gzegorz Swiszcz Gera Edmund & Adrian London & USA Rajstennaj Barrabas USA Jason Trigg Pennsylvania Lee Baker Las Cruces, NM Cole Harris Texas Nan Zhou Pittsburgh Uri Blass Tel-Aviv Giuseppe Ragusa Rome Robert Warsaw Ivan Russian Federation Chris DuBois Portland Philipp Emanuel Widmann Heidelberg, DE Dr. Christopher Hefele, New York Jeremy Howard Chris Raimondi Baltimore Tim Salimans Erasmus U

Global competitions 1½ weeks 70.8% Competition closes 77% State of the art 70% Predicting HIV progression US$500

HIV LoadStock PricesChess Ratings Where’s Wally? Scouring the world for the best analysts for a problem. Traffic flowGrant Forecasting Dr. Derek Gatherer UK John Blatz Baltimore Edmund & Adrian London & USA Jason Trigg Pennsylvania Chih-Li Sung & Roy Tseng Penghu & Taipei Jure Zbontar Ljubljana Chris Raimondi Batimore Claudio Perlich USA Gzegorz Swiszcz Gera Edmund & Adrian London & USA Rajstennaj Barrabas USA Jason Trigg Pennsylvania Lee Baker Las Cruces, NM Cole Harris Texas Nan Zhou Pittsburgh Uri Blass Tel-Aviv Giuseppe Ragusa Rome Robert Warsaw Ivan Russian Federation Chris DuBois Portland Philipp Emanuel Widmann Heidelberg, DE Dr. Christopher Hefele, New York Chris Raimondi Baltimore

HIV LoadStock PricesChess Ratings Where’s Wally? Scouring the world for the best analysts for a problem. Traffic flowGrant Forecasting Dr. Derek Gatherer UK John Blatz Baltimore Edmund & Adrian London & USA Jason Trigg Pennsylvania Chih-Li Sung & Roy Tseng Penghu & Taipei Jure Zbontar Ljubljana Chris Raimondi Batimore Claudio Perlich USA Gzegorz Swiszcz Gera Edmund & Adrian London & USA Rajstennaj Barrabas USA Jason Trigg Pennsylvania Lee Baker Las Cruces, NM Cole Harris Texas Nan Zhou Pittsburgh Uri Blass Tel-Aviv Giuseppe Ragusa Rome Robert Warsaw Ivan Russian Federation Chris DuBois Portland Philipp Emanuel Widmann Heidelberg, DE Dr. Christopher Hefele, New York Tim Salimans Erasmus U R’dam

Martin O’Leary

“In less than a week … a PhD student in glaciology outperformed the state- of-the-art algorithms”

We could not be happier with the result. The Kaggle approach has set a new benchmark in Government for the development of successful predictive models, delivered quickly and very cost effectively. In particular, the flexibility of the winning predictive model will enable its application to other major transport routes to the CBD and allow for the addition of other factors such as weather and incident. Susan Calvert Director, Strategy and Project Delivery Unit Department Premier and Cabinet

A Few Kaggle Projects Take historical medical claims and predict who will go to hospital. This competition has a $3 million prize. Predict which editors will stop contributing New algorithm for chess ratings. Has wide gaming and ranking significance Detect driver drowsiness Predict the likelihood of claims given different vehicle models Predict successful grant applications Predict shoppers’ next visit to supermarket

User base: 14,107 registered data scientists

Forecast Error (MASE) Combination of world’s best models Aug 92 weeks later 1 month later Competition End This competition (to forecast tourism demand) used one of the most heavily studied sets of time series data. It had previously been modeled using the leading commercial software and academic algorithms. Competitors quickly surpassed world’s best practice and found the frontier of what’s possible. Frontier reached after all information is extracted from the dataset Kaggle Competition Results

HIV LoadStock PricesChess Ratings Where’s Wally? Scouring the world for the best analysts for a problem. Traffic flowGrant Forecasting Dr. Derek Gatherer UK John Blatz Baltimore Edmund & Adrian London & USA Jason Trigg Pennsylvania Chih-Li Sung & Roy Tseng Penghu & Taipei Jure Zbontar Ljubljana Chris Raimondi Batimore Claudio Perlich USA Gzegorz Swiszcz Gera Edmund & Adrian London & USA Rajstennaj Barrabas USA Jason Trigg Pennsylvania Lee Baker Las Cruces, NM Cole Harris Texas Nan Zhou Pittsburgh Uri Blass Tel-Aviv Giuseppe Ragusa Rome Robert Warsaw Ivan Russian Federation Chris DuBois Portland Philipp Emanuel Widmann Heidelberg, DE Dr. Christopher Hefele, New York Jeremy Howard

From generating value => Making money 1.Open Comps: Unleashing the power of Crowdsourcing $Commission, consulting and performance fees 2.Consulting partnerships $revenue share 3.The platform as marketplace for technical talent $revenue share

Our market Business analytics = $107 bil market Outsourced business analytics = $38b [IDC] Public and third sector Revenue forecasts Traffic forecasting Energy demand Predicting crime Tax/social security fraud Hospital casualty demand Identifying great Teachers Hospitals Private Sector Sales forecasts Credit scoring Stock picking Risk modelling and pricing Identifying fraud Identifying best practice Production management Inventory management Logistic optimisation

First mover advantages of internet platforms Clients Analysts

Kaggle not for profit Kaggle public good competitions

“I keep saying the sexy job in the next ten years will be statisticians. ” Hal Varian Google Chief Economist 2009 No matter who you are, most of the smartest people work for someone else. Bill Joye Founder, Sun Microsystems 2009

Transforming the inefficient market for technical talent into the world’s largest Wally Photos by William Murphy (Flickr: infomatique)

Who We Are Anthony Goldbloom CEO / Founder the Australian Treasury & Reserve Bank of Australia Journalism The Economist. Nicholas Gruen Chairman Chairman of the Australian Gov. 2.0 taskforce Jeff Moser CTO Raytheon and widely read bloggerwidely read blogger Jeremy Howard Chief Scientist McKinsey and A.T. Kearney alumnus Founder of 2 successful startups: FastMail (exit to Opera) and Optimal Decisions Group (exit to Choicepoint)