Presentation on theme: "Data Mining in Practice: Techniques and Practical Applications"— Presentation transcript:
1 Data Mining in Practice: Techniques and Practical Applications Junling HuMay 14, 2013
2 What is data mining? Mining patterns from data Is it statistics? Functional form?Computation speed concern?Data sizeVariable sizeIs it machine learning?Big data issueNew methods: network miningE.g. stroke prediction
3 Examples of data mining Frequently bought togetherMovie recommendation
4 More examples of data mining Keyword suggestionsGenome & disease miningHeart monitoring
5 Overview of data mining Frequent pattern miningMachine LearningSupervisedUnsupervisedStream miningRecommender systemGraph miningUnstructured dataText,AudioImage and VideoBig data technology
10 Binary classification Input featuresOutput classCheckingDuration (years)Savings($k)Current LoansLoan PurposeRisky?Yes110TV24No575Car66Repair831199Data pointMillions of data points, hundreds of thousands of rows
33 Prediction Problems ? Rating Prediction Top-N Recommendation **** Given how an user rated other items, predict the user’s rating for a given itemTop-N RecommendationGiven the list of items liked by an user, recommend new items that the user might like?****
34 Explicit vs. Implicit Feedback Data Explicit feedbackRatings and reviewsImplicit feedback (user behavior)Purchase behavior: Recency, frequency, …Browsing behavior: # of visits, time of visit, time of staying, clicks
35 Collaborative Filtering HypothesesUser/Item SimilaritiesSimilar users purchase similar itemsSimilar items are purchased by similar usersMatching characteristicsMatch exists between user’s and item’s characteristics
36 User-User similarity User’s movie rating Out of Africa Star Wars Air Force OneLiar, LiarJohn451Adam2Laura?
37 Item-item similarity Out of Africa Star Wars Air Force One Liar, Liar John451Adam2Laura?