Presentation on theme: "Data Mining in Practice: Techniques and Practical Applications"— Presentation transcript:
1Data Mining in Practice: Techniques and Practical Applications Junling HuMay 14, 2013
2What is data mining? Mining patterns from data Is it statistics? Functional form?Computation speed concern?Data sizeVariable sizeIs it machine learning?Big data issueNew methods: network miningE.g. stroke prediction
3Examples of data mining Frequently bought togetherMovie recommendation
4More examples of data mining Keyword suggestionsGenome & disease miningHeart monitoring
5Overview of data mining Frequent pattern miningMachine LearningSupervisedUnsupervisedStream miningRecommender systemGraph miningUnstructured dataText,AudioImage and VideoBig data technology
10Binary classification Input featuresOutput classCheckingDuration (years)Savings($k)Current LoansLoan PurposeRisky?Yes110TV24No575Car66Repair831199Data pointMillions of data points, hundreds of thousands of rows
33Prediction Problems ? Rating Prediction Top-N Recommendation **** Given how an user rated other items, predict the user’s rating for a given itemTop-N RecommendationGiven the list of items liked by an user, recommend new items that the user might like?****
34Explicit vs. Implicit Feedback Data Explicit feedbackRatings and reviewsImplicit feedback (user behavior)Purchase behavior: Recency, frequency, …Browsing behavior: # of visits, time of visit, time of staying, clicks
35Collaborative Filtering HypothesesUser/Item SimilaritiesSimilar users purchase similar itemsSimilar items are purchased by similar usersMatching characteristicsMatch exists between user’s and item’s characteristics
36User-User similarity User’s movie rating Out of Africa Star Wars Air Force OneLiar, LiarJohn451Adam2Laura?
37Item-item similarity Out of Africa Star Wars Air Force One Liar, Liar John451Adam2Laura?