Lecture 6: Introduction to Machine Learning

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
1 Machine Learning: Lecture 10 Unsupervised Learning (Based on Chapter 9 of Nilsson, N., Introduction to Machine Learning, 1996)
Indian Statistical Institute Kolkata
Chapter 7 – Classification and Regression Trees
Chapter 7 – Classification and Regression Trees
Machine Learning CPSC 315 – Programming Studio Spring 2009 Project 2, Lecture 5.
Cluster Analysis.  What is Cluster Analysis?  Types of Data in Cluster Analysis  A Categorization of Major Clustering Methods  Partitioning Methods.
Three kinds of learning
Part I: Classification and Bayesian Learning
Chapter 5 Data mining : A Closer Look.
Evaluating Performance for Data Mining Techniques
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
More Machine Learning Linear Regression Squared Error L1 and L2 Regularization Gradient Descent.
Overview DM for Business Intelligence.
Machine Learning CUNY Graduate Center Lecture 1: Introduction.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Machine Learning CSE 681 CH2 - Supervised Learning.
Chapter 9 – Classification and Regression Trees
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 16 Nov, 3, 2011 Slide credit: C. Conati, S.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
CS 445/545 Machine Learning Winter, 2012 Course overview: –Instructor Melanie Mitchell –Textbook Machine Learning: An Algorithmic Approach by Stephen Marsland.
CS 445/545 Machine Learning Spring, 2013 See syllabus at
Today Ensemble Methods. Recap of the course. Classifier Fusion
Clustering Instructor: Max Welling ICS 178 Machine Learning & Data Mining.
Data Mining and Decision Support
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Introduction to Classification & Clustering Villanova University Machine Learning Lab Module 4.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Mixture Densities Maximum Likelihood Estimates.
Data Summit 2016 H104: Building Hadoop Applications Abhik Roy Database Technologies - Experian LinkedIn Profile:
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Machine Learning Supervised Learning Classification and Regression
GLBIO ML workshop May 17, 2016 Ivan Kryukov and Jeff Wintersinger
Machine Learning with Spark MLlib
Machine Learning Models
Introduction to Classification & Clustering
Evaluating Classifiers
Data Mining 101 with Scikit-Learn
Neural Networks for Machine Learning Lecture 1e Three types of learning Geoffrey Hinton with Nitish Srivastava Kevin Swersky.
Introduction to Data Science Lecture 7 Machine Learning Overview
Nick Hauenstein Signals, Intelligence, and Intelligent Actions
Estimating Link Signatures with Machine Learning Algorithms
Dipartimento di Ingegneria «Enzo Ferrari»,
Intro to Machine Learning
CSE 4705 Artificial Intelligence
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
Overview of Machine Learning
Dr. Unnikrishnan P.C. Professor, EEE
INTRODUCTION.
Classification and Prediction
Introduction to Data Science Lesson 1
Cluster Validity For supervised classification we have a variety of measures to evaluate how good our model is Accuracy, precision, recall For cluster.
CSCI N317 Computation for Scientific Applications Unit Weka
MIS2502: Data Analytics Clustering and Segmentation
Machine Learning in Practice Lecture 23
MIS2502: Data Analytics Clustering and Segmentation
LECTURE 21: CLUSTERING Objectives: Mixture Densities Maximum Likelihood Estimates Application to Gaussian Mixture Models k-Means Clustering Fuzzy k-Means.
Intro to Machine Learning
Model generalization Brief summary of methods
Machine learning overview
CS639: Data Management for Data Science
Lecture 21: Machine Learning Overview AP Computer Science Principles
Machine Learning in Business John C. Hull
Lecture 9: Machine Learning Overview AP Computer Science Principles
Morteza Kheirkhah University College London
Getting Started with Microsoft Azure Machine Learning
Presentation transcript:

Lecture 6: Introduction to Machine Learning Alireza Akhavan Pour CLASS.VISION چهارشنبه – ۱۶ اسفند ۱۳۹۶

Machine Learning Overview چهارشنبه – ۱۶ اسفند ۱۳۹۶

Let’s discuss some basic machine learning concepts to set a foundation for future lectures! We’ll talk about supervised learning, unsupervised learning, reinforcement learning, evaluation methods, and more. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unlike typical computer programs, machine learning techniques will iteratively learn from data. ML algorithms can find insights in data, even if they aren’t specifically instructed what to look for in the data. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning Reinforcement Learning In this course we’ll discuss three major types of machine learning algorithms Supervised Learning Unsupervised Learning Reinforcement Learning We’ll also touch on other topics, such as word embeddings with Word2Vec چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning uses labeled data to predict a label given some features. If the label is continuous its called a regression problem, if its categorical it is a classification problem. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Classification Features: Height and Weights Label: Gender Task: Given a person’s height and weight, predict their gender. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Classification Male Height Female Weight چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Classification ? Male Height Female Weight چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Classification Male Height Female Weight چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Regression Features: Square Footage, Rooms Label: House Price Task: Given a house’s size and number of rooms, predict the selling price. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Regression Price Square Feet چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Regression Price Square Feet چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Regression Price Square Feet چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning - Regression Price Square Feet چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning Supervised Learning has the model train on historical data that is already labeled (e.g. previous house sales). Once the model is trained, it can then be used on new data, where only the features are known, to attempt prediction چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning But what if you don’t have historical labels for your data? (You only have features) Since you have no “right answer” to fit on, you need to look for patterns in the data and find a structure. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning Unsupervised Learning - Clustering Features: Heights and Weights for breeds of dogs. Label: No Label for unsupervised! Task: Cluster together the data into similar groups. It is then up to the data scientist to interpret the clusters. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning Unsupervised Learning - Clustering Height Weight چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning Unsupervised Learning - Clustering Height Weight چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning Unsupervised Learning - Clustering Clustering won’t be able to tell you what the group labels should be. Only that the points in each cluster are similar to each other based off the features. Height Weight چهارشنبه – ۱۶ اسفند ۱۳۹۶

Reinforcement Learning What about machine learning tasks like have a computer learn to play a video game, drive a car, etc… ? Reinforcement learning works through trial and error which actions yield the greatest rewards. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Reinforcement Learning Components Agent-Learning/Decision Maker Environment - What Agent interacts with Actions - What the Agent can do چهارشنبه – ۱۶ اسفند ۱۳۹۶

Reinforcement Learning The agent chooses actions that maximize some specified reward metric over a given amount of time. Learning the best policy with the environment and responding with the best actions. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Reinforcement Learning Let’s walk through the basic machine learning process for a supervised learning problem. Afterwards we’ll discuss some key differences for unsupervised learning, as well discuss hold out data sets. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Acquire Data from Some Source Acquisition چهارشنبه – ۱۶ اسفند ۱۳۹۶

Clean and Organize the Data Acquisition Data Cleaning چهارشنبه – ۱۶ اسفند ۱۳۹۶

Train Test Split Test Data Data Acquisition Cleaning Train چهارشنبه – ۱۶ اسفند ۱۳۹۶

Train/Fit Model on Training Data Test Data Acquisition Data Cleaning Train Model Train چهارشنبه – ۱۶ اسفند ۱۳۹۶

Evaluate Model on Test Data Acquisition Data Cleaning Train Model Evaluate Model Train چهارشنبه – ۱۶ اسفند ۱۳۹۶

Adjust Model Parameters Test Data Acquisition Data Cleaning Train Model Evaluate Model Train Adjust Model Parameters چهارشنبه – ۱۶ اسفند ۱۳۹۶

Deploy Model on New Incoming Data Test Data Acquisition Data Cleaning Train Model Evaluate Model Deploy Model Train Adjust Model Parameters چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning Data Acquisition Data Cleaning Train Model Evaluate Model Deploy Model Train Adjust Model Parameters چهارشنبه – ۱۶ اسفند ۱۳۹۶

Adjust Model Parameters Hold Out Sets Test Dev (Validation) Deploy Data Acquisition Data Cleaning Train Model Test Model Evaluate Model Train Adjust Model Parameters چهارشنبه – ۱۶ اسفند ۱۳۹۶

Finally let’s quickly discuss model evaluation, we’ll dive into more details for certain problems later on in the course. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning -Classification Eval Accuracy , Recall, Precision Accuracy - Correctly Classified divided by total samples. Which metric is the most important depends on the specific situation چهارشنبه – ۱۶ اسفند ۱۳۹۶

Supervised Learning -Regression Eval MAE, MSE, RMSE All are measurements of: On average, how far off are you from the correct continuous value. چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning - Evaluation Much harder to evaluate, depends on overall goal of the task Never had “Correct Labels” to compare to Cluster Homogeneity, Rand Index چهارشنبه – ۱۶ اسفند ۱۳۹۶

Unsupervised Learning - Evaluation چهارشنبه – ۱۶ اسفند ۱۳۹۶

Types of Machine Learning Machine Learning Process Evaluation Metrics Review Machine Learning Types of Machine Learning Machine Learning Process Evaluation Metrics چهارشنبه – ۱۶ اسفند ۱۳۹۶

منابع https://docs.google.com/presentation/d/1kytN5gqrZe0Bcu3hExjZpA4PQipaFEKn26S-JhqncXk/edit#slide=id.g1ff36e20db_0_450 چهارشنبه – ۱۶ اسفند ۱۳۹۶