Machine Learning – a Probabilistic Perspective

Slides:



Advertisements
Similar presentations
COMPUTER AIDED DIAGNOSIS: CLASSIFICATION Prof. Yasser Mostafa Kadah –
Advertisements

Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Machine learning continued Image source:
CS Statistical Machine learning Lecture 13 Yuan (Alan) Qi Purdue CS Oct
An Overview of Machine Learning
Supervised Learning Recap
Lecture 17: Supervised Learning Recap Machine Learning April 6, 2010.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Pattern Recognition and Machine Learning
Supervised and Unsupervised learning and application to Neuroscience Cours CA6b-4.
Introduction to Neural Networks Simon Durrant Quantitative Methods December 15th.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation X = {
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Example Clustered Transformations MAP Adaptation Resources: ECE 7000:
ECSE 6610 Pattern Recognition Professor Qiang Ji Spring, 2011.
Chapter 4 CONCEPTS OF LEARNING, CLASSIFICATION AND REGRESSION Cios / Pedrycz / Swiniarski / Kurgan.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Overview of Supervised Learning Overview of Supervised Learning2 Outline Linear Regression and Nearest Neighbors method Statistical Decision.
Pattern Recognition April 19, 2007 Suggested Reading: Horn Chapter 14.
Learning from observations
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
Christopher M. Bishop, Pattern Recognition and Machine Learning.
Linear Models for Classification
Radial Basis Function ANN, an alternative to back propagation, uses clustering of examples in the training set.
Unsupervised Learning Networks 主講人 : 虞台文. Content Introduction Important Unsupervised Learning NNs – Hamming Networks – Kohonen’s Self-Organizing Feature.
Subjects Review Introduction to Statistical Learning Midterm: Thursday, October 15th :00-16:00 ADV2.
Introduction to Machine Learning Multivariate Methods 姓名 : 李政軒.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Introduction to Gaussian Process CS 478 – INTRODUCTION 1 CS 778 Chris Tensmeyer.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
CSE 4705 Artificial Intelligence
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Who am I? Work in Probabilistic Machine Learning Like to teach 
Machine Learning Models
Machine Learning for Computer Security
ECE 471/571 - Lecture 19 Review 02/24/17.
DEEP LEARNING BOOK CHAPTER to CHAPTER 6
Deep Feedforward Networks
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Sparse Kernel Machines
Unsupervised Learning Networks
Neural Networks for Machine Learning Lecture 1e Three types of learning Geoffrey Hinton with Nitish Srivastava Kevin Swersky.
Machine Learning Basics
Overview of Supervised Learning
Special Topics in Data Mining Applications Focus on: Text Mining
Machine Learning Week 1.
Probabilistic Models with Latent Variables
ECE 471/571 – Review 1.
Pattern Recognition and Machine Learning
INTRODUCTION TO Machine Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
Introduction.
Bayesian Decision Theory
Hairong Qi, Gonzalez Family Professor
ECE – Pattern Recognition Lecture 4 – Parametric Estimation
What is Artificial Intelligence?
ECE – Pattern Recognition Midterm Review
Presentation transcript:

Machine Learning – a Probabilistic Perspective Introduction Cui Jiaqi 2018.4.8

The goal of machine learning is to develop methods that can automatically detect patterns in data, and then to use the uncovered patterns to predict future data or other outcomes of interest.

Types of machine learning TYPE1: the predictive or supervised learning approach TYPE2: descriptive or unsupervised learning approach TYPE3: reinforcement learning Learning how to act or behave when given occasional reward or punishment signals.

Supervised learning Classification : - to make predictions on novel inputs - MAP (maximum a posteriori ) estimate Regression:like classification except the response variable is continuous

Unsupervised learning just given output data, without any inputs two differences from the supervised case: unsupervised learning is unconditional density estimation instead of is a vector of features, so we need to create multivariate probability models.

Discovering clusters to estimate the distribution over the number of clusters to estimate which cluster each point belongs to represent the cluster to which data point i is assigned.

Discovering latent factors dimensionality reduction: principal components analysis Discovering graph structure a set of correlated variables to discover which ones are most correlated with which others. graph G to discover new knowledge, and to get better joint probability density estimators. Matrix completion

K-nearest neighbors are the (indices of the) K nearest points to x in D is the indicator function defined as follows: Euclidean distance not work well with high dimensional inputs

Linear regression The connection between linear regression and Gaussians polynomial regression

Logistic regression generalize linear regression to the (binary) classification replace the Gaussian distribution for y with a Bernoulli distribution compute a linear combination of the inputs, but then pass this through a function that ensures 0 ≤ μ(x) ≤ 1 by defining

Logistic regression

Model selection misclassification rate about 80% of the data for the training set, and 20% for the validation set cross validation