Machine Learning Ali Ghodsi Department of Statistics

Slides:



Advertisements
Similar presentations
Current Trends in Machine Learning and Data Mining
Advertisements

Introduction to Machine Learning BITS C464/BITS F464
Godfather to the Singularity
Machine learning continued Image source:
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Automated Anomaly Detection, Data Validation and Correction for Environmental Sensors using Statistical Machine Learning Techniques
Data Visualization STAT 890, STAT 442, CM 462
Introduction to Machine Learning Anjeli Singh Computer Science and Software Engineering April 28 th 2008.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
Introduction to Machine Learning course fall 2007 Lecturer: Amnon Shashua Teaching Assistant: Yevgeny Seldin School of Computer Science and Engineering.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab The University of Texas at Dallas.
Introduction to machine learning
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Introduction Mohammad Beigi Department of Biomedical Engineering Isfahan University
TINONS1 Nonlinear SP and Pattern recognition
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
IE 585 Introduction to Neural Networks. 2 Modeling Continuum Unarticulated Wisdom Articulated Qualitative Models Theoretic (First Principles) Models Empirical.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Machine Learning An Introduction. What is Learning?  Herbert Simon: “Learning is any process by which a system improves performance from experience.”
Data Mining – A First View Roiger & Geatz. Definition Data mining is the process of employing one or more computer learning techniques to automatically.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Yazd University, Electrical and Computer Engineering Department Course Title: Advanced Software Engineering By: Mohammad Ali Zare Chahooki 1 Introduction.
1 Learning from Shadows Dimensionality Reduction and its Application in Artificial Intelligence, Signal Processing and Robotics Ali Ghodsi Department of.
Learning to perceive how hand-written digits were drawn Geoffrey Hinton Canadian Institute for Advanced Research and University of Toronto.
1 Machine Learning 1.Where does machine learning fit in computer science? 2.What is machine learning? 3.Where can machine learning be applied? 4.Should.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
Data Mining and Decision Support
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Machine learning & object recognition Cordelia Schmid Jakob Verbeek.
Brief Intro to Machine Learning CS539
Big data classification using neural network
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning for Computer Security
Introduction to Pattern Recognition
Machine Learning overview Chapter 18, 21
Machine Learning overview Chapter 18, 21
Intro to Machine Learning
Goodfellow: Chap 1 Introduction
Neural Networks for Machine Learning Lecture 1e Three types of learning Geoffrey Hinton with Nitish Srivastava Kevin Swersky.
CS 2750: Machine Learning Dimensionality Reduction
Reading: Pedro Domingos: A Few Useful Things to Know about Machine Learning source: /cacm12.pdf reading.
CH. 1: Introduction 1.1 What is Machine Learning Example:
CS 790 Machine Learning Introduction Ali Borji UWM.
MACHINE LEARNING.
Goodfellow: Chap 1 Introduction
What is Pattern Recognition?
Basic Intro Tutorial on Machine Learning and Data Mining
Creating Data Representations
3.1.1 Introduction to Machine Learning
ITEC323 Lecture 1.
Course Introduction CSC 576: Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Warehousing Data Mining Privacy
Presented by Wanxue Dong
Neutrosophic Graph Image Representation
Word embeddings (continued)
Machine Learning – a Probabilistic Perspective
Lecture 21: Machine Learning Overview AP Computer Science Principles
Azure Machine Learning
Lecture 9: Machine Learning Overview AP Computer Science Principles
Presentation transcript:

Machine Learning Ali Ghodsi Department of Statistics School of Computer Science University of Waterloo aghodsib @uwaterloo.ca Winter 2009

Two Problems Classical Statistics Infer information from small data sets (Not enough data) Machine Learning Infer information from large data sets (Too many data)

We are drowning in information and starving for knowledge. Rutherford D. Roger

are all closely related to the second problem. Other Names for ML Data mining, Applied statistics Adaptive (stochastic) signal processing Probabilistic planning or reasoning are all closely related to the second problem.

Applications Machine Learning is most useful when the structure of the task is not well understood but can be characterized by a dataset with strong statistical regularity. Search and recommendation (e.g. Google, Amazon) Automatic speech recognition and speaker verification Text parsing Face identification Tracking objects in video Financial prediction, fraud detection (e.g. credit cards) Medical diagnosis

More Applications More science and technology applications: handwritten identification drug discovery (to identify the biological activity of chemical compounds using features describing the chemical structures) Gene expression analysis ( thousands of features with only dozens of observations)

Tasks Supervised Learning: given examples of inputs and corresponding desired outputs, predict outputs on future inputs. e.g.: classification, regression Unsupervised Learning: given only inputs, automatically discover representations, features, structure, etc. e.g.: clustering, dimensionality reduction, Feature extraction

The Big Picture Weinberger et al 2004

Manifold and Hidden Variables

Dimensionality Reduction Dimensionality: The number of measurements available for each item in a data set. The dimensionality of real world items is very high. For example: The dimensionality of a 600 by 600 image is 360,000. The Key to analyzing data is comparing these measurements to find relationships among this plethora of data points. Usually these measurements are highly redundant, and relationships among data points are predictable.

Dimensionality Reduction Knowing the value of a pixel in an image, it is easy to predict the value of nearby pixels since they tend to be similar. Knowing that the word “corporation” occurs often in articles about economics, but not very often in articles about art and poetry then it is easy to predict that it will not occur very often in articles about love. Although there are lots of measurements per item, there are far fewer that are likely to vary. Using a data set that only includes the items likely to vary allows humans to quickly and easily recognize changes in high dimensionality data.

Data Representation

Data Representation

Data Representation 1 0.5

Hastie et, al 2001

2 by 103 644 by 103 644 by 2 -2.19 -0.02 -3.19 1.02 23 by 28 23 by 28 2 by 1 2 by 1

Hastie et, al 2001

Tenenbaum et, al 2000

Roweis & Saul 2000

Arranging words: Each word was initially represented by a high-dimensional vector that counted the number of times it appeared in different encyclopedia articles. Words with similar contexts are collocated Roweis & Saul 2000

Roweis & Hinton 2001