CS 446: Machine Learning Dan Roth

Slides:



Advertisements
Similar presentations
Introduction to Machine Learning BITS C464/BITS F464
Advertisements

Godfather to the Singularity
INTRODUCTION TO MACHINE LEARNING David Kauchak CS 451 – Fall 2013.
CSE 5522: Survey of Artificial Intelligence II: Advanced Techniques Instructor: Alan Ritter TA: Fan Yang.
1 i206: Distributed Computing Applications & Infrastructure 2012
CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.
CS 232 Geometric Algorithms: Lecture 1 Shang-Hua Teng Department of Computer Science, Boston University.
Machine Learning Bob Durrant School of Computer Science
Machine Learning (Extended) Dr. Ata Kaban
213: User Interface Design & Development Professor: Tapan Parikh TA: Eun Kyoung Choe
EECS 349 Machine Learning Instructor: Doug Downey Note: slides adapted from Pedro Domingos, University of Washington, CSE
Introduction to Artificial Intelligence Prof. Kathleen McKeown 722 CEPSR, TAs: Kapil Thadani 724 CEPSR, Phong Pham TA Room.
Learning Programs Danielle and Joseph Bennett (and Lorelei) 4 December 2007.
1 Introduction LING 575 Week 1: 1/08/08. Plan for today General information Course plan HMM and n-gram tagger (recap) EM and forward-backward algorithm.
COMP 14 – 02: Introduction to Programming Andrew Leaver-Fay August 31, 2005 Monday/Wednesday 3-4:15 pm Peabody 217 Friday 3-3:50pm Peabody 217.
Fall 2004 Cognitive Science 207 Introduction to Cognitive Modeling Praveen Paritosh.
CS 5941 CS583 – Data Mining and Text Mining Course Web Page 05/cs583.html.
Introduction to Artificial Neural Network and Fuzzy Systems
Introduction to machine learning
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
General Information Course Id: COSC6342 Machine Learning Time: MO/WE 2:30-4p Instructor: Christoph F. Eick Classroom:SEC 201
1 CS598 DNR FALL 2005 Machine Learning in Natural Language Dan Roth University of Illinois, Urbana-Champaign
CSCI 347 – Data Mining Lecture 01 – Course Overview.
1 CS546 Spring 2009 Machine Learning in Natural Language Dan Roth & Ivan Titov SC Wed/Fri 9:30  What’s.
Xiaoying Sharon Gao Mengjie Zhang Computer Science Victoria University of Wellington Introduction to Artificial Intelligence COMP 307.
CS 103 Discrete Structures Lecture 01 Introduction to the Course
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
1 CS 512 Machine Learning Berrin Yanikoglu Slides are expanded from the Machine Learning-Mitchell book slides Some of the extra slides thanks to T. Jaakkola,
Machine Learning An Introduction. What is Learning?  Herbert Simon: “Learning is any process by which a system improves performance from experience.”
Prof. Barbara Bernal NEW Office in J 126 Office Hours: M 4pm - 5:30 PM Class Lecture: M 6 PM - 8:30 in J133 Weekly Web Lecture between Tuesday to Sunday.
CMSC 671 Principles of Artificial Intelligence Course Overview Fall 2015.
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
CS 6961: Structured Prediction Fall 2014 Course Information.
CS 445/545 Machine Learning Winter, 2012 Course overview: –Instructor Melanie Mitchell –Textbook Machine Learning: An Algorithmic Approach by Stephen Marsland.
CMSC 671 Introduction to Artificial Intelligence Course Overview Fall 2012.
INTRODUCTIONCS446 Fall ’15 CS 446: Machine Learning Dan Roth University of Illinois, Urbana-Champaign
1 Machine Learning (Extended) Dr. Ata Kaban Algorithms to enable computers to learn –Learning = ability to improve performance automatically through experience.
1 Machine Learning 1.Where does machine learning fit in computer science? 2.What is machine learning? 3.Where can machine learning be applied? 4.Should.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
IST 210: Organization of Data
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
Machine Learning Introduction. Class Info Office Hours –Monday:11:30 – 1:00 –Wednesday:10:00 – 1:00 –Thursday:11:30 – 1:00 Course Text –Tom Mitchell:
CS 445/545 Machine Learning Winter, 2014 See syllabus at
1 Intro to Artificial Intelligence COURSE # CSC384H1F Fall 2008 Sonya Allin Note: many slides drawn from/inspired by Andrew Moore’s lectures at CMU and.
Please initial the attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the roster, please.
SPELLING, RESEARCH, HYPERLINKS, PROPERTIES 10/16/13.
Introduction: What is AI? CMSC Introduction to Artificial Intelligence January 3, 2002.
Mining of Massive Datasets Edited based on Leskovec’s from
Introduction: What is AI? CMSC Introduction to Artificial Intelligence January 7, 2003.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
Introduction to Machine Learning Original Version by Prof. Nati Srebro-Bartom Modifications by Nir Ailon Lecture 0: What is Machine Learning? What.
Brief Intro to Machine Learning CS539
Networking CS 3470, Section 1 Sarah Diesburg
CS 446: Machine Learning Dan Roth

Done Done Course Overview What is AI? What are the Major Challenges?
CH. 1: Introduction 1.1 What is Machine Learning Example:
Artificial Intelligence (CS 461D)
Artificial Intelligence (CS 370D)
CIS 519/419 Applied Machine Learning
Special Topics in Data Mining Applications Focus on: Text Mining
What is Pattern Recognition?
Other Problems due to Computers
CIS 519/419 Applied Machine Learning
Welcome! Knowledge Discovery and Data Mining
Course Introduction Data Visualization & Exploration – COMPSCI 590
CS201 – Course Expectations
Presentation transcript:

CS 446: Machine Learning Dan Roth University of Illinois, Urbana-Champaign danr@illinois.edu http://L2R.cs.uiuc.edu/~danr 3322 SC

CS446: Machine Learning Tuesday, Thursday: 12:30pm-1:45pm 1320 DCL Office hours: Tuesday 1:45-3:30 pm [my office] TAs: Kai-Wei Chang Bryan Lunt Chen-Tse Tsai Rongda Zhu Assignments: 7 +/- 1 Problems sets (Programming) Class exercises; Discussion sections Mid Term Exam Project Final Mitchell/Other Books/ Lecture notes /Literature Registration to Class

CS446 Machine Learning: Today What is Learning? Who are you? What is CS446 about?

What is Learning The Badges Game…… Who are you? Badges game Don’t give me the answer Start thinking about how to write a program that will figure out whether my name has + or – next to it.

An Owed to the Spelling Checker I have a spelling checker, it came with my PC It plane lee marks four my revue Miss steaks aye can knot sea. Eye ran this poem threw it, your sure reel glad two no. Its vary polished in it's weigh My checker tolled me sew. A checker is a bless sing, it freeze yew lodes of thyme. It helps me right awl stiles two reed And aides me when aye rime. Each frays come posed up on my screen Eye trussed to bee a joule...

Machine learning is everywhere

Applications: Spam Detection This is a binary classification task: Assign one of two labels (i.e. yes/no) to the input (here, an email message) Classification requires a model (a classifier) to determine which label to assign to items. In this class, we study algorithms and techniques to learn such models from data.

Ambiguity Resolution Can I have a peace of cake ? piece ...Nissan Car and truck plant is … …divide life into plant and animal kingdom Buy a car with a steering wheel (his money) (This Art) (can N) (will MD) (rust V) V,N,N The dog bit the kid. He was taken to a veterinarian hospital Learn a function that maps observations in the domain to one of several categories or <.

Comprehension (ENGLAND, June, 1989) - Christopher Robin is alive and well. He lives in England. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book. He made up a fairy tale land where Chris lived. His friends were animals. There was a bear called Winnie the Pooh. There was also an owl and a young pig, called a piglet. All the animals were stuffed toys that Chris owned. Mr. Robin made them come to life with his words. The places in the story were all near Cotchfield Farm. Winnie the Pooh was written in 1925. Children still love to read about Christopher Robin and his animal friends. Most people don't know he is a real person who is grown now. He has written two books of his own. They tell what it is like to be famous. This is an Inference Problem; where is the learning? 1. Christopher Robin was born in England. 2. Winnie the Pooh is a title of a book. 3. Christopher Robin’s dad was a magician. 4. Christopher Robin must be at least 65 now.

Learning Learning is at the core of Learning has multiple purposes Understanding High Level Cognition Performing knowledge intensive inferences Building adaptive, intelligent systems Dealing with messy, real world data Learning has multiple purposes Knowledge Acquisition Integration of various knowledge sources to ensure robust behavior Adaptation (human, systems)

Learning = Generalization H. Simon - “Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the task or tasks drawn from the same population more efficiently and more effectively the next time.” The ability to perform a task in a situation which has never been encountered before

Learning = Generalization The learner has to be able to classify items it has never seen before. Mail thinks this message is junk mail. Not junk

Learning = Generalization The ability to perform a task in a situation which has never been encountered before Classification Medical diagnosis; credit card applications; hand-written letters Planning and acting Navigation; game playing (chess, backgammon); driving a car Skills Balancing a pole; playing tennis Common sense reasoning Natural language interactions Generalization depends on Representation as much as it depends on the Algorithm used.

Why Study Learning? Computer systems with new capabilities. Develop systems that are too difficult to impossible to construct manually . Develop systems that can automatically adapt and customize themselves to the needs of the individual users through experience. Discover knowledge and patterns in databases, database mining, e.g. discovering purchasing patterns for marketing purposes.

Why Study Learning? Computer systems with new capabilities. Understand human and biological learning Understanding teaching better.

Why Study Learning? Computer systems with new capabilities. Understand human and biological learning Understanding teaching better. Time is right. Initial algorithms and theory in place. Growing amounts of on-line data Computational power available. Necessity: many things we want to do cannot be done by “programming”.

Learning is the future Looking for a student with strong interests and expertise in Programming Languages Learning techniques will be a basis for every application that involves a connection to the messy real world Basic learning algorithms are ready for use in applications today Prospects for broader future applications make for exciting fundamental research and development opportunities Many unresolved issues – Theory and Systems

Work in Machine Learning Artificial Intelligence; Theory; Experimental CS Makes Use of: Probability and Statistics; Linear Algebra; Statistics; Theory of Computation; Related to: Philosophy, Psychology (cognitive, developmental), Neurobiology, Linguistics, Vision, Robotics,…. Has applications in: AI (natural Language; Vision; Planning; HCI) Engineering (Agriculture; Civil; …) Computer Science (Compilers; Architecture; Systems; data bases) Very active field What to teach? The fundamental paradigms Some of the most important algorithmic ideas Modeling And: what we don’t know

Course Overview Who knows DTs ? Who knows NNs ? Introduction: Basic problems and questions A detailed example: Linear threshold units Two Basic Paradigms: PAC (Risk Minimization) Bayesian theory Learning Protocols: Supervised; Unsupervised; Semi-supervised Algorithms Decision Trees (C4.5) [Rules and ILP (Ripper, Foil)] Linear Threshold Units (Winnow; Perceptron; Boosting; SVMs; Kernels) [Neural Networks (Backpropagation)] Probabilistic Representations (naïve Bayes; Bayesian trees; Densities) Unsupervised /Semi supervised: EM Clustering; Dimensionality Reduction Who knows DTs ? Who knows NNs ?

CS446: Machine Learning Tuesday, Thursday: 12:30pm-1:45pm 1320 DCL Office hours: Tuesday 1:45-3:30 pm [my office] TAs: Kai-Wei Chang Bryan Lunt Chen-Tse Tsai Rongda Zhu Assignments: 7 +/- 1 Problems sets (Programming) Class exercises; Discussion sections Mid Term Exam Project Final Mitchell/Other Books/ Lecture notes /Literature Registration to Class Send me email after class Title: CS446 LastName, First Name, net id, Registration Body: how you are; any information you want to share

CS446: Machine Learning Theory of Computation What do you need to know: Theory of Computation Probability Theory Linear Algebra Programming (Java; your favorite language; some Matlab) Homework 0 – on the web Who is the class for? Future Machine Learning researchers/Advanced users

Note also the Schedule Page and our Notes CS446: Policies Cheating No. We take it very seriously. Homework: Collaboration is encouraged But, you have to write your own solution/program. (Please don’t use old solutions) Late Policy: You have a credit of 4 days (4*24hours); That’s it. Grading: Possibly separate for grads/undergrads. 5% Class work; 25% - homework; 30%-midterm; 40%-final; Projects: 25% (4 hours) Questions? Info page Note also the Schedule Page and our Notes

CS446 Team Dan Roth (3323 Siebel) Tuesday, 2:00 PM – 3:00 PM TAs (4407 Siebel) Kai-Wei Chang: Tuesday 5:00pm-6:00pm (3333 SC) Chen-Tse Tsai: Tuesday 8:00pm-9:00pm (3333 SC) Bryan Lunt: Tue 8:00pm-9:00pm (1121 SC) Rongda Zhu: Thursday 4:00pm-5:00pm (2205 SC) Discussion Sections: (starting next week; no Monday) Mondays:  5:00pm-6:00pm 3405-SC Bryan Lunt [A-F] Tuesday:  5:00pm-6:00pm 3401-SC Chen-Tse Tsai [G-L] Wednesdays:  5:30pm-6:30pm 3405-SC Kai-Wei Chang [M-S] Thursdays:  5:00pm-6:00pm 3403-SC Rongda Zhu [T-Z]

CS446 on the web Check our class website: Schedule, slides, videos, policies http://l2r.cs.uiuc.edu/~danr/Teaching/CS446-14/index.html Sign up, participate in our Piazza forum: Announcements and discussions https://piazza.com/class#fall2014/cs446 Log on to Compass: Submit assignments, get your grades https://compass2g.illinois.edu

What is Learning The Badges Game…… This is an example of the key learning protocol: supervised learning First question: Are you sure you got it? Why? Issues: Prediction or Modeling? Representation Problem setting Background Knowledge When did learning take place? Algorithm Badges game Don’t give me the answer Start thinking about how to write a program that will figure out whether my name has + or – next to it.