CS6501: Text Mining Course Policy

Slides:



Advertisements
Similar presentations
CS6501: Text Mining Course Policy
Advertisements

Introduction to CS170. CS170 has multiple sections Each section has its own class websites URLs for different sections: Section 000:
Welcome to MAT 170. Basic Course Information Instructor Office Office Hours Beth Jones PSA 725 Wednesday and Friday 10:40 am – 11:30 am and by appointment.
OBJECT ORIENTED PROGRAMMING I LECTURE 1 GEORGE KOUTSOGIANNAKIS
COMP 110 Introduction to Programming Jingdan Zhang June 20, 2007 MTWRF 9:45-11:15 am Sitterson Hall 014.
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
CS6501 Information Retrieval Course Policy Hongning Wang
Cpt S 471/571: Computational Genomics Spring 2015, 3 cr. Where: Sloan 9 When: M WF 11:10-12:00 Instructor weekly office hour for Spring 2015: Tuesdays.
Math 125 Statistics. About me  Nedjla Ougouag, PhD  Office: Room 702H  Ph: (312)   Homepage:
1 EEL 6935: Embedded Systems Seminar. 2 General Information Instructor: Ann Gordon-Ross Office: Benton Office Hours – By appointment.
COMP Introduction to Programming Yi Hong May 13, 2015.
CS6501 Information Retrieval Course Policy Hongning Wang
General Physics1 Welcome to Phys 130! Blackboard blackboard.siena.edu.
CS598CXZ (CS510) Advanced Topics in Information Retrieval (Fall 2014) Instructor: ChengXiang (“Cheng”) Zhai 1 Teaching Assistants: Xueqing Liu, Yinan Zhang.
1 Computer Networks ECE/CS 7457 Fall 2013 Malathi Veeraraghavan Professor Charles L. Brown Department of Electrical & Computer Engineering Rice 407 Course.
Welcome to IIT and cs115!.
Understanding the Academic Structure of the US Classroom: Syllabus.
CSCI 51 Introduction to Computer Science Dr. Joshua Stough January 20, 2009.
1 Principles of Computer Science I Note Set 1 CSE 1341.
Introduction to Data Structures
CS 6961: Structured Prediction Fall 2014 Course Information.
Econ 3320 Managerial Economics (Fall 2015)
Lecture 1 Page 1 CS 111 Summer 2013 Introduction CS 111 Operating System Principles Peter Reiher.
Principles of Computer Science I Honors Section Note Set 1 CSE 1341 – H 1.
ACIS 3504 Accounting Systems and Controls. 2 Dr. Linda Wallace  Office: Pamplin 3092  
CIS 842: Specification and Verification of Reactive Systems Lecture ADM: Course Administration Copyright , Matt Dwyer, John Hatcliff, Robby. The.
Parallel and Distributed Computing Overview and Syllabus Professor Johnnie Baker Guest Lecturer: Robert Walker.
CMSC 2021 CMSC 202 Computer Science II for Majors Spring 2001 Sections Ms. Susan Mitchell.
General Physics1 Welcome to Phys 140!
1 Introduction to Data Communication Networks ISQS 3349, Spring 2000 Instructor: Zhangxi Lin Office: BA 708 Phone: Homepage:
CS151 Introduction to Digital Design Noura Alhakbani Prince Sultan University, College for Women.
Welcome to Astronomy 113 “ It would seem that you have no useful skill or talent whatsoever, he said.
Computer Science I ISMAIL ABUMUHFOUZ | CS 180. CS 180 Description BRIEF SUMMARY: This course covers a study of the algorithmic approach and the object.
INTE 290 Summer 2015.
Syllabus Highlights CSE 1310 – Introduction to Computers and Programming Alexandra Stefan University of Texas at Arlington 1.
CSc 120 Introduction to Computer Programing II
All important information will be posted on Blackboard
Andy Wang Object Oriented Programming in C++ COP 3330
CS6501 Advanced Topics in Information Retrieval Course Policy
Networking CS 3470, Section 1 Sarah Diesburg
CS101 Computer Programming I
MATH/COMP 340: Numerical Analysis I
EEL 6686: Embedded Systems Seminar
ACIS 3504 Accounting Systems and Controls
CMSC 471 Introduction to Artificial Intelligence section 1 Course Overview Spring 2017.
Syllabus Overview CSE 4309 – Machine Learning Vassilis Athitsos
CS598CXZ (CS510) Advanced Topics in Information Retrieval (Fall 2016)
Syllabus Overview CSE 6363 – Machine Learning Vassilis Athitsos
September 27 – Course introductions; Adts; Stacks and Queues
CS410: Text Information Systems (Spring 2018)
Cpt S 471/571: Computational Genomics
Andy Wang Object Oriented Programming in C++ COP 3330
MA Fall 2016 Instructor: Matt Weaver Office: MATH 615
Introduction to Programming Using C++
Cpt S 471/571: Computational Genomics
Accelerated Introduction to Computer Science
ACIS 3504 Accounting Systems and Controls
CS4501: Information Retrieval Course Policy
Topics in Database Systems
CS 3950 Introduction to Computer Science Research
Intro to CIT 594
Syllabus Highlights CSE 1310 – Introduction to Computers and Programming Alexandra Stefan University of Texas at Arlington.
Miss Tower Chemistry Grading Scale 10th and 11th grade
Course policy.
Lecture 1a- Introduction
CS Computer Science II: Data Structures and Abstraction Fall 2009
Course Introduction Data Visualization & Exploration – COMPSCI 590
CS 232 Geometric Algorithms: Lecture 1
MA Fall Instructor: Tim Rolling -Office: MATH 719 -
MA Fall 2018 Instructor: Hunter Simper Office: Math 607
Presentation transcript:

CS6501: Text Mining Course Policy Hongning Wang CS@UVa

Goal of this course Discuss fundamental problems in text mining research Building blocks of text mining algorithms Wide coverage of many applications Document classification/clustering Topic modeling Sentiment analysis/recommendation Get hands-on experience by developing practical systems/components Prepare students for doing cutting-edge research in text mining and related fields Open the door to the amazing job opportunities in data science industry CS@UVa CS6501: Text Mining

Letters from former students I wanted to catch up with you and let you know that I’ve been working on a variety of text mining projects, and that the knowledge and experience I gained in your class has been vital. I have your slides saved on my computer and I frequently revisit them. Most recently I reviewed the sections on sense signatures for an automated lexicography pipeline I’m working on. Thank you for the resources and for the wonderful instruction! CS@UVa CS6501: Text Mining

Letters from former students I was a student in your Text Mining course in 2016. I have to say first that your course has been my most marketable skill coming out of school. Companies are not impressed with the surface knowledge, but what I learned from your class sets me far above other candidates. I can't thank you enough for offering this course. CS@UVa CS6501: Text Mining

Structure of this course Lecture based Six major topics will be covered E.g., NLP pipelines, classification/clustering models, and probabilistic topic models Introduce state-of-the-art large-scale text analytics techniques E.g., MapReduce framework, Apache Spark and GraphLab CS@UVa CS6501: Text Mining

Prerequisites Programming skills – Important! Math background Basic data structures: CS 2150 or equivalent Java is required for machine problems Most open source packages are written in Java! Any language you choose for the rest of this course Math background Probability Discrete/continuous distributions, expectation, moments Linear algebra Vector, matrix, dot product, matrix factorization Optimization Gradient-based methods, optimality conditions CS@UVa CS6501: Text Mining

Pop-up quiz Let a=(1,2,3) and b=(2,3,-2), the inner product between a and b is 0 (b) 1 (c) 2 (d) 3 Let A = ( 1 2 2 1 ), what is A-1, (a) ( −1 −2 −2 −1 ) (b) ( − 1 3 2 3 2 3 − 1 3 ) (c) ( 1 0 0 1 ) (d) ( 2 1 1 2 ) CS@UVa CS6501: Text Mining

Pop-up quiz What is the expectation of random variables drawn from Gaussian distribution N(0, 1), 0 (b) 0.5 (c) 1 (d) 2 A biased coin with P(head)=0.2, in a sequence of 10 consecutive tossing, you have already got 9 tails, what is the probability you will have a head at the 10th tossing, (a) 0 (b) 0.1 (c) 0.2 (d) 0.2*0.89 CS@UVa CS6501: Text Mining

Pop-up quiz Let a=(1,2,3) and b=(2,3,-2), the inner product between a and b is 0 (b) 1 (c) 2 (d) 3 Let A = ( 1 2 2 1 ), what is A-1, (a) ( −1 −2 −2 −1 ) (b) ( − 1 3 2 3 2 3 − 1 3 ) (c) ( 1 0 0 1 ) (d) ( 2 1 1 2 ) (c) (b) CS@UVa CS6501: Text Mining

Pop-up quiz What is the expectation of random variables drawn from Gaussian distribution N(0, 1), 0 (b) 0.5 (c) 1 (d) 2 A biased coin with P(head)=0.2, in a sequence of 10 consecutive tossing, you have already got 9 tails, what is the probability you will have a head at the 10th tossing, (a) 0 (b) 0.1 (c) 0.2 (d) 0.2*0.89 (a) (c) (d) CS@UVa CS6501: Text Mining

Grading policy Homework (35%) In-class quizzes (15%) Machine problems (~4) In-class quizzes (15%) To review the learned concepts (~5) Paper presentation (15%) Graded by peer-review Course project (35%) Research/development-oriented No midterm/final exams! No curve will be applied in final grading! CS@UVa CS6501: Text Mining

Quizzes Format Schedule Closed book and closed notes True/False questions Multiple choice questions Short answer questions Schedule After each major lecture topic Will be informed one week before the quiz Closed book and closed notes No electronic aids or cheat sheets CS@UVa CS6501: Text Mining

Paper presentation Let students present the state-of-the-art research related to text mining Choosing from recommended readings, or your favorite paper outside the list Should be related to your course project 15-mins presentation including 2-mins Q&A One paper per a group of students Register your choice early, first come first serve Will be graded by the instructor and other students CS@UVa CS6501: Text Mining

Course project Appreciate research-oriented problems or “deliverables” Work in groups (required) Up to 3 students Project proposal (20%) Discuss your topic with peers or the instructor first Written report Project report (40%) Due before the final presentation Project presentation (40%) 15-mins in-class presentation 5-mins Q&A CS@UVa CS6501: Text Mining

Deadlines Machine problems Paper presentation Project Submit via Collab Due in 7-days after posting Paper presentation Sign up is due in the end of 4th week Presentation starts on the 6th week Project Proposal due in the end of 5th week Presentation in the last week of Spring semester CS@UVa CS6501: Text Mining

Late policy Homework Quizzes Paper presentation Course project Everyone will have one chance to ask for extension (extra three days after deadline) Request must be made before the deadline! Quizzes No make-up quizzes unless under emergency situation Paper presentation Must be presented on your selected date Course project Proposal due early in the semester (~5th week, no extension) Final report due before presentation (no extension) CS@UVa CS6501: Text Mining

Late policy If submit after the deadline without granted extension 15% late penalty will be applied within the first week of due date 30% late penalty thereafter Fairness among all the students will be guaranteed! CS@UVa CS6501: Text Mining

Classroom participation HIGHLY APPRECIATED! Helps me quickly remember your names Reminds me what is still confusing You can drive the lecture/discussion in this class! CS@UVa CS6501: Text Mining

TAs Lin Gong (lead) lg5bt@virginia.edu Yiling Jia qw2ky@virginia.edu CS@UVa CS6501: Text Mining

Contact information Lecture Instructor: Hongning Wang Time: Tuesday/Thursday 12:30pm to 1:45pm Location: Thornton Hall E316 Office hour Instructor’s Time: Tuesday/Thursday 2:00pm to 3:00pm Location: Rice Hall 408 TAs’ Time: Monday/Wednesday 3:00pm to 4:00pm Location: Rice Hall 514 (not finalized yet) Course website Website: http://www.cs.virginia.edu/~hw5x/Course/TextMining-2018Spring/_site/ Piazza: https://piazza.com/virginia/spring2018/cs6501004/home CS@UVa CS6501: Text Mining

Thank you! Questions? CS@UVa CS6501: Text Mining