CS & CS ST: Probabilistic Data Management Fall 2016 Xiang Lian Kent State University Kent, OH

Slides:



Advertisements
Similar presentations
Welcome to the seminar course
Advertisements

Master’s thesis – a piece of cake? Per Ertbjerg Department of Food and Environmental Sciences University of Helsinki, Finland.
CS 498 Senior Seminar Students will research a current topic in computer science, write a paper on that topic, and make an oral presentation.
CSE 322: Software Reliability Engineering Topics covered: Course outline and schedule Introduction, Motivation and Basic Concepts.
Introduction to SEG 5010 Hong Cheng 2009/10 Second Term.
Syllabus CS 765: Introduction to Database Management Systems Fall 2008 Text Database Management Systems Ramakrishnan/Gehrke, 3rd.
Math 125 Statistics. About me  Nedjla Ougouag, PhD  Office: Room 702H  Ph: (312)   Homepage:
CS 150 PERSONAL PRODUCTIVITY USING TECHNOLOGY Instructor: Xenia Mountrouidou.
Lecture 1 Page 1 CS 111 Summer 2015 Introduction CS 111 Operating System Principles.
T Seminar on Network Security Today’s agenda 1.Overview and organization 2.English support 3.Course theme 4.Project topics 5.Timetable.
Proposal for Term Project Operating Systems, Fall 2015 J. H. Wang Sep. 18, 2015.
Course Introduction Software Engineering
CS 498 Senior Seminar Students will research a current topic in computer science, write a paper on that topic, and make an oral presentation.
Introduction to Databases Computer Science 557 September 2007 Instructor: Joe Bockhorst University of Wisconsin - Milwaukee.
Proposal for Term Project Operating Systems, Fall 2012 J. H. Wang Nov. 13, 2012.
IST 210: Organization of Data
ITCS 6265 Details on Project & Paper Presentation.
COMP1927 Course Introduction 16x1
Cluster and Grid Computing Course Project. Project Proposal Submit a one-page proposal to me by Tuesday, October 12 Two-person teams are encouraged No.
Welcome to EECS 395/495 Online Advertising: A Systems Approach.
IST 210: ORGANIZATION OF DATA Introduction IST210 1.
Web-Based Information Retrieval Week 1: Administrivia Old Dominion University Department of Computer Science CS 895 Spring 2013 Michael L. Nelson 01/15/13.
CSE6339 DATA MANAGEMENT AND ANALYSIS FOR COMPUTATIONAL JOURNALISM CSE6339, Spring 2012 Department of Computer Science and Engineering, University of Texas.
Class Introduction IST 210: Organization of Data IST2101.
{ COMI Thursday night 5:30 – 10:30 Room 6054.
Math 1316 Prof. Thomas Jay. Homework: Homework problems are designed to reinforce the lecture/reading. If you miss a class, you should attempt to do.
Technical Writing (Applies to research papers and theses)
Term Project Proposal By J. H. Wang Apr. 7, 2017.
Welcome to CS 4390/CS5381: Introduction to Formal Methods
Proposal for Term Project Information Security, Fall 2016
CS & CS ST: Big Data Analytics
Course Information and Introductions
TJTS505: Master's Thesis Seminar
Course Information and Introductions
The progress of the world depends almost entirely upon education
TJTS505: Master's Thesis Seminar
Proposal for Term Project
Introduction to CS Senior Design Project I / II
Literature Surveys Source : : Keshav P. Dahal (Bradford University)
It’s called “wifi”! Source: Somewhere on the Internet!
Academic Spotlight: SCMS Academic Team
Objectives of the Course and Preliminaries
Digital Forensics Dr. Bhavani Thuraisingham
Introduction to Information Retrieval Week 1: Administrivia
CS5040: Data Structures and Algorithms
COMI Friday 9:00 – 1:50 Room 2108.
Cpt S 471/571: Computational Genomics
CS & CS Probabilistic Data Management
Introduction to CS Senior Design Project I / II
Introduction to CS Senior Design Project I / II
ECE 751: Embedded Computing Systems Prof
Introduction to Information Retrieval Week 1: Administrivia
Title: Proposal Template
How to Read Research Papers?
CS & CS Capstone Project & Software Development Project
Proposal for Term Project Operating Systems, Fall 2018
Wuhan University of Technology
Title: Thesis Template
Cpt S 471/571: Computational Genomics
CS & CS ST: Probabilistic Data Management
Research and Life in Sakurai Lab.
Web-Based Information Retrieval Week 1: Administrivia
CSC227: Operating Systems
Lecture 1a- Introduction
General recommendations
Introduction to CS Senior Design Project I / II
Data Engineering Research Group
CSCE 4143 Section 001: Data Mining Spring 2019.
Lecture 1a- Introduction
Web-Based Information Retrieval Week 2: Administrivia
Presentation transcript:

CS & CS ST: Probabilistic Data Management Fall 2016 Xiang Lian Kent State University Kent, OH

Probabilistic Data Management An Overview of Probabilistic Data Management Data Uncertainty Model Probabilistic Query Answering Over Probabilistic and Uncertain Databases Probabilistic Graph Databases Data Quality in Probabilistic Databases 2

Background Required Probability & statistics (math) Algorithms & data structure Database techniques (e.g., index) You need to be able to look up how to get things done (for example, read papers/surveys from online resources, using digital library, Google, Wikipedia, etc.) 3

Skills Required This course is a seminar course for Master & Ph.D. students, in the sense that you need to learn how to do research – Read papers – Do surveys – Write papers (Latex) Tackle research problems – Do experiments – Give presentations & demonstrations – Research collaborations 4

Study Group Please form a team with at most 3 team members The workload should be distributed evenly to each team member Each team needs to finish 1 survey + 1 research paper/report + 2 presentations: – A survey on a selected research topic (including a comprehensive reading list on this topic) – A presentation on 1-2 existing research papers in your selected research directions (20-25 minutes) – A paper-like report (including introduction, problem definition, related work, the proposed approaches, experimental evaluation, and conclusions); – A presentation & demonstration on your research paper 5

Survey & Research Paper I will post a reading list of papers – It does not include all related works, but only a few typical papers in different research directions – You need to search digital libraries (e.g., ACM portal, IEEE Xplore, etc.) and Google the Web to find more related works in each direction You need to decide which topics/problems you want to study Please make an appointment with me to discuss research directions of your teams (within the first 3 weeks of the semester; on or before Sept. 14) 6

Scoring and Grading 5% - Attendance & Questions 60% - Research Projects/Papers – Survey on papers for the selected research topics in recent database conferences/journals (25%) – Code and report for the research project in paper format (including introduction, related works, problem definition, solutions, experiments, and conclusions; 35%) 40% - Presentations – Presentation for 1-2 related works in the selected research direction (20%) – Presentation and demonstration for the proposed research project (20%) 5% - Bonus Points, rated by other team members 7

Scoring and Grading (cont'd) A = 90 or higher B = C = D = F = <60 The maximum score you can get is: 110! 8

Use of the Textbook No textbooks!! The only resources are papers!! – ACM digital library – IEEE Xplore Digital Library – DBLP – Database Conferences SIGMOD, PVLDB, ICDE, EDBT, CIKM – Database Journals TODS, VLDBJ, TKDE 9

The Schedule for the Class 10 I expect to give lectures and introduce the concepts and techniques of probabilistic data management for the first month (September) Then, each team (or student, depending on the available time and the number of teams) will present 1-2 related works in the literature during the class, and meanwhile prepare the survey (October) Finally, each team will start to identify research problems and find solutions. You need to write a report in the paper format, do experiments (comparing with the existing approaches), and present/demonstrate your paper in class (November & December).

Advices & Suggestions Tools: – Latex vs. Word Survey – Check "Related Work" sections in most recent papers, and you can obtain more related papers – Read abstract/introductions of papers, and classify papers into different categories (this will help you later to identify problems that have not been solved before) Paper – Even if you are not familiar with some topics, try to read as many related works as possible to understand the general problems and solutions in these topics (you can skip some part, if it is too hard to understand) – Stick to the problem you want to solve, and use any resource you can find to solve the problem (note: DO NOT simply apply previous techniques to your problem, since it is not counted as your contributions!!) 11

Advices & Suggestions (cont'd) Do not copy from any sources (even for the survey) – Any form of academic dishonesty will be strictly forbidden and will be punished to the maximum extent – Allowing another student to copy one's work will be treated as an act of academic dishonesty, leading to the same penalty as copying 12

Advices & Suggestions (cont'd) If the resulting surveys and papers are of high quality and novel, I highly recommend you to submit them to database conferences or journals After this class, self-motivated, hardworking, and creative students with good performance on surveys/papers may have the chance to join my lab (Big Data Science Research Lab)! 13

14