Welcome to the Course of Web and Document Databases

Slides:



Advertisements
Similar presentations
Copyright © 2004 Pearson Education, Inc.. Chapter 15 Algorithms for Query Processing and Optimization.
Advertisements

Sept Welcome to the Course of Web and Document Databases ( )
Jan Welcome to the Course of Data Structures and Algorithms.
Jan Welcome to the Course of Advanced Algorithm Design (ACS-7101/3)
CS 46101–600/CS Design and Analysis of Algorithms Dr. Angela Guercio Spring 2010.
Sept. 2012ACS-3902/3 Yangjun Chen1 Welcome to Database Course.
Jan. 2014ACS-4902/3 Yangjun Chen1 Welcome to Advanced Database Course.
1 541: Database Systems S. Muthu Muthukrishnan. 2 Preliminaries  CS541. Thursdays 5 – 8 PM, CORE A. Course webpage:
1 CENG 302 Introduction to Database Management Systems Nihan Kesim Çiçekli URL:
CSE 636 Data Integration Introduction. 2 Staff Instructor: Dr. Michalis Petropoulos Location: 210 Bell Hall Office Hours:
Recursive Graph Deduction and Reachability Queries Yangjun Chen Dept. Applied Computer Science, University of Winnipeg 515 Portage Ave. Winnipeg, Manitoba,
Department of Computer Sciences Bahria University (Karachi Campus)
May 2012ACS-2814/3 Yangjun Chen1 Welcome to Database Application Development Course.
01/02/20041 Welcome to OO Implementation Issues (Java) Course.
METU Computer Engineering Department
May 2012ACS-2814/3 Yangjun Chen1 Welcome to Application of Database Systems (ACS-2814)
CS223 Algorithms D-Term 2013 Instructor: Mohamed Eltabakh WPI, CS Introduction Slide 1.
01/02/20031 Welcome to Distributed Databases Course.
CS461: Principles and Internals of Database Systems Instructor: Ying Cai Department of Computer Science Iowa State University Office:
Jan /3 Yangjun Chen1 Welcome to Database Course.
Database Management Systems I
Christoph F. Eick Introduction Data Management Today 1. Introduction to Databases 2. Questionnaire 3. Course Information 4. Grading and Other Things.
Course Introduction Assist. Prof. Pongpisit Wuttidittachotti, Ph.D. Faculty of Information Technology King Mongkut's University of Technology North Bangkok.
Copyright © 2004 Pearson Education, Inc.. Chapter 26 XML and Internet Databases.
DatabaseCSIE NUK1 Fundamentals of Database Systems Wen-Yang Lin Dept. of Computer Science and Information Engineering National University of Kaohsiung.
CS4432: Database Systems II Course Logistics 1. Textbook 2 Required “Database Systems: The Complete Book”, Second Edition Hector Garcia-Molina, Jeffrey.
Sept /3 Yangjun Chen1 Welcome to Database Application Development Course.
CS 541 Lecture Slides Sunil Prabhakar CS541 Database Systems.
CSC 411/511: DBMS Design CSC411_L0_OutlineDr. Nan Wang 1 Course Outline.
Holistic Twig Joins Optimal XML Pattern Matching Nicolas Bruno Columbia University Nick Koudas Divesh Srivastava AT&T Labs-Research SIGMOD 2002.
CSE3330/5330 DATABASE SYSTEMS AND FILE STRUCTURES (DB I) CSE3330/5330 DB I, Summer2012 Department of Computer Science and Engineering, University of Texas.
1 Holistic Twig Joins: Optimal XML Pattern Matching Nicolas Bruno, Nick Koudas, Divesh Srivastava ACM SIGMOD 2002 Presented by Jun-Ki Min.
Information Retrieval CIS-462 Dr. Samir Tartir 2013/2014 First Semester.
Processing XML Streams with Deterministic Automata Denis Mindolin Gaurav Chandalia.
CSE202 : Fundamentals of Database Systems Vikram Goyal Indraprastha Institute of Information Technology, Delhi (IIIT-D), India FROM : Slides from CSE202.
CENG 3511 CENG 351 Introduction to Data Management and File Structures Nihan Kesim Çiçekli Department of Computer Engineering METU.
Theory of Computation. Introduction to The Course Lectures: Room ( Sun. & Tue.: 8 am – 9:30 am) Instructor: Dr. Ayman Srour (Ph.D. in Computer Science).
Welcome to Advanced Database Course
Database Systems – (
CSE202 Database Management Systems
ENG3380 Computer Organization
CENG 707 Data Structures and Algorithms
Information Modeling and Database System
Welcome to the Course of Advanced Algorithm Design
Course Overview - Database Systems
CF 1334 Sistem Basis Data (3 SKS)
Course Introduction 공학대학원 데이타베이스
Introduction to Database
CENG 213 Data Structures Nihan Kesim Çiçekli
CENG 213 Data Structures Dr. Cevat Şener
Instructors: Churee Techawut
Piyush Kumar (Lecture 1: Introduction)
CPSC-310 Database Systems
CS416 Compiler Design lec00-outline September 19, 2018
Definition In simple terms, an algorithm is a series of instructions to solve a problem (complete a task) We focus on Deterministic Algorithms Under the.
Course Overview - Database Systems
CENG 213 Data Structures Nihan Kesim Çiçekli
ENG3380 Computer Organization
Introduction to Database Systems CSE 444 Lecture 23: Final Review
Lecture1: Introduction to IT322 Software Engineering I
CS416 Compiler Design lec00-outline February 23, 2019
Lecture 30: Final Review Wednesday, December 6, 2000.
Lecture 30: Final Review Wednesday, December 10, 2003.
CENG 351 Introduction to Data Management and File Structures
Data Management and Information Processing
Information Retrieval CIS-462
Introduction to Database Systems CSE 444 Lecture 23: Final Review
Introduction to Database Systems CSE 444
Introduction to Database
COMP 122 – Design and Analysis of Algorithms
Presentation transcript:

Welcome to the Course of Web and Document Databases (7102-001) Jan. 2017

P.h..D: University of Kaiserslautern, Germany, in 1995 Name: Yangjun Chen Birthplace: China P.h..D: University of Kaiserslautern, Germany, in 1995 Post Doctor: University of Chemnitz, Germany, 1995/7 - 1997/8 Senior engineer: Germany Research Center for Information Technology, 1997/9 - 2000/2 Post Doctor.: University of Alberta, 2000/2 - 2000/6 Assistant Prof.: University of Winnipeg, 2000/7 – 2004/6 Associate Prof.: University of Winnipeg, 2004/7 – 2009/6 Full Prof.: University of Winnipeg, from 2009/7 Jan. 2017

home-page: http://www.uwinnipeg.ca/~ychen2 E-mail: y.chen@uwinnipeg.ca Professor: Dr. Y. Chen Office: 3D27 home-page: http://www.uwinnipeg.ca/~ychen2 E-mail: y.chen@uwinnipeg.ca phone: 786-9417 Meeting time: 11:30 – 12:45 pm, Monday and Wednesday Meeting location: 3D03 Office hours: 15:00 - 16:30 Monday and Wednesday 10:00 – 17:00 Friday Jan. 2017

Course Outline Intro. to the design of Web and document databases, analysis, and applications Introduction to relational database theory Basic methods for designing relational databases, Normalization theory, B+-tree, Hashing, R-tree. XML documents Semistructured-Data Model, Programming languages for XML, Databases and Internet. Evaluation of tree pattern queries Tree searching and graph searching, unordered tree matching, ordered tree matching, XB-tree. Reachability query evaluation transitive closure, reachability checking, matrix multiplication, tree encoding, graph deduction and decomposition Jan. 2017

Course Outline Intro. to the design of Web and document databases, analysis, and applications Data mining: find most popular packages signature files, signature trees, signature tree construction and search, SPD-trees, SPD-tree construction and search (SPD – Single Package Design) Evaluation of extended reachability queries Regular expressions, non-deterministic finite automata (NDFA), deterministic finite automata (DFA), simple paths, intersection graphs, restricted regular expression, general query evaluation algorithm. (A. O. Mendelzon, and P. T. Wood, Finding Regular Paths in Graph Databases, SIAM J. Comput. Vol. 24, No. 6, pp. 1235-1258, Dec. 1995) Jan. 2017

For evaluation of tree pattern queries: Reference books: Database Systems (the complete book), 2nd Ed. by Carcia-Molina, Ullman, Widom, Pearson Prentice Hall, 2009. Fundamentals of Database Systems, 5th edition, Elmasri, Navathe, Addison-Wesley, Introduction to Algorithms, 2nd Ed. by Cormen, Leiserson, Rivest, & Stein (CLRS), McGraw Hill, 2002. Lecture slides online For evaluation of tree pattern queries: - N. Bruno, N. Koudas, and D. Srivastava, Holistic Twig Joins: Optimal XML Pattern Matching, in Proc. SIGMOD Int. Conf. on Management of Data, Madison, Wisconsin, June 2002, pp. 310-321. - Y. Chen and L. Zou, Unordered tree matching: the evaluation of tree pattern queries, Int. J. Information Technology, Communications and Convergence, 2011. Jan. 2017

For evaluation of reachability queries Reference books: For evaluation of reachability queries - Warren, “A Modification of Warshall’s Algorithm for the Transitive Closure of Binary Relations,” Commun. ACM 18, 4 (April 1975), 218 - 220. - H. Wang, H. He, J. Yang, P.S. Yu, and J. X. Yu, Dual La­beling: Answering Graph Reachability Queries in Con­stant time, in Proc. of Int. Conf. on Data Engineering, Atlanta, USA, April -8 2006. - H. Yildirim, V. Chaoji, and M.J. Zaki, GRAIL: Scalable Reachability Index for Large Graphs, in Proc. VLDB Endowment, 3(1), 2010, pp. 276-284. - Y. Chen and Y.B. Chen, Decomposing DAGs into spanning trees: A new way to compress transitive closures, in Proc. 27th Int. Conf. on Data Engineering (ICDE 2011), IEEE, April 2011, pp. 1007-1018. - Chen, Yangjun: General Spanning Trees and Reachability Query Evaluation, in Proc. Canadian Conference on Computer Science and Software Engineering (C3S2E’09), Montreal, Canada, 2009, IEEE, pp. 243 – 252. Jan. 2017

For evaluation of regular expressions Reference books: For evaluation of regular expressions - A. O. Mendelzon, and P. T. Wood, Finding Regular Paths in Graph Databases, SIAM J. Comput. Vol. 24, No. 6, pp. 1235-1258, Dec. 1995) Jan. 2017

Course Roadmap Database basics Indexes: B+-tree, Hashing, R-tree, kd-trees, Quad-trees, … Semistructured data model Programming languages for XML Databases over Internet Evaluation of tree pattern queries - Tree searching and graph searching - Unordered tree matching - Ordered tree matching - XB-trees Jan. 2017

Course Roadmap Reachability query evaluation - Transitive closure and reachability checking - Matrix multiplication, Warren’s algorithm - Methods based on tree encoding - Methods based on graph deduction and decomposition Finding most popular packages - Signature files - Signature trees - SPD-trees Jan. 2017

Course Roadmap Evaluation of extended reachability queries - Regular expressions, NDFA (non-deterministic finite automata), DFA (deterministic finite automata) - Intersection graphs and restricted regular expressions - General query evaluation algorithm Jan. 2017

Database Basics Basic method for designing relational databases - Database system architecture, Enhanced entity- relationship diagram - Rules for mapping EERD to relational schema - Relational data model, SQL, database application: JDBC Normalization theory - 1st, 2nd, 3rd, and 4th normal forms - Boyce-Codd normal form Hashing, B+-tree and R-tree - Hashing and linear hashing - B+-tree, R-tree construction and maintenance - Kd-trees, Quad-trees, Bit-map, inverted files Jan. 2017

XML Document Databases Semistructured-data model Programming languages for XML Databases over internet Goals Data storage in document databases Manipulation of data in document databases Data transfer over the Internet Jan. 2017

Evaluation of Tree Pattern Queries Tree searching and graph searching Unordered tree matching Ordered tree matching XB-trees Goals Evaluation of queries against document databases Indexing data to speed up information retrieval Jan. 2017

Reachability Query Evaluation Transitive closures and reachability checking - matrix multiplication - Warren’ algorithm Method based on tree encoding - Tree encoding - Extension of tree encoding to DAGs Method based on graph deduction and decomposition Goals Efficient algorithm for checking reachability of nodes Basic operation used in many applications, such as web navigation, graph databases, biological databases Jan. 2017

FINDING MOST POPULAR PACKAGES Signature files Signature trees, signature tree construction and search SPD-trees, SPD-tree construction and search Goals A kind of data mining to find customer patterns to enlarge sales Binary tree search with pruning Jan. 2017

Evaluation of Extended Reachability Queries Regular expressions, NDFA, DFA Intersection graphs Restricted regular expressions General query evaluation algorithm Goals Evaluation of queries in graph databases Get information from the Internet Jan. 2017

Project Implementation of an algorithm for constructing an XB-tree Implementation of an algorithm for decomposing a DAG into a minimized set of chains Implementation of an algorithm for evaluating unordered tree pattern queries Implementing an algorithm for evaluating ordered tree pattern queries Jan. 2017

Wednesday, Jan. 04, 2017 First class Important dates: Wednesday, Jan. 04, 2017 First class Mon, Feb. 27, 2017 Midterm examination March 01, 2017 Final date to withdraw without academic penalty from a course that begins in Jan. and ends in April of the 2017 Winter term April 03, 2017 Last class Final examination no (replaced by projects) Jan. 2017

1 project (or final exam.) 50% Course Evaluation: 3 assignments   24% 1 midterm examination   26% 1 project (or final exam.)   50% All assignments are handed in at class on the due date. All works must be prepared using a word processor and placed in a folder. Late assignments are accepted (up to 1 day late) and receive a 25% penalty. Jan. 2017

Academic dishonesty: Academic dishonesty is a very serious offense and will be dealt with in accordance with the University's discipline bylaw. Be sure that you have read and understood Regulations and Policies, #8 in the 2016-2017 UW Calendar. Jan. 2017