Evacuating the Comfort Zone: (Via Curriculum Reform…)

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
1 i206: Distributed Computing Applications & Infrastructure 2012
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Database Management: Getting Data Together Chapter 14.
DATA WAREHOUSING.
Chapter 14 The Second Component: The Database.
Chapter 5: Information Retrieval and Web Search
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
MIS 710 Module 0 Database fundamentals Arijit Sengupta.
D ATA, D ATABASE, DBMS, DBMS SOFTWARE, R ELATION, A PPLICATION, ENVIRONMENT, BENEFITS. Week 8 Mr.Mohammed Rahmath.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
IT – DBMS Concepts Relational Database Theory.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Module Title? DBMS Introduction to Database Management System.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Databases and Database Management Systems
Unit Seven Database 1.Passage One. Foundation of Database.
INF 141 COURSE SUMMARY Crista Lopes. Lecture Objective Know what you know.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Succeeding with Technology Database Systems Basic Data Management Concepts Organizing Data in a Database Database Management Systems Using Database Systems.
Information Systems & Enhancing Decision Making for the Digital Firm
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Chapter 6: Information Retrieval and Web Search
Database Systems Carlos Ordonez. What is “Database systems” research? Input? large data sets, large files, relational tables How? Fast external algorithms;
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Computing Ontology Part II. So far, We have seen the history of the ACM computing classification system – What have you observed? – What topics from CS2013.
Data resource management
Big Data Analytics Carlos Ordonez. Big Data Analytics research Input? BIG DATA (large data sets, large files, many documents, many tables, fast growing)
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
Proteomics databases for comparative studies: Transactional and Data Warehouse approaches Patricia Rodriguez-Tomé, Nicolas Pinaud, Thomas Kowall GeneProt,
Database Architecture Course Orientation & Context.
IT Architectures for Handling Big Data in Official Statistics: the Case of Scanner Data in Istat Gianluca D’Amato, Annunziata Fiore, Domenico Infante,
Foundations of Business Intelligence: Databases and Information Management.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Advanced Database Concepts
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
CS798: Information Retrieval Charlie Clarke Information retrieval is concerned with representing, searching, and manipulating.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
IIS 645 Database Management Systems DDr. Khorsheed Today’s Topics 1. Course Overview 22. Introduction to Database management 33. Components of Database.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
Managing Data Resources File Organization and databases for business information systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
COMPSCI 110 Operating Systems
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Chapter 13 The Data Warehouse
Information Retrieval and Web Search
Information Retrieval and Web Search
Information Retrieval and Web Search
Data Mining: Concepts and Techniques Course Outline
Information Retrieval
Data Warehousing and Data Mining
Parallel Analytic Systems
DATA RETRIEVAL IN ADHOC NETWORKS
Chapter 3 Database Management
Information Retrieval and Web Search
CS222/CS122C: Principles of Data Management UCI, Fall 2018 Notes #03 Row/Column Stores, Heap Files, Buffer Manager, Catalogs Instructor: Chen Li.
Presentation transcript:

Evacuating the Comfort Zone: (Via Curriculum Reform…)

Comfy Topics Logical data models and languages Query optimization/execution Consistency models and mechanisms Storage architectures Enterprise IT applications

Moving Outside the Zone Rethinking system architectures –Deep memory hierarchies, componentization, adaptive algorithms, extreme scales (nano to global) Embracing probabilistic reasoning –In data analysis, adaptive algorithms (again!), online user interactions, data modeling and integration, lossy compression

Course 1: Data Systems Not so radical: infect the OS course with the RSS –Traditional OS material (scheduling, protection, resource management) –File & Record storage –Transactions, Concurrency, Recovery –Storage Hierarchies –Dataflow architectures: query plans, NW support Big pedagogical benefit to merging this material –Two design targets (OS/FS vs. DBMS) –Leads to instructive architectural tradeoffs –Illustrates 2 design philosophies (bottom-up vs. top- down)

Course 2: Modeling & Analysis Relational + IR + Statistics + Information Theory Review of basic math –1st-order logic –Central Limit Theorem, Chernoff/Hoeffding bounds –Simple information theory: entropy, error metrics Data Modeling –Logical: Relational normalization, ontologies, IR bag-of-words –Probabilistic: simple graphical models (Bayes nets), IR vector space Data Analysis –Relational-style query optimization/execution, OLAP –Sampling and summarization –Boolean IR, ranked retrieval, link analysis, info extraction –Predictive analyses: classification, clustering –Data Visualization Pragmatics, Exercises: –Decision-support systems & tasks: queries, mining tools, etc.

Assertions Course 1 is a Good Idea –4 years at the grad level at Berkeley, it works great Course 2 is The Future –It’s in demand Think of what Business, BioEng, etc. really want! Do our systems students even know how to manage, exploit experimental data? –A curriculum, or a research agenda? KDD is a piece of this But fragmentary Opportunity here! –The DB textbook market is saturated :-)