Institute for Personal Robots in Education (IPRE)‏ CSC 170 Computing: Science and Creativity.

Slides:



Advertisements
Similar presentations
A database is a collection of data that is stored in a computer system. Databases allow their users to enter, access, and analyze their data quickly and.
Advertisements

ICDL Software Applications - Database Concepts. Unit 6 Data and Data Representation Database Concepts –File Structure –Relationships Database Design –Data.
Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
Calculations, Visualization, and Simulation 6.  2001 Prentice Hall6.2 Chapter Outline The Spreadsheet: Software for Simulation and Speculation Statistical.
Introduction to databases from a bioinformatics perspective Misha Taylor.
Chapter 14 The Second Component: The Database.
Attribute databases. GIS Definition Diagram Output Query Results.
MS Access 2007 IT User Services - University of Delaware.
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
D ATA, D ATABASE, DBMS, DBMS SOFTWARE, R ELATION, A PPLICATION, ENVIRONMENT, BENEFITS. Week 8 Mr.Mohammed Rahmath.
CHAPTER 9 DATABASE MANAGEMENT © Prepared By: Razif Razali.
Institute for Personal Robots in Education (IPRE)‏ CSC 170 Computing: Science and Creativity.
CPS120: Introduction to Computer Science Information Systems: Database Management Nell Dale John Lewis.
CS370 Spring 2007 CS 370 Database Systems Lecture 2 Overview of Database Systems.
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Simple Database.
CIS 103 — Applied Computer Technology Last Edited: September 17, 2010 by C.Herbert Using Database Management Systems.
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Unit 23 Database and Spreadsheets Prepared by :Nahed Al-Salah.
Data Mining – A First View Roiger & Geatz. Definition Data mining is the process of employing one or more computer learning techniques to automatically.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, Jeffrey Naughton.
Database Essentials. Key Terms Big Data Describes a dataset that cannot be stored or processed using traditional database software. Examples: Google search.
Introduction to Databases Trisha Cummings. What is a database? A database is a tool for collecting and organizing information. Databases can store information.
Spreadsheet Applications Calculations, Visualization, and Simulation.
Microsoft Access Database Software.
Database What is a database? A database is a collection of information that is typically organized so that it can easily be storing, managing and retrieving.
Chapter 5 Data Resource Management. 2 I. Why do organizations store data?  Data resources must be structured and organized in some logical manner so.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Chapter 9 Database Systems Introduction to CS 1 st Semester, 2014 Sanghyun Park.
Chapter 9 Database Systems © 2007 Pearson Addison-Wesley. All rights reserved.
Today’s Goals Answer questions about homework and lecture 2 Understand what a query is Understand how to create simple queries using Microsoft Access 2007.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
DISCRETE COMPUTATIONAL STRUCTURES CSE 2353 Fall 2010 Most slides modified from Discrete Mathematical Structures: Theory and Applications by D.S. Malik.
INLS 200: Retrieving & Analyzing Information. + Today’s Agenda Typically a quiz – no quiz today. Review last week/ Administrivia Can everyone access Sakai?
Introduction to Information and Computer Science
Chapter 10 Database Management. Data and Information How are data and information related? p Fig Next processing data stored on disk Step.
Data Management Managing Big Data Briefing 10/2012 Will Graves US-VISIT Chief Biometric engineer Chair of Biometric Domain.
2/20: Ch. 6 Data Management What is data? How is it stored? –Traditional management storage techniques; problems –DBMS.
Chapter 13.3: Databases Invitation to Computer Science, Java Version, Second Edition.
Computer Science: A Structured Programming Approach Using C1 8-7 Two-Dimensional Arrays The arrays we have discussed so far are known as one- dimensional.
PREPARED BY: PN. SITI HADIJAH BINTI NORSANI. LEARNING OUTCOMES: Upon completion of this course, students should be able to: 1. Understand the structure.
ACCESS CHAPTER 2 Introduction to ACCESS Learning Objectives: Understand ACCESS icons. Use ACCESS objects, including tables, queries, forms, and reports.
Database Overview What is a database? What types of databases are there? How are databases more powerful than spreadsheets?
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
Databases. What is a Database? A database is an organized collection of information or data. Databases can be paper-based or electronic. Information (text.
Database Presentation BIM, Mrs. Bailey. **Database Notes** Use new sheet of paper! Microsoft Access - known as a database management system or DBMS Database.
OBIS Data Scenarios: Using Darwin Core to bring data into OBIS Philip Goldstein O CEAN B IOGEOGRAPHIC I NFORMATION S YSTEM May 5, 2014.
Data Mining Introduction to data mining concepts.
Big Data is a Big Deal!.
Fundamentals of Database
Learn about relations and their basic properties
Chapter 9 Database Systems
Lecture 8 Database Implementation
Database Management  .
Fundamentals of Database Design
September 11, Ian R Brooks Ph.D.
System And Application Software
Chapter 2 Database Environment.
Database.
Databases.
An Introduction to Data Warehousing
Structured Query Language
INTRODUCING DATABASES
Course Introduction CSC 576: Data Mining.
Spreadsheets, Modelling & Databases
DATABASES WHAT IS A DATABASE?
Microsoft Access Date.
Calculations, Visualization, and Simulation
Presentation transcript:

Institute for Personal Robots in Education (IPRE)‏ CSC 170 Computing: Science and Creativity

THE BIG DATA CHALLENGE How do we... process data sets that are too large for traditional algorithms and software tools? extract knowledge from large datasets in a wide variety of domains, from science to medicine to consumer data

DIKW PYRAMID Represents structural and/or functional relationships between d ata, i nformation, k nowledge, and w isdom Typically information is defined in terms of data, knowledge in terms of information, and wisdom in terms of knowledge wikipedia

DIKW PYRAMID Data : discrete raw facts, signals, observations of no use until...in a usable form Patient data, sensor data, scientific data

DIKW PYRAMID Data : discrete raw facts, signals, observations of no use until...in a usable form Patient data, sensor data, scientific data Information : processed, organized or structured data that is used for some purpose 14% of Americans are over 65

DIKW PYRAMID Data : discrete raw facts, signals, observations of no use until...in a usable form Patient data, sensor data, scientific data Information : processed, organized or structured data that is used for some purpose 14% of Americans are over 65 Knowledge and Wisdom: Thoughts in a persons mind believed to be true E = mc 2 God exists

DATA STORAGE HISTORY

SORTING AND SEARCHING Traditional Searching & Sorting Algorithms Data must fit into main memory (RAM) Data are processed sequentially (not in parallel)

ANALYZING DATA Spreadsheets Structured data: Columns and rows Small to large size data sets (50 MB - 2 GB) Analysis, visualization Widely used in business Code demo: Descriptive statistics in Google Sheets

VISUALIZING DATA Databases Structured Data: Tables Moderate to large data sets (2 GB - 2 TB) Storage and retrieval of relational data Logical searching and analysis of data Retrieve records for all accounts > 50000

STANDARD QUERY LANGUAGE SQL Data bases have tables: lists of records (1, 2,3) Record: a list of attributes (ID, Name, Address, City) Code Demo: Show a few data base queries from

DATASETS A data set (or dataset ) is a collection of data. Most commonly a data set corresponds to the contents of a single database table, or a single statistica data matrix, where every column of the table represents a particular variable, and eac row corresponds to a given member of the data set in question. The data set lists values for each of the variables, such as height and weight of an object, for each member of the data set. Each value is known as a datum. The data set may comprise data for one or more members, corresponding to the number of rows. wikipedia

GOOGLE PUBLIC DATA SETS Google Public Data This is the first part of Today’s lab Demo a bubble chart: Determine something interesting