GOOGLE FUSION TABLES: WEB- CENTERED DATA MANAGEMENT AND COLLABORATION HectorGonzalez, et al. Google Inc. Presented by Donald Cha December 2, 2015.

Slides:



Advertisements
Similar presentations
MICHAEL MARINO CSC 101 Whats New in Office Office Live Workspace 3 new things about Office Live Workspace are: Anywhere Access Store Microsoft.
Advertisements

Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
Microsoft Office Excel 2013 Core Microsoft Office Excel 2013 Core Courseware # 3253 Lesson 8: Macros, Importing and Exporting Data.
With Microsoft Access 2010© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Access.
Technical BI Project Lifecycle
Integrating Access with the Web and with Other Programs.
Fast Track to ColdFusion 9. Getting Started with ColdFusion Understanding Dynamic Web Pages ColdFusion Benchmark Introducing the ColdFusion Language Introducing.
Microsoft Visio is diagramming software for Microsoft Windows. It uses vector graphics to create diagrams. The 2007 Standard and Professional editions.
Tutorial 8 Sharing, Integrating and Analyzing Data
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
Chapter 14: Advanced Topics: DBMS, SQL, and ASP.NET
BUSINESS DRIVEN TECHNOLOGY
Tutorial 11: Connecting to External Data
Exploring Microsoft® Office Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Robert Grauer and Maryann Barber Using.
Building Data Integration Systems for the Web Alon Halevy Google NSF Information Integration Workshop April 22, 2010.
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Page 1 ISMT E-120 Introduction to Microsoft Access & Relational Databases The Influence of Software and Hardware Technologies on Business Productivity.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
Databases & Data Warehouses Chapter 3 Database Processing.
Page 1 ISMT E-120 Desktop Applications for Managers Introduction to Microsoft Access.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
SharePoint 2010 Business Intelligence Module 10: Reporting Services.
CS370 Spring 2007 CS 370 Database Systems Lecture 2 Overview of Database Systems.
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
1 Keith Vicens, Managing Consultant CRM Housing Solution Extending Your Case Management Capabilities.
ACOT Intro/Copyright Succeeding in Business with Microsoft Excel
Microsoft Access Illustrated Unit I: Importing and Exporting Data.
10-1 aslkjdhfalskhjfgalsdkfhalskdhjfglaskdhjflaskdhjfglaksjdhflakshflaksdhjfglaksjhflaksjhf.
Office Live Workspace Visio 2007 Outlook 2007 Groove 2007 Access 2007 Excel 2007 Word 2007.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
Relational Databases (MS Access)
Bridging Communities and Data with ArcGIS Open Data Courtney Claessens, Product Engineer Daniel Fenton, Product Engineer.
1 Committed to Shaping the Next Generation of IT Experts. Chapter 8 Exchanging Data Between Access and Other Applications Exploring Microsoft Office Access.
EXTENDING DATABASE USABILITY Michelle Brown, MSc. Student.
Storing Organizational Information - Databases
Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
ITGS Databases.
Paperless Publishing web publishing. ebooks. digital paper.
AUC Technologies LINQ (Language Integrated Query) LINQ Presented By : SHAIKH SHARYAR JAVED Software Engineer (Daedalus Software Inc.) Technology Teacher.
Google Image Search, Code, Fusion Tables Audrey and Chris.
NSF DUE ; Wen M. Andrews J. Sargeant Reynolds Community College Richmond, Virginia.
Using Document Collaboration, Integration, and Charting Tools
Microsoft Office 2013 Try It! Chapter 4 Storing Data in Access.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Computers Are Your Future Tenth Edition Spotlight 5: Microsoft Office Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
Analytics Plus Product Overview. Introduction Analytics Plus is a self-service Business Intelligence and advanced analytics software. On-premise reporting.
Introduction to the Power BI Platform Presented by Ted Pattison.
Excel Services Displays all or parts of interactive Excel worksheets in the browser –Excel “publish” feature with optional parameters defined in worksheet.
Microsoft Excel Illustrated Introductory Workbooks and Preparing them for the Web Managing.
1 Middle East Users Group 2008 Self-Service Engine & Process Rules Engine Presented by: Ryan Flemming Friday 11th at 9am - 9:45 am.
A Workshop on LibreOffice Er. Arvind Kumar Assistant Professor, Department of Computer Science & Engineering
Fusion Tables.
Mapping for the interwebs
Miscellaneous Excel Combining Excel and Access.
Leveraging BI in SharePoint with PowerPivot and Power View
Potter’s Wheel: An Interactive Data Cleaning System
INFS 3500 Martin, Brad, and John
Microsoft FrontPage 2003 Illustrated Complete
Microsoft Access 2003 Illustrated Complete
Tutorial 8 Objectives Continue presenting methods to import data into Access, export data from Access, link applications with data stored in Access, and.
Data Visualization Web Application
Exploring Microsoft® Access® 2016 Series Editor Mary Anne Poatsy
The Open Fiscal Data Package
Analytics Plus Product Overview 1.
Data Model.
Tutorial 7 – Integrating Access With the Web and With Other Programs
Donald Donais Minnesota SharePoint Users Group – April 2019
Presentation transcript:

GOOGLE FUSION TABLES: WEB- CENTERED DATA MANAGEMENT AND COLLABORATION HectorGonzalez, et al. Google Inc. Presented by Donald Cha December 2, 2015

THE WORLD WE SEE NOW World is “connected.” Proliferation of connected computing devices. Computation done on the Cloud Image Source: content/uploads/2015/04/Connected-World-1.jpg

FOUNDATIONS OF DBMS Established several decades ago. Focus on high-throughput business transactions. Processing of complex SQL queries Data belongs to a single enterprise.

DBMS NEEDS A CHANGE Need data management functionality for connected world But How? Needs to support collaboration among multiple users and multiple organizations. Needs to appeal to a broader audience of users (including those who are non- experts). Needs to be integrated seamlessly with Web application.

GOOGLE FUSION TABLES Cloud-based data management and integration service. Targeted Audience Organizations struggling to get their data available online Communities of users needing to collaborate on data management Novice users who are passionate about finding useful data and using data for integration Google Fusion Table can Upload data Collaborate data Visualize data Combine data

EXAMPLES OF GOOGLE FUSION TABLES

UNDERLYING PRINCIPLES OF GOOGLE FUSION TABLES Seamless Integration with Web Ease of Use Incentives for Sharing Data Support Collaboration

PROVIDE SEAMLESS INTEGRATION WITH THE WEB Publish visualization on the Web Bar charts, pie charts Timelines Display geo-spatial datasets on Google Maps Public datasets on Fusion Tables can be crawled by Google search engines Accessible through web search Can be integrated seamlessly with Google Documents and spreadsheets

EMPHASIZE EASE OF USE Pay-As-You-Go data management principles Immediate benefit of time invested when using the service Little or no initial installation to use the service No need to declare a schema

PROVIDE INCENTIVES FOR SHARING DATA Users desire to share data with others Users face problem when sharing data Loss of attribution Misuse and corruption of data

FACILITATE COLLABORATION Collaboration among different organizations table can provide valuable insight. Discuss and comment on the data Study different dataset of other users

DATA MANAGEMENT WITH FUSION TABLES Data Acquisition Data Sharing and Collaboration Data Manipulation and Visualization

DATA ACQUISITION Upload Capability of Different Formats CSV (Comma Separated Values) Spreadsheet (Excel, Open Office, Google Spreadsheets) KML (Keyhole Markup Language) Automatic Schema Detection Which row in the uploaded file is the header row Make as few query as possible to the users No need of user-defined schema and import method No need to specify data type for the column User can specify any description about the data for other users.

PROBLEMS WITH DATA ACQUISITION Can we trust Automatic Schema Detection? What if the data gets misplaced? How will we fix it? Possibly use human-interact data cleaning system like Potter’s Wheel

ATTRIBUTION, EXPORT, SHARING, AND INTEGRATION User can choose whom to share the data with. User can specify static attribute for the data Owner of the data can control exporting by other users. User can invite a set of collaborators to view, update, and comment on data.

CONTINUED Supports merging data from multiple sources of same entities.

PROBLEMS WITH DATA SHARING Users can make mistake If mistakes accumulate overtime by multiple users, hard to undo Users may not have the right skill set to contribute to a data Possible Solution Coordinate with others Educate other contributors

SEARCH Data must easily be discovered by interested users Public data discoverable by search engine Create a corresponding HTML page Advanced Search for tables in Fusion Tables For those who needs to explore specific dataset Based on an extension of the WebTables’ relation search

DISCUSSIONS Supports in-depth collaborations by elaborating discussions among multiple users about the data. Users can point out outliers Users can detect incorrect data Users can question about the underlying assumptions of data. Supports commenting data on all levels of granularity Row, column, and individual data Gives better context and keep track of the discussions Discussions are append-only Changes made are also appended

DATA MANIPULATION AND VISUALIZATION System recognizes a type of values in a column Geographical locations Date and time Numeric values Provides visualizations based on the type of data in a column Map viewing Timeline and Motion charts Bar charts and Pie charts Provides HTML snippet of generated data visualizations Allows multiple web property to use it (i.e. blog, , and etc)

DATA VISUALIZATION EXAMPLE

FUSION TABLES API Allows external developers to write applications that use Fusion Tables as its main database. Supports basic query and database modification SELECT, UPDATE, INSERT, and DELETE CREATE TABLE All access to database requires authentication

RELATED WORK Extended from parts of ManyEyes (multiple users upload data and visualize it) Several online database management tools DabbleDB Socrata Factual

CONCLUSION Google Fusion Tables encourage data owners to publish data to public so many others who need data can easily access Still needs Improvement Provide more expressive data modeling Query capability and performance on larger datasets

Image credit:

Image credit: