Visualizations, Mashups and Dashboards University of Illinois at Urbana-Champaign.

Slides:



Advertisements
Similar presentations
HATHI TRUST A Shared Digital Repository Delivering Data For New Generations of Research Strategies and Challenges Jeremy York NISO/BISG Forum ALA 2010.
Advertisements

1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
AS ICT Finding your way round MS-Access The Home Ribbon This ribbon is automatically displayed when MS-Access is started and when existing tables.
Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:
Book Recommender System Guided By: Prof. Ellis Horowitz Kaijian Xu Group 3 Ameet Nanda Bhaskar Upadhyay Bhavana Parekh.
1 Actuate Corporation © 2010 THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE.
Describing Process Specifications and Structured Decisions Systems Analysis and Design, 7e Kendall & Kendall 9 © 2008 Pearson Prentice Hall.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
University of Illinois OCR Workshop Loretta Auvil UIUC October 18, 2011.
Information Retrieval in Practice
CIS392Semester Projects1 CIS392 Text Processing, Retrieval, and Mining Overview of Semester Projects.
University of Illinois Role of Mashups, Cloud Computing, and Parallelism for Visual Analytics Loretta Auvil.
Tutorial 8 Sharing, Integrating and Analyzing Data
Chapter 1 Getting Started With Dreamweaver. Explore the Dreamweaver Workspace The Dreamweaver workspace is where you can find all the tools to create.
SEASR Overview Loretta Auvil, Boris Capitanu National Center for Supercomputing Applications University of Illinois at Urbana-Champaign
Overview of Search Engines
Database Updates Made Easy In WebFocus Using SQL And HTML Painter Sept 2011 Lender Processing Services 1.
Dobrin / Keller / Weisser : Technical Communication in the Twenty-First Century. © 2008 Pearson Education. Upper Saddle River, NJ, All Rights Reserved.
B.A. (Mahayana Studies) Introduction to Computer Science November March Office Tools A look at the main tools most computer users.
SEASR Analytics and Zotero University of Illinois at Urbana-Champaign.
2012 National BDPA Technology Conference Creating Rich Data Visualizations using the Google API Yolanda M. Davis Senior Software Engineer AdvancED August.
Oracle Application Express (Oracle APEX)
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
GIS technologies and Web Mapping Services
Server-side Scripting Powering the webs favourite services.
Team 1 Jesus Weibo Mina Yunming. Client’s Needs  Short term, medium term, long term plans  Simulation of energy system to provide affordable future.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
NPI Reporting 2010 Visions Conference July 28, 2010.
CSCI 1101 Intro to Computers 3. Common Productivity Software.
More HTRC Loretta Auvil, Boris Capitanu University of Illinois at Urbana-Champaign
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
SEASR Applications and Future Work University of Illinois at Urbana-Champaign.
Introduction of Geoprocessing Topic 7a 4/10/2007.
Web site development: Basics & MS FrontPage. What I hope to demonstrate n n Basics of a good web site n n How to most effectively communicate via the.
Microsoft Office 2007 Intermediate© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! With Microsoft ® Office 2007 Intermediate Chapter.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
Robin Mullinix Systems Analyst GeorgiaFIRST Financials PeopleSoft Query: The Next Step.
Meandre Workbench National Center for Supercomputing Applications University of Illinois at Urbana-Champaign.
The College of Saint Rose CSC 460 / CIS 560 – Search and Information Retrieval David Goldschmidt, Ph.D. from Search Engines: Information Retrieval in Practice,
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation Meandre Workbench National Center for Supercomputing.
L JSTOR Tools for Linguists 22nd June 2009 Michael Krot Clare Llewellyn Matt O’Donnell.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Mashups and Dashboards National Center for Supercomputing Applications University of Illinois at Urbana-Champaign.
4BP1 Electronic & Computer Engineering Paul Gildea th Year Final Project Presentation.
Week 2- Overview of the internet The construction of a webpage Four Key Elements – how the internet works Elements and Design concepts Introduction to.
SEASR Overview Loretta Auvil, Boris Capitanu University of Illinois at Urbana-Champaign
More Text Analytics National Center for Supercomputing Applications University of Illinois at Urbana-Champaign.
HTRC Loretta Auvil, Boris Capitanu University of Illinois at Urbana-Champaign
CS5604: Final Presentation ProjOpenDSA: Log Support Victoria Suwardiman Anand Swaminathan Shiyi Wei Department of Computer Science, Virginia Tech December.
SEASR Analytics and Zotero University of Illinois at Urbana-Champaign.
Data analytics and mash-up Real time analytics of employment data Team Shadowfax 1/25/2016 CMPE Class Project 0.
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
Introduction of Geoprocessing Lecture 9 3/24/2008.
GROUP PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.
 2001 Prentice Hall, Inc. All rights reserved. Outline 1 JavaScript.
1 New Perspectives on Access 2016 Module 8: Sharing, Integrating, and Analyzing Data.
Information Retrieval in Practice
COP4710 Database Systems Project Overview.
21 Essential Data Visualization Tools
SEASR Overview Loretta Auvil, Boris Capitanu
Team Project, Part II NOMO Auto, Part II IST 210 Section 4
An Introduction to Data Warehousing
Extracting Recipes from Chemical Academic Papers
CS4433 Database Systems Project.
CloudAnt: Database as a Service (DBaaS)
AI Discovery Template IBM Cloud Architecture Center
Presentation transcript:

Visualizations, Mashups and Dashboards University of Illinois at Urbana-Champaign

Outline Visualization SEASR as a service Text App: Airport Data Text App: Correlation & Ngram Viewer Text App: ProseVis Attendee Project Work

Visualization Components JavaScript –GIS: GoogleMaps –Temporal: Simile –InfoVis: Protovis – Parallel Coordinates, Link Node, Arcs –InfoVis: D3 – Line Graphs, Tag Cloud GWT –Dendogram –Table Viewer Flash –InfoVis: Flare Applets –Data Mining Results: Decision Tree, Naïve Bayes, Rule Association HTML –Reports

Mashups & Dashboards HTML driven dashboard Connect to DB to make the query 6 analytical services (flows) –4 different tag cloud views –2 entity relationship views Example: SEASR, Protovis

Usage of Service from HTML Simple

More Complex var servicePorts = [ 10000, 10001, 10002, 10003, 10004, ]; var iframes = ["frame1", "frame2", "frame3", "frame4", "frame5", "frame6”]; function searchService() { var searchterm = document.getElementById("searchterm").value; for (var i in servicePorts) window.frames[iframes[i]].location.href = " dev.leo.ncsa.edu:" + servicePorts[i] + "/service/post?query=" + searchterm; } …

Correlation-Ngram Viewer

Pearson Correlation Algorithm 5 Flows run as ZigZag flows

Correlation-Ngram Viewer Words whose frequencies correlate with each other in a large dataset often turn out to have contextual associations as well Corrected version of the Google ngrams dataset (the English corpus)

Correlation-Ngram Viewer New version of the Google ngrams viewer (for 1 grams) –addresses case-sensitivity –period spellings Normalized to British practice, and period spellings have been normalized to modern practice (“honor” -> “honour,” “enow” -> “enough,” “publick” -> “public”) –past-tense syncope (' d) based on 3-gram dataset to correct the treatment of syncopated past tenses (occurrences of “resolv’d” are now folded into the occurrences of “resolved”) –f/s substitution as well as other OCR issues Searches within already stored correlation results (using Pearson) results for top 10K ngrams Computes correlation (using Pearson) results for given word against top 1K ngrams

Correlation-Ngram Viewer Flows Service Search Pearson for Top 10k Google Ngrams –Uses the web service components to accept a query and executes a SQL query with this term against the table that stores the calculated Pearson results. Service Compute Pearson for Top 10k Google Ngrams –Uses the web service components to accept a query and executes a SQL query to calculate the correlation of this search term against the top 10,000 ngrams. Service Query Pearson Results –Shows the set of ngrams that have the highest Pearson score within the 10,000 result set. Service Google Ngrams Multi Line Chart –Uses the web service components to accept a query, execute the SQL query with this term(s) to retrieve the frequency data and create a visualization showing the frequency graph for each ngram. The graph has a smoothing moving average that can be user adjusted. Service Google Ngrams List –Uses the web service components to accept a query, execute the SQL query to identify the term(s) included in the graph and their total frequency counts. The list displays normalization for case, OCR errors from the transformation rules, and period spellings normalized to British.

Service Google Ngrams Multi Line Chart

Service Query st?query=“dark”&query=“bright”&min_year=1700 &max_year=1899

HTML code calling Meandre Service var servicePorts = [10012,10013,10014,10015]; var iframes = [ "frame0","frame1","frame2","frame3" ]; function searchService() { var searchterm = document.getElementById("searchterm").value; searchterm = searchterm.replace(/,,/g, ","); //remove extra commas searchterm = searchterm.replace(/ /g, ""); searchterm = searchterm.toLowerCase(); var gsearchterm = searchterm.replace(/,/g, "%22&query=%22"); searchterm = searchterm.replace(/,/g, "%22,%22"); window.frames[iframes[0]].location.href = " + servicePorts[0] + "/service/post?query=%22" + searchterm + "%22&min_year=1700&max_year=1899"; window.frames[iframes[1]].location.href = " + servicePorts[1] + "/service/post?query=%22" + gsearchterm + "%22&min_year=1700&max_year=1899"; }

ProseVis

Visualization tool Seeks to identify other features than the "word" to analyze texts Features comprise sound –parts-of-speech –accent –phoneme –stress –tone –break index

ProseVis: Output Data

ProseVis: Data Processing Flow

ProseVis: Prosody Features “Matisse”“Picasso”

Prosody Comparisons

ProseVis: Prosody Comparisons Tender Buttons

ProseVis Calls Meandre service for data processing –Identify Prosody Features of Document –Compare Prosody Features of Multiple Documents

Other Noteworthy Data Cleaning Tools Data Wrangler – –“Wrangler is an interactive tool for data cleaning and transformation. Spend less time formatting and more time analyzing your data.” Google Refine – refine/ refine/ –“is a power tool for working with messy data, cleaning it up, transforming it from one format into another”

Demonstration Visualizations Text App: Correlation & Ngram Viewer Text App: ProseVis

Attendee Project Plan Study/Project Title Team Members and their Affiliation Procedural Outline of Study/Project –Research Question/Purpose of Study –Data Sources –Analysis Tools Activity Timeline or Milestones Report or Project Outcome(s) Ideas on what your team needs from SEASR staff to help you achieve your goal.

Learning Exercises Create a web page that accesses SEASR services –Customize an HTML template that we have constructed to leverage 2-3 flows that you develop and execute

Discussion Questions What types of mashups or dashboards would you like to see?