CoScripter and Topes: Putting Data into Usable Formats Christopher Scaffidi Carnegie Mellon University With Allen Cypher and Jimmy Lin IBM Almaden.

Slides:



Advertisements
Similar presentations
1 End User Software Engineering Mary Shaw Carnegie Mellon University.
Advertisements

+ Learning Intention 10 To understand and apply our understanding of spreadsheet software functions and techniques for efficiently and effectively manipulating.
The Icarus Group © Spacebar or mouse to advance to next slide The Icarus Group © 2007 GROUND HANDLING IcarusGH is designed expressly for the upper.
E Pluribus Unum for end-user programming Christopher Scaffidi EUSES 2009.
Software to Manage EEP Vegetation Plot Data A design proposal Michael Lee January 31, 2011.
Fast, Accurate Creation of Data Validation Formats by End-User Developers Christopher Scaffidi Brad Myers, Mary Shaw Carnegie Mellon University.
Topes: Reusable Abstractions for Validating Data Christopher Scaffidi Brad Myers, Mary Shaw Carnegie Mellon University.
Unsupervised Inference of Data Formats in Human-Readable Notation Christopher Scaffidi Carnegie Mellon University.
Introduction to the EUSES Web Macro Scenario Corpus Allen Cypher, Sebastian Elbaum, Andhy Koesnandar, Brad Myers, Christopher Scaffidi.
Topes: Enabling End-User Programmers to Validate and Reformat Data Christopher Scaffidi Committee: Mary Shaw (chair)Institute for Software Research, Carnegie.
Cornell University Library Instruction Statistics Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24) Wilson Muyenzi.
Scenario-Based Requirements for Web Macro Tools Christopher Scaffidi, Allen Cypher, Sebastian Elbaum, Andhy Koesnandar, Brad Myers.
A Lightweight Model for End Users’ Data: Progress and Future Work Christopher Scaffidi Carnegie Mellon University.
Topes: Enabling End-User Programmers to Validate and Reformat Data Christopher Scaffidi Key collaborators: Brad Myers, Mary Shaw Carnegie Mellon University.
Topes: Enabling End-User Programmers to Validate and Reformat Data Christopher Scaffidi Carnegie Mellon University.
Tool Support for Data Validation by End-User Programmers Christopher Scaffidi Brad Myers, Mary Shaw Carnegie Mellon University.
Toped: Enabling End-User Programmers to Validate Data Chris Scaffidi, Brad Myers, Mary Shaw, Carnegie Mellon University, School of Computer Science,
Accommodating Data Heterogeneity in ULS Systems Christopher Scaffidi Mary Shaw Carnegie Mellon University.
A Lightweight Model for End Users’ Domain-Specific Data Christopher Scaffidi Carnegie Mellon University VL/HCC Graduate Consortium 2006.
INTERNATIONAL FORMATTING Keyboarding & document processing 1.
A Data Model to Help End User Programmers Manipulate and Validate Data Christopher Scaffidi Carnegie Mellon University ISRI SSSG Oct 2006.
Introducing! Office of State Publishing’s new Web StoreFront.
Lecturer: Ghadah Aldehim
JavaScript Form Validation
SCRAM Software Configuration, Release And Management Background SCRAM has been developed to enable large, geographically dispersed and autonomous groups.
Reporting with Project 2007 Dharmesh Patel Project Server Engineer Microsoft.
SWIS Digital Inspections Project (SWIS DIP) Chris Allen, Information Management Branch California Integrated Waste Management Board November 5, 2008 The.
My Redneck Brother's Tire Size, and Other Unrelated Topes Christopher Scaffidi Carnegie Mellon University.
Created by the Community for the Community BizTalk & Build.
No application is an island: Using topes to transform strings during data transfer Atipol Asavametha, Prashanth Ayyavu, Christopher Scaffidi School of.
Exploiting relationship intelligence for profit Excel – Friend not foe.
Chapter 2 – Software Processes Lecture 1 1Chapter 2 Software Processes.
Topes: Meeting the Challenges of User Input Validation Christopher Scaffidi Key collaborators: Brad Myers, Mary Shaw Carnegie Mellon University.
AARON CROCKETT Spreadsheets used in the real world Examples of different spreadsheets Features used within spreadsheets Improving Productivity & Usability.
Pupil Name OCR Nationals in ICT (2010) : Unit 06: Design a SpreadsheetsAO7–Testing your spreadsheet.
SE3183 Advance Web Programming Programming Session 2013/2014.
University of Economics Prague Information Extraction (WP6) Martin Labský MedIEQ meeting Helsinki, 24th October 2006.
Just as there are many human languages, there are many computer programming languages that can be used to develop software. Some are named after people,
Intelligently Creating and Recommending Reusable Reformatting Rules Christopher Scaffidi Brad Myers, Mary Shaw Carnegie Mellon University.
Context Awareness: From Dream to Reality Norman M. Sadeh School of Computer Science Carnegie Mellon University Research Sponsors: DARPA/DAML, IBM, Boeing,
Evaluating Web Pages Techniques to apply and questions to ask.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
Training and Evaluation Tool Milan Jovic Dusan Jevtic Dr Dragan Jankovic Public Reporting on Project Results TEMPUS project.
SWIS Digital Inspections Project Chris Allen, Information Management Branch California Integrated Waste Management Board August 22, 2008.
INTERNATIONAL FORMATTING Keyboarding & document processing 1.
Predicting Reuse of End-User Web Macro Scripts Chris Scaffidi 1  2, Chris Bogart 2, Margaret Burnett 2, Allen Cypher 3, Brad Myers 1, Mary Shaw 1 1 Carnegie.
7. Data Import Export Lingma Acheson Department of Computer and Information Science IUPUI CSCI N207 Data Analysis Using Spreadsheets 1.
Evaluating Web Pages Techniques to apply and questions to ask.
1 Year of Progress on Topes Christopher Scaffidi Brad Myers, Mary Shaw Carnegie Mellon University.
A Data Model to Support End-User Software Engineering Christopher Scaffidi Carnegie Mellon University.
Unit 9: Learning aims A–D
1 PSI/PhUSE Single Day Event – SAS Applications – June 11, 2009 SAS Drug Development from the Inside Magnus Mengelbier Director.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
Software Development Languages and Environments. Computer Languages Just as there are many human languages, there are many computer programming languages.
Contents Sample rollout check list Rollout phases and activities
Data Migration (ETL) Approach Scheduler Roll-Out at HCCRC Institutions
UNIT 15 Webpage Creator.
A Data Model to Help End Users Shape Effective Software
Building an Observation Data Layer
The CVD Metadata Handler
How EMI Music Implemented Master Data Services with Adatis
My Business – My Business is a feature of Team on the Run that allows sending and receiving processes (forms) between team members, to complete different.
06 | Managing Enterprise Data
Why would you want to add a Footer to a website?
User-Centered Design Data Entry CS 4640 Programming Languages for Web Applications [The Design of Everyday Things, Don Norman, Ch 7]
Just Enough SSIS Scripting to be Dangerous.
User-Centered Design Data Entry CS 4640 Programming Languages for Web Applications [The Design of Everyday Things, Don Norman, Ch 7]
E Pluribus Unum for end-user programming
Presentation transcript:

CoScripter and Topes: Putting Data into Usable Formats Christopher Scaffidi Carnegie Mellon University With Allen Cypher and Jimmy Lin IBM Almaden

2 Data may be Incorrectly Formatted Example: In a contextual inquiry, an end user needed to copy job title, phone number, and address into a spreadsheet for each staff member. Notice the mis-formatted phone number and address. A web macro for this task would need to help the user fix the data. formats ● coscripter ● topes

3 Data may be Inconveniently Formatted Consider all the ways that we write dates. Reformatting may be necessary when reusing a date from one web site to fill out a form in another web site. formats ● coscripter ● topes

4 Limitations of Web Macro Tools Right now, CoScripter cannot –Clean up incorrectly formatted data –Reformat inconveniently formatted data Like most web macro tools, CoScripter treats data as strings (so it cannot clean up or reformat data). It does not recognize –Phone numbers – addresses –Country currency codes –Dates etc. formats ● coscripter ● topes

5Topes A tope is a kind of data that has a natural place in the problem domain. (“tope” = Greek for “place”) E.g.: Phone number, state name, person name Many topes have several common formats (408) / California CALIFORNIA CA Calif. John von Neumann JOHN VON NEUMANN von Neumann, John VON NEUMANN, JOHN formats ● coscripter ● topes

6 A tope is modeled as a graph of formats An example tope for CMU room numbers –3 formats (called “ isa ” functions, which recognize data) –4 transformations (called “ trf ” functions, which reformat data) –Most topes have enough trf s to form a connected graph Formal building name & room number Elliot Dunlap Smith Hall 225 Building abbreviation & room number EDSH 225 Colloquial building name & room number Smith 225 formats ● coscripter ● topes

7 Topes + CoScripter / Vegemite We have started integrating topes with CoScripter –To recognize a string as a certain kind of data –To clean up strings when incorrectly formatted –To reformat strings to a more convenient format Quick Demos formats ● coscripter ● topes

8 Other Topes Research Prior work: –Inference of formats from example strings –UI so that end users can define new topes –Using topes to validate and reformat data in spreadsheets, databases, and web applications UIs Future work: –Repository so that users can share topes with one another –Statistical techniques for automatically identifying and correcting incorrectly implemented topes formats ● coscripter ● topes