Presentation is loading. Please wait.

Presentation is loading. Please wait.

. gov Toward Digital Government: The Case of Government Statistics Gary Marchionini University of North Carolina at Chapel Hill www.ils.unc.edu/govstat.

Similar presentations


Presentation on theme: ". gov Toward Digital Government: The Case of Government Statistics Gary Marchionini University of North Carolina at Chapel Hill www.ils.unc.edu/govstat."— Presentation transcript:

1 . gov Toward Digital Government: The Case of Government Statistics Gary Marchionini University of North Carolina at Chapel Hill www.ils.unc.edu/govstat NSF Grants EIA 0131824 and EIA 0129978 Principal Investigators: Gary Marchionini, Stephanie Haas, Ben Shneiderman, Catherine Plaisant, and Carol Hert

2 . gov Digital Government: Leveraging IT Government information dissemination –Websites –Other publications (no mass emailings yet) Transactions –Registrations –Census, regulatory filings –Taxes Policy making –E-voting –E-rules Our work focuses on statistical information and agencies as many important decisions by policy makers and citizens depend on statistics

3 . gov Preliminary Work 1996-2000 Human needs –Interviews (agencies, public) –Transaction log analysis –Email content analysis System development and testing –Novel interfaces –Information architecture –Usability studies

4 . gov Focus on Tables 1998-2000 Table browser –Java applet –DTD for tables (DC and DDI influence) –XML protocol –Mapping metadata elements to interface control mechanisms –Piping data from large databases to applet –User studies Metadata to aid understanding

5 . gov Statistical Knowledge Network 2003-2006 Create SKN prototype with agency partners Integration –Horizontal integration across federal agencies (BLS, EIA, NCHS, Census, SSA, NASS) –Vertical integration from local/state Focus on non-specialists –Help crucial –Metadata drives help User interfaces are the intermediaries to link people and data Find what you need, understand what you find

6 . gov Data Flow agency data with integrated metadata agency with multiple metadata repositories agency backend data and metadata Distributed Public Intermediary: variable/concept level, XML-based incorporating ISO 11179 and DDI providing java-based statistical literacy tools to user interfaces Statistical Ontology firewall Domain Experts End User Communities Domain Ontologies I n t e r f a c e s U s e r end user end users: interact with data from information/concept perspective, not agency perspective membrane end user end user end user end user

7 . gov Statistical Knowledge Network Architecture Agencies SKN Registry Actions Contribute Find Display Annotate Understand Manipulate Collaborate ….. …………. Objects Actions Private Work Space Objects Actions Private Work Space Objects Actions Private Work Space OntologyRules & Constraints SKN Consortium …... gov Objects Reports metadata Tables metadata People metadata Glossary Annotations

8 . gov Interface Prototypes: Find, Display, Understand; Leverage Metadata, Glossary, Ontology Relation Browser Mulitlayered help: treemaps, video help Animated Glossary Contextualizer PairTrees Spatial audio for maps Missing Data

9 . gov Use Case Scenarios to Guide Design Based on discussions with agency partners 20 scenarios 4 detailed with in depth resources located Used to ground ongoing work

10 . gov Relation Browser++ displaying all webpages EIA

11 . gov RB++ with Cursor Over Residential Sector

12 . gov RB++ showing ‘hous’ typed in title field

13 . gov Multi-layered interfaces 1 level 3 levels of growing complexity map+table +filters map+table +filters +scatterplot map+table +filters +scatterplot

14 . gov Animated Demonstration Features

15 . gov Script Guidelines Base the script on a live demonstration (never on a written description) –Focus on tasks (not tours of widgets or conceptual overviews) –Act out the interaction (with minimum description) then describe results in context of task –Start with a tour of main screen components (orient and introduce vocabulary) 5-10 sec. max –Plan a linear sequences made of very short autonomous chunks (15-60 sec.) Map the chunks to existing online documentation Show text title at beginning of each chunk Carefully synchronize voice and visual (hard when alone) Provide duration and file size for individual chunk

16 . gov Interactive Glossary Development Tools Provide foundation for content development Separate content development from presentation development Reduce overall development time Maximize reuse of existing elements Create multiple presentations from a single content development effort

17 . gov Animation Template

18 . gov Content Foundation Template (SIG) Question initial motivation Answer overview, definition Process explanation, equation Example Result statistic, answer Review summary, interpretation

19 . gov Animation Template Consistent display and interaction for all animations Presents animation and explanatory text simultaneously Navigate (forward and back) through animation segments Complete review of text at any time

20 . gov Animation Template Three pieces: text, animations, template Text is tagged with content section tags in a separate text file Animation consists of segments in individual animation files Text and animation segments coordinated by placement in template

21 . gov ontology Semantic level Classes Relationships Constraint rules DTD/XML Schema Structural level Elements Attributes Datatypes SKN Ontology DTD / XML Schema Interface Tools Statistical Interactive Glossary (SIG) Ontology Applications  Knowledge organization  Content and terminology control  Data integration  Query support  Automatic classification support  Reasoning mechanism  Others modeling implementation

22 . gov unit aged unit aged unit married couples living together, with husband or wife aged 65 or older age SSA household Domain knowledge Operational knowledge estimate poverty estimate poverty benefit Census Bureau FIFARS earning salary wage income family distribution

23 . gov Project DTD Investigate DDI and ISO 11179 Leverage DDI and data cubes Markup a set of objects –Tables –Reports/press releases Use markup to build added value search (find what you need) and help (understand what you find) support into interfaces

24 . gov The Basic Structure entDscr_1: description of an entity within the marked up document docDscr : description of the markup-what is being marked-up, who marked it up, etc. entDscr_2: description of an entity within the marked up document varDscr_1: description of each variable within an entity, study group or document stdygrpDscr: describes the “group” to which an entity or document belongs such as a survey program nCubeDscr: used when entity is an aggregated table fileDscr: descripes physical file structures for nCubes varDscr_2: description of each variable within an entity, study group or document

25 . gov One Example of How the DTD Helps The DTD can help bring the “expert knowledge” to the less expert user and bring relevant information together by enabling searching via variables as well as subjects/keywords

26 . gov Median income, by age, 2001 age persons Age 1 65-69 2 70-74 3 75-79 4 80 or older

27 . gov Discovering Metadata Hybrid machine learning approach –Crawl website –Create term document matrices –Use k-means clustering with small K to fit on screen in RB++ –Revise Use structure in the existing sites to train a classifier For small n of concepts, classify site

28 . gov Combining Machine Learning and Dynamic Interfaces What should these topics be, and how do we know if we’ve found the right names for them?

29 . gov Combining Machine Learning and Dynamic Interfaces How do we assign thousands of documents to their respective topics?

30 . gov Initial, Unstructured Approach doc

31 . gov Initial, Unstructured Approach doc

32 . gov Initial, Unstructured Approach doc This approach yielded intuitively coherent clusters. But the clusters fall at too fine a level of granularity, while also wasting large portions of the data. Clustering Based on Word Distributions

33 . gov New Approach, Semi-Supervised

34 . gov New Approach, Semi-Supervised doc

35 . gov New Approach, Semi-Supervised doc This approach capitalizes on the agencies’ efforts and expertise, and so far seems to yield superior results. However, the amount of training data is very sparse, and the observed categories have high correlation in some cases. Our current work addresses these tuning issues.

36 . gov State Statistical Office USDA / NASS State Cooperative Agency (Dept. of Agriculture,etc.) Farmers & Producers Statistical Consumers Supply data to agencies Obtain data from agencies Collection agents Vertical Integration: Agriculture

37 . gov Multiple Research Threads for the SKN Interfaces Metadata and Ontology Multi-leveled help Automatic slicing and dicing User needs and user testing Cross agency cooperation See www.ils.unc.edu/govstatwww.ils.unc.edu/govstat


Download ppt ". gov Toward Digital Government: The Case of Government Statistics Gary Marchionini University of North Carolina at Chapel Hill www.ils.unc.edu/govstat."

Similar presentations


Ads by Google