METRIDOC: A Framework for Managing and Exposing Library Event Data With the support of University of Pennsylvania Libraries.

Slides:



Advertisements
Similar presentations
An Overview of the Integration of the UCSF Dept. of Radiology Teaching File with MIRC Wyatt M. Tellis University of California San Francisco Departments.
Advertisements

The use of SDMX at the ECB Xavier Sosnovsky European Central Bank Bonn,
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
SWaNI Project Update Report April Project Outcomes Under review, might not all be possible in conjunction with Skillnet or SITS Interoperability.
What is a data warehouse and why would you want one? Emily
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
Where’s My Data? Using MetriDoc to manage data integration headaches Joe Zucca– Tommy Barker –
CUMREC 2005 Creating A Suite of Convenience Services for Clients and Developers Deb Nelson – Systems Analyst Larry Newhouse – Information Systems Leader.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Building Frameworks of Organizational Intelligence Joe Zucca Director for Planning and Communication University of Pennsylvania Libraries.
Building Frameworks of Organizational Intelligence Library Assessment Conference, 2008 Seattle Joe Zucca Director for Planning and Communication University.
A community-maintained data store for descriptions of library resources Global Open Knowledgebase (GOKb)
Gathering Data NISO E-Resource Management Forum Denver, Colorado September 24-25, 2007 Oliver Pesch EBSCO Information Services
Metadata for Digital Content Jane Mandelbaum, Ann Della Porta, Rebecca Guenther.
Integrating an MLE with Voyager Paul Hudson Learning Technology Development Unit University of Hertfordshire.
Tools You Own Maggie Moehringer AIRPO, June 2006.
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
The World Wide Web and the Internet Dr Jim Briggs 1WUCM1.
Federated Searching Pre-Conference Workshop - The federated searching cookbook Qin Zhu HP Labs Research Library February 18, 2007.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
©2011 Quest Software, Inc. All rights reserved. Steve Walch, Senior Product Manager Blog: November, 2011 Partner Training Webcast.
Securing Legacy Software SoBeNet User group meeting 25/06/2004.
PerfSONAR Client Construction February 11 th 2010, APAN 29 – perfSONAR Workshop Jeff Boote, Assistant Director R&D.
Identification of Mobile Devices from Network Traffic Measurements - a HTTP User Agent Method Master’s Thesis August 2 8, 2012 Supervisor – Prof. Heikki.
Initial slides for Layered Service Architecture
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
WPS Application Patterns at the Workshop “Models For Scientific Exploitation Of EO Data” ESRIN, October 2012 Albert Remke & Daniel Nüst 52°North Initiative.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Alberto Isoardo Seminario autunnale CIBER Novembre 2007 ROMA.
Using the SAS® Information Delivery Portal
Using Vocabulary Services in Validation of Water Data May 2010 Simon Cox, JRC Jonathan Yu & David Ratcliffe, CSIRO.
Annick Le Follic Bibliothèque nationale de France Tallinn,
Todd Kitta  Covenant Technology Partners  Professional Windows Workflow Foundation.
Presenting Statistical Data Using XML Office for National Statistics, United Kingdom Rob Hawkins, Application Development.
Project Overview Bibliographic merging, Endeca, and Web application.
Re-Implementing ERM MENA-IUG 5 th Annual Conference 1-2 November 2010.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
A Web-Enabled Aircraft Scheduler Michael Wallette 20 Nov
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ICOLC Las Vegas March 28, 2003 TDNet E-Management Services for Consortia From E-Journals to E-Resources Michael Markwith President, TDNet Inc.
Emerging Uses for the OpenURL Framework Ann Apps and Ross MacIntyre MIMAS, The University of Manchester.
Plumbing and Counting… Joe Zucca Assessment, Planning and Publications Librarian University of Pennsylvania Plumbing and Counting…An Update on the Penn.
MapReduce Kristof Bamps Wouter Deroey. Outline Problem overview MapReduce o overview o implementation o refinements o conclusion.
METRIDOC: A Framework for Library Business Intelligence With the support of University of Pennsylvania Libraries.
OASIS Adoption Forum Tim Mortimer – Red Wahoo Wednesday Oct 6, 2004.
Creating a Web-based Podcasting Interface for Eastern Illinois University Faculty and Staff Danny Harvey & Ryan Gibson.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
RSS Interfaces and Standards Chander Iyer. Really Simple Syndication (RSS) Web data format providing users with frequently updated content. Make a collection.
Processes of the Information Value Chain Informing Knowledge ActionProductive Knowledge Information Organizing Grouping Classifying Formatting Geo-referencing.
Georgia Fujikawa and Bob McQuillan Electronic Resource Management: Getting a Running Start on Your Implementation May , 2009.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Technical Report 4th CERN Workshop of Innovations in Scholarly Communication (OAI4)
17 Copyright © 2006, Oracle. All rights reserved. Information Publisher.
Taming the E-Chaos Through Standards and Best Practices An Update on Recent Developments Betty Landesman NC Serials Conference March 21, 2016.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Vertical Search for Courses of UIUC Homepage Classification The aim of the Course Search project is to construct a database of UIUC courses across all.
Data Transfer between Discovery Services and Library Catalogs: A Case Study at the University of the Fraser Valley By Hongfei Li and Diane Cruickshank.
Ashima Wadhwa Assistant Professor(giBS)
An Overview of Data-PASS Shared Catalog
Introduction Web Environments
The Re3gistry software and the INSPIRE Registry
Eurostat activities update
Widgets – Usage statistics collection Task force for the strategic project on the development and use of common ESS tools and services for dissemination.
Multimedia and Networks
Metadata Construction in Collaborative Research Networks
Presentation transcript:

METRIDOC: A Framework for Managing and Exposing Library Event Data With the support of University of Pennsylvania Libraries

METRIDOC University of Pennsylvania Libraries Metrics start with a basic abstraction: The Event

METRIDOC University of Pennsylvania Libraries xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41: ]| GET tp:// t=psycinfo&adv=1 HTTP/1.1| 302|0| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/ (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc= ;ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID= ; __utmc= ; __utmz= utmccn=(direct)|utmcsr=(direct)|utmc md=(none);UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E985076F88F67F Viewing an Ejournal article. The Event as raw data

METRIDOC University of Pennsylvania Libraries User & Program Parameters College | Dept Rank Course Host College Host Dept Instructor Grant Spnsr Library Parameters Srvice Genre Cognzt Staff Orgn’l Unit Budget cntr Environmental Parameters Bibliographic Parameters Title URI Format Cost| Supplr Date | Time Location IP Domain URL EVENT An Event Abstracted

METRIDOC University of Pennsylvania Libraries Link resolver Proxy server COUNTER ILS (Voyager, I3, Kuali-OLE) Resource sharing system Web server Social networking Srvs. Spreadsheets, databases Other targets… The “Event” is represented in machine-readable data, stored in a plethora of business systems. E-Resource Use by service, demographic, package Expenditures & Inventory planning / reader interest data Supply chain data Discovery systems & content use Research & instructional data learning management Other events… Event TypesSource Target

Is a framework for : Extracting event data from systems Transforming those data into readable, normalized formats Loading transformed/normalized payload into a repository Supporting analysis through local and collaborative dissemination channels. MetriDoc METRIDOC University of Pennsylvania Libraries

Increased scope of sources Synthesis of vectors, e.g.  Expenditure per use  Resources use by communities Contextualized data with greater statistical dimension and descriptive power. Collaborative assessment. Improved Data Resolution Through Integration METRIDOC University of Pennsylvania Libraries

Our legacy system: Datafarm Perl cron Perl cron Perl cron Voyager Farmer Quaker App Logs

METRIDOC University of Pennsylvania Libraries Datafarm Shortcomings Maintainability issues Scripts that depend on each other located in different places Perl is very productive as long as you are maintaining your own code Doing the same thing over again, no code reuse Lack of notification for success and failure Not shareable No safe way to expose data for collaboration Generating data for a report can be a job in itself Schemas are not stored in a sharable format Not reusable Doing the same thing over and over again without building libraries for common tasks No central code repository to share libraries within and outside of UPenn

METRIDOC University of Pennsylvania Libraries What we need?Who takes care of it A central schedulerJenkins Notifications of job success or failure Jenkins Batch job / etl scripting framework Metridoc Exposing dataMetridoc – Google data format Reporting / GraphsGoogle Charts / R / Tableau / Other Stat Packages Central Code RepositoryMaven Central via Sonatype Hosting

METRIDOC University of Pennsylvania Libraries Current System: Metridoc Perl cron Perl cron Perl cron Voyager Farmer Quaker App Logs

METRIDOC University of Pennsylvania Libraries Metridoc Philosophy

METRIDOC University of Pennsylvania Libraries Scripting Framework

METRIDOC University of Pennsylvania Libraries Scripting Example

METRIDOC University of Pennsylvania Libraries Scripting Example

METRIDOC University of Pennsylvania Libraries Exposing data

METRIDOC University of Pennsylvania Libraries Metrics on the cheap (google charts)

METRIDOC University of Pennsylvania Libraries Thoughts on complex statistics

METRIDOC University of Pennsylvania Libraries The future

METRIDOC University of Pennsylvania Libraries Abstracts 4 key functions, exposes interfaces for interoperability Target Source, e.g. Relais, Illiad, ILS Ingest Log Parse Format Refined output 1. Extract Resolution Sources e.g. IdM, WorldCat Refined output Resolve Codes & IDs Normalize 2. Transform Query Srvc Data Repo 3. Load User Interface Local Data Stores Results Document Query Document 4. Query

METRIDOC University of Pennsylvania Libraries Partners are welcome Sponsor More at