Repository Statistics Peter Millington Technical Development Officer SHERPA, University of Nottingham.

Slides:



Advertisements
Similar presentations
Using the SQL Access Advisor
Advertisements

Más de 1000 repositorios a su disposición OpenDOAR y ROAR IX Worksop REBIUN, Salamanca, 2-3 Oct.2009 Peter Millington SHERPA Technical.
1
1 L U N D U N I V E R S I T Y Integrating Open Access Journals in Library Services & Assisting Authors in choosing publishing channels 4th EBIB Conference.
Select from the most commonly used minutes below.
Copyright © 2003 Pearson Education, Inc. Slide 7-1 Created by Cheryl M. Hughes The Web Wizards Guide to XML by Cheryl M. Hughes.
Myra Shields Training Manager Introduction to OvidSP.
EPrints.FRI – a case study Open Access Repositories with EPrints EIFL-FOSS and EIFL-OA free online workshop 23 May 2011
Counting on OpenDOAR Peter Millington SHERPA Technical Development Officer CRC, University of Nottingham
COUNTER Update Peter Shepherd Project Director COUNTER STM Innovations Seminar, 2 December 2005.
The New Improved OpenDOAR Directory of OA Repositories Peter Millington SHERPA Technical Development Officer University of Nottingham, England.
Peter Millington SHERPA Technical Development Officer University of Nottingham, England Policy Tool Digital Repositories: Dealing with the Digital Deluge,
EIFL Open Access Workshop, 21-Sep-2006, Poznan OpenDOAR The Directory of OA Repositories Peter Millington SHERPA Technical Development Officer University.
Version Policies and the OpenDOAR Policies Tool Peter Millington, University of Nottingham Version Identification Workshop, London, 22-Apr-2008.
Implications of Release 3 of the COUNTER Code of Practice Vendor Usage Reports: Are we all on the same page now? Charleston Conference November 6, 2008.
Search, access and impact: Web citation services Tim Brody Intelligence, Agents, Multimedia Group University of Southampton.
EPrints Web Configuratio n Management. SQL database Web server Scripts to configure repository activities Configuration files EPrints - the Administrator's.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Rclis in vision and reality Thomas Krichel
The IR on the International Stage Mary Robinson SHERPA, University of Nottingham Embedding Repositories event, University of Lincoln,
Introduction to HTML, XHTML, and CSS
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
UKCoRR meeting Kingston University, November 2007 Mary Robinson European Development Officer University of Nottingham, UK
OpenDOAR and ROAR RSP Services Day, Bath, 15 th Jan.2009 Peter Millington SHERPA Technical Development Officer SHERPA, University.
SHERPA Din guide til det åpne landskapet 31. oktober 2007 Peter Millington SHERPA Technical Development Officer SHERPA, University.
RoMEO, JULIET & OpenDOAR Services that can enhance your repository JISC Repositories & Preservation Programme Meeting, Bristol,
Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.
Corpus Linguistics Richard Xiao
LIBRARY WEBSITE, CATALOG, DATABASES AND FREE WEB RESOURCES.
Photo Slideshow Instructions (delete before presenting or this page will show when slideshow loops) 1.Set PowerPoint to work in Outline. View/Normal click.
Welcome. © 2008 ADP, Inc. 2 Overview A Look at the Web Site Question and Answer Session Agenda.
Break Time Remaining 10:00.
PP Test Review Sections 6-1 to 6-6
Campaign Overview Mailers Mailing Lists
1 DARTBOARD Tutorial: DARTBOARD Access and Use for Faculty and Staff Tutorial: DARTBOARD Access and Use for Faculty and Staff.
1 The information industry and the information market Summary.
Svetlin Nakov Telerik Corporation
Physical Aspects [Reflection Modelling] Hauptseminar: Augmented Reality for Driving Assistance in Cars.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Operations to Serve You 05/17/ The Service Desk Provides an Announcement Page? The Service Desk houses a library of SOLUTIONS that are available.
XML and Databases Exercise Session 3 (courtesy of Ghislain Fourny/ETH)
1 What is JavaScript? JavaScript was designed to add interactivity to HTML pages JavaScript is a scripting language A scripting language is a lightweight.
RoMEO, JULIET and OpenDOAR: A Tale with a Happy Ending!
EPrints.FRI – a case study Open Access: Maximising Research Impact in Sofia Sofia, 23 April 2009
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1..
Mobility Tool Fremtidens afrapportering 2013 – Erasmus Mobilitet / IP 2014 – Erasmus+ aktioner.
Facebook Pages 101: Your Organization’s Foothold on the Social Web A Volunteer Leader Webinar Sponsored by CACO December 1, 2010 Andrew Gossen, Senior.
By CA. Pankaj Deshpande B.Com, FCA, D.I.S.A. (ICA) 1.
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
2004 EBSCO Publishing Presentation on EBSCOadmin.
: 3 00.
5 minutes.
DIKLA GRUTMAN 2014 Databases- presentation and training.
WorkKeys Internet Version Training
Clock will move after 1 minute
CINAHL Keyword Searching. This presentation will take you through the procedure of finding reliable information which can be used in your academic work.
Chapter 13 Web Page Design Studio
Select a time to count down from the clock above
RefWorks: The Basics October 12, What is RefWorks? A personal bibliographic software manager –Manages citations –Creates bibliogaphies Accessible.
CFR 250/590 Introduction to GIS, Autumn 1999 Data Search & Import © Phil Hurvitz, find_data 1  Overview Web search engines NSDI GeoSpatial Data.
Scientific writing (81-933) Lecture 6: References Dr. Avraham Samson Faculty of Medicine in the Galilee 1.
Introduction Peter Dolog dolog [at] cs [dot] aau [dot] dk Intelligent Web and Information Systems September 9, 2010.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
EPrints statistics at the University of Northampton Statistics for repositories: DSpace and Eprints 26/2/2013
COUNTER Update February 2006.
OpenDOAR and ROAR RSP Services Day, Nottingham, 23rd Apr.2008
Presentation transcript:

Repository Statistics Peter Millington Technical Development Officer SHERPA, University of Nottingham

Overview Introduction Global statistics The what & why of repository statistics Benchmarks & data sources Compilation methods Web usage logging tools Google Analytics demo Problems and solutions Group session – Key issues

Global Repository Statistics Data Sources – Global lists of repositories OpenDOAR- ROAR- Repository66- May be useful for advocacy work Examples of types of chart & presentation

ROAR – Individual Growth Charts

ROAR – Individual Source Data MonthRecords Archives MonthRecords Archives

Delegates What and Why of Statistics Rate of growth For advocacy Measure of success – for our paymasters Rate of usage Targeting weak areas – departments Measure of success Justifying funding Most downloaded author/paper Promotes interest and engagement from authors

Delegates What and Why of Statistics Where are visitors coming from – referrers Curiosity – is it being seen by the right people Citation statistics To demonstrate the beneficial impact of repositories Drilling down for more detail For a sense reality Steep slopes, animation, etc Glitzy marketing

Individual Repositories - Content Growth & Deposition rates Measure of progress Impact of advocacy events Impact of mandatory deposition Types of document or item Trend-watching? Breakdown by department and/or author How much is everyone contributing? Proportion of full text v metadata only Measure of usefulness

Item types: Universidade do Minho

Individual Repositories - Performance Proportion of publications deposited How comprehensive is the archive? Proportion of authors who are depositing Are they complying with local mandates? Compliance with funders mandates Are you meeting your obligations? Repository administration Are your turn round times acceptable?

Compliance with the CERN Mandate

Compliance Benchmarks Counting publications Institution-wide bibliographies e.g. Maintained by research managers Publication lists on departmental web pages Public/Commercial databases – ISI, Medline, etc Counting authors Who qualifies as an author? Academic staff, Research students, Managers University Calendars & Departmental staff lists

Individual Repositories - Usage Rates of usage Measure of usefulness Impact of news-related items Most downloaded items Identifying research(ers) with most impact? Engendering competition between authors? Downloads according to author Performance reviews? Geographical distribution of users Are you reaching your intended audience?

Sources of Data Repositorys own database OAI-PMH Servers access log Remote logging

Compilation Methods Repositorys own database Copying from the human interface Interactive SQL commands

Copying from the Human Interface

Interactive SQL Commands mysql> SELECT type,COUNT(*) FROM eprint GROUP BY type; | type | COUNT(*) | | article | 456 | | book | 5 | | book_section | 39 | | conference_item | 173 | | exhibition | 1 | | monograph | 18 | | other | 3 | | thesis | 4 | rows in set (0.00 sec)

Compilation Methods Repositorys own database Copying from the human interface Interactive SQL commands OAI-PMH Harvesting programs – e.g. ROARs Celestial

OAI-PMH ListIdentifiers

OAI-PMH ListRecords

ROAR - Celestial dateidentifierurl oai:bora.uib.no:1956/2270Department of Earth Science oai:bora.uib.no:1956/2272Department of History oai:bora.uib.no:1956/2273Department of the History of Religions oai:bora.uib.no:1956/2274Section for Endocrinology oai:bora.uib.no:1956/2275Department of the History of Religions oai:bora.uib.no:1956/2276Department of the History of Religions oai:bora.uib.no:1956/2277Department of the History of Religions oai:bora.uib.no:1956/2278Department of the History of Religions oai:bora.uib.no:1956/2279Department of Oral Sciences oai:bora.uib.no:1956/2281Department of the History of Religions oai:bora.uib.no:1956/2282Department of Sociology oai:bora.uib.no:1956/2283Else Æyen oai:bora.uib.no:1956/2284Section for Art History oai:bora.uib.no:1956/2285Section for Russian oai:bora.uib.no:1956/2286Department of Geography oai:bora.uib.no:1956/2287Department of Greek, Latin and Egyptology oai:bora.uib.no:1956/2288Section for Spanish oai:bora.uib.no:1956/2289Department of Mathematics oai:bora.uib.no:1956/2290Department of Geography oai:bora.uib.no:1956/2291Department of Geography oai:bora.uib.no:1956/2292Department of Biology oai:bora.uib.no:1956/2293Department of Biology

Compilation Methods Repositorys own database Copying from the human interface Interactive SQL commands OAI-PMH Harvesting programs – e.g. ROARs Celestial Servers access log Web usage statistics tools

Raw Web Access Logs [10/Apr/2005:05:34: ] "GET /portfolio.css HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:16: ] "GET /DAWN_Index.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:17: ] "GET /Eric.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:21: ] "GET /Library_Form.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:22: ] "GET /cleansing.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:25: ] "GET /index.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:28: ] "GET /integration.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:31: ] "GET /merging.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:07:34: ] "GET /publication.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:08:22: ] "GET /ABACUS_Index.htm HTTP/1.0" "-" "ia_archiver" [10/Apr/2005:08:27: ] "GET /limitations.htm HTTP/1.0" "-" "ia_archiver" [20/Dec/2004:13:22: ] "GET /robots.txt HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:23: ] "GET / HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:25: ] "GET /Logo.gif HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:27: ] "GET /contact.htm HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:29: ] "GET /profile.htm HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:37: ] "GET /index.htm HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:47: ] "GET /publication.htm HTTP/1.1" "-" "gazz/ [20/Dec/2004:13:49: ] "GET /InsideInfo.jpg HTTP/1.1" "-" "gazz/5.0 Recorded fields include: IP Address of the computer requesting a file Date & time transaction completed Name of file requested Success code – usually 200 for successfully completed File size in bytes

Web Usage Statistics Tools Analog Webalizer AWStats etc.

Sample output from the Analog Statistics Package

Sample output from the Webalizer Statistics Package

Sample output from the AWStats Statistics Package

Compilation Methods Repositorys own database Copying from the human interface Interactive SQL commands OAI-PMH Harvesting programs – e.g. ROARs Celestial Servers access log Web usage statistics tools Remote logging Google Analytics

Sign up to a Google Account Specify the URL to be logged Obtain snippet of JavaScript code Insert snippet into HTML of pages to be logged Ideally into a template file Make sure the modified pages are live! Logging starts automatically Log in to your account to view the analytics

Google Analytics JavaScript snippet var gaJsHost = ((" == document.location.protocol) ? " : " document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E")); var pageTracker = _gat._getTracker("UA "); pageTracker._initData(); pageTracker._trackPageview(); Find URL Containing/Excluding String e.g.pdf Regular expressions e.g./[0-9]*/for EPrints IDs

Problems Web bots and crawlers Inflating usage volume Scewing usage time series Auxiliary files & non-eprint pages CSS style sheet files Image files – jpeg, gif, etc. Index pages Linking URLs to bibliographic references What does that eprint number mean?

Problems and Solutions Web bots and crawlers Use robots.txt & meta robots tags to prevent crawling Filtering out known bots Still leaves maverick hackers & students bots Auxiliary files & non-eprint pages Configuring & tuning the analysis tool Filter using regular expressions Linking URLs to bibliographic references Programmatic concordance e.g. IRStats

Over to Chris for DSpace statistics…

What are your priorities for statistics?

Peter Millington