Presentation is loading. Please wait.

Presentation is loading. Please wait.

March 20, 2008Electronic Resources and Libraries College Center for Library Automation Tallahassee, FL Susan B. Campbell Susan.

Similar presentations


Presentation on theme: "March 20, 2008Electronic Resources and Libraries College Center for Library Automation Tallahassee, FL Susan B. Campbell Susan."— Presentation transcript:

1 March 20, 2008Electronic Resources and Libraries College Center for Library Automation Tallahassee, FL Susan B. Campbell (scampbell@cclaflorida.org) Susan B. Campbell (scampbell@cclaflorida.org) Jim McGill Jim McGill(jmcgill@cclaflorida.org)

2 March 20, 2008Electronic Resources and Libraries automating retrieval and reporting of database usage statistics for a consortium CCLA provides and maintains the Library Information Network for 28 Community Colleges (LINCC) for Florida's 65+ community college libraries. CCLA provides and maintains the Library Information Network for 28 Community Colleges (LINCC) for Florida's 65+ community college libraries. db statistics we’re collecting and reporting db statistics we’re collecting and reporting 19 vendors 19 vendors over 200 databases over 200 databases monthly reports by database, campus, statewide monthly reports by database, campus, statewide on demand on demand customers for monthly reports customers for monthly reports 28 community colleges in Florida 28 community colleges in Florida internal reports internal reports

3 March 20, 2008Electronic Resources and Libraries automating retrieval and reporting of database usage statistics for a consortium problem problem what we were doing and why it doesn’t work what we were doing and why it doesn’t work solution solution the pieces, the parts and how they fit together the pieces, the parts and how they fit together future future what we’ve learned and our expectations what we’ve learned and our expectations problem problem what we were doing and why it doesn’t work what we were doing and why it doesn’t work

4 March 20, 2008Electronic Resources and Libraries excel excess excel excess the problem

5 March 20, 2008Electronic Resources and Libraries the problem vendor variety vendor variety repeat 28 times or more for each vendor (and sometimes each database)

6 March 20, 2008Electronic Resources and Libraries

7 March 20, 2008Electronic Resources and Libraries

8 March 20, 2008Electronic Resources and Libraries

9 March 20, 2008Electronic Resources and Libraries automating retrieval and reporting of database usage statistics for a consortium problem problem what we were doing and why it doesn’t work what we were doing and why it doesn’t work solution solution the pieces, the parts and how they fit together the pieces, the parts and how they fit together future future what we’ve learned and our expectations what we’ve learned and our expectations

10 March 20, 2008Electronic Resources and Libraries the solution automating automating maintenance utilities maintenance utilities handling retrieved data handling retrieved data reporting in multiple formats reporting in multiple formats retrieval of vendor data retrieval of vendor data

11 March 20, 2008Electronic Resources and Libraries intranet web interface

12 March 20, 2008Electronic Resources and Libraries

13 March 20, 2008Electronic Resources and Libraries

14 March 20, 2008Electronic Resources and Libraries Vendor not responding

15 March 20, 2008Electronic Resources and Libraries

16 March 20, 2008Electronic Resources and Libraries

17 March 20, 2008Electronic Resources and Libraries

18 March 20, 2008Electronic Resources and Libraries

19 March 20, 2008Electronic Resources and Libraries

20 March 20, 2008Electronic Resources and Libraries

21 March 20, 2008Electronic Resources and Libraries

22 March 20, 2008Electronic Resources and Libraries reporting

23 March 20, 2008Electronic Resources and Libraries

24 March 20, 2008Electronic Resources and Libraries creating retrieval scripts “nuts and bolts”

25 March 20, 2008Electronic Resources and Libraries

26 March 20, 2008Electronic Resources and Libraries

27 March 20, 2008Electronic Resources and Libraries This is a manual process to create the Perl script that will accept variables and create GetWebPage_VENDOR.pl step 1. capture HTTP headers Process Trace File (ParseHTTPTrace.pl) Generic Web Page retrieval (GetWebPage_VENDOR.pl)

28 March 20, 2008Electronic Resources and Libraries step 2. modify Perl script to accept command line variables to reformat standard YYYYMM format to two separate variables: MM and YYYY for URL $Period=$ARGV[0]; $ScopeCustID=$ARGV[1]; $UserName=$ARGV[2]; $Password=$ARGV[3]; #$ScopeCustID="bcc"; #$Period="200701"; $yr=substr($Period,0,4); $mon=substr($Period,4,2); if ($mon < 10) {$mon=~s/0//gi;}; YYYYMM - our DB format vendor specific scope customer ID remarks - unremarked for testing

29 March 20, 2008Electronic Resources and Libraries Step 3. modify script with command line variables and parse runtime variables... iodFromMonth='. $mon. '&timePeriodFromYear='. $yr. '&timeP... $content0=$resp5->content; $pos=index($content0,"VIEWSTATE")+13; $pos2=substr($content0,$pos,5000); $pos3=index($pos2,"value")+7; $pos4=index($pos2,"\/>"); $VIEWSTATE=substr($pos2,$pos3,$pos4-$pos3-2); $VIEWSTATE=~s/\//\%2F/gi; $VIEWSTATE=~s/\+/\%2B/gi; $VIEWSTATE=~s/\=/\%3D/gi; $pos=index($content0,"EVENTVALIDATION")+13; $pos2=substr($content0,$pos,2000); $pos3=index($pos2,"value")+7; $pos4=index($pos2,"\/>"); $EVENTVALIDATION=substr($pos2,$pos3,$pos4-$pos3-2); $EVENTVALIDATION=~s/\//\%2F/gi; $EVENTVALIDATION=~s/\+/\%2B/gi; $EVENTVALIDATION=~s/\=/\%3D/gi; SECURITY CODES some codes are session based & must be parsed out to pass to subsequent pages

30 March 20, 2008Electronic Resources and Libraries step 4. create page parser (part 1) creating ProcessVendor.pl script include file with needed subroutines $col=$ARGV[0]; $vendor=“vendorname"; $VDBSuffix=“VENDOR"; $jumpin=" Site:"; $jumpout="Grand Total"; require ("../VDBProcs.pl"); anonymized ( for this presentation ) vendor name college name – when needed points to begin and stop processing file

31 March 20, 2008Electronic Resources and Libraries After processing, each table row is on one line with all carriage returns, linefeeds, and tabs removed. Blank lines and page feeds are not output, code outside jump* is ignored. Period, college name and other variables are passed from the database by the VDBProc.pl file. Validation is run on SQL log file to look for error messages and write to log. Entries are made for no data, change from previously retrieved period value or other potential problems. Step 4. create page parser (part 2) procedures called from common include file

32 March 20, 2008Electronic Resources and Libraries automated process

33 March 20, 2008Electronic Resources and Libraries handling retrieved data delete from VDBStatistics where vendor=‘VENDOR' and college='VALENCIA COMM COLLEGE' and datasource=‘SOME VENDOR DATABASE' and datatype='Sessions' and subdatatype='0' and period='200802' insert into VDBStatistics ( sourcefile, vendor, college, period, datatype, subdatatype, datasource, quantity ) values ('ProcessVENDOR.sql',‘VENDOR','VALENCIA COMM COLLEGE‘,'200802','Sessions','0',SOME VENDOR DATABASE','4348')

34 March 20, 2008Electronic Resources and Libraries handling retrieved data where/how we store what we retrieve where/how we store what we retrieve

35 March 20, 2008Electronic Resources and Libraries daily backup of database via windows scheduler * SQL Server Express does not support SQL Agent handling retrieved data

36 March 20, 2008Electronic Resources and Libraries tools

37 March 20, 2008Electronic Resources and Libraries software used retrieval of data – free retrieval of data – free Internet Explorer Internet Explorer Perl Perl LWP library (Library for the WWW for Perl) LWP library (Library for the WWW for Perl) ieHTTP Headers ieHTTP Headers ParseHTTPTrace.pl ParseHTTPTrace.pl SQLExpress and manager SQLExpress and manager Intranet Site (IIS,.asp, vbscript, java) Intranet Site (IIS,.asp, vbscript, java) reporting – some cost reporting – some cost EZView (low cost) EZView (low cost) Crystal Reports (had it) Crystal Reports (had it)

38 March 20, 2008Electronic Resources and Libraries structure environment environment each vendor has its own working directory each vendor has its own working directory each vendor has several files in this directory each vendor has several files in this directory batch file (called from SQL Server) batch file (called from SQL Server) Perl script (gets web page) Perl script (gets web page) Perl script (makes sql to load data) Perl script (makes sql to load data) log files (troubleshoot) log files (troubleshoot)

39 March 20, 2008Electronic Resources and Libraries activePerl 5.8.6 build 811 to download webpages activePerl 5.8.6 build 811 to download webpages run from command prompt in development and testing run from command prompt in development and testing ieHTTPHeaders - an add-on for IE that displays HTTP Headers ieHTTPHeaders - an add-on for IE that displays HTTP Headers  http://www.blunck.se/iehttpheaders/iehttpheaders.html http://www.blunck.se/iehttpheaders/iehttpheaders.html once trace file is captured with ieHTTPHeaders add-on, use ParseHTTPTrace.pl to create GetWebPage_VENDOR.pl file. once trace file is captured with ieHTTPHeaders add-on, use ParseHTTPTrace.pl to create GetWebPage_VENDOR.pl file. http://www.codeproject.com/KB/perl/webautomaton.aspx http://www.codeproject.com/KB/perl/webautomaton.aspx http://www.codeproject.com/KB/perl/webautomaton.aspx retrieval of vendor data

40 March 20, 2008Electronic Resources and Libraries automating retrieval and reporting of database usage statistics for a consortium problem problem what we were doing and why it doesn’t work what we were doing and why it doesn’t work solution solution the pieces, the parts and how they fit together the pieces, the parts and how they fit together future future what we’ve learned and our expectations what we’ve learned and our expectations

41 March 20, 2008Electronic Resources and Libraries what have we learned? large change in service requires staffing and support large change in service requires staffing and support project name should be closely related to the service project name should be closely related to the service administration understanding of needs administration understanding of needs assignment of priorities assignment of priorities proof-of-concept proof-of-concept need for ongoing support –vendor changes, local needs need for ongoing support –vendor changes, local needs moving from proof-of-concept is NOT trivial moving from proof-of-concept is NOT trivial data checking/revisions/data checking/revisions data checking/revisions/data checking/revisions handoff from development to maintenance handoff from development to maintenance

42 March 20, 2008Electronic Resources and Libraries expectations future use future use until SUSHI is widespread OR until SUSHI is widespread OR until data collection and reporting in ERM products is mature OR until data collection and reporting in ERM products is mature OR until existing automated systems have reasonable consortial pricing until existing automated systems have reasonable consortial pricing future plans future plans customer/college interface customer/college interface hope… hope…

43 March 20, 2008Electronic Resources and Libraries

44 March 20, 2008Electronic Resources and Libraries Thank you College Center for Library Automation 1753 W. Paul Dirac Drive Tallahassee, Florida 32310 Susan Campbell scampbell@cclaflorida.org Jim McGill jmcgill@cclaflorida.org


Download ppt "March 20, 2008Electronic Resources and Libraries College Center for Library Automation Tallahassee, FL Susan B. Campbell Susan."

Similar presentations


Ads by Google