Toro 1 EMu Hacking at the Peabody Museum. Yale campus.

Slides:



Advertisements
Similar presentations
Intel® RPIER 3.1 User Training Joe Schwendt Steve Mancini 7/31/2006.
Advertisements

ImageNow at LaSalle University Julie Riganati
Accessing electronic journals from off- campus This causes lots of headaches, but dont despair, heres how to do it! (Please note – this presentation is.
Create a Simple Game in Scratch
Meet Manager Training Updated April Step 1: Set up your meet Download the meet template Restore the template Purge Old Data Setup Meet Information,
Toro 1 EMu on a Diet Yale campus Peabody Collections Counts & Functional Cataloguing Unit Anthropology 325,000Lot Botany 350,000Individual Entomology1,000,000Lot.
XIr2 Recommended Performance Tuning Andy Erthal BI Practice Manager.
Peabody Museum of Natural History
MvCIS - Forbes Hawkins – Copyright © 2004 Museum Victoria Forbes Hawkins Collection Systems Developer Museum Victoria - Melbourne, Australia Museum Victoria.
Create a Simple Game in Scratch
Welcome to the Award Winning Easiest to Use & Most Advanced View, Manage, and Control Security, Access Control, Video, Energy & Lighting Systems, & Critical.
1 of : Multi-Currency Payments / DA0813 Last updated: Project Walkthrough: Multi-Currency Payments Multi-Currency Payments.
SQL Migration Daniel Abbott Director of Admin Info Systems Contra Costa Comm College Dist Randy Wallace Senior Technical Architect.
I / O: Care & Feeding of Your EMu Larry Gall Computer Systems Office Peabody Museum of Natural History Yale University.
The Caught and Coloured website: its EMu origins Alex Chubaty – Collection Information Systems Craig Churchill – IT Software Development Museum Victoria.
Best Practices for Managing & Motivating the Digitizers Larry Gall Computer Systems Office Yale Peabody Museum of Natural History.
Copyright © 2003 Americas’ SAP Users’ Group Custom Archiving 101 Session Code 108 Karin Tillotson Sr. Basis Administrator Tuesday, May 20 th, 2003.
Office Timesheets Tool Or go to and click on the Office Timesheets link in the footer.
Analysis and Performance Information Systems 337 Prof. Harry Plantinga.
EMu and Fotoware: Integrating the EMu Collections Management Program with Image Management Software - Dr. Lance Wilkie, EMu Unit, Australian Museum.
+ Photoshop – Monitor and Printer Calibration! Drycreekphoto.com.
batch Reports, Statements, Letters and more … eXPLODE and your documents – Put important documents in the hands of Clients faster to improve.
The Paperless Classroom with Google Docs Eric Curts - North Canton City Schools ericcurts.com - - gplus.to/ericcurts.
Global Update with Confidence Mary M. Strouse Innovative Users Group May 19, 2009.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Information guide.
FireRMS SQL Audit, Archiving & Purging Presented by Laura Small FireRMS Quality Assurance.
Copyright 2003 Accenture. All rights reserved. Accenture, its logo, and Accenture Innovation Delivered are trademarks of Accenture. Data Migration in Oracle.
Moving into the Testing Phase Revised for October 22, 2008.
Black Box Larry Gall -- Peabody Museum of Natural History.
Automating Tasks with Visual Basic. Introduction  When can’t find a readymade macro action that does the job you want, you can use Visual Basic code.
1 Team Leader TKS Job Aid. 2 Viewing the On-line Presentation If you are viewing this presentation via Internet Explorer for best results resize the “Notes”
Thank you for your decision to try vTreePro Practice Tree Software! Your satisfaction is very important to us. Please feel free to send us an if.
Dam It ! Larry Gall Peabody Museum of Natural History Yale University.
DTS Conversion to SSIS Conversion Best Practices Mike Davis
Win202 Database Administration. Introduction Welcome to OpenEdge. Type 2 Storage Areas. One of the big selling points for the OpenEdge platform and Win202.
Sage ACT! 2013 SDK Update Brian P. Mowka March 23, 2012 Template date: October 2010.
QuikTrac 5.5, a validated Motorola Software Solution, allows you to take your Host ERP screens and extend them out to fixed or mobile devices including.
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
Types of Processing of Data www. ICT-Teacher.com.
Introduction to KE EMu
Collections Management Museums EMu Searching EMu Searching Explained (What’s going on under the hood!) Bernard Marshall Chief Technical Officer KE Software.
IT1001 – Personal Computer Hardware & system Operations Week7- Introduction to backup & restore tools Introduction to user account with access rights.
Data Migration Training Page 1 KE EMu Data Migration
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
When the program is first started a wizard will start to setup your Lemming App. Enter your company name and owner in the fields designated “Company Name”
Or how to make 11=1 Dr Penny Berents Australian Museum EMu Project.
How to kill SQL Server Performance Håkan Winther.
HTBN Batches These slides are intended as a starting point for further discussion of how eTime might be extended to allow easier processing of HTBN data.
Outlook / Exchange Training. Outlook / Exchange: Agenda What Can Microsoft Exchange Do / How works at UST? and Inbox Mailbox Quota Archiving.
Principles of Demonstrative Instructional Video Peyton R. Glore Assistant Professor School of Information Technology Macon State College October 17, 2007.
You Inherited a Database Now What? What you should immediately check and start monitoring for. Tim Radney, Senior DBA for a top 40 US Bank President of.
Start-SPPowerShell – Introduction to PowerShell for SharePoint Admins and Developers Paul BAker.
Application Maintenance Toolset (AMT) Applying Patches (CTPs)
Exploring Computer Science - Lesson 3-4
You Inherited a Database Now What?
Express Checkout January 2013.
Exploring Computer Science - Lesson 3-4
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Adding animations. To add an animations to text or picture you have to click on the text or animation and on the top navigation bar click on animation.
Informatica PowerCenter Performance Tuning Tips
Overview of VAdata Virginia’s Sexual and Domestic Violence Data Collection System.
Bulk update E. Camelback Road #559, Phoenix, AZ Phone: Fax:
Exploring Computer Science - Lesson 3-4
Multi-Factor Authentication (MFA)
Turbo-Charged Transaction Logs
C/S Windows Overview Nigel Pilsbury.
You Inherited a Database Now What?
Presentation transcript:

Toro 1 EMu Hacking at the Peabody Museum

Yale campus

Peabody Collections Counts & Functional Cataloguing Unit Anthropology 325,000Lot Botany 350,000Individual Entomology1,000,000Individual Invertebrate Paleontology 300,000Lot Invertebrate Zoology 300,000Lot Mineralogy 35,000Individual Paleobotany 150,000Individual Scientific Instruments 2,000Individual Vertebrate Paleontology 125,000Individual Vertebrate Zoology 185,000Lot / Individual 2.7 million database-able units => ~11 million items

Peabody Collections Functional Units Databased Anthropology 325, % Botany 350,000 1 % Entomology1,000,000 1 % Invertebrate Paleontology 300, % Invertebrate Zoology 300, % Mineralogy 35, % Paleobotany 150, % Scientific Instruments 2, % Vertebrate Paleontology 125, % Vertebrate Zoology 185, % 940,000 of 2.7 million => 37 % overall

EMu migration in '05 (all disciplines went live simultaneously) Physical move in '00-'02 (primarily neontological disciplines) Big events

The four YPM buildings Peabody (YPM) Environmental Science Center (ESC) Geology / Geophysics (KGL) 175 Whitney (Anthropology)

VZ Kristof Zyskowski (Vert. Zool. - ESC) Greg Watkins-Colwell (Vert. Zool. - ESC)

HSI Shae Trewin (Scientific Instruments – KGL )

VP Mary Ann Turner (Vert. Paleo. – KGL / YPM)

ANT Maureen DaRos (Anthro. - YPM / 175 Whitney)

EMu Hacking at Peabody Hacking – in a laudatory programming sense, not a criminal sense

Mitnick Often we tend to think of “hackers” in this mode

Mitnick modified cracker A better moniker

Mitnick modified w/EMu cracker Crackers often have unnamed accomplices…

3 Vignettes of YPM EMu “hacks” An issue of functionality (background script) An issue of performance (tweaking the catalogue) An issue of user behavior & cost (another script…)

Hack Vignette #1 Multimedia module - JPEG 2000 support

- non-proprietary compression standard - lossless mode (much smaller files) - lossy mode (vastly smaller files) - potential space/bandwidth savings

JP2 spicebush with J2K and tail target

JP2 spicebush tails with file sizes 1.54 mB (native TIFF) 15 kB (heavily squeezed JP2)

HERBIS images 261 kb – <1%1,302 kb – 2% 5,166 kb – 12%62,640 kb – 100%

JP2 – no thumbnail In EMu, oops… no thumbnail

JP2 – script coding find imagedir –name *.jp2 –mtime -2 –print loop on the matches and test to see which recently loaded JP2 files are missing a thumbnail JPG, or which JP2 files have been modified more recently than their existing thumbnail JPG ; then build filenames for any qualifying target JPGs ; execute script several times per hour from cron jasper –f match –F tempfile convert tempfile –resize 90x90 target

JP2 – prior, without script wakes up every 20 minutes…

JP2 – now, with makes the thumbnail…

JP2 – Tiled View JP2 files now behave just like all other standard multimedia

JP2 – Photoshop opens Double click and the Photoshop handler kicks in

JP2 – V1 V. 1 – simply generated thumbnails in the background

JP2 – V2 V. 2 – also inserted suitable metadata into records via texload (next version, script to be called directly in validation code at file time)

Hack Vignette #1 Moral #1 = EMu is extensible, you may be able to implement significant changes yourself in whole or in part, without delay

Catalogue module - performance issues Hack Vignette #2

Default EMu “cron” job configuration late night workday evening = emulutsrebuild = emumaintenance batch = emumaintenance compact Mo Tu We Th Fr Sa Su Orange is time EMu busy running background jobs. Interfering with workday work, and leaving Sunday processing time idle/unused.

The ecatalogue database is a rate limiter File NameFunction ~/emu/data/ecatalogue/datathe actual data ~/emu/data/ecatalogue/recindexing (part) ~/emu/data/ecatalogue/segindexing (part) At YPM, the combined size of these was >10 gB, with 4 gB in data and 3 gB in both rec and seg

Touch many types of records in EMu… e.g., Party record add middle name e.g., Bibliography recordadd author e.g., Collecting Events recordadd collector …automatic changes subsequently propogate to numerous records in the ecatalogue database …ecatalogue can grow a lot and slow EMu to varying degrees between maintenance runs

How to make ecatalogue go faster ?

maybe save 20+% ? Make it smaller - trim nulls from Legacy Data ?

Repetitive scripting of texexport & texload jobs Conducting around a million re-imports of records Manual adjustment of nightly cron jobs to accommodate Do the work at nighttime over a month-long period Watched ecatalogue closely to keep from exploding disk Make it smaller - trim nulls from Legacy Data ?

data rec seg Starting situation at YPM for ecatalogue (gB on y axis)

data rec seg delete nulls from AdmOriginalData

sites – round 2 constant data lengthy prefixes … not satisfied with just that… here are some other things to possibly trim!

data rec seg delete nulls from AdmOriginalData shorten prefix on AdmOriginalData selectively delete AdmOriginalData >55 % !

catalogue – round 2 data rec seg What ecatalogue AdmOriginalData looks like post scripting

Default EMu “cron” job configuration late night workday evening = emulutsrebuild = emumaintenance batch = emumaintenance compact Mo Tu We Th Fr Sa Su BEFORE

Modified EMu “cron” job configuration Mo Tu We Th Fr Sa Su late night workday evening = emulutsrebuild = emumaintenance batch = emumaintenance compact *** AFTER Can now squeeze all maintenance into wee hours of night, use Sunday, and fully compact ecatalogue every other day (asterisks)!

Quick backup Also, all of YPM EMu can now be squeezed onto a thumbdrive

Hack Vignette #2 Moral #2 = know your data, you can put aspects of EMu on a diet and your computer system is likely to thank you

Hack Vignette #3 EMu sessions - licensing and user behavior

Dreaded WARNING! 2 KE EMu user(s) are currently being denied access because all 10 of your KE EMu licenses are in use. For license upgrades, please contact Dreaded for sysadmins

Museum Director: "Go license shopping at KE!" Systems Admin: "VISA or MasterCard?" The conversation you dream of but of course never have…

What do you need ? Guaranteed license seat for every potential user ? Cover maximal number of expected concurrent users ? Minimize expenses by minimizing license seats ?

Jess & Lourdes fight (2) My turn to log in ! %}&$ Dream on, loser ! 3rd option is dangerous… if you have this you probably have too few licenses

Even with a moderate number of licenses… … inactive EMu sessions can and will accumulate

Critical research VARIANT 1: critical research needed, EMu session put on hold

VARIANT 2: both people and computers crash… Life intervenes Mon cherie IRN View >Attachments

…enter the EMu Grim Reaper Script seeks out inactive EMu sessions

reaper – script coding texlicstatus ps -ef -Grim Reaper wakes up frequently throughout the day -keeps a running table of statistics about each texserver -compares each texserver against a countdown timer -adjusts timer based on activity since last wake up -if some new activity, resets the countdown timer -if no activity, increments the countdown timer -if countdown timer max is reached, kill the texserver kill –9 texserver_process_id

Tuning the Emu Grim Reaper Script Change time between wakeup checks Change number of wakeup check intervals Tell reaper to ignore certain users Amend reaper behavior by time of day Alter how much inactivity is considered bad 32 regular YPM users, 13 runtime licenses

New sessions started per hour, Real data prior two weeks in October 2006

Cumulative new sessions started, Real data prior two weeks in October 2006

Active sessions, : three slow days Real data prior two weeks in October 2006

Active sessions, : three fast days Real data prior two weeks in October 2006

Cope on phone It’s telling me, “Licenses Exceeded?!” No more worrries

Hack Vignette #3 MORAL = find a licensing balance, but also consider training your users and EMu system

Happy Scripting, Happy Campers