How Zones Served Queen Victoria Peter Tribble ProQuest.

Slides:



Advertisements
Similar presentations
B. Ramamurthy 4/17/ Overview of EC2 Components (fig. 2.1) 10..* /17/20152.
Advertisements

Kentico CMS 5.5 R2 What’s New. Highlights Intranet Solution Document management package – WebDAV support – Project & task management – Document libraries.
Calendar Browser is a groupware used for booking all kinds of resources within an organization. Calendar Browser is installed on a file server and in a.
14.1 © 2004 Pearson Education, Inc. Exam Planning, Implementing, and Maintaining a Microsoft Windows Server 2003 Active Directory Infrastructure.
Keith Burns Microsoft UK Mission Critical Database.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Fall 2011 Nassau Community College ITE153 – Operating Systems Session 24 NTFS Permissions and Sharing Printers 1.
Week:#14 Windows Recovery
Guide to Operating System Security Chapter 2 Viruses, Worms, and Malicious Software.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
Android 4: Creating Contents Kirk Scott 1. Outline 4.1 Planning Contents 4.2 GIMP and Free Sound Recorder 4.3 Using FlashCardMaker to Create an XML File.
Linux Operations and Administration
Configuring File Services Lesson 6. Skills Matrix Technology SkillObjective DomainObjective # Configuring a File ServerConfigure a file server4.1 Using.
Motivation. Part of Deutsche Telekom project:
Eucalyptus Virtual Machines Running Maven, Tomcat, and Mysql.
DNN Performance & Scalability Planning, Evaluating & Improving : Part 2.
Module 13: Configuring Availability of Network Resources and Content.
Business Continuity and Disaster Recovery Chapter 8 Part 2 Pages 914 to 945.
Hands-On Virtual Computing
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
A Web Crawler Design for Data Mining
Integrating with UCSF’s Shibboleth system
Maintaining File Services. Shadow Copies of Shared Folders Automatically retains copies of files on a server from specific points in time Prevents administrators.
SUSE Linux Enterprise Desktop Administration Chapter 12 Administer Printing.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Team 708 – Hardwired Fusion Created by Nam Tran 2014.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
Module 3 Planning and Deploying Mailbox Services.
1 MONGODB: CH ADMIN CSSE 533 Week 4, Spring, 2015.
Lecture 4 Page 1 CS 111 Online Modularity and Virtualization CS 111 On-Line MS Program Operating Systems Peter Reiher.
Apache Web Server Architecture Chaitanya Kulkarni MSCS rd April /23/20081Apache Web Server Architecture.
Revision Unit 1 – The Online World Online Services Online Documents Online Communication Cloud Computing The Internet Internet Infrastructure Internet.
Virtual Machines Module 2. Objectives Define virtual machine Define common terminology Identify advantages and disadvantages Determine what software is.
Introduction of Wget. Wget Wget is a package for retrieving files using HTTP and FTP, the most widely-used Internet protocols. Wget is non-interactive,
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
INTRODUCTION TO AMAZON WEB SERVICES (EC2). AMAZON WEB SERVICES  Services  Storage (Glacier, S3)  Compute (Elastic Compute Cloud, EC2)  Databases (Redshift,
Sun Tech Talk 3 Solaris 10 and OpenSolaris Pierre de Filippis Sun Campus Evangelist
1 Remote Installation Service Windows 2003 Server Prof. Abdul Hameed.
Solaris containers (Zones) Server virtualization What zones are and how they are used in ECE/CIS at the University of Delaware Ben Miller.
GENERAL SCALABILITY CONSIDERATIONS
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
April 1st, 2009 Cobbler Provisioning Made Easy Jasper Capel.
Migration and Preparation
Calgary Oracle User Group
Seven Problems of Linux Containers
SQL Database Management
DISCOVERING COMPUTERS 2018 Digital Technology, Data, and Devices
Computers: Tools for an Information Age
Configuring File Services
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Dag Toppe Larsen UiB/CERN CERN,
Pinterest Clone Features
Deploy, Manage, and Scale Your Apps with OpsWorks, Elastic Beanstalk, and CodeDeploy Part 1 – Elastic Beanstalk © 2017 Amazon Web Services, Inc. and.
Dag Toppe Larsen UiB/CERN CERN,
Outline What does the OS protect? Authentication for operating systems
Platform as a Service.
Introduction to the Junos Operating System
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Outline What does the OS protect? Authentication for operating systems
Utilization of Azure CDN for the large file distribution
2018 Real Dell EMC E Exam Questions Killtest
Managing Clouds with VMM
Data Security for Microsoft Azure
Outline Virtualization Cloud Computing Microsoft Azure Platform
Tivoli Common Reporting v1.2 Overview
Ed Ferrara, MSIA, CISSP MIS 5208 Processing and Analyzing Data Ed Ferrara, MSIA, CISSP
Presentation transcript:

How Zones Served Queen Victoria Peter Tribble ProQuest

The speaker... ● Full time sysadmin – Solaris, Linux, other unices... – Jack of all trades – Application focus ● Part time developer/enthusiast – OpenSolaris – Governing Board Member

ProQuest ● Online Publisher, US parent ● I deal with mostly literature and historical data – Early English Books, Early European Books – Vogue – Literature Online, Film, media, video – Periodicals ● Digital scans, indexing, strong search ● Customers – academic, universities, research – Not individuals or the public

One day ● “We've got this project...” ● Short term, high traffic ● Very public, on national media ● High profile, fantastic PR ● Potential for egg on face ● Can we build it?

Queen Victoria's Journals ● Princess Victoria started a journal in 1832 ● Carried on as Queen until her death in 1901 ● 141 volumes, pages ● Digitally scanned

Initial Evaluation ● Let's assume it's on the BBC News – And national newspapers ● And 1 million page views per day ● Given existing similar sites ● System infrastructure will be easy ● Network load might be interesting – Maybe 300M/s

Handling the data ● Scanned/photographed ● Stored on ZFS ● ImageMagick ● Zoomify ● Replicated image servers ● Reprocess until customer happy ● Keyed data for search

Handwriting ● Handwritten texts only exist from the time before Victoria became queen ● Only 13 small purple and marbled volumes survive

Standard Architecture ● Use Solaris with Zones for everything ● Sparse root, very lightweight – Guarantees every system is identical ● Application lives in one place – /opt/proquest ● Private application stack – Independence from vendor, patches ● Replicate horizontally – Just copy the one directory (+extras)

Application layout ● /opt/proquest – Same on every zone ● /opt/proquest/{apache,java,tomcat} – Read-only (root -owned) ● /opt/proquest/qvj/{apache,tomcat} – Configuration and logs only ● Virtualization at every level – System, application, application instance

Princess Beatrice Transcripts ● Manually transcribed after Victoria's death ● On her instructions ● Sanitised and edited ● Originals destroyed ● Sketches and doodles cut and pasted ● Finished in 1940

Build a zone ● zonecfg -z qvj -f zconfig.qvj create set zonepath=/storage/zones/qvj set autoboot=true add net set physical=igb0 set address= /24 end add fs set dir=/opt/sfw set special=/opt/sfw set type=lofs set options=ro end add fs set dir=/opt/proquest set special=/storage/qvj/opt/proquest set type=lofs end verify commit

What we do NOT do ● Manage the zone – No packaging, no storage delegation – It's all about the application ● Patch zones – Migrate to newly patched host, then patch server ● Migrate zones – Save app, destroy zone, build new zone, restore

Lord Esher transcripts ● First Keeper of the Royal Archives ● Typewritten ● Unknown to Beatrice ● Unabridged ●

Production Infrastructure ● Load balancer handles individual failures ● N web zones (2, 4, 8) apache/tomcat ● M search zones (Solr in this case) ● Separate url and zones for images – Offsite at Everycity during launch ● Our normal site has MySQL – this doesn't – Authentication (free to UK) – History and personalization (remove)

System Capacity ● Most of our sites use tomcat, so each zone needs approx 2G of RAM ● OS overhead is essentially zero ● Typically low traffic, so memory is the constraint ● Even on busy sites, the application is a bottleneck ● So we basically size for RAM, and add zones until (just before) we start to swap ● Actually need to add lots of swap

The Artist ● Queen Victoria was a keen artist ● Scanned many of her sketches ● Also many illustrations in the journals

Testing, testing... ● Lots of new technologies (in our context) – Solr, no mysql, GeoIP – And go to current versions in the stack ● Need to work on each of those – Easy – create a zone – Actually, lots of zones... ● Use our standards to create lots of clones – Clone the app, not the zone

Performance ● Normally, we are low usage (but high value) ● Critical bottleneck – MySQL – was obvious – Removed very early ● Complete environment cloned using zones – And a whole bunch of load generators ● Actual testing almost trivial – Plain wget + ab – Components + whole stack

Fixes ● 1 page was horrifically expensive – >2s to render afresh with each request – Generate once, save, serve that ● HTML is extremely verbose – mod_gzip made a huge difference ● Design images unoptimized – 5x improvement possible ● Timeline sent twice for Mobile – Detect before sending the data

Other tricks - ZFS ● Each zone has its own zfs file system – Just 1, so managed as a single unit – Automatic snapshots (daily) – Compression enabled (great for logs) ● Corrupted zone? Roll back – Shut down zone – zfs rollback – Boot zone

Other Tricks - SMF ● All services managed using SMF – Trivial start/stop – Clean application environment – Dramatic reduction of trivial call-outs ● Developers/app owners have privilege – Just to restart, not to modify boot state ● Assign net_privaddr if appropriate – All applications run on the “official” port ● SMF still has rough edges

Thank You