First Indico Workshop Conversion Server Thomas Baron 29-27 May 2013 CERN.

Slides:



Advertisements
Similar presentations
Other Web Application Development Technologies. PHP.
Advertisements

using SharePoint Automation Services;
Enterprise Content Management Departmental Solutions Enterprisewide Document/Content Management at half the cost of competitive systems ImageSite is:
Overview Features & Functions 7/13/12. Foundations Need A Document Processing Solution That… Collects files automatically? Scans to Word? Scans to Desktop?
RE Adapter for Encompass (v1.0)‏ Encompass and The Raiser's Edge® Integrated Data Solution.
UNDERSTANDING JAVA APIS FOR MOBILE DEVICES v0.01.
Remote Unit Testing Brian Pruitt-Goddard Alex Riordan.
© 2010, Robert K. Moniot Chapter 1 Introduction to Computers and the Internet 1.
Alon Blich A.B.C.  Printer Languages (Escape Codes) ◦ PCL, PostScript, Canon etc.  ActiveX/OLE Automation Server  PDF Utilities ◦ PDFInclude, PDFlib.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
The World Wide Web and the Internet Dr Jim Briggs 1WUCM1.
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
Introducing…. Business Problem Are you working as an individual, in a workgroup or with an enterprise having time restraints, limited resources and want.
Charlie Crocker Vice President Farallon Geographics, Inc. An Overview of Internet Mapping Technology.
Delivering Value Driven Document Management. The Business Case An unfulfilled need in the market for a powerful, comprehensive and value driven document.
INTRO TO MAKING A WEBSITE Mark Zhang.  HTML  CSS  Javascript  PHP  MySQL  …That’s a lot of stuff!
Virtual techdays INDIA │ august 2010 UNDERSTANDING OFFICE WEB APPS Vedant Kulshreshtha │ TSP – Collaboration Platform Microsoft India.
TOPIC 1 – SERVER SIDE APPLICATIONS IFS 234 – SERVER SIDE APPLICATION DEVELOPMENT.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Computer Concepts 2014 Chapter 7 The Web and .
Submitted by: Madeeha Khalid Sana Nisar Ambreen Tabassum.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
JavaScript, Fourth Edition Chapter 12 Updating Web Pages with AJAX.
2-6th June 2003 WP6 MAMA project WWW WP6 MAMA WWW 4 rd MAMA Meeting ROME, 3-6th June 2003 IFREMER coordinator Responsible
Overview Features & Functions 07/2013. A family of workflow automation products developed by Konica Minolta’s Solutions Engineering Center Easy-to-use.
Putting What We Learned Into Context – WSGI and Web Frameworks A290/A590, Fall /16/2014.
Nightly Releases and Testing Alexander Undrus Atlas SW week, May
Overview Features & Functions 07/2013. Foundations Need A Document Processing Solution That… Collects files automatically? Scans to Word? Scans to Desktop?
1 In the good old days... Years ago… the WWW was made up of (mostly) static documents. –Each URL corresponded to a single file stored on some hard disk.
THttpServer class Sergey Linev (GSI). Some history Development was inspired by JSRootIO why not achieve similar functionality with online ROOT application?
20-753: Fundamentals of Web Programming 1 Lecture 1: Introduction Fundamentals of Web Programming Lecture 1: Introduction.
Dynamic Content On Edge Cache Server (using Microsoft.NET) Name: Aparna Yeddula CS – 522 Semester Project Project URL: cs.uccs.edu/~ayeddula/project.html.
Creating Dynamic Web Pages Using PHP and MySQL CS 320.
Digital Filing A Simple Way to Digitally Centralize and Distribute Documents.
First Indico Workshop An Introduction to the Indico Software Thomas Baron May 2013 CERN.
Copyright © 2012 UNICOM Systems, Inc. Confidential Information z/Ware Product Overview illustro Systems International A Division of UNICOM Global.
Asynchronous Javascript And XML AJAX : an introduction UFCEUS-20-2 : Web Programming.
Delta Encoding for HTTP/1.1 Sloutsky Alexander Fink Dmitry Supervised by Lavy Libman.
INTERNET PROTOCOLS. Microsoft’s Internet Information Server Home Page Figure IT2031 UNIT-3.
Alfresco Daeja Integration Yong Qu Chief Solutions Architect
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
GOSS iCM Forms Gary Ratcliffe. 2 Agenda Webinar Programme Form Groups Publish Multiple Visual Script Editor Scripted Actions Form Examples.
Quick Launch. Google Drive 30 GB Cloud Space Document.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
.NET Mobile Application Development XML Web Services.
Geant4 is a toolkit to simulate the passage of particles through matter, and is widely used in HEP, in medical physics and for space applications. Ongoing.
Module 1: Introduction to Microsoft SQL Server Reporting Services
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
Integrating and Extending Workflow 8 AA301 Carl Sykes Ed Heaney.
/16 Final Project Report By Facializer Team Final Project Report Eagle, Leo, Bessie, Five, Evan Dan, Kyle, Ben, Caleb.
VNew PDF Converter “A PDF Converter to convert PDF files to images Manually and Automatically“
Indico – CERN-UNOG meeting – 28 Feb CERN – IT 1 INDICO Event Management and Archival Thomas Baron CERN-UNOG Meeting 28 th February 2012.
Esri UC 2014 | Technical Workshop | Administering ArcGIS for Server with Python Jon Bodamer.
//liveVirtualacademy2011/ What’s New for ASP.NET 4.5 and Web Development in Visual Studio 11 Developer Preview Γιώργος Καπνιάς MVP, MCT, MCDP, MCDBA, MCTS,
Frank DiMauro Web App Specialist Information Services Division Case Study: Updating Plone and Replacing Apache.
Redmond Protocols Plugfest 2016 Jinghui Zhang Office Interoperability Test Tools (Test Suites and Open Source Projects) Software Engineer Microsoft Corporation.
XNAT 1.7: Getting Started 6 June, Introduction In this presentation we’ll discuss:  Features and functions in XNAT 1.7  Requirements  Installing.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
z/Ware 2.0 Technical Overview
Extended DMS PUG 2017 April 2nd - 4th.
Modernising CERN Document Conversion service
IT Partners Conference Oliver Thomas 19 April 2005
Ambir nSign Presentation Synopsis
Extended Document Management System (EDMS)
Exploring the Power of EPDM Tasks - Working with and Developing Tasks in EPDM By: Marc Young XLM Solutions
Part of Chapter 1 Key Concepts Networks
Online Translation Service Capstone Design
GIL Users Group Meeting
Click2Export Export & Dynamics 365/CRM Reports/Word/Excel Templates in 1 Click
Presentation transcript:

First Indico Workshop Conversion Server Thomas Baron May 2013 CERN

Service Description Architecture Conversion Alternatives Future Directions

Service description

Goal Provide a PDF version of all textual documents uploaded to Indico Long-term preservation Multi-platform reading Converted formats:.ppt,.pptx,.doc,.docx,.sxi,.odp

Interface On user request only

Service parameters Asynchronous About 30 seconds in average At CERN: An average of 165 conversions per day

architecture

General overview Conversion server Indico PDFs files to convert HTTP API

Integration to indico Currently all entangled to Indico’s core Configuration in indico.conf conversion server URL: FileConverter[‘conversion_server’] callback URL: FileConverter[‘response_url’] Conversion handled by the Makac.export.fileConverter class convert function : sends the file storeConvertedFile function: gets the converted file back Conversion server Indico HTTP API Makac.export.fileConverter convert storeConvertedFile Indico.conf PDFs files to convert

Conversion server side A dedicated server running non-indico code and software Web server: IIS (previously Apache) Listener script: getSegFile.py ; python; receives the file, saves it locally, creates the conversion task (a text file) Conversion Daemon: Engine.py ; python script in scheduled tasks; parses the conversion task files, and for each of them launch the conversion, wait for its completion and send the file back to the callback URL (to Indico) Conversion software: performs the conversion Conversion server Indico HTTP API Makac.export.fileConverter convert storeConvertedFile Configuration.py Conversion software getSegFile.py Engine.py www task mas ter PDFs files to convert

Conversion alternatives

Fully home made Was the case at CERN until 2009 Using direct OLE-automation of Microsoft Office applications python scripts Example

Using commercial products Example at CERN: Neevia Document Converter Pro OLE automation Pros: More reliable Better error management Regular updates More extensible More formats Hot folders Can be used for other services Features: supports 300 file types Com, hot folder, interfaces Watermark, stamping etc. Convert to PDF, PostScript, TIFF (including Class F), BMP, PNG, PCX, JPEG

NEEVIA doc Converter Pro Simplified automation code in python NDocConverter = win32com.client.Dispatch("docConverter.docConverterClass") NDocConverter.DocumentOutputFormat = "PDF" NDocConverter.DocumentOutputFolder = output_dir NDocConverter.JobOption = "printer" rv = NDocConverter.SubmitFile( file_path, "") rv = NDocConverter.CheckStatus( file_path, "") NDocConverter = win32com.client.Dispatch("docConverter.docConverterClass") NDocConverter.DocumentOutputFormat = "PDF" NDocConverter.DocumentOutputFolder = output_dir NDocConverter.JobOption = "printer" rv = NDocConverter.SubmitFile( file_path, "") rv = NDocConverter.CheckStatus( file_path, "")

Future directions

What should be coming Unfortunately the feature is not directly usable by external instances Rewrite the conversion server-side code Very old! Improve the conversion server monitoring at CERN Not planned yet Replace current implementation with a plugin on the Indico side v1.5 (2015?)

Thomas Baron