Cyberthèses and Cyberdocs The EADI Information Management Working Group's 27th annual meeting – Dublin – September 11th and 12th Martin Sévigny – AJLSM.

Slides:



Advertisements
Similar presentations
Aquitaine Patrimoines & Cyberdocs 4th Open Archives Forum Workshop September 5, 2003 Rasik Pandey – AJLSM
Advertisements

Lund University Libraries, Sweden Jessica Lindholm, Lund University Libraries, Sweden E-theses Breakout Session.
28 March 2003e-MapScholar: content management system The e-MapScholar Content Management System (CMS) David Medyckyj-Scott Project Director.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Standards and Increasing Maintainability on Web- based Systems James Eaton SE4112/16/2006.
IAEA International Atomic Energy Agency United Nations Library and Information Network for Knowledge Sharing (UN-LINKS) September 2013, Geneva.
Harvesting Metadata for Use by the geodata.gov Portal Doug Nebert FGDC Secretariat Geospatial One-Stop Team.
LoboVault UNM's Searchable Online Archive New Mexico Data Users Conference Albuquerque, NM Larry Compton BBER Data Bank University of New Mexico.
FAO and UNESCO-IOC/IODE Combine Efforts in their Support of Open Access Written by Marc Goovaerts, U. Hasselt, BE.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Bibliothèque de l’Université LavalFaculté des études supérieures Guy Teasdale Access 2003 Vancouver - October 4, 2003.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
Formation of ETD‘s and releated issues 6th ETD Conference May 20 – , Berlin Dr. Nikola Korb, Co-ordination Agency DissOnline Deutsche Bibliothek.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
AgriDrupal - a “suite of solutions” for agricultural information management and dissemination, built on the Drupal CMS; - the community of practice around.
Africa Information Highway and SDMX implementation in Africa Beejaye Kokil Economic & Social Statistics Division African Development Bank
Content Management at Grainger Engineering Library Case studies from various digital library research projects Tom Habing
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
8th International Symposion on Electronic Theses and Dissertations, ETD2005, Sydney SCOPE An XML Based Publishing Platform Uwe Müller, Manuel Klatt Humboldt-Universität.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Global & Regional Initiatives on Information Management Eero Mikkola(IUFRO) Joris Siermann (CIFOR) Global Forest.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Open Textbooks and Electronic Publishing Formats/Standards Arctic Virtual Learnng Tools
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
ETD 2001
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
TEI and Scholarly publishing Laurent Romary INRIA & HUB-ISDL TEI council, chair.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
Planning for Life after OCLC Passport for Cataloging An overview of the new OCLC cataloging service Revised April 2002.
The Development of Open Access to ETDs in Canada: a Partnership between Canadian Universities and Library and Archives Canada 8 th International Symposium.
Practical Experiences With the Adoption of XML in Commercial Publishing Richard Kidd Neil Hunter
IR Applications at University of Saskatchewan Library: present and future CARL Institutional Repository Luncheon Saskatoon, SK June 8, 2005 David Fox Head,
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Economists Online researchers and libraries collaborate. A subject-specific service model. Benoit Pauwels Université Libre de Bruxelles.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Greater Visibility, Greater Access QSpace QSpace Queen’s University Research & Learning Repository.
Open Repository Claire Bundy OAI6 Geneva Overview BioMed Central: who we are About Open Repository Is Open Repository right for you? Questions and.
1 Technical Update Kathi Fletcher, Cameron Cooper, Ross Reedstrom (and the Connexions staff) Objectives: 1.Community College Consortium 2.University Press.
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
Beyond HTML: Extensible Markup Language (XML)
GeoNetwork OpenSource: Geographic data sharing for everyone
Project 1 Introduction to HTML.
A look at the digital initiatives of Laval University Library
Building A Repository for Digital Objects
Markup of Educational Content
VI-SEEM Data Repository
Introduction to Implementing an Institutional Repository
Presentation transcript:

Cyberthèses and Cyberdocs The EADI Information Management Working Group's 27th annual meeting – Dublin – September 11th and 12th Martin Sévigny – AJLSM Publishing structured electronic documents

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Overview Cyberthèses: an ETD project Cyberdocs: tools for publishing structured electronicu documents Development process and philosophy

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Historical backgrounds Cyber?  “Cyberthèses” is now a six year old project Publishing electronic theses and dissertations using a standard-based and structured approach  “Cyberdocs” is a brand new open source publishing platform In 2002/2003, the Cyberthèses project processing tools have undergone a major upgrade  Completely open source  New dynamic publishing module  Not only theses, but any word processor document...  … Cyberdocs!

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Historical backgrounds Origin: information processing platform for scholarly publishing, Presses de l’Université de Montréal, 1997 Soon after  Université Lumière Lyon 2 joins the project, with other institutions  Important financing and implication from the Agence internationale de la francophonie (Fonds francophone des inforoutes) Since then  Continuing development of the project and the tools  Strong implication of Lyon 2 and AIF  Disseminiation towards other countries

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Cyberthèses publishing model Processing and document model (first version of the tools)  From word processor documents to logically structured documents in ISO-8879 (SGML), using TEILite DTD Theses are important scientific documents, their use (archiving, searching and retrieving, browsing, reading,...) should be based on standard and structured documents Because of the wide variety of authors, even within an institution, the passage to a structured document in SGML should be as easy as possible for them  Human styling of word processor documents  Processing mostly based on Omnimark scripts  Static HTML publishing on the Web, SGML publishing with Softquad Panorama

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Cyberthèses project today Nine institutional partners  France, Canada, Egypt, Chile, Switzerland, … Around 40 institutional users  France, Morocco, Algeria, Lebanon, Burkano Faso, Vietnam, Mauritius, Madagascar, Senegal, Mali, … Very strong impact in France, but also... ... in Africa and Indian Ocean ... in South-America, under the initiative of Universidad de Chile But what about tomorrow…?

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Objectives of the open source transition Economical  Increase the dissemination of the project and its platform at very low costs Technological  Review the platform and add functionalities where possible XML? Dynamic Web site? Unicode? MathML? SVG? OAI?

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Objectives of the open source transition Long-term viability of the project  Benefit from the four freedoms of Free software Freedom to run the software, for any purpose Freedom to study the program, and adapt it to your needs Freedom to redistribute copies Freedom to improve the program (according to the Free Software Foundation) Cyberthèses had already addressed the long-term viability of its documents, the theses, by using open and structured standards. It now addresses the long-term viability of its tools by using free software and by building a community around them.

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Cyberdocs now A complete platform for publishing structured electronic documents  Conversion module: from word processor to TEILite XML  Management module: Web interface for driving conversions  Publication module: dynamic Web application for publishing documents First release candidate available ( GNU Public Licence (GPL)

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Description Conversion module: from word processor to structured documents  Word processor documents are styled to identify important contents and structures  Flat XML extraction using OpenOffice.org office suite  Conversion to TEILite based on XSLT transformations  Production of support files for the dynamic Web application  Production of static HTML, XHTML and PDF files from XML Used in the dynamic Web application for printing May be used to publish using other systems

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Description Management module: Web interface for driving conversions  May serve multiple institutions  Administrators can create users  Workers can create workspaces for documents Upload files (word processor documents, images, etc.) Start the conversion process, at any step See the results of the conversion, at any step, with adapted error and information messages Make necessary corrections and restart if necessary Forms to add metadata (Cyberthèses, Dublin Core, ETDMS, …) Publish the document if the dynamic Web application is used

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Description Publication module: dynamic Web application  Underlying infrastructure SDX, an XML search engine and publishing framework Cocoon, an XML-based infrastructure for building dynamic Web applications  Functionalities Search documents using simple queries or complex forms  Search in metadata  Search in fulltext  Search in specific zones (section titles, figure legends, table captions, …) List documents by institutions, dates, subjects, … Browse documents  Interactive table of contents, list of tables, list of figures, …  Query terms highlighting  Search within a document

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Description Publication module: dynamic Web application  Easily customizable Translations Skins Metadata labels Colours using per institution CSS  OAI-PMH repository Built-in OAI-PMH support in the SDX platform Sends metadata in Dublin Core (mandatory), ETDMS  One could easily add OAI-PMH harvesting capabilities

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Benefits Technical  Some small indirect technical benefits MathML Unicode Easier graphics capabilities Dynamic Web site OAI repository  Benefit from other dynamic open source efforts In XML processing In TEILite tools  Participate in the huge global effort within the open source community towards implementing open technologies based on standards

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Benefits Organizational  Easier cooperation The Cyberthèses program is already centred around cooperation Now its platform favours such cooperation and exchange of expertise  Lower costs More important in the long term  Attractiveness New developers New contributors  Documentation  Tutorials  Translations New areas of application

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Benefits Organizational  New economic model In the past and currently: major sponsoring from public institutions or organizations In the future: small contributions from a variety of individuals and organizations?

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Future developments No specific plans yet, but should come before the end of the year Functionalities that could be added  Support for OAI-PMH harvesting  Better support for various document types Reports Electronic journal and newletters Monographs  Support other DTDs Conversion module Publication module  Support a wider range of word processor features

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Future developments Main objectives of future developments  Make Cyberdocs not only a great publishing platform for electronic theses and dissertations, but for any kind of scientific literature  Increase the number of supported languages in the various modules by translating messages, in order to broaden the distribution around the world  Attract new developers and financing institutions interested in various uses of the platform

Cyberthèses and Cyberdocs - Martin Sévigny - EADI-IMWG 2003 – September 12th Conclusion Three phases of open source projects  Phase 1: initial development Few developers and sponsors Few users  Phase 2: growing user base Still few developers and sponsors More users  Phase 3: sustainable development Various developers and sponsors More and more users Cyberdocs will try to reach phase 3 by developing user and developer communities