Grid Content Management Jim Myers PNNL. GFS-WG Aims to –describe and manage the namespace of federated data sets, access control mechanisms, and meta-

Slides:



Advertisements
Similar presentations
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh Alan Chappell PNNL
Advertisements

Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
Welcome to Middleware Joseph Amrithraj
Data Management Expert Panel - WP2. WP2 Overview.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
Lecture-7/ T. Nouf Almujally
HEP Data Sharing … … and Web Storage services Alberto Pace Information Technology Division.
HyperContent 2.0 JA-SIG Winter Conference December 5, 2005 Alex Vigdor, Columbia University.
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
Next Generation Node (NGN) Technical Overview April 2007.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Application Standards for ‘Push’ Content and Streaming Media Hadi Partovi Microsoft Corporation.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
Web-based Software Development Web-based Distributed Authoring and Versioning Jul 19, 2005 Shin Young Ahn.
Electronic Notebooks: An Interface Component for Semantic Records Systems James D. Myers, Michael Peterson, K Prasad Saripalli, Tara Talbott Mathematics.
DATABASE and XML Moussa Mané. Learning Objectives ● Learn about Native XML Databases ● Learn about the conversion technology available ● Understand New.
Copyright © IBM Corp., All rights reserved; made available under the EPL v1.0 | March 20, 2008 | Short Talk Standards based systems management: An.
Digital Object Architecture
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
Java Servlets CS-422. Application Mapping Your servlet application will be mapped to a directory structure: –“myapp” maps to some directory C:/docs/apps/myapp.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
第十四章 J2EE 入门 Introduction What is J2EE ?
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
WebDAV Issues Munich IETF August 11, Property URL encoding At present, spec. allows encoding of the name of a property so it can be appended to.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
Interfacing Registry Systems December 2000.
Java Naming and Directory Interfaces. A naming service is an entity that performs the following tasks:  It associates names with objects. Similar to.
HyperContent 2.0 Common Solutions Group September 21, 2005 Alex Vigdor, Columbia University.
XML Registries Source: Java TM API for XML Registries Specification.
Fisheries Oceanography Collaboration Software Donald Denbo NOAA/PMEL-UW/JISAO Presented by Nancy Soreide NOAA/PMEL AMS 2002/IIPS 10.3.
New Ideas for IA Readings review - How to manage the process Content Management Process Management - New ideas in design Information Objects Content Genres.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Content Repositories with CMIS and Apache Chemistry Stephan Klevenz, SAP AG November 2011.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Data Provenance and Annotation Dec. 2, 2003 Collaboratory for Multi-scale Chemical Science (CMCS): A Knowledge Grid/ Adaptive Informatics Infrastructure.
Struts 2 introduction. Struts 2 framework Struts 2 A full-featured web application framework for the Java EE platform The Java Servlet API exposes the.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Scientific Annotation Middleware (SAM) Jim Myers, Elena Mendoza PNNL Al Geist, Jens Schwidder ORNL.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Adapting the Electronic Laboratory Notebook for the Semantic Era Tara Talbott, Michael Peterson, Jens Schwidder, James D. Myers 2005 International Symposium.
WebDAV Working across the Internet: Peter Pierrou, Excosoft.
WEB SERVER SOFTWARE FEATURE SETS
Oracle HFM Implementation Boot Camp
1 Shuaib Karim Institute of Software Technology & Interactive Systems (Vienna University of Technology)
Session #2482 Implementing WebDAV Using J2EE TM and Open Source Technologies Juergen Pill Team Leader Software AG Remy Maucherat Software Engineer Sun.
The ECOST Web-based platform for data providers and for data users.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Document Management Alliance (DMA)
© 2010 IBM Corporation RESTFul Service Modelling in Rational Software Architect April, 2011.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
ODP V2 Data Provider overview. 22 Scope Data Provider provides access to data and metadata of the local data systems. Data Provider is a wrapper, installed.
BOF-1147, JavaTM Technology and WebDAV: Standardizing Content Management Java and WebDAV Juergen Pill Team Leader Software AG Remy Maucherat Software Engineer.
Integrating CMS/DMS into OpenOffice Michael E. Bohn Consultant Office Migration Sun Microsystems GmbH.
SmartCode Brad Argue INLS /19/2001.
Warm Handshake with Websites, Servers and Web Servers:
Web Services Primer Overview of Web Services
XML in WebDAV or, a Tale of Two Standards
Document Management Reference Model
Web Page Concept and Design :
Database Systems Instructor Name: Lecture-3.
Presentation transcript:

Grid Content Management Jim Myers PNNL

GFS-WG Aims to –describe and manage the namespace of federated data sets, access control mechanisms, and meta- data –(a) virtualized hierarchical namespaces for files or data sets, (b) efficient and transparent file sharing, and (c) access control with flexible capabilities management, and (d) ability to manage other metadata. –(e.g. data in file systems, FTP server, WWW sites, streams, etc.,) or semi-structured data (XML repositories).

Why a GFS? Familiar metaphor supporting user control of –data organization, –access control, –file metadata Value added beyond a UUID/address

Content Management What other aspects of data management should/could be virtualized? –User metadata –Granularity –Versioning –Searching –Locking –Observation –Content Typing –(Semantic) Linking –Packaging –Transactions

Questions for GFS-WG Are these useful services to think about? Are they logically dependent on/connected to a virtual namespace? Do they require significant additional capabilities to implement?

Are they useful? Ive been influenced by JSR 170 and WebDAV… Yes!

JSR-170 Expert Group Major CM, DM & Repository Vendors Content Application Vendors Application Server Vendors Integration Experts Open Source Community Representatives

JSR-170 Expert Group Apache Software Foundation Art Technology Group Inc.(ATG) BEA Systems Broadvision Inc. Day Divine Documentum, Inc. Filenet Corporation Fujitsu Limited Griffin, Sean Hewlett-Packard IBM Intalio, Inc. Interwoven Kandzior, Alexander Macromedia, Inc. Mark, Scott Mediasurface Ltd. Myers, James D. Novell, Inc. Oracle Rational Software SAP AG SAS Institute Inc. Shin, Simon Y.S. Software AG Stellent, Inc. Sun Microsystems, Inc. Thodla, Dorai Venetica Corporation Vignette

Simplified Content Model

Web Distributed Authoring and Versioning (WebDAV) An early web service (XML Payloads over HTTP) Put/Get data with arbitrary properties (dynamic) Properties can be discovered and accessed independently DASL, Versioning, Transactions, …

Scientific Annotation Middleware

Supporting A Wide range of Applications File View –Implemented by DAVfs, MS WebFolders Content View –DAVExplorer views properties, versioning Provenance View –SAM/CMCS generates provenance graphs, etc. Fortran Application Local Disk DAV Store DAV+ JMS Resource + Key/value metatadata

What is required? Arbitrary metadata associated with a logical name –Not much more than is requires to support a file system view Interpreting metadata to implement specific capabilities could be separable (level1,2 compliance)

Questions for GFS-WG Are these useful services to think about? Are they logically dependent on/connected to a virtual namespace? Do they require significant additional capabilities to implement? Should they be considered in this WG? Use a level 1, level 2 compliance scheme?

If yes Do we need a document describing content management in more detail? –Concept –Benefits (higher level services such as provenance, …) –Mapping(s) to virtual file directory service? –Grid-related practice?

Hierarchy Support Sample API getNode(String path) addNode(String path) removeNode(String path) getNodes() moveTo(String absPath) copyTo(String absPath)

Make the case for other services, Argue why they apply to a virtual namespace Note that they may rely on lower level services tied to the UUID Argue that most can be implemented using properties to store state Argue that GFS should be GCM – level 1 ala JSR 170.

XML Serialization Example DTD: <!ATTLIST node name CDATA #REQUIRED> <!ATTLIST property name CDATA #REQUIRED type (String|Date|SoftLink|Binary|Double|Long|Boolean) "String" onVersion (copy|noCopy) "copy" pattern CDATA ".*" defaultValue CDATA "">

Scope of Level 2 Spec What does an extended Content Repository do? –Versioning –Searching –Locking –Observation –Content Typing –Linking –Packaging –Transactions –Access Control