Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.

Slides:



Advertisements
Similar presentations
X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo
Advertisements

National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Grid Content Management Jim Myers PNNL. GFS-WG Aims to –describe and manage the namespace of federated data sets, access control mechanisms, and meta-
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Capturing and Supporting Contexts for Scientific Data Sharing via the Biological Sciences Collaboratory George Chin Jr. and Carina S. Lansing (PNNL) Appeared.
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
What is Asset Bank? Asset Bank is an enterprise-scale Digital Asset Management system A fully searchable, categorised library of digital images, videos.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Talend 5.4 Architecture Adam Pemble Talend Professional Services.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Research sponsored by Mathematics, Information and Computational Sciences Office U.S. Department of Energy Al Geist Jens Schwidder David Jung Computer.
Electronic Notebooks: An Interface Component for Semantic Records Systems James D. Myers, Michael Peterson, K Prasad Saripalli, Tara Talbott Mathematics.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Digital Object Architecture
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Towards a Provenance Architecture Karen Schuchardt PNNL.
material assembled from the web pages at
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Environmental Molecular Sciences LaboratoryDOE Security Workshop Electronic Notebooks (Collaboratories) James D. Myers EMSL Collaboratory Project Pacific.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
Integrated Collaborative Information Systems Ahmet E. Topcu Advisor: Prof Dr. Geoffrey Fox 1.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Fisheries Oceanography Collaboration Software Donald Denbo NOAA/PMEL-UW/JISAO Presented by Nancy Soreide NOAA/PMEL AMS 2002/IIPS 10.3.
1 Windows 2008 Configuring Server Roles and Services.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
Information Builders : SmartMart Seon-Min Rhee Visualization & Simulation Lab Dept. of Computer Science & Engineering Ewha Womans University.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
JCR Content Management Jukka Zitting
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
Policy Based Data Management Data-Intensive Computing Distributed Collections Grid-Enabled Storage iRODS Reagan W. Moore 1.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Data Provenance and Annotation Dec. 2, 2003 Collaboratory for Multi-scale Chemical Science (CMCS): A Knowledge Grid/ Adaptive Informatics Infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
National Computational Science National Center for Supercomputing Applications National Computational Science GSI Online Credential Retrieval Requirements.
Scientific Annotation Middleware (SAM) Jim Myers, Elena Mendoza PNNL Al Geist, Jens Schwidder ORNL.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
Strictly Business Using “StrictlyFused” to Create an Extensible Knowledge Portal.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
14 1 Chapter 14 Web Database Development Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Adapting the Electronic Laboratory Notebook for the Semantic Era Tara Talbott, Michael Peterson, Jens Schwidder, James D. Myers 2005 International Symposium.
DSpace - Digital Library Software
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Scientific Annotation Middleware: Data/Metadata Access/Transformation Services On/Off the Grid James D. Myers Pacific Northwest National Laboratory.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
AMGA Web Interface Vincenzo Milazzo
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Presentation transcript:

Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder Tara D. Gibson Computer Science Research Group Computer Science and Mathematics Division James D. Myers National Center for Supercomputing Applications

2 Schwidder_SAM_SC07 Scientific Annotation Middleware (SAM) objectives  Develop a lightweight, flexible middleware to support the creation and use of metadata and annotations  Support sharing of annotations among scientific applications, portals, problem-solving environments, and electronic notebooks  Improve the completeness, accuracy, and availability of the scientific record  Support mapping between the annotation schemas of different scientific domains, thus enabling collaboration

3 Schwidder_SAM_SC07 Middleware approach Various client and storage interfaces provide transparent integration of legacy applications as well as new applications using new, more powerful semantics

4 Schwidder_SAM_SC07 Characteristics Features  Middleware design capable of integrating into multiple service architectures  “Schema-less” store that accepts arbitrary content and metadata  Dynamic metadata/data translations to support evolving standards and lightweight integration  Layered design to allow basic and advanced clients and interactions between them  Meta–data translation/extraction  Semantic services  Distributed Authoring and Versioning (DAV)  Notebook services and user interfaces  Event notification using Java Messaging Service (JMS)  Prototype implementation of Java Content Repository (JCR)(JSR 170) -based SAM layer that allows adding SAM capabilities to JCRs

5 Schwidder_SAM_SC07 Benefits of the SAM system  Rich, accessible, integrated scientific records  Support for system-science cyber environments and collaboration across disciplines  Increased automation of metadata capture and data/metadata translation  Integrated electronic notebook, semantic relationship (e.g., provenance) tracking, and third-party annotation services  Open source, standards-based scientific content management services  Flexible authentication and authorization support

6 Schwidder_SAM_SC07 SAM-based electronic notebooks  Take advantage of advanced SAM features, such as data translation  Provide hierarchical chapters/pages/notes  Provide add/view/search notes  Provide multiple client interfaces  Internationalized Electronic Laboratory Notebook (ELN) client  HTML-based Web interfaces  Enable applications to provide notebook functionality using SAM notebook API/components  Can serve as record with electronic signatures  Allow scientists to share notes in distributed teams  Allow notifications

7 Schwidder_SAM_SC07 Community interactions Data Format Description Language (DFDL) standardization within the Global Grid Forum JCR (JSR 170) standardization within the Java Community Process Battelle records managers DOE2000 electronic notebook (Enote and ELN) communities PNNL Computational Science and Mathematics Division Semantic data grid to store, generate, and query provenance information Collaboratory for Multiscale Chemical Science (CMCS)—using SAM to support a portal-based community knowledge grid SAM-based internationalized grid-capable notebook Automated experiment records, user annotations, and customized instrument logs MAEviz—“Consequence-based Risk Management Cyberenvironment”—using SAM to support shared data and provenance

8 Schwidder_SAM_SC07 CMCS use of SAM Powers CMCS knowledge management Provides a node plus metadata/relationship view of underlying data sources Supports put/get/search/access control of arbitrary data/metadata Enables configurable metadata extraction from binary/ASCII/XML files Enables semantic/graph queries More information on CMCS at Fortran application Fortran application ‘Local disk’ Data grid DAV DAV+ JMS ELN CMCS

9 Schwidder_SAM_SC07 SAM release  DFDL, Web service, and XSLT-based metadata extraction and data translation capabilities  Improved semantic search capabilities using an extension of DAV searching and location and Lucene indexing  JDBC databases, file systems as data/metadata stores  Simple Web-based SAM and notebook administration  Internationalized ELN client (accepts UNICODE for Chinese/Japanese character sets)  Optional fully Web-based version of the ELN client  JAAS-based single-sign-on capabilities  Notarization server and proxy implementation  Command-line client and client API library  Jakarta Slide 2.1 code base  Requirements: Java 1.4 (or higher) and Tomcat 5.x

10 Schwidder_SAM_SC07 Spallation Neutron Source (SNS) Notebooks  The electronic notebook software for the SNS is being developed based on the research done in the SAM project.  Support for different types of notebooks:  Instrument notebooks  Record events and annotations regarding an instrument.  Structure fixed; entries can’t be edited, but can be annotated.  Proposal notebooks  Contain research annotations for a proposal and its experiments.  Structure and editing policies under control of proposal PI.  Layered access control for SNS users and groups:  Personal notebooks.  Shared proposal and instrument notebooks.  Web-based user interfaces using AJAX.  Support for Wiki-formatting to support easy input of structured text.  JCR-based storage system.

11 Schwidder_SAM_SC07 More information about SAM Project information at SAM source code hosted at BSD/Apache-style open source license

12 Schwidder_SAM_SC07 Contact Jens Schwidder Computer Science and Mathematics Division Computer Science Research Group (865) Schwidder_SAM_SC07