Exeter – Implementation of a Crosswalk Connector S. Trowell, University of Exeter Nov 2013.

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

OOPSLA 2005 Workshop on Library-Centric Software Design The Diary of a Datum: An Approach to Modeling Runtime Complexity in Framework-Based Applications.
Bentley and ESRI Interoperability. Designed to serve all types of workflows Desktop Interoperability Server Interoperability.
Open Provenance Model Tutorial Session 6: Interoperability.
RE Adapter for Encompass (v1.0)‏ Encompass and The Raiser's Edge® Integrated Data Solution.
Alternate Software Development Methodologies
From Relational to Semantics A Methodology Arka Mukherjee, Ph.D. Founder / CTO Global IDs David Schaengold Director,
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Lecture Nine Database Planning, Design, and Administration
United Nations Development Program India Coordination & Decision Support System (CDSS) on External Assistance Department of Economic Affairs Ministry of.
Building an efficient pipeline for your bank communication
5 Copyright © 2009, Oracle. All rights reserved. Defining ETL Mappings for Staging Data.
Process-oriented System Automation Executable Process Modeling & Process Automation.
ETL By Dr. Gabriel.
The BIM Project Execution Planning Procedure
Data Warehouse Tools and Technologies - ETL
Data Transformation for Analysis Purposes Presented By: Gregg Ravenscroft Khulisa Management Services
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
Boštjan Šumak dr. Marjan Heričko THE ROLE OF BIZTALK SERVER IN BUSINESS PROCESS INTEGRATION.
This presentation will guide you though the initial stages of installation, through to producing your first report Click your mouse to advance the presentation.
The Software Development Cycle Defining and understanding the problem.
Oracle iLearning/Tutor Integration Jan  Oracle iLearning Overview  Oracle Tutor Overview  Benefits of integration  Manual integration process.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
RIM Meeting, Edinburgh, 11 th January, 2012 RMAS Update Simon Foster Simon Foster RMAS Project Manager.
MobeSys Technologies MobeSys – helping you overcome mobile technology challenges.
Trimble Connected Community
DE&T (QuickVic) Reporting Software Overview Term
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
This presentation is the property of Paradigm Information Systems It is confidential to the intended recipient for the purpose of evaluating FMS Any other.
Dr. Nikos Houssos| National Documentation Centre / NHRF European Network of National Contact Points for Research Infrastructures moving forward The CERIF-based.
December 15, 2011 Use of Semantic Adapter in caCIS Architecture.
Sick of InfoPath? Come get sicker… a quick look into developing no-code business forms for the curious cookie Presenter: Hector Perez.
Esri UC 2015 | Technical Workshop | Land Records Maps and Apps for State and Local Governments Chris Buscaglia Scott Oppmann.
Codeigniter is an open source web application. It occupies a very small amount of space in the memory and is most useful for developers who aim to develop.
State Records Office of Western Australia.NET Proof of Concept Project Slideshow: Prototype Online Disposal Authority/Recordkeeping Plan System Project.
© Copyright 2007 Arbinet-thexchange, Inc. All Rights Reserved. Voice Peering Steve Heap Chief Technology Officer.
© Copyright 2007 Arbinet-thexchange, Inc. All Rights Reserved. VoIP Peering Pilot Using the Internet2 Backbone.
Address Maps and Apps for State and Local Governments
Computer Emergency Notification System (CENS)
KFS Data Mapping Leveraging a new KFS 3.0 feature.
Automated Benchmarking Of Local Authority Web Sites Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by:
ETL Extract Transform Load. Introduction of ETL ETL is used to migrate data from one database to another, to form data marts and data warehouses and also.
Toward Generic Systems Shifra Haar - Central Bureau of Statistics-Israel.
Automated (meta)data collection – problems and solutions Grete Christina Lingjærde and Andora Sjøgren USIT, University of Oslo.
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
Software Requirements: A More Rigorous Look 1. Features and Use Cases at a High Level of Abstraction  Helps to better understand the main characteristics.
Using and modifying plan constraints in Constable Jim Blythe and Yolanda Gil Temple project USC Information Sciences Institute
We provide web based benchmarking, process diagnostics and operational performance measurement solutions to help public and private sector organisations.
SDMX IT Tools Introduction
ABS Statistical Databases Session 6 Mark Viney Australian Bureau of Statistics 6 June 2007.
Information Systems Concepts Basic Computer Concepts Information Systems  Information System  a particular discipline or branch of learning that is concerned.
UKRISS Landscape Study 28 June 2012 Simon Waddington Centre for e-Research King’s College London 1.
Collaborating With Your Health Plan 03/07/05 To paraphrase A. Einstein: We cannot solve today’s problems with the same level of thinking that created them.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Esri UC 2015 | Technical Workshop | Community Addresses Chris Buscaglia.
Overview and Key Findings. UKRISS Project 2 Duration 22 months: March 2012 – December 2013 Funding JISC – Research Information Management Programme Structure.
Esri UC 2014 | Technical Workshop | Address Maps and Apps for State and Local Government Allison Muise Nikki Golding Scott Oppmann.
UKRISS Steering Board The UKRISS Steering Board comprises key stakeholders from across the UK HE sector with an interest in the reporting of research information,
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge R&D Statistics Information System: An Interoperability Tail between CERIF and SDMX Dimitris Karaiskos Dimitrios.
What is BizTalk ?
Dispatcher Phoenix Is…
Wes Rihani, MBA ADP – Global Payroll Product Leader October 23, 2018
SDMX: Enabling World Bank to automate data ingestion
Enterprise Program Management Office
LOD reference architecture
Overview of Oracle Site Hub
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

Exeter – Implementation of a Crosswalk Connector S. Trowell, University of Exeter Nov 2013

The problem from an Institutional perspective 2 The 3 UKRISS partner Institutions of Kings, Brunel and Exeter all have different organizational structures, methods of data processing and system architectures. Such differences are common across the sector; in fact no two institutions will be the same in this regard. However, it is also very apparent that we all share the same common issues: Many manual processes to support all forms of information flow between the Institution and funders, particularly in support of research reporting. Responsibility for completion of many reporting streams is split between PI’s and central research support administrative staff, with little visibility of data entered. Much of the required data resides across a range of internal systems, each storing the data in different database structures and with different naming conventions for data fields. Institutions recognize that improving information flow processes would yield benefits, yet competing institutional priorities mean only limited resources can be invested in this area.

Exeter – Implementation aims 3 Deliver a proof of concept to demonstrate principles for automation of data from institutions. Since all Institutions are different in terms of systems architecture, need a ‘Universal Connector’. Demonstrate how an open-source crosswalk connector can be used to: Extract information from existing source institutional systems Arrange the data into templates that reproduces the data required for ROS and Research Fish Convert this native data into a standardised data model [CERIF] Deposit the outputs in an accessible location to enable recipient [funding body] to easily and securely access the data and decode into a suitable format for ingest into funder’s internal systems. Demonstrate how the connector can be used in conjunction with data validation tools to improve the underlying data quality.

The approach 4 Focus on the ROS and Research Fish requirements by way of an example. Reproducing the current ROS and RF outputs, define a global data set that encompasses all ROS and RF requirements. Working with a local [Exeter] software supplier, CERTUS Technology Associates, to develop an open source connector tool. Demonstrate this tool working in conjunction with Cottage Labs’ data validation tools. Demonstrate how this approach may be used for data sets other than ROS and Research Fish and therefore be used to directly benefit all stakeholders [funders, public bodies and institutions].

Connector Detail – what is it? Open Source Pentaho Data Integration Community Edition (PDI CE), sometimes referred to as Kettle (Kettle Extraction Transformation Transport Load Environment). Graphical ‘drag and drop’ environment to combine extract, transform and loading steps. Able to schedule ‘jobs’ to run at a pre-determined time. UKRISS Connector methodology 5 External to Institution, hosted database Sponsors Funding Body Destination server Institution X Research System A HR System Finance System Student System Secure ftp Reporting capability Research System B Connector Tool

UKRISS Connector methodology 6 Institution X Research System A HR System Finance System Student System Research System B Connector Tool Map data fields Convert to CDM Create output e.g. combined ROS/RF output Map to CERIF Apply data validation tools

Connector Detail – how does it work? 7 Extract - mapping data fields: Integrating multiple sources of information relating to the same entity [e.g. a project or person] requires ability to identify matching records from these systems. For each data field, UKRISS has agreed name, type, format, meaning, vocabulary, and representation in CERIF. Transform - convert to CDM: Source data may be in a range of formats: Database input to CDM CSV file input to CDM XML to CDM Transform – convert the CDM into the latest version of CERIF XML. Load – output to a destination(s), including security layer Output data may be in a range of formats CDM to database CDM to CSV file CDM to CERIF XML Connector Tool Map data fields Convert to CDM Create output Map to CERIF

Simple example of a transform to generate CERIF XML 8

Specifying the fields to be ‘CERIFised’ 9

A Flexible approach… different data, different directions 10 This concept may be used for data sets other than ROS and Research Fish, e.g. HE-BCI survey. The Crosswalk Connector functionality is reversible so could use within Funders use to pick up Institutions’ outputs and ingest into existing internal systems External to Institution, hosted database Sponsors Funding Body Destination server Institution X Research System A HR System Finance System Student System Secure ftp Reporting capability Research System B Connector Tool

A Flexible approach… different data, different directions 11 Sponsors Funding Body Institution X ROS or RF report Connector Tool Institution z ROS or RF report Institution y ROS or RF report External to Institution Reporting capability

A Flexible approach… different data, different directions 12 This concept may be used for data sets other than ROS and Research Fish and therefore be used to directly benefit all stakeholders [funders, public bodies and institutions]. The Crosswalk Connector functionality is reversible so could use within Funders use to pick up Institutions’ outputs and ingest into existing internal systems. Or, could use to aggregate Funding Opportunity announcements for coordinated dissemination to Institutions.

Receiving CERIF XML messages 13

In summary 14 Implementations are only at a proof-of-concept stage. Production solutions would require further investment and development and consensus across the sector. The UKRISS ‘Universal Connector’ is a modular approach, fully compatible with additional data validation tools. Provides a flexible and sustainable systems architecture that can grow as CERIF standards evolve. Acknowledgement: thanks to Dr Brian Lings, Certus Technology Associates For Further information contact Dr Steve Trowell, Exeter: