RDA 9th Plenary Breakout 3, 5 April :00-17:30

Slides:



Advertisements
Similar presentations
Agenda 2 factor authentication Smart cards Virtual smart cards FIM CM
Advertisements

Chapter 13 Review Questions
Information Systems Analysis and Design
Service Oriented Architecture Concepts March 27, 2006 Chris Armstrong
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
Process-oriented System Automation Executable Process Modeling & Process Automation.
SOA, BPM, BPEL, jBPM.
ESB Guidance 2.0 Kevin Gock
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 09. Review Introduction to architectural styles Distributed architectures – Client Server Architecture – Multi-tier.
Margherita Forcolin (Insiel S.p.A.) Thessaloniki, 13 October 2011.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Effort.vs. Software Product “Quality” Effort Product “Quality” Which curve? - linear? - logarithmic? - exponential?
ATIS’ Service Oriented Networks (SON) Activity Andrew White, Nokia Siemens Networks DOCUMENT #:GSC15-PLEN-81r1 FOR:Presentation SOURCE:ATIS AGENDA ITEM:PLEN.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Data Foundation IG DF Organizing Chairs: Gary Berg-Cross & Peter Wittenburg.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
BI Performance Management. Business Issues Too much information: Create confusions Multiple version of Truth: Lack of Trusted information: Incomplete,
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
Workshop on Brokering in Data Fabrics - community perspectives -
VisIt Project Overview
DFIG and Workflows Tobias Weigel, Peter Wittenburg, Larry Lannom, Jay Pearlman, Stefano Nativi, Christine Staiger, Reagan Moore, Bridget Almas, Rainer.
Approaches and Challenges in Managing Persistent Identifiers
CIIT-Human Computer Interaction-CSC456-Fall-2015-Mr
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
The Impact of Cloud Technology on Multi-Unit Operators
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Data Ingestion in ENES and collaboration with RDA
PaaS Core Session (Notes from UPV)
OO Methodology OO Architecture.
Software Design and Architecture
Distribution and components
EIN 6133 Enterprise Engineering
Designing software applications
Agenda Welcome and overview (Peter)
C2CAMP (A Working Title)
Ramesh Baral Team: Marjani Peterson, Andre Guerrero
Chapter 2 Database Environment Pearson Education © 2009.
The GEO DAB possible contributions
Oracle Architecture Overview
Architectural Roadmap
Lecture 1: Multi-tier Architecture Overview
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
Agenda welcome and goals (Peter)
From Observational Data to Information (OD2I IG )
Service Oriented Architecture (SOA)
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
SAMANVITHA RAMAYANAM 18TH FEBRUARY 2010 CPE 691
CMIP6 use case and adoption of RDA outputs
Joint DFIG – Broker Meeting The DFIG view Peter Wittenburg
Introduction to SOA and Web Services
Bird of Feather Session
NextGRID: From Compute Grids to Grid SOAs and beyond
Brokering as a Core Element of EarthCube’s Cyberinfrastructure
SOA initiatives at Istat
Digital Object Management for ENES: Challenges and Opportunities
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
Chapter 2: Building a System
Building a “System” Moving from writing a program to building a system. What’s the difference?! Complexity, size, complexity, size complexity Breadth.
Presentation transcript:

RDA 9th Plenary Breakout 3, 5 April 2017 16:00-17:30 Joint meeting: IG Brokering, IG Data Fabric “Advancing Data Fabric with Brokering services” Wrap-up and next steps (Stefano Nativi, Bridget Almas, Peter Wittenburg, Jay Pearlman, Larry Lannom, ….)

Why are we here? Data Fabric Core Components, such as the PIT and DTR, provide essential pieces of infrastructure for managing and sharing Research Data But in order to be useful, they must be connected to provide functionality in research workflows Brokers have traditionally served this role in SOA architectures, providing facade interfaces, mediation and abstraction

GAP Composition Abstractions and Implementations PID Abstract Composition Data Brokers Executable Workflow(s) Objet Types Processing Brokers PID and Object typing help fill the gap Brokering services (e.g. Data services brokering and Processing services brokering) help fill the gap.

From Human-Controlled Process to Type-Triggered Automatic Processing End-to-end architecture Community Services architecture N M N + M N M N x M Data Server Client applications Client applications Harmonization task Mediation task Intermediation services Community (Third-party) Human Controlled Processing (HCP) Type-Triggered Automatic Processing (T-TAP) Web-as-a-Platform Mediation task Mediation task Data Servers 4

Client-Broker-Server architecture Community Services architecture Data Server Client applications Community services (Third-party) Web-as-a-Platform Intermediation services Broker(s)

Useful (architectural and Software) engineering Patterns Separation of concerns (SoC) separate architecture into distinct components, such that each component addresses a separate concern Information Expert Assign a responsibility to the component that has the information needed to fulfill it Low coupling Implement low dependency between the components

Brokering Services Provide the intermediation functionalities to pass from Abstraction to Implementations and vice versa Apply Service-Oriented Architecture approach Utilize Web-as-a-Platform (Web 2.0 patterns) Intermediation-as-a-Service Third-party services: not managed by Client/Server organizations

Brokering Services Benefits Challenges Multi-purpose, Reusable, Participate in Composition Sustainable through ability to evolve Flexible, Configurable, Extensible Challenges Trust, Governance New cultural and business model Requires specialist knowledge Types Discovery and Access Processing and Transformation

Future: Data and workflow integration Goal: Make the life easier for scientists who are no experts in programming and handling data Portals: integrate data and compute workflows Data Workflow Result Preview: Set 1 ✔ Set 2 ✔ Set 3 Workflow diagram: Diana tools – Taverna Heatmap: wikipedia … Data: <PID> 9

Future: Data and workflow integration Label data with PID Label (parts of) workflows with PIDs #Load data mRNA = W1(D1) miRNA = W1(D2) #Analysis res1 = W2(mRNA, miRNA) res = W3(res1) #PID for result file = writeToFile(res) create_PID(file) #Plot for preview Plot(res) PID D1 PID D2 PID D3 PID W1 PID W2 Portal builds the script and executes it based on PIDs for data and PIDs for code snippets. PID W3 PID W4 10

Matching in federation environment Repository metadata of DO bit sequences of DO PID Resolver MD bs PID record 1 DTR 5 PID Rights DB Type Record 3 2 12 4 rights record 6 13 10 Agent 9 Y 8 7 11 Controller WF 14 Matcher 14 14 Broker Profiles Broker Broker

Scalability, efficiency, flexibility, sustainability, close to the Client needs Clients Application Community services & Interface new breed of organizations providing intermediation services to “abstract” current “implementations” and enable composition Composition Intermediation Supply system

PID Centric Data Management and Access Brokering & Mediation services

Biodiversity Use Case

Climate data processing use case (I) Challenge: Worsening ratio of bandwith vs. volume Server-side processing – cultural change in scientific work Solution: Favour support for most common processing tasks over complexity/variety Service innovation rather than technical innovation Understand users, train influencers, reach high quality

Climate data processing use case (II) Motivation: multiple initiatives to build data processing solutions – some exist, some to come variety of data sources: CMIP6, Copernicus, ... some standardization across them, but detail differences brokering as middleware service – not for end- users, but software agents

WPS (final, preprocessing) Climate data processing use case (III) Agent (user need) WPS (final, preprocessing) Data sources Broker Broker component with a specific role: Middleware service catalogs will come; defines broker role Needed: Clear definition for broker component, respecting multiple roles common interface for broker tasking and output format (to be fed into controller)

Broker acting as Controller

Discussion

Thank you !