©STFC/Keith G JefferyELPUB 2007 Keynote Technical Infrastructure and PolicyFramework for Maximising the Benefits from Research Output Keith G Jeffery Science and Technology Facilities Council Rutherford Appleton Laboratory, OX11 0QX UK
©STFC/Keith G JefferyELPUB 2007 Keynote Structure 1. Introduction 2. Maximising Benefits from Research Output 3. Technical Infrastructure 4. Policy Framework 5. The Way Forward 6. Conclusion
©STFC/Keith G JefferyELPUB 2007 Keynote Director, IT & Internatio nal Strategy Director IT & Head BITD (IT, library, photorepro; > 1000 servers, users)
©STFC/Keith G JefferyELPUB 2007 Keynote European countries - major labs or consortia of universities ICT researchers Working groups Fellows programme Cor Baayen Award Strategy documents for EC and national governments R&D projects, networks of excellence etc > 100 spin-out companies European Office(s) of W3C ERCIM News ERCIM European Research Consortium for Informatics and Mathematics
©STFC/Keith G JefferyELPUB 2007 Keynote Linking together systems in each country managing research information –Funders of research –Organisations performing research For –Strategic decision-making about what research to fund /do –Finding research partners and competitors –Finding innovative ideas for technology transfer / exploitation –Informing the media / public CERIF: EU recommendation to member states
©STFC/Keith G JefferyELPUB 2007 Keynote Structure 1. Introduction 2. Maximising Benefits from Research Output 3. Technical Infrastructure 4. Policy Framework 5. The Way Forward 6. Conclusion
©STFC/Keith G JefferyELPUB 2007 Keynote Requirement Actors Researcher Research Manager Funding Agency Policymaker Innovator Educator Student Media Roles Review existing material ideas, techniques Evaluation researcher, organisation Search for innovative ideas Discover teaching material Input to ‘stories’ for public interest, ethics
©STFC/Keith G JefferyELPUB 2007 Keynote Requirement User: Fast Easy Homogeneous Sharing Legal Cost-effective middleware application Technical: GRIDs Formalised metadata Canonical syntax/semantics
©STFC/Keith G JefferyELPUB 2007 Keynote Benefits Faster research turnround – more progress Originator improved quality – access & review Community improved quality – access & review Improved innovation Improved education Improved public engagement Improved PR for institution => wealth creation / quality of life improvement
©STFC/Keith G JefferyELPUB 2007 Keynote Scenario middleware application CRIS project, person, organisational unit, research output (products, patents, publications), funding, facilities, equipment, events…… e-Research repository research datasets, software e-Research control experiments, take data, visualistaion, in-silico experiments (simulation) e-Process Workflows, research applications, travel requests, claims Not only work with the e-literature repository but also…..
©STFC/Keith G JefferyELPUB 2007 Keynote Scenario middleware application CRIS project, person, organisational unit, research output (products, patents, publications), funding, facilities, equipment, events…… e-Research repository research datasets, software e-Research control experiments, take data, visualistaion, in-silico experiments (simulation) e-Process Workflows, research applications, travel requests, claims Not only work with the e-literature repository but also…..
©STFC/Keith G JefferyELPUB 2007 Keynote Structure 1. Introduction 2. Maximising Benefits from Research Output 3. Technical Infrastructure 4. Policy Framework 5. The Way Forward 6. Conclusion
©STFC/Keith G JefferyELPUB 2007 Keynote Technical Infrastructure e-infrastructure CRIS Repositories Metadata & Curation Integration
©STFC/Keith G JefferyELPUB 2007 Keynote The GRIDs Architecture Knowledge Layer Information LayerComputation / Data Layer Data to Knowledge Control
©STFC/Keith G JefferyELPUB 2007 Keynote A POSSIBLE ARCHITECTURE U:USER S:SOURCE R:RESOURCE Rm:Resource Metadata Ra:Resource Agent Ua:User Agent Um:User Metadata Sm:Source Metadata Sa:Source Agent brokers The GRIDs Environment
©STFC/Keith G JefferyELPUB 2007 Keynote NGG1 Requirements Transparent and reliable Open to wide user and provider communities Pervasive and ubiquitous Secure and provide trust across multiple administrative domains Easy to use and to program Persistent Based on standards for software and protocols Person-centric Scalable Easy to configure and manage
©STFC/Keith G JefferyELPUB 2007 Keynote NGG2 NGG1 left some undefined research areas Call2 projects did not address all areas of research opportunity NGG2 convened to update the vision: –Particularly security / trust –Particularly self-* properties –Particularly semantic description of components Report September 2004
©STFC/Keith G JefferyELPUB 2007 Keynote Application AApplication BApplication C Grids Middleware Services Needed for A Grids Middleware Services Needed for B Grids Middleware Services Needed for C Grids Foundations for Operating System X Grids Foundations For Operating System Y Operating System X Operating System Y Grids Operating System (including Foundations) Modular and dynamically loadable NGG2 Architecture
©STFC/Keith G JefferyELPUB 2007 Keynote NGG2 Problems Layering architecture complex; need more flexibility Require functional software components - easy, fast development - re-use
©STFC/Keith G JefferyELPUB 2007 Keynote NGG3: SOKU Interfaces Computing Infrastructure Services Non SOKU
©STFC/Keith G JefferyELPUB 2007 Keynote SOKU Descriptive metadata Restrictive metadata Functional program code Descriptive metadata Restrictive metadata Functional program code Descriptive metadata Restrictive metadata Descriptive metadata Restrictive metadata Functional program code Descriptive metadata Restrictive metadata Functional program code Descriptive metadata Restrictive metadata Functional program code Descriptive metadata Restrictive metadata Functional program code Composed SOKU SOKU
©STFC/Keith G JefferyELPUB 2007 Keynote Technical Infrastructure e-infrastructure CRIS Repositories Metadata & Curation Integration
©STFC/Keith G JefferyELPUB 2007 Keynote The R&D Process Workprogramme Proposal Project Results Exploitation WealthCreation Note: some CRIS developers limit recording of outputs from the process to areas indicated Nirvana
©STFC/Keith G JefferyELPUB 2007 Keynote PROJECT ORGUNIT Skills CV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme Event Classification Prize/Award PERSON CERIF: EU Recommendation to Member States
©STFC/Keith G JefferyELPUB 2007 Keynote PROJECT ORGUNIT PERSON Result_Publication Can Express: (where DT-date/time) Person A (DT1 - DT2) (is author of) Publication X Orgunit O (DT1 - DT2) (is owner of IPR in) Publication X Person A (DT1 - DT2) (is employee of ) Orgunit O Person A (DT1 - DT2) (is project leader of) Project P Person A (DT1-DT2) (is member of) Orgunit M Person A (DT1-DT2) (is member of) Orgunit N Orgunit M (DT1-DT2) (is part of) Orgunit O Orgunit N (DT1-DT2) (is part of) Orgunit O Secondary Base Entities: example: RESULT_PUBLICATION
©STFC/Keith G JefferyELPUB 2007 Keynote Result_Publication Instance Diagram Person A Publication X OrgUnit O OrgUnit M OrgUnit N Project P member employee Part of owns IPR author Project leader
©STFC/Keith G JefferyELPUB 2007 Keynote Result_Publication Instance Diagram Person A Publication X OrgUnit O OrgUnit M OrgUnit N Project P member employee Part of owns IPR author Project leader repository HR System webpages Project Management Finance CERIF encourages interfacing to external systems
©STFC/Keith G JefferyELPUB 2007 Keynote Technical Infrastructure e-infrastructure CRIS Repositories Metadata & Curation Integration
©STFC/Keith G JefferyELPUB 2007 Keynote Repositories Document / article repositories - simple metadata (discovery, description) - ePrints, DSpace, Fedora, ePubs…. e-Research repositories - more complex metadata (discovery, description, usage control, software parameters…) - ‘homebrew’ systems – portals to research datasets and software
©STFC/Keith G JefferyELPUB 2007 Keynote Technical Infrastructure E-infrastructure CRIS Repositories Metadata & Curation Integration
©STFC/Keith G JefferyELPUB 2007 Keynote Classification of Metadata data (document) SCHEMANAVIGATIONALASSOCIATIVE how to get it constrain it view to users
©STFC/Keith G JefferyELPUB 2007 Keynote Classification of Metadata data (document) SCHEMANAVIGATIONALASSOCIATIVE how to get it constrain it view to users Metadata must have formal syntax and semantics to be machine- understandable as well as machine- readable
©STFC/Keith G JefferyELPUB 2007 Keynote Dublin Core Simple DC elements, some syntax, no semantics Qualified DC better syntax, namespaces More Recent proposals (2007) abstract data model, RDF
©STFC/Keith G JefferyELPUB 2007 Keynote DC Problems In parallel (1999-present) criticism of DC: 1.Syntax and semantics not sufficiently formal 2.,, are ROLES of person or organisational unit not base entities 3. : extremely general 4. : is a variant of a role-based relationship object<>object 5. recently separated into geographic and temporal but needs formalisation Formalised version of DC proposed, considered, now in CERIF Note: recent (2007) work on DC and SWAP (JISC) going in this direction
©STFC/Keith G JefferyELPUB 2007 Keynote Publications UniqueId Subject Keywords Description Resource Type Coverage Temporal Coverage Spatial Title PersonOrgUnit Security Privacy AccessLevel Charge Annotation Classification Domain of CERIF Restrictive Quality Assessment Project ResourceIdentifier Descriptive Navigational
©STFC/Keith G JefferyELPUB 2007 Keynote But the problem with metadata is It takes too much effort for the researcher to put it in (many web-form-screens) So have to input incrementally, no repetition, using the workflow.. And not re-keying data stored already elsewhere in other (linked-up) systems
©STFC/Keith G JefferyELPUB 2007 Keynote Progressive Recording Grey Document Grey doc Publication metadata Person Project OrgUnit new
©STFC/Keith G JefferyELPUB 2007 Keynote Progressive Recording White document Grey doc Publication metadata Person Project OrgUnit White doc Publication metadata new
©STFC/Keith G JefferyELPUB 2007 Keynote Curation Problem fast changing media – need media conversion digital fading – need for refresh metadata to understand later Answer OAIS : but provides only an architecture: no interoperation metadata
©STFC/Keith G JefferyELPUB 2007 Keynote Technical Infrastructure e-infrastructure CRIS Repositories Metadata & Curation Integration
©STFC/Keith G JefferyELPUB 2007 Keynote CRIS + Repositories at 1 institution CRIS Research Context [projects, persons, organisational units funding, products, patents, publications facilities, equipment, events] OA Repository (hypermedia) Documents e-Research repository Datasets and Software OAI- PMH Various protocols End-User CERIF
©STFC/Keith G JefferyELPUB 2007 Keynote ….and multiple institutions CRIS OA repository e-Research repository CRIS OA repository e-Research repository CRIS OA repository e-Research repository End-User Institution AInstitution BInstitution C
©STFC/Keith G JefferyELPUB 2007 Keynote CERIF-CRIS at the Centre Portal with knowledge-assisted user interface Digital Curation Facility SCIENTIFIC DATASETS Data Information Knowledge PUBLICATIONS Data Information Knowledge metadata publish validate GRIDs Ambient, Pervasive Access
©STFC/Keith G JefferyELPUB 2007 Keynote CERIF-CRIS : Enhancing and Enabling CRIS Finance Human Resources Project Management Publications Patents Products
©STFC/Keith G JefferyELPUB 2007 Keynote Structure 1. Introduction 2. Maximising Benefits from Research Output 3. Technical Infrastructure 4. Policy Framework 5. The Way Forward 6. Conclusion
©STFC/Keith G JefferyELPUB 2007 Keynote Policy OA Benefits Ethics: public access to publicly funded research Research Impact: greater access and use Costs and economic benefit: reduced costs and clear benefits to economy of open access Metrics: easier to get real metrics of usage Added value: link OA repositories to CRIS etc Just reward: overcomes publishers profiting from scholarly work provided free
©STFC/Keith G JefferyELPUB 2007 Keynote Policy Barriers to OA Loss of publisher income: publishers fear catastrophic cancellations of subscriptions Copyright: transfer author publisher so cannot re-use (in fact mostly can) Access Difficulties: DC metadata insufficient Completeness: 8-15% fill: need mandates and better workflowed input/update systems
©STFC/Keith G JefferyELPUB 2007 Keynote Policy Many OA declarations (Budapest onwards) Increasing use of green institutional OA repositories – publisher permissions (embargoes) Publishers offering OA – but author/institution pays (gold) Note: for highly productive institutions gold costs more than subscription models
©STFC/Keith G JefferyELPUB 2007 Keynote Mandates Progressively more mandates – institutional and funding organisations The preferred, optimal and recommended procedure is : immediately upon acceptance for publication the metadata and full article are deposited in an institutional repository. If the publisher does not demand an embargo period both are set to open access; if an embargo period is demanded then only the metadata is made visible until the end of the embargo period. Of course, associated with the metadata record there can be (and ePrints provides) a ‘request button’ so that the material can be sent automatically to any researcher who requests it under the usual ‘fair use’ conditions.
©STFC/Keith G JefferyELPUB 2007 Keynote Integration Need all funding organisations to mandate OA in institutional repositories Resistance from publishers (including learned societies as publishers) Engage with them to find new business models
©STFC/Keith G JefferyELPUB 2007 Keynote Structure 1. Introduction 2. Maximising Benefits from Research Output 3. Technical Infrastructure 4. Policy Framework 5. The Way Forward 6. Conclusion
©STFC/Keith G JefferyELPUB 2007 Keynote Way Forward We cannot allow progress to be delayed by particular commercial interests Their commercial interests are not above the public need for improved wealth creation and quality of life
©STFC/Keith G JefferyELPUB 2007 Keynote Speculation Author deposits in green OA IR Push technology informs learned society Referees access and record reviews Learned society places ‘kitemark ’ Or anyone can referee and record review? Note: JISC OJIMS (Overlay Journal Infrastructure for Meterological Sciences) exploring this space from March 2007
©STFC/Keith G JefferyELPUB 2007 Keynote Structure 1. Introduction 2. Maximising Benefits from Research Output 3. Technical Infrastructure 4. Policy Framework 5. The Way Forward 6. Conclusion
©STFC/Keith G JefferyELPUB 2007 Keynote Take-home message Despite protests and obstacles to improved access to research material over the centuries from religious, commercial, professional or labour groups, none delayed for long progress to meet the requirement as defined by the research community. Electronic publishing must take its place in the modern world of integrated e-infrastructure, research output, CRIS. All within the new e-research environment
©STFC/Keith G JefferyELPUB 2007 Keynote Prof. Keith G Jeffery CEng, CITP, FBCS, FGS, HFICS Director, IT & International Strategy Science and Technology Facilities Council Rutherford Appleton Laboratory