The OpenAIRE Infrastructure On Measuring Research Impact - The EGI use-case - Paolo Manghi Natalia Manola

Slides:



Advertisements
Similar presentations
WORKSHOP ON CRIS, CERIF AND INSTITUTIONAL REPOSITORIES, Rome, 10-11/5/2010 Interoperability Challenges and Approaches.
Advertisements

DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
UKOLN is supported by: Digital Repositories Roadmap: looking forward The JISC/CNI Meeting, July 2006 Rachel Heery Assistant Director R&D, UKOLN
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen.
OA and REF: Jisc support Neil Jacobs Head of Scholarly Communications Support E M Skype neil.jacobs1
OpenAIRE: the European Scholarly Communication Infrastructure OCLC Research Workshop Libraries and Research: Supporting Change/Changing Support June 10.
OpenAIRE – Building a Collaborative Open Access Infrastructure for European Researchers Dr Birgit Schmidt / Najla Rettberg Goettingen State and University.
Interoperability scenarios between UKPMC and OpenAIRE Jo McEntyre, Wolfram Horstmann.
OpenAIRE From Pilot to Service
European Open Data and Access DuraSpace Summit Michele Mennielli Washington – March 11, 2015.
OpenAIRE & OA in H2020 Open Access Infrastructure for Research In Europe Inge Van Nieuwerburgh Gwen Franck.
Supporting education and research Open Access in the UK Neil Jacobs, JISC, UK.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Supporting Open Access Implementation via CRIS/repository interoperability Pablo de Castro euroCRIS Board member Open Access Project Officer at LIBER
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
EGU, 23 April 2012, Najla Rettberg, OpenAIRE, University of Göttingen, Linking Data to Open Access Publications.
OpenAIRE: Open Access Infrastructure for Research in Europe Wolfram Horstmann Bielefeld University, Germany
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
EGI: A European Distributed Computing Infrastructure Steven Newhouse Interim EGI.eu Director.
The OpenAIRE Project Open Access Infrastructure for Research in Europe Stefania Biagioni, Donatella Castelli, Paolo Manghi CNR - ISTI GL11 - Library of.
OpenAIRE e-Infrastructure & Support for Open Access in FP7 and Horizon 2020 MedOANet Conference Athens, 17 October 2013 Birgit Schmidt University of Goettingen,
Open Access to Grey Literature on e-Infrastructures: The BELIEF-II Project Digital Library Stefania Biagioni, Donatella Castelli, Franco Zoppi CNR-ISTI.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Data curation in an existing infrastructure: Stellenbosch University 1 st African Digital Curation Conference 12 – 13 February 2008 Wouter Klapwijk Senior.
OpenAIRE The Open Scholarly Communication & Scientific Information Infrastructure for Europe SciencePad PID Geneva, January 30, 2013 Natalia.
Najla Rettberg University of Göttingen, Germany OpenAIREplus – Research Information 2020.
OpenAIRE - supporting Open Access for FP7 and ERC funded projects Inge Van Nieuwerburgh – Ghent University Library.
FP7-Infrastructures Open access in Slovenia and OpenAIRE project.
Donatella Castelli CNR-ISTI
"How much?": Aggregating usage data from Repositories in the UK Jo Lambert, Ross Macintyre, Paul Needham, Jo Alcock OR2015.
Opening access to UK doctoral theses: the EThOS E-Theses Service 13 August 2014 Sara Gould.
Implementing & measuring OA pilot 8th e-Infrastructure Concertation Meeting CERN, Geneva 4-5 Nov, 2010 Natalia Manola Department of Informatics & Telecommunications.
Iryna Kuchma eIFL FP7 and ERC Open Access Policies - How to comply The 8th e-Infrastructure Concertation Meeting Nov 5, 2010 CERN - Geneva.
Tackling the Infrastructure Requirements: Potential Role of SK-CRIS and National CRIS Systems in Supporting Open Access Implementation Pablo de Castro.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
SHARE (SHared Access Research Ecosystem) Tyler Walters Co-Chair, SHARE Steering Group (a joint committee of the ARL, the AAU, and the APLU) Eric Celeste.
CRIS as an Interconnector: IConnectEU - Building a thematic CRIS Maximilian Stempfhuber & Engin Sagbas GESIS-IZ Social Science Information Centre Bonn,
Natalia Manola Department of Informatics & Telecommunications University of Athens, Greece Implementing & measuring the FP7 OA pilots 8th e-Infrastructure.
Research Grants and Projects Discovery Service ANDS Webinar 12th August 2015 Monica Omodei, ANDS.
Date, location Open Access policy guidelines for research funders Name Logo area.
DANS is an institute of KNAW and NWO Data Archiving and Networked Services Measurement of research impact in OpenAIRE 2020: via text mining or the CRISs?
OpenAIRE and the Case of Irish Repositories RIAN OpenAIRE Day and Workshop 27 Nov 2015 Jochen Schirrwagen, University of Bielefeld.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
The fifth SEEDI International Conference May 19-20, 2010, Sarajevo, BiH FP7-Infrastructures Project Overview.
Date, location Open Access policy guidelines for research institutions Name Logo area.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
Pasquale Pagano CNR - ISTI The OpenAIRE Infrastructure EC Policy on Open Access and the OpenAIRE Initiative EGI Scientific Publications Repository Workshop.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
The OpenAIRE Infrastructure On Measuring Research Impact - The EGI use-case - OpenAIRE project Claudio Atzori, Paolo Manghi (CNR-ISTI), Ioannis Foufoulas,
Exploitation of ISS Scientific data EGI-Aparsen Workshop March Science Park– Amsterdam – The Netherlands Cooperative ISS Research data Conservation.
Theses in the UK: PhD research, university repositories and EThOS ETD2014 International Conference 24 July 2014 Sara Gould.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Paola Gargiulo, Ilaria Fava CASPUR – Italy OpenAIRE National Open Access Desk Open Access to research data and publications: OpenAIREplus.
Pasquale Pagano CNR-ISTI The OpenAIRE Infrastructure: on measuring research impact Evolving EGI Workshop – 29 January 2013 Paolo Manghi CNR - ISTI.
The OpenAIRE Infrastructure
The OpenAIRE Catalogue of Services
INFRA : Scientific Information Repository supporting FP7
OpenAIRE in 8 Minutes Tony Ross-Hellauer State and University Library,
The OpenAIRE infrastructure
An Open Knowledge & Research Information Infrastructure
Antonella Fresa Technical Coordinator
OpenAIRE Services for Funders
Vision for Open Science
OpenAIRE Services for Open Science
Digital Curation Centre, Glasgow
Sergio Andreozzi Strategy and Policy Manager (EGI.eu)
Jisc Research Data Shared Service (RDSS)
OpenAIRE Open Science Publishing for Research Infrastructures: the EPOS use-case Paolo Manghi, Michele Manunta, Miriam Baglioni, Alessia Bardi, Francesco.
Presentation transcript:

The OpenAIRE Infrastructure On Measuring Research Impact - The EGI use-case - Paolo Manghi Natalia Manola

Outline The What and How of OpenAIRE Supporting research communities Contexts, categories and concepts User input Results and analytics Looking Ahead Developing the Open Sience Commons - Sept 25, Amsterdam2

OpenAIRE in a nutshell European data infrastructure for scholarly communication Facilitating discovery of research outcome across disciplines Promotes & implements Open Access Interlinks and contextualizes research outcomes Integrates publication, data, software repositories, CRIS systems Monitoring research outputs and measuring research impact Open Access policy evaluation Funding schemes: return of investment through impact Research initiatives: research impact Providing both human and technical infrastructure to make this possible! Developing the Open Sience Commons - Sept 25, Amsterdam3

Visualize - Manage Enhanced Publications Get support (NOADs) Linked Content Statistics +++ Search & Browse Curate & collaborate Deposit Publications & data Research impact Citations, usage statistics +++ APIs Data repositories Data Journals Metadata on data Publication repositories Institutional & Thematic Open Access Journals Usage data Metadata And pdfs 8,700,000 OA publications 460 validated repositories National funding EC funding Guidelines for use services Institutional CRIS Systems CERN/OpenAIRE “catch-all” repository Guidelines for data interoperability Services for Project Coordinators, Project Funders, Funders Infrastructure coordination Infrastructure: data sources Deposits in institutional or thematic repository Publishes in OA journal Publishes data Fully compliant? Mine for project Mine for other infoDe-duplicateLinkEnrich Organizations Projects Authors Datasets Publications Data Providers

8.7 mi publications 7 mi authors 460+ data providers 90K publications linked to projects 2 funders 700 datasets linked to publications 33K organizations 2731 publications linked to EGI Added Value: Integrated Scientific Information System Organizations Projects Authors Datasets Publications Data Providers Developing the Open Sience Commons - Sept 25, Amsterdam Research Communities 5

BEHIND THE SCENES Developing the Open Sience Commons - Sept 25, Amsterdam6

Internal data flow Developing the Open Sience Commons - Sept 25, Amsterdam 7 Data source import End-user claims Native Information Space De-duplication Public Information Space Data Inference Human Data Curation Enriched Information Space OpenAIRE Portal: Discovery & Impact measure Harvesting End-user Inferring Off-line

RESEARCH ANALYTICS Developing the Open Sience Commons - Sept 25, Amsterdam8

66K pubs – 7.5K projects FP7 FP7 timeline - total FP7 breakdowns Monitoring OA policy Research Output Measures

Classification Text mining - Supervised techniques Developing the Open Sience Commons - Sept 25, Amsterdam10

Beyond the Obvious Text mining – Unsupervised techniques (topic modeling) Developing the Open Sience Commons - Sept 25, Amsterdam Example 1 FP7 programmes connected through scientific pubs Research Trends Structural effects 11 Interactive graphs Providing overview

Developing the Open Sience Commons - Sept 25, Amsterdam12 Example 2 How FP7 programme areas are related

EGI & OPENAIRE 1-year pilot ended in May 2014 Official service release: Oct Developing the Open Sience Commons - Sept 25, Amsterdam13

Supporting communities Enriched OpenAIRE data model Context (e.g. “EGI”) Category (“Virtual Organizations”) Concept (“alice”) Text mining algorithms tailored to community needs, integrated into OpenAIRE text mining framework Developing the Open Sience Commons - Sept 25, Amsterdam14

What OpenAIRE does Extract full text from publications if structured, use “funding” & “acknowledgements” fields Scan text for matches against any of the EGI organization names provided For each match, search surrounding context for general terms & suggested acknowledgements (using word pairs) to add a confidence value to the match and eliminate false matches For EC projects, we search not only for the project acronym (e.g. EGI-InSPIRE) but also for the grant ID (261323) Behind the scenes Developing the Open Sience Commons - Sept 25, Amsterdam15

How to identify EGI Identify publications associated to EGI in terms of Associated to EGI projects Publication “enabledBy EGI:XYZ” Publication ”supportedBy EGI:XYZ” Associated to a certain Virtual Organisation (VO) or National GRID Infrastructures (NGI) Publication "used EGI" Publication "used NGI:XYZ" Publication ”producedBy VO:XYZ” Associated to a certain EGI scientific discipline Publication "related to EGI Scientific Discipline:XYZ” Text mining on pdfs from repositories, publisher metadata Developing the Open Sience Commons - Sept 25, Amsterdam16

What EGI community should do Use proper acknowledgement in the publication STEP 1 Developing the Open Sience Commons - Sept 25, Amsterdam17 Organisation Name TypeGrant ID Suggested Acknowledgement WeNMREC Project "The WeNMR project (European FP7 e-Infrastructure grant, contract no , supported by the European Grid Initiative (EGI) through the national GRID Initiatives of Belgium, France, Italy, Germany, the Netherlands (via the Dutch BiG Grid project), Portugal, Spain, UK, South Africa, Taiwan and the Latin America GRID infrastructure via the Gisela project is acknowledged for the use of web portals, computing and storage facilities." and the following article describing the WeNMR portals should be cited: Wassenaar et al. (2012). WeNMR: Structural Biology on the Grid.J. Grid. Comp., 10: EGI-InSPIREEC Project The authors acknowledge the use of resources provided by the European Grid Infrastructure. For more information, please reference the EGI-InSPIRE paper ( ALICEVOn/a The ALICE collaboration gratefully acknowledges the resources and support provided by all Grid centres and the Worldwide LHC Computing Grid (WLCG) collaboration. LHCbVOn/a The Tier1 computing centres are supported by IN2P3 (France), KIT and BMBF (Germany), INFN (Italy), NWO and SURF (The Netherlands), PIC (Spain), GridPP (United Kingdom). We are thankful for the computing resources put at our disposal by Yandex LLC (Russia), as well as to the communities behind the multiple open source software packages that we depend on. NGI:PTNGIn/a This work makes use of results produced with the support of the Portuguese National Grid Initiative. More information in

What EGI community should do Option 1: follow the OpenAIRE guides Publish in OA journal or deposit in OA repository – preferably the OpenAIRE compatible ones for OpenAIRE 2.0+ guidelines (i.e., link to funding) Option 2: use the OpenAIRE portal “claiming” service to associate any publication (within OpenAIRE or not) to EGI results to additional EGI information: VO, classification, relationship STEP 2 Developing the Open Sience Commons - Sept 25, Amsterdam18

User Input Developing the Open Sience Commons - Sept 25, Amsterdam19

Developing the Open Sience Commons - Sept 25, Amsterdam20

What does it look like Developing the Open Sience Commons - Sept 25, Amsterdam21

Aggregated statistics Developing the Open Sience Commons - Sept 25, Amsterdam22

Lessons learned & Best practices Mandates on how to write acknowledgements are crucial but often missing Try to collect as much information that may help with the mining beforehand. Even information that you may not think that it'll help, it may prove useful in the end. Clean and normalize your input data (character encoding, stop-word removal, character case, special characters, etc.). Design your data mining methods to be very tolerant. In our case, suggested acknowledgements never appeared exactly as given in the input texts. Do manual curation of the results to tune your data mining methods. Yes it is very labor intensive, but without it you'll be blind to your mistakes. Design and implement your data processing methods to work in a streamed fashion and to be performant. Streamed design solves the “data bigger than memory” problem, performance design solves the “having to wait one week for results” problem. Developing the Open Sience Commons - Sept 25, Amsterdam23

Roadmap Release Results of inference visible from the portal Claim user interfaces available from the portal Plan Production release – ready by 1 st of October 2013 Add more communities (e.g., FET) Developing the Open Sience Commons - Sept 25, Amsterdam24

facebook.com/groups/openaire linkedin.com/groups/OpenAIRE Thank you! Looking forward to your questions and feedback Developing the Open Sience Commons - Sept 25, Amsterdam25