Presentation is loading. Please wait.

Presentation is loading. Please wait.

Www.cineca.it ~ Integrate external services in DSpace submission process How to make self-deposit easy and improve metadata quality and presence of full-text.

Similar presentations


Presentation on theme: "Www.cineca.it ~ Integrate external services in DSpace submission process How to make self-deposit easy and improve metadata quality and presence of full-text."— Presentation transcript:

1 ~ Integrate external services in DSpace submission process How to make self-deposit easy and improve metadata quality and presence of full-text Andrea Bollini – Susanna Mornati

2 Topics Some context: CINECA a brief overview DSpace as part of a CRIS solution | Integrate external services in DSpace submission process | OR2013| July 2013 Make the repository an active actor: Discovering missing content Improve Fulltext presence Integration of external services: Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. Publishers policy: Sherpa/Romeo

3 Owned companies: Kion, SCS. Employees: 400 (+150 Kion) Total turnover: 70M The Company Interuniversity Consortium No-Profit Founded in 1969 Headquarter in Bologna 57 Members 54 Universities 2 Research institutes MIUR as last week! | Integrate external services in DSpace submission process | OR2013| July 2013

4 The merging process of the three Italian Consortia started in September 2012 It was concluded in July 1st 2013 (last week!) The Merge Members More than 700 employees (+ 150 Kion) The only Italian Interuniversity Consortium | Integrate external services in DSpace submission process | OR2013| July 2013

5 Higher Education Solutions & Services for the University Administration Services for the Ministry of Education, University and Research (MIUR) Scientific Research High Performance Computing – FERMI: 2° in EU / 7° WW) Scientific Visualization & Interactive Virtual Environments Technological Innovation Data Center Information and Knowledge Management Services Health Care Systems What CINECA does | Integrate external services in DSpace submission process | OR2013| July 2013

6 Cineca Board of Directors Product Managers Board Product Managers Board U-GOV & SURplus Restricted Board Customer Service Board Customer Service Board Technical & Delivery Board Apps Road Map Apps Road Map Tech Road Map Tech Road Map University Customers Focus Groups University Customers Focus Groups University Customers Cineca Technical Board University Customers Cineca Technical Board Requirements How we work with Universities | Integrate external services in DSpace submission process | OR2013| July 2013

7 Solutions for HE = ERP = Best of Breed | Integrate external services in DSpace submission process | OR2013| July 2013

8 SURplus: supporting the World of Research Collect institutional research output for evaluation and assessment purposes Measure research results for benchmarking Disseminate data to enhance impact and visibility Preserve ICT investments and maximize ROI | Integrate external services in DSpace submission process | OR2013| July 2013

9 The adoption of open- source solutions allows the SURplus team to customize and enhance the source code depending on the Institutions needs. The OS community provides innovative, high-quality and safe software and it is challenging to work with & for them Why Open Source? | Integrate external services in DSpace submission process | OR2013| July 2013

10 SURplus: CINECA CRIS System An interoperable infrastructure made of different components Ingestion of data from any legacy systems adopted by an institution Maintenance of specific functional requirements, data model and preferred technologies at the level of applications Data warehouse and Business Intelligence tools to facilitate aggregations of data and the application of measurement parameters and algorithms | Integrate external services in DSpace submission process | OR2013| July 2013

11 SURplus: Dimension Beginning of activities: institutions 22 institutional repositories Total modules: 77 | Integrate external services in DSpace submission process | OR2013| July 2013

12 Topics Integration of external services: Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. Publishers policy: Sherpa/Romeo | Integrate external services in DSpace submission process | OR2013| July 2013 Make the repository an active actor: Discovering missing content Improve Fulltext presence Some context: CINECA a brief overview DSpace as part of a CRIS solution

13 CINECA is a registered service provider at DuraSpace Long-term collaboration with DSpace community, since 2003 Upgrades are periodically released to the open source community DSpace: SURplus Open Archive Module Manages collection and dissemination of research results Simplifies data collections processes Service Integration The OA Module, developed on DSpace: | Integrate external services in DSpace submission process | OR2013| July 2013

14 dissemination of entities descriptions in the research environment which go beyond publications DSpace-CRIS : SURplus Expertise & Skills DSpace-CRIS: designed together with the Hong Kong University & released as open-source | Integrate external services in DSpace submission process | OR2013| July 2013

15 IR as part of a CRIS system: what change? | Integrate external services in DSpace submission process | OR2013| July 2013 Benefits: Strong deposit mandate More funding Issues to mitigate: IR become a critical application Author have a requirements perception Wasting time Late submission Professional support HA infrastructure Dedicated team advocacy Make the submission process easy The information already exists in other database!

16 Topics Integration of external services: Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. Publishers policy: Sherpa/Romeo | Integrate external services in DSpace submission process | OR2013| July 2013 Make the repository an active actor: Discovering missing content Improve Fulltext presence Some context: CINECA a brief overview DSpace as part of a CRIS solution

17 New first submission step | Integrate external services in DSpace submission process | OR2013| July 2013 Available providers: each provider is a spring service Free search form Main metadata common to all publication types (article, book, etc.) Title of the contribution Year Authors/Editors

18 New first submission step | Integrate external services in DSpace submission process | OR2013| July 2013 Lookup by unique identifier Each provider declares which identifiers is able to manage

19 New first submission step | Integrate external services in DSpace submission process | OR2013| July 2013 For each result providers are shown that match the record. Grouping is done via DOI

20 Modal box publication details | Integrate external services in DSpace submission process | OR2013| July 2013 Records from different providers are merged to get richer metadata The system guesses a collection for the submission but the user can change it if required

21 Manual submission | Integrate external services in DSpace submission process | OR2013| July 2013 When lookup fails the user can always proceed manually

22 Batch import from external source | Integrate external services in DSpace submission process | OR2013| July 2013 Import data (identifiers or structured text) can be inputed manually or uploaded as a file Format/provider must be specified by the user

23 Batch import from external source | Integrate external services in DSpace submission process | OR2013| July 2013 Request are processed: Inline for specific providers and/or within configured data limits Submitter can immediately complete the pre-filled submissions In a background process Submitter will receive a summary with import result Pre-filled submissions are available as in-progress submission in the MyDSpace The legacy batch import feature for JSPUI has been already shared as pull request on GitHub, see DS-1252DS-1252

24 Enhanced Describe step: showing metadata source | Integrate external services in DSpace submission process | OR2013| July 2013

25 Translation logic original normalized Technical details PubMed Lookup Provider | Integrate external services in DSpace submission process | OR2013| July 2013 PubMed record JAVA Bean Mapping file DSpace Item Normalized record Enhancer plugins Split, aggregate fields Derive data ISSN Journal title … Split, aggregate fields Derive data ISSN Journal title … arXiv Lookup Provider arXiv record JAVA Bean Mapping file Scopus Lookup Provider Scopus record JAVA Bean Mapping file … Translation logic Normalized Repository Mapping file implements SubmissionLookupProvider public class PubmedLookupProvider extends ConfigurableLookupProvider public abstract class ConfigurableLookupProvider public class PubmedItem { private String pubmedID; private String doi; private String issn; private String eissn; private String journalTitle; private String title; private String pubblicationModel; private String year; private String volume; private String issue; private String language; private List type; private List primaryKeywords; private List secondaryKeywords; …

26 Topics Integration of external services: Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. Publishers policy: Sherpa/Romeo | Integrate external services in DSpace submission process | OR2013| July 2013 Make the repository an active actor: Discovering missing content Improve Fulltext presence Some context: CINECA a brief overview DSpace as part of a CRIS solution

27 Enhanced upload step | Integrate external services in DSpace submission process | OR2013| July 2013 Using the ISSN or EISSN provided in the describe step the upload form is improved showing on the right side the publisher policy from the Sherpa/Romeo database

28 Enhanced upload step | Integrate external services in DSpace submission process | OR2013| July 2013 Access policy for the bitstream: Open access, embargo, intranet, etc. Deposit of fulltext to the national database for individual CVs

29 Topics Integration of external services: Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. Publishers policy: Sherpa/Romeo | Integrate external services in DSpace submission process | OR2013| July 2013 Make the repository an active actor: Discovering missing content Improve Fulltext presence Some context: CINECA a brief overview DSpace as part of a CRIS solution

30 What is the problem? | Integrate external services in DSpace submission process | OR2013| July 2013 (very) late submissions produce some issues for the repository both at technical and organization level: /The system is subjected to periods of intense input activities. DSpace, but in general IR software, scales well for read operations less well for write operations /IR staff involved in workflow get lot of task to perform in small period Get researcher aware Remind researcher about IR presence Intercept early new content

31 How we plan to mitigate the problem? | Integrate external services in DSpace submission process | OR2013| July 2013 Citation databases provide APIs to perform search (we already use them for the lookup) and in some cases they provide additional APIs or search filters/indexes to make more raffinated search and allow scanning of the database. The interesting filters/indexes are: /Time based (much better if related to insertion in the citation database) /Author ID (better if related to a «standard/common» identifier as ORCID) /Affiliation /Subject category

32 Implementation idea | Integrate external services in DSpace submission process | OR2013| July 2013 Allow the researcher to store personal preferences about scanning: /Enabled providers (e.g disable arXiv if you are not a physicist) /Frequencies /Subject categories filters AuthorIDs will be stored/retrieved from the Researcher profile. Subject categories could be proposed from previous items or researcher profile.

33 DSpace-CRIS: Researcher profile | Integrate external services in DSpace submission process | OR2013| July 2013

34 Who are the potential targets? | Integrate external services in DSpace submission process | OR2013| July 2013 ORCID Scopus Web of Science arXiv PubMed Central DBLP REPEC The Repository itself!

35 The repository as source of missing content? | Integrate external services in DSpace submission process | OR2013| July 2013 The submitter has to match authors of publication with the University staff to higthlight internal authors Sometimes matches are missing Othertimes matches are wrong (homonymous) External authors could become «internal» at some point in the future

36 The repository as source of missing content? | Integrate external services in DSpace submission process | OR2013| July 2013 Send to internal «co-authors» when a submission is done prevent wrong attribution (and reduce duplication) Allow researcher to unclaim publications from her profile last chance to fix wrong attribution Allow researcher to claim publications fix missing attribution and/or engagement of new researcher The last two features are included in the DSpace-CRIS addon

37 Current implementation: claim/unclaim publications in the repository | Integrate external services in DSpace submission process | OR2013| July 2013 This is the current status of the publication U Unlinked You can claim it A Active, simple claim S Make it a selected publication H Claim it but hide from you public profile

38 Current implementation: claim/unclaim publications in the repository | Integrate external services in DSpace submission process | OR2013| July 2013 You can unclaim a publication U Unlink

39 Current implementation: claim/unclaim publications in the repository | Integrate external services in DSpace submission process | OR2013| July 2013

40 Topics Integration of external services: Bibliographic database: Scopus, PubMed, CrossRef, ArXiv, etc. Publishers policy: Sherpa/Romeo | Integrate external services in DSpace submission process | OR2013| July 2013 Make the repository an active actor: Discovering missing content Improve Fulltext presence Some context: CINECA a brief overview DSpace as part of a CRIS solution

41 Improve fulltext presence | Integrate external services in DSpace submission process | OR2013| July 2013 Use the Sherpa/Romeo policy database to analyze repository content Use external database API to find an actual fulltext (arXiv, pubmed,...why not the publisher version via library subscription?) Send to researcher to validate found PDFs or ask for an «author» versions Use statistics to encourage upload

42 Sherpa/Romeo Statistics (Example) | Integrate external services in DSpace submission process | OR2013| July % ISSN 36% Not in Sherpa items 7,3% have a fulltext… 5,3% open access 32% green items

43 | Innovative Open Source Technologies for a CRIS: SURplus | euroCRIS | May 2013 SURplus: prevision institutional repositories (DSpace) 10 research portals (DSpace-CRIS)

44 ~ Thank you! Andrea Bollini SURplus - DSpace-CRIS -


Download ppt "Www.cineca.it ~ Integrate external services in DSpace submission process How to make self-deposit easy and improve metadata quality and presence of full-text."

Similar presentations


Ads by Google