©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Supporting the Research Process with a CRIS Keith G Jeffery Director IT CLRC President, euroCRIS Anne Asserson Senior ExecutiveOfficer University of Bergen
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Agenda The Issue The Proposition The Research Process Dealing With the Issue The Metadata Conclusion
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The Issue Increasing numbers of researchers Increasing output per researcher –Publications –Patents –Products Especially research datasets from automated equipment Effort to catalog - input metadata –Too great (for the user) –Does not scale (with increasing numbers)
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Agenda The Issue The Proposition The Research Process Dealing With the Issue The Metadata Conclusion
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The Proposition The research process provides the context –Link the CERIF-CRIS information to the research output information Provides context Provides some of the required metadata –Collect metadata fragments Only once As early as possible (as they are generated) Result –Research output Publications, patents, products –Linked together in context by the CERIF-CRIS Person, Project, OrgUnit, Funding, Event, Facility, Equipment –With provenance and curation managed automatically
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The Notion The research process is a workflow with e-forms –At each step (meta) information is required and stored incrementally (re-use, minimal effort) The researcher sees benefit from the process: examples –Automated CV –Automated publication list –Tracking competing and cooperating teams –Research visible to intermediaries for exploitation –Boilerplate information for research proposals
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Agenda The Issue The Proposition The Research Process Dealing With the Issue The Metadata Conclusion
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process: Recording Workprogramme Proposal Project Results Exploitation WealthCreation CRIS DATABASE Information from external systems and CRIS
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording WorkProgramme Workprogramme ProgrammeName Funding OrgUnit Person Workprogramme document CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Proposal Proposal Title Abstract Person(s) OrgUnit(s) Proposal Document CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Project Project Title Abstract Person(s) OrgUnit(s) Funding Project Plan CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Results-Product Results Person(s) OrgUnit(s) Project(s) Product(s) Product Description CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Results-Patent Results Person(s) OrgUnit(s) Project(s) Patent(s) Patent File CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Results-Publication Results Person(s) OrgUnit(s) Project(s) Bibliographic Information Article CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Exploitation Exploitation Person(s) OrgUnit(s) Business plan Finance Data Marketing Data Production Data Sales Data CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Recording Wealth Creation WealthCreation Person(s) OrgUnit(s) Annual Reports/Accounts Employment Records Dividends Records CRIS DATABASE
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The R&D Process Workprogramme Proposal Project Results Exploitation WealthCreation Note: some CRIS developers limit recording of outputs from the process to areas indicated Nirvana
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CRIS Features Required Entity instance attribute data collected once and stored Entity instances related flexibly (n:m) Entity instances related by role and temporal limits (semantics) Input incremental, flexible, validated (minimum effort) System extensible (add new attributes,entities preserving previous datastructure for interoperation) System interoperable – CRIS (to create world view) System linkable – other systems used in research process (eg finance, HR, project management to utilise them for CRIS purposes)
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CERIF-CRIS It is no accident that CERIF (Common European Research Information Format) provides a datamodel with exactly these desirable properties. Linking relations are the key feature –temporal and role information Critical to answer questions like: –“during what time interval was person A project leader of project P?” –“to which research group(s) did person A belong when she produced publication X?”
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CERIF-CRIS Further features Inference: –in a multidimensional framework, –deduction or induction of relationships between entities eg between a grey internal report and a white published paper - and with other research outputs such as datasets or software. Fact generation –automated generation of facts eg (1) Person A on Project P produces Paper X; (2) Project P uses Equipment E Person A uses Equipment E –the generated data may be recorded in the CERIF-CRIS deduced / induced afresh each time it is required.
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CERIF-CRIS Further features Assertions –relationships between entity instances (eg documents) can also be expressed explicitly (i.e. asserted) eg references and / or citations can be recorded by directly inputting the information into the CERIF-CRIS. Metrics –role-based temporal relationships between entity instances (eg publications) –provides detailed research output metrics, –increasingly in demand from CRISs as research institutions seek to justify their funding and to improve their relative standing in league tables –while funding organisations seek to justify their decisions.
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CERIF-CRIS Summary through the flexible and dynamic linking relations between entities, –with their role and time-stamped attributes, a rich context for understanding the R&D output is provided, including versions, history and provenance. This context is particularly important for other users of CRISs such as –entrepreneurs engaged in technology transfer and wealth creation –the media explaining to the public the importance of the research being done.
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CERIF-CRIS at the Centre Acting as metadata Relating CRIS information to itself –Flexible linking relations And to information in other systems –Eg publications repository –Eg e-research datasets and software And Via GRIDs environment to other research process systems –E.g. finance, HR, project management
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS CERIF-CRIS at the Centre Portal with knowledge-assisted user interface Digital Curation Facility SCIENTIFIC DATASETS Data Information Knowledge PUBLICATIONS Data Information Knowledge metadata publish validate GRIDs Ambient, Pervasive Access
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Agenda The Issue The Proposition The Research Process Dealing With the Issue The Metadata Conclusion
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Dealing with the Issue: Progressive Recording early research ideas or work in progress : grey document –described by appropriate metadata (title, abstract….) input at the time of deposit. –publication metadata linked to pre-existing research information (such as person, organisational unit, project) in a temporal and role-based context.
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Progressive Recording Grey Document Grey doc Publication metadata Person Project OrgUnit new
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Dealing with the Issue: Progressive Recording early research ideas or work in progress : grey document –described by appropriate metadata (title, abstract….) input at the time of deposit. –publication metadata linked to pre-existing research information (such as person, organisational unit, project) in a temporal and role-based context. grey document developed into a white publication –additional publication metadata is input at the time of submission. –linked through temporal and role-based relationships to the pre- existing grey publication –and to the pre-existing contextual information such as persons, organisational units etc.
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Progressive Recording White document Grey doc Publication metadata Person Project OrgUnit White doc Publication metadata new
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Dealing with the Issue: Re- Use for Scalability Record (meta)data once: re-use many times Record only the metadata available and needed at each process step –Automated input assistance - quality –Reduces input required Addresses scalability and high user effort threshold, improves quality
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Agenda The Issue The Proposition The Research Process Dealing With the Issue The Metadata Conclusion
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Metadata Where to Store it In the repository (publications or e-research datasets, software) In the CERIF-CRIS
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Metadata in the Repository Advantages –Metadata with the object Available for retrieval, statistical processing, advanced computation… Available for harvesting (eg OAI-PMH) Disadvantages –Metadata not available in CERIF-CRIS for management information –Most repositories only store poor metadata non-machine-understandable Insufficient for bibliographic reference No DOI to link to publisher database
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Metadata in the CERIF- CRIS Advantages –Efficient processing of management information queries Disadvantages –Have to somehow redirect OAI-PMH harvesting to CERIF-CRIS instead of repository –Separate metadata from the full hypermedia article, research dataset or software
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The Solution: Metadata in CERIF-CRIS and Repository Primary metadata source is in the CERIF-CRIS –Linked with research process workflow –Incremented as generated –Provenance and context –Validation – quality –Generate bibliographic references Copy in the repository –For harvesting (articles) –With additional detailed metadata for research datasets or software
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS The Solution: Metadata in CERIF-CRIS and Repository Discussion –Parts of (meta)data stored twice, but storage is cheap Research process workflow means only input once –Improved quality through validation due to context and provenance –Management Information processing performed in one system and separated from access to the research articles, datasets or software
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Agenda The Issue The Proposition The Research Process Dealing With the Issue The Metadata Conclusion
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Conclusion The solution presented works in prototype designs: –UiB: FRIDA (CERIF-CRIS) linked to DSpace –CCLRC: CDR (CERIF-CRIS) linked to ePubs (articles) and e-Research portal (datasets and software) And is now being implemented in production