0 APAN 24-01-2008 NorStore national storage infrastructure for Norwegian research Jan Meijer, UNINETT.

Slides:



Advertisements
Similar presentations
DRIV(ER)ing Research Infrastructures Yannis Ioannidis University of Athens, Hellas 1st DRIVER Summit: Towards a Confederation of Digital Repositories,
Advertisements

EU Presidency Conference Effective policies for the development of competencies of youth in Europe Warsaw, November 2011 Improving basic skills in.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Storage Services Let the data flow! NorduNet 2008,.fi, 9 April 2008 Jan Meijer.
OCLC Online Computer Library Center Steering Around the Iceberg: Economic Sustainability for Digital Collections Brian Lavoie Research Scientist OCLC Economics.
Preserving Research Data in Canada: an update DLI/ACCOLEDS 2009 Chuck Humphrey University of Alberta 1.
Challenges and Opportunities in Recruitment and Retention of Rural General Practitioners 16 th March 2013 Wesley Henderson 1.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
Research Infrastructures WP 2012 Call 10 e-Infrastructures part Topics: Construction of new infrastructures (or major upgrades) – implementation.
An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
1 Building scientific Virtual Research Environments in D4Science Paul Polydoras University of Athens, Greece.
Edinburgh 23 October DSpace: A Platform for Research Repositories Peter Morgan Project Director, Cambridge University Library.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
Philip LordDigital Archiving Consultancy Alison Macdonald Digital Archiving Consultancy Liz LyonDigital Curation Centre David GiarettaDigital Curation.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
Supporting education and research Repositories in Context Digital repositories as components of an integrated infrastructure for education Leona Carpenter.
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
A centre of expertise in data curation and preservation DCC Workshop: Curating sApril 24 – 25, 2006 Funded by: This work is licensed under the Creative.
Pulling it all together… with thanks to Sheila Anderson.
E-Science and Open Access, Data, Science, … a Nordic perspective Sverker Holmgren Director The Nordic e-Science Globalisation Initiative.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
1 Kentuckys Public Safety Awareness Initiative Program Coordination and Partnerships August 23, 2005.
EMS Checklist (ISO model)
1 Dr. Ashraf El-Farghly SECC. 2 Level 3 focus on the organization - Best practices are gathered across the organization. - Processes are tailored depending.
1 The OneGeology project IC GS Ian Jackson, February 2007.
New Products for © 2009 ANGEL Learning, Inc. Proprietary and Confidential, 2 Update Summary Enrich teaching and learning Meet accountability needs.
How to commence the IT Modernization Process?
Supporting National e-Health Roadmaps WHO-ITU-WB joint effort WSIS C7 e-Health Facilitation Meeting 13 th May 2010 Hani Eskandar ICT Applications, ITU.
A deepening of training needs in digital curation Claudia Engelhardt Framing the digital curation curriculum Florence, 6-7 May 2013.
An introduction to collections and collection-level description Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London,
1 Chapter 11: Data Centre Administration Objectives Data Centre Structure Data Centre Structure Data Centre Administration Data Centre Administration Data.
ICDL-Contentra Workshop 29 th November /11/2013 Contentra Technologies Confidential (RajuB)1.
GSDI 6 conference, , Budapest 1 From data harmonisation to data interoperability Presented by
NORMAPME ISO User Guide for European SMEs The essence of.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
Riding the Wave: a Perspective for Today and the Future APA Conference, November 2011 Monica Marinucci EMEA Director for Research, Oracle.
An Overview of eResearch Activities in Australia Paul Davis, GrangeNet Jane Hunter, Uni of Qld.
Thee-Framework for Education & Research The e-Framework for Education & Research an Overview TEN Competence, Jan 2007 Bill Olivier,
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Why persistent identifiers are crucial in digital preservation.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
DASISH Final Conference Common Solutions to Common Problems.
Notur: - Grant f.o.m is 16.5 Mkr (was 21.7 Mkr) - No guarantees that funding will increase in Same level of operations maintained.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Storage Services Collecting the pieces of the puzzle TNC 2008, Brugge, 21 May 2008 Jan Meijer.
Institutional Repositories July 2007 Intellectual property management : the DISA experience Dr D Peters DISA: Digital Innovation South Africa.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Storage, an infrastructure component TERENA storage collaboration meeting Amsterdam, Jan Meijer uninett.no.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Accessing the VI-SEEM infrastructure
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Summit 2017 Breakout Group 2: Data Management (DM)
Exploitation of ISS Scientific data - sustainability
EGI Webinar - Introduction -
Brian Matthews STFC EOSCpilot Brian Matthews STFC
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Presentation transcript:

0 APAN NorStore national storage infrastructure for Norwegian research Jan Meijer, UNINETT

1 APAN Norway, geography

2 APAN Trends in Norwegian e-Infrastructure: Need for support of research that is based on access to collections of primary research data and information incl. research that does not have a need for HPC Need to separate long-term storage from HPC facilities Need for strategy, policy and practice regarding the creation, management, and long-term care of data: data curation Data repositories that are actively curated have become a reality. Data is not merely archived anymore Need for tools to assist in discovery, re-exploitation and presentation of data NorStore

3 APAN Trends in Norwegian e-Infrastructure (ctd): Sizes of scientific data collections have increased to the Terabyte scale Information technology tools evolve rapidly and the flexibility in using these tools put the very data they create and transform at risk Survival of digital scientific information depends on a hierarchy of constantly shifting technologies – hardware, storage media, operating systems, applications software and middleware. Overcome the technical obsolescence problems NorStore

4 APAN There are many reasons to keep data: retention of unique observational information which is impossible to recreate retention of expensively generated data which is cheaper to maintain than to recreate reuse data for new or future research purpose validate and account for publicly funded research for compliance with legal requirements for educational and teaching purposes NorStore

5 APAN Survey 2007: several user groups in Norway expressed a need for long-term storage of (large) data: - International Polar Year - collection of 30 projects - Meteorological Institute - Bjerknes Centre for Climate Research (>1 PB/year) - Nansen Environmental and Remote Sensing Centre (NERSC) – data assimilation - CARBOOCEAN – biogeochemistry and CO2 - PGP – study of violent processes in (geo)physics - CMBN – molecular biology and neuroscience NorStore

6 APAN The main objective: Establish and maintain a broad and sustainable infrastructure for the curation, archiving and preservation of data from computational science and the natural sciences. NorStore

7 APAN NorStore will - operate storage resources and peripheral equipment - provide support to researchers need storage capacity, digital repositories and curation services - promote a set of standard services and establish best practices and policies that aim to improve the reuse and the reusability of scientific data - provide easy, secure and transparent access to distributed storage resources, provide large aggregate capacities for storage and data transfer, and optimize the utilization of the overall capacity NorStore

8 APAN The infrastructure will be an integrated part of the national e-Infrastructure (HPC, national network, national AAI and national grid) The infrastructure must prove to be sustainable, cost- efficient, allow efficient utilization of the available resources, services and competencies and shall be attractive to a wide range of sciences. NorStore

9 APAN The infrastructure must enhance the ability of researchers to extract further meaning from masses of data stored in institutional, national, international or community repositories, contribute to the standardization and interoperability of repositories and software interfaces for storage in general increase the pooling of resources and competencies across the participating centres NorStore

10 APAN The data revolution raises non-technical issues wrt: - security - confidentiality and continued privacy - ownership - assured provenance - authenticity and integrity How to guarantee the quality of the primary data and associated metadata? Trust in data can be enhanced by the existence of qualified domain specialists who curate the data Open access: immediate, free, unrestricted on-line access NorStore

11 APAN Concrete tasks for the project: coordination of investments in the infrastructure, in particular in large-scale storage resources and recruitment of skilled personnel operation of the resources, tools and services in the infrastructure provide expert advice and assistance to users of the infrastructure and more generally, to individual and groups that create and maintain data collections ensure cost-efficient and high utilization of the overall infrastructure NorStore

12 APAN Concrete tasks for the project (ctd): maintain a core set of services, tools and protocols, as required by user communities, (international) collaborative partnerships coordination and administration of access to the infrastructure contribute to national policy and establishment of best practices for using the infrastructure and for the curation, archiving, and preservation of data in general increase awareness of the importance of data curation to all stakeholders NorStore

13 APAN NorStore NANO FUGE KLIMA project HAV UiX institution PETRO programme HEP networks, standardized services, interoperability Logically: (repositories)

14 APAN NorStore NANO FUGE KLIMA HAV UiX PETRO HEP networks, services, interoperability tape disk Physically:

15 APAN User access: The infrastructure is open to all sciences that are within the responsibility of the Norwegian e-Science programme (computational sciences, natural sciences, …) Access to the infrastructure will be by application and will be governed by a Committee appointed by the Research Council of Norway. NorStore

16 APAN Application criteria shall include - scientific merit of the research - feasibility of the usage of the infrastructure - duration of usage, known/expected future usage - type of usage (e.g., on-line repository, archive, …) - proper content management of the data collections Also aspects of security, confidentiality, privacy, ownership, provenance and formats of the data, restrictions for third parties to access the data shall be considered. NorStore

17 APAN Initial consortium: Universities in Bergen, Trondheim (NTNU), Oslo, Tromsø, UNINETT (NREN), UNINETT Sigma Collaborations with organizations and international efforts that have an interest in infrastructure for scientific data. E.g., - National Library, Norwegian Social Science Data Services - Similar Nordic initiatives - Coupling to European initiatives: EGI, PRACE, … NorStore

18 APAN In 2007, focus was on the initial specification of the infrastructure, choice of technologies, levels of curation, roadmap for 2008, accumulation of experience. A main activity late 2007 and early 2008 is the investment in hardware for the initial infrastructure. The investments must be such that the initial infrastructure can be expanded and upgraded in a cost-efficient manner in the coming years. NorStore

19 APAN Characteristics and challenges: - Distributed infrastructure (multiple sites, long distances) - Heterogeneous infrastructure (multiple types of resources and storage media) - Heterogeneous data, large data sets - Heterogeneous usage: active repositories, archiving, back-up, buffers, scratch areas, … NorStore

20 APAN NorStore hardware general services Proj A Proj B … NorStore ssh, ftp, scp, gridftp, middlewares, virtualization UiA Inst C Org B … project-specific services

21 APAN Initially: Two storage elements (ca. 600 TB each). Installation January Tape-robot expansion at UiB. Built bottom-up from identical homogeneous independent resources. Core set of services (back-up, mirrors, archives, …) Technology project to investigate software solutions NorStore 10G UiT UiO UiB NTNU SE HPC MS

22 APAN Further information