RSS Data-farm: from local storage to the cloud

Slides:



Advertisements
Similar presentations
Archive Task Team (ATT) Disk Storage Stuart Doescher, USGS (Ken Gacke) WGISS-18 September 2004 Beijing, China.
Advertisements

Slide: 1 ROSA GRAS Meeting February 2009 Matera, Italy User Services EUMETSAT EUMETSAT Data Access & User Support.
Symantec De-Duplication Solutions Complete Protection for your Information Driven Enterprise Richard Hobkirk Sr. Pre-Sales Consultant.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Windows ® Powered NAS. Agenda Windows Powered NAS Windows Powered NAS Key Technologies in Windows Powered NAS Key Technologies in Windows Powered NAS.
STEALTH Content Store for SharePoint using Windows Azure  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
Disaster Recovery as a Cloud Service Chao Liu SUNY Buffalo Computer Science.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC and Microsoft SharePoint Server Data Protection Name Title Date.
Opensource for Cloud Deployments – Risk – Reward – Reality
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
IT Infrastructure Chap 1: Definition
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
By: Ashish Gohel 8 th sem ISE.. Why Cloud Computing ? Cloud Computing platforms provides easy access to a company’s high-performance computing and storage.
©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Planning for Expansion Steve Nevermann.
Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
Page 1 Federated Earth Observation (FedEO) Status CEOS WGISS Meeting #40 28 Sep – 02 Oct, 2015 Harwell, UK Hosted by UKSA M.Albani, P.Mougnaud, A.Della.
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
Cloud Archive By: Kimberly Nolan. What it is?  The goal of a cloud archiving service is to provide a data storage (ex. Google drive and SkyDrive) as.
Research and Service Support Resources for EO data exploitation RSS Team, ESRIN, 23/01/2013 Requirements for a Federated Infrastructure.
Dell Software Unified Communications Command Suite (UCCS) Provides Flexible, Cross-Platform Management, Reporting and Data Diagnostics MICROSOFT AZURE.
Commvault and Nutanix October Changing IT landscape Today’s Challenges Datacenter Complexity Building for Scale Managing disparate solutions.
What is Hybrid Cloud Software?. What is Cloud Storage? Before talking about hybrid cloud software we have to know about cloud storage. Cloud Storage means.
A Solution for Maintaining File Integrity within an Online Data Archive Dan Scholes PDS Geosciences Node Washington University 1.
Jordi Farres HMA-WG Meeting ESRIN, 23 Jan 2013
CLOUD ARCHITECTURE Many organizations and researchers have defined the architecture for cloud computing. Basically the whole system can be divided into.
ESA-FAO GEOPortal STATUS & PLANS
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Device Maintenance and Management, Parental Control, and Theft Protection for Home Users Made Easy with Remo MORE and Power of Azure MICROSOFT AZURE APP.
Remaking Secondary Storage
OGC Technical and Planning Committee Meeting Welcome
Network Attached Storage Overview
DREAM High-Level Architecture
New Heights by Guiding Them into the Cloud
Introduction to Data Management in EGI
Presentation on Copernicus Dissemination
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Informix Red Brick Warehouse 5.1
Introduction.
Veeam Backup Repository
AWS. Introduction AWS launched in 2006 from the internal infrastructure that Amazon.com built to handle its online retail operations. AWS was one of the.
With Help from the Microsoft Azure Cloud,
Real IBM C exam questions and answers
Future Data Architecture Cloud Hosting at USGS
Overview Introduction VPS Understanding VPS Architecture
The VITO Earth Observation LTDA Facility
Disaster happens; don’t be held hostage
Opening Remarks European Commission CEOS 2018 Chair
AKAMAI INTELLIGENT PLATFORM™
On-Premises, or Deployed in a Hybrid Environment
User Monitoring Appliance Secures Microsoft Azure by Auditing Privileged Users in the Cloud “Microsoft Azure provides an easily accessible platform for.
Unitrends Enterprise Backup Solution Offers Backup and Recovery of Data in the Microsoft Azure Cloud for Better Protection of Virtual and Physical Systems.
Druva inSync: A 360° Endpoint and Cloud App Data Protection and Information Management Solution Powered by Azure for the Modern Mobile Workforce MICROSOFT.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
BluVault Provides Secure and Cost-Effective Cloud Endpoint Backup and Recovery Using Power of Microsoft OneDrive Business and Microsoft Azure OFFICE 365.
Barracuda Solutions VMware® vCloud® Air™ Version 1.0 | February 2015.
Keep Your Digital Media Assets Safe and Save Time by Choosing ImageVault to be Your Digital Asset Management Solution, Hosted in Microsoft Azure Partner.
TEMPLATE.
One-Stop Shop Manages All Technical Vendor Data and Documentation and is Globally Deployed Using Microsoft Azure to Support Asset Owners/Operators MICROSOFT.
Services Chaining Alessandro Marin SSE Workshop 2008
Pier Giorgio Marchetti, Philippe Mougnaud European Space Agency
PRESENTER GUIDANCE: These charts provide data points on how IBM BaaS mid-market benefits a client with the ability to utilize a variety of backup software.
Emerging technologies-
BluSync by ParaBlu Offers Secure Enterprise File Collaboration and Synchronization Solution That Uses Azure Blob Storage to Enable Secure Sharing MICROSOFT.
Data Management Components for a Research Data Archive
Presentation transcript:

RSS Data-farm: from local storage to the cloud Julio Carreira ESA-ESRIN, Frascati , Roma (Italy) Storage Solutions for Digital Preservation APARSEN Webinars , 14th April 2014 EOP-GTR Systems

What is RSS RSS architecture and environments have been design and made available by the Research/Service Support and Ground Segment Technology (EOP-GSR) section of ESA Earth Observation Ground Segment Department. Its primary mission is to Provide support to the EO community to exploit EO data and to the researchers and service provider to ease the development of applications to generate value added information Support and promote ground segment harmonization activities through the identification and collection of ground segment technology needs EOP-GSR

Standardised IFs following HMA Guidelines RSS systems Data Farm EO Products, Image Collection UK PAC ESA RSS Storage Standardised IFs following HMA Guidelines MEA Storage OGC GIS Web Service Online Archives Metadata, Auxiliary data G-POD Catalogue OGC Web GIS Services Reference Data Sets DAIL SSE EOP-GSR

RSS storage infrastructure GPOD have 20 storages (NAS/SAN) , the full space is ~ 600 TB ~7 000 000 products storage ~ 350 TB of data Our storages are from 2005 till 2013 Increase of > 50 TB year Use of RAID 5/6 with/without spare disk Monitoring the storage system using OPSView EOP-GSR

The use of data at RSS EOP-GSR

Ancient times (around 2010) The problem All the Storages had different paths to the data sctructure A catalogue was necessary to know were was the data No direct access to a flow of data (ex. A full year of a dataset) EOP-GSR

The answer! The solution was to create a datafarm(distributed file system): POSIX based file system Can be deploy on top of an pre-existent file system Mirroring and replication Load balancing Storage quotas ACLs for user access to data EOP-GSR

RSS datafarm The RSS DataFarm allows much more flexibility than before in accessing data. For example, it is now possible to ingest data directly from the former G-POD dedicated storages into the RSS WebMap Server, with no need to copy data on a local storage. The same applies to SSE, KEO and the other RSS environments. Other benefits brought by the DataFarm are:   Optimized storage space utilization   Easy access control    Easy scalability     At the moment the RSS archive, composed by ENVISAT, ERS and third party missions data has a total volume around 350TB growing by some 50TB/yr. RSS DataFarm move a step towards the Cloud as well. Indeed, this novel RSS infrastructure model is been naturally extended to the Cloud, therefore constituting a robust and scalable basis for providing more and more efficient and flexible support to EO data users in the coming years.  EOP-GSR

Today! All the Storages have a single point of access

In the far future (Dec 2014) EOP-GSR

Main tecnical Problems Main Problems Network Polices Network velocity Security EOP-GSR

Different Elements of Risk Availability Performance Portability Responsiveness Confidentiality Legal Problems Backup Data integrity Disaster recovery Access control Reporting and monitoring Client expectations Responsiveness to users Maintaining corporate values of quality of service High High Medium Medium High Medium High Medium High EOP-GSR

At this moment! At this moment on the development environment, we have 15 Tb on the cloud, and we are: Testing the geo replication of “critical” data, Making stress testing with massive data requests, Monitoring the availability and the performance of the file system, Planning the security and the encryption of the communications, Creating a software for a easy portability between cloud and virtualization providers, Making a cost model of the storage on the cloud comparing to the local storage. EOP-GSR

Conclusions We are happy with our solution of unifed storage using a distributed file system We are testing the portability to the cloud with sucess We are creating policies/rules to mitigate the main risks of having the storage on the cloud EOP-GTR

Thank you EOP-GTR