Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas (Computer Architecture, Communications and Systems Group)

Slides:



Advertisements
Similar presentations
Shared-Memory Model and Threads Intel Software College Introduction to Parallel Programming – Part 2.
Advertisements

Design and Evaluation of an Autonomic Workflow Engine Thomas Heinis, Cesare Pautasso, Gustavo Alsonso Dept. of Computer Science Swiss Federal Institute.
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Distributed Systems Architectures
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 3 CPUs.
1 Building a Fast, Virtualized Data Plane with Programmable Hardware Bilal Anwer Nick Feamster.
11 Application of CSF4 in Avian Flu Grid: Meta-scheduler CSF4. Lab of Grid Computing and Network Security Jilin University, Changchun, China Hongliang.
Towards Automating the Configuration of a Distributed Storage System Lauro B. Costa Matei Ripeanu {lauroc, NetSysLab University of British.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Year 6 mental test 10 second questions
OPERATING SYSTEMS Lecturer: Szabolcs Mikulas Office: B38B
Chapter 6 File Systems 6.1 Files 6.2 Directories
1 Chapter 12 File Management Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
Secure Virtual Machine Execution Under an Untrusted Management OS Chunxiao Li Anand Raghunathan Niraj K. Jha.
Database Systems: Design, Implementation, and Management
Tom Hamilton – America’s Channel Database CSE
4.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 4: Organizing a Disk for Data.
Mehdi Naghavi Spring 1386 Operating Systems Mehdi Naghavi Spring 1386.
Tanenbaum, Modern Operating Systems 3 e, (c) 2008 Prentice-Hall, Inc
Troubleshooting Startup Problems
© 2003 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance Measurements of a User-Space.
Lecture 13 storage management
Chapter 1: Introduction to Scaling Networks
Pharos Uniprint 8.3 Training
Advance Nano Device Lab. Fundamentals of Modern VLSI Devices 2 nd Edition Yuan Taur and Tak H.Ning 0 Ch9. Memory Devices.
INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
Virtual Memory II Chapter 8.
Windows® Deployment Services
Chapter 11: The X Window System Guide To UNIX Using Linux Third Edition.
IP Multicast Information management 2 Groep T Leuven – Information department 2/14 Agenda •Why IP Multicast ? •Multicast fundamentals •Intradomain.
25 July, 2014 Hailiang Mei, TU/e Computer Science, System Architecture and Networking 1 Hailiang Mei Remote Terminal Management.
Operating Systems Operating Systems - Winter 2011 Dr. Melanie Rieback Design and Implementation.
Operating Systems Operating Systems - Winter 2012 Dr. Melanie Rieback Design and Implementation.
VOORBLAD.
Chapter 6 File Systems 6.1 Files 6.2 Directories
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 EN0129 PC AND NETWORK TECHNOLOGY I IP ADDRESSING AND SUBNETS Derived From CCNA Network Fundamentals.
© 2012 National Heart Foundation of Australia. Slide 2.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Introduction to Computer Administration Introduction.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Executional Architecture
Implementation Architecture
Global Analysis and Distributed Systems Software Architecture Lecture # 5-6.
© 2004 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Volume Concepts HP Restricted Module.
25 seconds left…...
Equal or Not. Equal or Not
Slippery Slope
Chapter 10: The Traditional Approach to Design
Systems Analysis and Design in a Changing World, Fifth Edition
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
PSSA Preparation.
VPN AND REMOTE ACCESS Mohammad S. Hasan 1 VPN and Remote Access.
1 Chapter 13 Nuclear Magnetic Resonance Spectroscopy.
Installing Windows XP Professional Using Attended Installation Slide 1 of 30Session 8 Ver. 1.0 CompTIA A+ Certification: A Comprehensive Approach for all.
TCP/IP Protocol Suite 1 Chapter 18 Upon completion you will be able to: Remote Login: Telnet Understand how TELNET works Understand the role of NVT in.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
Storage Virtualization
Chapter 3: Windows7 Part 4.
Presentation transcript:

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas (Computer Architecture, Communications and Systems Group) UNIVERSIDAD CARLOS III DE MADRID A parallel File System for Networks of Windows Workstations José María Pérez Jesús Carretero José Daniel García Félix García Alejandro Calderón

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –2– Outline  Introduction  Goals  Design  Evaluation  Conclusion

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –3– High performance and data storage  Growing need for high performance data storage. Growing capacity of disks. Growing data storage from applications.  I/O becomes in bottleneck.  Typical solution: Parallel I/O Join several storage resources  Large storage. Increased scalability and performance. Load balancing.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –4– Parallel File Systems  Several nodes with storage devices. Accesses performed in parallel. Data striped among nodes.  Striping allows: Parallel access to different files. Parallel access to the same file. Striping originally used in RAID.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –5– Current state  Current solutions are neither general nor flexible.  Do not use standard servers. Difficult to integrate in existing networks of workstations. Need to install new difficult servers. Available for specific platforms.  Implementation outside the operating system. A new I/O API is needed. Applications need to be modified or recompiled.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –6– Outline  Introduction  Goals  Design  Evaluation  Conclusion

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –7– WinPFS: Goal  Build a parallel file system for networks of Windows workstations using standard data sharing services (as Windows Shared Folders). A first prototype has been built using CIFS/SMB servers.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –8– Detailed goals  Integrate existing storage resources using shared folders rather than installing new servers. Accomplished by using Windows Redirectors.  Simple setup. Implemented as a new Windows File System in the kernel (a new stackable driver in the I/O hierarchy).  Easy to use. No special API’s  Applications work without recompilation.  Enhance performance, scalability and capacity. Request splitting, balanced data allocation, load balancing,...

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –9–

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –10– Outline  Introduction  Goals  Design  Evaluation  Conclusion

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –11– WinPFS design  Design based in a new Windows kernel component: A file system redirector. Implements the basis of the file system. Isolates users from the parallel file system Uses protocols to connect to different network file systems.  Redirector  redirects requests to remote servers with specific protocol (e.g.: CIFS/SMB).  WinPFS is registered as a virtual remote file system, implement the parallel I/O mechanisms and use other remote data services (redirectors).

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –12–

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –13– Remote data access User point of view  Access to remote data through shared folders.  WinPFS creates a new shared folder: \\PFS.  Users can access parallel files through this shared folder. Kernel point of view  Access through CIFS/SMB, …  Capture requests through the usage of Universal naming Convention (UNC).  Special kind of file system: a redirection of redirector drivers.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –14– File striping and requests

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –15– Layered I/O  Windows NT family has a layered I/O model. Several layers to process a request in the I/O subsystem. Each layer is a driver which can receive a request and pass it to lower layers in the I/O stack. The model allows the insertion of new layers, using new drivers. File systems are implemented as drivers in the I/O model, so new file systems can be added at the kernel level.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –16– I/O Request Management  IRP (I/O Request Packet) Describes an I/O request. Sent to kernel-mode drivers by I/O Manager (in behalf of the client).  I/O Manager Receives system calls. Creates IRP describing the request. Deliver the IRP to the appropriate driver.  MUP (Multi UNC Provider) Identifies the kernel-mode driver in charge for a network name.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –17–

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –18– Request management  Create: IRP’s are replicated and sent to each server.  Read/Write: Request split in smaller subrequests.  Create Directory: IRP’s are replicated and set to each server.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –19– Using WinPFS  Administration / Installation: Install a new driver in client nodes. Share folders in server nodes. Indicate shared folders using registry in client nodes.  User Prefix paths with \\PFS. We plan to map remote names to common driver letters.  WinPFS may be used with any API that is on top of Windows Services. Win32, POSIX, DOS, cygwin, …

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –20– Other Features  Caching. Caching mechanisms performed by redirectors. Limited to Windows caching model. More advanced caching for future work.  Security and Authentication Current model works on a Windows Domain, forests and trusted domains. Standard Windows mechanisms used to managed policies and security in enterprises, labs and departments. Uses standard Windows security model. Changes to be done for workgroup or not trusted domains.  Data consistency between clients. Currently only solved for all servers using CIFS, using the default mechanism used by CIFS redirector  oplocks (oportunistic locks).

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –21– Outline  Introduction  Goals  Design  Evaluation  Conclusion

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –22– Evaluation  Creating a file of 100 MB.  Write sequentially.  Read sequentially.  Static buffer size.  Client cache disabled.  Two clusters with four nodes.  Node BiProcessor Pentium III. 1 GHz. 1 GB main memory. 200 GB disk.  GigaEthernet network  1 Windows 2003 Server  7 Windows XP Professional

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –23– Evaluation infrastructure

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –24– Configurations  CIFS: One server.  PFS88: 8 servers in parallel.  PFS44: 4 servers in parallel.  PFS84: 4 servers in parallel and selected randomly from a set of 8. In all cases 8 clients running (1 client per node)

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –25– Write results

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –26– Read results

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –27– Results  All WinPFS solutions provide better results than CIFS.  PFS88 provides the best performance as its parallelism degree is maximum.  Performance reaches to 250 Mbit/s for write operations. Writes limited by the disks.  Performance reaches to 1200 Mbit/s for read operations. Reads limited by the network.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –28– Write Speedup PFS88/CIFS

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –29– Read Speedup PFS88/CIFS

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –30– Speedup results  Speedup is higher with more concurrent clients.  Write speedup from 500% to 700% may be achieved.  Read speedup is less 100% because data are obtained from server caches without disk accesses.  WinPFS performance is limited by the striping size buffer.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –31– Outline  Introduction  Goals  Design  Evaluation  Conclusion

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –32– Conclusions  WinPFS is a parallel file system implemented as a kernel- mode driver.  Integration into the kernel provides higher performance. Uses existing mechanisms at kernel level.  No change or recompilation needed in client applications.  We can run an application that uses parallel I/O taking advantages of the shared folders in our organizations, without affecting users. For example, launch an I/O intensive application in a classroom, and accessing to shared folders.

Grupo de Arquitectura de Computadores, Comunicaciones y Sistemas UNIVERSIDAD CARLOS III DE MADRID Expanding Windows Kernel to Integrate Heterogeneous Resources on Data Grids Page –33– Future Work  Use of Active Directory Service to create metadata repository and give a consistent image of parallel file systems. Objective: No need of manual edition of client registry to provide information about shared folders.  Evaluation with other operating systems. Linux, FreeBsd and Solaris sharing folders with Samba.  Evaluation with other protocols (redirectors). NFS (redirector provided by Services for UNIX 3.5) and WebDAV. So, a WinPFS can connect to more servers, including NAS.  Parallel usage of heterogeneous resources and protocols in networks of workstations.  Dynamically addition and removal of storage nodes.  Data allocation and load balancing for heterogeneous distributed systems.