Big Data Open Source Software and Projects ABDS in Summary XXI: Layer 15B Part 1 Data Science Curriculum March 1 2015 Geoffrey Fox

Slides:



Advertisements
Similar presentations
Big Data Open Source Software and Projects ABDS in Summary XIV: Level 14B I590 Data Science Curriculum August Geoffrey Fox
Advertisements

Big Data Open Source Software and Projects ABDS in Summary I I590 Data Science Curriculum August Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary XIX: Layer 14B Data Science Curriculum March Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary XVI: Layer 13 Part 1 Data Science Curriculum March Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary II: Layers 3 to 4 Data Science Curriculum March Geoffrey Fox
MyCloudIT Removes the Complexity of Moving Cloud Customers’ Entire IT Infrastructures to Microsoft Azure – Including the Desktop MICROSOFT AZURE ISV: MYCLOUDIT.
Compute/Storage/Network Users On-Premises Data Center A VM A A AA A Application VM Virtual Machine.
Big Data Open Source Software and Projects ABDS in Summary XXII: Layer 15B Part 2 Data Science Curriculum March Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary XIII: Level 14A I590 Data Science Curriculum August Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary VI: Layer 6 Part 2 Data Science Curriculum March Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary III: Layer 5-Part 1 Data Science Curriculum March Geoffrey Fox
Modeling Public Pensions with Mathematica and Python II
Cloud: a New Paradigm for Developers Svetlin Nakov Telerik Software Academy academy.telerik.com.
Engineering the Cloud Andrew McCombs March 10th, 2011.
Big Data Open Source Software and Projects ABDS in Summary IX: Level 11C I590 Data Science Curriculum August Geoffrey Fox
Big Data Open Source Software and Projects Unit 0 Part B: Class Introduction Data Science Curriculum March Geoffrey Fox
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud: a "New" Paradigm for Developers Telerik Software Academy Web Services and Cloud.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
Cloud Operating System Unit 11 Sever Technology II M. C. Chiang Department of Computer Science and Engineering National Sun Yat-sen University Kaohsiung,
Using the Powerful Microsoft Azure Platform, e-SUAP Properly and Securely Manages All Steps for Customizable Business Activities Permissions MICROSOFT.
1 NETE4631 Using Google Web Services and Using Microsoft Cloud Services Lecture Notes #7.
Introduction to Cloud Computing
BIG DATA APPLICATIONS & ANALYTICS LOOKING AT INDIVIDUAL HPCABDS SOFTWARE LAYERS 1/26/2015 Cloud Computing Software 1 Geoffrey Fox January BigDat.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Big Data Open Source Software and Projects ABDS in Summary I: Layers 1 to 2 Data Science Curriculum March Geoffrey Fox
Communicate with All Workers Involved in the Process of Delivering High-Quality Health Care by Choosing Dossier365 on the Azure Platform MICROSOFT AZURE.
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
Built on the Powerful Microsoft Azure Platform, Media Sonar Brings Social Media Investigation to the Public Cloud MICROSOFT AZURE ISV PROFILE: MEDIA SONAR.
Bizfss File Sync and Sharing Solution, Built on Microsoft Azure, Allows Businesses to Sync, Share, Back Up Using Their Own Cloud Storage MICROSOFT AZURE.
Big Data Open Source Software and Projects ABDS in Summary IV: Level 7 I590 Data Science Curriculum August Geoffrey Fox
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Introduction To BlueMix By: Ryan
Built on the Powerful Microsoft Azure Platform, Mproof’s Clientele ITSM Provides Companies with a Complete Software Suite to Manage Services MICROSOFT.
Recipes for Success with Big Data using FutureGrid Cloudmesh SDSC Exhibit Booth New Orleans Convention Center November Geoffrey Fox, Gregor von.
Web Development Technologies Advanced Web-based Systems Advanced Web-based Systems | Misbhauddin.
TACTIC | Workflow: Project Management OSS on Microsoft Azure Helps Enterprises to Create Streamline, Manage, and Track Digital Content MICROSOFT AZURE.
MidVision Enables Clients to Rent IBM WebSphere for Development, Test, and Peak Production Workloads in the Cloud on Microsoft Azure MICROSOFT AZURE ISV.
Web Technologies Lecture 13 Introduction to cloud computing.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Flight is a SaaS Solution that Accelerates the Secure Transfer of Large Files and Data Sets Into and Out of Microsoft Azure Blob Storage MICROSOFT AZURE.
Picturex Secures and Scales Event-Photo Sharing for Enterprise and Private Customers by Relying on the Powerful, Scalable Microsoft Azure Platform MICROSOFT.
Gain High Availability Performance and Scale of Applications Running on Windows Azure with KEMP Technologies’ Virtual LoadMaster COMPANY PROFILE: KEMP.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Panel Discussion Software Defined Ecosystems June BigSystem Software-Defined Ecosystems at HPDC Vancouver Canada Geoffrey Fox.
Saasabi’s Analytical Processing Engine in the Cloud Makes Business Intelligence Affordable for Everyone COMPANY PROFILE: Saasabi Saasabi is a BizSpark.
Big Data Open Source Software and Projects ABDS in Summary II: Layer 5 I590 Data Science Curriculum August Geoffrey Fox
From infrastructure to applications Where cloud computing is at and where it’s headed.
Big Data analytics in the Cloud Ahmed Alhanaei. What is Cloud computing?  Cloud computing is Internet-based computing, whereby shared resources, software.
LIMS (Location Information Management System) is the Smart Claim Solution for Motor Insurers, Built on the Powerful Microsoft Azure Platform MICROSOFT.
© 2016 Catalyze, Inc. Go-To-Market Services HIPAA Compliance in the Cloud: Catalyze Provides Microsoft Azure Customers with a HITRUST Certified Platform-as-a-Service.
PaaS solutions evaluation CERN openlab Summer Students Lightning Talks Sessions Sofia Danko › 19/08/2014.
DreamFactory for Microsoft Azure Is an Open Source REST API Platform That Enables Mobilization of Data in Minutes across Frameworks and Storage Methods.
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Prepared by: Assistant prof. Aslamzai
Cloud Data platform (Cloud Application Development & Deployment)
Platform as a Service.
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Andrew McCombs March 10th, 2011
Built on the Powerful Microsoft Azure Platform, iSwarm Helps Businesses Analyze Social Media Conversations, then Connect with Individuals MICROSOFT AZURE.
Scalable SoftNAS Cloud Protects Customers’ Mission-Critical Data in the Cloud with a Highly Available, Flexible Solution for Microsoft Azure MICROSOFT.
Data Science Curriculum March
Cloud DIKW based on HPC-ABDS to integrate streaming and batch Big Data
Big Data Open Source Software and Projects ABDS in Summary I
I590 Data Science Curriculum August
Presentation transcript:

Big Data Open Source Software and Projects ABDS in Summary XXI: Layer 15B Part 1 Data Science Curriculum March Geoffrey Fox School of Informatics and Computing Digital Science Center Indiana University Bloomington

Functionality of 21 HPC-ABDS Layers 1)Message Protocols: 2)Distributed Coordination: 3)Security & Privacy: 4)Monitoring: 5)IaaS Management from HPC to hypervisors: 6)DevOps: 7)Interoperability: 8)File systems: 9)Cluster Resource Management: 10)Data Transport: 11)A) File management B) NoSQL C) SQL 12)In-memory databases&caches / Object-relational mapping / Extraction Tools 13)Inter process communication Collectives, point-to-point, publish-subscribe, MPI: 14)A) Basic Programming model and runtime, SPMD, MapReduce: B) Streaming: 15)A) High level Programming: B) Application Hosting Frameworks Part 1 16)Application and Analytics: 17)Workflow-Orchestration: Here are 21 functionalities. (including 11, 14, 15 subparts) 4 Cross cutting at top 17 in order of layered diagram starting at bottom

What is a Framework We use this to cover to Platform as a Service PaaS plus common practice of bundling that with Infrastructure IaaS as a “full solution”. So it does not include Amazon EC2 or Google Compute Engine but does include AWS Elastic Beanstalk and Google App Engine Typically supports a subset of very popular languages and tools Some aim to support a few important technologies very well Others are lower level and support anything with correct scripts for configuration Others just provide “management services” Frameworks span from IaaS to DevOps to specialized support of key technologies and include tool kits as well as management systems

Google App Engine started in 2008 and offers a service aimed at cloud web resources and not for example at running Hadoop (data analytics) at scale. Java and Python main languages with PHP, Go Jinja2 and Django are supported Python frameworks Memcached, SQL, NoSQL, web serving (Jetty) supported Used by Snapchat, Rovio, and Khan Academy. GAE part of Google Cloud platform that also includes GCE Google Compute Enginehttps://cloud.google.com/ Review engine https:// engine

AppScale AppScale is an open- source cloud computing platform that automatically deploys and scales unmodified Google App Engine applications over many public and private cloud systems and on-premises clusters. AppScale like GAE has support for Python, Go, PHP and Java applications Uses Cassandra and ZooKeeper, memcached, RabbitMQ and Celery Can run on Google Compute Engine, Amazon EC2, Softlayer (IBM), Microsoft Azure, RackSpace, OpenStack, CloudStack, Eucalyptus Spin off of UCSB Computer Science under Professor Chandra Krintz

Red Hat OpenShift OpenShift is a cloud computing platform as a service product from Red Hat. It allows the use of arbitrary languages and frameworks running on Red Hat Enterprise Linux. – OpenShift takes care of maintaining the services underlying the application and scaling the application as needed. Supports MongoDB, MySQL, PostgreSQL and web-application frameworks Node.js for JavaScript, PSGI for Perl, Rack for Ruby and WSGI for Python. Django works for Python – Java also supported

Heroku Heroku is a cloud platform as a service (PaaS) supporting several programming languages. Heroku was acquired by Salesforce.com in Heroku, one of the first cloud platforms, has been in development since June 2007, when it supported only the Ruby programming language, but has since added support for Java, Node.js, Scala, Clojure, Python and PHP and (undocumented) Perl. – The base operating system is Debian or, in the newest stack, the Debian-based Ubuntu. – Supports subsystems like Couchbase, MongoDB and Redis, PostgreSQL with detail at

Aerobatic Aerobatic, the cloud platform for front-end developers that makes it fun to build nimble HTML5 web apps in record time. So what is Aerobatic? In a nutshell, it's a platform as a service (PaaS) for single page web apps. You could think of it as Heroku for front-end client apps. However, unlike Heroku, Aerobatic does not require developers to build and maintain a backend with Ruby, Node, Python, etc. Powerful applications can be built using only browser-based technologies, namely Javascript, HTML, and CSS. In this respect Aerobatic provides similar functionality to static hosting with Amazon S3 or GitHub Pages. Aerobatic provides the same ease of deployment and cloud level scale as those platforms, but layers on a management dashboard and a suite of smart hosting services that provide a greater level of functionality, performance, and security than can easily be achieved with with pure static pages.

AWS Elastic Beanstalk AWS Elastic Beanstalk is a PaaS (Platform as a Service) service from Amazon Web Services that allows users to create applications and push them to a set of AWS services, including Amazon EC2, Amazon S3, Amazon Simple Notification Service (SNS), Amazon CloudWatch, auto scaling, and elastic load balancers Supports Java,.NET, PHP, Node.js, Python, Ruby, Go, and Docker on familiar servers such as Apache, Nginx, Passenger, and IIS.

Azure Microsoft Azure has both IaaS (equivalent to AWS EC2 etc.) and a PaaS which was their original 2010 offering. The PaaS is still very powerful but now matched by others – especially Amazon – Azure’s original platform was.NET based and that handicaps it as most technologies are produced in Linux (Java) versions – IaaS offers Linux or Windows Technologies in PaaS include Object and Blob storage, Tables, Queues, Databases, Hadoop (HDInsight)

Cloud Foundry Pivotal Cloud Foundry, used by IBM Bluemix, comes in three flavors Cloud Foundry Open Source Software (OSS) – Available to anyone - see Pivotal Cloud Foundry (Pivotal CF) which is a commercial product available from Pivotal. It provides extra tools for installation and administration not included in the OSS product. Pivotal Web Services PWS which is an instance of Pivotal Cloud Foundry hosted on Amazon Web Services (AWS). The following services are available to Pivotal CF and on PWS: – Data Storage: MySQL, PostgreSQL, MongoDB, Redis, Riak, DataStax (Cassandra), Neo4J, Pivotal HD (Hadoop) – Messaging: Pivotal RabbitMQ

IBM BlueMix IBM Bluemix is a cloud platform as a service (PaaS) developed by IBM. It supports several programming languages and services as well as integrated DevOps to build, run, deploy and manage applications on the cloud. Bluemix is based on Cloud Foundry open technology and runs on SoftLayer infrastructure. Bluemix supports Java, Node.js, Go, PHP, Python, Ruby Sinatra, Ruby on Rails and can be extended to support other languages such as Scala through the use of buildpacks. Uses OpenStack or Docker (IBM Containers) Softlayer was a hosting service that was acquired by IBM to jump start their Cloud Services division Subsystems supported not clear – includes Cloudant (IBM NoSQL) and Hadoop

Ninefold Server company in Australia offering bare metal or Docker set up in 2011