Near Real Time ETLs with Azure Serverless Architecture

Slides:



Advertisements
Similar presentations
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Advertisements

Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
Andy Roberts Data Architect
AZ PASS User Group Azure Data Factory Overview Josh Sivey, Solution Partner October
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
Microsoft Ignite /28/2017 6:07 PM
Energy Management Solution
Building Azure Mobile Apps
Mobile Application Solution
1/27/2018 5:13 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Connected Infrastructure
Join the Community
TV Broadcasting What to look for Architecture TV Broadcasting Solution
4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Fan Engagement Solution
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Connected Living Connected Living What to look for Architecture
Smart Building Solution
Connected Maintenance Solution
The story of an IoT solution
Parcel Tracking Solution Parcel Tracking What to look for Architecture
Tulika Chaudharie / Harikharan Krishnaraju
Using Azure Functions to Build Nanoservices
Data-driven serverless apps with Azure functions
Data-driven serverless apps with Azure functions
Mobile App Trends: lifecycle, functions, and cognitive
Introducing Azure Functions
Smart Building Solution
Optimizing Edge-Cloud IoT Applications for Performance and Cost
Connected Maintenance Solution
Platform as a Service.
Connected Living Connected Living What to look for Architecture
Connected Infrastructure
Building Analytics At Scale With USQL and C#
Mobile Application Solution
9/6/2018 7:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Remote Monitoring solution
Nimble Streamer Helps Media Content Providers Create Streaming Networks Cost-Effectively and Easily by Utilizing Azure’s Worldwide Scalability MICROSOFT.
Energy Management Solution
SQL Server Integration Services
Exploring Azure Event Grid
Jeff Hollan Azure Functions – Serverless compute in the cloud
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Chapter 12: Automated data collection methods
What Azure have to offer for your data
Microsoft Azure P wer Lunch
Azure Event Grid with Custom Events
Outline Virtualization Cloud Computing Microsoft Azure Platform
Lecture 1: Multi-tier Architecture Overview
Learn. Imagine. Build. .NET Conf
XtremeData on the Microsoft Azure Cloud Platform:
Azure Data Lake for First Time Swimmers
Key Features Automated Self Healing
Microsoft Build /14/2019 8:42 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Practical WebJobs SDK and Extensions
Technical Capabilities
Serverless Architecture in the Cloud
2/19/2019 9:06 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Last.Backend is a Continuous Delivery Platform for Developers and Dev Teams, Allowing Them to Manage and Deploy Applications Easier and Faster MICROSOFT.
Developing Microsoft Azure Solutions Jump Start
5 Azure Services Every .NET Developer Needs to Know
Agenda Need of Cloud Computing What is Cloud Computing
Beyond orchestration with Azure Data Factory
Microsoft Azure Services Platform
Matthew Farmer Making Azure Integration Services Real
Alex Karcher 5 tips for production ready Azure Functions
Architecture of modern data warehouse
Presentation transcript:

Near Real Time ETLs with Azure Serverless Architecture Samara Soucy Innovative Architects 9/16/18

Samara Soucy Microsoft Certified Specialist – Programing in C# Software Development Consultant – Innovative Architects samarasoucy@gmail.com @oneangrypenguin Oneangrypenguin.com

What does it mean when a service is serverless? There are still servers (duh) Next iteration after PaaS Consumers don’t have to worry about resource management. Dynamic Scaling Pricing is usage based (Usually) easier to develop against

Cons Pros ETL via Microservices Lose performance because components must communicate rather than having a single process. Complicated system to maintain. Divide and Conquer Easier to scale just the parts that need it Removes the possibility of dependency hell Services are easy to understand and maintain.

Service Roles Source & Destination Event – “We’ve got data to work on.” Manager – Decides when an event will happen or controls process flow. Router – Moves events between services. May transmit data as well. Worker – Moves data between source and destination, may perform transforms. Monitor – Makes sure that the processes are all running and may check data quality. Utility – Performs background maintenance tasks (ex. Moving old data to an archive). ** A service will often fit into multiple roles.

Basic Architecture

Separate Manager and/or Router

ETL via Microservices pt. 2 Entrance to the ETL should be as close to the originating event as possible. If possible, event data should be possible to pass through Azure queue systems. Event Grid is the smallest at 64 KB per message. Each service should preform a single task. Usually. When it doesn’t create unessesary complexity. Balance is important. When chaining services together, consider whether or not you may want multiple things to trigger off a specific processing stage. If there is a posibility that you need to fan out or fan in at a specific point, make sure the data stream makes a stop at at a router. Take advantage of the Visual Studio projects for Azure serverless. Keeping the code for all the services in a single solution makes it easier to pull up all the code for your ETL system than clicking around in the portal.

Azure Serverless Services Meet the toys!

The Workers pt. 1 Logic Apps Functions Similar experience to SSIS (but better) Limited transform capabilities, but it can call Functions to perform that task. Strongest at managing process flow. Integrates with Azure services, Http, timers and many 3rd party APIs out of the box. No code required (minus any Functions integrations). Extends the capabilities of Web Jobs. Small processes built in C# or JS. Experimental support for other languages like Python, PHP, Powershell and others. Integrates natively with many other Azure services, HTTP, and timers triggers. Strongest tool for transforms, most flexible.

The Workers pt. 2 Stream Analytics Cognitive Services SQL-like queries against a data stream. Primarily attatches to Event Hub or Iot Hub, paired with Azure storage for contextual data. Allows for measures to be computed in real time. Strong for finding values out of range, trends, and possible fraud. Machine Learning as an API Lots of tools like speech to text, sentiment analysis, search, natural language processing, and many more. Integration with Stream Analytics- call many of the APIs directly from your SQL Query.

The Routers Event Grid Event Hub HTTPS input and push to subscribers. Offers some basic filtering of which events get pushed to which subscribers. HTTPS (in) or APMQ based message routing. Caches messages for a set period of time so they can be replayed. Default is 24 hours. Handles both batched messages and streams of data. If you are going to use Stram Analytics, you want Event Hub in your pipeline.

The Destinations Cosmos DB Azure Storage No SQL successor to DocumentDB Multiple storage types, multiple API options Scale on demand Globally distributed File and Blob storage Will be used to store code and logs for most of the other serverless offerings Used as a destination and as a way to store data that will be used in multiple services rather than passing it in the routers.

Application Insights (The Monitor) Comprehensive application monitoring and logging. Azure Funtions are closely integrated with App Insights, almost all metrics run through there. When using App Insight with Azure Funtions, consider turning down the sampling rate. You will end up spending significantly more on app insights than on your Functions May also act as a source since it provides a real time feed of the status of whatever application it is monitoring. Setup continuous export to Azure Storage to allow real time ingestion by a real time analytics feed.

Links https://azure.microsoft.com/en-us/overview/serverless- computing/ Presentation Links: https://nrtdemoweb.azurewebsites.net/ https://github.com/serri588/NRT_ETL_Demo