Presentation is loading. Please wait.

Presentation is loading. Please wait.

Building Data Integration Solutions with Integration Services Donald Farmer Group Program Manager Microsoft Corporation.

Similar presentations


Presentation on theme: "Building Data Integration Solutions with Integration Services Donald Farmer Group Program Manager Microsoft Corporation."— Presentation transcript:

1 Building Data Integration Solutions with Integration Services Donald Farmer Group Program Manager Microsoft Corporation

2 Agenda  Integration Services Overview  Building Packages Demo  SSIS Lifecycle  Tools, management, security, deployment  Managing Packages Demo  Troubleshooting  Log data, error flows,  Summary

3  Enterprise ETL platform  High performance  High scale  Best in class usability  Rich development environment  Source control  Visual debugging of control flow and data  Great range of transforms out-of-the-box  Highly extensible  Custom tasks  Custom enumerations  Custom transformations  Custom data sources Integration Services Breakthrough ETL Capabilities

4 Data Integration Architecture Before Integration Services Call centre data: semi structured Legacy data: binary files Application database ETL Warehouse Reports Mobile data Data mining Alerts and escalation  Integration and warehousing require separate, staged, operations.  Preparation of data requires different, often incompatible, tools.  Reporting and escalation is a slow process, delaying smart responses.  Heavy data volumes make this scenario increasingly unworkable. Hand coding Staging Text Mining ETL Staging Cleansing & ETL Staging ETL

5 Data Integration Architecture With Integration Services Call centre: Semi-structured data Legacy data: Binary files Application database Alerts and escalation  Integration and warehousing are a seamless, manageable, operation.  Sourced, prepare and load data in a single, auditable process.  Reporting and escalation can be parallelized with the warehouse load.  Scales to handle heavy and complex data requirements. SQL Server Integration Services Text mining components Custom source Standard sources Data cleansing components Merges Data mining components Warehouse Reports Mobile data Data Integration Architecture With Integration Services

6 How SQL Server Integration Services Works  Data sources can be diverse, including custom or scripted adapters  Transformation components shape and modify data in many ways.  Data is routed by rules or error conditions for cleansing and conforming.  Flows can be as complex as your business rules, but highly concurrent.  And finally data can be loaded in parallel to many varied destinations.

7  Data Cleansing  Provides data mining and AI expertise  Domain-independent data cleansing  Fuzzy lookup  Lookup on approximate matches  Tune for best match  De-duplication  Eliminate approximate duplicates  “Windows XP”, “WinXP”, etc.  Tune for confidence  Managing Slowly Changing Dimensions  E.g. Sales organization changes  E.g. Customer movement  E.g. Product category changes SQL Server Integration Services New Paradigm for the ETL Platform

8 Building Packages

9 SSIS Life Cycle tools Feature summary  Design the SSIS ‘Package’  Business Intelligence Studio (visual Studio)  Migration wizard for pre SQL 2005 packages  Visual Source Safe Integration  Deployment/Execution  Deployment Utility to copy packages  Command Line execution (dtexec.exe and dtexecui.exe)  Flexible Configuration Options  Supportability  Rich per package Logging (Log Providers)  SQL Management Studio for monitoring running packages and organizing stored packages (using SSIS windows service).  Checkpoint - Restart ability

10 Sample Server Layout Destination data Source data SSIS package error rows SSIS package Logging SSIS packages stored in SQL SSIS Package Execution SSIS support Servers Packages on file system

11 Logging and Log Providers  Log entries are a blend of status and result messages.  Can select what ‘details’ per control flow object within each package (e.g. OnError, OnWarning, OnPreExecute)  Can select what fields (e.g. computer, operator, ExecutionID…)  Can define multiple log providers (SQL, text file, Windows Event..) per package  BIDS has a Log Events window to see the log entries that are headed for the log provider

12 SSIS Windows Service  Monitors and manages running as well as stored packages, via SQL Management Studio  Service Installed when you install SSIS  Service is not required to design or execute Packages  Stored Packages tree is based on XML configuration file  You can customize the file contents, name, and location  Windows events for service (service start, service failed to start, package started, package stopped…)

13 Overview of SSIS security  ‘Security’ is comprised of several layers to support both SQL and File System Based scenarios.  Packages can be Encrypted  Packages can be Digitally Signed  Packages can be stored in SQL DB and protected with SQL Roles

14 Overview of SSIS security  (A) A package can be encrypted via the Package Protection level property as well as stored inside MSDB, where access to the package is limited by SQL Database roles OS Permissions File and folder access control for file based packages View/stop running packages Package Protection Level Property Reader and Writer roles for packages stored in MSDB SQL DB Roles A B  (B) A package can be encrypted via the Package Protection level property as well as access to the package file is controlled by folder/file permissions at the operating system level Encrypt or clear sensitive properties Package Signature

15 Deployment Flow  Tools to organize and ‘copy’ packages and supporting files Design Package Add Configurations Add Miscellaneous files Set Project Deployment properties Build Project Installation Wizard Execute manifest file Choose Destination (SQL File System) Modify protection level Choose location of supporting files Change configurations Bi Studio Copy/Move Deployment folder\files You Create desired agent jobs SQL Agent

16 Deploying SSIS packages  Designer can build a deployment file set which includes a project’s package(s), configuration files, and installer files  You move the installer file set to another server\environment and execute, to install packages to SQL or the File system.  Deployment is not a version sync\check tool such as SMS  Installer can ‘copy’ miscellaneous files but will not ‘install’ them, such as custom component.

17 SQL Management Studio  Requires the SSIS service  Allows Monitoring and Stopping of currently Executing packages  Maintain stored package structure and set roles for SQL stored packages  You can connect\view multiple SSIS servers at one time  Adhoc Package execution from folder tree

18 Log Data for Troubleshooting  Logging and Error Flow data are core for troubleshooting  Can save\load logging detail templates  Children packages bubble entries to parent  Package IDs need to be unique. You can re- generate the ID via BIDS or dtutil.exe  Example: This will regenerate package IDs for all packages in a folder…be it 2 or 200:  for %f in (C:\_work\SSISPackages\_quick\Notepad\*.dt sx) do dtutil.exe /i /File %f

19 Error Flows in Data Flow Task  Error flows can be controlled per field per row (fail component, redirect, ignore)  Just another flow…to your destination  Error flows can all be directed to a central location for centralized operations  Includes error code and column id  Can couple with a Row Sampling transform

20 Other Troubleshooting ideas  Performance Counters (SQLServer:SSISPipeline and SQLServer:SSISService:)  Integration with Microsoft ® Operations Manager  SQL Agent has log data as well and proxies to limit package execution  General package design to log row counts, multicast, and save variables  Webcast: Scalability, Performance and Optimization in SSIS

21 Summary  SQL Server Integration Services is an exceptionally high performance integration and transformation tool  Some processes benefit more from parallelism, some from memory  32-bit performance and scale is best increased by parallelism  64-bit enables highly scalable memory operations

22 For More Information  Analysis Services TechCenter  http://www.microsoft.com/technet/prodtechnol/sql/200 5/technologies/ssisvcs.mspx http://www.microsoft.com/technet/prodtechnol/sql/200 5/technologies/ssisvcs.mspx http://www.microsoft.com/technet/prodtechnol/sql/200 5/technologies/ssisvcs.mspx  Developer Center  http://msdn.microsoft.com/sql/bi/integration/default.as px http://msdn.microsoft.com/sql/bi/integration/default.as px http://msdn.microsoft.com/sql/bi/integration/default.as px  Great information available at www.sqlis.com www.sqlis.com  Project Real  http://www.microsoft.com/sql/solutions/bi/projectreal.m spx http://www.microsoft.com/sql/solutions/bi/projectreal.m spx http://www.microsoft.com/sql/solutions/bi/projectreal.m spx  On-demand Webcasts  http://www.microsoft.com/events/series/sqlserverbi.ms px http://www.microsoft.com/events/series/sqlserverbi.ms px http://www.microsoft.com/events/series/sqlserverbi.ms px

23 © 2005 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.


Download ppt "Building Data Integration Solutions with Integration Services Donald Farmer Group Program Manager Microsoft Corporation."

Similar presentations


Ads by Google