Know your data source well. Who am I? Nik – Shahriar Nikkhah Microsoft MVP 2010 – SQL Server MCITP SQL 2008 MCTS SQL 2008 and 2005 s:

Slides:



Advertisements
Similar presentations
Little Used, but Powerful Features with GP Cathy Fregelette, CPA, PMP Practice Manager BroadPoint Technologies September 20, 2012.
Advertisements

CC SQL Utilities.
Introduction to OWB(Oracle Warehouse Builder)
Database management system (DBMS)  a DBMS allows users and other software to store and retrieve data in a structured way  controls the organization,
Introduction to ETL Using Microsoft Tools By Dr. Gabriel.
SpreadsheetML Basics.
Week 6: Chapter 6 Agenda Automation of SQL Server tasks using: SQL Server Agent Scheduling Scripting Technologies.
Templates and Styles Excel Advanced. Templates are pre- designed and formatted spreadsheets –They provide consistency of layout/structure –They.
Deep Dive into ETL Implementation with SQL Server Integration Services
SQL Server Integration Services Deep Dive Warren Stevens-Baytopp BI Architect - GijimaAst
2015/6/301 TransCAD Managing Data Tables. 2015/6/302 Create a New Table.
With Microsoft Access 2010 © 2011 Pearson Education, Inc. Publishing as Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Access.
1 Chapter Overview Transferring and Transforming Data Introducing Microsoft Data Transformation Services (DTS) Transferring and Transforming Data with.
Working with Mail Merge
ICOLC Use Data Webinar Episode 2 TANSY MATTHEWS JULY 2010.
ETL By Dr. Gabriel.
Chapter 2 Querying a Database
SQL Server Integration Services (SSIS) Presented by Tarek Ghazali IT Technical Specialist Microsoft SQL Server (MVP) Microsoft Certified Technology Specialist.
What’s New in SSIS with SQL 2008 Bret Stateham Training Manager Vortex Learning Solutions blogs.netconnex.com.
Chapter 2 Querying a Database MICROSOFT ACCESS 2010.
Air Quality System Precision and Accuracy Data Transaction Generator (AQSP&A) Training Session.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
Eurotrace Hands-On The Eurotrace File System. 2 The Eurotrace file system Under MS ACCESS EUROTRACE generates several different files when you create.
Chapter 1 Introduction to SAS ® Enterprise Guide ®
Understanding SSIS Control Flows Bret Stateham Training Manager Vortex Learning Solutions blogs.netconnex.com.
DTS Conversion to SSIS Conversion Best Practices Mike Davis
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Cleansing Ola Ekdahl IT Mentors 9/12/08.
Data Management Console Synonym Editor
Embarquez les services d'intégration SQL Server 2005 Romelard Fabrice D311.
Advanced ETL: Embedding Integration Services Ashvini Sharma Development Lead DAT411 Microsoft Corporation Sergei Ivanov Technical Lead DAT411 Microsoft.
DAT 360: DTS in SQL Server 2000 Best Practices Euan Garden Group Manager, SQL Server Microsoft Corporation.
DataMAPPER - Applied Database Tech. 이화여대 과학기술대학원 석사 3 학기 992COG08 김지혜.
Automating the Cube Build/Publish Process Using CognosScript Brian Morris, CBIP BI-340 – Cutting Costs in Half by Automating the Cube Build ProcessCognos.
- Joiner Transformation. Introduction ►Transformations help to transform the source data according to the requirements of target system and it ensures.
1 Advanced Topics Using Microsoft SQL Server 2005 Integration Services Allan Mitchell – SQLBits – Oct 2007.
6 Copyright © 2009, Oracle. All rights reserved. Using the Data Transformation Operators.
Gold – Crystal Reports Introductory Course Cortex User Group Meeting New Orleans – 2011.
Access Chapter 1: Intro to Access Objectives Navigate among objects in Access database Difference between working in storage and memory Good database file.
DAY 21: MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Aliya Farheen October 29,2015.
Scenario Topics. Creating Work Item Lists Use a work item list: To track status for a team or group of tasks. To view or change work items offline. To.
NetTech Solutions Troubleshooting Office Applications Lesson Seven.
Building Data Integration Solutions with Integration Services Donald Farmer Group Program Manager Microsoft Corporation.
Chapter 9 Vocabulary Databases. 1.Table – a collection of information, or data arranged in columns and rows. 2.Record – all of the information about one.
© 2009 Wipro Ltd - Confidential ETL TESTING Handling Heterogeneous Data Formats Rajasimman Selvaraj Simanchal Sahu Tithi Mukherjee.
SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.
MIS 451 Building Business Intelligence Systems Data Staging.
INCREMENTAL AGGREGATION After you create a session that includes an Aggregator transformation, you can enable the session option, Incremental Aggregation.
Jemini Joseph. About me Working in Microsoft BI field since Mostly consulting in SSIS Worked as programmer in Visual Basic before moving to BI
1 Record Linkage & Fuzzy Matching (More on "Blocking" for Performance Improvement) Joseph Vertido Melissa Data Fuzzy.
© 2012 Quest Software Inc. All rights reserved. The Undiscovered Country - I have Excel, I need Performance Point, I don’t know Analysis Services NYSPUG.
Lesson 17 Mail Merge. Overview Create a main document. Create a data source. Insert merge fields into a main document. Perform a mail merge. Use data.
Presented By: Jemini Joseph | June 18, Woodland Center Blvd. Tampa, Florida | Phone: | SSIS Factory.
SSIS ETL Data Resource Management. Create an ETL package using a wizard database server to database server The business goal of this ETL package is to.
Practical MSBI(SSIS, SSAS,SSRS) online training. Contact Us: Call: Visit:
Template Package  Presented by G.Nagaraju.  What is Template Package?  Why we use Template Package?  Where we use Template Package?  How we create.
Welcome To: Word Day 1 With Your Instructor: Cara Clifford Class will start at Approximately 8:05 AM.
SQL Server Tasks and Components from CozyRoc
ETL Design - Stage Philip Noakes May 9, 2015.
SQL Server Tasks and Components from CozyRoc
For Letters, Labels or s Mail Merge For Letters, Labels or s.
© 2016, Mike Murach & Associates, Inc.
WOCAT Mapping methodology
SQL Server Integration Services
EXCEL AND SSIS: BETTER TOGETHER
Data Scenario: Header and Details files
Chapter 7 Searching Your Products
Patterns and Best Practices in SSIS
Topics Today Capability Efficiency Troubleshooting
Handling Data Errors in a Dataflow Task
Presentation transcript:

Know your data source well

Who am I? Nik – Shahriar Nikkhah Microsoft MVP 2010 – SQL Server MCITP SQL 2008 MCTS SQL 2008 and s: msdn.microsoft.com (SSIS forum) One chapter on SSIS in MVP Deep dive 2 (Sep 2011)

OVERVIEW Know your data source well / Data cleansing 1. Chronological file order 2. Data cleansing 3. Check a few sample packages Error handling / notification 1. Capture error in a text file 2. error file as notification 3. One package sample A package with the combination of the above.

Know your data source well Analyze you data source from 2 different angles 1- Data point of view Data relations, field mapping, data value PK, FK, Index, Metadata, Dictionary (mapping) tables Good records and bad records (Redirecting) 2- Data source behavior Behavior changes (Table / file renaming and header names changes ) Delivery process, how does the source get made, provided and loaded. (CSV been open by excel and saved) Who is providing it.

Scenario on data behavior Data Point of view

Scenario on data behavior Data Point of view

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Data source behavior

Scenario on data behavior Files renamed and moved to different folders. Data source behavior Who is providing data source

Daily file load statistics Working days No. of Packages CVS / Excel, Load & Reload Excel Sheets Records per sheet (1,000) Total no. Records, Million Million record per day K, 10K Perfect world

Daily file load statistics Working days No. of Packages CVS / Excel, Load & Reload Excel Sheets Records per sheet (1,000) Total no. Records, Million Million record per day K, 10K Real world Files loaded per monthMonthly extra reload (Population reload) 6,300 – 1o,500 files / month2 – 3 reload a month = 12.6 – 31.5 files / month Loads Forecast Packages for the next yearNew customers Extra 200 (sum of 300 per customer)2 – 3 per year Reloads

Chronological file load Over 99% of the ETLs that have a file as a source don’t use chronological file load in the SSIS package.

Chronological file load Package overview.

Chronological file load Script that provides the files properties and information

Chronological file load Inside the DFT

Chronological file load Sort object

Chronological file load Set flag

Chronological file load Second For EachLoop Display script

Data cleansing Data cleansing and transformation Data flow transformation includes a series of data cleansing tool such as Joins Fuzzy Lookups Character mapping Data type conversion Derived columns Set of Boolean functions for data comparisons and replacement

Data cleansing

Error handling / notification Keep track of your packages when an error occurs Organize your error files Backup in the right folder Display the right Error message. Send a notification message to the right person The subject of the must be clear

Capture error files in a text file

SEE ATTACHED SAMPLE

notification Use SSIS Variables to set your SMTP object SEE ATTACHED SAMPLE