CSV Files and ETL The Good, Bad, and Ugly

Slides:



Advertisements
Similar presentations
How to Create a Local Collection
Advertisements

Exporting Records to a File. Perform a search and retrieve records on the Search Results screen.
CC SQL Utilities.
The Petroleum Registry of Alberta Training Session September 27, 2004 The Petroleum Registry of Alberta Energizing the flow of information.
This Time Whitespace and Input/Output revisited The Programming cycle Boolean Operators The “if” control structure LAB –Write a program that takes an integer.
Tekla Structures User Meeting 2008
C++ Basics March 10th. A C++ program //if necessary include headers //#include void main() { //variable declaration //read values input from user //computation.
Dear Friends, I m Kartik Mali from gujarat. I prepared this presentation for who want to use oracle loader utility. I m giving here step by step knowledge.
Introduction to C++ Programming. A Simple Program: Print a Line of Text // My First C++ Program #include int main( ) { cout
PIM Platform Free text search. When you type in the search field a suggestion tool helps you to find a concept from the ontology.
1 Key Concepts:  Why C?  Life Cycle Of a C program,  What is a computer program?  A program statement?  Basic parts of a C program,  Printf() function?
2015/6/301 TransCAD Managing Data Tables. 2015/6/302 Create a New Table.
Fall 2006AE6382 Design Computing1 Matlab File & Directory Management Learning Objectives Define file input and output terminology Compare high and low.
Add a File with X, Y coordinates to MapWindow
Pasewark & Pasewark 1 Access Lesson 6 Integrating Access Microsoft Office 2007: Introductory.
Access Tutorial 8 Sharing, Integrating, and Analyzing Data
1 Access Lesson 6 Integrating Access Microsoft Office 2010 Introductory Pasewark & Pasewark.
Separating Columns in Excel. An extremely useful function in Excel is the Text to Column feature which can be used for any type of column separation but.
Ping Zhang 10/08/2010.  You can get data from the user (input) and display information to the user (output).  However, you must include the library.
MarcEdit Basics and Beyond By Mary Aycock Head, Catalog Department Missouri University of Science and Technology MOBIUS 2012 Conference.
ASP.NET Programming with C# and SQL Server First Edition
Chapter 2 Basic SQL SELECT Statements Oracle 10g: SQL.
Importing existing reference lists Lorraine Beard & Martin Snelling DRAFT: May 2007.
Pascal Programming Strings, Arithmetic operators and output formatting National Certificate – Unit 4 Carl Smith.
C++ Basics Structure of a Program. C++ Source Code Plain text file Typical file extension .CPP Must compile the C++ source code without errors before.
Introduction to File I/O High-Level Functions 1.Data files 2."High level" File I/O 3.dlmread() 4.xlsread() 1.
Data Storage Choices File or Database ? Binary or Text file ? Variable or fixed record length ? Choice of text file record and field delimiters XML anyone.
Feedback ELearning in Sakai. Feedback UseExport GradebookWorking in ExcelPost FileView FeedbackUpdate, Download, or Delete.
Introduction to Unix – CS 21 Lecture 12. Lecture Overview A few more bash programming tricks The here document Trapping signals in bash cut and tr sed.
12 steps for Mail Merge Setup Mpact Magic. Step 1 Open Your MS Outlook program and put it an offline mode. Go to Main Menu >> File >> Work Offline.
How do I export the Address Book to Excel? The first step is to go to "Address Book Report" under Admin Only menu Choose the fields you want. note that.
MySQL Importing and creating a database. CSV (Comma Separated Values) file CSV = Comma Separated Values – they are simple text files containing data which.
Chapter 14 Formatting Readable Output. Chapter Objectives  Add a column heading with a line break to a report  Format the appearance of numeric data.
Awk- An Advanced Filter by Prof. Shylaja S S Head of the Dept. Dept. of Information Science & Engineering, P.E.S Institute of Technology, Bangalore
Files Tutor: You will need ….
Personal Oracle8i Create a new user Create a new table Enter data into a new table Export & import data Start and exit SQL Plus SQL Plus Syntax.
# 1# 1 Moving SQL Data Across Applications How do you export and import data into or out of a database? What do we mean by.csv? CS 105 Spring 2010.
More Oracle SQL Scripts. Highlight (but don’t open) authors table, got o External data Excel, and make an external spreadsheet with the data.
1 CSE 2337 Chapter 7 Organizing Data. 2 Overview Import unstructured data Concatenation Parse Create Excel Lists.
LIS654 lecture 4 more on omeka Thomas Krichel
Distribution A: Approved for public release; distribution is unlimited. Case Number: 88ABW , 31 Mar 2015 A Tool that Uses the SAS PRX Functions.
To create text styles click on Home >> Tab under Change Styles
Copyright 2007, Paradigm Publishing Inc. EXCEL 2007 Chapter 8 BACKNEXTEND 8-1 LINKS TO OBJECTIVES Import data from Access, a Web site, or a CSV text file.
Chapter 4 Strings and Screen I/O. Objectives Define strings and literals. Explain classes and objects. Use the string class to store strings. Perform.
28 Formatted Output.
Basic concepts of C++ Presented by Prof. Satyajit De
Do-more Technical Training
Creates the file on disk and opens it for writing
CPS120: Introduction to Computer Science
This shows the user interface and the SQL Select for a situation with two criteria in an AND relationship.
Select Your Meeting Export Your Contacts From Outlook How to create a csv contact file from Outlook to import into Select Your Meeting
Dynamic Input with SQL Queries
Data File Import / Export
Access Lesson 14 Import and Export Data
Microsoft Office Illustrated
Comma Separated Values
Exchanging Data with Other Programs
Data Migration to DOORS DNG Presented By Adam Hammett
Working with Big Data in SQL
CSV File Manipulation.
The Petroleum Registry of Alberta Energizing the flow of information
Creates the file on disk and opens it for writing
Access Tutorial 8 Sharing, Integrating, and Analyzing Data
Recitation Outline C++ STL associative containers Examples
Microsoft Excel 2007 – Level 2
Click ‘browse’ to search your device for
CSV files Professor Hugh C. Lauer CS-1004 — Introduction to Programming for Non-Majors (Slides include materials from Python Programming: An Introduction.
Exporting Data from the Analog Discovery to Excel
Excel Tips & Tricks July 18, 2019.
KySat Packet format legend
Presentation transcript:

CSV Files and ETL The Good, Bad, and Ugly Eric Freeman CSV Files and ETL The Good, Bad, and Ugly

Comma-Separated Values- Overview

CSV Overview CSV- comma-separated values Plain text Delimited text file Each line is a new record Not fully standardized!

CSV Evolution 1972- IBM Fortran compiler under OS/360 Input Lists- commas or spaces only

CSV Evolution 1983- Osborne Executive computer w/ SuperCalc Spreadsheet Added quoted field containers

CSV Evolution 2005- RFC4180 (standardization initiative) Common Format and MIME Type for CSV Files

RFC 4180 RFC 4180 Standardization Initiative Each record Is delimited by a line break Last record may end with a line break Headers are optional- Same # of fields Double quotes may enclose fields: “abc”,”def”,”ghi” or abc,def,ghi Double quotes can be escaped: “abc”,”de””f”,”ghi”

CSV Overview Basic Concept- Clear Line-breaks Commas Quotes Escape Character

Powershell CSV Functions Export-Csv -InputObject <PSObject> [[-Path] <String>] [-LiteralPath <String>] [-Force] [-NoClobber] [-Encoding <String>] [-Append] [[-Delimiter] <Char>] [-IncludeTypeInformation] [-NoTypeInformation] [-WhatIf] [-Confirm]

Demo

Powershell CSV Functions Import-Csv [[-Delimiter]] <Char>] [[-Path] <String[]>] [-LiteralPath <String[]>] [-Header <String[]>] [-Encoding <String>]

Demo

The Good Simple File, Comma delimiters only BULK INSERT

Demo

The Bad Huge CSV file with a consistent format BULK INSERT w/ Format File

Demo

The Ugly Huge CSV file with Changing format Embedded quotes May contain duplicate column names

Demo