2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio.

Slides:



Advertisements
Similar presentations
Concepts of Maintaining Your Data Simple Ways to Edit Your Data By Lorne Woods.
Advertisements

Environmental GIS Nicholas A. Procopio, Ph.D, GISP Some slides from Lyna Wiggins (Rutgers University)
Why python? Automate processes Batch programming Faster Open source Easy recognition of errors Good for data management What is python? Scripting programming.
GIS Level 2 MIT GIS Services
A GIS Approach to Pedestrian Level of Service Natalia Domarad 14th TRB Transportation Planning Applications Conference Columbus, Ohio May 5-9, 2013.
GIS Topics and Applications
Agenda Overview Why TransCAD Challenges/tips Initiatives Applications.
Mobile Technology for Real Property Assessment Tax Assessor’s Office Davie County, North Carolina.
Python & ModelBuilder. Overview Python/ModelBuilder Concepts – The Geoprocessor – Checking some environment variables – Providing feedback from your model/script.
ModelBuilder In ArcGIS 9.x By Tim Weigel GEOG 407/607 April 3 rd, 2006.
Spatial Analysis, Geoprocessing,
Attribute databases. GIS Definition Diagram Output Query Results.
ModelBuilder at ArcGIS 9.2 Lyna Wiggins Rutgers University May 2008.
@ 2007 Austin Troy. Geoprocessing Introduction to GIS Geoprocessing is the processing of geographic information. Perform spatial analysis and modeling.
Querying an Avian Inventory Database and Visualizing the Results GEORGE WASHINGTON BIRTHPLACE NATIONAL MONUMENT NATIONAL PARK SERVICE NR 595D Final Project.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Add a File with X, Y coordinates to MapWindow
Resources – MS Access Free Online Training Resources  Using an Access database to store and information (2 min)
From VS C# 2010 Programming, John Allwork 1 VS2010 C# Programming - DB intro 1 Topics – Database Relational - linked tables SQL ADO.NET objects Referencing.
Page 1 ISMT E-120 Desktop Applications for Managers Introduction to Microsoft Access.
MS Access Advanced Instructor: Vicki Weidler Assistant:
Arc: Programming Options Dr Andy Evans. Programming ArcGIS ArcGIS: Most popular commercial GIS. Out of the box functionality good, but occasionally: You.
Enterprise ETL & the Home Again
╬Cory Vardaman Project Manager ╬Joe Clark Assistant Manager ╬Lisa Albanese GIS Technician, Web Master ╬Ethan Roberts GIS Analyst, Graphics Design.
Python & ModelBuilder. Continuing Education Python and ModelBuilder Overview Python/ModelBuilder Concepts –The Geoprocessor –Checking some environment.
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
GIS Concepts ‣ What is a table? What is a table? ‣ Queries on tables Queries on tables ‣ Joining and relating tables Joining and relating tables ‣ Summary.
Attribute Data in GIS Data in GIS are stored as features AND tabular info Tabular information can be associated with features OR Tabular data may NOT be.
Self Guided Tour for Query V8.4 Basic Features. 2 This Self Guided Tour is meant as a review only for Query V8.4 Basic Features and not as a substitute.
1 Overview of Databases. 2 Content Databases Example: Access Structure Query language (SQL)
« Just Do It » A Parcel Data Integration Project WLIA Team Justin Conner, Wood County GIS Specialist Ian Grasshoff, Waupaca County GIS Coordinator/LIO.
Using OCR for Census Data Capture in China National Bureau of Statistics of China.
Winrunner Usage - Best Practices S.A.Christopher.
Introduction to ArcGIS for Environmental Scientists Module 3 – GIS Analysis Address Geocoding.
ITIS 3110 LAB 02 Scripts I. Lab IDs Your Lab ID is the same as your 49er ID The PW is different: a + your 800 id a Note: sometimes the a is not.
Normalization (Codd, 1972) Practical Information For Real World Database Design.
AUTOMATION OF WEB-FORM CREATION - KINNERA ANGADI – MS FINAL DEFENSE GUIDANCE BY – DR. DANIEL ANDRESEN.
Development Of A Stormwater Utility For The City Of Brunswick, Ohio John R. Woodard, MS GIS Specialist Chagrin Valley Engineering, Ltd Ohio GIS Conference.
Exploring an Open Source Automation Framework Implementation.
Introduction of Geoprocessing Topic 7a 4/10/2007.
WyGEO Conference September 17 & 18, About Me Shawn Lanning – GIS Research WyGISC – ModelBuilder Experience About You?
Exploring ArcToolbox Presented by: Isaac Johnson.
Relational Databases (MS Access)
Copyright © 2006 by Maribeth H. Price 8-1 Chapter 8 Geoprocessing.
GIS 1 GIS Lecture 4 Geodatabases Copyright – Kristen S. Kurland, Carnegie Mellon University.
Database and Data File Management Oct 6/7/8, 2010 Fall 2010 | / Recitation 2.
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
Map overlays & Geoprocessing Learn about spatial analysis functions overlays, clipping & buffering Use overlays to analyze multiple spatial criteria Understand.
Intro to GIS | Summer 2012 Attribute Tables – Part 1.
Database Management Supplement 1. 2 I. The Hierarchy of Data Database File (Entity, Table) Record (info for a specific entity, Row) Field (Attribute,
Data Coordinators Conference – 2014 Laura Marroquin CASEWORKER/JCMS Specialist Everything New Data Coordinators Should Know.
Introduction to Geographic Information Systems Fall 2013 (INF 385T-28620) Dr. David Arctur Research Fellow, Adjunct Faculty University of Texas at Austin.
Introduction of Geoprocessing Lecture 9 3/24/2008.
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
GeoElections and Voter Focus Interoperability 14 th Annual GeoElections User Conference GeoElections Voter Registration System.
QC – User Interface QUALITY CENTER. QC – Testing Process QC testing process includes four phases: Specifying Requirements Specifying Requirements Planning.
William Perry U.S. Geological Survey Western Ecological Research Center Geography 375 Final Project May 22, 2013.
2016 CSO System Training & Networking Conference / Copyright © 2016 #csoconf 2016 CSO System Training & Networking Conference / Copyright © 2016 #csoconf.
Introduction to GIS Programming Final Project Submitted by Todd Lenkin Geography 375 Spring of 2011 American River College.
Key Terms Attribute join Target table Join table Spatial join.
Improving Georeferencing Workflow with Python
GEOG 375 Final Project Robert Abbotts Spring 2013.
Tan Hoang GEOG 362 – Final Project
Attribute Extraction.
Spatial Data Processing
Databases and Information Management
Computer Science Projects Database Theory / Prototypes
Vector Geoprocessing.
Automating Student Yield Data Extraction
Presentation transcript:

2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio OIT/GISSC Intern Columbus State Community College

Overview Objective Source Data & Desired Outputs Timeline Tools and Automation Process Statistics Observations

Objective Geoprocessing Input: Source Data from County Auditors Output: - Normalized Parcel Data - Unique AG Owners

Output - Deliverables Normalized Parcel/Point Geodata – agricultural ( 100 <= LUC <= 199) – dairy (LUC = 103, 113) – residential ( 510 <= LUC <= 520, LUC = 560) Normalized Tabular Data (Access DB) –Table of unique ag owners with owner_id –Table of parcel data with owner_id Time Estimate to regenerate data annually

Example: Locate Residential Parcels of Ag Land Owners

Example: Select Parcels owned by Owner ID = 2894

Owner to Parcel Table Example

Source Data – Quantity/Quality Large volume of data – approx. 5 million source records – some counties had fields of data – approx. 5 GB of data Multiple source files per county Parcel, Point, CAMA data Non-standardized data fields Variable completeness

Example: Non-Normalized Source vs. Normalized Output

Processing – High Level View Data Collection from Counties Normalize Source Data Generate Owner Ids for Parcel Records Generate Owner Table Match Dairy Addresses to Parcel Table Create Project for User

Timeline First Pass EffortSeveral PT HC - Approx. 1 FT HC Tasks Data Collection & Geocoding NormalizingOwner IDsDairy Match Create Project MonthJanuaryFebruaryMarchAprilMay Second Pass Effort1 PT HC1 FT HC Tasks Identify Original Source used Manual Normalizing Automation Normalizing Owner IDs Owner Ids Dairy match Project MonthMayJuneJulyAugustSept.

Need Automation Strategy Need to automate process for: –Repeatability –Ease of modification –Testability –Traceability...As well as speed

ToolsProcessing Tasks ArcToolBox - Model BuilderScript development - Python - VBscript Pre-Normalization - Joining source files, - adding key id, -copying to working directory Pre-Owner ID Generation - Address Standardization - Rejoin Data file to Shapefile MS Access - VBA - Queries - SQL - Form Interface - Normalization - Owner ID & Owner Table - (Dairy Match)

Processing Detail - Example Pre-normalization steps in Model-Builder for a county with 2 source files – shape and CAMA that need to be joined. This county is now ready for normalization in Access. Slightly different steps are needed for point files and counties with a single source parcel shapefile.

Processing Detail - Example Continued Model-Builder has limitations – you can’t loop through these steps for a list of counties. But this model can be converted to script and coded to process a list. Additional field-name mapping steps needed due to “coarse-grained” geoprocessing object. Loop thru cnty list. Delete Temporary table view & layer Get Fields Make Field Map

Example of Geoprocessing Tool Limitations When you join fields in the geoprocessing environment, and create a new Feature Layer shapefile, field names are [original layer name].[field name] truncated to 10 characters. Renaming is not done automatically for you as it is when you join and create a new layer manually in ArcMap.

Python Script Example

Access Form Interface Used for Normalization

Example: Non-Normalized Source vs. Normalized Output

Normalization Mapping Table

Processing – Owner IDs Data Collection from Counties Normalize Source Data Generate Owner Ids for Parcel Records Generate Owner Table Match Dairy Addresses to Parcel Table Create Project for User

Owner ID and Owner Table Generation

Standardized vs. Un-standardized

Owner ID Algorithm Aggregate on Lastname, Firstname Standardize addresses For each Lastname,Firstname group, choose the address - OWNADD1, MAILADD1, or SITEADD, that produces the best set of matches

Statistics ORIG_REC = Total AG + Total Residential NOAD = # Records with no address information ADD_REC = Total # of AG + Total Residential associated with more than 1 parcel FINL_REC = Total # of AG + Total Residential associated with at least one AG pcl OWNR = # of Records in the Owner Table NMD_AG = Aggregate of OWNNAM1/MAILADD1 and OWNADD1/MAILADD1 as a sanity check and to compare how effective the processing was

Testing Use Statistics –Numbers make sense –Numbers add up, e.g.: All records in Parcel table assigned an ownerid # Records in Owner Table = # Aggregated on Owner Id in PCL table Visual Inspection –Visually inspect how Owner Ids were assigned –Create shapefile and view data in project –Spot check source vs. processed data in shapefiles

Status 53 counties normalized 40 counties have owner ids/owner table Dairy matching - to do Final project – to do

Example Project – Work in Progress

Observations and Conclusions (1) After initial development, Automation speeds process For example, using Form Interface to normalize: Data Normalization TimeData Volume Manual 1 st pass 6 day 1X Ag only Auto 2 nd pass 1 day 5X Ag + Res

Observations and Conclusions (2) Automation: –speeds process after initial development investment –enables repeatability of process –makes modification and redo less painful –increases data consistency –reduces errors –accurately documents process –increases future capability to do similar processing – tools are reusable Automation is cost effective

Observations and Conclusions (3) This job would be easier if: –Data was maintained in small standard components: Last Name, First Name, MI as separate fields Address components – SiteNum, SiteDir, SiteStr There was a standard for field names of components