Presentation is loading. Please wait.

Presentation is loading. Please wait.

2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio.

Similar presentations


Presentation on theme: "2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio."— Presentation transcript:

1 2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio OIT/GISSC Intern Columbus State Community College

2 Overview Objective Source Data & Desired Outputs Timeline Tools and Automation Process Statistics Observations

3 Objective Geoprocessing Input: Source Data from County Auditors Output: - Normalized Parcel Data - Unique AG Owners

4 Output - Deliverables Normalized Parcel/Point Geodata – agricultural ( 100 <= LUC <= 199) – dairy (LUC = 103, 113) – residential ( 510 <= LUC <= 520, LUC = 560) Normalized Tabular Data (Access DB) –Table of unique ag owners with owner_id –Table of parcel data with owner_id Time Estimate to regenerate data annually

5 Example: Locate Residential Parcels of Ag Land Owners

6 Example: Select Parcels owned by Owner ID = 2894

7 Owner to Parcel Table Example

8

9 Source Data – Quantity/Quality Large volume of data – approx. 5 million source records – some counties had 40-50 fields of data – approx. 5 GB of data Multiple source files per county Parcel, Point, CAMA data Non-standardized data fields Variable completeness

10 Example: Non-Normalized Source vs. Normalized Output

11 Processing – High Level View Data Collection from Counties Normalize Source Data Generate Owner Ids for Parcel Records Generate Owner Table Match Dairy Addresses to Parcel Table Create Project for User

12 Timeline First Pass EffortSeveral PT HC - Approx. 1 FT HC Tasks Data Collection & Geocoding NormalizingOwner IDsDairy Match Create Project MonthJanuaryFebruaryMarchAprilMay Second Pass Effort1 PT HC1 FT HC Tasks Identify Original Source used Manual Normalizing Automation Normalizing Owner IDs Owner Ids Dairy match Project MonthMayJuneJulyAugustSept.

13 Need Automation Strategy Need to automate process for: –Repeatability –Ease of modification –Testability –Traceability...As well as speed

14 ToolsProcessing Tasks ArcToolBox - Model BuilderScript development - Python - VBscript Pre-Normalization - Joining source files, - adding key id, -copying to working directory Pre-Owner ID Generation - Address Standardization - Rejoin Data file to Shapefile MS Access - VBA - Queries - SQL - Form Interface - Normalization - Owner ID & Owner Table - (Dairy Match)

15 Processing Detail - Example Pre-normalization steps in Model-Builder for a county with 2 source files – shape and CAMA that need to be joined. This county is now ready for normalization in Access. Slightly different steps are needed for point files and counties with a single source parcel shapefile.

16 Processing Detail - Example Continued Model-Builder has limitations – you can’t loop through these steps for a list of counties. But this model can be converted to script and coded to process a list. Additional field-name mapping steps needed due to “coarse-grained” geoprocessing object. Loop thru cnty list. Delete Temporary table view & layer Get Fields Make Field Map

17 Example of Geoprocessing Tool Limitations When you join fields in the geoprocessing environment, and create a new Feature Layer shapefile, field names are [original layer name].[field name] truncated to 10 characters. Renaming is not done automatically for you as it is when you join and create a new layer manually in ArcMap.

18 Python Script Example

19 Access Form Interface Used for Normalization

20 Example: Non-Normalized Source vs. Normalized Output

21 Normalization Mapping Table

22 Processing – Owner IDs Data Collection from Counties Normalize Source Data Generate Owner Ids for Parcel Records Generate Owner Table Match Dairy Addresses to Parcel Table Create Project for User

23 Owner ID and Owner Table Generation

24 Standardized vs. Un-standardized

25 Owner ID Algorithm Aggregate on Lastname, Firstname Standardize addresses For each Lastname,Firstname group, choose the address - OWNADD1, MAILADD1, or SITEADD, that produces the best set of matches

26 Statistics ORIG_REC = Total AG + Total Residential NOAD = # Records with no address information ADD_REC = Total # of AG + Total Residential associated with more than 1 parcel FINL_REC = Total # of AG + Total Residential associated with at least one AG pcl OWNR = # of Records in the Owner Table NMD_AG = Aggregate of OWNNAM1/MAILADD1 and OWNADD1/MAILADD1 as a sanity check and to compare how effective the processing was

27 Testing Use Statistics –Numbers make sense –Numbers add up, e.g.: All records in Parcel table assigned an ownerid # Records in Owner Table = # Aggregated on Owner Id in PCL table Visual Inspection –Visually inspect how Owner Ids were assigned –Create shapefile and view data in project –Spot check source vs. processed data in shapefiles

28 Status 53 counties normalized 40 counties have owner ids/owner table Dairy matching - to do Final project – to do

29 Example Project – Work in Progress

30 Observations and Conclusions (1) After initial development, Automation speeds process For example, using Form Interface to normalize: Data Normalization TimeData Volume Manual 1 st pass 6 day 1X Ag only Auto 2 nd pass 1 day 5X Ag + Res

31 Observations and Conclusions (2) Automation: –speeds process after initial development investment –enables repeatability of process –makes modification and redo less painful –increases data consistency –reduces errors –accurately documents process –increases future capability to do similar processing – tools are reusable Automation is cost effective

32 Observations and Conclusions (3) This job would be easier if: –Data was maintained in small standard components: Last Name, First Name, MI as separate fields Address components – SiteNum, SiteDir, SiteStr There was a standard for field names of components


Download ppt "2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio."

Similar presentations


Ads by Google