Presentation is loading. Please wait.

Presentation is loading. Please wait.

By Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November 2009 Testing the.

Similar presentations


Presentation on theme: "By Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November 2009 Testing the."— Presentation transcript:

1 by Serena Coetzee scoetzee@cs.up.ac.za and Magnus Rademeyer magnus@afrigis.co.za presented at the ICC 2009, Santiago, Chile, November 2009 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names by Serena Coetzee scoetzee@cs.up.ac.za and Magnus Rademeyer magnus@afrigis.co.za presented at the ICC 2009, Santiago, Chile, November 2009scoetzee@cs.up.ac.zamagnus@afrigis.co.zascoetzee@cs.up.ac.zamagnus@afrigis.co.za

2 Overview Problem statement Address matching with a spatial adjacency match Test runs Results Conclusion Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009

3 Problem statement Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Alphanumeric matching 101 Rubida Street, Murrayfield incorrectly matched to 110 Rubida Street, Murrayfield 

4 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Problem statement Alphanumeric matching only causes errors (previous slide) Potential solution: attribute relaxation (i.e. ignore suburb) Most common cause of errors (Goldberg et al. 2007)

5 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match Intiendo = alphanumeric matching + spatial adjacency match Improves geocoding results Alphanumeric match: propose matched address from reference dataset Above threshold? Yes, proposed matched address is result No, search for street number in radius around proposed address

6 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match NO YES

7 With spatial adjacency match Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009

8 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match 1.Geocode without SpatialAdjacencyMatch (Non-spatial run) 2.Geocode with SpatialAdjacencyMatch enabled (Spatial run) Compare results

9 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match Sample input address data  14,760 address records  Test for misleading names  Therefore include only addresses for which province, suburb, street name and street number are populated

10 With spatial adjacency match Intiendo hierarchy database Reference dataset: AfriGIS address data Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match

11 Intiendo settings Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Test runs

12 Results Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results

13 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Spatial runNon-spatial run Customer address records14,670 Matched address records8,905 (61%)8,514 (58%) Non-matched address records5,765 (39%)6,156 (42%) 3% is low but improvement can be significant, e.g. address on different sides of a highway

14 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Specific example

15 Results 35 Voortrekker Road 16 Voortrekker Road Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results

16 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Misleading suburb names in address Alphanumeric match only causes errors Intiendo = alphanumeric + spatial adjacency match More input addresses are matched more accurately Improves quality of results Sample test runs: 3% improvement

17 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Conclusion Intiendo address matching = alphanumeric string matching + spatial adjacency match Improves quality of results More addresses matched more accurately This work Specific sample dataset showed improvement Future More tests to understand average percentage improvement

18 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Acknowledgements Christopher Ueckermann from AfriGIS for running the geocoding tests with Intiendo


Download ppt "By Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November 2009 Testing the."

Similar presentations


Ads by Google