Presentation is loading. Please wait.

Presentation is loading. Please wait.

Geospatial Data Mining at University of Texas at Dallas Dr. Bhavani Thuraisingham (Computer Science) Dr. Latifur Khan (Computer Science) Dr. Fang Qiu (GIS)

Similar presentations


Presentation on theme: "Geospatial Data Mining at University of Texas at Dallas Dr. Bhavani Thuraisingham (Computer Science) Dr. Latifur Khan (Computer Science) Dr. Fang Qiu (GIS)"— Presentation transcript:

1 Geospatial Data Mining at University of Texas at Dallas Dr. Bhavani Thuraisingham (Computer Science) Dr. Latifur Khan (Computer Science) Dr. Fang Qiu (GIS) Students Shaofei Chen (GIS) Mohammad Farhan (CS) Shantnu Jain (GIS), Lei Wang (CS) Post Doc: Dr. Chuanjun Li This Research is Partly Funded by Raytheon

2 Outline l Case Study - ASTER Dataset - Technical Challenges - Sketches l Process of Our Approach - Pixel classification using SVM Classifiers - Ontology Driven Mining l Pixel Merging l Output l Related Work l Future Work

3 Case Study: Dataset l ASTER (Advanced Spaceborne Thermal Emission and Reflection Radiometer) - To obtain detailed maps of land surface temperature, reflectivity and elevation. l ASTER obtains high-resolution (15 to 90 square meters per pixel) images of the Earth in 14 different wavelengths of the electromagnetic spectrum, ranging from visible to thermal infrared light. l ASTER data is used to create detailed maps of land surface temperature, emissivity, reflectivity, and elevation.

4 Case Study: Dataset & Features l Remote sensing data used in this study is ASTER image acquired on 31 December 2005. - Covers northern part of Dallas with Dallas-Fort Worth International Airport located in southwest of the image. l ASTER data has 14 channels from visible through the thermal infrared regions of the electromagnetic spectrum, providing detailed information on surface temperature, emissive, reflectance, and elevation. l ASTER is comprised of the following three radiometers : - Visible and Near Infrared Radiometer (VNIR --band 1 through band 3) has a wavelength range from 0.56~0.86μm.

5 Case Study: Dataset & Features l Short Wavelength Infrared Radiometer (SWIR-- band 4 through band 9) has a wavelength range from 1.60~2.43μm. - Mid-infrared regions. Used to extract surface features. l Thermal Infrared Radiometer (TIR --band 10 through band 14) covers from 8.125~11.65μm. - Important when research focuses on heat such as identifying mineral resources and observing atmospheric condition by taking advantage of their thermal infrared characteristics.

6 ASTER Dataset: Technical Challenges l Testing will be done based on pixels l Goal: Region-based classification and identify high level concepts l Solution - Grouping adjacent pixels that belong to same class - Identify high level concepts using ontology-based mining

7 Sketches: Process of Our Approach ASTER Image Training Data Features (14/pixel) SVM Classifiers Test Data Features (14/pixel) All Pixel Data Features (14/pixel) Feature Extraction Feature Extraction Feature Extraction Classifier Training Validation Classification High Level Concepts Pixel Grouping

8 Process of Our Approach Testing Image Pixels SVM Classifier Pixel Merging Ontology Driven Mining Classified Pixels Concepts and Classes High Level Concepts Training Image Pixels

9 SVM Classifiers: Atomic Concepts Classes Train set Test set Water 1175 1898 Barren Lands 1005 1617 Grass 952 1331 Trees 887 1479 Buildings 1041 768 Road435 648 House15841364 # of instances70799105 Different Class Distribution of Training and Test Sets

10 Process of Our Approach Testing Image Pixels SVM Classifier Pixel Merging Ontology Driven Mining Classified Pixels Concepts and Classes High Level Concepts Training Image Pixels

11 Ontology-Driven Mining - Ontology will be represented as a directed acyclic graph (DAG). Each node in DAG represents a concept - Interrelationships are represented by labeled arcs/links. Various kinds of interrelationships are used to create an ontology such as specialization (Is-a), instantiation (Instance-of), and component membership (Part-of). Residential Apartment Single Family Home Multi-family Home IS-A Urban Part-of

12 Ontology-Driven Mining l We will develop domain-dependent ontologies - Provide for specification of fine grained concepts - Concept, “Residential Area” can be further categorized into concepts, “House”, “Grass” and “Tree” etc. l Generic ontologies provide concepts in coarser grain

13 Ontology Driven Mining Urban AreaResidential AreaOpen Area Target Area BuildingHouseWater Barren Land RoadGrass Tree

14 Challenges l Region growing - Find out regions of the same class - Find out neighboring regions - Merge neighboring regions - Not scalable l Irregular regions l Of different sizes l Hard to track boundaries or neighboring regions l Pixel merging - Only neighboring pixels considered - Pixels are converted into Concepts - Linear

15 Pixels Merging

16

17 Complexity l There are two iterations: - First iteration converts signature classes into Concepts - Second iteration converts remaining classes and isolated concepts into Dominating classes l Each pixels take O(1) time l Target area takes O(n) time, where n is the number of pixels in the target area l Example (next slide): - Signature classes: c1, c2, c3 - Non-signature class: c4 - Concepts: C1, C2, C3

18 Pixels Merging c1 c2 C2c1c2 C2c1c2 c1c3c2 c1c3c2 C2c3c2 c3c2 c3c2 c3c2 c3 c2c3 c2c3 c2c3 c4c3 c4c3 c4c3 c4 c3 c4 c3 c4 c3 (a)(b)(c) C2c1c2 C2c1c2 C2c1c2 C2c3c2 C2c3c2 C2c3c2 C2c3c2 C2c3c2 C2c3c2 c3 c2c3C3c3c2c3C3c3c2c3 c4c3 c4c3 C3c4c3 c4 c3 c4 c3 c4 c3 (d)(e)(f)

19 Implementation l Software: - ArcGIS 9.1 software. - For programming, we use Visual Basic 6.0 embedded in the software.

20 Output:

21 Output

22

23 Related Work l Classification (SVM) l Farid Melgani, Lorenzo Bruzzone, Classification of hyperspectral remote-sensing images with support vector machines. l Zhu, G. and D.G. Blumberg. (2002). Classification using ASTER data and SVM algorithms - The case study of Beer Sheva, Israel. l Huang C.; Davis L. S.; Townshend J. R. G. (2002) An assessment of support vector machines for land cover classification.

24 Future Work l Develop Full Fledged Prototype (By January 31, 2007) l Generate Rules automatically (By June 30, 2007) - Ripper–Semi-automatically - Association mining


Download ppt "Geospatial Data Mining at University of Texas at Dallas Dr. Bhavani Thuraisingham (Computer Science) Dr. Latifur Khan (Computer Science) Dr. Fang Qiu (GIS)"

Similar presentations


Ads by Google