CHAPTER 10 DATA EXPLORATION 10.1 Data Exploration Box 10.1 Data Visualization 10.1.1 Descriptive Statistics Box 10.2 Descriptive Statistics 10.1.2 Graphs.

Slides:



Advertisements
Similar presentations
RGS-IBG Online CPD course in GIS Analysing Data in ArcGIS Session 6.
Advertisements

Analyzing Bivariate Data With Fathom * CFU Using technology with a set of contextual linear data to examine the line of best fit; determine and.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Concepts of Database Management Sixth Edition
Data Storage and Processing GIS Topics and Applications.
Descriptive Statistics In SAS Exploring Your Data.
Attribute databases. GIS Definition Diagram Output Query Results.
What Geoprocessing? Geoprocessing is the processing of geographic information. Commonly used to describe a process when geographic objects are manipulated.
Chapter Extension 6 Using Microsoft Access © 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke.
Intro. To GIS Lecture 6 Spatial Analysis April 8th, 2013
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
Basic Spatial Analysis
A lesson approach © 2011 The McGraw-Hill Companies, Inc. All rights reserved. a lesson approach Microsoft® Access 2010 © 2011 The McGraw-Hill Companies,
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Applied Cartography and Introduction to GIS GEOG 2017 EL
Exploring your geospatial data. It’s all about Relationships!
Data Exploration Chapter 9. Introduction  Where to begin?  Data exploration is data-centered query and analysis  Better understand the data and provide.
Introduction to ArcGIS for Environmental Scientists Module 2 – Fundamentals Chapter 7 – Queries.
CHAPTER 12 RASTER DATA ANALYSIS 12.1 Data Analysis Environment
Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Chapter 8. ATTRIBUTE DATA INPUT AND MANAGEMENT
Concepts of Database Management Seventh Edition
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
ArcGIS: ArcMap Tables. Agenda Opening tables The interface Working with columns Working with records Making selections Advanced table tools ▫Add fields.
Lecture 2: Data Exploration Jianfei Chen School of Geographical Sciences GuangZhou University GunagZhou, China
Data Queries Selecting features in ArcMap Data queries  Important part of a GIS project Can be a part of your data preparation or final analysis  Data.
Queries Select by Attribute Select by Location. What is a Query? A query extracts information from a data table for further use –Once extracted you can:
Copyright © 2006 by Maribeth H. Price 6-1 Chapter 6 Queries.
CHAPTER 11 VECTOR DATA ANALYSIS 11.1 Buffering
Query and Reasoning. Types of Queries Most GIS queries will select spatial features Query by Attribute (Select by Attribute) –Structured Query Language.
School of Geography FACULTY OF ENVIRONMENT Querying with ArcGIS.
NSF DUE ; Wen M. Andrews J. Sargeant Reynolds Community College Richmond, Virginia.
Selecting features in ArcMap
城市空间信息技术 第十章 数据探查 胡嘉骢 不动产学院 博士 副教授 城市规划系主任 手机 : ( ) QQ:
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-5 Chapters 9 and 10.
Descriptive Statistics ( )
Elementary Statistics
Elementary Statistics
Re-introduction to GIS
Visual Basic 2010 How to Program
MATH-138 Elementary Statistics
GIS Institute Center for Geographic Analysis
Getting Started with GIS Analysis Module 6
Basic Spatial Analysis
Laugh, and the world laughs with you. Weep and you weep alone
Elementary Statistics
IET 603 Quality Assurance in Science & Technology
Descriptive Statistics:
Spatial Data Processing
Preliminaries: -- vector, raster, shapefiles, feature classes.
David M. Kroenke and David J
Data Queries Raster & Vector Data Models
Linear transformations
Location and Attribute Queries
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
URBDP 422 Urban and Regional Geo-Spatial Analysis
Martin O'Neill, Wolfram Schultz  Neuron 
Introduction Previous lessons have demonstrated that the normal distribution provides a useful model for many situations in business and industry, as.
GIS Institute Center for Geographic Analysis
Tutorial 7 – Integrating Access With the Web and With Other Programs
Querying your geodata. Tools to improve your search for knowledge.
Grid and Nongrid Cells in Medial Entorhinal Cortex Represent Spatial Location and Environmental Features with Complementary Coding Schemes  Geoffrey W.
GIS Institute Center for Geographic Analysis
Databases and Information Management
ESRM 250/CFR 520 Autumn 2009 Phil Hurvitz
GEO 481 Lab Geographical Information Systems Spring 2019
Essentials of Statistics 4th Edition
Presentation transcript:

CHAPTER 10 DATA EXPLORATION 10.1 Data Exploration Box 10.1 Data Visualization Descriptive Statistics Box 10.2 Descriptive Statistics Graphs Dynamic Graphics 10.2 Map-Based Data Manipulation Box 10.3 Geovisualization Data Classification Spatial Aggregation Map Comparison 10.3 Attribute Data Query Box 10.4 Query Methods in ArcGIS SQL (Structured Query Language) Query Expressions Type of Operation Examples of Query Operations Relational Database Query 10.4 Spatial Data Query Feature Selection by Cursor Feature Selection by Graphic Feature Selection by Spatial Relationship Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Box 10.5 Expressions of Spatial Relationship in ArcMap Combining Attribute and Spatial Data Queries 10.5 Raster Data Query Query by Cell Value Query by Select Features Key Concepts and Terms Review Questions Applications: Data Exploration Task 1: Select Feature by Location Task 2: Make Dynamic Chart Task 3: Query Attribute Data from a Joint Attribute Table Task 4: Query Attribute Data from a Relational Database Task 5: Combine Spatial and Attribute Data Queries Task 6: Query Raster Data Challenge Question References

DATA EXPLORATION Centered on the original data, data exploration allows a researcher to examine the general trends in the data, to take a close look at data subsets, and to focus on possible relationships between data sets. Data exploration takes advantage of interactive and dynamically linked visual tools. Maps, graphs, and tables are displayed in multiple windows and dynamically linked so that selecting records from a table will automatically highlight the corresponding features in a graph and a map.

Figure 10.1 A line graph.

Figure 10.2 A histogram.

Figure 10.3 A cumulative distribution graph.

Figure 10.4 A scatterplot plotting % persons 18 years old in 2000 against % population change, 1990–2000. A weak positive relationship, with a correlation coefficient of 0.376, is present between the two variables.

Figure 10.5 A bubbleplot showing % population change, 1990–2000, along the x-axis; % persons under 18 years old in 2000 along the y-axis; and state population in 2000 by the bubble size.

Figure 10.6 A boxplot based on the % population change, 1990–2000, data set.

Figure 10.7 Boxplot (a) suggests that the data values follow a normal distribution. Boxplot (b) shows a positively skewed distribution with a higher concentration of data values near the high end. The x’s in (b) may represent outliers, which are more than 1.5 box lengths from the end of the box. Boxplot (c) shows a negatively skewed distribution with a higher concentration of data values near the low end.

Figure 10.8 A QQ plot plotting % population change, data value against the standardized value from a normal distribution.

Figure 10.9 A 3-D plot showing annual precipitation at 105 weather stations in Idaho. A north to south decreasing trend is apparent in the plot.

Figure The scatterplot on the left is dynamically linked to the map on the right. The “brushing” of two data points in the scatterplot highlights the corresponding states (Washington and New Mexico) on the map.

MAP-BASED DATA MANIPULATION Maps are an important part of GIS operations including data exploration. Map-based data manipulation includes data classification, spatial aggregation, and map comparison.

Figure Two classification schemes: above or below the national average (a), and mean and standard deviation (SD) (b).

Figure Two levels of spatial aggregation: by state (a), and by region (b).

Figure A bivariate map: (1) rate of unemployment in 1997, either above or below the national average, and (2) rate of income change, 1996–1998, either above or below the national average.

ATTRIBUTE DATA QUERY Attribute data query retrieves a data subset by working with attribute data. The selected data subset can be simultaneously examined in the table, displayed in charts, and linked to the highlighted features in the map. The selected data subset can also be saved for further processing.

SQL SQL (structured query language) is a data query language designed for relational databases. The basic syntax of SQL includes the following: select, from, and where. ArcGIS has already prepared the keywords in the dialog for querying a local database. Therefore, we only have to enter the where clause (commonly called the query expression) in the dialog box.

Figure PIN relates the owner and parcel tables and allows use of SQL with both tables.

QUERY EXPRESSIONS Query expressions consist of Boolean expressions and connectors. A simple Boolean expression contains two operands and a logical operator such as Parcel.PIN = ‘P101’. Boolean connectors are AND, OR, XOR, and NOT, which are used to connect two or more expressions in a query statement. Boolean connectors of NOT, AND, and OR are actually keywords used in the operations of Complement, Intersect, and Union on sets in probability.

Figure The shaded portion represents the complement of data subset A (top), the union of data subsets A and B (middle), and the intersection of A and B (bottom).

TABLE 10.1 A Data Set for Query Operation Examples CostSoiltypeAreaCostSoiltypeArea 1Ns15006Tn4300 2Ns15007Tn4200 3Ns14008N3200 4Tn44009N3100 5Tn430010N3100

TYPE OF OPERATION Attribute data query begins with a complete data set. A basic query operation is to select a subset. Given a selected data subset, three types of operations can act on it: add more records to the subset, remove records from the subset, and select a smaller subset

Figure Three types of operation may be performed on the subset of 40 records: add more records to the subset (+2), remove records from the subset (-5), or select a smaller subset (20).

RELATIONAL DATABASE QUERY Relational database query works with a relational database. A query of a table not only selects a data subset in the table but also selects records related to the subset in other tables. To query a relational database, we must be familiar with the overall structure of the database, the designation of keys in relating tables, and a data dictionary listing and describing the fields in each table.

Figure The keys relating three dBASE files in the SSURGO database and the soil map attribute table.

SPATIAL DATA QUERY Spatial data query refers to the process of retrieving a data subset from a layer by working directly with features. We may select features using a cursor, a graphic, or the spatial relationship between features. Spatial relationships used from query include containment, intersect, and proximity.

Figure Select features by a circle centered at Sun Valley.

RASTER DATA QUERY Raster data can be queried by cell value and select features.

Figure Raster data query: slope = 2 and aspect = 1. Selected cells are coded 1 and others 0 in the output raster.

gnuplot U.S. Census Bureau