Geographical Information Systems and Science Longley P A, Goodchild M F, Maguire D J, Rhind D W (2001) John Wiley and Sons Ltd 7. Generalization, Abstraction,

Slides:



Advertisements
Similar presentations
REQUIRING A SPATIAL REFERENCE THE: NEED FOR RECTIFICATION.
Advertisements

Beyond Metadata: Towards User- Centric Description of Data Quality Michael F. Goodchild University of California Santa Barbara.
Data Models There are 3 parts to a GIS: GUI Tools
Geographical Information Systems and Science Longley P A, Goodchild M F, Maguire D J, Rhind D W (2001) John Wiley and Sons Ltd 3. Representing Geography.
Copyright, © Qiming Zhou GEOG1150. Cartography Data Models for Computer Cartography.
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
Geographical Information Systems and Science Longley P A, Goodchild M F, Maguire D J, Rhind D W (2001) John Wiley and Sons Ltd 9. Geographic Data Modeling.
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Akm.
Geographical Information Systems and Science Longley P A, Goodchild M F, Maguire D J, Rhind D W (2001) John Wiley and Sons Ltd 6. Uncertainty © John Wiley.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
3D and Surface/Terrain Analysis
Geog 458: Map Sources and Errors January Representing Geography.
Cartographic and GIS Data Structures
Raster Data. The Raster Data Model The Raster Data Model is used to model spatial phenomena that vary continuously over a surface and that do not have.
Geographic Information Systems and Science SECOND EDITION Paul A. Longley, Michael F. Goodchild, David J. Maguire, David W. Rhind © 2005 John Wiley and.
Geographic Information Systems
GIS Geographic Information System
Geographic Information Systems. What is a Geographic Information System (GIS)? A GIS is a particular form of Information System applied to geographical.
Week 17GEOG2750 – Earth Observation and GIS of the Physical Environment1 Lecture 14 Interpolating environmental datasets Outline – creating surfaces from.
Geographic Information Systems : Data Types, Sources and the ArcView Program.
©2005 Austin Troy. All rights reserved Lecture 3: Introduction to GIS Part 1. Understanding Spatial Data Structures by Austin Troy, University of Vermont.
Week 10. GIS Data structure II
1 Spatial Databases as Models of Reality Geog 495: GIS database design Reading: NCGIA CC ’90 Unit #10.
Lecture 4. Interpolating environmental datasets
Using ESRI ArcGIS 9.3 3D Analyst T I N
Lineage February 13, 2006 Geog 458: Map Sources and Errors.
GI Systems and Science January 23, Points to Cover  What is spatial data modeling?  Entity definition  Topology  Spatial data models Raster.
Data Acquisition Lecture 8. Data Sources  Data Transfer  Getting data from the internet and importing  Data Collection  One of the most expensive.
Spatial Analysis University of Maryland, College Park 2013.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
The Nature of Geographic Data Based in part on Longley et al. Ch. 3 and Ch. 4 up to 4.4 (Ch. 4 up to 4.6 to be covered in Lab 8) Library Reserve #VR 100.
Spatial Data Model: Basic Data Types 2 basic spatial data models exist vector: based on geometry of points lines Polygons raster: based on geometry of.
©2005 Austin Troy. All rights reserved Lecture 3: Introduction to GIS Understanding Spatial Data Structures by Austin Troy, Leslie Morrissey, & Ernie Buford,
Spatial data models (types)
CORSE '07 Spatial Data Spatial data comes in many forms. So How does a GIS work with the data so that it can put the data in the right place on a map?
GIS 1110 Designing Geodatabases. Representation Q. How will we model our real world data? A. Typically: Features Continuous Surfaces and Imagery Map Graphics.
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
Point to Ponder “I think there is a world market for maybe five computers.” »Thomas Watson, chairman of IBM, 1943.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-2 Chapters 3 and 4.
Basic Geographic Concepts GEOG 370 Instructor: Christine Erlien.
GIS Data Quality.
Spatial Analysis.
Schematic representation of weekend activities of three children in Cheshunt, UK. The horizontal dimensions represent geographic space (rendered using.
Chapter 3 Digital Representation of Geographic Data.
8. Geographic Data Modeling. Outline Definitions Data models / modeling GIS data models – Topology.
How do we represent the world in a GIS database?
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
Raster Concepts.
Introduction to Cartographic Modeling
Chapter 8 – Geographic Information Analysis O’Sullivan and Unwin “ Describing and Analyzing Fields” By: Scott Clobes.
Geographic Information Systems in Water Science Unit 4: Module 16, Lecture 3 – Fundamental GIS data types.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
Current and Potential Uses for GIS in Academic Arctic Research Michael F. Goodchild University of California Santa Barbara.
GIS Data Structures How do we represent the world in a GIS database?
The Nature of Geographic Data Based in part on Longley et al. Chapters 3 and 4.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Raster Data Models: Data Compression Why? –Save disk space by reducing information content –Methods Run-length codes Raster chain codes Block codes Quadtrees.
Spatial Data Models Geography is concerned with many aspects of our environment. From a GIS perspective, we can identify two aspects which are of particular.
The Nature of Geographic Data Longley et al. Chapters 3 and 4.
Chapter 1: GIS Data Outline Representing the world as a map Coordinate systems Map scale Data quality issues About ArcGIS.
Rayat Shikshan Sanstha’s Chhatrapati Shivaji College Satara
Spatial Data 1-Introduction to GIS 5/9/2018 © J.M. Piwowar
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Statistical surfaces: DEM’s
Geospatial Data models
Take Notes as you view the slides
Spatial interpolation
Schematic representation of weekend activities of three children in Cheshunt, UK. The horizontal dimensions represent geographic space (rendered using.
Generalization Abstraction And Method
Presentation transcript:

Geographical Information Systems and Science Longley P A, Goodchild M F, Maguire D J, Rhind D W (2001) John Wiley and Sons Ltd 7. Generalization, Abstraction, and Metadata © John Wiley & Sons Ltd

Outline Introduction Generalization basics Methods of generalization Measuring the degree of generalization Metadata

Can a Database be Perfect? The real world is infinitely complex a perfect description would have to be infinitely large and complex A geographic database must always approximate, generalize, abstract, or simplify we have many ways of doing this in GIS

How Hilly is Iowa? Iowa is relatively flat compared say to Colorado, or Switzerland, or Nepal people often think of it as flat Suppose the “slope” attribute in a database is given the value 0 for an object representing the state of Iowa this is a crude approximation it is much simpler than recording the slope at 30m intervals across the state it may be good enough for some purposes

GIS Compresses the Real World Representations are almost always “lossy” It is important to know how much loss has occurred by measuring the difference between the data and the real world we term this uncertainty, or the degree to which data leave us uncertain about the real world

Metadata are the Ultimate in Compression They describe the entire contents of a data set metadata are data about data the documentation and handling instructions for data Metadata are what make data useful without documentation and handling instructions data would have no value to a user without metadata it would be impossible to find data in a library or on the WWW

Generalization and Fields Many geographic phenomena are conceptualized as fields exactly one value of the phenomenon exists at every point in space think of elevation and land ownership as convenient examples In principle a field can take a different value everywhere creating an infinite amount of information Tobler’s Law helps by virtually guaranteeing that variation will be smooth and slow over space

Six Ways of Representing a Field All involve some kind of approximation or generalization All reduce the variation of the field to a set of objects and attributes that now look similar to phenomena conceptualized as discrete objects but the conceptualizations are very different

The six approximate representations of a field used in GIS. A. Regularly spaced sample points. B. Irregularly spaced sample points. C. Rectangular cells. D. Irregularly shaped polygons. E. Irregular network of triangles, with linear variation over each triangle (the Triangulated Irregular Network or TIN model; the bounding box is shown dashed in this case because the unshown portions of complete triangles extend outside it). F. Polylines representing contours. ABC DEF

Map Specifications Topographic maps are prepared by mapping agencies using specifications specific to each scale a scale’s specification sets the rules for representing real-world features on the map these rules involve generalization and approximation If a map meets its specification it can be said to be perfectly accurate even though its contents do not match the real world perfectly

Methods of generalization McMaster and Shea (1992) define 10 distinct types of generalization Generalization can affect a database permanently (database generalization) or can be temporary for the purpose of display (cartographic generalization)

Weeding Simplifying the shape of a line or an area by reducing the number of points in its representation The Douglas-Poiker algorithm drops points from a polyline or a polygon using a user-defined tolerance distance

A 2 3 B Tolerance The first two steps of the Douglas-Poiker algorithm. The endpoints of the polyline are first connected (A), and the point lying furthest from this line is found. If it lies further than the user-supplied tolerance distance, it is selected as a member of the simplified line, along with the two endpoints, and a new cycle of the algorithm is started. In the next cycle Points 2 and 3 lie within the tolerance of the line , but Point 7 does not. 7

In the final step 7 points remain (identified with green disks), including 1, 4, 7, and 15. No points are beyond the user- defined tolerance distance from the line.

Merging Another common form of generalization by aggregating adjacent areas Small areas can be generalized by removing any that fall below a user-defined threshold known as the Minimum Mapping Unit or MMU such areas are merged with their most similar neighbors

Measuring the Degree of Generalization Representative Fraction the ratio of distance on the map to distance on the ground also known as the scale e.g., 1:50,000 every 10 cm on the map correspond to 5 km on the ground

Scale for Digital Databases How can a digital database have a representative fraction if there are no distances to be measured in the database? A system of conventions allows digital databases to have scales e.g., use the scale of the map that was digitized or scanned to create the database

Minimum Mapping Unit Area can be a misleading indicator of importance e.g., a riparian zone along a stream

Spatial Resolution The smallest distance over which change is recorded Easily defined for raster data, but not for vector e.g., if census reporting zones vary greatly in area, what is the spatial resolution of census data? Resampling can create false spatial resolution e.g., dividing every pixel into 4 does not necessarily give finer spatial resolution spatial resolution is defined by the process of observation, not by such transformations as resampling

Example of resampling. The original cells outlined in black have been resampled to the cells outlined in red. New attributes of each cell have been assigned using the largest area rule.

Example of resampling an existing DEM to obtain a new DEM with shorter spacing between sample points. The black dots are the new DEM sample points, and the existing DEM provided mean elevations for each red square. The apparent improvement in spatial resolution as a result of resampling may not be justified.

Metadata Needed to automate the process of search for data compare using a library catalog Needed to determine the fitness of a data set for use particularly regarding quality Needed to handle data effectively e.g., format Needed to identify notable data contents e.g., to find images of an interesting hurricane

Metadata can be Expensive to Generate They represent a high level of abstraction and may need an expert to define But the benefits are substantial metadata make it possible to find data sets, and use them effectively they allow the benefits of investments in data to be realized

The U.S. FGDC Standard Content Standards for Digital Geospatial Metadata (CSDGM) Defined by a committee of U.S. Federal agencies Now widely used worldwide The basis of a new international standard Potentially several hundred items for one data set but easily boiled down to a much smaller number

The Dublin Core Standard Devised by the digital library community Suitable for any type of data, geospatial included easily extended to include essential items for geospatial data e.g., the latitude and longitude limits of the data set’s coverage

Geolibraries Repositories of data that can be searched for data covering geographic areas of interest this was very difficult in a conventional library using a card catalog each data set in a geolibrary is identified with a geographical footprint in a search, footprints are matched to the area of interest defined by the user

The Alexandria Digital Library The user picks an area of interest by interacting with a map, specifying latitude and longitude limits, or giving a place name. The library returns all data sets whose footprints match the query area, and which match other criteria also supplied by the user.

Collection-level Metadata Metadata describe each data set and allow users to search geolibraries such as Alexandria But how does the user know which geolibrary to search? collection-level metadata describe the contents of entire collections