unit-2_GIS Data Model
unit-2_GIS Data Model
GIS does not store a map in any conventional sense. Instead GIS stores the data
from which we can draw a desired view to suit a particular purpose.
There are two types of data in GIS
Spatial Data (Location of a particular feature)
Attribute data (information about features. E.g. name of roads, forest type etc.)
GIS integrates the two data types and allows the users to derive new data for
planning.
Spatial Models are important in that way in which information is represented,
affects the type of analysis performed and types of graphic display that can be
obtained.
The two data models common in GIS are Vector data model and Raster data
Model.
Basic Concepts:
1) All geographic data can be represented by three basic entities:
i) Point
ii) Line
iii) Area or Polygon plus a label saying what is it
E.g.
An oil well could be represented by a single point consisting of X, Y coordinates.
Road – represented by a series of X, Y coordinates
1
Forest – represented by a set of X, Y coordinates plus the label forest. The label
could be actual name or a special symbol.
2) Layers and Coverages:
GIS organize spatial data into layers or coverages
Typical layers represent information belonging to particular class. E.g. Roads,
Rivers, Vegetation types are different layers.
All the layers or coverages pertaining to an area are referenced to a common
projection system
The layers can be combined with each other in various ways to create new layers
that are functions of individual layers.
Land use
Settlement
Drainage
Road
3) Data Model:
In order to represent the spatial information and their attributes, a Data Model – a
set of logical definitions or rules for characterizing the geographical data is
adopted
The Data Model represents the linkages between the real world domain of
geographical data and the computer and GIS representation of these features.
2
As a result, Data Model, not only helps in organizing the real world geographical
features into a systematic storage/retrieval mechanism, but also helps in
capturing user’s perception of these features.
2.2 Conceptual Models of Spatial Information:
There are different models – which have influenced the way in which data are
organized and processed within GIS.
They are based on Objects, Network and Fields.
3
Sub-class
Classification
Spatailly
Related to
Equivalent
Aspatailly Real World
Phenomena
Related to
SubPart
Temporally Coordinate
Related to Geometry
Equivalen
t
Network Model:
Network based spatial module share some aspects of the object based module in
that “they often deal with discrete phenomenon”.
But the essential characteristic is the need to consider interaction between
multiple objects, often along discrete path or routes that connect them.
The exact shape of phenomena is not of much importance. What is important is
some measure of distance and impendence (interaction) between specified
phenomena.
4
Network Based Model
E.g. Studies of Traffic on road, analysis of flow of water, flow of electricity etc.
Field Model:
Field Based model is appropriate for modeling phenomena that are regarded as
continuously variable across some region of space.
E.g. concentration of pollutants in the air, temperature of ground surface,
moisture level of soil, elevation of ground etc.
Field model may represent either 2 or 3 dimensions depending upon the
applications.
2.3 Vector Based Model:
A vector based GIS is defined by the vectorial representation of its geographic
data. According with the characteristics of this data model, geographic objects
are explicitly represented and, within the spatial characteristics, the thematic
aspects are associated.
The vector representation of an object is an attempt to represent the object as
exactly as possible.
The geographical phenomena are represented by three basic entities along with
their attributes.
5
Point – City – population, no. of school, no. of houses etc.
Line – Road – Type of road, road name etc.
Area – Landuse – class, soil type etc.
The coordinate space is assumed to be continuous, allowing all positions, lengths
and dimensions to be defined precisely.
The vector data structure represents each geographical feature by a set of
coordinates.
The basic thing is to define a 2D space where coordinates on the two axes
represent features.
Point Features:
A zero-dimensional abstraction of an object represented by a single X,Y co-
ordinate. A point normally represents a geographic feature too small to be
displayed as a line or area; for example, the location of a building location on a
small-scale map, or the location of a service cover on a medium scale map.
Besides the X, Y coordinate, other data must be stored to indicate what kind of
point it is and other information associated with it. Fig. 1 shows a typical point
data stored in GIS.
Fig.1
Line Features:
A set of ordered co-ordinates that represent the shape of geographic features too
narrow to be displayed as an area at the given scale (contours, street centrelines,
6
or streams), or linear features with no area (county boundary lines). A lines is
synonymous with an arc.
Simplest line required the storage of begin point and end point. (Two X, Y
coordinates plus a possible record). An arc, a chain or string is asset of n X, Y
coordinate pairs describing a continuous complex line.
Shorter the line segment and larger the no. of X,Y coordinate pairs, the closer the
chain will approximate a complex curve. Fig.2
Fig: 2
7
I
Fig: 3
Topology:
Topology describes the spatial relationships between the objects and to carryout the
measurements. (Length and Area).
e.g. spatial relationship of connectivity and adjacency are examples of topological
relationships.
Fig: 4 a Fig: 4 b
9
Fig: 4
Since each cell is associated with a value called cell value or pixel value, it is
very easy to carry out overlay operations to compare attributes recorded in
different layers.
Each attribute associated with a grid cell can be combined logically or
arithmetically with attributes in corresponding cells of the other layers to create a
new attribute value for the resulting overlay.
Transitional areas are poorly represented by raster-based model.
The choice between raster and vector based model depend upon the type of data
analysis and other operations to be carried out for a project. However, there is always
scope to convert one form to other. i.e., raster to vector or vector to raster
Raster method for spatial data structure requires large memory space as
compared to vector data.
Certain kinds of data manipulation such as polygon intersection, union, clipping,
merging etc are complex in raster data model as compared to vector.
However, multi-theme overlay operations are more easy in raster data model.
Similarly, representation of surfaces is more common in raster-based model.
Vector Data Model:
Advantages:
Good and real representation of geographic data
Compact data structure
Topology can be completely described
Accurate graphic output
Less storage space.
Disadvantages:
Data structure is complex
Combination of several vector polygons create difficulties in handling
Simulation is difficult because each unit has a different topological form
Display and plotting are expensive.
Advantages:
Simple data structure
The overlay of mapped data with remote sensing data is easy
Simulation is easy because each spatial unit has same size and shape
Good for multiplayer overlay.
Disadvantages:
Data is voluminous and require large storage space
Use of large cell to reduce data volume loses significant information
Crude raster maps have ugly look
Network linkages are difficult to establish.
called child.
Department
Job Employee
Description
12
Education Background Education Job History
Required Required
Example:
This approach is efficient if all desired access paths follow the parent child
linkage.
However, it requires a relatively inflexible structure and hence linkage with other
branch of database is tough or cumbersome. That is why this data base structure
is not very common in flexible GIS.
2. Network Structure:
Network structure exists when child in a data relationship has more than one
parent.
An item in such a structure can be linked to any other item.
It is good for network-based analysis.
Example:
Author 1 Author 2
3. Relational Structure:
In this case data are organized in 2D tables consisting of rows and columns. The
rows are called records and columns are called items or fields.
Such tables are easy to develop and understand.
Different sets of tables are created within database and a relationship is
established between each table.
Because of this, it is easy to create a subset of data fro one user or to join two
tables for other user to form a large table.
The structure can be described mathematically, hence mathematics provide the
basis for extracting some columns from the table and for joining various
columns.
This capability to manipulate relations provides flexibility that is normally not
available in hierarchy and network structures.
Relational Operators:
Retrieval of data sets from relational model involves creation of new relation,
which is a characteristic of permanently stored relations.
There are several relations algebra operators that can be used to search and
manipulate relations.
These operators are implemented by means of Structured Query Language (SQL)
using number of commands.
E.g SELECT Settlement Name, wareda name
FROM Settlement
Will create a new table from Settlement table that consists of only settlement name
and wareda name.
SELECT
FROM Settlement
WHERE wareda name = zero arath
Important Features of Relational database:
i) Primary and Foreign Keys
ii) Relational Joins
Name Id Marks
Table -2
16