Module Notes - GIS
Module Notes - GIS
BASICS OF GIS
Overview
Geography is the study of the Earth and everything found on it. It focuses on where things are
located, why they are found there, and how they relate to each other. Geography studies both
physical features, such as mountains, rivers, climate, and soils, and human features, such as
settlements, roads, population, and land use. It also examines the patterns and interactions between
people and their environment across space.
An Information System (IS) is an organized way of collecting, storing, processing, and managing
data so that it can be transformed into useful information. Information systems use computers,
software, data, and procedures to help users analyze information and make decisions. Examples of
information systems include student record systems, banking systems, and inventory management
systems.
Therefore, GIS combines geography and information systems to manage and analyze information
that is linked to specific locations on the Earth’s surface.
Definition of GIS
1
In simple terms, GIS helps answer fundamental geographic questions such as:
Evolution of GIS
The evolution of Geographic Information Systems (GIS) has unfolded through several distinct
stages, each characterized by advancements in technology, functionality, and application:
GIS emerged in Canada with the establishment of the Canada Geographic Information System
(CGIS), a pioneering effort led by Roger Tomlinson. This system was primarily used for land
inventory and resource management, enabling government agencies to manage natural resources
more effectively. At this time, data processing capabilities were quite limited due to rudimentary
computer hardware, which resulted in slow data handling and analysis.
This period witnessed the commercial development of early GIS software, with software such as
ARC/INFO, created by Environmental Systems Research Institute (ESRI), becoming prominent.
The accuracy and functionality of GIS tools improved significantly, leading to their increased
adoption by government agencies, research institutions, and private companies. A key innovation
during this era was the introduction of digitizing maps, which allowed for the conversion of analog
data into digital formats, thereby enhancing the ability to store, manipulate, and analyze
geographic information.
The 1990s marked a significant transformation in GIS, as software became more user-friendly and
accessible to a broader audience, including planners and engineers. This era saw the integration of
Geographic Positioning System (GPS) technology and remote sensing data, allowing for more
precise location tracking and environmental monitoring. GIS applications expanded into various
2
fields, such as urban planning, environmental impact assessments, and utilities management,
where they helped in decision-making processes and strategic planning.
The advent of the Internet revolutionized GIS, giving rise to web-based GIS applications that
offered dynamic online maps and cloud-based data storage. This shift enabled widespread access
to geographic data and improved spatial analysis capabilities. Mobile GIS emerged as a powerful
tool for real-time data collection and field surveys, allowing users to gather and share data
instantly. Additionally, the integration of GIS with big data analytics, unmanned aerial vehicles
(drones), and artificial intelligence has significantly enhanced the sophistication and utility of GIS
technologies, enabling advanced predictive modeling and immersive spatial analyses in various
sectors, including urban development, disaster response, and environmental conservation.
b) Concepts of GIS
The concepts of GIS are the basic ideas that explain how GIS works and how geographic
information is handled, analyzed, and presented. The main concepts of GIS include the following:
Geographical data, also known as Geodata, is information that describes features on the Earth’s
surface and their locations. It includes:
• Spatial data – shows the position, shape, and size of features such as points (wells), lines
(roads), and polygons (lakes or districts).
• Attribute data – provides descriptive information about those features, such as names,
population, land use type, or road condition.
3
2. Coordinate Systems
A coordinate system is a method used to identify the exact position of a place or feature on the
Earth’s surface. It provides a reference framework that allows locations to be described using
numbers instead of names, making mapping and spatial analysis accurate and consistent in GIS.
In GIS, coordinate systems are essential because they ensure that different maps and datasets align
correctly and represent real-world locations accurately.
A geographic coordinate system uses a spherical or ellipsoidal model of the Earth to define
locations using angles. It is based on:
• Latitude – measures the distance north or south of the Equator, expressed in degrees.
• Longitude – measures the distance east or west of the Prime Meridian, expressed in
degrees.
Geographic coordinate systems are commonly used for global mapping and GPS data.
A projected coordinate system represents the curved surface of the Earth on a flat, two-dimensional
map. It uses:
Projected coordinate systems are usually measured in meters, making them useful for distance,
area, and direction calculations. Examples include UTM (Universal Transverse Mercator) and
national grid systems.
4
• Accurately locate and map geographic features anywhere in the world
• Combine data from different sources without misalignment
• Measure distances, areas, and directions correctly
• Support accurate spatial analysis and decision-making
In summary, coordinate systems provide the foundation for GIS mapping by defining where
features are located on the Earth’s surface. Without coordinate systems, accurate mapping and
spatial analysis would not be possible.
3. Data Analysis
Data analysis in GIS involves examining and processing geographic data to produce new
information and knowledge. GIS analysis can answer questions such as:
Common GIS analysis methods include buffering, overlay analysis, and spatial queries.
4. Data Layers
GIS organizes information into layers, where each layer represents a single type of information or
theme. For example:
• A road layer
• A river layer
• A land parcel layer
By stacking and combining these layers, GIS helps users understand complex geographic
relationships more easily.
5
5. Spatial Relationships
Spatial relationships describe how geographic features are related to each other in space. These
relationships include:
GIS helps identify patterns and trends in geographic data, such as population growth or land use
change. Using these patterns, GIS can also support prediction, helping to estimate future conditions
like urban expansion or environmental changes.
7. Map
A map is a visual representation of geographic data. In GIS, maps are used to:
8. Scale
Scale refers to the relationship between distances on a map and actual distances on the ground. It
affects the level of detail shown:
6
9. Resolution
Resolution refers to the detail and accuracy of spatial data. Higher resolution data shows clearer
and more precise features, while lower resolution data shows less detail. Resolution is especially
important in raster data such as satellite images.
c) Components of GIS
A Geographical Information System (GIS) is made up of five main components. All these
components work together to collect, process, analyze, and display geographic information.
1. Hardware
Hardware refers to the physical equipment used in GIS. These are the tools that allow users to
enter data, process it, and produce outputs.
• Computers and servers – used to run GIS software and store data
• GPS devices – used to collect location data in the field
• Scanners – used to convert paper maps into digital form
• Plotters and printers – used to produce large maps
2. Software
Software consists of computer programs that allow users to work with geographic data. GIS
software helps in:
7
• ArcGIS
• QGIS
• GRASS GIS
The software provides the tools that make GIS analysis and mapping possible.
3. Data
Data is the most important component of GIS. Without data, GIS cannot be used.
• Spatial data – shows location and shape of features such as maps, satellite images, roads,
rivers, and boundaries
• Attribute data – provides descriptive information about features, such as names,
population, land use type, or road condition
4. People
The success of GIS depends greatly on the skills, knowledge, and experience of the people using
it. Even with good software and data, GIS will not be effective without trained users.
5. Methods (Procedures)
Methods are the rules and steps followed when working with GIS. They include:
8
• Standards to ensure accuracy and consistency
Proper methods help ensure that GIS work is organized, accurate, and reliable.
All five components—hardware, software, data, people, and methods—must work together for
GIS to function successfully.
GIS performs several important functions that support mapping and spatial analysis.
1. Data Input
This is the process of entering data into the GIS. Data can be captured using:
• Digitizing maps
• GPS field surveys
• Remote sensing (satellite and aerial images)
• Ground surveys
GIS stores data in an organized way, often in databases. This makes it easy to:
• Retrieve information
• Update data
• Manage large amounts of spatial data efficiently
4. Data Analysis
GIS can analyze spatial data to produce new information. Common GIS analyses include:
9
• Network analysis – finding shortest routes
• Suitability analysis – identifying best locations for specific activities
• Population maps
• Land use maps
• Elevation maps
Maps can be produced in digital format or printed form, making information easy to understand
and share.
6. Decision Support
One of the most important functions of GIS is to support decision-making. GIS helps planners,
managers, and policymakers to:
Through its components and functions, GIS plays a vital role in modern mapping, planning, and
resource management.
Although the terms Geographic Information Science (GI Science) and Geographic Information
Systems (GIS) are closely related, they do not mean the same thing. Understanding the difference
between them helps beginners clearly see what GIS does and the science that supports it.
Geographic Information Science (GI Science) is the scientific field that studies geographic
information itself. It focuses on understanding how geographic data is collected, represented,
analyzed, and interpreted. GI Science is concerned with the theories, principles, and methods
behind spatial data. It asks questions such as: How accurate is geographic data? How can spatial
patterns be modeled? How do errors and uncertainty affect maps and analysis? GI Science is
mainly studied in universities and research institutions, and it helps improve the way geographic
information is handled and understood.
10
On the other hand, Geographic Information Systems (GIS) refers to the computer-based systems
and tools used to work with geographic data. GIS includes hardware, software, data, people, and
procedures that allow users to store, analyze, and display spatial information. GIS is mainly
practical and application-oriented. It is used by planners, engineers, environmentalists, surveyors,
and decision-makers to solve real-world problems such as urban planning, land management,
disaster management, and environmental monitoring.
In simple terms, GI Science provides the scientific foundation, while GIS provides the tools and
technology. GI Science develops the ideas and methods, and GIS applies those ideas using
computers and software.
Key Summary
11
f) Applications of GIS
Geographic Information Systems (GIS) are applied across a wide range of industries to support
decision-making, improve efficiency, and visualize spatial patterns. The following are major
domains where GIS is commonly used:
3. Environmental Management
• Resource Monitoring: Helps monitor forests, water bodies, wildlife, soil health, and other
natural resources.
• Conservation Planning: Supports biodiversity conservation, protected area management,
and renewable energy site assessment.
• Environmental Impact Assessment (EIA): GIS models environmental scenarios and
assesses potential impacts of proposed projects.
12
• Disease Surveillance: Tracks the spread of diseases, identifies hotspots, and supports
planning for medical services.
• Emergency Response: Vital for decision-making during floods, earthquakes, landslides,
fires, and other disasters—supporting evacuation planning, risk mapping, and resource
allocation.
• Location-Based Marketing: Retailers use GIS to analyze consumer behavior and send
targeted promotions based on user location.
• Site Selection: Real estate and business analysts evaluate potential development sites using
demographic, accessibility, and environmental criteria.
• Logistics and Transportation: Enhances route optimization, fleet management, supply
chain efficiency, and traffic analysis.
• Precision Agriculture: Helps optimize crop yields, manage irrigation, assess soil
conditions, and monitor farm activities using spatial data.
• Mining Exploration: Identifies mineral-rich zones, monitors extraction activities, and
supports environmental rehabilitation.
• Forestry: GIS manages forest inventory, tracks deforestation, and supports reforestation
planning.
• Utility Management: Used extensively by gas, water, and electricity companies to map
networks, manage outages, plan expansions, and monitor infrastructure health.
• Asset Management: Supports the monitoring, maintenance, and lifecycle planning of
critical infrastructure assets.
13
8. Criminal Justice and Security
• Crime Mapping: Law enforcement agencies analyze crime patterns, identify high-risk
areas, and plan patrol routes.
• Military Reconnaissance: Supports battlefield analysis, mission planning, terrain
evaluation, and surveillance using spatial intelligence.
14
TOPIC 2: GEOGRAPHIC INFORMATION AND SPATIAL DATA TYPES
Geographical phenomena are the real-world features, events, and processes that exist or occur on
or near the Earth’s surface. These phenomena form the foundation of analysis in a Geographic
Information System (GIS) because GIS is designed to capture, store, analyze, and visualize
information that is linked to specific locations.
In GIS, geographical phenomena are understood through two main categories of information:
Spatial data describes the where of a phenomenon. It represents the geographic position of features
on the Earth’s surface. This location is defined using coordinate systems such as:
Spatial data allows GIS users to map and analyze the distribution of features such as settlements,
rivers, roads, and land parcels. It also enables distance measurements, area calculations, and
pattern analysis.
Non-spatial data, also called attribute data, describes the what, how, and why of a geographical
phenomenon. These are the characteristics that provide detailed information about a feature,
without directly defining its location.
In GIS, attribute data is usually stored in tables and linked to spatial features using unique
identifiers. This connection allows users to perform queries, comparisons, and statistical analyses.
1. Natural Features
These are features that occur naturally in the environment, such as:
15
o Rivers and streams
o Mountains and hills
o Forests and vegetation cover
o Lakes, seas, and wetlands
2. Human-made (Cultural) Features
These are features created or modified by human activities, including:
o Roads, railways, and bridges
o Buildings and settlements
o Administrative boundaries (wards, districts, regions)
o Dams, pipelines, and power lines
3. Dynamic (Temporal) Phenomena
These are phenomena that change over time and often need time-based data for proper
analysis. Examples include:
o Rainfall and temperature variations
o Flooding and drought events
o Urban growth and land-use change
o Migration and population movement patterns
Importance in GIS
In a Geographic Information System (GIS), real-world geographic features are stored and
processed in computers using spatial data models. These models provide structured ways of
converting physical features on the Earth’s surface into digital forms that a computer can store,
display, and analyze. The two fundamental spatial data models are the vector data model and the
raster data model.
The vector data model represents geographic features using precise geometric shapes based on
coordinates.
• Points – represent discrete and well-defined locations such as wells, schools, hospitals, and
tree positions.
• Lines (Polylines) – represent linear features such as roads, rivers, pipelines, and power
transmission lines.
16
• Polygons – represent area features such as lakes, forest reserves, administrative districts,
and land parcels.
Vector data is particularly suitable for representing features with clear boundaries and exact
locations.
Vector data can exist in several digital formats. Important ones include:
a) Shapefiles
A shapefile is one of the most widely used vector data formats in GIS. It was developed by Esri
and is stored as a set of related files:
b) Geodatabases
A geodatabase is a more advanced data storage format that stores multiple datasets in a single
database structure. It is mainly used in ArcGIS.
The raster data model represents geographic information using a grid of cells (pixels) arranged
in rows and columns. Each cell contains a value that represents information about that specific
location on the Earth’s surface.
17
Key characteristics of raster data:
• Best suited for representing continuous phenomena that change gradually over space
• Each cell has a fixed size called spatial resolution
• Smaller cells = higher detail; larger cells = lower detail
• GeoTIFF (.tif)
• JPEG (.jpg)
• PNG (.png)
• GRID formats
• GIS represents real-world features using vector and raster data models.
• Vector data uses points, lines, and polygons to represent discrete features.
• Raster data uses grids of cells to represent continuous data.
• Shapefiles are simple, widely used vector formats.
• Geodatabases are advanced formats capable of storing complex datasets and rules.
Attribute data refers to the descriptive, non-spatial information that provides details about
geographic features in a Geographic Information System (GIS). While spatial data shows where
a feature is located, attribute data explains what that feature is, how it behaves, and why it is
important.
In GIS, attribute data is organized and stored in tables that are linked directly to spatial features
through a unique identification field.
18
• Rows (Records):
Each row represents a single geographic feature, such as a specific road, land parcel, river
section, or settlement.
• Columns (Fields):
Each column represents one characteristic or property of that feature, such as name,
length, area, population, soil type, or ownership status.
This tabular structure allows efficient storage, retrieval, and management of descriptive
information.
Attribute data enables users to perform attribute-based analysis and queries in GIS. Some
typical functions include:
• Selection queries:
Identifying areas with population values above or below a specified threshold.
• Filtering and classification:
Finding roads of a particular type, class, or condition.
• Land management analysis:
Identifying land parcels with specific ownership, tenure status, or land-use restrictions.
• Statistical analysis:
Generating summaries such as totals, averages, and distributions of geographic features.
19
Topology refers to the set of rules and relationships that define how geographic features share
space and interact with one another, independent of their exact coordinates, shape, or size. In a
Geographic Information System (GIS), topology is not just a visual concept; it is a data structure
and logical framework that ensures spatial data behaves like the real world.
While geometry answers the question “Where is the feature and what does it look like?”, topology
answers deeper questions like:
This makes topology essential for data integrity, spatial reasoning, and decision-making.
Adjacency refers to the relationship where two area features share a common boundary but do not
overlap.
Deeper clarification:
Real-world value:
20
Used in:
Connectivity defines how linear features join together to form continuous networks.
Deeper clarification:
Important concepts:
Real-world value:
Crucial in:
Deeper clarification:
21
• Rules can be defined such as:
o “Points must be inside polygons”
o “Lines must be within boundaries”
Real-world value:
Applied in:
Overlap occurs when features share part of the same area, while intersection occurs when features
cross each other at any point.
Deeper clarification:
Real-world value:
Used for:
In modern GIS, topology is implemented as explicit rules stored in the database, not just as visual
relationships.
22
Examples of topology rules include:
These rules are enforced and managed inside software such as ArcGIS and QGIS, making topology
a core data quality mechanism, not just a theoretical idea.
In reality, topology is the logic that makes GIS intelligent, not just visual.
23
• Cadastral and land administration systems
• Topology is the logical framework that defines spatial relationships between geographic
features, independent of shape and size.
• It governs how features touch, connect, contain, and intersect.
• It is implemented in GIS through explicit topology rules.
• It is essential for data quality, accurate analysis, and real-world decision making.
24
TOPIC 3: GIS DATA SOURCES AND DATA ENTRY
Geographical Information Systems (GIS) depend heavily on data. Without data, a GIS cannot
function, no matter how powerful the computer or how advanced the software is. Data is the raw
material that GIS uses to create maps, perform analysis, and support decision-making. Therefore,
understanding where GIS data comes from, how it is collected, and how it is entered into a
GIS system is a very important step for anyone learning GIS.
This topic focuses on GIS data sources and data entry. It explains the different places from
which GIS data is obtained, the difference between collecting new data and using existing data,
and the basic ideas behind entering data into a GIS. It also prepares learners to understand why
data quality is critical in GIS work.
GIS does not create data by itself; it depends on data collected from the real world or
obtained from reliable sources.
A GIS data source refers to the origin or place from which geographic data is obtained for
use in a GIS. In simple terms, it answers the question:
“Where did this GIS data come from?”
1. Spatial data
2. Attribute data
25
Spatial data describes the location, shape, and size of features on the Earth’s surface. It tells us
where something is and what shape it has. For example:
Spatial data is usually represented as points, lines, or polygons, or as raster images such as
satellite images.
Attribute data, on the other hand, provides descriptive information about those spatial
features. It answers questions such as:
In GIS, spatial data and attribute data are always linked. A feature on the map is meaningful only
when it has descriptive information attached to it.
These two approaches lead to primary data sources and secondary data sources.
26
Primary data refers to data that is collected directly from the field by the user or organization
conducting the GIS project. This data is collected first-hand, specifically for a particular
purpose or study.
• More accurate
• More current
• More relevant to the specific project
However, it is also:
• More time-consuming
• More expensive
• More labour-intensive
One of the most common methods of collecting primary GIS data is through GPS surveys.
A Global Positioning System (GPS) device is used to determine the exact geographic
coordinates of features on the Earth’s surface. GPS devices receive signals from satellites and
calculate the location in terms of latitude, longitude, and sometimes elevation.
• Roads
• Buildings
27
• Water points
• Schools and hospitals
• Administrative boundaries
• Utility features such as power lines or pipelines
Example:
A planning officer visits a village and uses a GPS device to record the locations of all water
wells. Each well is recorded as a point with coordinates and additional information such as
depth, condition, and ownership.
Field surveys involve collecting data through direct observation, measurement, or interviews
in the field. This method is often used when attribute data is needed alongside spatial data.
Example:
A GIS student conducts a land use survey by walking through an area and recording whether
each parcel is residential, commercial, agricultural, or industrial.
28
Field surveys are especially useful for collecting:
• Socio-economic data
• Land use and land cover information
• Environmental conditions
• Infrastructure details
Remote sensing is the science of collecting data about the Earth’s surface without direct
contact. It uses sensors mounted on:
• Satellites
• Aircraft
• Drones (Unmanned Aerial Vehicles)
Remote sensing provides data in the form of images, which can be analyzed in GIS to extract
useful information.
Example:
A satellite image is used to analyze changes in forest cover over a period of ten years.
29
• Reduces the need for field visits in difficult areas
Ground observation involves directly observing and recording features in their natural setting.
This method is often used to verify data collected from other sources.
Example:
After analyzing satellite images, a GIS analyst visits the area to confirm whether certain areas are
farms or natural vegetation.
Because primary data is collected specifically for a project, it fits the project’s needs very well.
30
Secondary data refers to data that has already been collected by other organizations or
individuals and is reused for GIS purposes. The GIS user does not collect this data directly but
obtains it from existing sources.
• Faster to obtain
• Cheaper than primary data
• Available for large areas
• Outdated
• Less accurate
• Collected for a different purpose
Despite these limitations, secondary data is widely used in GIS because it saves time and
resources.
Maps are one of the oldest and most common sources of secondary GIS data. These include:
• Topographic maps
• Cadastral maps
• Thematic maps
• Administrative boundary maps
Maps can be converted into digital form through scanning and digitization and then used in GIS.
Example:
A topographic map produced years ago is digitized to extract road and river networks.
31
Population data collected during national censuses is a major source of secondary data. This data
includes:
• Population counts
• Age and gender distribution
• Household information
• Socio-economic indicators
• Urban planning
• Health services planning
• Education planning
• Resource allocation
• Reports
• Tables
• Databases
• Digital map formats
These data sources can often be imported directly into GIS software.
Example:
A spreadsheet containing village population figures is linked to a village boundary map in GIS.
Advantages:
• Saves time
• Reduces cost
32
• Covers large areas
• Often easily available
Limitations:
• May be outdated
• May not match project needs
• May contain errors
• May use different standards
GIS users must always evaluate the quality of secondary data before using it.
After understanding primary and secondary data sources, the next important step for GIS
learners is to understand who actually produces and supplies GIS data, especially at the
national level. In most countries, including Tanzania, a large amount of GIS data is produced by
government institutions, public agencies, and research organizations.
These organizations are known as GIS data providers. They collect, manage, and distribute
geographic data that is used for planning, development, research, and decision-making. Knowing
these providers helps GIS users:
In Tanzania, GIS data providers play a critical role in national development, land management,
infrastructure planning, environmental protection, and disaster management.
33
The National Bureau of Statistics (NBS) is one of the most important GIS data providers in
Tanzania. Its main responsibility is to collect, analyze, and disseminate statistical data related to
population and socio-economic conditions.
NBS provides:
Although NBS data is mainly statistical, it becomes GIS data when it is linked to geographic
locations, such as districts, wards, or villages.
Example:
A GIS analyst uses census data from NBS to create a map showing population density by ward
in a city. This map helps planners identify areas that need more schools or health facilities.
34
The Ministry of Lands, Housing and Human Settlements Development is a key provider of
land-related GIS data in Tanzania. This ministry is responsible for land administration, land
use planning, and housing development.
This data is extremely important for land management and development planning.
Example:
A town planner uses cadastral maps to identify land parcels affected by a proposed road
expansion project.
The Survey and Mapping Division (SMD) is the national authority responsible for mapping
and surveying activities in Tanzania. It produces some of the most fundamental GIS datasets.
35
Types of GIS Data Provided by SMD
SMD provides:
• Topographic maps
• Geodetic control points
• Elevation data
• National base maps
• Coordinate reference frameworks
• Rivers
• Roads
• Hills and mountains
• Settlements
• Administrative boundaries
Example:
A GIS user uses topographic maps from SMD as a base layer for environmental analysis.
The Tanzania Meteorological Authority (TMA) is responsible for collecting and providing
weather and climate data across the country.
36
TMA provides:
• Rainfall data
• Temperature data
• Wind patterns
• Climate zones
• Weather station locations
This data becomes GIS data when it is mapped and analyzed spatially.
Example:
A GIS analyst creates a rainfall distribution map using data from TMA to support agricultural
planning.
The Tanzania National Roads Agency (TANROADS) is responsible for managing the national
road network. It is a major provider of transportation-related GIS data.
TANROADS provides:
37
• Road condition data
• Bridge and culvert locations
• Transport infrastructure data
• Transport planning
• Road maintenance planning
• Traffic analysis
• Infrastructure development
Example:
A GIS planner uses TANROADS data to analyze accessibility between regions.
Local Government Authorities (LGAs) include city councils, municipal councils, town
councils, and district councils. They are important sources of local-level GIS data.
LGAs provide:
38
• It reflects current ground conditions
• It supports community-level planning
Example:
A district council uses GIS data to map health facility coverage.
Universities and research institutions in Tanzania also produce valuable research-based GIS
data.
They produce:
• Environmental datasets
• Biodiversity maps
• Land cover classifications
• Urban growth studies
• Climate and ecological data
This data is often collected through research projects and academic studies.
Use in GIS
• Policy development
• Environmental conservation
• Scientific studies
• Educational purposes
39
• Identify reliable data sources
• Access authoritative datasets
• Avoid collecting data that already exists
• Improve efficiency and accuracy
Introduction
In GIS studies, beginners often hear the terms data collection, data acquisition, and data
capture used as if they mean the same thing. While these terms are closely related, they actually
describe different stages in the life of GIS data. Understanding the difference between them is
very important because each term refers to a specific activity in GIS work.
In simple words:
• Data collection is about gathering new data from the real world.
• Data acquisition is about obtaining existing data from other sources.
• Data capture is about entering or converting data into a digital GIS format.
Together, these processes ensure that geographic information moves from the real world into a
GIS system where it can be stored, analyzed, and displayed.
Data Collection
Data collection refers to the process of gathering new and original data directly from the
field. This data has not been collected before or is collected again because existing data is
outdated or insufficient.
Data collection usually involves physical presence in the field, where GIS users observe,
measure, or record information about geographic features.
40
Data collection is often the first step in a GIS project when:
Data collection in GIS can be carried out using several methods, including:
GPS devices are widely used to collect location data. Using GPS, users can record:
• Coordinates of features
• Paths and routes
• Boundaries and edges
Example:
A GIS technician uses a handheld GPS device to record the locations of all boreholes in a rural
ward.
• Accurate
• Fast
• Suitable for fieldwork
Field surveys involve collecting attribute information through observation, interviews, and
questionnaires.
41
Example:
During a field survey, data is collected about:
This method involves visually observing features and taking measurements using tools such as
measuring tapes or range finders.
Example:
A surveyor measures the width of a road and records its condition.
• Time-consuming
• Costly
• Requires trained personnel
• Difficult in remote or unsafe areas
Data Acquisition
42
Data acquisition refers to the process of obtaining existing data from other sources, rather
than collecting it directly from the field. In this case, the data already exists and is simply
accessed or requested for use in a GIS project.
• Government institutions
• Research organizations
• International agencies
• Online platforms
• Commercial data providers
Example:
Instead of conducting a new population survey, a GIS user acquires census data from NBS and
uses it for analysis.
• Saves time
43
• Reduces cost
• Covers large geographic areas
• Often readily available
Data Capture
Data capture refers to the process of converting data into a digital format that can be used in
GIS software. This step is necessary whether data is collected directly or acquired from other
sources.
In simple terms, data capture is about getting data into the GIS system.
• Digitizing maps
• Scanning paper documents
• Entering coordinates manually
• Importing digital files
44
Example:
After collecting GPS points of health facilities, a GIS analyst enters the coordinates into GIS
software to create a digital map.
1. Data Collection
o Field teams use GPS to record the locations of new roads.
2. Data Acquisition
o Population data is obtained from the National Bureau of Statistics.
3. Data Capture
o Both GPS data and population data are entered into GIS software and digitized.
This example shows how the three processes work together but remain distinct.
45
PART 4: GIS DATA INPUT METHODS, DATA EDITING, DATA QUALITY ISSUES,
AND CONCLUSION
Once GIS data has been collected, acquired, or prepared for use, the next critical step is data
input or data entry. Data input refers to the process of entering geographic data into a GIS
system so that it can be stored, analyzed, and displayed in map form.
In GIS, data input is not just about typing information into a computer. It involves transforming
real-world features such as roads, rivers, buildings, and boundaries into digital representations
that a GIS can understand and process.
The accuracy and reliability of GIS outputs depend heavily on how carefully data input is done.
Poor data input leads to errors that may affect the entire GIS project.
If errors occur during data input, they may continue through all stages of GIS analysis, leading to
incorrect maps and poor decisions.
1. Scanning
2. Digitization
3. Coordinate Entry
46
Each method has its own purpose, advantages, and limitations.
1. Scanning
Meaning of Scanning
Scanning is the process of converting paper maps, drawings, or documents into digital image
files using a scanner. The output of scanning is usually a raster image, which is made up of
pixels.
• Topographic maps
• Cadastral maps
• Land use maps
• Engineering drawings
• Historical maps
47
A scanned map does not automatically have geographic coordinates. Therefore, it must be
georeferenced before use.
Georeferencing involves:
Example:
Aligning a scanned topographic map with known latitude and longitude values.
Advantages of Scanning
Limitations of Scanning
2. Digitization
Meaning of Digitization
Digitization is the process of converting map features into vector data by tracing them. Vector
data represents features as:
• Points
• Lines
• Polygons
48
Types of Digitization
a) Manual Digitization
• A mouse
• A digitizing tablet
b) On-Screen Digitization
On-screen digitization involves tracing features directly on a digital image displayed on the
computer screen.
Example:
Tracing roads from a satellite image displayed in GIS software.
• Roads → Lines
• Rivers → Lines
• Buildings → Polygons
• Boreholes → Points
Errors in Digitization
49
• Incorrect feature shapes
Advantages of Digitization
Limitations of Digitization
• Time-consuming
• Requires skilled operators
• Errors can occur if not careful
3. Coordinate Entry
Coordinate entry involves manually entering known coordinates into GIS software to create
spatial features.
50
Advantages of Coordinate Entry
Data editing is the process of correcting, modifying, and improving GIS data after it has been
entered into the system.
• Accurate
• Consistent
• Complete
51
• Correcting spelling mistakes
• Filling missing values
• Updating outdated information
Data quality refers to how reliable, accurate, and suitable GIS data is for a specific purpose.
Good-quality data leads to reliable GIS results, while poor-quality data leads to misleading
outcomes.
1. Accuracy
2. Completeness
3. Consistency
Consistency ensures:
52
4. Timeliness
5. Resolution
• Level of detail
• Size of the smallest mapped feature
• Incorrect maps
• Faulty analysis
• Poor planning decisions
• Waste of resources
• Accurate mapping
• Reliable analysis
• Informed decision-making
• Effective planning
Conclusion
53
This topic has explained the entire journey of GIS data, from its source to its final use in a GIS
system. We have learned that GIS data comes from many sources, including field surveys,
satellite images, and government institutions. We also explored the important differences
between data collection, data acquisition, and data capture, and how these processes work
together.
Furthermore, we examined the main GIS data input methods—scanning, digitization, and
coordinate entry—and discussed how data editing and quality control ensure that GIS outputs are
accurate and reliable.
Understanding GIS data sources and data entry is essential for anyone studying or working with
GIS, because the quality of GIS outputs depends directly on the quality of GIS inputs.
GIS technology enables users not only to store and display maps, but also to ask questions, analyze
spatial relationships, and support decision-making. Spatial analysis is the core strength of a
Geographic Information System, allowing users to understand patterns, relationships, and trends
based on location.
Spatial analysis refers to a set of techniques used in GIS to examine the location, distribution, and
relationships of geographic features in space. It goes beyond simple map display by transforming
raw spatial data into meaningful information.
54
Key Components of Spatial Analysis
1. Spatial Data
o Vector data (points, lines, polygons)
o Raster data (cells, pixels)
2. Spatial Relationships
o Distance
o Direction
o Adjacency
o Connectivity
o Containment
3. Spatial Operations
o Overlay
o Buffering
o Proximity analysis
o Network analysis
o Surface analysis
55
A spatial query is a request to select features based on their spatial location or relationship with
other features.
1. Location-Based Queries
o Find schools within a district
o Identify wells inside a buffer zone
2. Proximity Queries
o Find hospitals within 5 km of major roads
o Identify settlements near rivers
3. Intersection Queries
o Find land parcels that intersect flood zones
o Identify roads that cross protected areas
4. Containment Queries
o Identify points inside polygons
o Find villages within administrative boundaries
• Select by Location
• Buffer
• Intersect
• Clip
• Union
• Spatial Join
56
• Near
1. Buffer Analysis
Example:
2. Overlay Analysis
Example:
• Overlay land use map with soil map to identify suitable agricultural land.
3. Proximity Analysis
Example:
57
• Find the nearest hospital to each village.
• Identify settlements closest to water sources.
4. Network Analysis
Example:
5. Spatial Join
Example:
Steps:
58
• Ignoring topology (leading to wrong results)
• Assuming maps = analysis (visualization ≠ analysis)
Geospatial data visualization is the process of displaying geographic data in visual form—
mainly maps—so that spatial patterns, relationships, and trends can be easily understood.
Visualization is not only about making maps look nice; it is about making spatial information
meaningful.
What Is a Map?
A map is a visual representation of geographic features on the Earth’s surface, drawn at a certain
scale.
What Is GIS?
A Geographic Information System (GIS) is a computer system used to capture, store, analyze,
and visualize spatial data.
59
• GIS creates maps
• Maps are outputs of GIS
• GIS allows maps to be:
o Updated easily
o Viewed at different scales
o Generated after analysis
Key idea:
A map shows information; GIS helps you analyze information before showing it.
Visualization in GIS follows a clear process. Skipping steps often leads to poor or misleading
maps.
What to do:
60
Line Roads, rivers Line styles
Polygon Districts, wards Colors / patterns
Raster Satellite image Color ramps
Questions to ask:
Examples:
61
5. Apply changes
Common methods:
✔ Title
✔ Legend
✔ Scale bar
✔ North arrow
✔ Labels
✔ Neat layout
In GIS:
62
4. How to Visualize Different Types of Data in GIS
Examples:
• Schools
• Health facilities
How to do it:
Examples:
• Roads
• Rivers
How to do it:
Examples:
• Wards
• Districts
How to do it:
63
• Use patterns for land use
Examples:
• Satellite images
• Elevation models
How to do it:
64
(v). Overcrowded maps
Visualization:
8. Exam-Focused Summary
Answer:
Geospatial data visualization is the process of representing geographic data visually, mainly
through maps, to help users understand spatial patterns, relationships, and trends.
Answer:
GIS is a system used to collect, store, analyze, and visualize spatial data, while maps are visual
outputs produced by GIS to communicate geographic information.
Q3. List four elements that must appear on a good GIS map.
Answer:
• Title
65
• Legend
• Scale bar
• North arrow
Answer:
Symbolization is the process of assigning symbols, colors, and sizes to geographic features in GIS
in order to represent different types of data visually.
Answer:
Categorical data represents distinct classes or types (e.g., land use types), while numerical data
represents measurable quantities (e.g., population, rainfall).
Answer:
The visualization process in GIS involves data preparation, understanding data type, selecting
visualization methods, symbolization, classification of data, map layout design, and interpretation
of the final map.
Answer:
Polygon data can be visualized using graduated colors, categorized colors, or pattern fills
depending on the nature of the data, such as population density or land use types.
Answer:
Classification groups numerical data into classes to make patterns easier to interpret. Proper
classification ensures accurate and meaningful representation of data, while poor classification can
mislead map users.
Answer:
66
SECTION C: Practical-Oriented Questions
Q10. Outline steps to visualize population data by district using GIS software.
Answer:
Q11. A student wants to show road hierarchy on a map. Which visualization method should
be used and why?
Answer:
Different line thicknesses and colors should be used to represent road hierarchy because line data
is best visualized by varying line width and style to show importance or classification.
Q12. How can raster data such as satellite imagery be visualized in GIS?
Answer:
Raster data can be visualized using color ramps, contrast adjustments, classification, transparency,
and hillshade to enhance visibility and interpretation.
Answer:
GIS visualization presents complex spatial data in an easy-to-understand visual form, helping
decision-makers identify patterns, compare locations, assess risks, and make informed planning
decisions.
Q14. Why is GIS visualization considered more powerful than traditional mapping?
Answer:
GIS visualization is dynamic, allows multiple layers and scales, supports analysis before mapping,
and can be updated easily, unlike traditional static maps.
67
Road network Line thickness and color
Land use Categorized colors
Elevation (DEM) Color ramps / hillshade
68