Overview of Database Systems
Lecture 1
Learning Outcomes
In this chapter, you will learn:
Data VS information
Evolution: from file system to database system
What a database is, the various types of
databases, and why they are valuable assets for
decision making
The main components of the database system
The roles and main functions of a database
management system (DBMS)
The importance of database design
2
Introduction
Good decisions require good information
derived from raw facts
Data is managed most efficiently when stored
in a database
Correct information?
Wrong information?
3
Why Databases?
Databases solve many of the problems
encountered in data management
Used in almost all modern settings involving data
management:
Business
Research
Administration
4
Data vs. Information
Data are raw facts
Information is the result of processing raw
data to reveal meaning
Information requires context to reveal
meaning
Raw data must be formatted for storage,
processing, and presentation
Data are the foundation of information, which
is the bedrock of knowledge
5
File System Data Processing
File systems typically composed of collection
of file folders, each tagged and kept in cabinet
They are Organized by expected use
6
File System Data Processing
Contents of each file folder are logically
related
Manual systems (paper-and-pencil systems)
Served as a data repository for small data
collections
Cumbersome for large collections
7
File System Data Processing
Computerized file systems
Data processing (DP) specialist is hired to convert
computer file structure from manual system
Wrote software that managed the data
Designed the application programs
8
File System Data Processing
Initially, computer file systems resembled
manual systems
As number of files increased, file systems
evolved
Each file used its own application program to
store, retrieve, and modify data
Each file was owned by individual or department
that commissioned its creation
9
10
11
12
Problems with File System Data Processing
File systems were an improvement over
manual system
File systems used for more than two decades
File systems store data files into hard disk
Understanding the shortcomings of file systems
aids in development of modern databases
Many problems not unique to file systems
Even simple file system retrieval task required
extensive programming
Ad hoc queries impossible
Changing existing structure difficult
13
Problems with File System Data Processing
Security features difficult to program
Often omitted in file system environments
Summary of file system limitations:
Requires extensive programming: example, to apply update, must
make sure all records have been considered in the programming;
may cause a lot of if/else statements and long execution time
Cannot perform ad hoc queries: example, to search or change a
specific record of a student, is tough
System administration is complex and difficult: example, when
update occurs, administration must check and make sure all
related records are updated. This means that data inconsistency is
unsolved.
Difficult to make changes to existing structures: example, add in
new columns, cause the change of data access method
Security features are likely to be inadequate: example, anyone
can make changes to the files
14
15
Data independence and Data
Dependence
Structural dependence: access to a file is
dependent on its own structure
All file system programs must be modified to
conform to a new file structure
Structural independence: change file structure
without affecting data access
Data dependence: data access changes when
data storage characteristics change
Data independence: data storage
characteristics do not affect data access
16
Data independence and Data Dependence
(cont'd.)
Practical significance of data dependence is
difference between logical and physical format
Logical data format: how human views the
data
Physical data format: how computer must
work with data
Each program must contain:
Lines specifying opening of specific file type
Record specification
Field definitions
17
Data independence and Data
Dependence
File system structure makes it difficult to
combine data from multiple sources
Vulnerable to security breaches
Organizational structure promotes storage of
same data in different locations- Islands of
information
Data stored in different locations is unlikely to
be updated consistently
Data redundancy: same data stored
unnecessarily in different places
18
Data Redundancy
Data inconsistency: different and conflicting
versions of same data occur at different
places
Data anomalies: abnormalities when all
changes in redundant data are not made
correctly
Update anomalies
Insertion anomalies
Deletion anomalies
Sources: https://www.ictlounge.com/html/types_of_databases.htm
19
Introducing the Database
Database: shared, integrated computer
structure that stores a collection of:
End-user data: raw facts of interest to end user
Metadata: data about data
Provides description of data characteristics and
relationships in data
Complements and expands value of data
Database management system (DBMS):
collection of programs that manages structure
and controls access to data
20
Role and Advantages of the DBMS
DBMS is the intermediary between the user
and the database
DBMS enables data to be shared
DBMS integrates many users’ views of the
data
21
22
Role and Advantages of the DBMS (cont’d.)
Advantages of a DBMS:
Improved data sharing
Improved data security
Better data integration
Minimized data inconsistency
Improved data access
Improved decision making
Increased end-user productivity
23
Types of Databases
Databases can be classified according to:
Number of users
Database location(s)
Expected type and extent of use
Single-user database supports only one user
at a time
Desktop database: single-user; runs on PC
Multiuser database supports multiple users at
the same time
Workgroup and enterprise databases
24
Types of Databases
Centralized database: data located at a single
site
Distributed database: data distributed across
several different sites
Operational database: supports a company’s
day-to-day operations
Transactional or production database
Data warehouse: stores data used for tactical
or strategic decisions
25
Types of Databases
Three types of data
Unstructured - data exist in their original state
Structured - data result from formatting
Semi-structured - data have been processed to
some extent
Extensible Markup Language (XML) represents
data elements in textual format
XML database supports semi-structured XML data
26
27
Why Database Design Is Important
Database design focuses on design of
database structure used for end-user data
Designer must identify database’s expected use
Well-designed database:
Facilitates data management
Generates accurate and valuable information
Consistent (store useful and required data)
Efficient execution of queries
Poorly designed database:
Causes difficult-to-trace errors
Generate wrong information
28
Database Systems
Database system consists of logically related
data stored in a single logical data repository
May be physically distributed among multiple
storage facilities
DBMS eliminates most of file system’s problems
Current generation stores data structures,
relationships between structures, and access
paths
29
The Database System Environment
Database system: defines and regulates the
collection, storage, management, use of data
Five major parts of a database system:
Hardware
Software
People
Procedures
Data
30
31
The Database System Environment
Hardware: all the system’s physical devices
Software: three types of software required:
Operating system software
DBMS software
Application programs and utility software
32
The Database System Environment
People: all users of the database system
System and database administrators
Database designers
Systems analysts and programmers
End users
Procedures: instructions and rules that govern
the design and use of the database system
Data: the collection of facts stored in the
database
33
The Database System Environment
Database systems are created and managed
at different levels of complexity
Database solutions must be cost-effective as
well as tactically and strategically effective
Database technology already in use affects
selection of a database system
34
DBMS Functions
Most functions are transparent to end users
Data dictionary management
DBMS stores definitions of data elements and
relationships (metadata) in a data dictionary
DBMS looks up required data component
structures and relationships
Changes automatically recorded in the dictionary
DBMS provides data abstraction and removes
structural and data dependency
35
36
37
DBMS Functions
Data storage management
DBMS creates and manages complex structures
required for data storage
Also stores related data entry forms, screen
definitions, report definitions, etc.
Performance tuning: activities that make the
database perform more efficiently
DBMS stores the database in multiple physical
data files
38
DBMS Functions
Data transformation and presentation
DBMS transforms data entered to conform to
required data structures
DBMS transforms physically retrieved data to
conform to user’s logical expectations
Security management
DBMS creates a security system that enforces
user security and data privacy
Security rules determine which users can access
the database, which items can be accessed, etc.
39
DBMS Functions
Multiuser access control
DBMS uses sophisticated algorithms to ensure
concurrent access does not affect integrity
Backup and recovery management
DBMS provides backup and data recovery to
ensure data safety and integrity
Recovery management deals with recovery of
database after a failure
40
DBMS Functions
Data integrity management
DBMS promotes and enforces integrity rules
Minimizes redundancy
Maximizes consistency
Data relationships stored in data dictionary used
to enforce data integrity
Integrity is especially important in transaction-
oriented database systems
41
DBMS Functions
Database access languages and application
programming interfaces
DBMS provides access through a query language
Query language is a nonprocedural language
Structured Query Language (SQL) is the de facto
query language
42
DBMS Functions
Database communication interfaces
Current DBMSs accept end-user requests via
multiple different network environments
Communications accomplished via:
filling in screen forms through Web browser
automatically publishes predefined reports on a Web
site
connects to third-party systems to distribute
information via e-mail
43
Disadvantages of Database Systems
Increased costs: need sophisticated hardware and
software, cost of maintaining and upgrading, cost of
training
Management complexity: interface with a lot of
technologies and keep updating according to the
latest technologies
Maintaining currency: have to follow the latest
database trend to make sure the system is always
efficient
Vendor dependence: performance of database is
depend on the features which offered by vendor
Frequent upgrade/replacement cycles: need to
upgrade the features and functionalities always
44