Chapter 7: Databases and Data
Warehouses
Management Information Systems, Fifth
Edition
Objectives
• Explain the difference between traditional file
organization and the database approach to
managing digital data
• Explain how relational and object oriented
database management systems are used to
construct databases, populate them with data, and
manipulate the data to produce information
• Enumerate the most important features and
operations of a relational database, the most
popular database model
Management Information Systems, Fifth Edition 2
Objectives (continued)
• Understand how data modeling and design
creates a conceptual blueprint of a database
• Discuss how databases are used on the Web
• List the operations involved in transferring data
from transactional databases to data
warehouses
Management Information Systems, Fifth Edition 3
Managing Digital Data
• Businesses collect and dissect data
• Data can be stored in powerful database format
– Easy access and manipulation
• Databases have profound impact on business
• Database technology integrated with Internet
Management Information Systems, Fifth Edition 4
The Traditional File Approach
• Traditional file approach: no mechanism for
manipulating data
• Database approach: has mechanism for
manipulating data
• Traditional approach inconvenient
– High data redundancy
– Low data integrity
• Data redundancy: duplication of data
• Data integrity: accuracy of data
Management Information Systems, Fifth Edition 5
The Traditional File Approach
(continued)
Figure 7.1: The layout of a human resource file in traditional file organization
Management Information Systems, Fifth Edition 6
The Database Approach
• Database approach: data organized as entities
• Entity: object that has data
– People
– Events
– Products
• Character: smallest piece of data
• Field: single piece of information about entity
• Record: collection of fields
Management Information Systems, Fifth Edition 7
The Database Approach (continued)
• File: collection of related records
• Database management system (DBMS):
program used to build databases
– Populates with data
– Manipulates data
– Query: message requesting access to data
Management Information Systems, Fifth Edition 8
The Database Approach (continued)
• Database has security issues
• Database administrator (DBA): limits user
access to database
– Requires users to enter codes
• DBMS bundled with fourth-generation languages
Management Information Systems, Fifth Edition 9
The Database Approach (continued)
Figure 7.2: Data hierarchy
Management Information Systems, Fifth Edition 10
The Database Approach (continued)
Databases include more than just text and numbers: for instance, a database used by real
Estate agents may show property pictures in addition to addresses, prices, and sale status
Management Information Systems, Fifth Edition 11
The Database Approach (continued)
Figure 7.3: Different database views reveal different combinations of data
Management Information Systems, Fifth Edition 12
The Database Approach (continued)
Figure 7.4: Different views from the same database
Management Information Systems, Fifth Edition 13
Database Models
• Database model: general logical structure
– How records stored in database
– Records linked differently in different models
– Models constantly changing
Management Information Systems, Fifth Edition 14
The Relational Model
• Relational Model: consists of tables
• Based on relational algebra
– Tuple: record
– Attribute: field
– Relation: table
– Key: identifier field
• Used to retrieve records
Management Information Systems, Fifth Edition 15
Relational Model (continued)
Figure 7.5: A relational database
Management Information Systems, Fifth Edition 16
The Relational Model (continued)
• Primary key: unique key
– Uniquely identifies record
– Required in table
• Composite key: combination of fields
– Serves as primary key
• Foreign key: shared field
– Links tables
• Join table: composite of tables
Management Information Systems, Fifth Edition 17
The Relational Model (continued)
Figure 7.6: A join table
Management Information Systems, Fifth Edition 18
The Relational Model (continued)
• Table relationships with other tables
• One-to-many relationship: one item in table
linked to many items in other table
• Many-to-many relationship: many items in
table linked to many items of other table
Management Information Systems, Fifth Edition 19
The Object-Oriented Model
• Object-Oriented model: uses object-oriented
approach
• Encapsulation: combined storage of data and
relevant procedures
– Allows object to be planted in different data sets
• Inheritance: creates new object by replicating
characteristics of existing (parent) object
Management Information Systems, Fifth Edition 20
The Object-Oriented Model (continued)
Figure 7.7: An object-oriented database
Management Information Systems, Fifth Edition 21
Relational Operations
• Relational operation: create temporary subset
of table
• Create limited list or joined table list
– Select records based on conditions
– Project columns
– Join tables to create temporary table
Management Information Systems, Fifth Edition 22
Structured Query Language
• Structured query language: language of
choice for DBMSs
• Advantages
– Standardized language
– Used in many host languages
• Portable
Management Information Systems, Fifth Edition 23
The Schema and Metadata
• Schema: plan
– Describes structure of database
– Names and sizes of fields
– Identifies primary keys
• Data dictionary: repository of information about
data
Management Information Systems, Fifth Edition 24
The Schema and Metadata (continued)
• Metadata: data about data
– Source of data
– Tables related to data
– Field information
– Usage of data
– Population rules
Management Information Systems, Fifth Edition 25
The Schema and Metadata (continued)
Figure 7.8: Schema of the Employee table in an Access database
Management Information Systems, Fifth Edition 26
Data Modeling
• Databases must be carefully planned
• Data modeling: analysis and organization of
data
– Proactive process
– Develop conceptual blueprint
• Entity relationship diagram: graphical
representation of relationships
Management Information Systems, Fifth Edition 27
Data Modeling (continued)
• Entity relationship diagram
– Boxes identify entities
– Lines indicate relationship
– Crossbars indicate mandatory fields
– Circles indicate optional
– Crows feet identify “many”
Management Information Systems, Fifth Edition 28
Data Modeling (continued)
Figure 7.9: An entity relationship diagram (ERD)
Management Information Systems, Fifth Edition 29
Data Modeling (continued)
Figure 7.10: Fields of the Professor entity
Management Information Systems, Fifth Edition 30
Databases on the Web
• Web dependent on databases
• Interface between Web and database required
– CGI
– ASP
– API
Management Information Systems, Fifth Edition 31
Databases on the Web (continued)
Figure 7.11: Active server pages and similar software enable data queries and
entry via the Web
Management Information Systems, Fifth Edition 32
Data Warehousing
• Data collections used for transactions
• Accumulation of transaction data useful
• Data warehouse: large database
– Typically relational
– Supports decision making
– Data copied from transactional database
• Data mart: collection of data focusing on
particular subject
Management Information Systems, Fifth Edition 33
From Database to Data Warehouse
• Transactional database not suitable for business
analysis
– Only current data
– Not historic
• Data warehouse requires large storage capacity
– Mainframe computers used
– Scalability issue
Management Information Systems, Fifth Edition 34
Phases in Building a Data Warehouse
• Begin building data warehouse after equipment
secured
– Extraction phase
• Create files from transactional database
– Transformation phase
• Cleanse and modify data
• Loading phase
• Transfer files to data warehouse
Management Information Systems, Fifth Edition 35
Phases in Building a Data Warehouse
(continued)
Figure 7.12: Phases in preparing and using a data warehouse
Management Information Systems, Fifth Edition 36
Summary
• Organizations collect vast amounts of data
• Database approach has advantages over
traditional approach
• Character: smallest piece of data
• File: collection of records
• Designer must construct schema to construct
database
Management Information Systems, Fifth Edition 37
Summary (continued)
• Database management system enables database
construction and manipulation
• Relational and object-oriented database models have
different advantages
• Keys used to form links among entities
• Object-oriented database maintains links differently
• SQL adopted as international standard
Management Information Systems, Fifth Edition 38
Summary (continued)
• Designers conduct data modeling to show
required tables
• Databases often linked to Web
• Data warehouses contain huge collections of
historical data
• Data warehouse allows data extraction,
transformation, and loading
• Invasion of privacy is exacerbated by database
technology
Management Information Systems, Fifth Edition 39