MANAGING DATA RESOURCES
TERMS
Bit - represents the smallest unit of data a
computer can handle
Byte – group of bits, represent a single character (
number, letter, symbol etc)
Field – grouping of characters into a word, group
of words, numbers etc..
Record – group of related fields, such as student’s
name,course taken, date etc..
File – group of records of the same type
Database – group of related files
TERMS (CONTD.)
Entity – a person, place, thing or event on which
we store and maintain information
Attribute – the characteristic or quality describing
a particular entity
PROBLEMS WITH TRADITIONAL FILE ENVIRONMENT
Data redundancy & inconsistency
Program data dependence
Refers to the coupling of data stored in files
and the specific programs required to update &
maintain those files such that changes in
programs require changes to the data
Lack of flexibility
Poor security
Lack of data sharing & availability
DBMS
The software that permits an organisation to
centralize data, manage them efficiently &
provide access to the stored data by application
programs
Acts as an interface between application programs
& physical data files
VIEWS
Logical view
Presents data as they would be perceived by end
users/ business specialists
Physical view
Shows how data are actually organized and structured
on physical storage media
Employees :Name
Address Personal
ID Application
programs Personnel
Designation
Dept
Marital status
Payroll Payroll
Hours worked Application
Payroll
Pay rate DBMS programs
Dept
Gross pay
Tax
Net pay
Benefits
Benefits Application Benefits
Life insurance programs Dept
Pension plan
Health care
Retirement
COMPONENTS
DDL
Formal language programmers use to specify the
structure or the content of the database
DML
To manipulate data. Contains commands that
permit end users and programming specialists to
extract data from database to satisfy information
requests and develop applications
Data dictionary
A file that stores definitions of data elements &
data characteristics such as usage, physical
representation, security etc.
DATA DEFINITION LANGUAGE (DDL)
Specification notation for defining the database
schema
E.g.
create table account (
account-number char(10),
balance integer)
DROP
DDL compiler generates a set of tables stored in a
data dictionary
DATA MANIPULATION LANGUAGE (DML)
Language for accessing and manipulating the data
organized by the appropriate data model
Used to add, change, delete and retrieve data in the
database
SQL is the most widely used query language
Eg: Select, insert, update, delete
Sample data dictionary
TYPES OF DATA MODELS
Hierarchical and Network DBMS
Relational DBMS (RDBMS)
Object oriented Databases
HIERARCHICAL DBMS
In the Hierarchical Model, different records are
inter-related through hierarchical or tree-like
structures.
The relationship between the files or records form
a hierarchy
Typical for representing one–to-many relationships
NETWORK DBMS
In the Network Model, a parent can have several
children and a child can also have many parent
records.
Records are physically linked through linked-lists
Sales
Customer
Representative
Product Invoice Payment
Invoice-line
Example of implementing network model for a
sales organisation
RELATIONAL DBMS
Represent data as two dimensional tables ( called
relations)
Tables may be referred to as files
Rows are commonly referred to as records or tuples
A key field called as Primary Key is the unique
identifier for all the information in any row of the
table
Primary key cannot be duplicated
When the primary key of a table appears in another
table, it is called a Foreign key.
Attributes
RELATIONAL MODEL
customer- customer- customer- account-
Customer-
name street city number
id
192-83-7465 Johnson
Alma Palo Alto A-101
019-28-3746 Smith
North Rye A-215
192-83-7465 Johnson
Alma Palo Alto A-201
321-12-3123 Jones
Main Harrison A-217
019-28-3746 Smith
North Rye A-201
A SAMPLE RELATIONAL DATABASE
OBJECT ORIENTED DBMS
Represent an entity and a class
Class represents both object attributes and
behavior of entity
Stores the data and procedures that act on those
data as objects that can be automatically
retrieved and shared
Can be used to manage the various multimedia
components, in web applications etc.
DISTRIBUTING DATABASES
A distributed database is a database in which
not all storage devices are attached to a
common processor
It may be stored in multiple computers, located in
the same physical location; or may be dispersed
over a network of interconnected computers.
2 methods
Partitioned database
Replicate database
STEPS IN DESIGNING A DATABASE
Determine the purpose of the database
Determine the tables needed by the organisation
in the database
Determine the fields needed in the tables
Identify the primary unique keys
Determine the relationships between tables
E-R MODEL
The entity-relationship (E-R) model
perceives the real world as consisting of
basic objects, called entities, and
relationships among these objects.
A relationship is an association among
several entities
E-R DIAGRAMS
Rectangles represent entity sets.
Diamonds represent relationship sets.
Lines link attributes to entity sets and entity sets to relationship sets.
Ellipses represent attributes
Underline indicates primary key attributes
ACTORS ON THE DBMS SCENE
Data administrator (DA)
Database administrator (DBA)
Database designers
Casual end users
Application programmers
DATA WAREHOUSE
A database that stores current & historic data of
potential interest to managers throughout the
company
A data warehouse is the main repository of an
organization's historical data, its corporate
memory. It contains the raw material for
management's decision support system.
Operational
data
Historical
data
Operational Extract
data Data
& Warehouse
Transform
External
data •Queries
•Reports
External
•OLAP
data
•Data mining
Internal OLAP
SCM
Data queries
Data
Marts CRM
Data
External warehouse
Data
Strategy
Data
Metadata Marts Data
Personal Mining
Others
Data
DATA MART
A subset of data warehouse in which a summarized
or highly focused portion of the organization’s
data is placed in a separate database for a
specific population of users.
DATA MINING
Discovering hidden value in your data
warehouse
Data mining, the extraction of hidden
predictive information from large
databases, is a powerful new technology
with great potential to help companies
focus on the most important information
in their data warehouses.
Data mining tools predict future trends
and behaviors, allowing businesses to
make proactive, knowledge-driven
decisions.