0% found this document useful (0 votes)

2 views

lesson10 Normalization

Database normalization is the process of organizing relational database fields and tables to minimize redundancy and improve data integrity. It involves several stages, including First Normal Form (1NF), Second Normal Form (2NF), Third Normal Form (3NF), Fourth Normal Form (4NF), and Fifth Normal Form (5NF), each addressing specific types of data dependency and redundancy. The importance of normalization lies in its ability to enhance data retrieval, reduce storage requirements, and maintain data consistency.

Uploaded by

erickchugu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

lesson10 Normalization

Uploaded by

erickchugu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

NORMALIZATION

Introduction and meaning of Database normalization

Database normalization is the process of organizing the fields and tables of a relational database to
minimize redundancy. Normalization usually involves dividing large tables into smaller (and less
redundant) tables and defining relationships between them.

Normalization of data can be defined as a process during which the existing tables of a database
are tested to find certain data dependency between the column and the rows or normalizing of
data can be referred to a formal technique of making preliminary data structures into an easy to
maintain and make efficient data structure

With data normalization any table dependency detected, the table is restructured into multiple
tables (two tables) which eliminate any column dependency. Incase data dependency is still
exhibited the process is repeated till such dependency are eliminated. The process of eliminating
data redundancy is based upon a theory called functional dependency

Importance of normalization
 It highlights constraints and dependency in the data and hence aid the understanding the nature of
the data
 Normalization controls data redundancy to reduce storage requirement and standard
maintenance
 Normalization provide unique identification for records in a database
 Each stage of normalization process eliminate a particular type of undesirable dependency
 Normalization permits simple data retrieval in response to reports and queries
 The third normalization form produces well designed database which provides a higher degree
of independency
 Normalization helps define efficient data structures
 Normalized data structures are used for file and database design
 Normalization eliminate unnecessary dependency relationship within a database file

Forms of normalization/Normalization rules

First normal form (1NF)

Refers to the first step where preliminary data structures are transforming into the first normal
form by eliminating any repeating sets of data elements. A relation table is said to be on the first
normal form, if and only if it contains no repeating groups that is it has no repeated value for a
particular attribute with a single record. Any repeated group of attribute is isolated to form a new
relation. In other words first normal form (1nf) means that a table has no multiple value attribute
or composite attribute, In the 1nf, each column holds one attribute and each row holds a single
occurrence of the entity
Second normal form (2NF)
2nf concentrated on records with concatenated keys, they check the non key attribute for
dependency on the entire key, and any data element that dependent only on part of the key is
moved to a new entity
Third normal form (3NF)
All data element in the third normal form must be a function of the key. To reach the 3nf, you
need to review the structure‘s non-key data elements and identify any data element dependent on
an attribute other than the key, if there is all these data elements should be moved to a new entity
Fourth normal form (4NF)
In data normalization, the fourth normal form deals with data element with issues of multi-value
dependency (when one attributes determine another attribute sets). A relation is said to be in the
4nf formal form if and if only all existing multi-value dependency is converted into functional
dependency
Fifth normal form (5NF)
Here is where the join dependency is removed, the 5nf is also known as the projection join
normal form(PJNF), and refers to the separation of one relation into any sub-relations or having
sub-relations into one relation and can produce join dependencies

Performing Normalization by example

While designing a database out of an entity–relationship model, the main problem existing in
that ―raw‖ database is redundancy. Redundancy is storing the same data item in more one place.
A redundancy creates several problems like the following:

1. Extra storage space: storing the same data in many places takes large amount of disk space.
2. Entering same data more than once during data insertion.
3. Deleting data from more than one place during deletion.
4. Modifying data in more than one place.
5. Anomalies may occur in the database if insertion, deletion, modification etc are no done
properly. It creates inconsistency and unreliability in the database.

To solve this problem, the ―raw‖ database needs to be normalized. This is a step by step process
of removing different kinds of redundancy and anomaly at each step. At each step a specific rule
is followed to remove specific kind of impurity in order to give the database a slim and clean
look.

Un-Normalized Form (UNF)

If a table contains non-atomic values at each row, it is said to be in UNF. An atomic value is
something that can not be further decomposed. A non-atomic value, as the name suggests, can be
further decomposed and simplified. Consider the following table:
Emp-Id Emp-Name Month Sales Bank-Id Bank-Name
E01 AA Jan 1000 B01 SBI
Feb 1200
Mar 850
E02 BB Jan 2200 B02 UTI
Feb 2500
E03 CC Jan 1700 B01 SBI
Feb 1800
Mar 1850
Apr 1725
In the sample table above, there are multiple occurrences of rows under each key Emp-Id.
Although considered to be the primary key, Emp-Id cannot give us the unique identification
facility for any single row. Further, each primary key points to a variable length record (3 for
E01, 2 for E02 and 4 for E03).

First Normal Form (1NF)

A relation is said to be in 1NF if it contains no non-atomic values and each row can provide a
unique combination of values. The above table in UNF can be processed to create the following
table in 1NF.
Emp-Id Emp-Name Month Sales Bank-Id Bank-Name
E01 AA Jan 1000 B01 SBI
E01 AA Feb 1200 B01 SBI
E01 AA Mar 850 B01 SBI
E02 BB Jan 2200 B02 UTI
E02 BB Feb 2500 B02 UTI
E03 CC Jan 1700 B01 SBI
E03 CC Feb 1800 B01 SBI
E03 CC Mar 1850 B01 SBI
E03 CC Apr 1725 B01 SBI

As you can see now, each row contains unique combination of values. Unlike in UNF, this
relation contains only atomic values, i.e. the rows can not be further decomposed, so the relation
is now in 1NF.

Second Normal Form (2NF)

A relation is said to be in 2NF f if it is already in 1NF and each and every attribute fully depends
on the primary key of the relation. Speaking inversely, if a table has some attributes which is not
dependant on the primary key of that table, then it is not in 2NF.

Let us explain. Emp-Id is the primary key of the above relation. Emp-Name, Month, Sales and
Bank-Name all depend upon Emp-Id. But the attribute Bank-Name depends on Bank-Id, which
is not the primary key of the table. So the table is in 1NF, but not in 2NF. If this position can be
removed into another related relation, it would come to 2NF.

Emp-Id Emp-Name Month Sales Bank-Id

E01 AA JAN 1000 B01
E01 AA FEB 1200 B01
E01 AA MAR 850 B01
E02 BB JAN 2200 B02
E02 BB FEB 2500 B02
E03 CC JAN 1700 B01
E03 CC FEB 1800 B01
E03 CC MAR 1850 B01
E03 CC APR 1726 B01
Bank-Id Bank-Name
B01 SBI
B02 UTI

After removing the portion into another relation we store lesser amount of data in two relations
without any loss information. There is also a significant reduction in redundancy.

Third Normal Form (3NF)

A relation is said to be in 3NF, if it is already in 2NF and there exists no transitive dependency in
that relation. Speaking inversely, if a table contains transitive dependency, then it is not in 3NF,
and the table must be split to bring it into 3NF.

What is a transitive dependency? Within a relation if we see

A → B [B depends on A]
And
B → C [C depends on B]
Then we may derive
A → C[C depends on A]

Such derived dependencies hold well in most of the situations. For example if we have
Roll → Marks
And
Marks → Grade
Then we may safely derive
Roll → Grade.

This third dependency was not originally specified but we have derived it.

The derived dependency is called a transitive dependency when such dependency becomes
improbable. For example we have been given
Roll → City
And
City → STDCode

If we try to derive Roll → STDCode it becomes a transitive dependency, because obviously the
STDCode of a city cannot depend on the roll number issued by a school or college. In such a
case the relation should be broken into two, each containing one of these two dependencies:
Roll → City
And
City → STD code

Boyce-Code Normal Form (BCNF)

A relationship is said to be in BCNF if it is already in 3NF and the left hand side of every
dependency is a candidate key. A relation which is in 3NF is almost always in BCNF. These
could be same situation when a 3NF relation may not be in BCNF the following conditions are
found true.

1. The candidate keys are composite.

2. There are more than one candidate keys in the relation.
3. There are some common attributes in the relation.

Professor Code Department Head of Dept. Percent Time

P1 Physics Ghosh 50
P1 Mathematics Krishnan 50
P2 Chemistry Rao 25
P2 Physics Ghosh 75
P3 Mathematics Krishnan 100

Consider, as an example, the above relation. It is assumed that:

1. A professor can work in more than one department

2. The percentage of the time he spends in each department is given.
3. Each department has only one Head of Department.

The relation diagram for the above relation is given as the following:

The given relation is in 3NF. Observe, however, that the names of Dept. and Head of Dept. are
duplicated. Further, if Professor P2 resigns, rows 3 and 4 are deleted. We lose the information
that Rao is the Head of Department of Chemistry.
The normalization of the relation is done by creating a new relation for Dept. and Head of Dept.
and deleting Head of Dept. form the given relation. The normalized relations are shown in the
following.

Professor Code Department Percent Time

P1 Physics 50
P1 Mathematics 50
P2 Chemistry 25
P2 Physics 75
P3 Mathematics 100

Department
Head of Dept.
Physics Ghosh
Mathematics Krishnan
Chemistry Rao
See the dependency diagrams for these new relations.

Fourth Normal Form (4NF)

When attributes in a relation have multi-valued dependency, further Normalization to 4NF and
5NF are required. Let us first find out what multi-valued dependency is.

A multi-valued dependency is a typical kind of dependency in which each and every attribute
within a relation depends upon the other, yet none of them is a unique primary key.

We will illustrate this with an example. Consider a vendor supplying many items to many
projects in an organization. The following are the assumptions:

1. A vendor is capable of supplying many items.

2. A project uses many items.
3. A vendor supplies to many projects.
4. An item may be supplied by many vendors.
A multi valued dependency exists here because all the attributes depend upon the other and yet
none of them is a primary key having unique value.

Vendor Code Item Code Project No.

V1 I1 P1
V1 I2 P1
V1 I1 P3
V1 I2 P3
V2 I2 P1
V2 I3 P1
V3 I1 P2
V3 I1 P3

The given relation has a number of problems. For example:

1. If vendor V1 has to supply to project P2, but the item is not yet decided, then a row with a blank
for item code has to be introduced.
2. The information about item I1 is stored twice for vendor V3.

Observe that the relation given is in 3NF and also in BCNF. It still has the problem mentioned
above. The problem is reduced by expressing this relation as two relations in the Fourth Normal
Form (4NF). A relation is in 4NF if it has no more than one independent multi valued
dependency or one independent multi valued dependency with a functional dependency.

The table can be expressed as the two 4NF relations given as following. The fact that vendors are
capable of supplying certain items and that they are assigned to supply for some projects in
independently specified in the 4NF relation.

Vendor-Supply Vendor-Project

Vendor Code Item Code Vendor Code Project No.

V1 I1 V1 P1
V1 I2 V1 P3
V2 I2 V2 P1
V2 I3 V3 P2
V3 I1
Fifth Normal Form (5NF)
These relations still have a problem. While defining the 4NF we mentioned that all
the attributes depend upon each other. While creating the two tables in the 4NF,
although we have preserved the dependencies between Vendor Code and Item code
in the first table and Vendor Code and Item code in the second table, we have lost the
relationship between Item Code and Project No. If there were a primary key then this
loss of dependency would not have occurred. In order to revive this relationship we
must add a new table like the following. Please note that during the entire process of
normalization, this is the only step where a new table is created by joining two
attributes, rather than splitting them into separate tables.

Project No. Item Code

P1 11
P1 12
P2 11
P3 11
P3 13

Let us finally summarize the normalization steps we have discussed so far.

Input Transformation Output

Relation Relation
All Eliminate variable length record. Remove multi-attribute lines in table. 1NF
Relations
1NF Remove dependency of non-key attributes on part of a multi-attribute 2NF
Relation key.
2NF Remove dependency of non-key attributes on other non-key attributes. 3NF
3NF Remove dependency of an attribute of a multi attribute key on an BCNF
attribute of another (overlapping) multi-attribute key.
BCNF Remove more than one independent multi-valued dependency from 4NF
relation by splitting relation.
4NF Add one relation relating attributes with multi-valued dependency. 5NF

Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Chit Fund Management System
67% (6)
Chit Fund Management System
79 pages
Topic 6- Normalization
No ratings yet
Topic 6- Normalization
13 pages
Normalization and Normal Form
No ratings yet
Normalization and Normal Form
11 pages
Unit3-Part2-Normalization-Normal Forms
No ratings yet
Unit3-Part2-Normalization-Normal Forms
20 pages
Normalization and Denormalization
No ratings yet
Normalization and Denormalization
44 pages
Normalization
No ratings yet
Normalization
57 pages
Normalization
No ratings yet
Normalization
19 pages
Normalization
No ratings yet
Normalization
17 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
28 pages
Dbms Normalization
No ratings yet
Dbms Normalization
5 pages
Normal Forms
No ratings yet
Normal Forms
30 pages
Unit 4
No ratings yet
Unit 4
19 pages
Normalization
No ratings yet
Normalization
11 pages
2nd and 3rd Unit
No ratings yet
2nd and 3rd Unit
87 pages
DBMS Module 3.2 PDF
No ratings yet
DBMS Module 3.2 PDF
22 pages
Fundamental of Database CH-5
No ratings yet
Fundamental of Database CH-5
34 pages
Unit II-Normalization
No ratings yet
Unit II-Normalization
23 pages
unit 4 rdbms s
No ratings yet
unit 4 rdbms s
8 pages
DBMS Unit-III (1)
No ratings yet
DBMS Unit-III (1)
42 pages
Normalization
No ratings yet
Normalization
17 pages
Database Normalization
No ratings yet
Database Normalization
9 pages
RDBMS Unit 4
No ratings yet
RDBMS Unit 4
15 pages
3169module 3 (Normalization) - 5th Semester - Computer Science and Engineering
No ratings yet
3169module 3 (Normalization) - 5th Semester - Computer Science and Engineering
12 pages
Unit 3 1
No ratings yet
Unit 3 1
11 pages
Normalization
No ratings yet
Normalization
17 pages
Normalization
No ratings yet
Normalization
23 pages
Normalization AND KEYS
No ratings yet
Normalization AND KEYS
19 pages
DBMS 20 Mark Questions
No ratings yet
DBMS 20 Mark Questions
12 pages
DBMS Normalization Normalization: Types of Normal Forms
No ratings yet
DBMS Normalization Normalization: Types of Normal Forms
17 pages
Dbms Assignment ON Normalization: Submitted By, R.Kiruba Sankar
No ratings yet
Dbms Assignment ON Normalization: Submitted By, R.Kiruba Sankar
10 pages
Normalization
No ratings yet
Normalization
42 pages
DBMS, unit-5
No ratings yet
DBMS, unit-5
9 pages
Normalization
No ratings yet
Normalization
48 pages
Unit-2-Normalization
No ratings yet
Unit-2-Normalization
11 pages
Normalization in DBMS11
No ratings yet
Normalization in DBMS11
12 pages
Normalization
No ratings yet
Normalization
30 pages
NORMALISATION
No ratings yet
NORMALISATION
15 pages
Normalization
No ratings yet
Normalization
18 pages
SQL Normalization
No ratings yet
SQL Normalization
18 pages
Normalization - AdBase - SUAREZ
No ratings yet
Normalization - AdBase - SUAREZ
27 pages
Normal Forms in DBMS
No ratings yet
Normal Forms in DBMS
3 pages
Unit 3 Updated FG
No ratings yet
Unit 3 Updated FG
16 pages
Research Activity
No ratings yet
Research Activity
9 pages
Normalization: Normalization Is A Systematic Way of Ensuring That A Database Structure Is Suitable For
No ratings yet
Normalization: Normalization Is A Systematic Way of Ensuring That A Database Structure Is Suitable For
6 pages
Unit 3 (KCS501)
No ratings yet
Unit 3 (KCS501)
13 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
30 pages
What Is Normalization
No ratings yet
What Is Normalization
4 pages
Normalization: Types of Normal Forms
No ratings yet
Normalization: Types of Normal Forms
12 pages
Normalization
No ratings yet
Normalization
23 pages
Noormalization 10
No ratings yet
Noormalization 10
26 pages
Normalization
No ratings yet
Normalization
13 pages
Normalization 1
No ratings yet
Normalization 1
26 pages
Unit - 3
No ratings yet
Unit - 3
22 pages
Database Normalization Tutorial
No ratings yet
Database Normalization Tutorial
14 pages
Database Normalization - New
No ratings yet
Database Normalization - New
8 pages
DBMS Unit-Iv
No ratings yet
DBMS Unit-Iv
51 pages
Unit 5: Data Normalization
No ratings yet
Unit 5: Data Normalization
27 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
44 pages
Normalization Lec4
No ratings yet
Normalization Lec4
29 pages
Unloved Bull Markets: Getting Rich the Easy Way by Riding Bull Markets
From Everand
Unloved Bull Markets: Getting Rich the Easy Way by Riding Bull Markets
Craig Callahan
2/5 (1)
circuit-analysis-1nbsped-9781617287039-9781617281068_compress
No ratings yet
circuit-analysis-1nbsped-9781617287039-9781617281068_compress
196 pages
ECU 105 ENGINEERING MATHEMATICS II (1)
No ratings yet
ECU 105 ENGINEERING MATHEMATICS II (1)
3 pages
EEE_227_Assigment[1]
No ratings yet
EEE_227_Assigment[1]
1 page
SCO206SIT250EEE207SST203 DATABASE SYSTEMS
No ratings yet
SCO206SIT250EEE207SST203 DATABASE SYSTEMS
3 pages
unit-2-notes-unit-2-note
No ratings yet
unit-2-notes-unit-2-note
29 pages
SCO206 DATABASE SYSTEMS 3
No ratings yet
SCO206 DATABASE SYSTEMS 3
2 pages
EEE 207 DATA BASE MANAGEMENT SYSTEMS
No ratings yet
EEE 207 DATA BASE MANAGEMENT SYSTEMS
3 pages
EEE 209 FLUID MECHANICS
No ratings yet
EEE 209 FLUID MECHANICS
4 pages
EEE 206 Electrical machines exam - supplementary
No ratings yet
EEE 206 Electrical machines exam - supplementary
3 pages
EEE 204 CIRCUIT THEORY II_ASSIGN I
No ratings yet
EEE 204 CIRCUIT THEORY II_ASSIGN I
3 pages
Hdcse4005 42 38
100% (1)
Hdcse4005 42 38
35 pages
Database 1 RJC 1
No ratings yet
Database 1 RJC 1
20 pages
An Introduction To Relational Database Management System
No ratings yet
An Introduction To Relational Database Management System
8 pages
Lec 5 Normalization
No ratings yet
Lec 5 Normalization
25 pages
UNIT 03-P3 Logical Data Modeling Using The Relational Model-1
No ratings yet
UNIT 03-P3 Logical Data Modeling Using The Relational Model-1
66 pages
Functional Dependency Notes
No ratings yet
Functional Dependency Notes
52 pages
A Guide To SQL 8th Edition Pratt Test Bank
No ratings yet
A Guide To SQL 8th Edition Pratt Test Bank
9 pages
Topic 2 - Normalization Notes
No ratings yet
Topic 2 - Normalization Notes
5 pages
Hibernate Notes by Sriman
50% (2)
Hibernate Notes by Sriman
206 pages
Online Admission System Project Report
No ratings yet
Online Admission System Project Report
40 pages
Normalization: Normalization Is A Method For Organizing Data Elements in A Database Into Tables
No ratings yet
Normalization: Normalization Is A Method For Organizing Data Elements in A Database Into Tables
4 pages
Normalization of Database-Ass-2
No ratings yet
Normalization of Database-Ass-2
31 pages
Commonly asked DBMS interview questionsSet 1
No ratings yet
Commonly asked DBMS interview questionsSet 1
3 pages
Chapter 4 - Normalization
No ratings yet
Chapter 4 - Normalization
65 pages
DBMS Interview Questions (2021) - Javatpoint
No ratings yet
DBMS Interview Questions (2021) - Javatpoint
17 pages
Datagu
No ratings yet
Datagu
20 pages
Unit - 3
No ratings yet
Unit - 3
40 pages
DMS (22319) - Chapter 2 Notes
No ratings yet
DMS (22319) - Chapter 2 Notes
133 pages
Niis Project SIP
No ratings yet
Niis Project SIP
61 pages
Normalization
No ratings yet
Normalization
8 pages
Prof. Sin-Min Lee Grade: - /10
No ratings yet
Prof. Sin-Min Lee Grade: - /10
4 pages
First Normal Form: First Normal Form (1NF) Is A Property of A Relation in A Relational Database. A
No ratings yet
First Normal Form: First Normal Form (1NF) Is A Property of A Relation in A Relational Database. A
4 pages
Normalization
No ratings yet
Normalization
41 pages
Mini Project Mca
No ratings yet
Mini Project Mca
49 pages
Final A Thesis Report E
No ratings yet
Final A Thesis Report E
47 pages
DBMS Unit-3 Notes
No ratings yet
DBMS Unit-3 Notes
23 pages
Online Banking Management System.
No ratings yet
Online Banking Management System.
50 pages
Steps On Normalization
No ratings yet
Steps On Normalization
2 pages
Bike&Scooter PDF
No ratings yet
Bike&Scooter PDF
27 pages

lesson10 Normalization

Uploaded by

lesson10 Normalization

Uploaded by

NORMALIZATION

Introduction and meaning of Database normalization

Forms of normalization/Normalization rules

First normal form (1NF)

Performing Normalization by example

Un-Normalized Form (UNF)

First Normal Form (1NF)

Second Normal Form (2NF)

Emp-Id Emp-Name Month Sales Bank-Id

Third Normal Form (3NF)

What is a transitive dependency? Within a relation if we see

Boyce-Code Normal Form (BCNF)

1. The candidate keys are composite.

Professor Code Department Head of Dept. Percent Time

Consider, as an example, the above relation. It is assumed that:

1. A professor can work in more than one department

Professor Code Department Percent Time

Fourth Normal Form (4NF)

1. A vendor is capable of supplying many items.

Vendor Code Item Code Project No.

The given relation has a number of problems. For example:

Vendor Code Item Code Vendor Code Project No.

Project No. Item Code

Let us finally summarize the normalization steps we have discussed so far.

Input Transformation Output

You might also like