0% found this document useful (0 votes)

11 views

DBMS Unit-III (1)

Normalization is a process used in database design to minimize data redundancy and eliminate anomalies such as insertion, deletion, and update anomalies by decomposing larger tables into smaller, well-structured relations. It involves several normal forms, including 1NF, 2NF, 3NF, BCNF, and others, each with specific criteria to ensure data integrity and efficiency. While normalization offers advantages like reduced redundancy and improved organization, it can also lead to performance issues and requires careful planning to avoid poor database design.

Uploaded by

helper bisht

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

DBMS Unit-III (1)

Uploaded by

helper bisht

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 42

Normalization

A large database defined as a single relation may result in data duplication. This repetition of data
may result in:

 Making relations very large.

 It isn't easy to maintain and update data as it would involve searching many records in relation.
 Wastage and poor utilization of disk space and resources.
 The likelihood of errors and inconsistencies increases.

So to handle these problems, we should analyze and decompose the relations with redundant data
into smaller, simpler, and well-structured relations that are satisfy desirable properties.
Normalization is a process of decomposing the relations into relations with fewer attributes.

What is Normalization?
 Normalization is the process of organizing the data in the database.
 Normalization is used to minimize the redundancy from a relation or set of relations. It is also
used to eliminate undesirable characteristics like Insertion, Update, and Deletion Anomalies.
 Normalization divides the larger table into smaller and links them using relationships.
 The normal form is used to reduce redundancy from the database table.
Why do we need Normalization?

The main reason for normalizing the relations is removing these anomalies. Failure to eliminate
anomalies leads to data redundancy and can cause data integrity and other problems as the database
grows. Normalization consists of a series of guidelines that helps to guide you in creating a good
database structure.

Data modification anomalies can be categorized into three types:

 Insertion Anomaly: Insertion Anomaly refers to when one cannot insert a new tuple into a
relationship due to lack of data.
 Deletion Anomaly: The delete anomaly refers to the situation where the deletion of data results
in the unintended loss of some other important data.
 Updatation Normal Form
Anomaly: Description
The update anomaly is when an update of a single data value requires
multiple rows
1NF of data to be updated.
A relation is in 1NF if it contains an atomic value.
2NF A relation will be in 2NF if it is in 1NF and all non-key attributes are fully functional dependent on
the primary key.
3NF A relation will be in 3NF if it is in 2NF and no transition dependency exists.
BCNF A stronger definition of 3NF is known as Boyce Codd's normal form.
4NF A relation will be in 4NF if it is in Boyce Codd's normal form and has no multi-valued dependency.
5NF A relation is in 5NF. If it is in 4NF and does not contain any join dependency, joining should be
lossless.
Advantages of Normalization

 Normalization helps to minimize data redundancy.

 Greater overall database organization.
 Data consistency within the database.
 Much more flexible database design.
 Enforces the concept of relational integrity.

Disadvantages of Normalization

 You cannot start building the database before knowing what the user needs.
 The performance degrades when normalizing the relations to higher normal forms, i.e., 4NF, 5NF.
 It is very time-consuming and difficult to normalize relations of a higher degree.
 Careless decomposition may lead to a bad database design, leading to serious problems.
First Normal Form (1NF)
 A relation will be 1NF if it contains an atomic value.
 It states that an attribute of a table cannot hold multiple values. It must hold only single-valued
attribute.
 First normal form disallows the multi-valued attribute, composite attribute, and their combinations.

Example: Relation EMPLOYEE is not in 1NF because of multi-valued attribute EMP_PHONE.

EMPLOYEE table:

EMP_ID EMP_NAME EMP_PHONE EMP_STATE

14 John 7272826385, UP
9064738238
20 Harry 8574783832 Delhi
12 Sam 7390372389, Punjab
8589830302
The decomposition of the EMPLOYEE table into 1NF has been shown below:

EMP_ID EMP_NAME EMP_PHONE EMP_STATE

14 John 7272826385 UP
14 John 9064738238 UP
20 Harry 8574783832 Delhi
12 Sam 7390372389 Punjab
12 Sam 8589830302 Punjab
Second Normal Form (2NF)
Before we learn about the second normal form, we need to understand the following −

 Prime attribute − An attribute, which is a part of the candidate-key, is known as a prime

attribute.
 Non-prime attribute − An attribute, which is not a part of the prime-key, is said to be a non-prime
attribute.

If we follow second normal form, then every non-prime attribute should be fully functionally
dependent on prime key attribute. That is, if X → A holds, then there should not be any proper
subset Y of X, for which Y → A also holds true.

 In the 2NF, relational

TEACHER_IDmust be inSUBJECT
1NF. TEACHER_AGE
 In the second normal form, all non-key attributes are fully functional dependent on the primary
25 Chemistry 30
key
25 Biology 30
47 English 35
83 Math 38
TEACHER table
83 Computer 38

In the given table, non-prime attribute TEACHER_AGE is dependent on TEACHER_ID which is a

proper subset of a candidate key. That's why it violates the rule for 2NF.
To convert the given table into 2NF, we decompose it into two tables:

TEACHER_DETAIL table:

TEACHER_ID TEACHER_AGE
25 30
47 35
83 38

TEACHER_SUBJECT table:

EACHER_ID SUBJECT
25 Chemistry
25 Biology
47 English
83 Math
83 Computer
Third Normal Form (3NF)
 A relation will be in 3NF if it is in 2NF and not contain any transitive partial dependency.
 3NF is used to reduce the data duplication. It is also used to achieve the data integrity.
 If there is no transitive dependency for non-prime attributes, then the relation must be in third
normal form.

A relation is in third normal form if it holds at least one of the following conditions for every non-trivial
function dependency
X → Y.
X is a super key.
EMPLOYEE_DETAIL table:
Y is a prime attribute, i.e., each element of Y is part of some candidate key.
EMP_ID EMP_NAME EMP_ZIP EMP_STATE EMP_CITY
222 Harry 201010 UP Noida
333 Stephan 02228 US Boston
444 Lan 60007 US Chicago
555 Katharine 06389 UK Norwich
666 John 462007 MP Bhopal
Super key in the table above:
{EMP_ID}, {EMP_ID, EMP_NAME}, {EMP_ID, EMP_NAME, EMP_ZIP}....so on

Candidate key: {EMP_ID}

Non-prime attributes: In the given table, all attributes except EMP_ID are non-prime.

Here, EMP_STATE & EMP_CITY dependent on EMP_ZIP and EMP_ZIP dependent on EMP_ID. The
non-prime attributes (EMP_STATE, EMP_CITY) transitively dependent on super key(EMP_ID). It
violates the rule of third normal form.

That's why we need to move the EMP_CITY and EMP_STATE to the new <EMPLOYEE_ZIP> table,
with EMP_ZIP as a Primary key.
EMPLOYEE table: EMPLOYEE_ZIP table:
EMP_ID EMP_NAME EMP_ZIP EMP_ZIP EMP_STATE EMP_CITY
222 Harry 201010 201010 UP Noida
333 Stephan 02228 02228 US Boston
444 Lan 60007 60007 US Chicago
555 Katharine 06389 06389 UK Norwich
666 John 462007 462007 MP Bhopal
Boyce Codd normal form (BCNF)
 BCNF is the advance version of 3NF. It is stricter than 3NF.
 A table is in BCNF if every functional dependency X → Y, X is the super key of the table.
 For BCNF, the table should be in 3NF, and for every FD, LHS is super key.

Example: Let's assume there is a company where employees work in more than one department.
EMPLOYEE table:
EMP_ID EMP_COUNTRY EMP_DEPT DEPT_TYPE EMP_DEPT_NO
264 India Designing D394 283
264 India Testing D394 300
364 UK Stores D283 232
364 UK Developing D283 549

In
1. the above table Functional dependencies are as follows:

EMP_ID → EMP_COUNTRY
EMP_DEPT → {DEPT_TYPE, EMP_DEPT_NO}

Candidate key: {EMP-ID, EMP-DEPT}

The table is not in BCNF because neither EMP_DEPT nor EMP_ID alone are keys.
To convert the given table into BCNF, we decompose it into three tables:

EMP_COUNTRY table: EMP_DEPT table:

EMP_ID EMP_COUNTRY EMP_DEPT DEPT_TYPE EMP_DEPT_NO

264 India Designing D394 283
264 India Testing D394 300
Stores D283 232
Developing D283 549
EMP_DEPT_MAPPING table:

EMP_ID EMP_DEPT Functional dependencies:

D394 283 EMP_ID → EMP_COUNTRY

D394 300 EMP_DEPT → {DEPT_TYPE, EMP_DEPT_NO}
D283 232
Candidate keys:
D283 549 For the first table: EMP_ID
For the second table: EMP_DEPT
For the third table: {EMP_ID, EMP_DEPT}

Now, this is in BCNF because left side part of both the functional dependencies is a
key.
Functional Dependency
The functional dependency is a relationship that exists between two attributes. It typically exists between the primary key and
non-key attribute within a table. Functional dependency in DBMS, as the name suggests is a relationship
between attributes of a table dependent on each other. Introduced by E. F. Codd, it helps in
preventing data redundancy and gets to know about bad designs.

To understand the concept thoroughly, let us consider P is a relation with attributes X and Y.
Functional Dependency is represented by -> (arrow sign)

Then the following will represent the functional dependency between attributes with an arrow sign −

X Y

The left side of FD is known as a determinant, the right side of the production is known as a
dependent.
Assume we have an employee table with attributes: Emp_Id, Emp_Name, Emp_Address.
Here Emp_Id attribute can uniquely identify the Emp_Name attribute of employee table because if we
know the Emp_Id, we can tell that employee name associated with it.

Functional dependency can be written as:

Emp_Id Emp_Name

We can say that Emp_Name is functionally dependent on Emp_Id.

Types of Functional dependency

Trivial Functional Dependency

It occurs when B is a subset of A in − A ->B

Example
We are considering the same <Department> table with two attributes to understand the
concept of trivial dependency.
The following is a trivial functional dependency since DeptId is a subset
of DeptId and DeptName

{ DeptId, DeptName } -> Dept Id

Non –Trivial Functional Dependency

It occurs when B is not a subset of A in − A ->B

Example

DeptId -> DeptName

The above is a non-trivial functional dependency since DeptName is a not a subset of DeptId.

When A intersection B is NULL, then A → B is called as complete non-trivial.

Armstrong's Inference Rule (IR):
 The Armstrong's axioms are the basic inference rule.
 Armstrong's axioms are used to conclude functional dependencies on a relational database.
 The inference rule is a type of assertion. It can apply to a set of FD(functional dependency) to
derive other FD.
 Using the inference rule, we can derive additional functional dependency from the initial set.

The Functional dependency has 6 types of inference rule:

1. Reflexive Rule (IR1)
In the reflexive rule, if Y is a subset of X, then X determines Y
If X ⊇ Y then X → Y
Example:
1.

X = {a, b, c, d, e}
Y = {a, b, c}

2. Augmentation Rule (IR2)

The augmentation is also called as a partial dependency. In augmentation, if X determines Y, then XZ
determines YZ for any Z.

If X → Y then XZ → YZ
Example

For R(ABCD), if A → B then AC → BC

3. Transitive Rule (IR3)

In the transitive rule, if X determines Y and Y determine Z, then X must also determine Z.

If X → Y and Y → Z then X → Z

4. Union Rule (IR4)

Union rule says, if X determines Y and X determines Z, then X must also determine Y and Z.

If X → Y and X → Z then X → YZ

5. Decomposition Rule (IR5)

Decomposition rule is also known as project rule. It is the reverse of union rule.
This Rule says, if X determines Y and Z, then X determines Y and X determines Z separately.

If X → YZ then X → Y and X → Z

6. Pseudo transitive Rule (IR6)

In Pseudo transitive Rule, if X determines Y and YZ determines W, then XZ determines W.

If X → Y and YZ → W then XZ → W
Relation Data Model

Relational data model is the primary data model, which is used widely around the world for data
storage and processing. This model is simple and it has all the properties and capabilities required to
process data with storage efficiency.

Concepts
Tables − In relational data model, relations are saved in the format of Tables. This format stores the relation among entities. A
table has rows and columns, where rows represents records and columns represent the attributes.
Tuple − A single row of a table, which contains a single record for that relation is called a tuple.
Relation instance − A finite set of tuples in the relational database system represents relation instance. Relation instances do
not have duplicate tuples.
Relation schema − A relation schema describes the relation name (table name), attributes, and their names.
Relation key − Each row has one or more attributes, known as relation key, which can identify the row in the relation (table)
uniquely.
Attribute domain − Every attribute has some pre-defined value scope, known as attribute domain.

Constraints
Every relation has some conditions that must hold for it to be a valid relation. These conditions are called Relational Integrity
Constraints. There are three main integrity constraints −
•Key constraints
•Domain constraints
•Referential integrity constraints
Key Constraints

There must be at least one minimal subset of attributes in the relation, which can identify a tuple uniquely. This minimal subset
of attributes is called key for that relation. If there are more than one such minimal subsets, these are called candidate keys.

Key constraints force that −

•in a relation with a key attribute, no two tuples can have identical values for key attributes.
•a key attribute can not have NULL values.

Key constraints are also referred to as Entity Constraints.

Domain Constraints

Attributes have specific values in real-world scenario. For example, age can only be a positive integer. The same constraints
have been tried to employ on the attributes of a relation. Every attribute is bound to have a specific range of values. For
example, age cannot be less than zero and telephone numbers cannot contain a digit outside 0-9.

Referential integrity Constraints

Referential integrity constraints work on the concept of Foreign Keys. A foreign key is a key attribute of a relation that can be
referred in other relation.
Referential integrity constraint states that if a relation refers to a key attribute of a different or same relation, then that key
element must exist.
Features of a relational database
Relational databases need ACID characteristics.
ACID refers to four essential properties: Atomicity, Consistency, Isolation, and Durability.
These features are the key difference between a relational database and a non-relational
database.

Atomicity
Atomicity keeps data accurate. It makes sure all data is compliant with the rules, regulations, and
policies of the business.
It also requires all tasks to succeed, or the transaction will roll back.
Atomicity defines all the elements in a complete database transaction.

Consistency
The state of the database must remain consistent throughout the transaction.
Consistency defines the rules for maintaining data points. This ensures they remain in a correct
state after a transaction.
Relational databases have data consistency because the information is updated across
applications and database copies (also known as ‘instances’). This means multiple instances
always have the same data.

Isolation
With a relational database, each transaction is separate and not dependent on others. This is
made possible by isolation.
Isolation keeps the effect of a transaction invisible until it is committed. This reduces the risk of
Uses and benefits of a relational database

Relational databases are often the backbone of a customer relationship management (CRM) system —
such as Salesforce.

But tracking customer transactions is just one use case for a relational database. There are many
others. We even use some in everyday life. For example, when you withdraw money from an ATM, your
bank balance may instantly update on your mobile app if it’s using a relational database. This is
because this scenario’s data point (“Account Balance”) is consistently updated across all platforms.

There are multiple benefits of using a relational database over a non-relationship database. And many
of these affect other systems, including Salesforce.

Some of the main advantages of a relational database are:

Data consistency
As mentioned when we outlined ACID, a core part of a relational database is consistency.

A relational database model ensures that all users always see the same data.

This improves understanding across a business because everyone sees the same information. This
ensures that nobody makes business decisions based on out-of-date information.
Data working together

All the data in a relational database has a ‘relationship’ with other data. Columns are built in a way that
makes it easy to establish relationships among data points.

Data working together gives a more holistic view of all your data — including your customers.

Data flexibility

Relational databases allow for flexibility. Users can change what they see. And it’s easy to add
additional data at a later time.

A relational database also allows for a subset of data to be viewed. This means you can hide certain
data if some users only need access to a specific set of columns or rows.
Codd's Rules
Every database has tables, and constraints cannot be referred to as a rational database system. And if
any database has only relational data model, it cannot be a Relational Database System (RDBMS)
. So, some rules define a database to be the correct RDBMS. These rules were developed by Dr.
Edgar F. Codd (E.F. Codd) in 1985, who has vast research knowledge on the Relational Model of
database Systems. Codd presents his 13 rules for a database to test the concept of DBMS against his
relational model, and if a database follows the rule, it is called a true relational database
(RDBMS). These 13 rules are popular in RDBMS, known as Codd's 12 rules.
Rule 0: The Foundation Rule

The database must be in relational form. So that the system can handle the database through its relational
capabilities.

Rule 1: Information Rule

A database contains various information, and this information must be stored in each cell of a table in the form of
rows and columns.

Rule 2: Guaranteed Access Rule

Every single or precise data (atomic value) may be accessed logically from a relational database using the
combination of primary key value, table name, and column name.

Rule 3: Systematic Treatment of Null Values

This rule defines the systematic treatment of Null values in database records. The null value has various meanings
in the database, like missing the data, no value in a cell, inappropriate information, unknown data and the primary
key should not be null.

Rule 4: Active/Dynamic Online Catalog based on the relational model

It represents the entire logical structure of the descriptive database that must be stored online and is known as a
database dictionary. It authorizes users to access the database and implement a similar query language to access
the database.

Rule 5: Comprehensive Data Sub Language Rule

Rule 6: View Updating Rule

All views table can be theoretically updated and must be practically updated by the database systems.

Rule 7: Relational Level Operation (High-Level Insert, Update and delete) Rule

A database system should follow high-level relational operations such as insert, update, and delete in each level or
a single row. It also supports union, intersection and minus operation in the database system.

Rule 8: Physical Data Independence Rule

All stored data in a database or an application must be physically independent to access the database. Each data
should not depend on other data or an application. If data is updated or the physical structure of the database is
changed, it will not show any effect on external applications that are accessing the data from the database.

Rule 9: Logical Data Independence Rule

It is similar to physical data independence. It means, if any changes occurred to the logical level (table structures),
it should not affect the user's view (application). For example, suppose a table either split into two tables, or two
table joins to create a single table, these changes should not be impacted on the user view application.

Rule 10: Integrity Independence Rule

A database must maintain integrity independence when inserting data into table's cells using the SQL query
language. All entered values should not be changed or rely on any external factor or application to maintain
integrity. It is also helpful in making the database-independent for each front-end application.
Rule 11: Distribution Independence Rule

The distribution independence rule represents a database that must work properly, even if it is stored
in different locations and used by different end-users. Suppose a user accesses the database through
an application; in that case, they should not be aware that another user uses particular data, and the
data they always get is only located on one site. The end users can access the database, and these
access data should be independent for every user to perform the SQL queries.

Rule 12: Non Subversion Rule

The non-submersion rule defines RDBMS as a SQL language to store and manipulate the data in the
database. If a system has a low-level or separate language other than SQL to access the database
system, it should not subvert or bypass integrity to transform data.
Database Schema
A database schema is a structure that represents the logical storage of the data in a
database. It represents the organization of data and provides information about the relationships
between the tables in a given database. In this topic, we will understand more about database schema
and its types. Before understanding database schema, lets first understand what a Database is.

What is Database?
A database is a place to store information. It can store the simplest data, such as a list of people as
well as the most complex data. The database stores the information in a well-structured format.

What is Database Schema?

 A database schema is the logical representation of a database, which shows how the data is stored
logically in the entire database. It contains list of attributes and instruction that informs the
database engine that how the data is organized and how the elements are related to each other.
 A database schema contains schema objects that may include tables, fields, packages, views,
relationships, primary key, foreign key,
 In actual, the data is physically stored in files that may be in unstructured form, but to retrieve it
and use it, we need to put it in a structured form. To do this, a database schema is used. It provides
knowledge about how the data is organized in a database and how it is associated with other data.
 The schema does not physically contain the data itself; instead, it gives information
about the shape of data and how it can be related to other tables or models.
 A database schema object includes the following:
 Consistent formatting for all data entries.
 Database objects and unique keys for all data entries.
 Tables with multiple columns, and each column contains its name and datatype.
 The complexity & the size of the schema vary as per the size of the project. It helps developers to
easily manage and structure the database before coding it.
 The given diagram is an example of a database schema. It contains three tables, their data types.
This also represents the relationships between the tables and primary keys as well as foreign keys.

Types of Database Schema

The database schema is divided into three types, which are:

1.Logical Schema
2.Physical Schema
3.View Schema
1. Physical Database Schema

A physical database schema specifies how the data is stored physically on a storage system or disk
storage in the form of Files and Indices. Designing a database at the physical level is called
a physical schema.
2. View Schema

The view level design of a database is known as view schema. This schema generally describes the
end-user interaction with the database systems.
3. Logical Database Schema

The Logical database schema specifies all the logical constraints that need to be applied to the stored
data. It defines the views, integrity constraints, and table. Here the term integrity
constraints define the set of rules that are used by DBMS (Database Management System) to
maintain the quality for insertion & update the data. The logical schema represents how the data is
stored in the form of tables and how the attributes of a table are linked together.

At this level, programmers and administrators work, and the implementation of the data structure is
hidden at this level.

Various tools are used to create a logical database schema, and these tools demonstrate the
relationships between the component of your data; this process is called ER modelling.

The ER modelling stands for entity-relationship modelling, which specifies the relationships between
In the given example, the Ids are given in each circle, and these Ids are primary key & foreign keys.

The primary key is used to uniquely identify the entry in a document or record. The Ids of the upper
three circles are the primary keys.

The Foreign key is used as the primary key for other tables. The FK represent the foreign key in the
diagram. It relates one table to another table.
Relational Algebra

Relational database systems are expected to be equipped with a query language that can assist its
users to query the database instances. There are two kinds of query languages − relational algebra
and relational calculus.

Relational Algebra

Relational algebra is a procedural query language, which takes instances of relations as input and
yields instances of relations as output. It uses operators to perform queries. An operator can be
either unary or binary. They accept relations as their input and yield relations as their output.
Relational algebra is performed recursively on a relation and intermediate results are also considered
relations.

The fundamental operations of relational algebra are as follows −

 Select
 Project
 Union
 Set different
 Cartesian product
 Rename
Select Operation (σ)
It selects tuples that satisfy the given predicate from a relation.

Notation − σp(r)

Where σ stands for selection predicate and r stands for relation. p is prepositional logic formula which
may use connectors like and, or, and not. These terms may use relational operators like −
=, ≠, ≥, < , >, ≤.

For example − σsubject = "database"(Books)

Output − Selects tuples from books where subject is 'database’.

σsubject = "database" and price = "450"(Books)

Output − Selects tuples from books where subject is 'database' and 'price' is 450.
σsubject = "database" and price = "450" or year > "2010" (Books)

Output − Selects tuples from books where subject is 'database' and 'price' is 450 or those books
published after
2010
Project Operation (∏)
It projects column(s) that satisfy a given predicate.

Notation −∏A1, A2, An (r)

Where A1, A2 , An are attribute names of relation r.

Duplicate rows are automatically eliminated, as relation is a set.

For example − ∏subject, author(Books)

Selects and projects columns named as subject and author from the relation Books.
Relational Calculus
There is an alternate way of formulating queries known as Relational Calculus. Relational calculus is a
non-procedural query language. In the non-procedural query language, the user is concerned with the
details of how to obtain the end results. The relational calculus tells what to do but never explains
how to do. Most commercial relational languages are based on aspects of relational calculus including
SQL-QBE and QUEL.

Why it is called Relational Calculus?

It is based on Predicate calculus, a name derived from branch of symbolic language. A predicate is a
truth-valued function with arguments. On substituting values for the arguments, the function result in
an expression called a proposition. It can be either true or false. It is a tailored version of a subset of
the Predicate Calculus to communicate with the relational database.

Many of the calculus expressions involves the use of Quantifiers. There are two types of
quantifiers:

•Universal Quantifiers: The universal quantifier denoted by ∀ is read as for all which means that in
a given set of tuples exactly all tuples satisfy a given condition.
•Existential Quantifiers: The existential quantifier denoted by ∃ is read as for all which means that
in a given set of tuples there is at least one occurrences whose value satisfy a given condition.

Before using the concept of quantifiers in formulas, we need to know the concept of Free and Bound
Variables.
A tuple variable t is bound if it is quantified which means that if it appears in any occurrences a
Types of Relational calculus:

1. Tuple Relational Calculus (TRC)

It is a non-procedural query language which is based on finding a number of tuple variables also known
as range variable for which predicate holds true. It describes the desired information without giving a
specific procedure for obtaining that information. The tuple relational calculus is specified to select the
tuples in a relation. In TRC, filtering variable uses the tuples of a relation. The result of the relation can
have one or more tuples.
Notation:
A Query in the tuple relational calculus is expressed as following notation

{T | P (T)} or {T | Condition (T)}

Where
T is the resulting tuples
Example

{ T.name | Author(T) AND T.article = 'database' }

Output: This query selects the tuples from the AUTHOR relation. It returns a tuple with 'name' from
Author who has written an article on 'database’.

TRC (tuple relation calculus) can be quantified. In TRC, we can use Existential (∃) and Universal
Quantifiers (∀).

Example

{ R| ∃T ∈ Authors(T.article='database' AND R.name=T.name)}

Output: This query will yield the same result as the previous one.
2. Domain Relational Calculus (DRC)

The second form of relation is known as Domain relational calculus. In domain relational calculus,
filtering variable uses the domain of attributes. Domain relational calculus uses the same operators as
tuple calculus. It uses logical connectives ∧ (and), ∨ (or) and ┓ (not). It uses Existential (∃) and
Universal Quantifiers (∀) to bind the variable. The QBE or Query by example is a query language
related to domain relational calculus.

Notation: { a1, a2, a3, ..., an | P (a1, a2, a3, ... ,an)}

Where
a1, a2 are attributes
P stands for formula built by inner attributes

Example

{< article, page, subject > | ∈ javatpoint ∧ subject = 'database’}

Output: This query will yield the article, page, and subject from the relational javatpoint, where the
subject is a database.
Well-Formed Formula(WFF) is an expression consisting of variables(capital letters), parentheses, and connective
symbols. An expression is basically a combination of operands & operators and here operands and operators are
the connective symbols.

Below are the possible Connective Symbols:

1.¬ (Negation)
2.∧ (Conjunction)
3.∨ (Disjunction)
4.⇒ (Rightwards Arrow)
5.⇔ (Left-Right Arrow)

Rules of the Well-Formed Formulas

6. A Statement variable standing alone is a Well-Formed Formula(WFF).

For example– Statements like P, ∼P, Q, ∼Q are themselves Well Formed Formulas.

2. If ‘P’ is a WFF then ∼P is a formula as well.

3. If P & Q are WFFs, then (P∨Q), (P∧Q), (P⇒Q), (P⇔Q), etc. are also WFFs.
Example Of Well Formed Formulas:

WFF Explanation

¬¬P By Rule 1 each Statement by itself is a WFF, ¬P is a WFF, and let ¬P = Q. So ¬Q will also be a WFF.

((P⇒Q)⇒Q) By Rule 3 joining ‘(P⇒Q)’ and ‘Q’ with connective symbol ‘⇒’.

(¬Q ∧ P) By Rule 3 joining ‘¬Q’ and ‘P’ with connective symbol ‘∧’.

((¬P∨Q) ∧ ¬¬Q) By Rule 3 joining ‘(¬P∨Q)’ and ‘¬¬Q’ with connective symbol ‘∧’.

¬((¬P∨Q) ∧ ¬¬Q) By Rule 3 joining ‘(¬P∨Q)’ and ‘¬¬Q’ with connective symbol ‘∧’ and then using Rule 2.
ER Model to Relational Model Mapping

ER Model, when conceptualized into diagrams, gives a good overview of entity-relationship, which is
easier to understand. ER diagrams can be mapped to relational schema, that is, it is possible to create
relational schema using ER diagram. We cannot import all the ER constraints into relational model, but
an approximate schema can be generated.

There are several processes and algorithms available to convert ER Diagrams into Relational Schema.
Some of them are automated and some of them are manual. We may focus here on the mapping
diagram contents to relational basics.

ER diagrams mainly comprise of −

 Entity and its attributes

 Relationship, which is association among entities.

Mapping Entity

An entity is a real-world object with some attributes.

Mapping Process (Algorithm)

 Create table for each entity.

 Entity's attributes should become fields of tables with their respective data types.
 Declare primary key.

Mapping Relationship

A relationship is an association among entities.

Mapping Process

 Create table for a relationship.

 Add the primary keys of all participating Entities as fields of table with their respective data types.
 If relationship has any attribute, add each attribute as field of table.
 Declare a primary key composing all the primary keys of participating entities.
 Declare all foreign key constraints.

Mapping Weak Entity Sets

A weak entity set is one which does not have any primary key associated with it.

Mapping Process

 Create table for weak entity set.

 Add all its attributes to table as field.
 Add the primary key of identifying entity set.
 Declare all foreign key constraints.
Mapping Hierarchical Entities

ER specialization or generalization comes in the form of hierarchical entity sets.

Mapping Process

 Create tables for all higher-level entities.

 Create tables for lower-level entities.
 Add primary keys of higher-level entities in the table of lower-level entities.
 In lower-level tables, add all other attributes of lower-level entities.
 Declare primary key of higher-level table and the primary key for lower-level table.
 Declare foreign key constraints.

Measure and Category, John C. Oxtoby
No ratings yet
Measure and Category, John C. Oxtoby
103 pages
Unit 3 (KCS501)
No ratings yet
Unit 3 (KCS501)
13 pages
Unit - 3
No ratings yet
Unit - 3
22 pages
Normalization
No ratings yet
Normalization
19 pages
Normalization
No ratings yet
Normalization
17 pages
DBMS 20 Mark Questions
No ratings yet
DBMS 20 Mark Questions
12 pages
Noormalization 10
No ratings yet
Noormalization 10
26 pages
2nd and 3rd Unit
No ratings yet
2nd and 3rd Unit
87 pages
Normalization and Denormalization
No ratings yet
Normalization and Denormalization
44 pages
Database Normalization
No ratings yet
Database Normalization
9 pages
RDBMS Unit 4
No ratings yet
RDBMS Unit 4
15 pages
Functional Dependency
No ratings yet
Functional Dependency
17 pages
Normalization
No ratings yet
Normalization
17 pages
Normalization and Normal Form
No ratings yet
Normalization and Normal Form
11 pages
Int 306 Normalization
No ratings yet
Int 306 Normalization
66 pages
RDBMS Normalization
No ratings yet
RDBMS Normalization
29 pages
Normalization
No ratings yet
Normalization
23 pages
Normalization
No ratings yet
Normalization
18 pages
What Is Functional Dependencyand Normalization-Final Updated 28 Oct 2020
No ratings yet
What Is Functional Dependencyand Normalization-Final Updated 28 Oct 2020
53 pages
Normalization
No ratings yet
Normalization
23 pages
Normalization
No ratings yet
Normalization
11 pages
2.chapter 5
No ratings yet
2.chapter 5
21 pages
Normalization of Database
No ratings yet
Normalization of Database
10 pages
Normalization
No ratings yet
Normalization
30 pages
Normalisation
No ratings yet
Normalisation
29 pages
Functional Dependency Notes
No ratings yet
Functional Dependency Notes
52 pages
NORMALISATION
No ratings yet
NORMALISATION
15 pages
DBMS 8
No ratings yet
DBMS 8
30 pages
Unit 3 1
No ratings yet
Unit 3 1
11 pages
Unit 3
No ratings yet
Unit 3
11 pages
Normalization
No ratings yet
Normalization
17 pages
MYSQL DAY - 20 (Normalization)
No ratings yet
MYSQL DAY - 20 (Normalization)
13 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
7 pages
Normalization
No ratings yet
Normalization
57 pages
Normalization: Types of Normal Forms
No ratings yet
Normalization: Types of Normal Forms
12 pages
Normalization and Functional Dependency
No ratings yet
Normalization and Functional Dependency
14 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
18 pages
UNIT 4 Normalization & Denormalization
No ratings yet
UNIT 4 Normalization & Denormalization
10 pages
Chapter-9-NORMALIZATION-1
No ratings yet
Chapter-9-NORMALIZATION-1
45 pages
DBMS Chap 3
No ratings yet
DBMS Chap 3
17 pages
Functional Dependency: Hassan Khan
No ratings yet
Functional Dependency: Hassan Khan
16 pages
DBMS_module IV dup
No ratings yet
DBMS_module IV dup
58 pages
Chapter 9 NORMALIZATION
No ratings yet
Chapter 9 NORMALIZATION
45 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
9 pages
DBMS Normalization Normalization: Types of Normal Forms
No ratings yet
DBMS Normalization Normalization: Types of Normal Forms
17 pages
Unit-Iii Normalization Functional Dependency: For Example
No ratings yet
Unit-Iii Normalization Functional Dependency: For Example
18 pages
Unit 3 Normalization
No ratings yet
Unit 3 Normalization
70 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
28 pages
Normalization
No ratings yet
Normalization
42 pages
Functional Dependency and Normalization: Chapter Four
No ratings yet
Functional Dependency and Normalization: Chapter Four
16 pages
Normalization
No ratings yet
Normalization
5 pages
Dbms Assignment ON Normalization: Submitted By, R.Kiruba Sankar
No ratings yet
Dbms Assignment ON Normalization: Submitted By, R.Kiruba Sankar
10 pages
UNIT-3 Functional Dependency
No ratings yet
UNIT-3 Functional Dependency
30 pages
Normalization 1
No ratings yet
Normalization 1
6 pages
Introduction To Normalization
No ratings yet
Introduction To Normalization
12 pages
unit 4 rdbms s
No ratings yet
unit 4 rdbms s
8 pages
DBMS Unit-Iv
No ratings yet
DBMS Unit-Iv
51 pages
Normalization Concepts
No ratings yet
Normalization Concepts
13 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
Oracle SQL and PL/SQL
From Everand
Oracle SQL and PL/SQL
Niraj Gupta
4.5/5 (8)
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Stack Applications v2 05022017
No ratings yet
Stack Applications v2 05022017
20 pages
DBMS Unit-II (1)
No ratings yet
DBMS Unit-II (1)
79 pages
DBMS Unit-I (1)
No ratings yet
DBMS Unit-I (1)
31 pages
BST
No ratings yet
BST
23 pages
2 Unit
No ratings yet
2 Unit
2 pages
Grade 11: Fundamentals OF Communication Oral Communication in Context
No ratings yet
Grade 11: Fundamentals OF Communication Oral Communication in Context
14 pages
MIS 370 Syllabus
No ratings yet
MIS 370 Syllabus
3 pages
Group Codes
No ratings yet
Group Codes
26 pages
Books and Stationaries. PRICE LIST
No ratings yet
Books and Stationaries. PRICE LIST
15 pages
Action Script Tutorial
No ratings yet
Action Script Tutorial
29 pages
A N Enigmatic Indo-European Rite: Paederasty: Problem: in
No ratings yet
A N Enigmatic Indo-European Rite: Paederasty: Problem: in
20 pages
Botblaze
No ratings yet
Botblaze
2 pages
The Lady and The Tiger Questions
No ratings yet
The Lady and The Tiger Questions
1 page
Principles and Theories of Language Acquisition and Learning - Cheatsheet
No ratings yet
Principles and Theories of Language Acquisition and Learning - Cheatsheet
25 pages
CLD Student Biography Card
No ratings yet
CLD Student Biography Card
3 pages
Um Kerfa
No ratings yet
Um Kerfa
4 pages
Sir-e-Zindagani (Volume 1) by Usman Anjum
No ratings yet
Sir-e-Zindagani (Volume 1) by Usman Anjum
16 pages
Question Bank - Module 1
No ratings yet
Question Bank - Module 1
3 pages
Monologue and Storytelling (ELT 6)
No ratings yet
Monologue and Storytelling (ELT 6)
19 pages
THE TALENSI TRA-WPS Office
No ratings yet
THE TALENSI TRA-WPS Office
3 pages
The Woman's Role in Education and Da'Wah
No ratings yet
The Woman's Role in Education and Da'Wah
10 pages
MCR3U Exam Review2015
No ratings yet
MCR3U Exam Review2015
1 page
Communication Strategies in SLA
100% (4)
Communication Strategies in SLA
3 pages
BL Beats Bibliography Printed Collections
100% (1)
BL Beats Bibliography Printed Collections
745 pages
The Listening Part B/Long Conversation
No ratings yet
The Listening Part B/Long Conversation
12 pages
Mouse and Me! Plus Level 1 Activity Book - Pre-School Children - Oxford University Press
No ratings yet
Mouse and Me! Plus Level 1 Activity Book - Pre-School Children - Oxford University Press
4 pages
The Impact of Audio Visual Aids in Teach 844640c7
No ratings yet
The Impact of Audio Visual Aids in Teach 844640c7
13 pages
Jade Programming Tutorial
No ratings yet
Jade Programming Tutorial
20 pages
Form and Content - An Introduction To Formal Logic
No ratings yet
Form and Content - An Introduction To Formal Logic
231 pages
Advance Web Application Development
No ratings yet
Advance Web Application Development
7 pages
Shape of A Skill Lesson
No ratings yet
Shape of A Skill Lesson
2 pages
SIMATIC PCS 7 Process Control System
No ratings yet
SIMATIC PCS 7 Process Control System
36 pages
AP Literature Exam: Tips For Success
No ratings yet
AP Literature Exam: Tips For Success
21 pages
03 - Kennedy Levy L' Italiano Al Telefonino
No ratings yet
03 - Kennedy Levy L' Italiano Al Telefonino
16 pages

DBMS Unit-III (1)

Uploaded by

DBMS Unit-III (1)

Uploaded by

Normalization

 Making relations very large.

Data modification anomalies can be categorized into three types:

 Normalization helps to minimize data redundancy.

Example: Relation EMPLOYEE is not in 1NF because of multi-valued attribute EMP_PHONE.

EMP_ID EMP_NAME EMP_PHONE EMP_STATE

EMP_ID EMP_NAME EMP_PHONE EMP_STATE

 Prime attribute − An attribute, which is a part of the candidate-key, is known as a prime

 In the 2NF, relational

In the given table, non-prime attribute TEACHER_AGE is dependent on TEACHER_ID which is a

Candidate key: {EMP_ID}

Candidate key: {EMP-ID, EMP-DEPT}

EMP_COUNTRY table: EMP_DEPT table:

EMP_ID EMP_COUNTRY EMP_DEPT DEPT_TYPE EMP_DEPT_NO

EMP_ID EMP_DEPT Functional dependencies:

D394 283 EMP_ID → EMP_COUNTRY

Functional dependency can be written as:

We can say that Emp_Name is functionally dependent on Emp_Id.

Trivial Functional Dependency

It occurs when B is a subset of A in − A ->B

{ DeptId, DeptName } -> Dept Id

It occurs when B is not a subset of A in − A ->B

DeptId -> DeptName

When A intersection B is NULL, then A → B is called as complete non-trivial.

The Functional dependency has 6 types of inference rule:

2. Augmentation Rule (IR2)

For R(ABCD), if A → B then AC → BC

4. Union Rule (IR4)

5. Decomposition Rule (IR5)

6. Pseudo transitive Rule (IR6)

In Pseudo transitive Rule, if X determines Y and YZ determines W, then XZ determines W.

Key constraints force that −

Key constraints are also referred to as Entity Constraints.

Referential integrity Constraints

Some of the main advantages of a relational database are:

Rule 1: Information Rule

Rule 2: Guaranteed Access Rule

Rule 3: Systematic Treatment of Null Values

Rule 4: Active/Dynamic Online Catalog based on the relational model

Rule 5: Comprehensive Data Sub Language Rule

Rule 8: Physical Data Independence Rule

Rule 9: Logical Data Independence Rule

Rule 10: Integrity Independence Rule

Rule 12: Non Subversion Rule

What is Database Schema?

Types of Database Schema

The database schema is divided into three types, which are:

The fundamental operations of relational algebra are as follows −

For example − σsubject = "database"(Books)

Output − Selects tuples from books where subject is 'database’.

Notation −∏A1, A2, An (r)

Where A1, A2 , An are attribute names of relation r.

Duplicate rows are automatically eliminated, as relation is a set.

For example − ∏subject, author(Books)

Why it is called Relational Calculus?

1. Tuple Relational Calculus (TRC)

{T | P (T)} or {T | Condition (T)}

{ T.name | Author(T) AND T.article = 'database' }

{ R| ∃T ∈ Authors(T.article='database' AND R.name=T.name)}

{< article, page, subject > | ∈ javatpoint ∧ subject = 'database’}

Below are the possible Connective Symbols:

Rules of the Well-Formed Formulas

6. A Statement variable standing alone is a Well-Formed Formula(WFF).

2. If ‘P’ is a WFF then ∼P is a formula as well.

ER diagrams mainly comprise of −

 Entity and its attributes

An entity is a real-world object with some attributes.

 Create table for each entity.

A relationship is an association among entities.

 Create table for a relationship.

Mapping Weak Entity Sets

 Create table for weak entity set.

ER specialization or generalization comes in the form of hierarchical entity sets.

 Create tables for all higher-level entities.

You might also like