0% found this document useful (0 votes)

8 views

DAAcourse

Algorithm analysis

Uploaded by

abrar2010abu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

DAAcourse

Algorithm analysis

Uploaded by

abrar2010abu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

COURSE PLAN

DESIGN AND ANALYSIS OF ALGORITHM

Course Objectives:
Analyse the asymptotic performance of algorithms.
Write rigorous correctness proofs for algorithms.
Demonstrate a familiarity with major algorithms and data structures.
Apply important algorithmic design paradigms and methods of analysis.
Synthesize efficient algorithms in common engineering design situations.

Course Outcomes:
Argue the correctness of algorithms using inductive proofs and invariants.
Analyse worst-case running times of algorithms using asymptotic analysis.
Describe the divide-and-conquer paradigm and explain when an algorithmic design situation
calls for it. Recite algorithms that employ this paradigm. Synthesize divide-and-conquer
algorithms. Derive and solve recurrences describing the performance of divide-and-conquer
algorithms.
Explain the different ways to analyse randomized algorithms (expected running time,
probability of error). Recite algorithms that employ randomization. Explain the difference
between a randomized algorithm and an algorithm with probabilistic inputs.
Analyse randomized algorithms. Employ indicator random variables and linearity of
expectation to perform the analyses. Recite analyses of algorithms that employ this method of
analysis.
Explain what competitive analysis is and to which situations it applies. Perform competitive
analysis.
Compare between different data structures. Pick an appropriate data structure for a design
situation.
Explain what an approximation algorithm is, and the benefit of using approximation
algorithms. Be familiar with some approximation algorithms.
Analyse the approximation factor of an algorithm.

Prerequisites:
Discrete Mathematics, Data Structures
Goals and Objectives:
The main goal of this course is to study the fundamental techniques to design
efficient algorithms and analyse their running time. After a brief review of prerequisite
material (search, sorting, asymptotic notation).

Required Knowledge:
1. Computer programming skills
2. Knowledge of probability
3. Understanding of basic data structures and algorithms
4. Basic knowledge in discrete mathematics

UNIT - I
Algorithm:

Introduction:
What is Algorithm?
An algorithm is a sequence of unambiguous instructions for solving a problem, i.e., for
obtaining a required output for any legitimate input in a finite amount of time.
The unambiguity requirement for each step of an algorithm cannot be compromised.
The range of inputs for which an algorithm works has to be specified carefully. The same
algorithm can be represented in several different ways. There may exist several algorithms
for solving the same problem. The reference to “instructions” in the definition implies that
there is something or someone capable of understanding and following the instructions given.
We call this a “computer,” keeping in mind that before the electronic computer was invented,
the word “computer” meant a human being involved in performing numeric calculations.
Nowadays, of course, “computers” are those ubiquitous electronic devices that have become
indispensable in almost everything we do. Note, however, that although the majority of
algorithms are indeed intended for eventual computer implementation, the notion of
algorithm does not depend on such an assumption.
How to make an Algorithm:
A person well-trained in computer science knows how to deal with algorithms: how to
construct them, manipulate them, understand them, analyse them. This knowledge is
preparation for much more than writing good computer programs; it is a general-purpose
mental tool that will be a definite aid to the understanding of other subjects, whether they be
chemistry, linguistics, or music, etc. The reason for this may be understood in the following
way: It has often been said that a person does not really understand something until after
teaching it to someone else. Actually, a person does not really understand something until
after teaching it to a computer, i.e., expressing it as an algorithm . . . An attempt to formalize
things as algorithms leads to a much deeper understanding than if we simply try to
comprehend things in the traditional way.

Algorithms for the same problem can be based on very different ideas and can solve the
problem with dramatically different speeds.

Example 1:
Making a Tea:
In order to prepare Tea we need to follow some steps.
1. Boil the water
2. Add Tea powder in boiling water
3. Boil it for 5 minutes
4. Filter the tea extract
5. After that add sugar and milk
6. Ready to serve
These step by step procedure is said to be an Algorithm.
*Algorithm to find area of rectangle
Step 1: Start
Step 2: get l, b values
Step 3: Calculate A=l*b
Step 4: Display A
Step 5: Stop

Example 2:
The greatest common divisor of two nonnegative, not-both-zero integers m and n,
denoted gcd(m, n), is defined as the largest integer that divides both m and n evenly, i.e., with
a remainder of zero. Euclid of Alexandria (third century B.C.) outlined an algorithm for
solving this problem in one of the volumes of his Elements most famous for its systematic
exposition of geometry. In modern terms, Euclid’s algorithm is based on applying repeatedly
the equality

gcd(m, n) = gcd(n, m mod n),

where m mod n is the remainder of the division of m by n, until m mod n is equal to 0. Since
gcd(m, 0) = m (why?), the last value of m is also the greatest common divisor of the
initial m and n.

For example, gcd(60, 24) can be computed as follows:

gcd(60, 24) = gcd(24, 12) = gcd(12, 0) = 12.

Here is a more structured description of this algorithm:

Euclid’s algorithm for computing gcd(m, n)

Step 1 If n = 0, return the value of m as the answer and stop; otherwise, proceed to Step 2.

Step 2 Divide m by n and assign the value of the remainder to r.

Step 3 Assign the value of n to m and the value of r to n. Go to Step 1.

Why algorithm?

Why do you need to study algorithms? If you are going to be a computer professional,
there are both practical and theoretical reasons to study algorithms. From a practical
standpoint, you have to know a standard set of important algorithms from different areas of
computing; in addition, you should be able to design new algorithms and analyse their
efficiency. From the theoretical stand-point, the study of algorithms, sometimes
called algorithmics, has come to be recognized as the cornerstone of computer science.

Algorithmics is more than a branch of computer science. It is the core of computer

science, and, in all fairness, can be said to be relevant to most of science, business, and
technology.
Another reason for studying algorithms is their usefulness in developing an-analytical
skills. After all, algorithms can be seen as special kinds of solutions to problems—not just
answers but precisely defined procedures for getting answers. Consequently, specific
algorithm design techniques can be interpreted as problem-solving strategies that can be
useful regardless of whether a computer is involved. Of course, the precision inherently
imposed by algorithmic thinking limits the kinds of problems that can be solved with an
algorithm.
Ascertaining the Capabilities of the Computational Device
Once you completely understand a problem, you need to ascertain the capabilities of
the computational device the algorithm is intended for. The vast majority of algorithms in use
today are still destined to be programmed for a computer closely resembling the von
Neumann machine—a computer architecture outlined by the prominent Hungarian-American
mathematician John von Neumann (1903– 1957), in collaboration with A. Burks and H.
Goldstine, in 1946. The essence of this architecture is captured by the so-called random-
access machine (RAM). Its central assumption is that instructions are executed one after
another, one operation at a time. Accordingly, algorithms designed to be executed on such
machines are called sequential algorithms.
The central assumption of the RAM model does not hold for some newer computers
that can execute operations concurrently, i.e., in parallel. Algorithms that take advantage of
this capability are called parallel algorithms.

Analysing an Algorithm

We usually want our algorithms to possess several qualities.

*The most important is efficiency.
In fact, there are two kinds of algorithm efficiency: time efficiency, indicating how fast the
algorithm runs, and space efficiency, indicating how much extra memory it uses.
*Another desirable characteristic of an algorithm is simplicity.
*Yet another desirable characteristic of an algorithm is generality.
There are, in fact, two issues here: generality of the problem the algorithm solves and the set
of inputs it accepts. On the first issue, note that it is sometimes easier to design an algorithm
for a problem posed in more general terms. Consider, for example, the problem of
determining whether two integers are relatively prime, i.e., whether their only common
divisor is equal to 1. It is easier to design an algorithm for a more general problem of
computing the greatest common divisor of two integers and, to solve the former problem,
check whether the gcd is 1 or not. There are situations, however, where designing a more
general algorithm is unnecessary or difficult or even impossible. For example, it is
unnecessary to sort a list of n numbers to find its median, which is its n/2 th smallest element.
Important Problem Types
The most important problem types:

Sorting

Searching

String processing

Graph problems

Sorting
The sorting problem is to rearrange the items of a given list in non-decreasing order.
Of course, for this problem to be meaningful, the nature of the list items must allow such an
ordering. As a practical matter, we usually need to sort lists of numbers, characters from an
alphabet, character strings, and, most important, records similar to those maintained by
schools about their students, libraries about their holdings, and companies about their
employees. In the case of records, we need to choose a piece of information to guide sorting.
For example, we can choose to sort student records in alphabetical order of names or by
student number or by student grade-point average. Such a specially chosen piece of
information is called a key.
Searching
The searching problem deals with finding a given value, called a search key, in a
given set. There are plenty of searching algorithms to choose from. They range from the
straightforward sequential search to a spectacularly efficient but limited binary search and
algorithms based on representing the underlying set in a different form more conducive to
searching. The latter algorithms are of particular importance for real-world applications
because they are indispensable for storing and retrieving information from large databases.
String Processing
In recent decades, the rapid proliferation of applications dealing with non-numerical
data has intensified the interest of researchers and computing practitioners in string-handling
algorithms. A string is a sequence of characters from an alphabet. Strings of particular
interest are text strings, which comprise letters, numbers, and special characters; bit strings,
which comprise zeros and ones; and gene sequences, which can be modelled by strings of
characters from the four-character alphabet {A, C, G, T}. It should be pointed out, however,
that string-processing algorithms have been important for computer science for a long time in
conjunction with computer languages and compiling issues.
One particular problem—that of searching for a given word in a text—has attracted special
attention from researchers. They call it string matching.

Graph Problems
One of the oldest and most interesting areas in algorithmics is graph algorithms.
Informally, a graph can be thought of as a collection of points called vertices, some of which
are connected by line segments called edges. Graphs are an interesting subject to study, for
both theoretical and practical reasons. Graphs can be used for modelling a wide variety of
applications, including transportation, communication, social and economic networks, project
scheduling, and games. Studying different technical and social aspects of the Internet in
particular is one of the active areas of current research involving computer scientists,
economists, and social scientists
Asymptotic Notations and Basic Efficiency Classes
The efficiency analysis framework concentrates on the order of growth of an
algorithm’s basic operation count as the principal indicator of the algorithm’s efficiency. To
compare and rank such orders of growth, computer scientists use three notations: O (big oh),
(big omega), and (big theta). First, we introduce these notations informally, and then, after
several examples, formal definitions are given. In the following discussion, t(n) and g(n) can
be any non-negative functions defined on the set of natural numbers. In the context we are
interested in, t(n) will be an algorithm’s running time
O(g(n)) is the set of all functions with a lower or same order of growth as g(n) (to within a
constant multiple, as n goes to infinity). Thus, to give a few examples, the following
assertions are all true:

Indeed, the first two functions are linear and hence have a lower order of growth
than g(n) = n2, while the last one is quadratic and hence has the same order of growth as n2.
O-notation
A function t (n) is said to be in O(g(n)), denoted t (n) ∈ O(g(n)), if t (n) is bounded above by
some constant multiple of g(n) for all large n, i.e., if there exist some positive constant c and
some nonnegative integer n0 such that

t(n) ≤ cg(n) for all n ≥ n0.

Ω-notation
A function t(n) is said to be in (g(n)), denoted t (n) ∈ (g(n)), if t (n) is bounded below by
some positive constant multiple of g(n) for all large n, i.e., if there exist some positive
constant c and some nonnegative integer n0 such that

t(n) ≥ cg(n) for all n ≥ n0.

Ө Notation:
A function t (n) is said to be in (g(n)), denoted t (n) ∈ (g(n)), if t (n) is bounded both above
and below by some positive constant multiples of g(n) for all large n, i.e., if there exist some
positive constants c1 and c2 and some nonnegative integer n0 such that

c2g(n) ≤ t (n) ≤ c1g(n) for all n ≥ n0.

Mathematical Analysis of Recursive Algorithms

What is recursive algorithm?
A recursive algorithm is an algorithm which calls itself with "smaller (or simpler)"
input values, and which obtains the result for the current input by applying simple operations
to the returned value for the smaller (or simpler) input. More generally if a problem can be
solved utilizing solutions to smaller versions of the same problem, and the smaller versions
reduce to easily solvable cases, then one can use a recursive algorithm to solve that problem.
For example, the elements of a recursively defined set, or the value of a recursively defined
function can be obtained by a recursive algorithm.
How to make a Recursive algorithm:
If a set or a function is defined recursively, then a recursive algorithm to compute its
members or values mirrors the definition. Initial steps of the recursive algorithm correspond
to the basis clause of the recursive definition and they identify the basis elements. They are
then followed by steps corresponding to the inductive clause, which reduce the computation
for an element of one generation to that of elements of the immediately preceding generation.
In general, recursive computer programs require more memory and computation compared
with iterative algorithms, but they are simpler and for many cases a natural way of thinking
about the problem.
Example:
Have you ever seen a set of Russian dolls? You can remove the top half of the first doll, and
what do you see inside? Another, slightly smaller, Russian doll! You can remove that doll
and separate its top and bottom halves. And you see yet another, even smaller, doll: And once
more: And you can keep going. Eventually you find the teeniest Russian doll. It is just one
piece, and so it does not open: We started with one big Russian doll, and we saw smaller and
smaller Russian dolls, until we saw one that was so small that it could not contain another.
What do Russian dolls have to do with algorithms? Just as one Russian doll has within it a
smaller Russian doll, which has an even smaller Russian doll within it, all the way down to a
tiny Russian doll that is too small to contain another, we'll see how to design an algorithm to
solve a problem by solving a smaller instance of the same problem, unless the problem is so
small that we can just solve it directly. We call this technique recursion.

The general framework for analysis of algorithms to recursive algorithms. We start with an
example often used to introduce novices to the idea of a recursive algorithm.
Example -2 :
Recursion is the process of defining a problem (or the solution to a problem) in terms of (a
simpler version of) itself.
For example, we can define the operation "find your way home" as:
1. If you are at home, stop moving.
2. Take one step toward home.
3. "find your way home".
Here the solution to finding your way home is two steps (three steps). First, we don't go home
if we are already home. Secondly, we do a very simple action that makes our situation
simpler to solve. Finally, we redo the entire algorithm.
Basic steps of recursive programs
Every recursive program follows the same basic sequence of steps:
1. Initialize the algorithm. Recursive programs often need a seed value to start with. This
is accomplished either by using a parameter passed to the function or by providing a
gateway function that is non-recursive but that sets up the seed values for the
recursive calculation.
2. Check to see whether the current value(s) being processed match the base case. If so,
process and return the value.
3. Redefine the answer in terms of a smaller or simpler sub-problem or sub-problems.
4. Run the algorithm on the sub-problem.
5. Combine the results in the formulation of the answer.
6. Return the results.
Properties
A recursive function can go infinite like a loop. To avoid infinite running of recursive
function, there are two properties that a recursive function must have −
 Base criteria − There must be at least one base criteria or condition, such that, when
this condition is met the function stops calling itself recursively.
 Progressive approach − The recursive calls should progress in such a way that each
time a recursive call is made it comes closer to the base criteria.

EXAMPLE Compute the factorial function F (n) = n! For an arbitrary non negative
integer n. Since

n! = 1 . . . . . (n − 1) . n = (n − 1)! . n for n ≥ 1

and 0! = 1 by definition, we can compute F (n) = F (n − 1) . n with the following recursive

algorithm.

ALGORITHM F(n)

//Computes n! Recursively
//Input: A nonnegative integer n
//Output: The value of n!

if n = 0 return 1

else return F (n − 1) ∗ n
For simplicity, we consider n itself as an indicator of this algorithm’s input size (rather than
the number of bits in its binary expansion). The basic operation of the algorithm is
multiplication,5 whose number of executions we denote M(n). Since the function F (n) is
computed according to the formula

F (n) = F (n − 1) . n for n > 0,

The number of multiplications M(n) needed to compute it must satisfy the equality

Indeed, M(n − 1) multiplications are spent to compute F (n − 1), and one more multiplication
is needed to multiply the result by n.
The last equation defines the sequence M(n) that we need to find. This equation
defines M(n) not explicitly, i.e., as a function of n, but implicitly as a function of its value at
another point, namely n − 1. Such equations are called recurrence relations
Our goal now is to solve the recurrence relation M(n) = M(n − 1) + 1, i.e., to find an explicit
formula for M(n) in terms of n only.
Applications of Recursive Algorithm:
Recursion has many, many applications. In this module, we'll see how to use recursion to
compute the factorial function, to determine whether a word is a palindrome, to compute
powers of a number, to draw a type of fractal, and to solve the ancient Towers of Hanoi
problem. Later modules will use recursion to solve other problems, including sorting.

Mathematical Analysis of Iterative Algorithms

What is an iterative algorithm?
An iterative algorithm executes steps in iterations. It aims to find successive approximation in
sequence to reach a solution. They are most commonly used in linear programs where large
numbers of variables are involved.
Example:
Practical problem: The birthday guy cuts the birthday cake and has to ensure that everyone in
the room gets a slice.
Solution: Iterative – The birthday guy uses a tray and goes around giving everyone a slice.
There are two ways in which programs can iterate or 'loop': count-controlled loops.
Condition-controlled loops.

Consider the problem of finding the value of the largest element in a list of n numbers. For
simplicity, we assume that the list is implemented as an array. The following is pseudo code
of a standard algorithm for solving the problem.

ALGORITHM MaxElement (A[0..n − 1])

//Determines the value of the largest element in a given array

//Input: An array A[0..n − 1] of real numbers

//Output: The value of the largest element in A maxval ← A[0]

for i ← 1 to n − 1 do

if A[i] > maxval maxval ← A[i]

return maxval

The obvious measure of an input’s size here is the number of elements in the array,
i.e., n. The operations that are going to be executed most often are in the algorithms for loop.

Example: Computing the nth Fibonacci number

Many more examples of Fibonacci-like numbers have since been discovered in the natural
world, and they have even been used in predicting the prices of stocks and commodities.
There are some interesting applications of the Fibonacci numbers in computer science as
well.

Comparison between Recursive and Iterative:

Which is better: Iteration or Recursion?

1. Sometime finding the time complexity of recursive code is more difficult than that of
Iterative code.
2. Recursion has a large amount of overhead as compared to Iteration. It is usually much
slower because all function calls must be stored in a stack to allow the return back to
the caller functions. Iteration does not involve any such overhead.
3. Infinite recursion can lead to CPU crash because infinite recursive calls may occur
due to some mistake in base condition, which on never becoming false, keeps calling
the function, which may lead to system CPU crash.
Infinite iteration due to mistake in iterator assignment or increment, or in the terminating
condition, will lead to infinite loops, which may or may not lead to system errors, but will
surely stop program execution any further.
Every recursion can be modelled as a kind of loop, that's what the CPU will ultimately do.
And the recursion itself, more directly, means putting the function calls and scopes in a stack.
But changing your recursive algorithm to a looping one might need a lot of work and make
your code less maintainable.
UNIT – II

Algorithm Design Techniques

The following is a list of several popular design approaches:
1. Divide and Conquer Approach: It is a top-down approach. The algorithms which
follow the divide & conquer techniques involve three steps:
o Divide the original problem into a set of subproblems.
o Solve every subproblem individually, recursively.
o Combine the solution of the subproblems (top level) into a solution of the whole
original problem.
2. Greedy Technique: Greedy method is used to solve the optimization problem. An
optimization problem is one in which we are given a set of input values, which are
required either to be maximized or minimized (known as objective), i.e. some
constraints or conditions.
o Greedy Algorithm always makes the choice (greedy criteria) looks best at the
moment, to optimize a given objective.
o The greedy algorithm doesn't always guarantee the optimal solution however it
generally produces a solution that is very close in value to the optimal.

3. Dynamic Programming: Dynamic Programming is a bottom-up approach we

solve all possible small problems and then combine them to obtain solutions for
bigger problems.
This is particularly helpful when the number of copying subproblems is exponentially
large. Dynamic Programming is frequently related to Optimization Problems.
4. Branch and Bound: In Branch & Bound algorithm a given subproblem, which
cannot be bounded, has to be divided into at least two new restricted subproblems.
Branch and Bound algorithm are methods for global optimization in non-convex
problems. Branch and Bound algorithms can be slow, however in the worst case they
require effort that grows exponentially with problem size, but in some cases we are
lucky, and the method coverage with much less effort.
5. Randomized Algorithms: A randomized algorithm is defined as an algorithm that
is allowed to access a source of independent, unbiased random bits, and it is then
allowed to use these random bits to influence its computation.
6. Backtracking Algorithm: Backtracking Algorithm tries each possibility until they
find the right one. It is a depth-first search of the set of possible solution. During the
search, if an alternative doesn't work, then backtrack to the choice point, the place
which presented different alternatives, and tries the next alternative.
7. Randomized Algorithm: A randomized algorithm uses a random number at least
once during the computation make a decision.
Example 1: In Quick Sort, using a random number to choose a pivot.
Example 2: Trying to factor a large number by choosing a random number as
possible divisors.

Brute Force Algorithms Explained

This is the most basic and simplest type of algorithm. A Brute Force Algorithm is the
straightforward approach to a problem i.e., the first approach that comes to our mind on
seeing the problem. More technically it is just like iterating every possibility available to
solve that problem.
For Example: If there is a lock of 4-digit PIN. The digits to be chosen from 0-9 then the
brute force will be trying all possible combinations one by one like 0001, 0002, 0003, 0004,
and so on until we get the right PIN. In the worst case, it will take 10,000 tries to find the
right combination.
Brute Force Algorithms are exactly what they sound like – straightforward methods of
solving a problem that rely on sheer computing power and trying every possibility rather than
advanced techniques to improve efficiency.

For example, imagine you have a small padlock with 4 digits, each from 0-9. You
forgot your combination, but you don't want to buy another padlock. Since you can't
remember any of the digits, you have to use a brute force method to open the lock.
So you set all the numbers back to 0 and try them one by one: 0001, 0002, 0003, and
so on until it opens. In the worst case scenario, it would take 104, or 10,000 tries to
find your combination.
A classic example in computer science is the traveling salesman problem (TSP).
Suppose a salesman needs to visit 10 cities across the country. How does one
determine the order in which those cities should be visited such that the total distance
traveled is minimized?
The brute force solution is simply to calculate the total distance for every possible
route and then select the shortest one. This is not particularly efficient because it is
possible to eliminate many possible routes through clever algorithms.
The time complexity of brute force is O(mn), which is sometimes written
as O(n*m) . So, if we were to search for a string of "n" characters in a string of "m"
characters using brute force, it would take us n * m tries.

Greedy Algorithms
Structure of a Greedy Algorithm
Greedy algorithms take all of the data in a particular problem, and then set a rule for
which elements to add to the solution at each step of the algorithm. In the animation
above, the set of data is all of the numbers in the graph, and the rule was to select the
largest number available at each level of the graph. The solution that the algorithm
builds is the sum of all of those choices.
If both of the properties below are true, a greedy algorithm can be used to solve the
problem.
 Greedy choice property: A global (overall) optimal solution can be reached by
choosing the optimal choice at each step.
 Optimal substructure: A problem has an optimal substructure if an optimal solution
to the entire problem contains the optimal solutions to the sub-problems.
In other words, greedy algorithms work on problems for which it is true that, at every
step, there is a choice that is optimal for the problem up to that step, and after the last
step, the algorithm produces the optimal solution of the complete problem.
To make a greedy algorithm, identify an optimal substructure or subproblem in the
problem. Then, determine what the solution will include (for example, the largest
sum, the shortest path, etc.). Create some sort of iterative way to go through all of the
subproblems and build a solution.

Greedy is an algorithmic paradigm that builds up a solution piece by piece, always

choosing the next piece that offers the most obvious and immediate benefit. So the
problems where choosing locally optimal also leads to global solution are best fit for
Greedy.
For example consider the Fractional Knapsack Problem. The local optimal strategy is
to choose the item that has maximum value vs weight ratio. This strategy also leads to
global optimal solution because we allowed to take fractions of an item.

Dynamic Programming
Dynamic Programming (DP) is an algorithmic technique for solving an optimization
problem by breaking it down into simpler subproblems and utilizing the fact that the
optimal solution to the overall problem depends upon the optimal solution to its
subproblems.
Let’s take the example of the Fibonacci numbers. As we all know, Fibonacci
numbers are a series of numbers in which each number is the sum of the two
preceding numbers. The first few Fibonacci numbers are 0, 1, 1, 2, 3, 5, and 8, and
they continue on from there.
If we are asked to calculate the nth Fibonacci number, we can do that with the
following equation,
Fib(n) = Fib(n-1) + Fib(n-2), for n > 1
As we can clearly see here, to solve the overall problem (i.e. Fib(n)), we broke it
down into two smaller subproblems (which are Fib(n-1) and Fib(n-2)). This shows
that we can use DP to solve this problem.
Characteristics of Dynamic Programming
Before moving on to understand different methods of solving a DP problem, let’s first
take a look at what are the characteristics of a problem that tells us that we can apply
DP to solve it.
1. Overlapping Subproblems
Subproblems are smaller versions of the original problem. Any problem has
overlapping sub-problems if finding its solution involves solving the same
subproblem multiple times. Take the example of the Fibonacci numbers; to find
the fib(4), we need to break it down into the following sub-problems:
fib(4)fib(3)fib(2)fib(2)fib(1)fib(1)fib(0)fib(0)fib(1)
Recursion tree for calculating Fibonacci numbers
We can clearly see the overlapping subproblem pattern here, as fib(2) has been
evaluated twice and fib(1) has been evaluated three times.
2. Optimal Substructure Property
Any problem has optimal substructure property if its overall optimal solution can be
constructed from the optimal solutions of its subproblems. For Fibonacci numbers, as
we know,
Fib(n) = Fib(n-1) + Fib(n-2)
This clearly shows that a problem of size ‘n’ has been reduced to subproblems of size
‘n-1’ and ‘n-2’. Therefore, Fibonacci numbers have optimal substructure property.

Dynamic Programming Methods

DP offers two methods to solve a problem.
1. Top-down with Memoization
In this approach, we try to solve the bigger problem by recursively finding the
solution to smaller sub-problems. Whenever we solve a sub-problem, we cache its
result so that we don’t end up solving it repeatedly if it’s called multiple times.
Instead, we can just return the saved result. This technique of storing the results of
already solved subproblems is called Memoization.
2. Bottom-up with Tabulation
Tabulation is the opposite of the top-down approach and avoids recursion. In this
approach, we solve the problem “bottom-up” (i.e. by solving all the related sub-
problems first). This is typically done by filling up an n-dimensional table. Based on
the results in the table, the solution to the top/original problem is then computed.
Tabulation is the opposite of Memoization, as in Memoization we solve the problem
and maintain a map of already solved sub-problems. In other words, in memoization,
we do it top-down in the sense that we solve the top problem first (which typically
recurses down to solve the sub-problems).
Let’s apply Tabulation to our example of Fibonacci numbers. Since we know that
every Fibonacci number is the sum of the two preceding numbers, we can use this fact
to populate our table.
Dynamic Programming is mainly an optimization over plain recursion. Wherever we
see a recursive solution that has repeated calls for same inputs, we can optimize it
using Dynamic Programming. The idea is to simply store the results of subproblems,
so that we do not have to re-compute them when needed later. This simple
optimization reduces time complexities from exponential to polynomial. For example,
if we write simple recursive solution for Fibonacci Numbers, we get exponential time
complexity and if we optimize it by storing solutions of subproblems, time
complexity reduces to linear.

Branch and Bound Algorithm

Branch and bound is an algorithm design paradigm which is generally used for
solving combinatorial optimization problems. These problems are typically
exponential in terms of time complexity and may require exploring all possible
permutations in worst case. The Branch and Bound Algorithm technique solves these
problems relatively quickly.
Let us consider the 0/1 Knapsack problem to understand Branch and Bound.
There are many algorithms by which the knapsack problem can be solved:
 Greedy Algorithm for Fractional Knapsack
 DP solution for 0/1 Knapsack
 Backtracking Solution for 0/1 Knapsack.
Let’s see the Branch and Bound Approach to solve the 0/1 Knapsack problem: The
Backtracking Solution can be optimized if we know a bound on best possible solution
subtree rooted with every node. If the best in subtree is worse than current best, we
can simply ignore this node and its subtrees. So we compute bound (best solution) for
every node and compare the bound with current best solution before exploring the
node.
Example bounds used in below diagram are, A down can give $315, B down can
$275, C down can $225, D down can $125 and E down can $30.

Backtracking
Backtracking is an algorithmic-technique for solving problems recursively by trying
to build a solution incrementally, one piece at a time, removing those solutions that
fail to satisfy the constraints of the problem at any point of time (by time, here, is
referred to the time elapsed till reaching any level of the search tree).
According to the wiki definition,
Backtracking can be defined as a general algorithmic technique that considers
searching every possible combination in order to solve a computational problem.
There are three types of problems in backtracking –
1. Decision Problem – In this, we search for a feasible solution.
2. Optimization Problem – In this, we search for the best solution.
3. Enumeration Problem – In this, we find all feasible solutions.

How to determine if a problem can be solved using Backtracking?

Generally, every constraint satisfaction problem which has clear and well-defined
constraints on any objective solution, that incrementally builds candidate to the solution
and abandons a candidate (“backtracks”) as soon as it determines that the candidate cannot
possibly be completed to a valid solution, can be solved by Backtracking. However, most of
the problems that are discussed, can be solved using other known algorithms like Dynamic
Programming or Greedy Algorithms in logarithmic, linear, linear-logarithmic time
complexity in order of input size, and therefore, outshine the backtracking algorithm in
every respect (since backtracking algorithms are generally exponential in both time and
space). However, a few problems still remain, that only have backtracking algorithms to
solve them until now.

Consider a situation that you have three boxes in front of you and only one of them has a
gold coin in it but you do not know which one. So, in order to get the coin, you will have to
open all of the boxes one by one. You will first check the first box, if it does not contain the
coin, you will have to close it and check the second box and so on until you find the coin.
This is what backtracking is, that is solving all sub-problems one by one in order to reach
the best possible solution.

Consider the below example to understand the Backtracking approach more formally,
A backtracking algorithm will then work as follows:
The Algorithm begins to build up a solution, starting with an empty solution set . S = {}
1. Add to the first move that is still left (All possible moves are added to one by one).
This now creates a new sub-tree in the search tree of the algorithm.
2. Check if satisfies each of the constraints in .
 If Yes, then the sub-tree is “eligible” to add more “children”.
 Else, the entire sub-tree is useless, so recurs back to step 1 using argument .
3. In the event of “eligibility” of the newly formed sub-tree , recurs back to step 1, using
argument
4. If the check for returns that it is a solution for the entire data . Output and terminate the
program.
If not, then return that no solution is possible with the current and hence discard it.
Difference between Recursion and Backtracking:
In recursion, the function calls itself until it reaches a base case. In backtracking, we use
recursion to explore all the possibilities until we get the best result for the problem.
Pseudo Code for Backtracking :

1. Recursive backtracking solution.

void findSolutions(n, other params) :
if (found a solution) :
solutionsFound = solutionsFound + 1;
displaySolution();
if (solutionsFound >= solutionTarget) :
System.exit(0);
return
for (val = first to last) :
if (isValid(val, n)) :
applyValue(val, n);
findSolutions(n+1, other params);
removeValue(val, n);

2. Finding whether a solution exists or not

boolean findSolutions(n, other params) :
if (found a solution) :
displaySolution();
return true;
for (val = first to last) :
if (isValid(val, n)) :
applyValue(val, n);
if (findSolutions(n+1, other params))
return true;
removeValue(val, n);
return false;

Let us try to solve a standard Backtracking problem, N-Queen Problem.

The N Queen is the problem of placing N chess queens on an N×N chessboard so that no
two queens attack each other. For example, following is a solution for 4 Queen problem.

The expected output is a binary matrix which has 1s for the blocks where queens are
placed. For example, following is the output matrix for the above 4 queen solution.
{ 0, 1, 0, 0}
{ 0, 0, 0, 1}
{ 1, 0, 0, 0}
{ 0, 0, 1, 0}

Backtracking Algorithm: The idea is to place queens one by one in different columns,
starting from the leftmost column. When we place a queen in a column, we check for
clashes with already placed queens. In the current column, if we find a row for which there
is no clash, we mark this row and column as part of the solution. If we do not find such a
row due to clashes then we backtrack and return false.
1) Start in the leftmost column
2) If all queens are placed
return true
3) Try all rows in the current column. Do following for every tried row.
a) If the queen can be placed safely in this row then mark this [row,
column] as part of the solution and recursively check if placing
queen here leads to a solution.
b) If placing the queen in [row, column] leads to a solution then return
true.
c) If placing queen doesn't lead to a solution then unmark this [row,
column] (Backtrack) and go to step (a) to try other rows.
3) If all rows have been tried and nothing worked, return false to trigger
backtracking.

Bin Packing Problem

Given n items of different weights and bins each of capacity c, assign each item to a bin
such that number of total used bins is minimized. It may be assumed that all items have
weights smaller than bin capacity.
Example:
Input: wieght[] = {4, 8, 1, 4, 2, 1}
Bin Capacity c = 10
Output: 2
We need minimum 2 bins to accommodate all items
First bin contains {4, 4, 2} and second bin {8, 1, 1}
Input: wieght[] = {9, 8, 2, 2, 5, 4}
Bin Capacity c = 10
Output: 4
We need minimum 4 bins to accommodate all items.
Input: wieght[] = {2, 5, 4, 7, 1, 3, 8};
Bin Capacity c = 10
Output: 3
Lower Bound
We can always find a lower bound on minimum number of bins required. The lower bound
can be given as :
Min no. of bins >= Ceil ((Total Weight) / (Bin Capacity))
In the above examples, lower bound for first example is “ceil(4 + 8 + 1 + 4 + 2 + 1)/10” = 2
and lower bound in second example is “ceil(9 + 8 + 2 + 2 + 5 + 4)/10” = 3.
This problem is a NP Hard problem and finding an exact minimum number of bins takes
exponential time. Following are approximate algorithms for this problem.
Applications
1. Loading of containers like trucks.
2. Placing data on multiple disks.
3. Job scheduling.
4. Packing advertisements in fixed length radio/TV station breaks.
5. Storing a large collection of music onto tapes/CD’s, etc.
Online Algorithms
These algorithms are for Bin Packing problems where items arrive one at a time (in
unknown order), each must be put in a bin, before considering the next item.
1. Next Fit:
When processing next item, check if it fits in the same bin as the last item. Use a new bin
only if it does not.
2. First Fit:
When processing the next item, scan the previous bins in order and place the item in the
first bin that fits. Start a new bin only if it does not fit in any of the existing bins.
3. Best Fit:
The idea is to places the next item in the *tightest* spot. That is, put it in the bin so that the
smallest empty space is left.
4. Worst Fit:
The idea is to places the next item in the least tight spot to even out the bins. That is, put it
in the bin so that most empty space is left.

Knapsack Problem
Given a set of items, each with a weight and a value, determine a subset of items to include
in a collection so that the total weight is less than or equal to a given limit and the total value
is as large as possible.
The knapsack problem is in combinatorial optimization problem. It appears as a subproblem
in many, more complex mathematical models of real-world problems. One general approach
to difficult problems is to identify the most restrictive constraint, ignore the others, solve a
knapsack problem, and somehow adjust the solution to satisfy the ignored constraints.
Applications
In many cases of resource allocation along with some constraint, the problem can be derived
in a similar way of Knapsack problem. Following is a set of example.
 Finding the least wasteful way to cut raw materials
 portfolio optimization
 Cutting stock problems
Problem Scenario
A thief is robbing a store and can carry a maximal weight of W into his knapsack. There are
n items available in the store and weight of ith item is wi and its profit is pi. What items
should the thief take?
In this context, the items should be selected in such a way that the thief will carry those
items for which he will gain maximum profit. Hence, the objective of the thief is to
maximize the profit.
Based on the nature of the items, Knapsack problems are categorized as
 Fractional Knapsack
 Knapsack

0-1 Knapsack
In 0-1 Knapsack, items cannot be broken which means the thief should take the item as a
whole or should leave it. This is reason behind calling it as 0-1 Knapsack.
Hence, in case of 0-1 Knapsack, the value of xi can be either 0 or 1, where other constraints
remain the same.
0-1 Knapsack cannot be solved by Greedy approach. Greedy approach does not ensure an
optimal solution. In many instances, Greedy approach may give an optimal solution.
The following examples will establish our statement.
Example-1
Let us consider that the capacity of the knapsack is W = 25 and the items are as shown in the
following table.

Item A B C D

Profit 24 18 18 10

Weight 24 10 10 7

Without considering the profit per unit weight (pi/wi), if we apply Greedy approach to solve
this problem, first item A will be selected as it will contribute maximum profit among all the
elements.
After selecting item A, no more item will be selected. Hence, for this given set of items total
profit is 24. Whereas, the optimal solution can be achieved by selecting items, B and C,
where the total profit is 18 + 18 = 36.
Example-2
Instead of selecting the items based on the overall benefit, in this example the items are
selected based on ratio pi/wi. Let us consider that the capacity of the knapsack is W = 60 and
the items are as shown in the following table.

Item A B C

Price 100 280 120

Weight 10 40 20

Ratio 10 7 6

Using the Greedy approach, first item A is selected. Then, the next item B is chosen. Hence,
the total profit is 100 + 280 = 380. However, the optimal solution of this instance can be
achieved by selecting items, B and C, where the total profit is 280 + 120 = 400.
Hence, it can be concluded that Greedy approach may not give an optimal solution.
To solve 0-1 Knapsack, Dynamic Programming approach is required.
Problem Statement
A thief is robbing a store and can carry a maximal weight of W into his knapsack. There
are n items and weight of ith item is wi and the profit of selecting this item is pi. What items
should the thief take?
Dynamic-Programming Approach
Let i be the highest-numbered item in an optimal solution S for W dollars. Then S' = S -
{i} is an optimal solution for W - wi dollars and the value to the solution S is Vi plus the value
of the sub-problem.
We can express this fact in the following formula: define c[i, w] to be the solution for
items 1,2, … , i and the maximum weight w.
The algorithm takes the following inputs
 The maximum weight W
 The number of items n
 The two sequences v = <v1, v2, …, vn> and w = <w1, w2, …, wn>
Dynamic-0-1-knapsack (v, w, n, W)
for w = 0 to W do
c[0, w] = 0
for i = 1 to n do
c[i, 0] = 0
for w = 1 to W do
if wi ≤ w then
if vi + c[i-1, w-wi] then
c[i, w] = vi + c[i-1, w-wi]
else c[i, w] = c[i-1, w]
else
c[i, w] = c[i-1, w]
The set of items to take can be deduced from the table, starting at c[n, w] and tracing
backwards where the optimal values came from.
If c[i, w] = c[i-1, w], then item i is not part of the solution, and we continue tracing with c[i-
1, w]. Otherwise, item i is part of the solution, and we continue tracing with c[i-1, w-W].
Analysis
This algorithm takes θ(n, w) times as table c has (n + 1).(w + 1) entries, where each entry
requires θ(1) time to compute.

Fractional Knapsack
In this case, items can be broken into smaller pieces, hence the thief can select fractions of
items.
According to the problem statement,
 There are n items in the store
 Weight of ith item wi>0wi>0
 Profit for ith item pi>0pi>0 and
 Capacity of the Knapsack is W
In this version of Knapsack problem, items can be broken into smaller pieces. So, the thief
may take only a fraction xi of ith item.
0 xi 10 xi 1
The ith item contributes the weight xi.wixi.wi to the total weight in the knapsack and
profit xi.pixi.pi to the total profit.
Hence, the objective of this algorithm is to
maximize∑n=1n(xi.pi)maximize∑n=1n(xi.pi)
subject to constraint,
∑n=1n(xi.wi) W∑n=1n(xi.wi) W
It is clear that an optimal solution must fill the knapsack exactly, otherwise we could add a
fraction of one of the remaining items and increase the overall profit.
Thus, an optimal solution can be obtained by
∑n=1n(xi.wi)=W∑n=1n(xi.wi)=W
In this context, first we need to sort those items according to the value of piwipiwi, so
that pi+1wi+1pi+1wi+1 ≤ piwipiwi . Here, x is an array to store the fraction of items.
Algorithm: Greedy-Fractional-Knapsack (w[1..n], p[1..n], W)
for i = 1 to n
do x[i] = 0
weight = 0
for i = 1 to n
if weight + w[i] ≤ W then
x[i] = 1
weight = weight + w[i]
else
x[i] = (W - weight) / w[i]
weight = W
break
return x
Analysis
If the provided items are already sorted into a decreasing order of piwipiwi, then the
whileloop takes a time in O(n); Therefore, the total time including the sort is in O(n logn).
Example
Let us consider that the capacity of the knapsack W = 60 and the list of provided items are
shown in the following table −

Item A B C D

Profit 280 100 120 120

Weight 40 10 20 24

Ratio (piwi)(piwi) 7 10 6 5

As the provided items are not sorted based on piwipiwi. After sorting, the items are as
shown in the following table.

Item B A C D

Profit 100 280 120 120

Weight 10 40 20 24
Ratio (piwi)(piwi) 10 7 6 5

Solution
After sorting all the items according to piwipiwi. First all of B is chosen as weight of B is
less than the capacity of the knapsack. Next, item A is chosen, as the available capacity of
the knapsack is greater than the weight of A. Now, C is chosen as the next item. However,
the whole item cannot be chosen as the remaining capacity of the knapsack is less than the
weight of C.
Hence, fraction of C (i.e. (60 − 50)/20) is chosen.
Now, the capacity of the Knapsack is equal to the selected items. Hence, no more item can
be selected.
The total weight of the selected items is 10 + 40 + 20 * (10/20) = 60
And the total profit is 100 + 280 + 120 * (10/20) = 380 + 60 = 440
This is the optimal solution. We cannot gain more profit selecting any different combination
of items.

Travelling Salesman Problem

Problem Statement
A traveler needs to visit all the cities from a list, where distances between all the cities are
known and each city should be visited just once. What is the shortest possible route that he
visits each city exactly once and returns to the origin city?
Solution
Travelling salesman problem is the most notorious computational problem. We can use
brute-force approach to evaluate every possible tour and select the best one. For n number of
vertices in a graph, there are (n - 1)! number of possibilities.
Instead of brute-force using dynamic programming approach, the solution can be obtained in
lesser time, though there is no polynomial time algorithm.
Let us consider a graph G = (V, E), where V is a set of cities and E is a set of weighted
edges. An edge e(u, v) represents that vertices u and v are connected. Distance between
vertex u and v is d(u, v), which should be non-negative.
Suppose we have started at city 1 and after visiting some cities now we are in city j. Hence,
this is a partial tour. We certainly need to know j, since this will determine which cities are
most convenient to visit next. We also need to know all the cities visited so far, so that we
don't repeat any of them. Hence, this is an appropriate sub-problem.
For a subset of cities S Є {1, 2, 3, ... , n} that includes 1, and j Є S, let C(S, j) be the length of
the shortest path visiting each node in S exactly once, starting at 1 and ending at j.
When |S| > 1, we define C(S, 1) = ∝ since the path cannot start and end at 1.
Now, let express C(S, j) in terms of smaller sub-problems. We need to start at 1 and end at j.
We should select the next city in such a way that
C(S,j)=minC(S−{j},i)+d(i,j)wherei∈Sandi≠jc(S,j)=minC(s−{j},i)+d(i,j)wherei∈Sandi≠jC(S,j)

=minC(S−{j},i)+d(i,j)wherei∈Sandi≠jc(S,j)=minC(s−{j},i)+d(i,j)wherei∈Sandi≠j

Algorithm: Traveling-Salesman-Problem
C ({1}, 1) = 0
for s = 2 to n do
for all subsets S Є {1, 2, 3, … , n} of size s and containing 1
C (S, 1) = ∞
for all j Є S and j ≠ 1
C (S, j) = min {C (S – {j}, i) + d(i, j) for i Є S and i ≠ j}
Return minj C ({1, 2, 3, …, n}, j) + d(j, i)

Analysis
There are at the most 2n.n2n.n sub-problems and each one takes linear time to solve.
Therefore, the total running time is O(2n.n2)O(2n.n2).

Example
In the following example, we will illustrate the steps to solve the travelling salesman
problem.
From the above graph, the following table is prepared.

1 2 3 4

1 0 10 15 20

2 5 0 9 10

3 6 13 0 12

4 8 8 9 0

S=Φ
Cost(2,Φ,1)=d(2,1)=5Cost(2,Φ,1)=d(2,1)=5Cost(2,Φ,1)=d(2,1)=5Cost(2,Φ,1)=d(2,1)=5
Cost(3,Φ,1)=d(3,1)=6Cost(3,Φ,1)=d(3,1)=6Cost(3,Φ,1)=d(3,1)=6Cost(3,Φ,1)=d(3,1)=6
Cost(4,Φ,1)=d(4,1)=8Cost(4,Φ,1)=d(4,1)=8Cost(4,Φ,1)=d(4,1)=8Cost(4,Φ,1)=d(4,1)=8
S=1
Cost(i,s)=min{Cost(j,s–(j))+d[i,j]}Cost(i,s)=min{Cost(j,s)−(j))+d[i,j]}Cost(i,s)=min{Cost(j,s
–(j))+d[i,j]}Cost(i,s)=min{Cost(j,s)−(j))+d[i,j]}
Cost(2,{3},1)=d[2,3]+Cost(3,Φ,1)=9+6=15cost(2,{3},1)=d[2,3]+cost(3,Φ,1)=9+6=15Cost(2,
{3},1)=d[2,3]+Cost(3,Φ,1)=9+6=15cost(2,{3},1)=d[2,3]+cost(3,Φ,1)=9+6=15
Cost(2,{4},1)=d[2,4]+Cost(4,Φ,1)=10+8=18cost(2,{4},1)=d[2,4]+cost(4,Φ,1)=10+8=18Cost(
2,{4},1)=d[2,4]+Cost(4,Φ,1)=10+8=18cost(2,{4},1)=d[2,4]+cost(4,Φ,1)=10+8=18
Cost(3,{2},1)=d[3,2]+Cost(2,Φ,1)=13+5=18cost(3,{2},1)=d[3,2]+cost(2,Φ,1)=13+5=18Cost(
3,{2},1)=d[3,2]+Cost(2,Φ,1)=13+5=18cost(3,{2},1)=d[3,2]+cost(2,Φ,1)=13+5=18
Cost(3,{4},1)=d[3,4]+Cost(4,Φ,1)=12+8=20cost(3,{4},1)=d[3,4]+cost(4,Φ,1)=12+8=20Cost(
3,{4},1)=d[3,4]+Cost(4,Φ,1)=12+8=20cost(3,{4},1)=d[3,4]+cost(4,Φ,1)=12+8=20
Cost(4,{3},1)=d[4,3]+Cost(3,Φ,1)=9+6=15cost(4,{3},1)=d[4,3]+cost(3,Φ,1)=9+6=15Cost(4,
{3},1)=d[4,3]+Cost(3,Φ,1)=9+6=15cost(4,{3},1)=d[4,3]+cost(3,Φ,1)=9+6=15
Cost(4,{2},1)=d[4,2]+Cost(2,Φ,1)=8+5=13cost(4,{2},1)=d[4,2]+cost(2,Φ,1)=8+5=13Cost(4,
{2},1)=d[4,2]+Cost(2,Φ,1)=8+5=13cost(4,{2},1)=d[4,2]+cost(2,Φ,1)=8+5=13
S=2
Cost(2,{3,4},1)= d[2,3]+Cost(3,{4},1)=9+20=29d[2,4]+Cost(4,{3},1)=10+15=25=25Co
st(2,{3,4},1){d[2,3]+cost(3,{4},1)=9+20=29d[2,4]+Cost(4,{3},1)=10+15=25=25Cost(2,{3,4
},1)={d[2,3]+Cost(3,{4},1)=9+20=29d[2,4]+Cost(4,{3},1)=10+15=25=25Cost(2,{3,4},1){d[
2,3]+cost(3,{4},1)=9+20=29d[2,4]+Cost(4,{3},1)=10+15=25=25
Cost(3,{2,4},1)= d[3,2]+Cost(2,{4},1)=13+18=31d[3,4]+Cost(4,{2},1)=12+13=25=25C
ost(3,{2,4},1){d[3,2]+cost(2,{4},1)=13+18=31d[3,4]+Cost(4,{2},1)=12+13=25=25Cost(3,{2
,4},1)={d[3,2]+Cost(2,{4},1)=13+18=31d[3,4]+Cost(4,{2},1)=12+13=25=25Cost(3,{2,4},1)
{d[3,2]+cost(2,{4},1)=13+18=31d[3,4]+Cost(4,{2},1)=12+13=25=25
Cost(4,{2,3},1)= d[4,2]+Cost(2,{3},1)=8+15=23d[4,3]+Cost(3,{2},1)=9+18=27=23Cos
t(4,{2,3},1){d[4,2]+cost(2,{3},1)=8+15=23d[4,3]+Cost(3,{2},1)=9+18=27=23Cost(4,{2,3},
1)={d[4,2]+Cost(2,{3},1)=8+15=23d[4,3]+Cost(3,{2},1)=9+18=27=23Cost(4,{2,3},1){d[4,2
]+cost(2,{3},1)=8+15=23d[4,3]+Cost(3,{2},1)=9+18=27=23
S=3
Cost(1,{2,3,4},1)= d[1,2]+Cost(2,{3,4},1)=10+25=35d[1
,3]+Cost(3,{2,4},1)=15+25=40d[1,4]+Cost(4,{2,3},1)=20+23=43=35cost(1,{2,3,4}),1)d[1,2]
+cost(2,{3,4},1)=10+25=35d[1,3]+cost(3,{2,4},1)=15+25=40d[1,4]+cost(4,{2,3},1)=20+23=
43=35Cost(1,{2,3,4},1)={d[1,2]+Cost(2,{3,4},1)=10+25=35d[1,3]+Cost(3,{2,4},1)=15+25=
40d[1,4]+Cost(4,{2,3},1)=20+23=43=35cost(1,{2,3,4}),1)d[1,2]+cost(2,{3,4},1)=10+25=35
d[1,3]+cost(3,{2,4},1)=15+25=40d[1,4]+cost(4,{2,3},1)=20+23=43=35

The minimum cost path is 35.

Start from cost {1, {2, 3, 4}, 1}, we get the minimum value for d [1, 2]. When s = 3, select
the path from 1 to 2 (cost is 10) then go backwards. When s = 2, we get the minimum value
for d [4, 2]. Select the path from 2 to 4 (cost is 10) then go backwards.
When s = 1, we get the minimum value for d [4, 3]. Selecting path 4 to 3 (cost is 9), then we
shall go to then go to s = Φ step. We get the minimum value for d [3, 1] (cost is 6).
UNIT – III
Graph traversals
Graph traversal means visiting every vertex and edge exactly once in a well-defined order.
While using certain graph algorithms, you must ensure that each vertex of the graph is visited
exactly once. The order in which the vertices are visited are important and may depend upon
the algorithm or question that you are solving.
During a traversal, it is important that you track which vertices have been visited. The most
common way of tracking vertices is to mark them.
Breadth First Search (BFS)

There are many ways to traverse graphs. BFS is the most commonly used approach.

Breadth first search (BFS)

BFS is a traversing algorithm where you should start traversing from a selected node (source
or starting node) and traverse the graph layerwise thus exploring the neighbour nodes (nodes
which are directly connected to source node). You must then move towards the next-level
neighbour nodes.

As the name BFS suggests, you are required to traverse the graph breadthwise as follows:

1. First move horizontally and visit all the nodes of the current layer
2. Move to the next layer

Consider the above diagram.

The distance between the nodes in layer 1 is comparitively lesser than the distance between
the nodes in layer 2. Therefore, in BFS, you must traverse all the nodes in layer 1 before you
move to the nodes in layer 2.

Traversing child nodes

A graph can contain cycles, which may bring you to the same node again while traversing the
graph. To avoid processing of same node again, use a boolean array which marks the node
after it is processed. While visiting the nodes in the layer of a graph, store them in a manner
such that you can traverse the corresponding child nodes in a similar order.
In the earlier diagram, start traversing from 0 and visit its child nodes 1, 2, and 3. Store them
in the order in which they are visited. This will allow you to visit the child nodes of 1 first
(i.e. 4 and 5), then of 2 (i.e. 6 and 7), and then of 3 (i.e. 7) etc.
To make this process easy, use a queue to store the node and mark it as 'visited' until all its
neighbours (vertices that are directly connected to it) are marked. The queue follows the First
In First Out (FIFO) queuing method, and therefore, the neigbors of the node will be visited in
the order in which they were inserted in the node i.e. the node that was inserted first will be
visited first, and so on.

Pseudo Code

BFS (G, s) //Where G is the graph and s is the source node

let Q be queue.
Q.enqueue( s ) //Inserting s in queue until all its neighbour vertices are marked.

mark s as visited.
while ( Q is not empty)
//Removing that vertex from queue,whose neighbour will be visited now
v = Q.dequeue( )

//processing all the neighbours of v

for all neighbours w of v in Graph G
if w is not visited
Q.enqueue( w ) //Stores w in Q to further visit its neighbour
mark w as visited.

Algorithm
The traversing will start from the source node and push s in the queue. s will be marked as
'visited'.
First iteration
● s will be popped from the queue
● Neighbors of s i.e. 1 and 2 will be traversed
● 1 and 2, which have not been traversed earlier, are traversed. They will be:
○ Pushed in the queue
○ 1 and 2 will be marked as visited
Second iteration
● 1 is popped from the queue
● Neighbors of 1 i.e. s and 3 are traversed
● s is ignored because it is marked as 'visited'
● 3, which has not been traversed earlier, is traversed. It is:
○ Pushed in the queue
○ Marked as visited
○
Third iteration
● 2 is popped from the queue
● Neighbors of 2 i.e. s, 3, and 4 are traversed
● 3 and s are ignored because they are marked as 'visited'
● 4, which has not been traversed earlier, is traversed. It is:
○ Pushed in the queue
○ Marked as visited
Fourth iteration
● 3 is popped from the queue
● Neighbors of 3 i.e. 1, 2, and 5 are traversed
● 1 and 2 are ignored because they are marked as 'visited'
● 5, which has not been traversed earlier, is traversed. It is:
○ Pushed in the queue
○ Marked as visited
Fifth iteration
● 4 will be popped from the queue
● Neighbors of 4 i.e. 2 is traversed
● 2 is ignored because it is already marked as 'visited'
Sixth iteration
● 5 is popped from the queue
● Neighbors of 5 i.e. 3 is traversed
● 3 is ignored because it is already marked as 'visited'
The queue is empty and it comes out of the loop. All the nodes have been traversed by using
BFS.
If all the edges in a graph are of the same weight, then BFS can also be used to find the
minimum distance between the nodes in a graph.
Traversing process
Depth first search (BFS)

The DFS algorithm is a recursive algorithm that uses the idea of backtracking. It involves
exhaustive searches of all the nodes by going ahead, if possible, else by backtracking.
Here, the word backtrack means that when you are moving forward and there are no more
nodes along the current path, you move backwards on the same path to find nodes to traverse.
All the nodes will be visited on the current path till all the unvisited nodes have been
traversed after which the next path will be selected.

This recursive nature of DFS can be implemented using stacks. The basic idea is as follows:
Pick a starting node and push all its adjacent nodes into a stack.
Pop a node from stack to select the next node to visit and push all its adjacent nodes into a
stack.
Repeat this process until the stack is empty. However, ensure that the nodes that are visited
are marked. This will prevent you from visiting the same node more than once. If you do not
mark the nodes that are visited and you visit the same node more than once, you may end up
in an infinite loop.

Pseudocode

DFS-iterative (G, s): //Where G is graph and s is source vertex

let S be stack
S.push( s ) //Inserting s in stack
mark s as visited.
while ( S is not empty):
//Pop a vertex from stack to visit next
v = S.top( )
S.pop( )
//Push all the neighbours of v in stack that are not visited
for all neighbours w of v in Graph G:
if w is not visited :
S.push( w )
mark w as visited

DFS-recursive(G, s):
mark s as visited
for all neighbours w of s in Graph G:
if w is not visited:
DFS-recursive(G, w)

The following image shows how DFS works.

Time complexity O(V + E) , when implemented using an adjacency list.

How to find connected components using DFS?

A graph is said to be disconnected if it is not connected, i.e. if two nodes exist in the graph
such that there is no edge in between those nodes. In an undirected graph, a connected
component is a set of vertices in a graph that are linked to each other by paths.

Consider the example given in the diagram. Graph G is a disconnected graph and has the
following 3 connected components.

● First connected component is 1 -> 2 -> 3 as they are linked to each other
● Second connected component 4 -> 5
● Third connected component is vertex 6

In DFS, if we start from a start node it will mark all the nodes connected to the start node as
visited. Therefore, if we choose any node in a connected component and run DFS on that
node it will mark the whole connected component as visited.
SHORTEST PATH:
 Shortest path problem is a problem of finding the shortest path(s) between vertices of
a given graph.
 Shortest path between two vertices is a path that has the least cost as compared to all
other existing paths.

Types of Shortest Path Algorithms:

There are two main types of shortest path algorithms, single-source and all-pairs. Both types
have algorithms that perform best in their own way. All-pairs algorithms take longer to run
because of the added complexity. All shortest path algorithms return values that can be used
to find the shortest path, even if those return values vary in type or form from algorithm to
algorithm.
Various types of shortest path problem are-

1. Single-pair shortest path problem

2. Single-source shortest path problem
3. Single-destination shortest path problem
4. All pairs shortest path problem

Applications-

Shortest path algorithms have a wide range of applications such as in-

 Google Maps
 Road Networks
 Logistics Research

SINGLE SOURCE SHORTEST PATH:

The single source shortest path algorithm (for arbitrary weight positive or negative) is also
known Bellman-Ford algorithm is used to find minimum distance from source vertex to any
other vertex. The main difference between this algorithm with Dijkstra’s algorithm is, in
Dijkstra’s algorithm we cannot handle the negative weight, but here we can handle it easily.

Bellman-Ford algorithm finds the distance in bottom up manner. At first it finds those
distances which have only one edge in the path. After that increase the path length to find all
possible solutions.

Bellman Ford's Algorithm:

Bellman Ford's algorithm is used to find the shortest paths from the source vertex to all other
vertices in a weighted graph. It depends on the following concept: Shortest path contains at
most n -1 edges, because the shortest path couldn't have a cycle
Algorithm Steps:
 The outer loop traverses from 0: n-1.
 Loop over all edges, check if the next node distance > current node distance + edge
weight, in this case update the next node distance to "current node distance + edge
weight".
This algorithm depends on the relaxation principle where the shortest distance for all vertices
is gradually replaced by more accurate values until eventually reaching the optimum solution.
In the beginning all vertices have a distance of "Infinity", but only the distance of the source
vertex = 0, then update all the connected vertices with the new distances (source vertex
distance + edge weights), then apply the same concept for the new vertices with new
distances and so on.

 For the single-source shortest path problem, you can use Dijkstra's algorithm. With
a normal binary heap, this gives you a time complexity of O((E + V) log V). With a
Fibonacci heap, this can be improved to O(E + V log V), which is faster for dense
graphs.

 The time complexity of this algorithm is O(V3), here V is the number of vertices in
the graph. Input − The cost matrix of the graph. Output − Matrix of all pair shortest
path.

MINIMUM SPANNING TREE:

A minimum spanning tree is a special kind of tree that minimizes the lengths (or “weights”)
of the edges of the tree. An example is a cable company wanting to lay line to multiple
neighborhoods; by minimizing the amount of cable laid, the cable company will save money.
A tree has one path joins any two vertices. A spanning tree of a graph is a tree that:
 Contains all the original graph’s vertices.
 Reaches out to (spans) all vertices.
 Is acyclic. In other words, the graph doesn’t have any nodes which loop back to
itself.

Even the simplest of graphs can contain many spanning trees. For example, the following
graph:
…has many possibilities for spanning trees, including:

Application of Minimum Spanning Tree

1. Consider n stations are to be linked using a communication network & laying of

communication links between any two stations involves a cost.
The ideal solution would be to extract a sub graph termed as minimum cost spanning
tree.
2. Suppose you want to construct highways or railroads spanning several cities then we
can use the concept of minimum spanning trees.
3. Designing Local Area Networks.
4. Laying pipelines connecting offshore drilling sites, refineries and consumer markets.
5. Suppose you want to apply a set of houses with
o Electric Power
o Water
o Telephone lines
o Sewage lines
To reduce cost, you can connect houses with minimum cost spanning trees.
Prim's Spanning Tree Algorithm:

Prim's algorithm to find minimum cost spanning tree (as Kruskal's algorithm) uses the
greedy approach. Prim's algorithm shares a similarity with the shortest path
first algorithms.
Prim's algorithm, in contrast with Kruskal's algorithm, treats the nodes as a single tree and
keeps on adding new nodes to the spanning tree from the given graph.
To contrast with Kruskal's algorithm and to understand Prim's algorithm better, we shall use
the same example −

Step 1 - Remove all loops and parallel edges

Remove all loops and parallel edges from the given graph. In case of parallel edges, keep the
one which has the least cost associated and remove all others.

Step 2 - Choose any arbitrary node as root node

In this case, we choose S node as the root node of Prim's spanning tree. This node is
arbitrarily chosen, so any node can be the root node. One may wonder why any video can be
a root node. So the answer is, in the spanning tree all the nodes of a graph are included and
because it is connected then there must be at least one edge, which will join it to the rest of
the tree.
Step 3 - Check outgoing edges and select the one with less cost
After choosing the root node S, we see that S,A and S,C are two edges with weight 7 and 8,
respectively. We choose the edge S,A as it is lesser than the other.
Now, the tree S-7-A is treated as one node and we check for all edges going out from it. We
select the one which has the lowest cost and include it in the tree.

After this step, S-7-A-3-C tree is formed. Now we'll again treat it as a node and will check
all the edges again. However, we will choose only the least cost edge. In this case, C-3-D is
the new edge, which is less than other edges' cost 8, 6, 4, etc.

After adding node D to the spanning tree, we now have two edges going out of it having the
same cost, i.e. D-2-T and D-2-B. Thus, we can add either one. But the next step will again
yield edge 2 as the least cost. Hence, we are showing a spanning tree with both edges
included.

We may find that the output spanning tree of the same graph using two different algorithms
is same.

Kruskal's Spanning Tree Algorithm:

Kruskal's algorithm to find the minimum cost spanning tree uses the greedy approach. This
algorithm treats the graph as a forest and every node it has as an individual tree. A tree
connects to another only and only if, it has the least cost among all available options and
does not violate MST properties.
To understand Kruskal's algorithm let us consider the following example −

Step 1 - Remove all loops and Parallel Edges

Remove all loops and parallel edges from the given graph.

In case of parallel edges, keep the one which has the least cost associated and remove all
others.

Step 2 - Arrange all edges in their increasing order of weight

The next step is to create a set of edges and weight, and arrange them in an ascending order
of weightage (cost).

Step 3 - Add the edge which has the least weightage

Now we start adding edges to the graph beginning from the one which has the least weight.
Throughout, we shall keep checking that the spanning properties remain intact. In case, by
adding one edge, the spanning tree property does not hold then we shall consider not to
include the edge in the graph.

The least cost is 2 and edges involved are B,D and D,T. We add them. Adding them does
not violate spanning tree properties, so we continue to our next edge selection.
Next cost is 3, and associated edges are A,C and C,D. We add them again −

Next cost in the table is 4, and we observe that adding it will create a circuit in the graph. −

We ignore it. In the process we shall ignore/avoid all edges that create a circuit.

We observe that edges with cost 5 and 6 also create circuits. We ignore them and move on.
Now we are left with only one node to be added. Between the two least cost edges available
7 and 8, we shall add the edge with cost 7.

By adding edge S,A we have included all the nodes of the graph and we now have minimum
cost spanning tree.
DIFFERENCE BETWEEN PRIM’S AND KRUSKAL’S ALGORITHM:

Prim’s Algorithm Kruskal’s Algorithm

  It starts to build the Minimum

It starts to build the Minimum Spanning Tree from the vertex
Spanning Tree from any vertex carrying minimum weight in the
in the graph. graph.

 It traverses one node more than 

one time to get the minimum
distance. It traverses one node only once.

 Prim’s algorithm has a time

complexity of O(V2), V being Kruskal’s algorithm can generate
the number of vertices and can forest(disconnected components) at
be improved up to O(E + log V) any instant as well as it can work on
using Fibonacci heaps. disconnected components

 Prim’s algorithm gives 

connected component as well as Kruskal’s algorithm’s time
it works only on connected complexity is O(E log V), V being
graph. the number of vertices.

 Prim’s algorithm runs faster in  Kruskal’s algorithm runs faster in

dense graphs. sparse graphs.

TOPOLOGICAL SORT:
Topological sorting for Directed Acyclic Graph (DAG) is a linear ordering of vertices such
that for every directed edge u v, vertex u comes before v in the ordering. Topological
Sorting for a graph is not possible if the graph is not a DAG.
For example, a topological sorting of the following graph is “5 4 2 3 1 0”. There can be
more than one topological sorting for a graph. For example, another topological sorting of
the following graph is “4 5 2 3 1 0”. The first vertex in topological sorting is always a
vertex with in-degree as 0 (a vertex with no incoming edges).

Topological Sorting vs. Depth First Traversal (DFS):

In DFS, we print a vertex and then recursively call DFS for its adjacent vertices. In
topological sorting, we need to print a vertex before its adjacent vertices. For example, in
the given graph, the vertex ‘5’ should be printed before vertex ‘0’, but unlike DFS, the
vertex ‘4’ should also be printed before vertex ‘0’. So Topological sorting is different from
DFS. For example, a DFS of the shown graph is “5 2 3 1 0 4”, but it is not a topological
sorting.

Algorithm to find Topological Sorting:

We recommend to first see the implementation of DFS. We can modify DFS to find
Topological Sorting of a graph. In DFS, we start from a vertex, we first print it and then
recursively call DFS for its adjacent vertices. In topological sorting, we use a temporary
stack. We don’t print the vertex immediately, we first recursively call topological sorting
for all its adjacent vertices, then push it to a stack. Finally, print contents of the stack. Note
that a vertex is pushed to stack only when all of its adjacent vertices (and their adjacent
vertices and so on) are already in the stack.
Below image is an illustration of the above approach:
Flow Networks and Flows
Flow Network is a directed graph that is used for modeling material Flow. There are two
different vertices; one is a source which produces material at some steady rate, and another
one is sink which consumes the content at the same constant speed. The flow of the material
at any mark in the system is the rate at which the element moves.
Some real-life problems like the flow of liquids through pipes, the current through wires and
delivery of goods can be modeled using flow networks.
Definition: A Flow Network is a directed graph G = (V, E) such that
1. For each edge (u, v) ∈ E, we associate a nonnegative weight capacity c (u, v) ≥ 0.If (u,
v) ∉ E, we assume that c (u, v) = 0.
2. There are two distinguishing points, the source s, and the sink t;
3. For every vertex v ∈ V, there is a path from s to t containing v.
Let G = (V, E) be a flow network. Let s be the source of the network, and let t be the sink. A
flow in G is a real-valued function f: V x V→R such that the following properties hold:
o Capacity Constraint: For all u, v ∈ V, we need f (u, v) ≤ c (u, v).
o Skew Symmetry: For all u, v ∈ V, we need f (u, v) = - f (u, v).
o Flow Conservation: For all u ∈ V-{s, t}, we need

The quantity f (u, v), which can be positive or negative, is known as the net flow from vertex
u to vertex v. In the maximum-flow problem, we are given a flow network G with source s
and sink t, and we wish to find a flow of maximum value from s to t.
The three properties can be described as follows:
1. Capacity Constraint makes sure that the flow through each edge is not greater than
the capacity.
2. Skew Symmetry means that the flow from u to v is the negative of the flow from v to
u.
3. The flow-conservation property says that the total net flow out of a vertex other than
the source or sink is 0. In other words, the amount of flow into a v is the same as the
amount of flow out of v for every vertex v ∈ V - {s, t}
The value of the flow is the net flow from the source,

The positive net flow entering a vertex v is described by

The positive net flow leaving a vertex is described symmetrically. One interpretation of the
Flow-Conservation Property is that the positive net flow entering a vertex other than the
source or sink must equal the positive net flow leaving the vertex.
A flow f is said to be integer-valued if f (u, v) is an integer for all (u, v) ∈ E. Clearly, the
value of the flow is an integer is an integer-valued flow.
UNIT – IV

Course objective:
After completing this Unit, you will understand about
 Classify problems as tractable or intractable
 Define decision problems
 Define the class P
 Define nondeterministic algorithms
 Define the class NP
 Define polynomial transformations
 Define the class of NP-Complete
Intractability:
Dictionary Definition of intractable:
“difficult to treat or work.”
Computer Science: problem is intractable if a computer has difficulty solving it
A problem is intractable if it is not tractable
Any algorithm with a growth rate not bounded by a polynomial
cn , c .01n , n logn , n!, etc.
Property of the problem not the algorithm
Unrealistic definition of the Problem (List all permutations of n numbers)
Towers of Hanoi
Un-Decidable problems: The Halting Problem (proven un-decidable by Alan Turing).
Decidable intractable problems: researchers have shown some problems from automata and
mathematical logic intractable
Tractable
A problem is tractable if there exists a polynomial bound algorithm that solves it.
Worst-case growth rate can be bounded by a polynomial
Function of its input size
P(n) = an n k + . . . + a1 n + a0 where k is a constant
P(n) is θ(n k )
nlgn not a polynomial
nlgn < n 2 bound by a polynomial
Decision problem:
Problem where the output is a simple “yes” or “no”
Theory of NP-completeness is developed by restricting problems to decision problems
Optimization problems can be transformed into decision problems
Optimization problems are at least as hard as the associated decision problem
If polynomial-time algorithm for the optimization problem is found, we would have a
polynomial time algorithm for the corresponding decision problem
Traveling Salesperson - For a given positive number d, is there a tour having length <= d?
0-1 Knapsack - For a given profit P, is it possible to load the knapsack such that total weight
<= W?
Class P:
The set of all decision problems that can be solved by polynomial-time algorithms
Decision versions of searching, shortest path, spanning tree, etc. belong to P
Do problems such as traveling salesperson and 0- 1 Knapsack (no polynomial-time algorithm
has been found), etc., belong to P?

To know a decision problem is not in P, it must be proven it is not possible to develop a

polynomial-time algorithm to solve it
Class NP:
The set of all decision problems that can be solved by polynomial-time nondeterministic
algorithms.
Nondeterministic polynomial:
For a problem to be in NP, there must be an algorithm that does the verification in
polynomial time.
Traveling salesperson decision problem belongs to NP
Show a guess, s, length polynomial bounded
Yes answer verified in a polynomial number of steps
Computational Problems:
NP Completeness:
What is Language?
Every decision problem can have only two answers, yes or no. Hence, a decision
problem may belong to a language if it provides an answer ‘yes’ for a specific input. A
language is the totality of inputs for which the answer is yes.
For input size n, if worst-case time complexity of an algorithm is O(nk), where k is a
constant, the algorithm is a polynomial time algorithm.
Algorithms such as Matrix Chain Multiplication, Single Source Shortest Path, All Pair
Shortest Path, Minimum Spanning Tree, etc. run in polynomial time. However there are
many problems, such as traveling salesperson, optimal graph coloring, Hamiltonian cycles,
finding the longest path in a graph, and satisfying a Boolean formula, for which no
polynomial time algorithms is known. These problems belong to an interesting class of
problems, called the NP-Complete problems, whose status is unknown.
P-Class:
The class P consists of those problems that are solvable in polynomial time, i.e. these
problems can be solved in time O(nk) in worst-case, where k is constant.
These problems are called tractable, while others are called intractable or super
polynomial.
Formally, an algorithm is polynomial time algorithm, if there exists a polynomial p(n) such
that the algorithm can solve any instance of size n in a time O(p(n)).
Problem requiring Ω(n50) time to solve are essentially intractable for large n. Most known
polynomial time algorithm run in time O(nk) for fairly low value of k.
The advantages in considering the class of polynomial-time algorithms is that all
reasonable deterministic single processor model of computation can be simulated on each
other with at most a polynomial slow-d
NP-Class:
The class NP consists of those problems that are verifiable in polynomial time. NP is
the class of decision problems for which it is easy to check the correctness of a claimed
answer, with the aid of a little extra information. Hence, we aren’t asking for a way to find a
solution, but only to verify that an alleged solution really is correct.
Every problem in this class can be solved in exponential time using exhaustive search.
P versus NP
Every decision problem that is solvable by a deterministic polynomial time algorithm is also
solvable by a polynomial time non-deterministic algorithm.
All problems in P can be solved with polynomial time algorithms, whereas all problems
in NP - P are intractable.
It is not known whether P = NP. However, many problems are known in NP with the
property that if they belong to P, then it can be proved that P = NP.
If P ≠ NP, there are problems in NP that are neither in P nor in NP-Complete.
The problem belongs to class P if it’s easy to find a solution for the problem. The problem
belongs to NP, if it’s easy to check a solution that may have been very tedious to find.
The classes NP Hard and NP – Complete:
P is set of all decision problems solvable by deterministic algorithms in polynomial time. NP
is the set of all decision problems solvable by non-deterministic algorithms in polynomial
time.

Relationship between NP and P

Relationship between P,NP,NP- Hard, NP-Complete

Cook Theorem:
Stephen Cook presented four theorems in his paper “The Complexity of Theorem Proving
Procedures”. These theorems are stated below.
Following are the four theorems by Stephen Cook −
Theorem-1
If a set S of strings is accepted by some non-deterministic Turing machine within
polynomial time, then S is P-reducible to {DNF tautologies}.
Theorem-2
The following sets are P-reducible to each other in pairs (and hence each has the same
polynomial degree of difficulty): {tautologies}, {DNF tautologies}, D3, {sub-graph pairs}.
Theorem-3
 For any TQ(k) of type Q, TQ(k)k√(logk)2TQ(k)k(logk)2 is unbounded
 There is a TQ(k) of type Q such that TQ(k) 2k(logk)2TQ(k) 2k(logk)2
Theorem-4
If the set S of strings is accepted by a non-deterministic machine within time T(n) = 2n, and
if TQ(k) is an honest (i.e. real-time countable) function of type Q, then there is a constant K,
so S can be recognized by a deterministic machine within time TQ(K8n).
 First, he emphasized the significance of polynomial time reducibility. It means that if
we have a polynomial time reduction from one problem to another, this ensures that
any polynomial time algorithm from the second problem can be converted into a
corresponding polynomial time algorithm for the first problem.
 Second, he focused attention on the class NP of decision problems that can be solved
in polynomial time by a non-deterministic computer. Most of the intractable
problems belong to this class, NP.
 Third, he proved that one particular problem in NP has the property that every other
problem in NP can be polynomially reduced to it. If the satisfiability problem can be
solved with a polynomial time algorithm, then every problem in NP can also be
solved in polynomial time. If any problem in NP is intractable, then satisfiability
problem must be intractable. Thus, satisfiability problem is the hardest problem in
NP.
 Fourth, Cook suggested that other problems in NP might share with the satisfiability
problem this property of being the hardest member of NP.
 A problem is in the class NPC if it is in NP and is as hard as any problem in NP. A
problem is NP-hard if all problems in NP are polynomial time reducible to it, even
though it may not be in NP itself.
 If a polynomial time algorithm exists for any of these problems, all problems in NP
would be polynomial time solvable. These problems are called NP-complete. The
phenomenon of NP-completeness is important for both theoretical and practical
reasons.
Definition of NP-Completeness
A language B is NP-complete if it satisfies two conditions
 B is in NP
 Every A in NP is polynomial time reducible to B.
If a language satisfies the second property, but not necessarily the first one, the
language B is known as NP-Hard. Informally, a search problem B is NP-Hard if there
exists some NP-Complete problem A that Turing reduces to B.
The problem in NP-Hard cannot be solved in polynomial time, until P = NP. If a problem is
proved to be NPC, there is no need to waste time on trying to find an efficient algorithm for
it. Instead, we can focus on design approximation algorithm.
NP-Complete Problems
Following are some NP-Complete problems, for which no polynomial time algorithm is
known.
 Determining whether a graph has a Hamiltonian cycle
 Determining whether a Boolean formula is satisfiable, etc.
NP-Hard Problems
The following problems are NP-Hard
 The circuit-satisfiability problem
 Set Cover
 Vertex Cover
 Travelling Salesman Problem
Example:
Now we will discuss TSP is NP-Complete
TSP is NP-Complete
The traveling salesman problem consists of a salesman and a set of cities. The
salesman has to visit each one of the cities starting from a certain one and returning to the
same city. The challenge of the problem is that the traveling salesman wants to minimize the
total length of the trip.
Proof
To prove TSP is NP-Complete, first we have to prove that TSP belongs to NP. In
TSP, we find a tour and check that the tour contains each vertex once. Then the total cost of
the edges of the tour is calculated. Finally, we check if the cost is minimum. This can be
completed in polynomial time. Thus TSP belongs to NP.
Secondly, we have to prove that TSP is NP-hard. To prove this, one way is to show
that Hamiltonian cycle ≤p TSP (as we know that the Hamiltonian cycle problem is
NPcomplete).
Assume G = (V, E) to be an instance of Hamiltonian cycle.
Hence, an instance of TSP is constructed. We create the complete graph G' = (V, E'), where
E′={(i,j):i,j∈Vandi≠jE′={(i,j):i,j∈Vandi≠j
Thus, the cost function is defined as follows −
t(i,j)={01if(i,j)∈Eotherwiset(i,j)={0if(i,j)∈E1otherwise
Now, suppose that a Hamiltonian cycle h exists in G. It is clear that the cost of each edge
in h is 0 in G' as each edge belongs to E. Therefore, h has a cost of 0 in G'. Thus, if
graph G has a Hamiltonian cycle, then graph G' has a tour of 0 cost.
Conversely, we assume that G' has a tour h' of cost at most 0. The cost of edges
in E' are 0 and 1 by definition. Hence, each edge must have a cost of 0 as the cost of h' is 0.
We therefore conclude that h' contains only edges in E.
We have thus proven that G has a Hamiltonian cycle, if and only if G' has a tour of cost at
most 0. TSP is NP-complete.

ADA Complete Notes
33% (3)
ADA Complete Notes
151 pages
Huawei Dualband Co BCCH Cell Introduction and Optimization
100% (1)
Huawei Dualband Co BCCH Cell Introduction and Optimization
25 pages
Unit I - Daa Notes
No ratings yet
Unit I - Daa Notes
43 pages
ADA Unit I
No ratings yet
ADA Unit I
114 pages
Chapter 1
No ratings yet
Chapter 1
59 pages
N - Unit-1 - Design and Analysis of Algorithm - B19CS4010 - 2020-2021 - 4 - D
No ratings yet
N - Unit-1 - Design and Analysis of Algorithm - B19CS4010 - 2020-2021 - 4 - D
24 pages
MODULE-1 PART-1
No ratings yet
MODULE-1 PART-1
28 pages
DSA - Module
No ratings yet
DSA - Module
108 pages
Print DAA
No ratings yet
Print DAA
76 pages
Analysis of Algorithm (Summ)
No ratings yet
Analysis of Algorithm (Summ)
8 pages
Ada (Bcs401) Notes
No ratings yet
Ada (Bcs401) Notes
14 pages
Topic 1 Introduction To Agorithms
No ratings yet
Topic 1 Introduction To Agorithms
55 pages
Module 1
No ratings yet
Module 1
39 pages
ADA m1
No ratings yet
ADA m1
129 pages
20itt43 Daa Unit I
No ratings yet
20itt43 Daa Unit I
122 pages
As An Example of Illustrating The Notion of Algorithm
0% (1)
As An Example of Illustrating The Notion of Algorithm
7 pages
BCS401 Module 1
No ratings yet
BCS401 Module 1
38 pages
Topic Notes: Introduction and Overview: What Is An Algorithm?
No ratings yet
Topic Notes: Introduction and Overview: What Is An Algorithm?
14 pages
ADA Module 1 Part 1
No ratings yet
ADA Module 1 Part 1
52 pages
01 CS316 Introduction
No ratings yet
01 CS316 Introduction
45 pages
mod1bcs401
No ratings yet
mod1bcs401
21 pages
02512tpnews_11082020
No ratings yet
02512tpnews_11082020
173 pages
Lecture 01 - Introduction To Algorithms
No ratings yet
Lecture 01 - Introduction To Algorithms
19 pages
ADA BCS401 Module 1notes
No ratings yet
ADA BCS401 Module 1notes
27 pages
Unit-I - Introduction
100% (1)
Unit-I - Introduction
75 pages
Daa Notes - It
No ratings yet
Daa Notes - It
134 pages
Fdsa
No ratings yet
Fdsa
38 pages
USTH Introduction
No ratings yet
USTH Introduction
53 pages
ADA Module 1
No ratings yet
ADA Module 1
22 pages
Ms-031 Design and Analysis of Algorithms
No ratings yet
Ms-031 Design and Analysis of Algorithms
324 pages
MCS 031
No ratings yet
MCS 031
324 pages
Mcs 031
No ratings yet
Mcs 031
324 pages
DAA UNIT 1 (1)
No ratings yet
DAA UNIT 1 (1)
19 pages
Module 1
No ratings yet
Module 1
65 pages
Design and Analysis of Algorithms: Text Book
No ratings yet
Design and Analysis of Algorithms: Text Book
76 pages
Ada Notes
No ratings yet
Ada Notes
148 pages
Cs6402 Daa - Unit I - 5 Notes
No ratings yet
Cs6402 Daa - Unit I - 5 Notes
156 pages
Python 3 Language
No ratings yet
Python 3 Language
632 pages
Sections 1.1 - 1.4 Pages 1-40
No ratings yet
Sections 1.1 - 1.4 Pages 1-40
32 pages
Gate Study Material
No ratings yet
Gate Study Material
89 pages
Original Copy As On 27.11.2024
No ratings yet
Original Copy As On 27.11.2024
72 pages
Daa Unit 1
No ratings yet
Daa Unit 1
60 pages
Mcs 31
No ratings yet
Mcs 31
324 pages
Mca 12
No ratings yet
Mca 12
136 pages
Alg Ch1 Part1
No ratings yet
Alg Ch1 Part1
53 pages
Design and Analysis of Algorithms Dr. N. Subhash Chandra: Course Objectives
No ratings yet
Design and Analysis of Algorithms Dr. N. Subhash Chandra: Course Objectives
137 pages
Concept of Algorithm
No ratings yet
Concept of Algorithm
40 pages
MSIT-104 Data Structure and Algorithms
No ratings yet
MSIT-104 Data Structure and Algorithms
237 pages
ADA - BCS401 MOD 1 (Simplified Notes) @vtunetwork
100% (1)
ADA - BCS401 MOD 1 (Simplified Notes) @vtunetwork
24 pages
UNIT - I Introduction
No ratings yet
UNIT - I Introduction
65 pages
Unit 1 Basics of An Algorithm and Its Properties: Muhammad Ibn Musa Al-Khowarizmi.
No ratings yet
Unit 1 Basics of An Algorithm and Its Properties: Muhammad Ibn Musa Al-Khowarizmi.
86 pages
Design and Analysis of Algorithms-1
No ratings yet
Design and Analysis of Algorithms-1
26 pages
Unit 1 Elememtary Algorithmics: Structure Page Nos
No ratings yet
Unit 1 Elememtary Algorithmics: Structure Page Nos
121 pages
Module01 Algorithm Ch01 Letin
No ratings yet
Module01 Algorithm Ch01 Letin
14 pages
Module1 (Autosaved)
No ratings yet
Module1 (Autosaved)
66 pages
Introduction To Daa
100% (1)
Introduction To Daa
126 pages
Unit 1 DAANotes
No ratings yet
Unit 1 DAANotes
22 pages
Theory of Computation
From Everand
Theory of Computation
IntroBooks Team
No ratings yet
Programming Logic
From Everand
Programming Logic
Frank Wellington
No ratings yet
Numerical Methods for Scientists and Engineers
From Everand
Numerical Methods for Scientists and Engineers
Richard Hamming
4/5 (11)
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Influence Line-Model Correction Approach For The Assessment of Engineering Structures Using Novel Monitoring Techniques
No ratings yet
Influence Line-Model Correction Approach For The Assessment of Engineering Structures Using Novel Monitoring Techniques
20 pages
LP Practice Solutions-18-19
No ratings yet
LP Practice Solutions-18-19
13 pages
Ten Challenges in 3D Printing: William Oropallo Les A. Piegl
No ratings yet
Ten Challenges in 3D Printing: William Oropallo Les A. Piegl
14 pages
Review ICC
No ratings yet
Review ICC
3 pages
Talpac Tutorial - Imperial
50% (2)
Talpac Tutorial - Imperial
52 pages
Asheville Synergetics Model Thinking
No ratings yet
Asheville Synergetics Model Thinking
85 pages
Drill Well Path Design
No ratings yet
Drill Well Path Design
17 pages
Agency Costs, Risk Management, and Capital Structure: Hayne E. Leland April 18, 1998
No ratings yet
Agency Costs, Risk Management, and Capital Structure: Hayne E. Leland April 18, 1998
48 pages
models.pipe.pipeline_insulation
No ratings yet
models.pipe.pipeline_insulation
18 pages
Fundamentals of Object Tracking: Subhash Challa
No ratings yet
Fundamentals of Object Tracking: Subhash Challa
4 pages
Bi Varient Freq.
No ratings yet
Bi Varient Freq.
12 pages
43 Optimal Taxation - An Introduction To The Literature
No ratings yet
43 Optimal Taxation - An Introduction To The Literature
18 pages
Adaptive Trading Strategies Across Liquidity Pools
0% (1)
Adaptive Trading Strategies Across Liquidity Pools
47 pages
A Comprehensive Study On Social Media Mental Disorder Detection
No ratings yet
A Comprehensive Study On Social Media Mental Disorder Detection
28 pages
LEAN SIX SIGMA GREEN BELT FOUNDATION_DOE ENGG PROCESS RELIABILITY & OPTIMISATION_ELECTRONICS & ELECTRICAL INDUSTRY
No ratings yet
LEAN SIX SIGMA GREEN BELT FOUNDATION_DOE ENGG PROCESS RELIABILITY & OPTIMISATION_ELECTRONICS & ELECTRICAL INDUSTRY
7 pages
Cost Reduction Faster, Better Decision Making New Products and Services
No ratings yet
Cost Reduction Faster, Better Decision Making New Products and Services
50 pages
Offline Data-Driven Multiobjective Optimization Knowledge Transfer Between Surrogates and Generation of Final Solutions
No ratings yet
Offline Data-Driven Multiobjective Optimization Knowledge Transfer Between Surrogates and Generation of Final Solutions
15 pages
Numerical Optimization Techniques
No ratings yet
Numerical Optimization Techniques
43 pages
Assignment # 03
No ratings yet
Assignment # 03
2 pages
C++ Challenge
No ratings yet
C++ Challenge
6 pages
Optimization of The Ductile Properties of A Arc Welded Plate Based
No ratings yet
Optimization of The Ductile Properties of A Arc Welded Plate Based
9 pages
Vijay Maths Project
No ratings yet
Vijay Maths Project
54 pages
The Design of Extrusion Screws: An Optimization Approach
No ratings yet
The Design of Extrusion Screws: An Optimization Approach
50 pages
Computerised Curve Deconvolution of TL OSL Curves Using A Popular Spreadsheet Program 2012 Radiation Protection Dosimetry
No ratings yet
Computerised Curve Deconvolution of TL OSL Curves Using A Popular Spreadsheet Program 2012 Radiation Protection Dosimetry
8 pages
An Adantive Generalized Sidelobe Canceller With
No ratings yet
An Adantive Generalized Sidelobe Canceller With
9 pages
Fin Heat Transfer
No ratings yet
Fin Heat Transfer
8 pages
A CFD-aided Experimental Study On Bending of Micro Glass Pipettes
No ratings yet
A CFD-aided Experimental Study On Bending of Micro Glass Pipettes
8 pages
Spot Welding
No ratings yet
Spot Welding
15 pages
Cognitive Radar Assisted Target Tracking: A Study
No ratings yet
Cognitive Radar Assisted Target Tracking: A Study
4 pages

DAAcourse

Uploaded by

DAAcourse

Uploaded by

COURSE PLAN

DESIGN AND ANALYSIS OF ALGORITHM

gcd(m, n) = gcd(n, m mod n),

For example, gcd(60, 24) can be computed as follows:

gcd(60, 24) = gcd(24, 12) = gcd(12, 0) = 12.

Here is a more structured description of this algorithm:

Euclid’s algorithm for computing gcd(m, n)

Step 2 Divide m by n and assign the value of the remainder to r.

Step 3 Assign the value of n to m and the value of r to n. Go to Step 1.

Algorithmics is more than a branch of computer science. It is the core of computer

We usually want our algorithms to possess several qualities.

t(n) ≤ cg(n) for all n ≥ n0.

t(n) ≥ cg(n) for all n ≥ n0.

c2g(n) ≤ t (n) ≤ c1g(n) for all n ≥ n0.

Mathematical Analysis of Recursive Algorithms

and 0! = 1 by definition, we can compute F (n) = F (n − 1) . n with the following recursive

F (n) = F (n − 1) . n for n > 0,

Mathematical Analysis of Iterative Algorithms

ALGORITHM MaxElement (A[0..n − 1])

//Determines the value of the largest element in a given array

//Output: The value of the largest element in A maxval ← A[0]

if A[i] > maxval maxval ← A[i]

Example: Computing the nth Fibonacci number

Comparison between Recursive and Iterative:

Which is better: Iteration or Recursion?

Algorithm Design Techniques

3. Dynamic Programming: Dynamic Programming is a bottom-up approach we

Brute Force Algorithms Explained

Greedy is an algorithmic paradigm that builds up a solution piece by piece, always

Dynamic Programming Methods

Branch and Bound Algorithm

How to determine if a problem can be solved using Backtracking?

1. Recursive backtracking solution.

2. Finding whether a solution exists or not

Let us try to solve a standard Backtracking problem, N-Queen Problem.

Bin Packing Problem

Price 100 280 120

Profit 280 100 120 120

Profit 100 280 120 120

Travelling Salesman Problem

The minimum cost path is 35.

Breadth first search (BFS)

Consider the above diagram.

Traversing child nodes

BFS (G, s) //Where G is the graph and s is the source node

//processing all the neighbours of v

DFS-iterative (G, s): //Where G is graph and s is source vertex

The following image shows how DFS works.

Time complexity O(V + E) , when implemented using an adjacency list.

Types of Shortest Path Algorithms:

1. Single-pair shortest path problem

Shortest path algorithms have a wide range of applications such as in-

SINGLE SOURCE SHORTEST PATH:

Bellman Ford's Algorithm:

MINIMUM SPANNING TREE:

Application of Minimum Spanning Tree

1. Consider n stations are to be linked using a communication network & laying of

Step 1 - Remove all loops and parallel edges

Step 2 - Choose any arbitrary node as root node

Kruskal's Spanning Tree Algorithm:

Step 1 - Remove all loops and Parallel Edges

Step 2 - Arrange all edges in their increasing order of weight

Step 3 - Add the edge which has the least weightage

Prim’s Algorithm Kruskal’s Algorithm

  It starts to build the Minimum

 It traverses one node more than 

 Prim’s algorithm has a time

 Prim’s algorithm gives 

 Prim’s algorithm runs faster in  Kruskal’s algorithm runs faster in

Topological Sorting vs. Depth First Traversal (DFS):

Algorithm to find Topological Sorting:

The positive net flow entering a vertex v is described by

To know a decision problem is not in P, it must be proven it is not possible to develop a

Relationship between NP and P

Relationship between P,NP,NP- Hard, NP-Complete

You might also like