Module 1
Module 1
What is Algorithm?
a clearly specified set of simple instructions to be followed
to solve a problem
○ Takes a set of values, as input and
○ produces a value, or set of values, as output
May be specified
○ In English
○ As a computer program
○ As a pseudo-code
Data structures
Methods of organizing data
Program = algorithms + data structures
Introduction
An algorithm is a sequence of computational steps that transform input into output.
These resources should be used wisely, and algorithms that are efficient in terms
of time or space will help you do so.
Properties of Algorithm
1. It should take 0 or more input.
2. It should produce at least one output.
3. It should terminate with in finite time.
4. It should be deterministic in nature.
(Every step should be unambiguous
and clear)
5. It should give correct output.
6. It is programming independent (It is
generic to all programming languages).
Steps to construct Algorithm
Problem definition (understand problem
clearly)
Design algorithm – divide and conquer,
greedy, dynamic
Flow chart
Verification – by flowchart
Coding or implementation by language
Analysis – CPU time, memory
Analysis
Time and memory (Space)
CPU always more costly than memory
So CPU time more important than
memory
In Analysis – firstly check time & then
space
Check space only when time of two
algorithm is same
Time complexity
After giving input and before getting
output the time taken by the algorithm
T(A) = C(A) + R(A)
Analyzing framework:
Measuring input size
Measuring running time
Order of growth
Worst-case, best-case, Average case
Example: Selection Problem
Given a list of N numbers, determine the
kth largest, where k N.
Algorithm 1:
(1) Read N numbers into an array
(2) Sort the array in decreasing order by
some simple algorithm
(3) Return the element in position k
Example: Selection Problem…
Algorithm 2:
(1) Read the first k elements into an array
and sort them in decreasing order
(2) Each remaining element is read one by
one
○ If smaller than the kth element, then it is
ignored
○ Otherwise, it is placed in its correct spot in the
array, bumping one element out of the array.
(3) The element in the kth position is returned
as the answer.
Example: Selection Problem…
Which algorithm is better when
N =100 and k = 100?
N =100 and k = 1?
What happens when N = 1,000,000 and
k = 500,000?
There exist better algorithms
Algorithm Analysis
We only analyze correct algorithms
An algorithm is correct
If, for every input instance, it halts with the correct output
Incorrect algorithms
Might not halt at all on some input instances
Might halt with other than the desired answer
Analyzing an algorithm
Predicting the resources that the algorithm requires
Resources include
○ Memory
○ Communication bandwidth
○ Computational time (usually most important)
Algorithm Analysis…
Factors affecting the running time
computer
compiler
algorithm used
input to the algorithm
○ The content of the input affects the running time
○ typically, the input size (number of items in the input) is the
main consideration
E.g. sorting problem the number of items to be sorted
E.g. multiply two matrices together the total number of
elements in the two matrices
Machine model assumed
Instructions are executed one after another, with no
concurrent operations Not parallel computers
Example
N
Calculate
i
i 1
3
1
1
2 2N+2
3 4N
4 1
N
i 1
i N N O(N ) 2
i 1
N
i 2
N N 2
O ( N 3
)
log N + N = O(N)
logk N = O(N) for any constant k
N = O(2N), but 2N is not O(N)
210N is not O(2N)
Math Review: logarithmic functions
x b iff
a
log x b a
log ab log a log b
log m b
log a b
log m a
log a b b log a
a log n n log a
log b a (log a)b log a b
d log e x 1
dx x
Some rules
When considering the growth rate of a function
using Big-Oh
Ignore the lower order terms and the coefficients
of the highest-order term
No need to specify the base of logarithm
Changing the base from one constant to another changes
the value of the logarithm by only a constant factor
f(N) = (g(N))
There are positive constants c and n0
such that
f(N) c g(N) when N n0
f (N ) f ( N )
then lim = lim
n g ( N ) n g ( N )
A
k i
k
O(N3)
Algorithm 2: Divide-and-
conquer
Divide-and-conquer
split the problem into two roughly equal subproblems, which
are then solved recursively
patch together the two solutions of the subproblems to arrive
at a solution for the whole problem
O(1)
T(m/2)
T(m/2)
O(m)
O(1)
Algorithm 2 (cont’d)
Recurrence equation
T (1) 1
N
T (N ) 2T ( ) N
2
• Input:
• Output:
2
Structure of data
3
Why Study Sorting Algorithms?
• There are a variety of situations that we can
encounter
– Do we have randomly ordered keys?
– Are all keys distinct?
– How large is the set of keys to be ordered?
– Need guaranteed performance?
4
Some Definitions
• Internal Sort
– The data to be sorted is all stored in the computer’s
main memory.
• External Sort
– Some of the data to be sorted might be stored in
some external, slower, device.
• In Place Sort
– The amount of extra space required to sort the data is
constant with the input size.
5
Insertion Sort
• Idea: like sorting a hand of playing cards
– Start with an empty left hand and the cards facing
down on the table.
– Remove one card at a time from the table, and insert
it into the correct position in the left hand
• compare it with each of the cards already in the hand, from
right to left
– The cards held in the left hand are sorted
• these cards were originally the top cards of the pile on the
table
6
Insertion Sort
7
Insertion Sort
8
Insertion Sort
9
Insertion Sort
input array
5 2 4 6 1 3
sorted unsorted
10
Insertion Sort
11
INSERTION-SORT
Alg.: INSERTION-SORT(A) 1 2 3 4 5 6 7 8
for j ← 2 to n a1 a2 a3 a4 a5 a6 a7 a8
do key ← A[ j ] key
Insert A[ j ] into the sorted sequence A[1 . . j -1]
i←j-1
while i > 0 and A[i] > key
do A[i + 1] ← A[i]
i←i–1
A[i + 1] ← key
• Insertion sort – sorts the elements in place
12
Loop Invariant for Insertion Sort
Alg.: INSERTION-SORT(A)
for j ← 2 to n
do key ← A[ j ]
Insert A[ j ] into the sorted sequence A[1 . . j -1]
i←j-1
while i > 0 and A[i] > key
do A[i + 1] ← A[i]
i←i–1
A[i + 1] ← key
Invariant: at the start of the for loop the elements in A[1 . . j-1]
are in sorted order
13
Proving Loop Invariants
• Proving loop invariants works like induction
• Initialization (base case):
– It is true prior to the first iteration of the loop
• Maintenance (inductive step):
– If it is true before an iteration of the loop, it remains true before
the next iteration
• Termination:
– When the loop terminates, the invariant gives us a useful
property that helps show that the algorithm is correct
– Stop the induction when the loop terminates
14
Loop Invariant for Insertion Sort
• Initialization:
– Just before the first iteration, j = 2:
the subarray A[1 . . j-1] = A[1],
(the element originally in A[1]) – is
sorted
15
Loop Invariant for Insertion Sort
• Maintenance:
– the while inner loop moves A[j -1], A[j -2], A[j -3],
and so on, by one position to the right until the proper
position for key (which has the value that started out in
A[j]) is found
– At that point, the value of key is placed into this
position.
16
Loop Invariant for Insertion Sort
• Termination:
– The outer for loop ends when j = n + 1 j-1 = n
– Replace n with j-1 in the loop invariant:
• the subarray A[1 . . n] consists of the elements originally in
A[1 . . n], but in sorted order
j-1 j
Invariant: at the start of the for loop the elements in A[1 . . j-1]
are in sorted order
17
Analysis of Insertion Sort
INSERTION-SORT(A) cost times
for j ← 2 to n c1 n
do key ← A[ j ] c2 n-1
Insert A[ j ] into the sorted sequence A[1 . . j -1] 0 n-1
i←j-1 c4 n-1
n
while i > 0 and A[i] > key c5 j 2 j
t
c6
n
do A[i + 1] ← A[i] j 2
(t j 1)
c7
n
i←i–1 j 2
(t j 1)
T (n) c1n c2 (n 1) c4 (n 1) c5 t j c6 t j 1 c7 t j 1 c8 (n 1)
n n n
j 2 j 2 j 2
18
Best Case Analysis
• The array is already sorted “while i > 0 and A[i] > key”
– A[i] ≤ key upon the first time the while loop test is run
(when i = j -1)
– tj = 1
= an + b = (n)
T (n) c1n c2 (n 1) c4 (n 1) c5 t j c6 t j 1 c7 t j 1 c8 (n 1)
n n n
j 2 j 2 j 2
19
Worst Case Analysis
• The array is in reverse sorted order “while i > 0 and A[i] > key”
– Always A[i] > key in while loop test
– Have to compare key with all elements to the left of the j-th
position compare with j-1 elements tj = j
n
n(n 1) n
n(n 1) n
n(n 1)
using j 1
j
2
j
j 2 2
1 ( j 1)
j 2 2
we have:
j 2 j 2 j 2 20
Comparisons and Exchanges in
Insertion Sort
INSERTION-SORT(A) cost times
for j ← 2 to n c1 n
do key ← A[ j ] c2 n-1
Insert A[ j ] into the sorted sequence A[1 . . j -1] 0 n-1
i←j-1 n2/2 comparisons c4 n-1
c5
n
while i > 0 and A[i] > key j 2 j
t
c6
n
do A[i + 1] ← A[i] j 2
(t j 1)
n2/2 exchanges c7
n
i←i–1 j 2
(t j 1)
22
Selection Sort
• Idea:
– Find the smallest element in the array
– Exchange it with the element in the first position
– Find the second smallest element and exchange it with
the element in the second position
– Continue until the array is sorted
• Disadvantage:
– Running time depends only slightly on the amount of
order in the file
23
Example
8 4 6 9 2 3 1 1 2 3 4 9 6 8
1 4 6 9 2 3 8 1 2 3 4 6 9 8
1 2 6 9 4 3 8 1 2 3 4 6 8 9
1 2 3 9 4 6 8 1 2 3 4 6 8 9
24
Selection Sort
Alg.: SELECTION-SORT(A)
n ← length[A] 8 4 6 9 2 3 1
for j ← 1 to n - 1
do smallest ← j
for i ← j + 1 to n
do if A[i] < A[smallest]
then smallest ← i
exchange A[j] ↔ A[smallest]
25
Analysis of Selection Sort
Alg.: SELECTION-SORT(A) cost times
n ← length[A] c1 1
for j ← 1 to n - 1 c2 n
do smallest ← j c3 n-1
n2/2 for i ← j + 1 to n c4 nj11 (n j 1)
comparisons
do if A[i] < A[smallest] c5
n 1
j 1
(n j )
n
exchanges then smallest ← i c6
n 1
j 1
(n j )
i
1 2 3 n
8 4 6 9 2 3 1
j
27
Example
8 4 6 9 2 3 1 1 8 4 6 9 2 3
i=1 j i=2 j
8 4 6 9 2 1 3 1 2 8 4 6 9 3
i=1 j i=3 j
8 4 6 9 1 2 3 1 2 3 8 4 6 9
i=1 j i=4 j
8 4 6 1 9 2 3 1 2 3 4 8 6 9
i=1 j i=5 j
8 4 1 6 9 2 3 1 2 3 4 6 8 9
i=1 j i=6 j
8 1 4 6 9 2 3 1 2 3 4 6 8 9
i=1 j i=7
j
1 8 4 6 9 2 3
28
i=1 j
Bubble Sort
Alg.: BUBBLESORT(A)
for i 1 to length[A]
do for j length[A] downto i + 1
do if A[j] < A[j -1]
then exchange A[j] A[j-1]
i
8 4 6 9 2 3 1
i=1 j
29
Bubble-Sort Running Time
Alg.: BUBBLESORT(A)
for i 1 to length[A] c1
do for j length[A] downto i + 1 c2
Comparisons: n2/2 do if A[j] < A[j -1] c3
Exchanges: n2/2
then exchange A[j] A[j-1] c4
n
(n i)
n n
T(n) = c1(n+1) + c2 (n i 1) c3 (n i) c4
i 1 i 1 i 1
n
= (n) + (c2 + c2 + c4) (n i)
i 1
n n n
n ( n 1) n 2
n
where (n i ) n i n 2
i 1 i 1 i 1 2 2 2
Thus,T(n) = (n2)
30
Order of growth
• INSERTION SORT : worst case time complexity
an2+bn+c
• Consider only the leading term of formula, since
the lower-order terms are relatively insignificant
for large values of n.
• Also ignore the leading term constant coefficient.