0% found this document useful (0 votes)

166 views

COMP2007 + COMP2907 Notes

- Algorithms are efficient if their running time is polynomial. Worst-case and average-case running times provide bounds on an algorithm's performance based on input size. Graph problems like connectivity and shortest paths can be solved using breadth-first search or depth-first search in polynomial time. Greedy algorithms make locally optimal choices at each step to find global optima for problems like interval scheduling and minimum spanning trees.

Uploaded by

Vincent Nguyen

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

166 views

COMP2007 + COMP2907 Notes

Uploaded by

Vincent Nguyen

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 23

COMP2007 Notes

TRY BRUTE FORCE FIRST TO UNDERSTAND THE PROBLEM THEN TRY

REDUCTION.
Definitions
-

Worst-case running time: Obtain bound on largest possible running

time of algorithm on input of a given size N.
Average case running time: Obtain bound on running time of algorithm
on random input as a function of input size N.
Algorithms are efficient if its running time is polynomial.
Lower bound: time complexity is not lower than this bound. Vice versa
upper bound. Tight bound is it is both a lower and upper bound. Bounds
have transitive and additive properties.

O ( nd ) for some constant d indepedent on input .

Polynomial time:

Logs generally grow slower than all polynomials.

Linear time: Running time is at most a constant factor times the size of
the input.

Graphs
-

Adjacency matrix: n-by-n matrix with

A uv =1 if ( u , v ) is an edge .

o Time: O(1) check edge.

o Space: O(n^2), n by n matrix.
Adjacency list: Node indexed array of lists.
o That is, you have an array, and if u have a vertex 1 connected to 2
and 3. You have at index 1 in the array, a linked list of 2 and 3.
o Saves space O(m+n) and identifying all edges is O(m+n); no wasted
space. 0s are gone.
Simple path: sequence of paired nodes connected by edges (path) and
all nodes are distinct (simple).
Connected graph: All nodes have paths to each other (graph must be
undirected).
Trees: An undirected graph that is connected and acyclic.
o Property: n-1 edges.
o Rooted: Pick a root node and orient edges away from root;
hierarchical.
s-t connectivity problem: Given two nodes s and t, is there a path
between s and t.
s-t shortest path problem: given two nodes s and t, what is the
shortest path between s and t.
Breath-first-search: Layer by layer. Layers are determined (intuition):
pick a root, 1 edge away from root is first layer, 2 edges is second and so
on.
o Theorem: For each i, Li consists of all nodes at distance i from s, iff
a path from s to t appears in some layer.

Running time:

O(V + E) : Each vertex is en-queued and de-

queued at most once O(V), and scanning all adjacent vertices takes
O(E).
o Using either Adjacency-matrix or list. Space complexity will differ
however.
o Uses a queue.
Connected component: All nodes reachable from s.
o Theorem: Upon termination, R is the connected component
containing s.

Shortest Paths: Compute the shortest path from a given node to all
other nodes.
o BFS: Computes the hop distance from s to u.
Initialise all dist[u] as infinity in BFS. Dist[s] = 0.
Transitive closure: Given a graph G, compute G such that:
o Has the same vertices
with an edge between all pairs of nodes that are connected
by a path in G.
o Use BFS but modify by adding an edge to (s,v) after we have seen
it.
Depth first search: Pick a starting vertex, s, and following outgoing
edges that lead to undiscovered vertices and backtrack (pop the stack)
whenever stuck.
o Assume that G is connected.
o Uses a stack.
o

Running time:

O(V +E )

Grows a forest if the graph is not connected; each tree will be a

connected component.
Bipartite Graphs: An undirected graph G = (V,E) is bipartite if the nodes
can be coloured red or blue such that every edge has at least one blue
and red end.
o Formally, we can divide V into two subsets where E is connected
between the two subsets and not within themselves.
o Lemma: If graph G is bipartite, it cannot contain an odd length
cycle. Proof: Can be shown through a diagram.
Can be used to create a poly-time certifier that runs in
o

O ( m+n ) .
o

Lemma 2: Let G be a connected graph and let

L0 , , Lk

be the

layers produced by BFS starting at node s. Exactly one of the

following holds:
No edge of G joins two nodes of the same layer
G is bipartite
An edge joins two nodes in the same layer such that
(meaning that) G contains an odd length cycle.
G is not bipartite.
Cut edge: In a connected graph, the removal of a cut edge would
disconnected the graph.
o G = (V,E) is connected
o G = (V,E\{e}) is not connected.

Algorithm:
Run DFS on graph
For each edge in the DFS tree
Remove that edge from graph G.
Check if G is now disconnected (using DFS).

Running time:

O ( mn ) .

Better algorithm: Test if edge (u,v), if there is a back edge from v, if

theres not, then edge (u,v) is a cut edge. O(m+n).
Directed reachability: In a DAG, find all nodes reachable from s.
o Directed s-t shortest path problem
o Graph search: BFS extends to directed graphs
o Web crawler: start from web page s. Find all pages linked from s,
either directly or indirectly.
Strongly connected: Node u and v are mutually reachable if there is a
path from u to v and also a path from v to u. i.e. every pair of nodes is
mutually reachable. (Of a directed graph).
o Lemma: G is strongly connected iff every node is reachable from s
and s is reachable from every node. Proof follows from
definition.
o

O ( m+n )

Running time:

Run BFS in G, and run BFS from s in

Greversed

return true iff true in

both executions.
To consider disjoint SCC use DFS twice, once to compute finish[u],
and then call in main loop in decreasing order. Output two forests
and compare.
Topological order: an ordering of nodes v, such that for every edge,
o

(v i , V j ) I < j.
o

o
o

Lemma: If G has a topological ordering then G is DAG.

Contradiction: we have I< j and j< I in a cycle. Cannot
settle precedence.
Every DAG must have a topological ordering; there must be at least
one node with no incoming edge; otherwise there would be a cycle.
Running time:

O ( m+n ) .

Maintain a counter for each edge. If it hits

0, remove v, and update all counts.

o
Greedy Algorithms
-

Definition: A greedy algorithm is an algorithm that follows the problem

solving heuristic of making the locally optimal choice at each stage with
the hope of finding a global optimum.

Interval scheduling:

Each jobi starts at sifinishes at f i .

Input: Set of n jobs.

o
o
o

Two jobs are compatible if they dont overlap in time.

Goal: find the maximum cardinality of a subset of compatible jobs.
Templates:
Earliest start time: breaks if there is one long block that
eclipses small intervals, where the first interval starts later.
Shortest interval: Consider two long intervals, and one
short interval that is eclipsed by both intervals on either side.
Optimal would be to ignore shortest.
Fewest conflicts:

Earliest finish time greedy algorithm: Consider jobs in

increasing order of finish time, take each job provided it is
compatible with the ones already taken.

O(nlogn)

Running time:

Compatible: Job I is compatible if

that it was the last job added.

Proof: (by contradiction):
Assume greedy is not optimal.

Let

i1 , i2 , ik

si f i

where * denotes

denote the set of jobs selected by

greedy.

Let

j1 , j2 , , jm

denote the set of jobs in the optimal

solution.
For this we only need to prove |A| = |O|, as proving A =
O is too hard.
OPT wouldve just chosen a different interval in the
same time frame however the overall intervals remains
the same. Use induction.

Since

f ( i r1 ) f ( j r 1 ) s ( j r ) , Job

available when the greedy algorithm makes its

choice. Hence f(i_r) <= f(j_r).
-

Interval partitioning: Given intervals

s
( i , f i) find the minimum

number of bins to schedule such that all the intervals in the bin are
compatible.

depth.

Number of bins needed

Algorithm: Sort the intervals by starting time, assign lectures to

any compatible classroom or start a new classroom.
Maintain the finish time of the last job added.
Proof: d = number of classrooms that greedy allocates
Classroom d is opened because the job is incompatible with
d-1 classrooms.
Since we sorted by start time, all incompatibilities caused by

lectures start no later than

Thus, we have d lectures overlapping at time

a very small number.

Hence all schedules use

s i +e , where e

d classrooms.

Schedule to minimise lateness: Schedule all jobs to minimise

maximum lateness. Lateness is caused by the fact that jobs have due
times and only be processed linearly.
o Algorithm (Earliest deadline first): sort by deadline and assign.
Proof: An inversion in a schedule is a pair (I,k) such that i<k
(by deadline) but k is schedule before i. We can always swap
inversions and it does not increase latness.

l 'x =l x for all x i

l 'k =f 'k d k
f i d i

(j finishes at time f_i).

f id i ( i<k ) l i

Proof: Define S* to be an optimal schedule that has the

fewest number of inversions.
S* has no idle time.
If S* has no inversions, S = S* done.
If S* has an inversion, let i-k be an adjacent inversion
o Swapping I and k does not increase maximum
latness and strictly decreases the number of
inversions
o This contradicts definition of S*.
Running time: O(nlogn)

Minimum spanning tree (MST): Given a connected Graph G=(V,E) with

real-valued edge weights, an MST is a subset of the edges

T , such that

T is a spanning tree whose sum of edge weights are minimised.

o Cayleys theorem: there are
n2

spanning treesof K n (brute force nope).

Cut property: Cut edges must be in the MST.

Cycle property: The maximum weighted edge in a cycle cannot be
in the MST.
o Prims algorithm (cut): (Grow a cloud from start node,
Kruskals is choosing from ordered weights).
Initialise s = any node.
Apply cut property to S
Add min cost edge in cutset corresponding to S to T and add
one new explored node u to S.
Keep adding the shortest length.
o Kruskals algorithm (cycle): Considering edges in ascending
order of weight. If adding e creates a cycle, discard according to
cycle property, otherwise insert e = (u,v) into T according to cut
property where S = set of nodes in us connected component.
o Tiebreaking: add small perturbations to break tie-breakers, we can
rid of the assumption that edge costs are distinct. Break ties
according to index.
o Union-find data structure: O(mlogn + m + nlogn).
Shortest Path (Djikstras algorithm):
o Maintain a set of explored nodes for which we have determined the
shortest path distance.
o Repeatedly choose the unexplored node which minimises the path.
o Proof via induction.
o
o

o
-

Invariant: For each node, d(u) is the length of the shortest s-u
path.

Clustering: Given a set of U of n objects labelled

classify into

coherent groups.
Distance function: Numeric value specifying closeness of two objects.
o Algorithm: Form a graph on the vertex set V, corresponding to n
clusters.
Find the closest pair of objects such that each item is in a
different cluster, and add an edge between them.
Repeat n-k times until there are exactly k clustesrs.
Similar to finding an MST, and deleting k-1 expensive edges.
Kruskals.

Divide and conquer:

p1 , , pn

Break up problems into several parts.

Solve each part recursively.

Combine solutions to sub-problems into overall solution.

Binary search
Mergesort
Closest pair of points:
o Given n points in the plane, find a pair with smallest Euclidean
distance between them.
o Algorithm: (Assume no two points have same x coordinate).

Divide: Draw a vertical line L so that roughly

1
n
2

points on

each side.
Conquer: find the closest pair in each side recursively.
Combine: find the closest pair with one point in each side.
Return best of 3 solutions.
We only need to consider the points that lie within a certain
distance from the line.
Sort points in the 2*certain distance by their ycoordinate, check distances of those within the shorted
list.
Proof: No two points lie in the same *certain
distance by *certain distance box,
o Two points at least 2 rows apart have distance
>= 2*1/2(certain distance).

Running time:

Can achieve

O ( nlo g2 n )
O ( nlogn )

if we have pre-sorted lists.

Master method: Applies to recurrences that have the form below.

T ( n ) =aT
where

( nb )+f ( n) ,

a 1,b 1f is asymptotically positive.

Case 1: If

f ( n )=O ( nlog a )
b

for any constant

grows polynomially slower than

Case 2: If

f ( n ) and

Case 3: If

f ( n )=O ( nlog a log k n )

log b a

nlog a . Hence
b

log b a

k 0

) .

then

n
k +1
n
log

T ( n ) =O
log b a

grow at similar rates. Thus

f ( n )=O ( nlog a+ ) for some constant > 0 . In this case

f (n )

T ( n ) =O ( f ( n ) ) .

Quick and dirty: if n^logba = f(n), add a log(n), if less take f(n), if
greater take n^logba.
Sweepline technique (and computational geometry): study of
algorithms to solve problems stated in terms of geometry.
Depth of interval: Given a set S of n intervals compute the depth of S is
the maximum number of intervals passing over a point.
o Algorithm: Sweepline sweeping from left to right while maintaining
the current depth. Data structure: Binary search tree with event
points.
o Event points: Endpoints of the intervals.
o Current depth is stored in the sweepline.
o

f (n )

T ( n ) =O(n

for some constant

dominates the recurrences and hence

> 0 then

Running time:

O ( nlogn ) .

Segment intersection: Given randomly oriented intervals in space.

Report all pairs of segments that intersect.
o Simulate sweeping a vertical line from left to right across the plane.
o Events: discrete points in time when sweep line status needs to
be updated.
o Sweep line status: store information along sweep line.
o Cleanliness property: at any point in time, to the left of sweep
line everything is clean, i.e. properly processed. (this is the
invariant).
o Algorithm:
Store segments in a blaanced binary search tree T.
Deleting a segment in T two segments become adjacent
When inserting a segment in T it becomes adjacement to two
segments, we can find them in Ologn and check if they
intersect
If we find an intersection, we are done.
O(nlogn). Decision version. O(nlogn + hlogn) for all
segments.

Convex hulls: a subset of the plane is convex if for every pair of points
(p,q) in S the straight line segment pq is completely contained in S. The
convex hull of a point set is the smallest convex set containing S.
o Divide and conquer approach:
If S not empty then
Find farthest point C in S from AB
Add C to convex hull between A and B
S0 = {points inside ABC}
S1 = {points to the right of AC}
S2 = {points to right of CB}
FindHull(S1,A,C)
FindHull(S2, C,B)
o Sweepline approach:
Maintain hull while adding the points one by one, from left to
right sweep the point from left to right.
O(nlogn)
Closest pair: Use two parallel vertical sweep-lines: the front

Lf back LB .
o
o
o
o
o

Invariant: The closest pair among the points to the left and the
distance d between this pair.
Data structure: BST to store all points in S between the two
sweeplines from top to bottom.
Once you reach an event point for both lines, calculate the
distances.
Find the point s closest to s inbetween

within vertical

distance d from s.
If |ss| < d then
set d = |ss|
CP = (s,s)

LbLf

Sweep

and update T.

o Insert s into T.
o O(nlogn).
Visibility: Let S be a set of n disjoint line segments in the plane, and let p
be a point not on any line segment of S, determine all the line segments
that p can see.
o Event points: Endpoints of segments.
o Keep track of: The segment that q sees in that direction, and the
order of the segments along the ray.
o Invariant: the segments seen so far, and the order of the segments
intersecting the ray.
o Event handling: Two cases: first endpoint, last endpoint.
First endpoint: insert new segment into D, if s is the first
segment hit by ray, report s.
Second endpoint: Remove s from D, if s was the first
segment in D, report new first segment in D.
o Complexity: Number of endpoints: 2n, handle events log(n)
(insert/delete from BST). = O(nlogn).
o Start sweep by sorting all segments intersecting starting
ray.

The median problem: Given a sequence of n numbers, find the median

in linear time.
The selection problem: given an unsorted array A with n numbers, and
a number k, find k-th smallest number in A.
o Algorithm to solving both: Use divide and conquer; partition into
5 groups, recursively find the median. Running time: O(n).
o

Dynamic programming
1) Define sub-problems: define what OPT(i) is, and what it does. And the
invariant.
2) Find recurrences: name all the test cases and write the formula.
3) Solve the base cases
4) Transform recurrence into an efficient algorithm.

Definition: Break up a problem into a series of overlapping sub-problems,

and build up solutions to larger and larger sub-problems.
Weighted Interval Scheduling:
o Goal: find maximum weight subset of mutually compatible jobs.
o Greedy algorithm fails when weights are allowed. One with 999
weight and one with 1 weight.
o Sort and then Instead define p(j) = largest index I < j such that I is
compatible with j.
o Algorithm:
OPT select job j, cant use incompatible jobs. Must include the
optimal solution to problems consisting of remaining
compatible jobs 1,2,,p(j).
OPT does not selected job j. Must include optimal solution
consisting remaining compatible jobs 1,2,j-1.

OPT ( j )=0 if j=0

OPT ( j )=max { v j +OPT ( p ( j ) ) ,OPT ( j1 ) } otherwise

o
o

Dynamic programming solutions are bottom up.

Memoization: Store the results of each sub-problem; lookup when
needed.

Running time:

O ( nlogn ) ,O ( n ) if jobs presorted .

Invariant: Everything to the left of i is optimal.

Maximum-sum contiguous subarray:
o OPT[i] = optimal ending at i
o OPT[i] = max{OPT[i-1]+A[i], 0}.
Knapsack
o OPT(i,w) = max profit subset of items 1,, I with weight limit w.
o OPT(i,w) = 0 if i = 0, OPT(i-1,w) if w_i > w, max(OPT(i-1,w), v_i +
OPT(i-1,w-w_i)) otherwise.
RNA: string of B= b1,b2,bn over alphabet {A,C,G,U}.
o Secondary structure: RNA is single-stranded so it tends to loop
back and forms base pairs with itself
o Free energy: Usual hypothesis is that an RNA molecule will form
the secondary structure with optimum total free energy.
o Goal: Given an RNA molecule B = and secondary structure S that
maximizes the number of base pairs.
o No crossing, no sharp turns, A connects to C, and G connects to C.
o Subproblems: OPT(I,j) = maximum number of base pairs in a
secondary structure of substring b_i, bi+1 to bj.
o Recurrence:

Case 1: Base
1).

Case 2: Base

is not involved in a pair. OPT(I,j) = OPT(I,j-

pairs with

for some

i t< j4 .

Non-crossing constraint decouples resulting subproblems

OPT(I,j) = 1+max{OPT(i,t-1)+OPT(t+1,j-1)} where

i t< j4

Base case:

Running time:

i j4 thenOPT ( i , j )=0 by nosharp turns condition .

O ( n3 )

Shortest path problem: Given a directed graph G=(V,E), with edge

weights, find the shortest path from s to t.
o Motivation: negative edge weights and cycles.
o Adding a constant to every edge weight can fail.
o Negative edge cycles cannot have a shortest path; infinite loop.
o Sub-problems:
OPT(I,v) = length of shortest v-t path P using at most I edges.
o Recurrences:
Case 1: P uses at most i-1 edges. OPT(I,v) = OPT(i-1,v).
P uses exactly I edges.
If (v,w) is first edge, then OPT uses (v,w) and then
selects best w-t path using at most i-1 edges.

OPT ( i , v )=min {OPT ( i1, v ) , min [ OPT ( i1, w )+ c vw ] } ( v , w ) E

o
o

Solve base case:

OPT(0,t) = 0, and OPT(0,v =/ t) = infinity.
Bellman ford:

For reach node

v V , set M[v] = infinity, and successor[v]

M[t] = 0
For i =1 to n-1
For each node w in V, if M[w] has been updated in
previous iteration, then for each node v such that (v,w)
in E. If M[v] > M[w] + c_vw then update.
With successor[v] being w.
If no M[w] value has changed in iteration I, then stop.
Improvements:
Maintain only one array M[v] = shortest apth we have
found so far.
No need to check edges of form (v,w) unless M[w]
changed in previous iteration.
Theorem: Throughout the algorithm, M[v] is length of some
v-t path, and after I rounds of updates, the value M[v] is no
larger than the length of shortest v-t path using <= I edges.
Memory: O(m+n)
Running time: O(mn) worst case, but on average is faster.
Least squares

Find a line or lines

squared error.
Sub-problems:

y=ax+b

that minimises the sum of the

OPT(j) = minimum cost for points

p1 , p2 , , p j

Recurrences: OPT(j) =

min ( e ( i , j ) +c +OPT ( i1 ) } 1 i j, e (i , j)
squares for points

= minimising sum of

p1 p j .

Base case: OPT(0) = 0.

Running time:

3
2
Running time:O ( n ) , space ( O ( n ) ) .

Flow Networks
-

An s-t flow is a function that satisfies

For each

e E :0 f ( e ) c (e)

For each

v V { s , t } : f = f ( e ) ( e out of s)

o
-

Capacity restriction.

v ( f )= f ( e ) , the total value of flow is the flow leaving s.

Max flow problem

o Find s-t flow of maximum value.
Cuts
o

conservation.

An s-t cut is a partition (A,B) of V with

s At B

The capacity of a cut (A,B) is:

The net flow sent across the cut is equal to the amount leaving s.
This is for flow across a cut not the capacity as described
above.

v ( f )=f out ( a )f ( A)

Weak duality
The value of the flow is at most the capacity of the cut.

v ( f ) cap( A , B)

Optimality

cap ( A , b ) = c ( e ) ( e out of A ) .

v ( f )=cap ( A , B ) then f is a max flow ( A , B ) is a min cut .

Greedy algorithm

Start with f(e) = 0 for all edges

Find an s-t path P where each edge has

o
o
o

Augment flow along path P.

Repeat until stuck.
Residual graph
You put a backward edge with the amount of flow in graph G

f ( e ) <c (e )

The capacity of the forward edge is

for the backward edge.

Undo flow sent.

The residual capacity of an edge in

c ( e ) f ( e)

and

f (e )

tells us how much

flow we can send, given the current flow.

Bottleneck
The minimum residual capacity of any edge on P with respect
to current flow f. This is the maximum flow we can send along
this simple s-t path.
Ford-Fulkerson
Build residual graph, initialise all flows as 0.

While there exists an augmenting path P in

f = Augment(f,P) [Bottleneck]

update

return f.
Proof
Assume initial capacities are integers.
At every intermidate stage of Ford-Fulkerson algorithm
the flow values and the residual graph capacities in

are integers

Induction
o Base case: initially the statements are correct
o Induction hyp: true after j iterations
o Induction step: Since all residual capacities in Gf
are integers, the bottleneck must also be an
integer. Thus the flow will have integer values

and hence also the capacities in the new

residual graph
Integrality theorem: If all capacities are integers, then
there exists a max flow f for which every flow value is
an integer.
Augmenting path theorem: Flow f is a max flow if and only if
there are no augmenting paths
Max-flow min-cut theorem: The value of the max flow is equal
to the value of the min-cut.
Running time: O(C(m+n)).
Scaling max-flow algorithm
Use a delta scaling factor

O(m2 logC )

Bipartite matching
A subset of E is a matching if each node appears in at most
one edge in M.
Max matching: find a max cardinality matching.
Algorithm
Use max flow
o Create digraph G
o Direct all edges from L to R, and assign unit
capacity
o Attached source s, and unit capacity edges from
s to each node in L
o Likewise R to t with unit capacities.
Proof
o You can only have 1 unit of flow on each of k
paths define by M,
o F is a flow and it has a value k.
o Cardinality is therefore at most k.
o Integrality theorem k is integral so f(e) is 0 or 1.
o Each node in L and R participate in at most one
edge in M.
Perfect matching: All nodes appear in the matching.
Marriage theorem:

Running time:

G has a prefect matching iff |N(S)|

|S| for all

subsets S in L. E.g. the neighbourhood of 3 nodes has

only 2 nodes, hence no perfect matching.
Disjoint paths: Two paths are edge-disjoint if they have no edge in
common.
o Assign unit capacity to every edge.
o Max flow is the max edge disjoint paths.
o Network connectivity
The maximum number of edge disjoint s-t paths is equal to
the min number of edges whose removal disconnects t from
s.
Circulation with demands
o Sum of supplies = sum of demands
o Max flow

Attach a new source s and sink t,

For each v with d(v) < 0, add edge (s,v) with capacity d(v).
d(v) > 0, add edge (v,t) with capacity d(v).
G has a circulation iff G has max flow of value D.
Survey design

Design survey asking

Can only survey consumer

Ask consumer

Ask between

consumers about

between

p j p j '

about a product

c ic

'
i

products.

if they own it

questions

consumers about product

Circulation problem
Include edge (I,j) if consumer owns product I,
If the circulation problem is feasible then the survey problem
is feasible and vice versa.
Image segmentation.
Baseball elimination
o

Team is eliminated iff max flow is less than the total number of
games left.

Polynomial-Time reduction

Problem X polynomial reduces to problem Y, denote by

X pY

arbitrary instances of problem X can be solved using polynomial reduction

to Y.
Reduction strategies
o Reduction by simple equivalence
Vertex cover and independent set; they reduce to each other,
S is an independent set iff V\s is a vertex cover.
o Reduction from special case to general case
Vertex cover to set cover (basically the same problem).
o Reduction by encoding with gadgets
Most used: change the input to match to solve the problem
using another problem (like assignment 5).

Class NP-hard
-

NP-Complete: A problem in NP such that every problem in NP polynomially

reduces to it.
NP-hard: A decision problem such that every problem in NP polynomially
reduces to it.

P: decision problems for which there is a poly-time algorithm.

NP: Decision problems for which there is a poly-time certifier.

Solving NP-complete problems

Unlikely to find a poly-time algorithm.

o Sacrifice one of the tree desired features.
o Solve problem to optimality
Approximation
Randomization
o Solve problem in polynomial time
Exponential algorithm
o Solve arbitrary instances of the problem
Solve restricted classes of instances
Parameterized algorithms
E.g. independent set on trees can be solved in O(n) time.
Approximation ratio
o
o

Approximation ratio=

Cost of apx solution

Cost of optimal solution

An approximation algorithm for a minimization problem requires an

approximation gurantee:

Approximation ratio

Approximation solution

cvalue of optimal solution

3-Sat
o SAT where each clause contains 3 literals, is there a truth
assignment? [satisfiable].
Clique
o A clique of a graph G is complete sub graph of G, is there a subgraph of size k.
o A G has a k-clique iff E is satisfiable.
Careful with direction of reduction:
o In order to prove NP-hard, we reduce a known problem to it.
E.g. independent set can be decided iff the instance is
satisfiable.
Hamiltonian cycle
o Does there exist a simple cycle C that visits every node.
Longest path
TSP: Given a set of n cities and a pairwise distance function d(u,v) is there
a tour of length

COMP2907
-

MST based approximation algorithm TSP

o 2-approximation for metric TSP.
o Proof

Assume

H opt

is the optimal tour and that

is the tour

returned by the approximation algorithm.

cost (T ) cost ( H opt )

cost (W ) 2cost ( T ) 2cost ( H opt )

Will revisit at most each node twice.

Set Cover: A finite set X and a family F of subsets of X such that
o
o
o
o

x=U s ( sF)
Find a subset C of minimal size which covers X.
That is find the minimum number of sets that covers the universe,
X.
Algorithm:
C <- empty set.
U<-X

While U

Select a subset in F that maximises |S intersection U|

C <- C union {S}
U <- U S.
Return C.
Loose ratio-bound
o Claim: if set cover of size k, then after k iterations the algorithm
covered at least of the elements.
o Proof
We can have the set O(logn) times.

Each time we perform k iterations.

Therefore after klogn iterations all the n elements must be
covered.
Fast matrix multiplication
o Divide: partition A and B into n by n blocks.
o Compute: 14 n by n matrices via 10 matrix additions.
o Multiply: 7 n by n matrices recursively.
o Combine: 7 products into 4 terms using 8 matrix additions.

TSP by dynamic programming

o Regard a tour to be a simple path that starts and end at vertex 1.
o Every tour consists of an edge (1,k) for some k in V {1} and a
path from k to vertex 1. The path from vertex k to vertex 1 goes
through each vertex V {1,k} exactly once.
o Let OPT[U,t] be the length of the shortest path starting at vetex 1,
going through all vertices in U and terminating at vertex t
Length of shortest ath starting at vertex 1 going through all
vertices in U and terminating at vertex t.
OPT[V,1] is the length of an optimal salesperson tour

Compute all solutions to sub-problems in order of increasing

cardinality of U.

Sets U =2

vertices connected t <n

Total time :O(n2 2n)

Kargers algorithms
o Global minimum cut: find a cut (S,S) of minimum cardinality
o While |V| > 2; contract an arbitrary edge (u,v) in G, return the cut S.
o This algorithm is an approximation algorithm.
To amplify the probability of success, run the contraction
algorithm many times
Repeat the contract algorithm r[n 2] times with independent
random choices, the probability that all runs fail is at most
n^(-c).
o Running time:
N-2 iterations
Each iteration O(n)
O(n^2)
2

The algorithm is iterated O( n log ( n)

Hence O( n logn .

times.

PSPACE: Decision problems solvable in polynomial space; can include

exponential time algorithms.
PSPACE-complete: Y is in Pspace and every problem, X, in Pspace can be
reduced to y.
Quantified SAT: Let there be a Boolean clause, is there a way that we
can pick the first value, such that if the second is generated at random,
can we have satisfiability, assume odd terms. IS IN PSPACE-complete.

Planning problems: Is it possible to apply a sequence of operations (legal)

to get from initial configuration to goal configuration? IN EXPTime.

The art gallery problem:

How many guards are needed to guard an art gallery.

Input: A simple polygon with n line segments.
o Divide the polygon into triangles.
o #guards = #triangles.
o Proof: every pair of points must see each other in triangle, hence a
guard placed in the centre can see every point in the triangle.
o Every simple polygon has a triangulation; because every polygon
with vertices > 3 has a diagonal.
o To solve; assign a 3-colour to each vertex such that no two adjacent
vertices have the same colour; place all the guards on the colour
with the least number.

A 3-colouring exists because every polygon n>3 vertices has at

least two non-overlapping ears. Proof: connect all the mid-points of
the ears; forms a tree. Each node is connected to two other nodes.

k-path
-

Find a simple path in G on k vertices.

Improved algorithm:
o Colour the vertices of G with k colours uniformally at random.
o Find a colourful k path in G if one exists.
o Probability is e^k.
o Use dynamic programming for all paths.

Parameterised problem:
o Given an instance of the problem and a parameter k, can we give a
yes or no instance?

Kernilisation: A polynomial time transformation that maps an instance (l,k) to

an instance (l,k) such that:
-

The kernalised transform is mapped to yes if it is a yes.

And k >= k
|l| <= f(k) for some function f(k).
i.e. for vertex cover; delete every vertex of degree > k and decrease k
accordingly.

Tutorials
1) If we have a blackbox, we can only make assumptions about the overall
lower bound. We cant make any assumptions about the upper bound.
2) Greedy algorithm, 2-approximation; pick the edge points of k-edges. If we
pick at least k edges, we have at least 2k vertices.
3) If we add edge weights to shortest path; it is no longer the shortest path,
or by squaring (weight < 1).
4) Optimal spanning tree does not change; order of edge weights doesnt
change.
5) Greedy algorithm for sorting by time/weight; exchange argument.
6) S-t path with vertex weights; change vertex into two vertices with the
vertex weight being the edge between them.

7) Where a is the size of the sub-problem, and b is the number of

subproblems.
8) Pigeon-hole principle: If n items are put into m containers such that n>m,
then at least one container must contain more than one item.
9) Locally optimal index. Start at the centre and follow the lowest branch, if u
hit an up branch, take the lower point.
10)
K-th smallest element; find two pairs of points by comparing k to the
rank. An element is I <= rank(i) <= I + m. can safely ignore items with
rank greater than k in the larger array.
11)
m line intersection problem; report every line on a segment S: Sort
the end points of the intervals and the points from left to right. Set a
counter c to zero. Sweep all the event points from left to right. If left
endpoint then increment c, and if right endpoint decrement c, if a point in
p, then report p if c >0. O((n+m)log(n+m)).
12)
Square intersection, put in BST, reduces to intersection in 1-D IN
THE Y direction. Maintain a BST, insert if point inside square in x-direction.
If right side of square, remove it.

13)
If I can see another interval; ray intersection; sweep r
counterclockwise from a position, initialise a BST that contains all
segments intersecting r, ordered using a distance function and the
intersection point. Consider the events.
14)
To merge two hulls in a hull; compute upper and lower tangents and
discarding all points lying between the two tangents. Find the rightmost
point and leftmost point, if its not a tangent, go clockwise/anticlockwise.

15)
Reverse-MST works. Sort by highest weights, and if graph remains
connected after removing the edge, keep goin.
16)
Another variation, for every cycle in T, remove the heaviest edge.
Proof for MSTs is via contradiction, we assume its an MST first, and show that it
contradicts with that MST definition.
17)
Fibonacci recursive is equal to the Fibonacci number itself
exponentially. An alternative is to use dynamic programming:
- Base cases: M[0] 0, M[1] 1, M[i] = M[i-1] + M[i-2].
18)
For some cases like with coins, you must initialise your base case
for the first 10 coins. And then use min(1+ j-1, j-7 and j-10).
19)
Dynamic programming, make the table.

20)

To test for valid words; find 1,i.

21)
22)
In the residual graph; any node reachable by s is in the min cut.
23)
Vertex capacities can be accounted for like before.
24)
Effective radius problem; precompute edge capacities if the pair is
in range.
25)
Flight problem: assign rigid unit capacities; add an extra edge, if the
flight is reachable (pre-compute).
26)
For an undirected graph; create two antiparallel edges; max flow is
k for capture flag problem. Ford Fulkerson.
27)
For even edges, the max flow must be even, (bottleneck must be
even; use induction).
28)
Every d-regular graph has pefect matching because marriage
theorem holds.
29)
To reduce vertex cover to set cover; U is the edge set of G, for each
vertex, we make a set containing the edges incident on u, set t = k. Vertex
cover is a special case of set cover.
30)
D-interval scheduling; use independent set.
31)
Degree at least delta, use base case n = 2, then input dummy
nodes to increase degree.
32)
Value of max flow always equal to the min cut.
33)
Augment called at most C times; every iteration, capacity must
increase by 1.

MA252 - Combinatorial Optimisation
No ratings yet
MA252 - Combinatorial Optimisation
9 pages
CSE247 Exam 3 Study Guide
No ratings yet
CSE247 Exam 3 Study Guide
5 pages
Notebook 231102
No ratings yet
Notebook 231102
10 pages
Unit 5&7 - GraphTheoryAndGreedyApproach
No ratings yet
Unit 5&7 - GraphTheoryAndGreedyApproach
139 pages
Basic Graph
No ratings yet
Basic Graph
8 pages
Algorithmic Cheatsheet: Typesetting Math: 97%
No ratings yet
Algorithmic Cheatsheet: Typesetting Math: 97%
12 pages
06 Basic Graph Algorithms
No ratings yet
06 Basic Graph Algorithms
31 pages
Fin f05 Sol
No ratings yet
Fin f05 Sol
4 pages
Algorithms and Complexity II
100% (2)
Algorithms and Complexity II
102 pages
TIP103 - Unit 7 Session 1 - Graph Algorithm
No ratings yet
TIP103 - Unit 7 Session 1 - Graph Algorithm
75 pages
COMP 482: Design and Analysis of Algorithms: Spring 2013
No ratings yet
COMP 482: Design and Analysis of Algorithms: Spring 2013
34 pages
Graph
No ratings yet
Graph
54 pages
Graph Handout
No ratings yet
Graph Handout
13 pages
Unit 6 graphs
No ratings yet
Unit 6 graphs
53 pages
Graph-BFS
No ratings yet
Graph-BFS
82 pages
(It-Ebooks-2017) It-Ebooks - Design and Analysis of Algorithms Lecture Notes (MIT 6.046J) - Ibooker It-Ebooks (2017) PDF
No ratings yet
(It-Ebooks-2017) It-Ebooks - Design and Analysis of Algorithms Lecture Notes (MIT 6.046J) - Ibooker It-Ebooks (2017) PDF
135 pages
Graph Algorithms
No ratings yet
Graph Algorithms
82 pages
Book
No ratings yet
Book
554 pages
Homework 4: Question 1 - Exercise 17.4-3, 17-3.6
No ratings yet
Homework 4: Question 1 - Exercise 17.4-3, 17-3.6
5 pages
Graph Algorithm
No ratings yet
Graph Algorithm
10 pages
All in One Algorithms
No ratings yet
All in One Algorithms
195 pages
CPSC Algorithms Cheat Sheet
No ratings yet
CPSC Algorithms Cheat Sheet
6 pages
Apznzaac8vgwcs8m7wss9ifm3m39bv2dblkn6hgjnfm6hl8tw6xsqse0zbshp0hc0smk1hvlj2jhy3jl29zxun8chwelu92m9jiwml Botqroep 5xpwlshvrenjn1rq8wgwpyxcfsuyi6k6faid9u2oxfo7 u35n1cm8cvabfgumu0acmli c6iydtlfactuaqgwdpq1loap9q94ry46
No ratings yet
Apznzaac8vgwcs8m7wss9ifm3m39bv2dblkn6hgjnfm6hl8tw6xsqse0zbshp0hc0smk1hvlj2jhy3jl29zxun8chwelu92m9jiwml Botqroep 5xpwlshvrenjn1rq8wgwpyxcfsuyi6k6faid9u2oxfo7 u35n1cm8cvabfgumu0acmli c6iydtlfactuaqgwdpq1loap9q94ry46
14 pages
Graph MF RA Exam Prep Notes yx
No ratings yet
Graph MF RA Exam Prep Notes yx
25 pages
The CIS121 Emergency Ohgodhelpme: T (N) at N B
No ratings yet
The CIS121 Emergency Ohgodhelpme: T (N) at N B
4 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
18 pages
Cs3491-Ai&ml Lab Manual Ece
No ratings yet
Cs3491-Ai&ml Lab Manual Ece
42 pages
Riccardo
No ratings yet
Riccardo
49 pages
Recap
No ratings yet
Recap
10 pages
Operations Research
No ratings yet
Operations Research
118 pages
Unit - 2 Daa
No ratings yet
Unit - 2 Daa
3 pages
Algos Midterm2
No ratings yet
Algos Midterm2
16 pages
23CS312 DAA Answer Key
No ratings yet
23CS312 DAA Answer Key
7 pages
2m DAA
No ratings yet
2m DAA
11 pages
End Sem
No ratings yet
End Sem
104 pages
Lec 3
No ratings yet
Lec 3
21 pages
Ds-unit-4
No ratings yet
Ds-unit-4
5 pages
CS203_WEEK_11
No ratings yet
CS203_WEEK_11
5 pages
safoori
No ratings yet
safoori
9 pages
cs3491 Aiandmllabmanual
No ratings yet
cs3491 Aiandmllabmanual
43 pages
DAA Unit-2
No ratings yet
DAA Unit-2
47 pages
04 Graph1
No ratings yet
04 Graph1
27 pages
Design and Analysis of Algorithms Course Notes
No ratings yet
Design and Analysis of Algorithms Course Notes
161 pages
Graphs: 3.1 Basic Definitions and Applications
No ratings yet
Graphs: 3.1 Basic Definitions and Applications
12 pages
BFS & DFS Implementations, Analysis Graph Applications: Bipartiteness Directed Graphs Wiki: Any Thoughts About Using Dokuwiki For Your Notes?
No ratings yet
BFS & DFS Implementations, Analysis Graph Applications: Bipartiteness Directed Graphs Wiki: Any Thoughts About Using Dokuwiki For Your Notes?
6 pages
Approx Algorithms Lecture Notes
No ratings yet
Approx Algorithms Lecture Notes
227 pages
07 Graph2-2
No ratings yet
07 Graph2-2
20 pages
Study Guide 1
No ratings yet
Study Guide 1
112 pages
Graph Theory and Application - Fournier
No ratings yet
Graph Theory and Application - Fournier
17 pages
Modelo11 Resuelto Ingles
No ratings yet
Modelo11 Resuelto Ingles
14 pages
Algorithm ch-4
No ratings yet
Algorithm ch-4
26 pages
Lab 20 - Graph Traversals
No ratings yet
Lab 20 - Graph Traversals
2 pages
Unit 2 Seaching
No ratings yet
Unit 2 Seaching
77 pages
Depth First Search
No ratings yet
Depth First Search
28 pages
Exercise No. 12: DFS, BFS, Topological-Sort
No ratings yet
Exercise No. 12: DFS, BFS, Topological-Sort
6 pages
A Star: Fundamentals and Applications
From Everand
A Star: Fundamentals and Applications
Fouad Sabry
No ratings yet
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
Logic Primer, third edition
From Everand
Logic Primer, third edition
Colin Allen
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Chapter 9. Fourier Series
No ratings yet
Chapter 9. Fourier Series
15 pages
100 Marks: Maharishi Vidya Mandir Senior Secondary School, Sipcot-Hosur Sample Paper 2
No ratings yet
100 Marks: Maharishi Vidya Mandir Senior Secondary School, Sipcot-Hosur Sample Paper 2
6 pages
On The Inverse Problem of Calculus of Variations
No ratings yet
On The Inverse Problem of Calculus of Variations
17 pages
Practical 1,2,3 NSM
No ratings yet
Practical 1,2,3 NSM
14 pages
Key To Correction g8
100% (12)
Key To Correction g8
2 pages
True or False Questions Linear Algebra
No ratings yet
True or False Questions Linear Algebra
6 pages
IC8451-Notes - by WWW - Easyengineering.net 8
No ratings yet
IC8451-Notes - by WWW - Easyengineering.net 8
132 pages
Chain Code
No ratings yet
Chain Code
53 pages
CoMet Exercises Scheduling
No ratings yet
CoMet Exercises Scheduling
3 pages
ADVANCED ENGINEERING MATH Ce
No ratings yet
ADVANCED ENGINEERING MATH Ce
2 pages
Matlab Functions and Constants
No ratings yet
Matlab Functions and Constants
12 pages
1.7 Practice - 2
No ratings yet
1.7 Practice - 2
8 pages
Unit-6 Three Dimensional Transformations
No ratings yet
Unit-6 Three Dimensional Transformations
10 pages
Linear Programming
100% (1)
Linear Programming
17 pages
Mode Superposition Example
No ratings yet
Mode Superposition Example
10 pages
Instant Download Numerical Analysis Using MATLAB and Spreadsheets 2nd Edition Steven T. Karris PDF All Chapters
100% (4)
Instant Download Numerical Analysis Using MATLAB and Spreadsheets 2nd Edition Steven T. Karris PDF All Chapters
40 pages
Theory of Porous Media
No ratings yet
Theory of Porous Media
7 pages
2 - Theories of Stress and Strain
No ratings yet
2 - Theories of Stress and Strain
28 pages
Mathematics Solved Sample Paper 4
No ratings yet
Mathematics Solved Sample Paper 4
3 pages
Linear 2fnonlinear Examples
No ratings yet
Linear 2fnonlinear Examples
6 pages
D5 D6
No ratings yet
D5 D6
7 pages
Alexander Graham - Kronecker Products and Matrix C
No ratings yet
Alexander Graham - Kronecker Products and Matrix C
128 pages
Nash Notes On Fluids
No ratings yet
Nash Notes On Fluids
75 pages
6.1 Differential Calculus 01 Solutions
100% (3)
6.1 Differential Calculus 01 Solutions
10 pages
Algebra 2 Chapter 7
No ratings yet
Algebra 2 Chapter 7
27 pages
Csir Net Maths Nov 26 2020
No ratings yet
Csir Net Maths Nov 26 2020
30 pages
Z Eigen Value Calculation in Ansys
No ratings yet
Z Eigen Value Calculation in Ansys
17 pages
Exercise 1.2 - Relation & Functions
No ratings yet
Exercise 1.2 - Relation & Functions
13 pages
Chapter 15
No ratings yet
Chapter 15
24 pages
DLP Math 9 (Melc) 1ST Quarter
No ratings yet
DLP Math 9 (Melc) 1ST Quarter
42 pages