Analysis of Algorithms I

Analysis of Algorithms: The Nonrecursive Case*
Key topics:
* Introduction
* Generalizing Running Time
* Doing a Timing Analysis
* BigOh Notation
* BigOh Operations
* Analyzing Some Simple Programs no subprogram calls
* WorstCase and Average Case Analysis
* Analyzing Programs with NonRecursive Subprogram Calls
* Classes of Problems
Why Analyze Algorithms?
There may be several different ways to solve a particular problem. For example, there are
several methods for sorting numbers. How can you decide which method is the best in a
certain situation? How would you define "best" – is it the fastest method or the one that
takes up the least amount of memory space?
Understanding the relative efficiencies of algorithms designed to do

the same task is very important in every area of computing. This is
how computer scientists decide which algorithm to use for a particular
application. In the 1950's and 60's, many mathematicians and
computer scientists developed the field of algorithm analysis. One
researcher in particular, Donald Knuth, wrote a three-volume text
called The Art of Computer Programming that provided a foundation
for the subject. Interestingly enough, Prof. Knuth also used to teach at
Stanford until his retirement several years ago. He is now in the
process of writing more volumes of his seminal book series, and still
occasionally gives talks on campus.
As mentioned earlier, an algorithm can be analyzed in terms of time efficiency or space
utilization. We will consider only the former right now. The running time of an
algorithm is influenced by several factors:
1) Speed of the machine running the program

– 2 –
2) Language in which the program was written. For example,

programs written in assembly language generally run faster
than those written in C or C++, which in turn tend to run
faster than those written in Java.
3) Efficiency of the compiler that created the program
4) The size of the input: processing 1000 records will take more
time than processing 10 records.
5) Organization of the input: if the item we are searching for is
at the top of the list, it will take less time to find it than if it is
at the bottom.
The first three items in the list are problematic. We don’t want to use an exact
measurement of running time: To say that a particular algorithm written in Java and
running on a Pentium IV takes some number of milliseconds to run tells us nothing about
the general time efficiency of the algorithm, because the measurement is specific to a
given environment. The measurement will be of no use to someone in a different
environment. We need a general metric for the time efficiency of an algorithm; one that
is independent of processor or language speeds, or compiler efficiency.
The fourth item in the list is not environmentspecific, but it is an important

consideration. An algorithm will run slower if it must process more data but this
decrease in speed is not because of the construction of the algorithm. It's simply because
there is more work to do. As a result of this consideration, we usually express the
running time of an algorithm as a function of the size of the input. Thus, if the input size
is n, we express the running time as T(n). This way we take into account the input size
but not as a defining element of the algorithm.
Finally, the last item in the list requires us to consider another aspect of the input, which
again is not part of the actual algorithm. To account for this, we express timing analyses
in terms of "worst case", "average case" or "best case" based on the organization of the
data, or the probability of finding an element quickly. For our purposes in the following
sections, we will assume a "worst case" organization (i.e., we will not worry about the
organization of the input for now).
Generalizing Running Time
The problem of generalizing a timing analysis is handled by not dealing with exact
numbers but instead with order of magnitude or rate of growth. In other words:
How does the execution time of an algorithm grow as the input size grows?
– 3 –
Do they grow together? Does one grow more quickly than the other – how much more?
The ideal situation is one where the running time grows very slowly as you add more
input. So, rather than deal with exact values, we keep it general by comparing the growth
of the running time as the input grows, to the growth of known functions. The following
functions are the ones typically used:
input
size
n (1) log n n n log n n2 n3 2n
5 1 3 5 15 25 125 32
10 1 4 10 33 100 103 103
100 1 7 100 664 104 106 1030
1000 1 10 1000 104 106 109 10300
10000 1 13 10000 105 108 1012 103000
For small inputs, the difference in running times would hardly be

noticeable on a fast computer. For inputs of 100,000, however, (if we
assume one microsecond per instruction) an algorithm comparable to
the (n log n) function will take about 1.7 CPU seconds, an n2 algorithm
will take about 2.8 CPU hours which is unacceptable, and an n 3
algorithm would take 31.7 CPU years. There is no way we would ever
use such an algorithm to deal with large inputs.
Analyzing Running Time

T(n), or the running time of a particular algorithm on input of size n, is taken to be the
number of times the instructions in the algorithm are executed. As an illustration,
consider a pseudocode algorithm that calculates the mean (average) of a set of n numbers:
1. n = read input from user

2. sum = 0
3. i = 0
4. while i < n
5. number = read input from user
6. sum = sum + number
7. i = i + 1
8. mean = sum / n
Statements 1, 2, and 3 are each executed once. Statements 5, 6, and 7 are each executed
n times. Statement 4 (which controls the loop) is executed n + 1 times (one additional
check is required – why?), and statement 8 is executed once. This is summarized below:
Statement Number of times executed

1 1
2 1
– 4 –
3 1
4 n+1
5 n
6 n
7 n
8 1
Thus, the computing time for this algorithm in terms of input size n is:
T(n) = 4n + 5. We can see intuitively, that as n grows, the value of
this expression grows linearly. We say T(n) has an "order of
magnitude (or rate of growth) of n". We usually denote this using big-
Oh notation: T(n) = O(n), and we say that the algorithm has a
complexity of O(n). In some cases, we might also say the algorithms
has a "time complexity" of O(n) to distinguish the growth rate for the
running time of the algorithm from the amount of memory, or space,
that the algorithm would use during its execution. Of course, intuition
is not enough for us skeptical computer science types – we must be
able to show mathematically which of the standard functions given in
the table above indicates the correct rate of growth. We more
formally present the meaning of big-Oh notation below.
Big-Oh Notation
Definition 1: Let f(n) and g(n) be two functions. We write:
f(n) = O(g(n)) or f = O(g)
(read "f of n is big oh of g of n" or "f is big oh of g") if there is a
positive integer C such that f(n) <= C * g(n) for all positive integers n.
The basic idea of bigOh notation is this: Suppose f and g are both realvalued functions
of a real variable x. If, for large values of x, the graph of f lies closer to the horizontal
axis than the graph of some multiple of g, then f is of order g, i.e., f(x) = O(g(x)). So, g(x)
represents an upper bound on f(x).
Example 1
Suppose f(n) = 5n and g(n) = n. To show that f = O(g), we have to show the existence of a
constant C as given in Definition 1. Clearly 5 is such a constant so f(n) = 5 * g(n). We
could choose a larger C such as 6, because the definition states that f(n) must be less than
or equal to C * g(n), but we usually try and find the smallest one. Therefore, a constant C
exists (we only need one) and f = O(g).
– 5 –
Example 2
In the previous timing analysis, we ended up with T(n) = 4n + 5, and we concluded
intuitively that T(n) = O(n) because the running time grows linearly as n grows. Now,
however, we can prove it mathematically:
To show that f(n) = 4n + 5 = O(n), we need to produce a constant C

such that:
f(n) <= C * n for all n.
If we try C = 4, this doesn't work because 4n + 5 is not less than 4n.

We need C to be at least 9 to cover all n. If n = 1, C has to be 9, but C
can be smaller for greater values of n (if n = 100, C can be 5). Since
the chosen C must work for all n, we must use 9:
4n + 5 <= 4n + 5n = 9n
Since we have produced a constant C that works for all n, we can conclude:
T(4n + 5) = O(n).
Example 3
Say f(n) = n2: We will prove that f(n) ≠ O(n). To do this, we must
show that there cannot exist a constant C that satisfies the big-Oh
definition. We will prove this by contradiction.
Suppose there is a constant C that works; then, by the definition of

big-Oh: n2 <= C * n for all n. Suppose n is any positive real number
greater than C, then: n * n > C * n, or n 2 > C * n. So there exists a real
number n such that n2 > C * n. This contradicts the supposition, so
the supposition is false. There is no C that can work for all n: f(n) ≠
O(n) when f(n) = n2.
Example 4
Suppose f(n) = n2 + 3n - 1. We want to show that f(n) = O(n2).
f(n) = n2 + 3n - 1
< n2 + 3n (subtraction makes things smaller so
drop it)
<= n2 + 3n2 (since n <= n2 for all integers n)
= 4n2
– 6 –
Therefore, if C = 4, we have shown that f(n) = O(n2). Notice that all

we are doing is finding a simple function that is an upper bound on the
original function. Because of this, we could also say that f(n) = O(n 3)
since (n3) is an upper bound on n2. This would be a much weaker
description, but it is still valid.
Example 5
Show: f(n) = 2n7 - 6n5 + 10n2 – 5 = O(n7)
f(n) < 2n7 + 6n5 + 10n2

<= 2n7 + 6n7 + 10n7
= 18n7
thus, with C = 18 and we have shown that f(n) = O(n7)
You are probably noticing a pattern here. Any polynomial is big-Oh

of its term of highest degree. We are also ignoring constants. A
proof of this is very straightforward. Any polynomial (including a
general one) can be manipulated to satisfy the big-Oh definition by
doing what we did in the last example: take the absolute value of each
coefficient (this can only increase the function); Then since
nj <= nd if j <= d
we can change the exponents of all the terms to the highest degree (the original function
must be less than this too). Finally, we add these terms together to get the largest constant
C we need to find a function that is an upper bound on the original one.
Big-Oh, therefore is a useful method for characterizing an

approximation of the upper bound running time of an algorithm. By
ignoring constants and lower degree terms, we end up with the
approximate upper bound. In many cases, this is sufficient.
Sometimes, however, it is insufficient for comparing the running times
of two algorithms. For example, if one algorithm runs in O(n) time and
another runs in O(n2) time, you cannot be sure which one is fastest for
large n. Presumably the first is, but perhaps the second algorithm was
not analyzed very carefully. It is important to remember that big-Oh
notation gives no tight information on how good an algorithm is, it just
gives an upper bound on how bad it can be.
There is another notation that is sometimes useful in the analysis of algorithms. To

specify a lower bound on the growth rate of T(n), we can use bigOmega (Ω) notation. If
the lower bound running time for an algorithm is g(n) then we say T(n) = Ω(n). As an
– 7 –
example, any algorithm with m inputs and n outputs that uses all the inputs to generate
the output would require at least Ω(m + n) work. In a rough sense, bigOmega notation
tells us that an algorithm requires at least this much time to run, hence it is a lower bound
on the running time or alternatively can be thought of as the best case running time for
the algorithm.
One last variation on this theme is bigTheta (Θ) notation. BigΘ bounds a function from
both above and below, so two constants must be provided rather than one if we are doing
formal proofs of the bounds of a given function.
We'll discuss big-Omega and big-Theta notation a bit more below. In

the meantime, there is one adjustment that we must make to our
definition of big-Oh. You may have noticed that we have been
avoiding logarithms in the discussion so far. We cannot avoid them for
long, however, because many algorithms have a rate of growth that
matches logarithmic functions (a few of these include binary search,
MergeSort, and QuickSort). As a brief review, recall that log 2 n is the
number of times we have to divide n by 2 to get 1; or alternatively,
the number of 2's we must multiply together to get n:
n = 2k ⇔ log2 n = k
Many "Divide and Conquer" algorithms solve a problem by dividing it into 2 smaller
problems, then into 2 even smaller problems. You keep dividing until you get to the point
where solving the problem is trivial. This constant division by 2 suggests a logarithmic
running time.
Thus, since log(1) = 0, there is no constant C such that 1 <= C * log(n) for all n. Note,
however, that for n >= 2, it is the case that 1 <= log(n) and so the constant C = 1 works for
sufficiently large n (larger than 1). This suggests that we need a stronger definition of
bigOh than given previously.
Definition 2: Let f(n) and g(n) be two functions. We write:
f(n) = O(g(n)) or f = O(g)
if there are positive integers C and N such that f(n) <= C * g(n) for all
integers n >= N.
Using this more general definition for bigOh, we can now say that if we have f(n) = 1,
then f(n) = O(log(n)) since C = 1 and N = 2 will work.
With this definition, we can clearly see the difference between the three types of notation:
– 8 –
In all three graphs above, n0 is the minimal possible value to get valid bounds, but any
greater value will work.
Figure A shows big-Θ, which bounds a function between two constant

factors (i.e., giving a tight lower and upper bound). We can write
f(n) = Θ(g(n)) if there exist positive constants n0, c1 and c2 such that
to the right of n0, the value of f(n) always lies between c1 * g(n) and
c2 * g(n) (inclusive). Thus, in order to prove that a function is Θ(g(n)),
we must prove that the function is both O(g(n)) and Ω (g(n)).
Figure B shows big-Oh, which gives an upper bound for a function to

within a constant factor. We can write f(n) = O(g(n)) if there exist
positive constants n0 and c such that to the right of n0, f(n) always
lies on or below c * g(n).
Finally, Figure C shows big-Ω, which gives a lower bound for a

function to within a constant factor. We can write f(n) = Ω (g(n)) if
there exist positive constants n0 and c such that to the right of n0, f(n)
always lies on or above c * g(n).
There is a handy theorem that relates these notations:

Theorem: For any two functions f(n) and g(n), f(n) = Θ(g(n)) if and
only if f(n) = O(g(n)) and f(n) = Ω (g(n)).
Example 6
Show: f(n) = 3n3 + 3n - 1 = Θ (n3)
As implied by the theorem above, to show this result, we must show

two properties:
(i) f(n) = O (n3)
(ii) f(n) = Ω (n3)
First, we show (i), using the same techniques we've already seen for
big-Oh. We consider N = 1, and thus we only consider n >= 1 to show
the big-Oh result.
f(n) = 3n3 + 3n - 1
– 9 –
< 3n3 + 3n + 1
<= 3n3 + 3n3 + 1n3
= 7n3
thus, with C = 7 and N = 1, we have shown that f(n) = O(n3).
Next, we show (ii). Here we must provide a lower bound for f(n).
Here, we choose a value for N, such that the highest order term in f(n)
will always dominate (be greater than) the lower order terms. We
choose N = 2, since for n >=2, we have n 3 >= 8. This will allow n3 to
be larger than the remainder of the polynomial (3n - 1) for all n >= 2.
So, by subtracting an extra n3 term, we will form a polynomial that will
always be less than f(n) for n >= 2.
f(n) = 3n3 + 3n - 1
> 3n3 - n3 since n3 > 3n - 1 for any n >= 2
= 2n3
Thus, with C = 2 and N = 2, we have shown that f(n) = Ω (n3), since

f(n) is shown to always be greater than 2n3.
BigOh Operations
For the much of the remainder of our discussion on computational
complexity, we focus on big-Oh bounds, since they are the primarily
used method for characterizing algorithm running times (especially in
the worst case). In our use of big-Oh, we are generally looking for a
tight bound (analogous to big-Θ, but without having to prove the lower
bound (big-Ω) explicitly). Thus, we usually try to find the smallest
valid function to characterize the running time of an algorithm. So,
while, technically, a function that is O(n) is also O(n2), we will always
characterize such a function as O(n) and would consider it
inappropriate to characterize the function as O(n2). It’s just not as
useful as a tight bound.
Summation Rule
Suppose T1(n) = O(f1(n)) and T2(n) = O(f2(n)). Further, suppose that f2 grows no faster
than f1, i.e., f2(n) = O(f1(n)). Then, we can conclude that T1(n) + T2(n) = O(f1(n)).
More generally, the summation rule tells us O(f1(n) + f2(n)) = O(max(f1(n), f2(n))).
Proof:
– 10 –
Suppose that C and C' are constants such that T1(n) <= C * f1(n) and
T2(n) <= C' * f2(n).
Let D = the larger of C and C'. Then,
T1(n) + T2(n) <= C * f1(n) + C' * f2(n)
<= D * f1(n) + D * f2(n)
<= D * (f1(n) + f2(n))
<= O(f1(n) + f2(n))
Product Rule
Suppose T1(n) = O(f1(n)) and T2(n) = O(f2(n)). Then, we can
conclude that
T1(n) * T2(n) = O(f1(n) * f2(n)).
The Product Rule can be proven using a similar strategy as the

Summation Rule proof.
Analyzing Some Simple Programs (with No Subprogram Calls)
General Rules:
1) All basic statements (assignments, reads, writes, conditional

testing, library calls) run in constant time: O(1).
2) The time to execute a loop is the sum, over all times around the
loop, of the time to execute all the statements in the loop, plus the
time to evaluate the condition for termination. Evaluation of basic
termination conditions is O(1) in each iteration of the loop.
3) The complexity of an algorithm is determined by the complexity of

the most frequently executed statements. If one set of statements
have a running time of O(n3) and the rest are O(n), then the
complexity of the algorithm is O(n3). This is a result of the
Summation Rule.
Example 7
Compute the bigOh running time of the following C++ code segment:
for (i = 2; i < n; i++) {

sum += i;
– 11 –
The number of iterations of a for loop is equal to the top index of the
loop minus the bottom index, plus one more instruction to account for
the final conditional test. Note: if the for loop terminating condition is i
<= n, rather than i < n, then the number of times the conditional test
is performed is:
((top_index + 1) – bottom_index) + 1)
In this case, we have n - 2 + 1 = n - 1. The assignment in the loop is

executed n - 2 times. So, we have (n - 1) + (n - 2) = (2n - 3)
instructions executed = O(n).
Complexity problems may ask for "number of instructions executed" which means you
need to provide an equation in terms of n of the precise number of instructions executed.
Or, we may just ask for the complexity in which case you need only provide a bigOh (or
bigTheta) expression.
Example 8
Consider the sorting algorithm shown below. Find the number of instructions executed
and the complexity of this algorithm.
1) for (i = 1; i < n; i++) {

2) SmallPos = i;
3) Smallest = Array[SmallPos];
4) for (j = i+1; j <= n; j++)
5) if (Array[j] < Smallest) {
6) SmallPos = j;
7) Smallest = Array[SmallPos]
}
8) Array[SmallPos] = Array[i];
9) Array[i] = Smallest;
}
Statement 1 is executed n times (n 1 + 1); statements 2, 3, 8, and 9 (each representing
O(1) time) are executed n 1 times each, once on each pass through the outer loop. On
the first pass through this loop with i = 1, statement 4 is executed n times; statement 5 is
executed n 1 times, and assuming a worst case where the elements of the array are in
descending order, statements 6 and 7 (each O(1) time) are executed n 1 times.
On the second pass through the outer loop with i = 2, statement 4 is executed n 1 times
and statements 5, 6, and 7 are executed n 2 times, etc. Thus, statement 4 is executed (n)
– 12 –
+ (n1) +... + 2 times and statements 5, 6, and 7 are executed (n1) + (n2) + ... + 2 + 1
times. The first sum is equal to n(n+1)/2 1, and the second is equal to n(n1)/2.
Thus, the total computing time is:
T(n) = (n) + 4(n-1) + n(n+1)/2 – 1 + 3[n(n-1) / 2]

= n + 4n - 4 + (n2 + n)/2 – 1 + (3n2 - 3n) / 2
= 5n - 5 + (4n2 - 2n) / 2
= 5n - 5 + 2n2 - n
= 2n2 + 4n - 5
= O(n2)
– 13 –
Example 9
The following program segment initializes a twodimensional array A (which has n rows
and n columns) to be an n x n identity matrix – that is, a matrix with 1’s on the diagonal
and 0’s everywhere else. More formally, if A is an n x n identity matrix, then:
A x M = M x A = M, for any n x n matrix M.
What is the complexity of this C++ code?
1) cin >> n; // Same as: n = GetInteger();

2) for (i = 1; i <= n; i ++)
3) for (j = 1; j <= n; j ++)
4) A[i][j] = 0;
5) for (i = 1; i <= n; i ++)
6) A[i][i] = 1;
A program such as this can be analyzed in parts, and then we can use
the summation rule to find a total running time for the entire program.
Line 1 takes O(1) time. The instructions in lines 5 and 6 are executed
O(n) times. The instructions in lines 3 and 4 are executed n times
every time we execute this loop. The outer loop of line 2 is executed n
times, yielding a time complexity of O(n2), due to the inner loop begin
executed O(n) times. Thus the running time of the segment is O(1) +
O(n2) + O(n). We apply the summation rule to conclude that the
running time of the segment is O(n2).
WorstCase and Average Case Analysis
Thus far, we have been analyzing programs considering an input of size n, i.e., the
maximum number of input items. This is worstcase analysis; we are looking at the
largest number of execution steps required and determining the running time from that.
There may be situations where you want to know how long the algorithm runs in the
average case, not necessarily the worst case. Averagecase analysis is much more
involved than worstcase.
What you have to do is determine the probability of requiring a particular number of steps
(given a particular input) to solve a given problem. You also have to determine the
distributions of the various data values. For example, if you are writing a searching
algorithm, how often will the value you are looking for be the 1st, 10th or 1000th element
of the array you are searching? Using these probabilities, you can compute an average
case analysis, but it is totally dependent on the assumptions that you have made to set the
– 14 –
probabilities. That is always the hardest part of average case analysis. Because of these
complications, worstcase analysis is far more common (but less accurate).
Just to give you a sense of how average-case analysis works, we will

look at an example below.
– 15 –
Example 10
Here is a simple linear search algorithm that returns the index location of a value in an
array.
/* a is the array of size n we are searching through */

i = 0;
while ((i < n) && (x != a[i]))
i++;
if (i < n)
location = i;
else
location = -1;
We will do an average-case analysis with the assumption that an

element x is in the array. There are n types of possible inputs when x
is known to be in the array. (We’ll consider just the execution of the
while loop.) If x is the first element of the array, the while loop
condition will execute and fail, so only one line of the while loop is
executed. If x is the second element of the array, the while loop
condition will execute as well as the increment statement. Then we
have to check the while loop condition again for a total of 3 lines. In
general, if x is the ith term of the array, 2i - 1 lines are executed. So,
the average number of lines executed equals:
1 + 3 + 5 + ... + (2n - 1) / n
= (2 (1 + 2 + 3 + ... + n) - n) / n
We know that 1 + 2 + 3 + ... + n = n (n + 1) / 2, so the average number of lines executed
is:
[2 [n (n + 1) / 2] – n] / n
= n
= O(n)
Notice we are assuming that x is equally likely to be at any position in the array.
Analyzing Programs with NonRecursive Subprogram Calls
The first step is to analyze each individual function to determine its running time. We
begin with the functions that do not call any other functions. Then, we evaluate the
running times of the functions that call previously evaluated functions. Suppose we have
determined the running time of function P to be O(f(n)). Therefore, any statement
anywhere in the program that calls P also has a running time of O(f(n)). We just include
this time in our analysis of any section of the program where P is called.
– 16 –
For an assignment or printf with function calls, we determine the

running time by using the summation rule to add the running times of
all the function calls. If a function call with running time O(f(n))
appears in a condition or for/while loop, its running time is accounted
for as described in the following rules.
1) while/repeat: add f(n) to the running time for each iteration. We then multiply that
time by the number of iterations. For a while loop, we must add one additional f(n)
for the final loop test.
2) for loop: if the function call is in the initialization of a for loop, add
f(n) to the total running time of the loop. If the function call is the
termination condition of the for loop, add f(n) for each iteration.
3) if statement: add f(n) to the running time of the statement.
To illustrate these points, consider at the following program:
int a, n, x;
int bar(int x, int n) {

int i;
1) for (i = 1; i < n; i++)

2) x = x + i;
3) return x;
}
int foo(int x, int n) {

int i;
4) for (i = 1; i <= n; i++)

5) x = x + bar(i, n);
6) return x;
}
void main(void) {
7) n = GetInteger();
8) a = 0;
9) x = foo(a, n)
10) printf("%d", bar(a, n))
}
We begin analyzing bar since it does not call any other subprograms.
The for loop adds each of the integers 1 through n - 1 to x. Thus, the
value that is returned by bar(x, n) = x + n(n - 1)/2. The running time
of bar is analyzed as follows: Line 1 iterates n times, and line 2 iterates
– 17 –
n - 1 times. Line 3 is executed once. Therefore, the running time for

bar is O(n) + O(1) = O(n). Next, we look at foo. Line 5 would normally
take O(1) time but it has a call to bar, so the time for line 5 is O(1) +
O(n) = O(n). The for loop of lines 4 and 5 iterates n times, so we
multiply O(n) for the body of the loop times the n iterations of the loop
to get O(n2) which is the running time for a call to procedure foo.
Finally, we look at main. Lines 7 and 8 each take O(1). The call to foo
on line 9 takes O(n2). The printf of line 10 takes O(1) plus a call to bar
which gives us O(1) + O(n). Thus, the total running time for main is
O(1) + O(1) + O(n2) + O(n) = O(n2).
Here is the body of a function:
sum = 0;
for (i = 1; i <= f(n); i++)
sum += i;
where f(n) is a function call. Give a bigoh upper bound on this
function if the running time of f(n) is O(n), and the value of f(n) is
n!:
Classes of Problems
Imagine that you have decided to ski across the Sierras over winter break. Since there are
no McDonalds between Yosemite and Lee Vining, you will need to bring all your food
with you in your pack. On this trip, taste is a secondary consideration. What matters is
how much your food weighs and how many calories it contains. After raiding your
kitchen, you discover that you have roughly 200 different food items. Each item has a
certain weight and contains a certain number of calories:
1. Snickers Bar 200 calories 100 grams

2. Diet Coke 1 calorie 200 grams
...
200. Dry Spaghetti 500 calories 450 grams
Given this selection of food, your task is to find the best subset that maximizes the
number of calories, but falls within the weight limit you can carry in your pack (say
15kg). One algorithm that definitely produces a solution is to simply try every possible
– 18 –
combination. Unfortunately, there are 2200 such combinations, and the Sierras would
erode away before that algorithm terminates.
Is there another algorithm that runs in reasonable (i.e., polynomial) time? If there is, no
one has found it yet. This might make you suspect that no polynomial time solution
exists, yet no one has ever been able to prove that either. This problem, known as the
Knapsack Problem, is an example of a general class of problems called the NPcomplete
problems. These are problems for which no polynomial time solution exists, but it is
unproven whether the problems necessarily require exponential time.
Problems, Problems, Problems...
In studying algorithms, we have seen functions and bigoh notation used to characterize
the running time of algorithms. There are some important terms used to classify
problems, based on the running time of the algorithms that solve those problems. First,
we review the different categories of functions for bigO notation:
The ones that involve n as an exponent (n! or nn) are called exponential functions.
Functions whose growth is <= nc for some relatively small constant c are polynomial
functions. Linear functions have growth proportional to n. Sublinear functions have
growth proportional to log n. A constant time function has growth independent of n (do
you know of any algorithms with running time of constant order?)
These descriptions of functions are frequently used to characterize different sets of
problems. The general categories are:
Intractable/Exponential: These are problems that require exponential time.
Moreover, it has been proven that a solution in less than exponential time is
impossible. For example, consider the problem of listing all possible committees
of size one or more that could be formed from a group of n people. There are 2n
1 such committees so any algorithm that solves this problem must have at least 2n
1 steps, and thus a running time proportional to 2n. This problem cannot
possibly have a polynomial solution.
Polynomial: These are problems for which sublinear, linear or polynomial
solutions exist. Insertion sort, quicksort, mergesort and other sorting algorithms
– 19 –
are good examples, as well as finding the smallest or largest values in a list. Any
algorithm whose running time grows proportional to a polynomial function can
solve reasonablesize problems if the constant in the exponent is small.
NPComplete: No polynomial solution has been found, although exponential
solutions exist. However, it has not been proven that a polynomial solution could
not be found.
Some famous examples:
a) Generalized Instant Insanity: Given n cubes, where each face of the
cube is painted in one of n colors, is it possible to stack the cubes so that
each of the n colors appears exactly once on each side of the column?
b) Traveling Salesperson (TSP): Given a map of cities and a cost of travel
for each pair of cities, is it possible to visit each city exactly once and
return home for less than k dollars?
c) nbookshelf packing: You are given k books to put on n shelves. Will
they fit? (the books are all of different sizes). This is another way of
stating the Knapsack problem.
This group of problems is very interesting for a number of reasons. First of all,
the "NP" means "nondeterministic polynomial". Nondeterministic does not mean
that we have not yet been able to determine if the problem is polynomial. It
means that the algorithm that we might define to solve one of these problems is
nondeterministic, i.e., there is an element of chance in the algorithm. For
example, we might write a solution as follows for the Traveling Salesperson
problem:
Choose one of the possible paths, and compute its total
cost.
If this cost is no greater than the allowable cost,
then declare a success
else declare nothing.
Someone must choose one of the possible paths (this is the nondeterministic part).
If we choose the right one, the problem is solved quickly; if not, we learn nothing.
– 20 –
We define the running time of an NP algorithm as the time required to execute the
algorithm if we make the "right" choice, meaning the choice that would lead to a
solution in the optimal amount of time. So we are trying to measure how good the
algorithm is, not how good our choices are.
According to this definition, the Traveling Salesperson problem has a polynomial
solution. If we make the right choice, we just have to compute the distance for n
cities which is proportional to n.
Of course, the class of NP problems includes the truly polynomial problems. If
we can design a deterministic (no choices) polynomial solution for a problem,
then we surely can design a nondeterministic one. In addition, the Traveling
Salesperson problem suggests that there might be NP problems that do not have
polynomial solutions. In other words, there appear to be problems that can be
solved in polynomial time by nondeterministic algorithms, but that might require
exponential time by a deterministic algorithm. But no one has been able to prove
this. In particular, no one has been able to show that there does not exist a
polynomial solution to the Traveling Salesperson problem. This is still one of the
open questions in theoretical computer science (but most computer scientists are
of the opinion that a polynomial solution does not exist for TSP).
So we have the following problem classification:
NPComplete?
Polynomial Exponential Undecidable
Let’s look at some important NPcomplete problems in more detail to see if we
can better understand these problems.
Two Famous Problems
– 21 –
1. Satisfiability : The problem is to determine whether any given logical formula is
satisfiable, i.e., whether there exists a way of assigning TRUE and FALSE to its
variables so that the result is TRUE.
Given a formula
• composed of variables a, b, c, ... and their logical complements, ~a, ~b, ~c...
• represented as a series of clauses, in which each clause is the logical OR (v) of
variables and their logical complements
• expressed as the logical AND (^) of the clauses
Is there a way to assign values to the variables so that the value of the formula is TRUE?
If there exists such an assignment, the formula is said to be satisfiable.
Is: (a) ^ (b v c) ^ (~c v ~a) satisfiable? ________________________________
Is: (a) ^ (b v c) ^ (~c v ~a) ^ (~b) satisfiable? ___________________________
2. Knapsack : The name of the problem refers to packing items into a knapsack. Is there
a way to select some of the items to be packed such that their “sum” (the amount of
space they take up) exactly equals the knapsack capacity? We can express the
problem as a case of adding integers. Given a set of nonnegative integers and a target
value, is there a subset of integers whose sum equals the target value?
Formally, given a set {a1, a2, ..., an} and a target sum T, where ai >= 0, is there a
selection vector V = [v1, v2, ..., vn] each of whose elements is 0 or 1, such that
n
Σ (ai * vi) = T
i = 1
For example, the set might be {4, 7, 1, 12, 10}. A solution for target sum T = 17 exists
when V = [1, 0, 1, 1, 0] since 4 + 1 + 12 = 17. No solution is possible if T = 25.
Characteristics of NPComplete Problems
The preceding examples are reasonable representatives of the class of NPComplete
problems. They have the following characteristics:
– 22 –
• Each problem is solvable, and a relatively simple approach solves it (although
the approach may be very timeconsuming). For each of them, we can simply
enumerate all the possibilities, all ways of assigning the logical values of n
variables, all subsets of the set S. If there is a solution, it will appear in the
enumeration; if not, we will discover this as well.
• There are 2n cases to consider if we use the enumeration approach. Each
possibility can be tested in a relatively small amount of time, so the time to
test all possibilities and answer “yes” or “no” is proportional to 2n.
• The problems are apparently unrelated coming from logic, number theory,
graph theory, etc.
• If it were possible to guess perfectly, we could solve each problem in a very
small amount of time. The verification process could be done in polynomial
time, given a good guess.
How Can You Tell If a Problem is NPComplete?
Suppose you are working on some problem p, and you just can't seem to find a solution
that runs in polynomial time. In addition, you have tried to prove that the problem
requires exponential time and have failed at that as well. Does that mean that p is an NP-
Complete problem? Unfortunately, it's not that easy. It could very well be that there is a
simple solution that you haven't found yet, or it could be that the problem is intractable but
you just can't see the proof.
To prove a problem NP-complete, we need to introduce the concept of a polynomial

transformation of a decision problem. Intuitively, a polynomial transformation of a
problem P1 to a problem P2 means that we can rewrite the problem P1 as a different
problem P2, and the answer to P1 will be yes if and only if the answer to P2 is yes. If this
is true, we say P1 reduces to P2, and write P1 ∝ P2. This definition is rather confusing, so
it is sometimes easier to think of transformations in different terms. Informally, if P1 ∝ P2,
then we can think of a solution to P1 being obtained from a solution to P2 in polynomial
time. The code below gives some idea of what is implied when we say P1 ∝ P2:
Convert_To_P2 p1 = ... /* Takes an instance of p1 and converts
it to an instance of P2 in
polynomial
time. */
Solve_P2 p2 = ... /* Solves problem P2 */
Solve_P1 p1 = Solve_P2(Convert_To_P2 p1);
– 23 –
Given the above definition of a transformation, these theorems should not be very
surprising:
If Π1 ∝ Π2 then
Π2 ∈ P → Π1 ∈ P
If Π1 ∝ Π2 then
Π2 ∉ P → Π1 ∉ P
These theorems suggest a technique for proving that a given problem

Π is NP-complete.
To Prove Π ∈ NP:
1) Find a known NPcomplete problem ΠNP
2) Find a transformation such that ΠNP ∝ Π
3) Prove that the transformation is polynomial.
If we look at the proof technique outlined above, it is obvious that

there must be a “first” NP-complete problem to get this process
started. There is such a problem: the Satisfiability Problem.
The Meaning of NP-Completeness
An important researcher named Cook (1971) showed that the

satisfiability problem (call it SAT for short) is NP-Complete, meaning it
can represent the entire class of NP problems. His important
conclusion was that if there is a deterministic, polynomial time
algorithm for SAT, then there is a deterministic, polynomial time
algorithm for every problem in the NP set, that is, P = NP. Thus, if SAT
can be solved in polynomial time, then everything in NP can be solved
in polynomial time, and if any problem in NP is intractable, then SAT
must be intractable as well. Thus SAT is the “core” of the NP-complete
class of problems. In more formal terms, for any problem Π ∈ NP, Π ∝
SAT.
Another researcher named Karp (1972) extended Cook’s result by

identifying several other problems, all of which share the property that
if any one of them could be solved using a deterministic, polynomial
time algorithm, then all of them can. The knapsack and clique
problems were two problems identified by Karp.
– 24 –
The results of Cook and Karp include the converse: if for even one of
these problems (or any NP-Complete problem) it could be shown that
there is no deterministic, polynomial time algorithm to solve it, then
no deterministic, polynomial time algorithm can exist for any of them.
It is important to distinguish between a problem and an instance of a

problem. An instance is a specific case: one logical formula, one
particular set S of integers, one graph. Certain simple graphs or
simple formulas may have solutions that are very easy and fast to
identify. A problem, however, is more general. It is the description of
all instances of a given type. For example, the formal statement of
SAT given above is a statement of a problem, because it tells what
each specific instance of the problem must look like. Solving a
problem means finding one algorithm that will solve all instances of
the problem.
– 25 –
Bibliography
A. Aho, J. Hopcroft, J.D. Ullman, The Design and Analysis of Computer

Algorithms, Reading, MA: Addison-Wesley, 1974.
G. Brassard, P. Bratley, Algorithmics: Theory and Practice, Englewood

Cliffs, NJ: Prentice Hall, 1988.
T. Cormen, C. Leiserson, R. Rivest, Introduction to Algorithms, New

York: McGraw-Hill, 1991.
J. Hartmanis, R. Stearns, "On the Computational Complexity of

Algorithms," Transactions of the American Mathematical Society,
Vol. 117, 1965.
D. Knuth, Fundamental Algorithms, volume 1 of The Art of

Programming, 2nd ed., Menlo Park, CA: Addison-Wesley, 1973.
D. Knuth, Seminumerical Algorithms volume 2 of The Art of

Programming, 2nd ed., Menlo Park, CA: Addison-Wesley, 1981.
D. Knuth, Sorting and Searching, volume 3 of The Art of Programming,

Menlo Park, CA: Addison-Wesley, 1973.
U. Manber, Introduction to Algorithms: A Creative Approach, Reading,

MA: Addison-Wesley, 1989.
R. Sedgewick, Algorithms, 2nd ed., Reading, MA: Addison-Wesley,

1989.
Historical Notes
The origin of Big-Oh notation is attributed to Paul Bachmann (1837-1920) in his
number theory text written in 1892. The actual O symbol is sometimes called a
Landau symbol after Edmund Landau (1877-1938) who used this notation
throughout his work. The study of the running time of programs and the
computational complexity of problems was pioneered by Hartmanis and Stearns
(see above). Donald Knuth's series of books established the study of running time
of algorithms as an essential ingredient of computer science. The class of
polynomial problems was introduced in 1964 by Alan Cobham ("The Intrinsic
Computational Difficulty of Functions" in Proceedings of the 1964 Congress for
Logic, Methodology, and the Philosophy of Science), and, independently by Jack Edmonds in
1965 ("Paths, Trees and Flowers” in Canadian Journal of Mathematics, 17).
* Except as otherwise noted, the content of this presentation is

licensed under the Creative Commons Attribution 2.5 License.
– 26 –

Analysis of Algorithms I

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Analysis of Algorithms I

Uploaded by

Copyright:

Available Formats

Analysis of Algorithms: The Non­recursive Case*

Understanding the relative efficiencies of algorithms designed to do

1) Speed of the machine running the program

2) Language in which the program was written. For example,

The fourth item in the list is not environment­specific, but it is an important

For small inputs, the difference in running times would hardly be

Analyzing Running Time

1. n = read input from user

Statement Number of times executed

To show that f(n) = 4n + 5 = O(n), we need to produce a constant C

If we try C = 4, this doesn't work because 4n + 5 is not less than 4n.

Suppose there is a constant C that works; then, by the definition of

Therefore, if C = 4, we have shown that f(n) = O(n2). Notice that all

f(n) < 2n7 + 6n5 + 10n2

You are probably noticing a pattern here. Any polynomial is big-Oh

Big-Oh, therefore is a useful method for characterizing an

There is another notation that is sometimes useful in the analysis of algorithms. To

We'll discuss big-Omega and big-Theta notation a bit more below. In

Figure A shows big-Θ, which bounds a function between two constant

Figure B shows big-Oh, which gives an upper bound for a function to

Finally, Figure C shows big-Ω, which gives a lower bound for a

There is a handy theorem that relates these notations:

As implied by the theorem above, to show this result, we must show

Thus, with C = 2 and N = 2, we have shown that f(n) = Ω (n3), since

The Product Rule can be proven using a similar strategy as the

1) All basic statements (assignments, reads, writes, conditional

3) The complexity of an algorithm is determined by the complexity of

for (i = 2; i < n; i++) {

In this case, we have n - 2 + 1 = n - 1. The assignment in the loop is

1) for (i = 1; i < n; i++) {

Thus, the total computing time is:

T(n) = (n) + 4(n-1) + n(n+1)/2 – 1 + 3[n(n-1) / 2]

1) cin >> n; // Same as: n = GetInteger();

Just to give you a sense of how average-case analysis works, we will

/* a is the array of size n we are searching through */

We will do an average-case analysis with the assumption that an

For an assignment or printf with function calls, we determine the

To illustrate these points, consider at the following program:

int bar(int x, int n) {

1) for (i = 1; i < n; i++)

int foo(int x, int n) {

4) for (i = 1; i <= n; i++)

n - 1 times. Line 3 is executed once. Therefore, the running time for

1. Snickers Bar 200 calories 100 grams

Polynomial Exponential Undecidable

To prove a problem NP-complete, we need to introduce the concept of a polynomial

These theorems suggest a technique for proving that a given problem

If we look at the proof technique outlined above, it is obvious that

The Meaning of NP-Completeness

An important researcher named Cook (1971) showed that the

Another researcher named Karp (1972) extended Cook’s result by

It is important to distinguish between a problem and an instance of a

A. Aho, J. Hopcroft, J.D. Ullman, The Design and Analysis of Computer

G. Brassard, P. Bratley, Algorithmics: Theory and Practice, Englewood

T. Cormen, C. Leiserson, R. Rivest, Introduction to Algorithms, New

J. Hartmanis, R. Stearns, "On the Computational Complexity of

D. Knuth, Fundamental Algorithms, volume 1 of The Art of

D. Knuth, Seminumerical Algorithms volume 2 of The Art of

D. Knuth, Sorting and Searching, volume 3 of The Art of Programming,

U. Manber, Introduction to Algorithms: A Creative Approach, Reading,

R. Sedgewick, Algorithms, 2nd ed., Reading, MA: Addison-Wesley,

* Except as otherwise noted, the content of this presentation is

You might also like

Analysis of Algorithms: The Nonrecursive Case*

The fourth item in the list is not environmentspecific, but it is an important