Inviiannealing PDF

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
What
What is
is simulated
simulated annealing?
annealing?
Simulated
Simulated annealing
annealing and
and probabilistic
probabilistic inversion
inversion
Examples
Examples
The problem: we need to efficiently search a possibly multi-modal

function in order to either sample the function or find the maximum-
likely point of of that function.
We draw on an analogy from solid state physics: the annealing process.
Nonlinear Inverse Problems Simulated Annealing

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
Annealing
Annealing is is the
the process
process ofof heating
heating aa solid
solid until
until thermal
thermal stresses
stresses are
are
released.
released. Then,
Then, inin cooling
cooling it
it very
very slowly
slowly to
to the
the ambient
ambient temperature
temperature
until
until perfect
perfect crystals
crystals emerge.
emerge. TheThe quality
quality of
of the
the results
results strongly
strongly
depends
depends onon the
the cooling
cooling temperature.
temperature. TheThe final
final state
state can
can be
be
interpreted
interpreted as as an
an energy
energy state
state (crystaline
(crystaline potential
potential energy)
energy) which
which is
is
lowest
lowest if
if aa perfectly
perfectly crystal
crystal emerged.
emerged.
But wheres the connection to

inverse problems?

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
Our
Our goal
goal is
is to
to sample
sample aa multi-modal
multi-modal function
function efficiently.
efficiently. We
We use
use an
an
analogy
analogy between
between thethe physical
physical process
process ofof annealing
annealing and
and the
the
mathematical
mathematical problem
problem of
of obtaining
obtaining aa global
global minimum
minimum of
of aa function.
function.
it may also be turned around so that we try and

find the maximum of for example a probability m=mml
density
Simulated annealing really aims at

finding the maximum likelihood
point mml
M(m) Maximum for m=mml
We first define an energy function:
( m)
S (m) = T0 log M
M ( m)

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
m=mml
T0 is a fixed positive number termed the ambient
temperature (e.g. T=1). We obtain the probability
density function
S ( m)
M (m, T ) = M (m) exp
T
written out ... M ( m)

T log
0

M (m, T ) = M (m) exp o
M ( m )
T

The probability thus defined has interesting properties:
M (m, T = ) = M (m)
M (m, T = T0 ) = M (m)
M (m, T = 0) = const (m mml )

(d , m) (d , m)
The Heat Bath
( d , m) = k
( d , m)
( m)
T0 log M M(m) -> peaks (as pdf)

M (m, T ) = M (m) exp o
( m )
M(m) -> constant
M
T
T0=1

M(m) M(m,T) S(m)

Temperature decreasing
T=1000
T=100

(d , m) (d , m)
The Heat Bath
( d , m) = k
( d , m)
M(m) M(m,T) S(m)
T=10
Temperature decreasing
T=1
T=0.1
T=00.1

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
m=mml
For constant prior denstiy this function resembles
The Gibbs distribution, giving the probability of
State m with energ S(m) of a statistical system at
temperature T.
The procedure would be:
1. Take a high temperature T (heat the system) and generate random models
this means we re effectively sampling the prior distribution
2. Cool the system slowly while continuing to generate random models until T=0.
you should now be in the global minimum.
The efficienvcy strongly depends on the cooling procedure. If too fast you may
end up in scondary minima. If too slow you will waste a lot of forward calculations.

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
m=mml
Here is a pseudo-code. It is only a slight modifiction
to the Metripolis algorithm
Simulated annealing:
Define a high temperature T

Define a cooling schedule T(it), e.g. T=alpha T
Define an energy function S
Define current_model initial state
While (not converged)

new_model = random
Delta_S = S(new_model)-S(current_model)
If (Delta_S < 0) current_model = new_model
Else with probability P = e^(-Delta_S/T) : current_model = new_model
T=alpha T

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
alpha(T) = a T, 0.8 < a < 0.995

Cooling schedules could be: alpha(T) = T+bT, b close to 0
alpha(T) = c/log(1+k) , k is the iteration
number and c is a constant

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
We will adopt a special approach (Rothmann, 1986):
Define a high temperature T

Define a cooling schedule T(it), e.g. T=alpha T
Define an energy function S and the associated pdf
Define current_model initial state
While (not converged)

new_model = random
calculate P(new_model)
generate random number x in (0,1)
accept with Metroplis rule
update T

(d , m) (d , m)
Simulated Annealing
( d , m) = k
( d , m)
Example with peaks function: T0=1, Ta=50

1000 iterations (ca. 340 accepted model updates)
A posteriori probability
P(accepted models)
y
x # accepted models

(d , m) (d , m)
( d , m) = k
( d , m) Summary
Simulated annealing is an mathematical analogy to a

cooling system which can be used to sample highly nonlinear,
multidimensional functions.
There are many flavors around and the efficiency strongly

depends on the particular function to sample. Therefore it
is extremely difficult to make general statements as to what
parameters work best.

Inviiannealing PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Inviiannealing PDF

Uploaded by

Copyright:

Available Formats

(d , m) (d , m)

The problem: we need to efficiently search a possibly multi-modal

We draw on an analogy from solid state physics: the annealing process.

Nonlinear Inverse Problems Simulated Annealing

But wheres the connection to

Nonlinear Inverse Problems Simulated Annealing

it may also be turned around so that we try and

Simulated annealing really aims at

M(m) Maximum for m=mml

We first define an energy function:

Nonlinear Inverse Problems Simulated Annealing

written out ... M ( m)

Nonlinear Inverse Problems Simulated Annealing

M(m) M(m,T) S(m)

Nonlinear Inverse Problems Simulated Annealing

M(m) M(m,T) S(m)

Nonlinear Inverse Problems Simulated Annealing

The procedure would be:

Nonlinear Inverse Problems Simulated Annealing

Define a high temperature T

While (not converged)

Nonlinear Inverse Problems Simulated Annealing

alpha(T) = a T, 0.8 < a < 0.995

Nonlinear Inverse Problems Simulated Annealing

We will adopt a special approach (Rothmann, 1986):

Define a high temperature T

While (not converged)

Nonlinear Inverse Problems Simulated Annealing

Example with peaks function: T0=1, Ta=50

Nonlinear Inverse Problems Simulated Annealing

Simulated annealing is an mathematical analogy to a

There are many flavors around and the efficiency strongly

Nonlinear Inverse Problems Simulated Annealing

You might also like