Professional Documents
Culture Documents
Statistics
N. Bert Loosmore
nhl@u.washington.edu
QERM 550
University of Washington
May 11 & 13, 2005
100
50
0
Easting
Easting(m)
Inference for Point Pattern Spatial Statistics – p.3/49
Point Pattern Statistics in Ecology
Spatial processes Ecological processes
200
150
Northing(m)
Northing
100
Easting
Easting(m)
Inference for Point Pattern Spatial Statistics – p.3/49
Point Pattern Statistics in Ecology
Spatial processes Ecological processes
200
150
Northing(m)
Northing
100
Easting
Easting(m)
Inference for Point Pattern Spatial Statistics – p.3/49
Point Pattern Statistics in Ecology
Spatial processes Ecological processes
200
150
Northing(m)
Northing
100
Easting
Easting(m)
Inference for Point Pattern Spatial Statistics – p.3/49
Point Pattern Spatial Stats: How?
Evaluate observed pattern against ideas of
of sample points as:
F statistic uses the distances ( ) from each of sample
points (typically located on a grid) to their nearest event as:
Under CSR, K statistic can be approximated by
L statistic used to set mean
and (supposedly) stabilize
variance as:
1.0
0.8
0.6
G(t)
0.4
0.2
0.0
distance
Distance
1.0
0.8
0.6
G(t)
0.4
0.2
0.0
distance
Distance
Plot after subtracting
0.3
0.2
rSSI(r=0.03, n=100)
0.1
hat G−bar G
0.0
−0.1
−0.2
−0.3
Distance
Distance
Inference for Point Pattern Spatial Statistics – p.8/49
Perceived Level Performance
Using all results from 19 simulations yields , or
Throwing out upper and lower 2 simulations at each
distance ( ) from 99 simulations also yields
locations (CSR) and ii) random mortality.
But does the simulation envelope comprise a univariate
distribution?
55 patterns comprising
the simulation envelope
0.1
G−G
^
0.0
−0.1
−0.2
−0.3
distance
Distance
1. G, F, and/or K statistics, and
2. spatial patterns
1. At a single a priori distance - use upper and lower
simulated values
for
for
but use
to reduce bias
p-value, calculated as
for . So, if (the largest), then
proper MC methods
Inference for Point Pattern Spatial Statistics – p.22/49
Outline
Use of Point Pattern Statistics in Ecology
How to:
choose limits , distance list
But how accurate is this approximation?
boundary
Remove point from calculation at distance where
Recommended default for , but application
dependent!
Recommended default for , but application
dependent!
, are discrete, change where
Recommended default for , but application
dependent!
,
are discrete, change where
Recommended default for , but application
dependent!
,
are discrete, change where
Recommended default for , but application
dependent!
,
are discrete, change where
Recommended default for , but application
dependent!
,
are discrete, change where
Because of calculation, especially , for exact
Inference for Point Pattern Spatial Statistics – p.28/49
Resolution of Simulated Patterns
Complexity? - Number of distances grows with ,
Resolution (i.e. vs ) of simulated
patterns should be equivalent to that of observed
pattern
Resolution (i.e. vs ) of simulated
patterns should be equivalent to that of observed
pattern
Limiting resolution helps constrain complexity
Resolution (i.e. vs ) of simulated
patterns should be equivalent to that of observed
pattern
Limiting resolution helps constrain complexity
Resolution (i.e. vs ) of simulated
patterns should be equivalent to that of observed
pattern
Limiting resolution helps constrain complexity
provides an exact solution
Number of simulations ( )
Number of simulations ( )
Choice of , including choice of
Uncertainly in realized p-value ( ) results from the use of
MC simulations
actual simulated patterns against which observed
pattern tested, and
number of those patterns
Let and for . The p-value for
the test is then:
Let and for . The p-value for
the test is then:
The expected value of P is:
Assuming Y comes from , then . So,
each of the
Looking at the variance of we have
Looking at the variance of we have
Hence we can model the theoretical distribution of as
from a binomial(p,s) distribution.
Inference for Point Pattern Spatial Statistics – p.33/49
Managing Uncertainty in
Rem that binomial quickly converges to Normal
Rem that binomial quickly converges to Normal
Create 95% CI on (true p-value) near as
Rem that binomial quickly converges to Normal
Create 95% CI on (true p-value) near as
95% of CI created this way should contain the true
value of , and so set decision rule: e.g. reject if
Rem that binomial quickly converges to Normal
Create 95% CI on (true p-value) near as
95% of CI created this way should contain the true
value of , and so set decision rule: e.g. reject if
Inference for Point Pattern Spatial Statistics – p.34/49
Managing Uncertainty in
Rem that binomial quickly converges to Normal
Create 95% CI on (true p-value) near as
95% of CI created this way should contain the true
value of , and so set decision rule: e.g. reject if
example if is ok, use
Inference for Point Pattern Spatial Statistics – p.34/49
Managing Uncertainty in
Rem that binomial quickly converges to Normal
Create 95% CI on (true p-value) near as
95% of CI created this way should contain the true
value of , and so set decision rule: e.g. reject if
example if is ok, use
Use relationship between and to find value of
0.07
0.06
0.05
0.04
σp
0.03
0.02
0.01
Used Poisson Clustered model, with represents the
number of parents and represents the expected
number of children per parent, and where clustering of
‘children’ around each parent are described as
Used Poisson Clustered model, with represents the
number of parents and represents the expected
number of children per parent, and where clustering of
‘children’ around each parent are described as
Used Poisson Clustered model, with represents the
number of parents and represents the expected
number of children per parent, and where clustering of
‘children’ around each parent are described as
model where
0.4
0.4
0.3
0.3
σ σ
0.2
0.2
0.1
0.1
0 20 40 60 80 100 0 20 40 60 80 100
ρ ρ
Inference for Point Pattern Spatial Statistics – p.38/49
Example of model fitting - 3
Inference? For the observed data, if this model fits, then
larger suggests lower (i.e. few parents) and so more
children/parent.
should be chosen before the test, and based on
should be chosen before the test, and based on
0.05
0.00
K(t)
−0.05
−0.10
Distance
distance
Inference for Point Pattern Spatial Statistics – p.40/49
Outline
Use of Point Pattern Statistics in Ecology
intensities (
)
Results within LRT boundaries
a) Type I error rates for G b) Type I error rates for K
0.15
0.15
0.10
0.10
^
α ^
α
0.05
0.05
0.00
0.00
λ λ
# points ( ) # points ( )
Inference for Point Pattern Spatial Statistics – p.43/49
Type II Error Rate (1-Power)
Type II error rate is the prob of accepting given that
is really true.
is really true.
Requires definition of .
is really true.
Requires definition of .
Power will be a function of ‘how far’ is from .
(‘Easy’ to think of this distance when using Normal
distribution, but more difficult to conceptualize here.)
is really true.
Requires definition of .
Power will be a function of ‘how far’ is from .
(‘Easy’ to think of this distance when using Normal
distribution, but more difficult to conceptualize here.)
Often overlooked for spatial point process analysis, but
can be simulated.
Type II error rate tells us the ability to distinguish the
pattern from CSR.
As increases, larger clusters are more like CSR.
a)ρ=20 b)ρ=40
1.0
1.0
0.8
0.8
0.6
0.6
Power
Power
0.4
0.4
0.2
0.2
0.0
0.0
σ σ
0.3
0.2
0.1
G−G
^
0.0
−0.1
−0.2
rSSI(r=0.02)
rSSI(r=0.03)
−0.3
distance
Distance
For my calculations,
For ,
(Diggle, 2003)
other?
previous knowledge
previous knowledge
Evaluate the Power of your test
previous knowledge
Evaluate the Power of your test
R software availability:
http://students.washington.edu/nhl/masters.html