Professional Documents
Culture Documents
510
Determine Neighbor
Index (Local Density of
Data Distribution)
575 520
Determine Kernel
Determine KNN
Representative Index
Attraction (Form natural
small groups) (Determine Optimal
Cluster Number)
530
Perform Merging to
Form Final Clusters
US 6,295,504 B1
Page 2
Nphi
-0.1 0 0.1 0.2 0.3 0.4 0.5
1.7
Q
0
1.9
2.1 a
2.3
RhoB
2.5
2.7
2.9 Electrofacies1
I Electrofacies2
3.1 I
Fig. 3
' 5lO
Flg- 4 Determine Neighbor
Index (Local Density of
Data Distribution)
575 520
Determine Kernel
Determine KNN
Representative Index
Attraction (Form natural
small groups) (Determine Optimal
Cluster Number)
550
Perform Merging to
Form Final Clusters
U.S. Patent Sep. 25, 2001 Sheet 3 0f 7 US 6,295,504 B1
670/
@
Initialize NxN array A to
contain indexes of N-1 nearest
neighbors for all points.
i
Sort vector A[i] for each point Xi,
620/ i=1 to N, in the data set.
Results of sorted index order of
vector A, j, stored in NxN array B.
l
Calculate value of m of point Xi
and of its jth neighbor y,
Determine on(x)=exp(-mlot)
640/ where on is user de?ned
parameter
l
N-1
650/
Determine s(x)= Z 6n(x)
Have all
points been
processed
?
660
\
Determine
Smin= Min{s(xi)}
i=1,N
, l
Have all
Calculate NlF l(x): Determine
points been
processed I(X) : :(X) ' 5min =
max- 3 mm
Smax= Max{s(xi)}
|=1,N
680) 670)
Fig.
U.S. Patent Sep. 25,2001 Sheet 4 0f 7 US 6,295,504 B1
Fig. 6 @
For each nearest neighbor q of a point p, determine
attraction function Attrp(q)=l(q)Vp(q), where Vp(q)=1, \ 720
if q is in the KNN of p. Vp(q)=O othen/vise.
l
Determine Attraction of point p by
Attfp=(I7\/1?/X(Attfp(q))
p
\ 730
Have all
points in the
dataset been
processed
?
740 K750
P is free attractor,
Is p not attracted Yes and kernel of a
by any other point? mode, local
maximum of PDF
/ 770
P is a related
2:83 p fatttrfact aphy Yes attractor. P is
o wioFrom fem 7 e
e data set.
a ofslope point
a mode
Have we
unsetlect a.n d categorized each Yes F
.68 egonze point in total num. row.
point from total of data points N as attractlon
Set of. po?ts of boundary or mode Sets
3'29 ' point?
U.S. Patent Sep. 25,2001 Sheet 5 0f 7 US 6,295,504 B1
Fig. 7 Cg
For each point x in the set of all points of size N, determine
y, ?rst neighbor of x, satisfying l(y)>l(x). l(x) and l(y) are \8 7 O
the Neighbor Indexes of point x and y respectively.
l
Determine Kernel Representative Index F(x):
F(X)=Ia(><)*Mb(X,y)*D(X,y) \820
M(x,y)=m in which y is the mth neighbor of x, and
D(x,y) is the distance between x and y.
0.8 -
0.7 -
0.6 -
KRI
5 Clusters
0.5
0.4
8 Clusters
0.3 -
12 Clusters
0.2 '
13 5 7 911131517192123252729
Kernels in decreasing order of their KRI
U.S. Patent Sep. 25,2001 Sheet 6 0f 7 US 6,295,504 B1
Have all
possible pairs
g) [7007
Select a pair of attraction sets S1 and S2 among
of attraction
sets been all attraction sets determined previously.
processed l
'7
Select a pair of points p1 and p2 \ 7 005
' belonging to S1 and 82 respectively.
707O
No Is p1 a
boundary point
in S1?
Yes
7075
No Is p2 a
boundary point
in S2?
Yes
7 020
ls p1 in the
KNN of p2? Yes
No
7 025
No ls p2 in the
KNN of p1?
Yes
A l 7030
Is the distance
Yes /\ No between p1 and p2,
Have all pairs No D(p2,p1), among all Yes
of points been pairs of points,
processed? satisfying all the
above conditions,
minimum?
Save the values
(p1,p2,L) where
- |-=Min(|(P1),l(P2))
Flg. 7032f intoa list.
U.S. Patent Sep. 25,2001 Sheet 7 0f 7 US 6,295,504 B1
Q)
Sort the list in decreasing \ 7035
order with respect to L.
i
In decreasing order of L, for all pair of
points (p1, p2) of the list determine:
7035
Does the set S1
corresponding to
point p1 contain a
previously selected
cluster kernel?
1040
No
7
7050
Have all (p1,p2)
in the list been
processed?
Fig. 9B
US 6,295,504 B1
1 2
MULTI-RESOLUTION GRAPH-BASED injected near the drilling tool. It is not normally cost
CLUSTERING effective to identify facies using these methods. Information
on geological formations traversed by a bore hole is more
This application claims bene?t to Us. Provisional commonly gathered by a measurement sonde passing
60/161,335 ?led Oct. 25, 1999. through the bore hole. The gathered information as a func
tion of the sondes position along the bore hole is then stored
BACKGROUND OF THE INVENTION or logged.
Many doWnhole measurement techniques have been used
1. Field of the Invention in the past, including passive measurements such as mea
The present invention generally relates to the geological suring the natural emission of gamma rays; and active
study of earth formations for the location and exploitation of measurements such as emitting some form of energy into the
mineral deposits using electrofacies analysis. More formation and measuring the response. Common active
particularly, the present invention relates to a neW system measurements include using acoustic Waves, electromag
and method for identifying formations of mineral deposits netic Waves, electric currents, and nuclear particles. The
using a user-friendly and reliable clustering technique that 15
sonde measurements are designed to re?ect the distinguish
can extract natural clusters from sets of logged data points ing characteristics of the rock facies. Multiple logs and
for improved electrofacies analysis of the formation. sondes may be used to gather the measurements, Which are
then correlated and standardiZed so as to furnish measure
2. Description of the Related Art
ments at discrete levels separated by equal depth intervals.
Mineral and hydrocarbon prospecting is based upon the The measurement standardiZation alloWs the automation of
geological study and observation of formations of the earths data interpretation in order to obtain estimates of the poros
crust. Correlations have long been established betWeen ity of the rocks encountered, the pore volume occupied by
geological phenomena and the formation of mineral and hydrocarbons, and the ease of How of hydrocarbons out of
hydrocarbon deposits that are suf?ciently dense to make the reservoirs in the case of petroleum prospecting. The set
their exploitation economically pro?table. of measured formation characteristics values that distinguish
25
The study of rock and soil facies encountered While the strata in a given bore hole is herein termed the electro
prospecting for minerals takes on particular importance. As facies.
used herein, a facies is an assemblage of characteristics that Interpretation studies have demonstrated a strong corre
distinguish a rock or strati?ed body from others. A facies lation betWeen the electrofacies and lithofacies, thereby
results from the physical, chemical and biological conditions making it possible to identify With con?dence the compo
involved in the formation of a rock With respect to other sitional characteristics of the rocks traversed by bore holes
rocks or soil. This set of characteristics provides information based on the sonde measurements. It has been established
on the origin of the deposits, their distribution channels and that the sets of log measurements (i.e. sample points) Which
the environment Within Which they Were produced. For correspond to a given lithofacies form a cluster in data
example, sedimentary deposits can be classi?ed according to space. That is, When the measured characteristic values of
35
their location (continental, shoreline or marine), according a formation are graphed, the points generally fall into a
to their origin (?uviatile, lacustrine, eolian) and according to continuous region Which is distinguishable from the regions
the environment Within Which they occurred (estuaries, Where points for other formations Would fall.
deltas, marshes, etc.). This information in turn makes it Various systems and method that use the correlation or the
possible to detect, for example, Zones in Which the prob observed clusters to identify lithofacies from electrofacies
ability of hydrocarbon accumulation is high. have been created. These systems take the logged measure
The set of characteristics used to de?ne a facies depends ments and convert them to a graph that furnishes, as a
on the situation. For example, a lithofacies may be de?ned function of position along the bore hole, an image of a
by the rocks petrographic and petrophysical characteristics. succession of facies. The graph typically also provides some
These are the composition, texture and structure of the rock. 45 indication of the measured formation characteristic values
Examples of mineral composition are silicate, carbonate, alongside the image. An example of one such system and its
evaporite, and so on. A rocks texture is determined by its output is described in US. patent application No. 4646240,
grain siZe, sorting, morphology, degree of compaction, and Which is hereby incorporated herein by reference. HoWever,
degree of cementation. The rock structure includes the before these systems can do the conversion, they must be
thickness of beds, their alternation, presence of stones, tailored to the drilling region.
lenses, fractures, degree of parallelism of laminations, etc. The most accurate existing systems and methods require
All of these parameters are related to the macroscopic a substantial amount of user participation to set up, and
appearance of the rock. conversely, those existing systems Which are highly auto
For extraction of hydrocarbons from geologic formations, mated tend to perform poorly. One proven approach uses a
the particularly desirable characteristics of the lithofacies are 55 tWo-step methodology to correlate different measured char
the porosity of the reservoir rocks and their permeability, as acteristic values into generaliZed electrofacies charts for
Well as the fraction of the pore volume occupied by these analysis. In the ?rst step, the number of clusters is speci?ed
hydrocarbons. These aid in estimating the nature, quantity, to an automatic clustering algorithm such as maximum
and producibility of the hydrocarbons contained in such likelihood algorithm, hierarchical clustering method,
strata. dynamic clustering or neural netWork. The number of clus
There are various sources of information on formation ters speci?ed is large, creating clusters containing small
lithofacies. Information may be gathered from subsurface numbers of points. Apetrophysicist or geologist then manu
observations such as, for example, by the study of core ally assigns geological characteristics from the facies to each
samples taken from rock formations during the drilling of a cluster and simultaneously merges similar small clusters into
bore hole for an oil Well. Such information can also be 65 electrofacies.
provided by drill cuttings sent up to the surface from the Another approach for creation of electrofacies charts
bottom of a Well by means of a ?uid (generally drilling mud) requires that the number of clusters speci?ed to the auto
US 6,295,504 B1
3 4
matic clustering algorithm be relatively small. In this BRIEF DESCRIPTION OF THE DRAWINGS
approach, the geologist often has a problem assigning spe
ci?c geological facies to the clusters, Which tend to be much For a more detailed description of the preferred embodi
larger than the clusters in the previous approach. The geolo ments of the present invention, reference Will noW be made
gist may also be required to lump together geological to the accompanying draWings, Wherein:
facies at a coarser level of distinction than might be appro FIG. 1 illustrates logging equipment in operation in a bore
priate. A large number of clusters require much Work by the hole;
geologist to match clusters to geology; too feW clusters FIG. 2 illustrates a computer system used to process the
cause the geologist problems in making meaningful linkages logged data and determine mineral compositions of the earth
betWeen clusters and geology. 10 formation;
The electrofacies analysis systems described above suffer FIG. 3 illustrates the scattering of points representative of
from various limitations and draWbacks. The automatic the values of tWo characteristic parameters of the formations
clustering methods require the user to provide an initial measured Within a given depth interval;
number of clusters before processing. This is a limitation FIG. 4 is a How diagram shoWing the steps for Multi
because the results are very sensitive to this parameter. 15
Resolution Graph Based automatic clustering;
Furthermore, unless the number is large, the identi?ed
clusters may have shapes that are not geologically mean FIG. 5 is a How diagram shoWing the steps to determine
ingful. This prevents them from being directly used for Neighboring IndeX Function;
facies analysis. On the other hand, manual merging of a FIG. 6 is a How diagram shoWing the steps to determine
large number of small clusters based on similar geological K-Nearest-Neighbor Attraction;
characteristics by hand makes this process sloW and subjec FIG. 7 is a How diagram shoWing the steps for determin
tive. Furthermore, because electrofacies analysis occurs in ing the Kernel Representative IndeX;
N-dimensional space it is still dif?cult even for a trained FIG. 8 is a curve of the Kernel Representative IndeX in
individual With good visualiZation tools to identify clusters decreasing order Which may be used to determine the
manually. Thus, it is desirable to develop a system and 25 optimal number of clusters at different resolutions; and
method that, in a relatively constant, reliable, and systematic FIGS. 9A and 9B shoW the steps for performing merging
manner, permits automatic clustering of logged data to to form ?nal clusters.
eXtract information about the geological facies of the data.
While the invention is susceptible to various modi?ca
SUMMARY OF THE INVENTION tions and alternative forms, speci?c embodiments thereof
Accordingly, there is disclosed herein a method for iden are shoWn by Way of eXample in the draWings and Will
ti?ng formations of mineral deposits. In one embodiment of herein be described in detail. It should be understood,
this method, logs are made over multiple levels Within an hoWever, that the draWings and detailed description thereto
interval along the bore hole in order to obtain a group of are not intended to limit the invention to the particular form
several measurements for each of these levels. With each 35
disclosed, but on the contrary, the intention is to cover all
such level of the bore hole interval is associated a sample modi?cations, equivalents and alternatives falling Within the
point Within a multidimensional space de?ned by the dif spirit and scope of the present invention as de?ned by the
ferent logs. The sample point coordinates are a function of appended claims.
the logging values measured at this level. The sample points
DETAILED DESCRIPTION OF PREFERRED
thus obtained Will form a scatter diagram Within this mul
EMBODIMENT
tidimensional space.
The sample points of this scatter diagram are used to FIG. 1 shoWs logging equipment in a bore hole 10 going
determine a set of characteristic modes, each corresponding through earth formations 12. The equipment includes a
to a Zone of maXimum density in the distribution of these sonde 16 is suspended in the bore hole 10 at the end of a
samples; each mode is regarded as a characteristic of a 45 cable 18. Cable 18 connects sonde 16 both mechanically and
respective cluster and the surrounding samples of this cluster electrically, by means of a pulley 19 on the surface, to a
are related to it. Afacies is designated for each of the modes control installation 20 equipped With a Winch 21 around
thus characteriZed and a graphic representation is produced Which the cable 18 is Wound. The control installation
as a function of the depth of the succession of facies thus includes recording and processing equipment knoWn in the
obtained. The characteristic modes of each cluster are made art that make it possible to produce graphic representations
up of sample points coming from the measurements them called logs of the measurements obtained by the sonde 16
selves. according to the depth of the sonde in the bore hole or Well
To identify the clusters, a neighboring indeX of each log 10.
measurement point in the data set is calculated. NeXt, small The bore hole 10 passes through a series of earth forma
natural groups of points (called attraction sets) are formed 55 tions (not speci?cally shoWn) that is typically composed of
based on the use of the neighboring indeX to determine the a series of Zones or beds. The Zones are identi?ed by the
K-Nearest-Neighbor (KNN) attraction for each point. Inde rock facies they contain, e.g. clay, limestone, etc. From the
pendently of the natural group formation, the optimal num geological vieWpoint, each of these successive Zones is
ber of clusters is calculated based on the Kernel Represen characteriZed by a relative homogeneity that is revealed by
tative IndeX (KRI) and a user-speci?ed resolution. Lastly, a set of characteristic data values (facies). These values vary
based on the data calculated from the prior steps, ?nal from one Zone to another, but have a relatively limited range
clusters are formed by merging the attraction sets. of variation Within a given Zone. These data, Which depend
Experimentation con?rms that the above method alloWs in particular on the mineralogical composition, the teXture
an accurate determination of the geological facies derived and the structure of the rocks making up these Zones,
from the logging measurements obtained Within an interval 65 identify respective facies.
of geological formations traversed by a sonde traveling in a It is possible to establish a correspondence betWeen, on
bore hole. the one hand, different facies characteriZed by mineralogical
US 6,295,504 B1
5 6
factors, texture and structure and, on the other hand, elec tools, among others. FIG. 3 demonstrates in tWo dimensions
trofacies Which can be obtained directly from a suitable the results of the automatic clustering technique described
quantitative analysis of a set of logs measured by the sonde beloW. Although only 2 dimensions are shoWn, the technique
as it traverses the bore hole. The possibility of establishing is generally applied to N-dimensions for N log measurement
such a correspondence betWeen electrofacies and facies is properties (4-dimensions Were used to obtain the clusters
capable of providing a valuable aid in the geological knoWl shoWn in FIG. 3).
edge of a Zone of the earths crust Within a given region,
Once a set of measurement points has been obtained, it is
such knoWledge being useful in completing the information
usually available to geologists and, in certain cases, helping desired to partition the points. Initially, this partitioning is
them in the interpretation of the facies encountered to obtain achieved by gathering the points into clusters using a cluster
information on the history of the formations and for deter
10 identi?cation algorithm. Many such algorithms exist, and
mining the concentrations of mineral deposits. they almost universally require that the data be normaliZed.
An approximate image of the facies may be produced by There are several Ways to normaliZe the data. One clas
?rst obtaining a number of logs over a bore hole interval sical method, frequently used, is to limit the data in an unit
H1H2. The measured values are discretiZed and correlated hypercube [0,1]i, d being the number of features, Which
15 corresponds to the number of dimensions. In each
in depth so as to have, for each level of the interval
considered, a set of distinct log values. In one embodiment,
dimension, that dimensions minimal value is subtracted
the measurements are discretiZed into levels at 15 cm from the data, and the difference is divided by the total range
intervals. The set of log values thus obtained is analyZed to of the data in that dimension. In another method, the average
determine groups of consecutive levels that have log values value of each feature is subtracted from the data, and the
Within a given range. The upper and loWer values of the difference divided by the standard deviation. The normal
range are based on the potential variations that may be iZation changes the distance betWeen data points and affect
caused by, eg bore hole conditions such as roughness or the natural separation of data points, but it is necessary to
caving of the bore hole Walls. Those consecutive levels prevent an improper choice of scale in one dimension from
having log values in the given range may be considered to dominating the measurements in other dimensions.
25
share a true physical characteristic value that is substan The clustering algorithms can be divided into parametric
tially constant over those levels. In this manner, those Zones algorithms, and non-parametric algorithms. Parametric algo
having relatively constant values may be portrayed as facies rithms are generally regarded as being less desirable than
having the indicated measurement values. non-parametric algorithms because parametric algorithms
To generate an image of the facies using the logged data are based on some model of the data, Whereas non
points, processing of the logged data measured by the sonde parametric algorithms make no assumptions about the data
as it traverses the formation may be handled at the Well site pattern. One consequent advantage of non-parametric algo
by a computer system such as that shoWn in FIG. 2. The rithms is that they are capable of recogniZing clusters of
computer system consists of a keyboard 308 and a monitor varied shapes.
306 to permit user interaction With the computer toWer 302 35
One example of a non-parametric approach is to divide
containing the CPU and peripheral hardWare. Logs gathered the observations space (eg the graph of FIG. 3) into regular
at the Well site may be graphically displayed on the monitor hypercubes of a ?xed siZe and estimating the probability
306 and clustering (described beloW) may be performed to density function (PDF) of the data based on the number of
determine the petrophysical characteristics of the formation. measurement points in each hypercube. One draWback of
Alternatively in a second embodiment, because of cost, this approach is that the number of hypercubes increases
space, poWer, and transportation restrictions, one may prefer exponentially as the number of data dimensions.
storage of the logged data gathered by the sonde in a storage Furthermore, this approach encounters critical dif?culties
medium such as a 3.5 diskette, tape or recordable compact When the data includes clusters that are closely spaced
disc 310. Processing of the data for determination of for and/or clusters of very different densities or siZes.
mation characteristics may then occur at the of?ce or labo 45 Another example of a non-parametric approach is the
ratory using more poWerful CPUs at a substantially loWer K-Nearest-Neighbor (KNN) approach. Rather than estimat
cost than at the Well site. ing the PDF by determining hoW many points are in a given
As shoWn in FIG. 3, if one analyZes a scattering of points data volume (eg a hypercube), the PDF is estimated by
representative of the logs carried out on a succession of measuring the data volume occupied by a given number of
levels in a bore hole interval, it is noted that the distribution points. The K-Nearest-Neighbor approach estimates the
density of these points in the scatter varies. This ?gure is the PDF around a point by determining the radius from the point
result of Multi-Resolution Graph-based Clustering (MRGC) to its Kth nearest neighbor.
using four logs (Nphi, RhoB, GR and DT), although only the Another example of a non-parametric approach is
tWo log measurements RhoB and Nphi are shoWn in the tWo described by C. T. Zahn in Graph theoretical methods for
dimensional graph. RhoB corresponds to a density measure 55 detecting and describing Gestalt Clusters, IEEE Trans. On
ment log and Nphi is a porosity measurement log measured Computers, v. C-20, n. 1, pp. 6886, 1971, incorporated
by a Compensated Neutron Log (CNL) tool. Examples of herein by reference. This approach relies on graph theory. A
other log measurement properties include natural gamma connected graph is constructed by linking the data points xi
radiation measurements (GR), temperature measurement by arcs according to their proximity relationship.
(HRT), inverse of square root of resistivity measurement In a graph representation of a data set, each observation
near the Wall of the Well in the invaded Zone (HRXO), xi is represented by a node, and an arc is established betWeen
acoustic Wave transit time measurement (DT), measurement tWo distinct nodes if they are linked by a relationship de?ned
of resistivity of formation far from Well or bore hole (RT), in the data set S. The graph is denoted by G=(S, A), Where
and measurement of resistivity near bore hole Wall (RXO). A is the set of arcs. Various graph structures have been used
These measurements may be made by logging tools such as 65 for clustering. Four particular structures are the Minimum
resistivity tools, induction tools, nuclear magnetic resonance Spanning Tree (MST), Relative Neighborhood Graph
tools, thermal neutron decay tools, and gamma radiation (RNG), Gabriel Graph (GG), and Delaunay Triangulation
US 6,295,504 B1
7 8
Graph (DTG). The MST is the graph that connects every then it is b thereafter. In practice, b is set equal to b=K+1,
node together With the shortest overall arc length. The RNG, thereby providing all points outside the neighborhood of
GG, and DTG are graphs With nodes coupled based on interest an equal ranking.
increasingly relaxed distance requirements. For a given set The limited rank On(x) is de?ned above for n=1, 2, . . . ,
of measurement points, these graphs are related in the K, ie for xs K nearest neighbors. The sum of the limited
folloWing manner: ranks for each point x is expressed as: