Professional Documents
Culture Documents
190;~
"2o S e n i ~ , s t r ~
\['Ct\'l)
N.
('I\IFVF()
('i~,t,i,.~
~, l, eh'a.~ - I , i c e r . ~ i d c t d e
dc ,~,'~io t ' a , i o
('4~xrt.:x'r.. 1. lntroductiov_.
2. Chance and Statistical Law. - 3. The
~'on~-epi of Probabilily.
4. The Irregular Collectives of von Mises. 5. The Modern Axiomati(. School. 6. On the General Statistical Problem
ax It 3,ri,~es in Statistical Mechanics.
7. A Further Example of the G-eneral Statistical Problem. The Deduction of a Probabilistie Law from
a I)eterminate lmw. - 8. The General Applicability of the Calculus of
Probability. - 9. A Criterion for Randomness.
19. Further Illustration
in Term~ of Random Pha,aes. -- 11. ('onelusion.
I.
Introduction.
th(, q u e s t i o n of w h e t h e r t h e t h e o r y of p r o b a b i l i t y r e a l l y r e p r e s e n t s t h e m o s t
a d e q u a t e possible m a t h e m a t i c a l i n s t r u m e n t in all of those e~ses to which it
is c u s t o m a r i l y applied. I n d e e d , as we shall see in this p a p e r , t h e r e is a wide
ran~'e of s~atisiieal a p p l i c a t i o n s in physics, in which a more reliable a n d a clearer
t r e a t m e n t can he o b t a i n e d b y using' m e t h o d s o t h e r t h a n those of the t h e o r y
T H E G E N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
1005
of probability. I n the study of such applications, one sees t h a t w h a t is relevant is not so much the theory of probability, b u t r a t h e r w h a t we cull (~the
general statistical problem ~>, of which the problem of probability is only a
special case.
The general statistical problem m a y be stated thus: To find those relationships which hold in a statistical aggregate independently of a wide range
of variations of the detailed properties of the individual members of the aggregate, the size of this range of variation depending on the physical conditions
and the context of interest.
I t is evident t h a t if relationships of the t y p e described above hold, then
w h a t e v e r m a y be the character of these relationships, we shall be able to make
statistical predictions concerning the aggregate without having to t r e a t the
problem of just what the individual members are doing. I n those cases to which
the t h e o r y of probability applies, such statistical predictions can be made with
the aid of the well k n o w n probability calculus; b u t as we shall show in this
paper, there are a n u m b e r of r a t h e r simple problems, in which it is highly
advantageous to go outside the scheme of expressing the statistical relationship
in terms of nothing b u t the concept of probability. These examples will also
provide further insight into the meaning of the concept of probability; for
t h e y enable us to see this concept inside of the broader setting of statistical
laws in general.
After formulating the general statistical problem with the aid of certain
examples drawn from the field of statistical mechanics, we shall focus our
a t t e n t i o n on a few cases to which the theory of probability applies. Here,
we shall discuss the problem of how probabilities are related to determinate
laws, and even to a more general class of laws lying between the determinate
and the probabilistic types. Such a process of relating probabilistic laws to
other types of laws will also enable us to see more clearly w h a t is the n a t u r e
of randomness. Moreover, as we shall see, studies of this kind lead us into
new kinds of problems t h a t cannot even be f o r m u l a t e d in terms of the t h e o r y
of probability itself, such as t h a t of estimating in specific cases the error and
d o m a i n of applicability of various aspects of the t h e o r y of probability.
The general plan followed in this paper will be first to review some of the
basic quantitative ideas on the theory of probability t h a t have already been
current, t h e n to go on to present the (, general statistical problem ~, and t h e n
to specialize to the problem of probability. W e wish to m a k e it clear a t the
outset, however, t h a t an appreciable n u m b e r of points appearing in this p a p e r
will be a repetition, in our own terms of course, of ideas t h a t have already
been published elsewhere. This repetition is, however, essential for the clarity
of presentation of the material, because our principal conclusion is really t h a t
it is just the combination of these various ideas, which h a v e appeared mainly
in a scattered form until now, combined with our own results, which indicates
66 - Supplemento al Nuovo Ui~'~tenlo.
](ill6
It.
I'>I)IIM
a l i ( l W. 5('Ili:'I'ZLP
2.
- Chance
and
Statistical
Law.
(*) .\fler x~t, had lildsll4.d lhi.~ work. Prof. 31Al;lo 5cltOxl;Hr
called our attenti~m
lo lhe facl lhal ~oll c of lhe ideas ou chance and probabilily appearing in this paper
had already I>e(ql ~ngge~led by I'OI'RNIV]"(Erpositio]t ilO 1r ~]ts
de8 ('lt(o~(;(t6" et den pr,hal~ilih;.% Paris, I s43).
(-) E.o.: one of the molorists could have slarled lea seconds earlier or later, or
slowed down lo avoid a cat I]iat happened Io eros,- lh(, road, or he could have o'iwqt
lh(' wheel a slightly different tin'n, etc..
T H E G E N E R A L S T A T I S T I C A L P R O B L E M IN P H Y S I C S AND T H E T H E O R Y OF P R O B A B I L I T Y
I007
each of these essentially i n d e p e n d e n t sets of causes will in general act a t different and hot s y s t e m a t i c a l l y related times a n d places, a n d in different a n d
not s y s t e m a t i c a l l y related w a y s in all other respects, the net result will be
j u s t the complicated fluctuations in the details of different accidents which
we h a v e described above, fluctuations which will be unpredictable in the c o n t e x t
u n d e r discussion (although t h e y are in principle predictable in a sufficiently
b r o a d context).
As the n u m b e r of accidents under consideration becomes larger a n d larger,
however, new properties begin to a p p e a r ; for one finds t h a t individual variations t e n d to cancel out a n d regular statistical trends begin to show t h e m selves. Thus, the t o t a l n u m b e r of accidents in a p a r t i c u l a r region generally
does not change m u c h f r o m y e a r to y e a r ; a n d the changes t h a t do t a k e place
often show a regular s y s t e m a t i c trend. Moreover, this t r e n d can be altered
in a s y s t e m a t i c w a y b y the a l t e r a t i o n of specific factors on which accidents
depend. Thus, when laws punishing careless driving are enforced, or when
regular inspection of mechanical p a r t s is required, the m e a n accident r a t e in
a given region always undergoes a downward trend. I n the case of an indiv i d u a l m o t o r i s t taki~g a p a r t i c u l a r trip, no v e r y definite predictions c a n i~
general be m a d e concerning the effects of such measures, since there are still
a n enormous n u m b e r of causes of accidents t h a t h a v e not y e t been eliminated;
y e t statistically, as we h a v e seen, variations in a p a r t i c u l a r cause produce a
regular a n d predictable t r e n d in the effect.
The b e h a v i o r described a b o v e is f o u n d in a v e r y wide r a n g e of fields including social, economic, medical a n d scientific statistics, a n d a whole host of
o t h e r applications. I n all these fields, the following are the m o s t i m p o r t a n t
characteristics:
1) E a c h e v e n t or result depends contingently on a v e r y large n u m b e r
of essentially i n d e p e n d e n t causal factors lying outside the c o n t e x t u n d e r discussion.
2) These causal factors all h a v e c o m p a r a b l e orders of i m p o r t a n c e (*~)
{although t h e y will not in general be e x a c t l y equal in their effects).
3) These causal factors v a r y sufficiently so t h a t in a series of similar
events~ the events of interest fluctuate in a w a y the details of which c a n n o t
be predicted f r o m a n y d a t a t h a t can be t a k e n solely in the n a r r o w e r c o n t e x t
(*) If the effects of a few of the factors are many orders of magnitude more important than the effects of all the others, combined, then these factors would be regarded as operating systematically. The event in question would still be contingent
on these causal factors, but it would be inappropriate to describe their effects as
being those of chance.
1008
D. BOIt:~! a n d ~V.SCItI'TZER
u n d e r d i s c u s s i o n . It celt be p r e d i c t e d , ho~vever, t h a t in a l o n g e l t o u g h s e r i e s
of e v e n t s , e v e r y p o s s i b i l i t y will be b r o u g h t a b o u t in t h e s e f l u c t u a t i o n s . M o r e o v e r , t h e m o r e s e n s i t i v e l y d e p e n d e n t t h e e v e n t of i n t e r e s t is on t h e e x t e r n a l
c a u s a l f a c t o r s t h e m o r e f a v o r a b l e a r e t h e c o n d i t i o n s for t h e c r e a t i o n of e h a n e e
f l u c t u a t i o n s of a p p r e c i a b l e i m p o r t a n c e .
4) i n a n a g g r e g a t e c o n t a i n i n g m a n y m e m b e r s , t h e c h a n c e f l u c t u a t i o n s
t e n d to cancel o u t , so t h a t t h e a v e r a g e or l o n g - r u n p r o p e r t i e s c a n b e p r e d i c t e d
a p p r o x i m a t e l y on t h e b a s i s of statistical l~~cs. S u c h l a w s c a n b e a p p l i e d w i t h o u t
t h e n e e d to ~'o ~o t h e b r o a d e r c o n t e x t in w h i c h one w o u l d t a k e i n t o a c c o u n t
the a d d i t i o n a l c a u s a l f a c t o r s t h a t .a'overn t h e d e t a i l s of t h e f l u c t u a t i o n s of t h e
i n d i v i d u a l m e m b e r s of t h e afz~'regate.
3. -
The Concept
of P r o b a b i l i t y
(*) The earliest mathematical discussions of probability seem to have been g~ven
b y PASCAL, FER~,IAT, HUYGHENS, BERNOULLI a n d I,APLACIL
TH]E G E N E R A L S T A T I S T I C A L P R O B L E M IX P H Y S I C S AND T H E T H E O R Y OF P R O B A B I L I T Y
1009
(*) The importance of the relative frequency in connection with probability was
first recognized by ELLIS (On the Foundations o] the Theory o] Probabilities, in Trans.
o] Ca~b. Phil. Soc., 8, 1-6 (1844)), (paper read in 1842), and then by CouR~-o'r ( ~ p o sition de la tbdorie des chances et des p~vbabiltd s, Paris, 1843). However, neither of
these authors fully eliminated the subjective point of view, VEx~ (~l'hc Logic o/Chance,
London and New York, 1866) advocated systematically a pure frequency interpretation.
lo10
1). B o i i ; i
alld
w . SCllliTZI.;P,
l*) I]oth ~',~CJ:XOT and \h,:N~ recognized t;hi~ fact explicitly. Indeed, the exposilimt in this se(.tiou follows closely atone" the ~eneral lines laid dowtt by {'OV,Rxo'r.
q ' n E G ] ~ N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
1011
4. -
of y o n M i s e s .
1()]')
1). B()II'r
alld
vc. S C t l i ~ T Z E R
ide~ttified with
(, i ' a l l d O l l l l l e S s
~) e l '
<~ lawlessness
,.
1,2vi(lently, i n a v e r y regula,"
Nevertheless,
nol o n e - s i x t h ,
(*) Actually, ~(~x- MlsJ,:~ developed his delinition ori~imdly il~ cmmeetiou with lhe
proldcm of wim~illg a -ambling g'ame by t,eltin~ on ~ubsequences of plays ehoseli front
the main sequence l)y some predetermined kind of rule or ,, systeIn ,,. The fact tha,t in
real /ames, such efforis to beat the ~ame, have always failed is then taken to
demonstrate ~he randomness of the sequence of plays. It is evident, however, that
the possibility of Iindina" a , winninR" system ,, in a g'ambling game depends on whether
or nol ,~amples drawn from the entire sequence of plays will have the same distribution
of relati~ e frequencie,, a~ are present in the sequence as a whole. Thus, the fonnulatio~t
of the definition of randomness in terms el the possibility of drawing representative
salli|)|(~y; t ' O l ! l a i l l s lilt* ffamblinR' R ' a l l i e a ~ a speeial e a s e .
T H E G E N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S AND T I I E T H E O R Y OF P R O B A B I L I T Y
]013
]0]4
T h e s e c o n d i m p o r t a n t c r i t i c i s m of t h e w m Mises t h e o r y of p r o b a b i l i t y is
c o n n e c t e d with t h e p r o p o s e d c r i t e r i o n for r a n d o m n e s s . T h e e s s e n t i a l p r o b h ' m
here is t o define lnoFe p r e c i s e l y w h a t is m e a n t b y <~i r r e g ' u l a r i t y ,> or <<lawlesshess ,>. T h u s . \ o N MtSFS a l r e a d y r e a l i z e d t h a t he cmfid n o t r e q u i r e t h a t t h e
r e l a t i v e f r e q u e n c y in ccery ims.~ibb, s u b s e q u e n e e b e t h e s a m e as i n t h e m a i n
s e q u e n c e , since one could, for e x a m p l e in t h e c~se of t h r o w s of a coin, a l w a y s
choose t h a t s u b s e q u e n e e in whieh t h e r e s u l t s c a m e o u t h e a d s , so t h a i t h e
r e l a t i v e frequen<'y of h e a d s w o u h l be u n i t y , a n d n o t o n e - h a l f , as i t is for t h e
sequen<.e as a whole, l i e d e a l t w i t h l h i s p r o b l e m 1)3- requiring', t h a t t h e sul>s e q u e n c e be ('hosen b y a metho(1 of sampling" t h a t is defined belore all t h e
results of a ,,,'iven sequen<.e a r e a v a i l a b l e : or m o r e 1)reeisely, t h a t t h e s u b s~*quenee be chosen by a m e t h o d t h a t d e p e n d s o n l y on t h e posit.ion of a g i v e n
e w , n t in t h e s e q u e n c e a n d on t h e l>roperties of t h e e v e n t s b e f o r e it. E v e n so,
h o w e v e r , t h e c r i t e r i o n is v e r y v a / a e , a n d e a n n o l be a p p l i e d in a n y l n ' a e t i e a l
p r o b l e m (e./. one e o u l d h a r d l y <.ounl t h e r e l a t i v e f r e q u e n c i e s in a l l p o s s i b h '
s u h s e q u e n c e s fittino/ i n t o t h e a b o v e c r i l e r i o n ) . M o r e o v e r , it is b y no m e a n s
vleav t h a i such a c r i t e r i o n is s e l f - c o n s i s t e n t , since it h a r d l y s e e m s l i k e l y t h a i
,',','r?l m e t h o d of sampling' fallin~z w i t h i n its scope will give t h e s a m e r e l a t i v e
f r e q | | e n e i e s as in t h e m a i n s e q u e n c e .
T h e lwoblem of d e f i n i n / w h a t one m e a n s b y <<irrela'ularity >> or <<lawlesshess ,> is a p a r t i c u l a r l y t h o r n y one. since in t h e v e r y a c t of d e f i n i n g it, one is
i n e v i t a b l y as<.ril>ing to it s o m e k i n d of reg'ularity a n d law. I n d e e d , v a r i o u s
m e m b e r s (*) of t h e school of \(>x 31[SES s e e m to h a v e Iziven u p t h e a t t e m p t
to d e l i n e r a n d o n m e s s as lqwlessness. T a k i n / as a s t a r t i n / p o i n t a s e q u e n c e
of p l a y s in a g'amblin,g" / a m e . t h e y p r o p o s e to define r a n d o m n e s s in t e r m s of
a specitie a n d well-defined l w o p e r t y of such a s e q u e n e e ; viz, t h a t if a p l a y e r
s y s t e m a t i c a l l y follows a n y one of a c o u n t a b l e i n f i n i t y of rules of p l a y w h i e h
/ h e y detine, he <.an n e v e r <, h e a l ,, t h e g'ame. T h e s e efforts to d e l i n e r a n d o m n e s s
a r e in t h e i r t u r n , h o w e v e r , s u b j e e t to i m p o r t a n t c r i t i c i s m s (e). Aside f|'o|n
sharing' in t h e defect of t h e approa<.h of vo.\ 3[tSES t h a t it is n o t <,lear h o w
t h e y c o u l d be a p p l i e d in p r a c t i c e to a wide rang'e of p r o b l e m s , t h o s e c r i t e r i a
H.
(.~OPEL,,NI~, e{3C..
T H E G E N L R A L S T A T I S T I C A L P R O B L E M LN P H Y S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
1015
5. -
The
Modern
Axiomatic
School.
(*) As we shall see in Section 9, there are strong reasons to suppose that the criteria
for randomness may have to possess a nmeh wider range of adjustability to the specific
characteristics of the various statistical problems to which they are applied, than is
possessed by the criteria that we arc discussing here.
(a) A. KOLMOGOROFF:Grundbegri]je der Wahvsc]~einlichl~eitsrechnang, in Erg. d. Math.,
2, m 3.
lO16
i).
B~IIlM itli(|
W. ~CII[~'I'ZEII
perhaps under other conditions alld ill. connection witll new kinds of statistieaI
problems, new kinds of m a t h e m a t i e a l abstractions m a y be needed (Just as
the abstractions of Euclidean ~'eometry m a y have to be replaced b y the more
~'eneral ones of non-Euclidean ~'eometry, to deal with astronomical d o m a i n s
of space), llox~ever tile l)osition taken by m a n y members of the modern
axiomatic school seems to imply that the calculus of probability already represents the most ~'eneral possible 1)urely m a t h e m a t i c a l means of treating'
statistical problems. Such a point of view tends to confine tile horizo,| of
our thinking about statistical problems to nothing' move t h a n the calculation
of one kind of probability in terms of another kind. On the other hand, as
we shall see in the later sections of this paper there exists a wide variety of
aetnal eases in which arise nlu(,h more ~'eneval statistical problems t h a t can
a.dvanlag'eously be treated by methods going' beyond those of the theory of
probability and that in m a n y cases must be treated by such methods. Thus.
it is our opinion t h a t while tile era'rent theory of probability m a y consistently
be re~'a|,ded as a self-eontained logical diseipli|w, it m a y also be studied both
m a t h e m a t i c a l l y and in more ~'eneral ways, within tile context of a broader
point of view. in which the meaning" of ils basic concepts becomes clearer, as
one comes to see their setti||~' in the ~'eneral domain of statistical problems,
as well as tile processes by which tile laws of probability <,ome to be valid,
and the factors limiting" the domains of validity of these laws.
6.
- On t h e
General
Statistical
Problem
as It A r i s e s in S t a t i s t i c a l
Mechanics.
T I l E G E N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D TH)] T H E O R Y OF P R O B A B I L I T Y
i017
Q [n~]
nl ! n~ ------T.
! ... n~ A ~ '
where A~2 is the Volume in phase space of the cell corresponding to a n y one
a.rr~ngement leading to the s a m e [nr].
W e now consider, as is usually done in stutisticul mechanics, a region of
phase space lying b e t w e e n two hypersurfaces of c o n s t a n t energy, E, a n d E + A E ,
where A E is quite smull c o m p a r e d with E. As is well known, a n isoluted
s y s t e m moves along a surface of c o n s t a n t energy, so t h a t if we s t u d y t h e properties of such a shell of thickness A E a n d let A E a p p r o a c h zero, we shall
o b t a i n a n a c c u r a t e r e p r e s e n t a t i o n of the properties of the h y p e r s u r f a c e on
which such a s y s t e m moves.
The n e x t p r o b l e m is to o b t a i n a n a p p r o x i m a t i o n to t h e expression (6.1),
which is valid when the n, ~re large. This is conveniently done in t e r m s of
the expression for In/2In,]. W e use Stirling's a p p r o x i m a t i o n for the n,, obtuining
(6.2)
In Q[nr] ~ n in n + In A~2 -- ~ n i In ni .
i
(~) This problem has been treated in various ways by many authors. See, for
example, P. and T. E r m ~ S T :
Encyelopedie des Sciences Mathdmatiques, 4, 2, Sup~pl6ment II.
I{tlS
I).
ll(fll'~l
~l)ld
W.
SCII{:TZItl
the
i s o l a t i o n of t h e
system,
that
the
total
ener~'y is a
constant.
number
of
particles is a
constant
(~,,E,--E
= 0) a n d
(~ ,, -- ~
~,, ~-)'
(6.3)
that
the
h, ~-2 - - :.( 2
total
,,, - - ,, ) - - : , ( 2 , , , = ,
- - ~:).
(6.1)
2,~,
""
1)I'
((;.5)
,;, -
;;,,
,1 ex 1) 1-- [,/',', I 9
=-0.
(i~.i;)
.,
~1 e x p l - -
.~v2,'2/,T],
E, = 89
,o
--
~,
--
~,,, ol)tainin~"
(t;..)
(|;.s)
T H E GENJERAL S T A T I S T I C A L P R O B L E M I N
P H Y S I C S A N D T H E T H E O R Y OE P R O B A B I L I T Y
1019
(*) A much more elegant and precise method is available for deducing this result.
See, for example, the proof of the central hmit theorem in A. I. KuI~Cm~: Statistical
Mechanics {New York, 1949). The method that we have chosen here is, however, based
on essentially the same ideas, and has the advantage of bringing out more clearly what
is important for our purposes.
(+) This fact has long been weli'known. See, for example, reference (b).
(6) See for example, A. I. K n i f e , I N : Statistical Mechanics (I~ew York, 1949).
(-) We shall discuss a few such cases in Section 9.
(*) In tile problem t h a t ue are considering here, there is not enough time for this
to h~ppen. Thus, as ~ consequence of a theorem proved by PoI~ca~s (.Iota Matt~.,
13, 07 (1890)), in a typical mechanical system, eont.aining something like l02a molecules, it would take a fantastically long time, of the order of trillions of trillions of
years for the orbit to come reasonably near to every point of phase space.
(+) This point seems to have been for a long time known to a number of writers
in the field (see, for example, _~. 1. Kmxc~Hx: Matt~e~natical Fo,t:ndations o/ Statistical
.IIeclmnics, ~ee. 13). Itowever, as far as we can tell its full significance with regard to
the lack of neeessit.y of using a prohabilistic treatment of stat.istieal mechanics does
not seem to have been realized.
TII]~5 G E N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D T H E T H E O R Y
OF PROBABILITY
I021
systems. We may, however, consider the possibility of deducing the statistical properties from intermediate types of laws, which are neither completely
probabilistic nor completely deterministic. The need for such intermedi~.te
~ypes of laws can be seen b y considering the fact t h a t no system can be described completely in terms of a n y finite n u m b e r of properties. First of all,
a n y system is always interacting with other systems outside of it. In m a n y
cases, these interactions seem to be small. B u t because the detailed motions
of individual systems are unstable, t h e y can in the long run, be affected a
g r e a t deal even b y small disturbances arising outside. Secondly, laws operating at a particular level are generally subject to disturbances arising a t
other levels, not included in the laws in question. F o r example, the laws of
t h e macroscopic level are subject to fluctuations of the t y p e of Browni~n motion,
arising f r o m the motions a t the atomic level. B u t the laws of classical mechanics operating at the atomic level are subject to further fluctuations arising
~rom the quantum-mechanical properties of matter. And because of the instability of motion, these fluctuations can have i m p o r t a n t effects on the detailed
behavior of the orbits even a t a level where classical mechanics would otherwise be a good approximation. Thus, one comes to the conclusion t h a t in a n y
real problem, in which one necessarily leaves out of consideration an unlimited
n u m b e r of effects coming from outside from the system and from other levels,
a precisely determinate and complete causal prediction c a n n o t be obtained.
I n general, therefore, we expect to obtain f r o m our laws predictions only of
the m e a n behavior of a system, and of the range and general c h a r a c t e r of the
fluctuations away from this mean, arising from the infinity of factors t h a t
have not been t a k e n into account. Because of the insensitivity of large scale
averages in statistical mechanics, however, a v e r y wide range of laws of the
t y p e described above is evidently possible, which lead to a p p r o x i m a t e l y the
same large scale statistical properties (such as the Maxwellian distribution).
On the basis of considerations of the t y p e discussed above, one is led to
consider the question of whether one ought not to seek a more general sta ~
tistical formalism (*) and not always to t r y to f r a m e all statistical problems
in terms of the culculus of probabilities. Of course, the t h e o r y of probability
has demonstrated its applicability in a wide range of fields, starting with
gambling games, and going on to the fields of medical, social, industrial, economic, and scientific statistics. Nevertheless, as we have already seen, in our
discussion of the foundations of statistical mechanics {certainly a s t u d y t h a t
falls within the scope of statistical problems), the theory of probability already
(*) Thu.-. t)ecausc of the (~xlremely great length of lhe Poincard cycles, there is
11o ~illle for a system to come reasonably ('lose to every point on a surface of constant
onelg 3 ill phase space.
(7) p. ttERTZ: lieber-(;(r,~, in ]iepertori.um d. 1)t~ysil,. Bd. 1, 2 (Leipzig, 1916);
T. 1,FW-('IW'rA: Drei I'orle.sttttfte~ iiber adiabali.~('l~e l~raria~de,, in tbh. d. 31al],. .~'e.m,i~ar.~ Ilamb.~fl, 6. 323 (192~).
T H E G E N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
1023
in a certain domain. Rather, the way should be left open for new statistical
concepts, and associated m a t h e m a t i c a l methods, which m a y shed new light
on the meaning of entropy.
Another i m p o r t a n t application in which the concept of probability m a y
have only a r a t h e r limited validity is the s t u d y of non-eqnilibrium problems
in statistical mechanics. If we are interested in how a system approaches
statistical eqlfilibrium, then a certain difficulty of principle arises in connection
with the application of the concept of probability. F o r the statistical properties of a system, (e.g., pressure, temperature, velocity, etc.), are changing
with the time. The usual m e t h o d of defining probability as the average over
an unlimited period of time t h e n will not work. Of course, if the statistical
averages are changing slowly, one m a y define the probability a p p r o x i m a t e l y
in terms of the average over a period of time during which these properties
do not change very much (s). F o r rapid changes, however, even this m e t h o d
does not work. The other possible m e t h o d of defining probability is through
a statistical ensemble. B u t it seems unsatisfactory to have to t r e a t w h a t is
a f t e r all the approach of a single system to equilibrium in terms of an average
over a non-existent ensemble. Suck a t r e a t m e n t m a y give correct results in
certain contexts, b u t again it is r a t h e r like ~studying the motion of a thing in
terms of the motion of its shadow. Perhaps new statistical methods and
concepts would here m a k e possible a deeper anderst~.nding of the problem of
the approach to equilibrium in such systems.
E v e n if we restrict ourselves to statistical laws t h a t are expressible in terms
of the laws of probability, our point of view shows t h a t one can still t r e a t a
broader range of problems t h a n just the calculation of one kind of probability
in terms of another. F o r within the setting in which statistical law is regarded
as more general t h a n probability, it has meaning to d e m o n s t r a t e the validity
of the laws of probability in certain contexts, a n d to calculate the numerical
values of the probabilities, on the basis of laws t h a t are more general t h a n
the laws of probability. Thus, as we h a v e seen, we can calculate a probability
f r o m a d e t e r m i n a t e law, or f r o m a n intermediate t y p e of law, which is p a r t i a l l y
determinate. Vice-versa, a partially determinate law can be obtained from a
probabilistic law, when the n u m b e r of elements N, in the distribution is large.
F o r example, according to BernouUi's theorem, one c~n for practical purposes
predict t h a t the error in a certain average will decrease as h r-89 As N approaches infinity, the partially d e t e r m i n a t e law will become more a n d more n e a r l y
a completely determinate law. Thus, we see how the adoption of a b r o a d e r
point of view toward statistical law permits us to a t t a c k the problem of the
interconnection of various kinds and levels of law.
(8) See, for example, M. SCH/kNB~G: Nuovo CimenIo, 10, 419 (1953) where a discussion is given of the problem of taking averages in a non equilibrium process.
1024
7. -
i).
A Further
Example
of a P r o b a b i l i s t i e
I'OIIM alld
of t h e
w.
General
,~CHI?'I'ZI.iR
Statistical
Problem
- The
Deduction
Law.
TI"IE GENERAL STATISTICAL :PROBL]~31~IIN PHYSICS AND THE THEORY OF PROBABILITY 1025
Fig. I.
Ax~ =
v~
-- h),
1026
0,~+1- KO,,.
(7.1)
iu KO,,.
We see also that our simplified model gives a schematic representation of
tile significant aspects of tire actual process of collision. First we note t h a t
*he angle 0,_~ changes by 2~r when tile initial position of tlre particle just before
it, strikes the (~ 1)-th obstacle changes b y LdO
2~L/K. Now if K islarge
it is clear that a small change of position will lead to a change of angle of 2a.
Thus we e m b o d y in our model something sinfilar to tire extreme sensitivity
of the angle of collision to the collision parameter, t h a t is so i m p o r t a n t in
real collision processes for bringing a b o u t a uniform distribution. In fact,
one could regard this model as more or less equivalent to having relative to
each collision point a uniformly distributed set of obstacles on a circle of
radius L. each of diameter 2.~L/K. Of course, in a n y given collision, tile locations of the effective obstaeles will depend ml where the previous collision
took place: wherea,~ in the real process it does nol. In this sense, our model
is a further simplification: but in the present work, the essential aspe('l of instability of motion is the principal feature of the real process t h a t we are interested in retaining in the model.
Let us now consider a sequence of collisions, starting with 0 :-: 0o. We
have
[ O,
liO,, - [ KOo l,
[ o . . . . . K"-o. -
(7.2)
tK%t
t 0 , := h ' v 0 , , - - t K % l
o,,}.
102~
(~) See, for example, H. ~'~,YL: Math. A~n., 77, 313 (1916).
|()~8
I). B l l l | ' q
;llld
V~. S('III~'I'ZE|{
Henee, t h e m e a s u r e of t h o s e t r a j e c t o r i e s passinK
can lead
to a ~'iven d i s t r i b u t i o n , N,
9(N, 0o. 0,) I0,.
Now, as we h a v e seen, a n y gi~',>+ s e q u e n c e has a m e a s u r e of /10.( ,10/2x)".
W e n m s t no~ c o u n t all s e q u e n e e s t h a t ean lead to a ~iven o(N, 0o, 0,) a n d
therefore l o a
-V'. NI'.
X>!
... N , ~
(7.3)
.11=
.V~
'..\.,.
='
,- , ( = 1 0 2 7 ) u
l0.
(7.4)
.V
c o n s t a n t - - : \ ~ = N 10 2.~.
(*) llere, we make use ~f lhe h , ' t thai if any ~iven ~equence. /(0 o, =Y) is po.~.~ible
lh#ll ~|lly other .-e~luence involvin~ an exchange of any rwo of the magles is also possible. This follows from the lar~e value of the inte~er, E, which has as a consequence
thai ,_,'iven any ~ alues ,}f 0,,. ~e ,'an always obtain any desired ~ alue of 0,~ ~1 simply bs:
a small change of q, thai doe,~ not take il out of the region :10,~ of width. 2.~m/K. :
T H E G E N E R A L S T A T I S T I C A L PROBL]dM IN P f t ' i - S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
109~}
In M ~ In Mo -- ~ (~N~)~/27e = In Mo -- ~2~(6N~)2/2NAO.
/
we get
M ~ M~ e x p [ - - ~ 89
i
We see t h e n t h a t the measure of those distributions in which 6]~ is a p p r e ciable becomes negligible as N - + c~. Thus, fixing a given AO and a given
6]io, however small, we can always choose N so large t h a t the measure of
those sequences in which ~]~ > ~]~0 is less t h a n a n y specified number.
W h a t is the meaning of this result? We first remind ourselves t h a t all of
the possible sets of initial angles, t h a t can lead to a given distribution,
N ~ = O(0o, 0~, N) arrange themselves into small elements of width, AO/K ~.
Thus as N - + c~, the width of each element approaches zero v e r y r a p i d l y .
This behavior corresponds to the e x t r e m e instability of motion, since a change
of A O o - - A O / K ~ would change 0~ b y AO and therefore alter the distributio n.
Thus, to get any given t y p e of sequence, as N - + ~ we would need a
perfect fit for the b o u n d a r y conditions. B u t as we have seen, the overwhelming
majority of sequences correspond to distributions t h a t are v e r y nearly uniform.
Only a comparatively small n u m b e r of regions of width flO/K ~ will lead t o
a n y t h i n g different (such special regions will occur, for example, where 0o is
rational). Although the n u m b e r of such special regions approaches infinity
as N--> c~, this approach is much less rapid t h a n is the decrease in width
of each region, so t h a t the total measure of such special regions ~pproaehes
zero,
aS
2V--+ c~.
/o30
D.
BOI[M
alld
w.
S('II[YTZER
it far from this special state and lead eventually to a unifornl distribution.
S o w , it is true t h a t such a disturbance might accidentally throw the system
into a n o t h e r of the special angles, 00. of the set of measure zero, not leading
to a uniform distribution. This would require however a very special exact
coordination between the external disturbance and the motion of the system.
I t is evident t h a t for practically all possible motions, this kind of eoordinat,ion
would not exist. Thus, it is consistent with the t h e o r y to suppose t h a t the
motions are such l h a t there are no such coordinations. Moreover, this supposition is also consistent with all of our practical experience. For example,
if we consider a cone standing on its point, this position is one of unstable equilibriunl. It would be conceivable that the fluctuations of the surrounding air
and of the table on which it is supported would be just, such as to maintain
it in its position of unstable equilibrium for all tinle but evidently this never
happens in reality (*).
We conclude then that the t r e a t m e n t of the sequence given here is a special
ease of the general statistical problem, ill the sense t h a t certain long run or
average properties can be predicted without taking into account the precise
factors needed for a prediction of the details of the behavior of the system.
For as we have sPell, once we exclude tile special set of O0 of lneasure zero
described above, then the mean distributiou of angles is independent of 00,
and is therefore independent of the b o u n d a r y eonditious t h a t determine the
precise motion. We e<)llelude then t h a t the average value of any int.egrable
x
be equal to
(7.s)
" ~
l(0)
--
~ ;t/
tf(~176 9
o
(*) it i.~ interesting to note that ('OURXOTill his ~J'l)ositiot~ de la Thdorie des (~i~atu'e.~
et des P.robabilit~ already realized tlle importance of this point, and in fact used the
example of the cone standing on its point as all argument in favor of the the~is that
in chance phenomena, no special properties depending on the infinitely precise ~peei~ication of any particular quantity could be of physical importan(.e.
(lo) H. POIXCARs ('ale,l de.~" t'roh,,hilitd.~ (Paris, 1912).
T I I E G E N E R A L S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
1031
i m p o r t a n c e of i n s t a b i l i t y in leading to a n equidistribution. Indeed, he considered t h e case of a p a c k e t of asteroids, each in neighboring, b u t slightly different circular orbits. Because the p h a s e angle of a n y p a r t i c u l a r asteroid relative to t h a t of a n o t h e r is u n s t a b l e {i.e., it increases indefinitely with the time),
one expects t h a t in t i m e the asteroids will h a v e a uniform a n d more or less
r a n d o m distribution of p h a s e angles (which is in f a c t found in a s t r o n o m i c a l
o b s e r v a t i o n s (*)).
I n order to t r e a t this p r o b l e m , POI~CAR~ considered a n a r b i t r a r y continuous
initial p r o b a b i l i t y distribution (e.g., one h a v i n g a v e r y n a r r o w b u t continuous
p e a k n e a r a certain point). H e t h e n showed t h a t i n d e p e n d e n t l y of the precise
initial f o r m of such a distribution, the p r o b a b i l i t y would e v e n t u a l l y b e c o m e
a c o n s t a n t t h a t is i n d e p e n d e n t of the angle. F o r discontinuous initial distributions (e.g., a delta function) this would not in general occur. B u t here
POINCA]~ used a n a r g u m e n t similar to the one b y which we h a v e excluded
sets of m e a s u r e zero as not r e p r e s e n t i n g real physical possibilities, l i e did
this b y pointing out t h a t in all real problems, there is a b a c k g r o u n d of fluct u a t i o n s coming f r o m influences outside the scope of w h a t is t r e a t e d in the
p r o b l e m (e.g., the motions of other planets, a n d even the molecular motions)
so t h a t no discontinuous p r o b a b i l i t y function (such as a delta function) can
represent a n y real physical situation.
POINCAR]~S conclusion was thus essentially the same as ours. B u t the
weak point in his a r g u m e n t is t h a t his m a t h e m a t i c a l methods do not correspond to the real context of his physical ideas. Indeed, f r o m a strictly
logical point of view, the a s s u m p t i o n of the initial n a r r o w l y p e a k e d a n d continuous initial distribution is not f u n d a m e n t a l l y different f r o m m a k i n g assumptions of the u l t i m a t e u n i f o r m p r o b a b i l i t y distribution to begin with. I t
would thus b e concluded t h a t while P o I ~ C A ~ has demi)nstr~ted a n interesting
a n d useful relationship b e t w e e n two p r o b a b i l i t y distributions t ~ k e n a t different times, he has still r e m a i n e d within the basic f r a m e - w o r k of lust calculating one p r o b a b i l i t y in t e r m s of another.
On the other hand~ the significance of our result is t h a t the d e v e l o p m e n t
of a n eqnidistribution .can be u n d e r s t o o d w i t h o u t a n y reference to the concept
of p r o b a b i l i t y a t all. This we h a v e done b y recasting WEYL'S results on the
d e v e l o p m e n t of a n equidistribution in a f o r m in which the physical significance of the sets of measure zero could easily be seen, so t h a t we can unders t a n d w h y t h e y m u s t be excluded (+).
1032
D. B()IIM
ltIld
w.
S('ttl}TZEtR
~ I I E GEN~ERAL S T A T I S T I C A L P R O B L E M I N P H Y S I C S A N D T H E T H E O R Y OF P R O B A B I L I T Y
1033
t h a t could lead to the same statistical results. Indeed, even ~ law intermediate
between determinate and probabilistie could h a v e the same statistical effects,
provided t h a t as h r increases without limit, it is quite rare t h a t the point in
sequence space finds itself inside the minute volume corresponding to distributions deviating appreciably from a uniform one.
The above result must, however, be interpreted with care. I t does not b y
any means signify, for example, t h a t all distributions must be uniform. Laws
are evidently quite possible t h a t would lead to non-uniform distributions, or
even to distributions t h a t approached no p a r t i c u l a r limiting behavior at all.
All t h a t we h a v e shown is t h a t a wide range of laws exist t h a t could le~d to
uniform distribution, so t h a t the appearance of such a distribution need
not be sensitively dependent on having a certa.in special kind of law determining
the sequence. Moreover, we can also u n d e r s t a n d why a wide range of perturbations of the system need not disturb the uniform character of the distribution significantly.
1034
I).
B~)tl'd
dO,.
dP,,
;llLd
~,.
t:('III:'I'ZFt~
p , "l~,,, dOqdO, .
<l P i]'
]},' P'_,"dOdlO' -
p;~,.(10,, d0,~.
Moreover. we m a y l)e interested in c o m b i n a t i o n s involvinR' m o r e events.
Thus. we wish to define p r o b a b i l i t y t h a t if a f t e r a ~iven collision t h e an~'le
is b e t w e e n O, a n d O+ dO, a n d a f t e r x~ a d d i t i o n a l collisions b e t w e e n O.q and
OqC-dO,: the ano'le a f t e r still a n o t h e r y.., collisions will be b e t w e e n Oq aitd
Oq dOq. I t is e v i d e n t t h a t a r b i t r a r i l y hio/h order conditional probabilities
can be defined in this wax-. The p r o b a b i l i t y of a coml)omld e v e n t (i,i2 ... it-)
is the|~
(s.3)
d P' .....~"-' = =
'~
)" i'"'~'-'dO,,
dO,x
_\ case of .areal interest is one ill which t)q, is i n d e p e n d e n t of i ,, P q s q indep e n d e n t of i t . i.,, etc.. It is readi]ly verified t h a t for this ease
(8.4)
t),,,.
P,
t I tff,~
P,~ .....
P,,,, .... x
= t),x
and
(8.5)
<1t',,
...,+
TIlE
G]~NF~I~AL S T A T I S T I C A L
PROBLEM
IN PHYSICS
AND
TH~
THEORY
OF PROBABILITY
~03~
(8.6)
l(O) =
2Yg
(0) t(O) dO
0
(0) dO = .,Vlira
.-..-).co
1(0.)
which would hold even if tile distribution were not uniform. I n order to prove
t h a t a probability, P(0), really exists for a given t y p e of sequence of angles 0~,
it would be necessary to show t h a t it is possible to find a function such t h a t
eq. (8.6) holds independently of 0o, except possibly for a set of measure zero.
In Sec. 7 we have already done this for the sequence, 0" ---- K'Oo, in which
case it turns out t h a t P(O) is a constant. Evidently, however, a more general
problem can be a t t a c k e d in a similar way. I t is not difficult to find a simple
determinate sequence leading to a non-mfiform P(0). I n this paper, however, we restrict ourselves to merely pointing out t h a t the m a t h e m a t i c a l framework is already broad enough to p e r m i t t h e t r e a t m e n t of problems involving
non-uniform probabilities.
The probabilities of compound events can be t r e a t e d similarly. To do
this, we consider a n y integrable function of two variables ](0~, 02). L e t us
recall t h a t in our sequence b o t h 0~ a n d 0.+~, are determined in terms of 00.
Thus, once 0~ is given, 0, is certainly determined. In general, such a determination would lead to correlations (e.g., 0,+~ 1 could be equal to 0 . - ~ ) .
W e shall see, however, t h a t in our problem, the relation between 0,+~, and 0.
fluctuate sufficiently as n changes so t h a t no such correlations exist.
If a probability P~,(0~02) of the compound event exists, t h e n we m u s t have
(8.7) lira
/(0~, 0,+6,) =
10;{(j
D.
BOIIM
alld
w.
SCtlI;TZEtl
(~.8)
.v
x,,~/(o,,,
o,,
--
,,/-- 2
n !..,
',,,,, "~p
[iifo,,
(s.o)
~ I(0,,, 0 ..... /
exp
[iO,,(~
,,~ So',)]
W e t h e n w r i t e 0,,
h'"O, a n d ( / - ) ~ t K " ) = C,,, ( p r o v i d e d t h a t C,,,,~-0).
replacino,. 0, b y 0,,,, :- ~'~,,,0~ we ~et t h e s u m in eq. (8.9)
2"',,,v
Tlten
1 ~ e x p [iK"O,,,,]
(~.io)
hm
N ](O,,, O~ ,,)
ao,,.
j)
Tim,s, we h a v e p r o v e d t h a t i n d e p e n d e i i t l y of 0o, t h e a v e r a - e s of a r b i t r a r y
funot.ions, ](010.,_) t h a t a r e e x p a n s i b l e in a finite n u m b e r of t e r m s in a t~'oul'ier
' ~ H E G ] ~ N E R A L S T A T I S T I C A L P R O B L E M I ~ ~ P H Y S I C S A N D TH]d T H E O R Y O F P R O B A B I L I T Y
1037
SappZemento al N u o v o Ci'me~do.
1038
D. B()tI?,I a l l d
w.
~CIII-TZEIr
T I t E G~;N]~RAL S T A T I S T I C A L P R O B L E M I N P14YSICS A N D T t I E T H E O R Y O F l a R O B A B I L I T Y
1039
9. -
A Criterion
for Randomness.
10~1
10. -
Further
Illustration
in Terms
of Random
Phases.
In this section, we shall apply some of the ideas suggested in the previous
sections to a problem t h a t has widespread application in statistical mechanics;
namely t h a t of the development of r a n d o m phases in a system of oscillators
or rotators.
The simplest example is the problem of r a n d o m phases with a system of
harmonic oscillators. L e t x~ be the coordinate of a given oscillator, y~ the
1047
H:
~ ~
(aid/dl;
-'- b ~ / ' d L
- c~.r,,/. ) .
i.i
As is well known, there exists a linear iransfornmtion, to the normal coordinates (~), q~ and to the eonjuffate momenta, pC, such t h a t the hamiltonian
reduces to a sum of squares, eaoh equivalent to a separate harmonic oscillator.
not coupled to the others.
~, - -
; H I l l > ,. [,,
(B,/A,)q,
o,
;H,;q,)
~,, -
(lt,/A,)p,
then lead to
()
q, -- ip, -
The soluiions
K , e x p [io>,ll,
where
(10.5)
(I0.6)
q, -- ip,
-~ ( J , ) i e x p [i,),l -~ q~.,J .
(10.7)
(f,
--- (fo,
(o,t
10~3
in statistical mechanics, e.g., t h e r m a l v i b r a t i o n of a solid, the t h e r m a l excit a t i o n of r o t a t i n g molecules, motions which can be solved in t e r m s of angle
~nd action variables a n d m o s t i m p o r t a n t of all, in the foundations of q u a n t u m
statistical mechanics (~). The a p p l i c a t i o n of the notion of r a n d o m phases
in the f o u n d a t i o n s of q u a n t u m statistical mechanics, we shall discuss in more
detail a t the end of this section.
I n order to illustrate in a simple w a y just how the phases b e c o m e r a n d o m (*),
let us begin b y considering the case of two variables. Now, in the p r o b l e m
t h a t we are t r e a t i n g here, we h a v e the new feature, not present in the t r e a t m e n t
given in sections 7 a n d 8, t h a t instead of a sequence of discrete values, we
h a v e to deal with a continuous function of the t i m e t. F o r this reason, we shall
replace the s u m over the N t e r m s of the sequence a p p e a r i n g in eq. (8.7) b y
an integral over the time, representing the m e a n value of an arbitra~T function,
] ( ~ ( t ) , f2(t)). (The use of the continuous t i m e a v e r a g e in this case corresponds
to the a c t u a l practice in the definition of physical quantities in statistical
mechanics (~3)). We t h e n obtain as ~ criterion for a uniform distribution t h a t
the following relationship m u s t hold i n d e p e n d e n t l y of initial conditions (except
possibly for a set of m e a s u r e zero):
ff
(10.8)
2~ 2:z
n~ a n d n~ being integers.
Now, writing ~ = 9~o~ w~t, ~0~ = 9~o.2 w2t we obtain
(10,10)
I n s e r t i n g (10.10) in (10.8), we note t h a t if ~ol a n d co2 are not e o m m e n s u r (~) See, for example, R. (3. TOLMA~,~: The Principles o] Statistival Mech,,~aic~, (Oxford,
19~0), Chap. IX.
(*) The development of random phases is already well known (see, for example,
reference (~)). However, we shall give a brief sketch of a proof here which is convenient
~or illustrating the concepts involved.
(la) See, for example, reference (~).
]044
I).
BOII'~I
;111(|
XV.
>('lti:'l'Zt-.l~
(lO. l l )
l..
2.'r 2,7
at
T >co
ti
i[
-_
ii
~1
(*) Thi~ re~uli i, esuenrially a l)roof of the qua,~i-ergodie theorem for t;he specia, t
TH]~ G E N E R A L
S T A T I S T I C A L P R O B L E M 'iN P H Y S I C S A N D T H E T H E O R Y
OF PROBABILITY
1045
(lO.12)
obtaining
(10.14)
~f = ~ anyJB~exp [i(p~],
~t
with
(10.15)
~f. ~ q~o. - - E . t / h .
1046
speaking, the wave function acts like an infinite collection of tmrmonic oscillators, one for each eiffenfunction of the hamiltonian. This is not surprising,
since the Sehr6dinger equation is linear ill V, and as is well known a wave
field satisfying a linear partial differential equation can be expressed in a
series of terms each oseillating with its own characteristic freqnency.
Now we note t h a t in general, the angular frequencies (,),~ = E,~t/D of oscillation of the wave function in differeni s t a t i o n a r y slates, will not be ('oremensurable (*). I n the h)ng run. we shall therefore have a uniform distribution in the eonliguration spa(.e of the phases, and the l)hases will be random
relative to each other.
Finally. we remark t h a t the problem of r a n d o m phases, is formally of the
same lype as the one treated by POIXCAR~ in connection with the distribution
(ff the phase an~les of a swarm of asteroids in neighbourin~" orbits (see Section 8).
11.
Conclusion.
(*) They at(' comiuensura|)le for th( ~ ol~e-(limen,~ional h;irm(,ni(' os~.illator, but. nor
Jn mosl other ~'enerM problenl~.
T H E [TcEN~3RAL S T A T I S T I C A L P R O B L E } [ IN P H Y S I C S A N D T I I E T H E O R Y
OF P R O B A B I L I T Y
104,7