In I three different types of theories have been defined which differ from each other with regard to the role of probability. We give a short review of the defining properties and supply some additional comments characterizing these theories.
The dogma underlying theories of type 1 is determinism with regard to single events; probability does not play
any role. If nature behaves according to this dogma, then measurements on identically prepared individual
systems yield identical results. Classical mechanics is obviously such a deterministic type 1 theory. We shall use
below (as a ’template’ for the dynamics of our statistical theories) the following version of Newton’s law, where
the particle momentum plays the role of a second dynamic variable besides the spatial coordinate
:
![]() | (2) |
In classical mechanics there is no restriction as regards the admissible forces. Thus, is an arbitrary
function of
; it is, in particular, not required that it be derivable from a potential. Note that
Eqs. (2) do not hold in the present theory; these relations are just used to establish a correspondence between
classical mechanics and associated statistical theories.
Experimental data from atomic systems, recorded since the beginning of the last century, indicate that nature does not behave according to this single-event deterministic dogma. A simple but somewhat unfamiliar idea is, to construct a theory which is deterministic only in a statistical sense. This means that measurements on identically prepared individual systems do not yield identical results (no determinism with regard to single events) but repeated measurements on ensembles [consisting each time of a large (infinite) number of measurements on individual systems] yield identical results. In this case we have ’determinism’ with regard to ensembles (expectation values, or probabilities).
Note that such a theory is far from chaotic even if our macroscopic anticipation of (single-event) determinism is
not satisfied. Note also that there is no reason to assume that such a statistical theory for microscopic events is
incompatible with macroscopic determinism. It is a frequently observed (but not always completely understood)
phenomenon in nature that systems with many (microscopic) degrees of freedom can be described by a much
smaller number of variables. During this process of elimination of variables the details of the corresponding
microscopic theory for the individual constituents are generally lost. In other words, there is no reason to
assume that a fundamental statistical law for individual atoms and a deterministic law for a piece of
matter consisting of, say, atoms should not be compatible with each other. This way of
characterizing the relation between two physical theories is completely different from the common
reductionistic point of view. Convincing arguments in favor of the former may, however, be found
in [4], [39].
As discussed in I two types (referred to as type 2 and type 3) of indeterministic theories may be identified. In type 2 theories laws for individual particles exist (roughly speaking the individuality of particles remains intact) but the initial values are unknown and are described by probabilities only. An example for such a (classical-statistical) type 2 theory is statistical thermodynamics. On the other hand, in type 3 theories the amount of uncertainty is still greater, insofar as no dynamic laws for individual particles exist any more. A possible candidate for this ’extreme’ type of indeterministic theory is quantum mechanics.
The method used in I to construct statistical theories was based on the following three assumptions,
These (properly generalized) assumptions represent also the formal basis of the present work. The first and second of these cover type 2 as well as type 3 theories, while it will be shown that the third - the requirement of maximal disorder - does only hold for a single type 3 theory, namely quantum mechanics. In this sense quantum mechanics may be considered as the most reasonable theory among all statistical theories defined by the first two assumptions. There is obviously an analogy between quantum mechanics and the principle of minimal Fisher information on the one hand and classical statistical mechanics and the principle of maximal entropy on the other hand; both theories are realizations of the principle of maximal disorder.
Let us now generalize the basic equations of I (see section 3 of I) with respect to the number of spatial dimensions and with respect to gauge freedom. The continuity equation takes the form
![]() | (3) |
We use the summation convention, indices run from
to
and are omitted if the
corresponding variable occurs in the argument of a function. The existence of a local conservation law for the
probability density
is a necessity for a probabilistic theory. The same is true for the fact
that the probability current takes the form
,
where
is the
th component of the momentum probability density. The only
non-trivial assumption contained in (3), is the fact that
takes the form of the
gradient,
![]() | (4) |
of a function . In order to gain a feeling for the physical meaning of (4) we could refer
to the fact that a similar relation may be found in the Hamilton-Jacobi formulation of classical
mechanics [63]; alternatively we could also refer to the fact that this condition characterizes ’irrotational flow’
in fluid mechanics. Relation (4) could also be justified by means of the principle of simplicity; a
gradient is the simplest way to represent a vector field, because it can be derived from a single scalar
function.
In contrast to I we allow now for multi-valued functions . At first sight this seems strange
since a multi-valued quantity cannot be an observable and should, consequently, not appear in
equations bearing a physical meaning. However, only derivatives of
occur in our basic
equations. Thus, this freedom is possible without any additional postulate; we just have to require
that
![]() | (5) |
(the quantity defined in (4) is not multi-valued; this notation is used to indicate that this quantity has been
defined with the help of a multi-valued
). As discussed in more detail in section 3 this new ’degree of
freedom’ is intimately related to the existence of gauge fields. In contrast to
, the second dynamic
variable
is a physical observable (in the statistical sense) and is treated as a single-valued
function.
The necessary and sufficient condition for single-valuedness of a function (in a subspace
) is that all second order derivatives of
with respect to
and
commute
with each other (in
) [see e.g. [28]]. As a consequence, the order of two derivatives of
with
respect to anyone of the variables
must not be changed. We introduce the (single-valued)
quantities
![]() | (6) |
in order to describe the non-commuting derivatives in the following calculations.
The second of the assumptions listed above has been referred to in I as ’statistical conditions’. For the present
three-dimensional theory these are obtained in the same way as in I, by replacing the observables
and the force field
of the type 1 theory (2) by averages
and
. This leads to the relations
where the averages are given by the following integrals over the random variables (which should be
clearly distinguished from the type I observables
which will not be used any more):
![]() | (12) |
The densities and
may be derived from the fundamental probability density
by means of the
relations
![]() | (13) |
The present construction of the statistical conditions (7) and (8) from the type 1 theory (2) shows two
differences as compared to the treatment in I. The first is that we allow now for a dependent external
force. This leads to a more complicated probability density
as compared to the two
decoupled densities
and
of I. The second difference, which is in fact related to the
first, is the use of a multi-valued
.
Note, that the dependent probability densities
and
have been
introduced in the above relations in a purely formal way. We defined an expectation value
[via Eq. (7)]
and assumed [in Eq. (10) ] that a random variable
and a corresponding probability density
exist. But the validity of this assumption is not guaranteed . There is no compelling conceptual basis for the
existence of these quantities in a pure configuration-space theory. If they exist, they must be defined with the
help of additional considerations (see section 6 of I). The deeper reason for this problem is that the concept of
measurement of momentum (which is proportional to the time derivative of position) is ill-defined in a theory
whose observables are defined in terms of a large number of experiments at one and the same instant of time
(measurement of a derivative requires measurements at different times). Fortunately, these considerations,
which have been discussed in more detail in I, play not a prominent role [apart from the choice of
discussed in section 4], for the derivation of Schrödinger’s equation reported in the present
paper3 .
Using the continuity equation (3) and the statistical conditions (7) and (8) the present generalization of the
integral equation Eq. (24) of I may be derived. The steps leading to this result are very similar to the
corresponding steps in I and may be skipped. The essential difference to the one-dimensional treatment is - apart
from the number of space dimensions - the non-commutativity of the second order derivatives of
leading to non-vanishing quantities
defined in Eq. (6). The result takes the
form
![]() | (14) |
In the course of the calculation leading to (14) it has been assumed that the macroscopic force
entering the second statistical condition (8) may be written as a sum of two contributions,
and
,
![]() | (15) |
where takes the form of a negative gradient of a scalar function
(mechanical potential) and
is the remaining
dependent part.
Comparing Eq. (14) with the corresponding formula obtained in I [see Eq. (24) of I] we see that two
new terms appear now, the expectation value of the dependent force on the r.h.s., and the
second term on the l.h.s. of Eq. (14). The latter is a direct consequence of our assumption of a
multi-valued variable
. In section 4 it will be shown that for vanishing multi-valuedness Eq. (14)
has to agree with the three-dimensional generalization of the corresponding result [Eq. (24) of I]
obtained in I. This means that the
dependent term on the r.h.s. has to vanish too in this limit
and indicates a relation between multi-valuedness of
and
dependence of the external
force.