Professional Documents
Culture Documents
ISBN
Contents
Preface 1
Contributors 139
Preface
João Queiroz & Priscila Farias
T
here seems to be a consensus that many of the classic problems in cognitive
science are strongly connected to the fundamental issues of information,
meaning and representation. There is, indeed, no domain of research,
interested in cognitive processes, that has not been concerned, at some point,
with these notions. At the same time, as many authors have noted, these seminal
elements of investigation are frequently obscured by terminological conflicts,
uncertainties and vagueness. Problematic as they may be, notions of information
and representation, or its models, even if only implicitly, are always present in
studies on cognitive systems, urging for reliable and sound theoretical basis.
North-American pragmatist Charles Sanders Peirce, founder of the modern
theory of signs, defined semiotics as a science of the essential and fundamental
nature of all possible varieties of meaning processes (semiosis). Peirce’s concept of
semiotics as the ‘formal science of signs’, and the pragmatic notion of meaning
as the ‘action of signs’, have had a deep impact in philosophy, psychology, 1
theoretical biology, and cognitive science.
The reader will find here a collection of papers that present, from different
perspectives, an attempt to relate semiotics and cognitive science with linguistics,
logic, and philosophy of biology. As a first broad account of those subjects, it does
not specifically focus on or privilege any of the different approaches that have
been proposed up to now, but instead gives the reader the opportunity to consider
the various directions and topics of research that emerge from such relations.
Several chapters focus on the logic of semiosis, and propose a range of
different analysis of the nature of mediation as an action of interpretation. Emmeche
and Hoffmeyer explore the explanatory power of the biosemiotic approach as an
alternative (or complementary) theoretical frame to the physicalist point of view.
Emmeche’s chapter comments upon some of the ‘open problems’ in artificial life
from the perspective of qualitative organicism and the emergent field of Peircean-
oriented biosemiotics. Hoffmeyer is interested in the rise and evolution of qualia.
According to the author, a scientifically consistent theory might be developed on
the basis of what he called semiotic materialism. Semiotic materialism considers
qualia as an evolved instantiation of a semiotic freedom that was latently present
in our universe from the beginning. It claims that our universe has a built-in
tendency to produce organized systems possessing increasingly more semiotic
freedom in the sense that the semiotic aspect of the system’s activity becomes
more and more autonomous relative to its material basis.
Brier proposes a transdisciplinary approach able to postulate a unified theory
of information, cognition and communication. His proposal involves the setting
of an epistemological framework that takes into account recent developments
from ethology, second order cybernetics, cognitive semantics and pragmatic
linguistics, and that builds on basic concepts from biosemiotics. According to
his view, all living organisms are immersed in a web of signs, participating in
what he defines as sign games, which eventually lead to wittgenteinian human
language games.
Chemero challenges the notion of information in Gibsonian-oriented
approaches to perception. He proposes an understanding of direct perception
and information that differs from the ecological psychology orthodoxy, the
Turvey-Shaw-Mace view. According to his view, perception is direct when the
perceiver and perceived are coupled and their relationship is unmediated by
mental representations.
Dynamical hypothesis on cognition, by rejecting the idea that cognition
is to be explained exclusively in terms of internal representations, are close to
Chemero’s approach. Two papers discuss the use of dynamical system theory
(DST) as a strategy for modeling cognition. Van Gelder is interested in exploring
the “essence of dynamical cognitive science”, and to show how it differs from
2 traditional computational cognitive science. His strategy is to oppose the
traditionalist slogan that cognitive agents are digital computers to the dynamicist
claim that cognitive agents are dynamical systems. Port’s argument is about
an essentially temporal (continuous, historical) feature of verbal language
generation and processing on a phonological level of description. He suggests
that, to constitute a general dynamic theory of language, from phonology to
semantics, we should understand linguistic events and structures as temporal
processes.
One of Peirce’s most original contributions to the studies on cognitive
inference is his development of the concept of abduction. However, Peirce’s
abductive inference received, so far, relatively little attention. In the last chapter
of this book, Bourgine proposes axiomatic and geometric models for abduction,
close to the framework of belief revision, in an effort to formally capture what
differentiates it from induction and deduction, while preserving a coherent
relation between the three types of reasoning envisaged by Peirce. Bourgine’s
models are exemplified in the context of three perspectives from the field of
cognitive science: cognitivism, connectionism and constructivism.
The various strategies presented here may be considered non-standard, and
therefore remain peripheral in their fields. It is still too early to properly evaluate
all the perspectives opened up by the research frontiers presented in this book.
Indeed, it is premature to assert that they may one day constitute new scientific
paradigms. What all these perspectives suggest, however, are alternative
approaches to basic principles of cognition, information and representation. By
offering innovative and consistent propositions, we hope that the ideas presented
in this book may constitute a fresh breath, and point out important new directions
to be followed in the future.
Acknowledgments
The authors would like to acknowledge the support received, in the form of research grants,
from FAPESP - The State of São Paulo Research Foundation.
3
4
A-life, organism and body:
the semiotics of emergent levels
Claus Emmeche
Biosemiotics. The study of living systems from the point of view of semiotics,
the theory of signs (and their production, transfer, and interpretation), mainly
in the tradition of the philosopher and scientist Charles Sanders Peirce (1839-
1914) but also inspired by the ethologist Jakob von Uexküll (1864-1944), has
a long and partly neglected history in 20th Century science (Kull 1999 for the
history, Hoffmeyer 1996 for an introduction). It re-emerged in the 1990s and is
establishing itself as a cross-disciplinary field attempting to offer alternatives
to a gene-focused reductionist biology (much like one of the aims of Artificial
Life, and indeed inspired by it), by gathering researchers for a new approaches
to biology, or a new philosophy of biology, or ultimately with the hope to bridge
the gab between science and the humanities. The semiotic approach means that
cells and organisms are not seen primarily as complex assemblies of molecules,
as far as these molecules — rightly described by chemistry and molecular biology
— are sign vehicles for information and interpretation processes, briefly, sign
action or semiosis.
A sign is anything that can stand for something (an object) to some
interpreting system (e.g., a cell, an animal, a legal court), where ‘standing for’
means ‘mediating a significant effect’ (called the interpretant) upon that system.
Thus, semiosis always involves an irreducibly triadic process between sign, object
and interpretant. Just as in chemistry we see the world from the perspective of
molecules, in semiotics (as a general logic of sign action) we see the world from
the perspective of sign action, process, mediation, purposefulness, interpretation,
generality. Those are not reducible to a dyadic mode of mechanical action-
reaction, or merely efficient causality. The form of causality governing triadic
processes is final causation. Organisms are certainly composed of molecules,
but these should be seen as sign vehicles having functional roles in mediating
sign action across several levels of complexity, e.g., between single signs in the
genotype, the environment, and the emerging phenotype.
Biosemiotics is a species of qualitative organicism for these reasons: (i) It
holds a realist position regarding sign processes of living systems, i.e., signs and
interpretation processes are not merely epistemological properties of a human
observer but exists as well in nature, e.g., in the genetic information system (El- 7
Hani et al. 2006). (ii) Biosemiotics interprets the teleology of sign action as related
to final causation (Hulswit 2002). (iii) The qualitative and species-specific ‘subject’
of an animal (i.e., its Umwelt understood here as a dynamic ‘functional circle’ of an
internal representation system interactively cohering in action-perception cycles
with an environmental niche) can to some extent be studied scientifically by
the methods of cognitive ethology, neurobiology and experimental psychology,
even though the experiential feeling of the animal is closed to the human
Umwelt (on Umwelt research, see papers in Kull 2001). (iv) Signs have extrinsic
publicly observable as well as intrinsic phenomenal aspects. We can only access
the meaning of a sign from its observable effects, a good pragmaticist principle
indeed, but observation of the phenomenal experiences of another organism may
be either impossible or highly mediated. However, reality exceeds what exists
actually as observable. (v) Even though sign activity generally can be approached
by formal and logic methods, sign action has a qualitative aspect as well. Due to
the principle of inclusion (Liszka 1996) every sign of a higher category (such as a
legisign, i.e., a sign of a type) includes a sign of a lower category (e.g., a sinsign,
i.e., a sign of a token. A type has somehow to be instantiated by a token of it,
just like any sign must be embodied). A symbol is not an index, but includes
an indexical aspect, which again involves an icon. All signs must ultimately
include (even though this might not be pertinent for their phenomenology)
qualisigns, which are of the simplest possible sign category, hardly functioning as
mediating any definite information, yet being signs of quality and thus having a
phenomenal character of feeling (Peirce preferred a type-token-tone trichotomy
for the type-token dichotomy; a tone is like a simple feeling). The argument (v)
may strike a reader not acquainted with Peirce as obscure, but it is a logical
implication of the ontological-phenomenological basis of Peirce’s semiotics and
points to an interesting continuity between matter, life and mind, or, to phrase it
more precise, between sign vehicles as material possibilities for life, sign action as
actual information processing, and the experiential nature of any interpretant of a
sign, i.e., the effects of the sign upon a wider mind-like system (Emmeche 2004).
To recapitulate, the biosemiotic notion of life is a notion of a complex web
of sign and interpretation processes, typically with the single cell seen as the
simplest possible autonomous semiotic system.
Biological life: The so-called life sciences are not interested in the life of the
Lebenswelt as a normative phenomenon, but in the general physical, chemical
and biological properties of life processes, as conceived of within separate
paradigms of biology. This leads to several distinct ‘ontodefinitions’ of life
(Emmeche 1997) such as life as evolutionary replicators, life as autopoiesis, or life
as sign systems (and probably many more). However, advances in biotechnology
and biomedicine will tend to mix up, ‘hybridize’ or create new boundary objects
(sensu Star & Griesemer 1989) between the domains of bioscience and a life-world
deeply embedded in technoscience.
Language, culture-specific
Complex dissipative, self-
Self-moving, action-
organic codes
kinesthetics
Lebenswelt
Table 1. Ordering relations between forms of embodiment. The epistemic dimension (top row) is shown
by organizing those forms according to different domains of science each constituting its own objects;
the ontic dimension (bottom row) is implied by an underlying ontology of levels of organization in
Nature. Increasing specificity from left to right; for each new level the previous one is presupposed.
Models of life
Pattee (1989) was emphatic about the distinction between a model of life and
14 a realization of some life process. In the early phase of Artificial Life research,
focus was put on the possibility of ‘life in computers’, and thus the question of
computational simulations vs. realizations was crucial. Considering the possibility
of a ‘wet’ bottom-up synthesis of other forms of life, we need to expand the kind
of analysis given by Pattee to include not only the role of computational models
in science in general and Artificial Life in particular, but also the very notion of a
model in all its variety, and especially the notion of model organisms in biology.
It is beyond the scope of this note to make any detailed analysis here, so in this
final section only some hints will be given.
Let us make a preliminary, almost Borgesian classification of models in
biology like the following.
Formal models and simulations. Highly relevant for ‘software’ Artificial Life. Such
models are, for their theoretically relevant features, computational and medium-
independent, and thus disembodied, and would hardly qualify as candidates
for ‘true’ or ‘genuine’ life, from the point of view of organicism or biosemiotics.
Semiotically, the map is not the territory; a model is not the real beast.
Evolutionary models. This label collects a large class of dynamic models not only
across the previous two categories (because they may be either computational
or mechanical, cf. also the field of ‘evolutionary robotics’) but also combining
evolutionary methods with real chemistries. Many sessions of previous Artificial
Life conferences have been devoted to these models.
Bottom-up models. The term ‘bottom-up’ may be used for all three major areas of
Artificial Life, in relation to ‘software’, ‘hardware’ and ‘wet’ models. Considering
only the later here (e.g., Szostac et al. 2001), the crucial question is to distinguish
between, on the one hand, a process aimed at by the researches which is truly
bottom-up emergent, creating a new autonomous level of processes such as
growth and self-reproduction pertaining to biofunctionality and biobodies,
and on the other hand, something more similar to engineering a robot from
pre-fabricated parts, that is, designing a functioning protocell but under such
special conditions that one might question its exemplifying a genuine agent or
organism. Just as exciting as they are as examples of advances in wet Artificial
Life research, just as perplexing are they as possible candidates for synthetic
true organisms, because their process of construction is highly designed by the
research team. In this way, they are similar to the ‘model organisms’ in classical
experimental biology, but with the crucial difference that no one doubts the later
to be organisms, while it is question begging to proclaim the former to be.
Conclusion
From an organicist perspective, real biological life involves complex part-whole
relationships, not only regarding the structured network of organism-organs-
cells relationships, but also regarding the environment-(Umwelt)-organism
16 relations. The biosemiotic trend in organicism is needed to understand natural
life (the plants and animals we already know) from a scientific perspective, but
is not enough to evaluate the complex question of “what is life?” as recently
raised by synthetic chemistry approaches to wet artificial life. Here, also more
ontological, metaphysical, and philosophy of science (and scientific models)
inspired considerations are needed. Some of these have been presented, other
just hinted at.
Acknowledgements
I thank Frederik Stjernfelt, Simo Køppe, Charbel Niño El-Hani, João Queiroz, Jesper
Hoffmeyer, Mia Trolle Borup and Tom Ziemke for stimulating discussions. The work was
supported by the Faculty of Science, University of Copenhagen, and the Danish Research
Foundation for the Humanities.
References
Barbieri, M. 2003. The Organic Codes. An introduction to semantic biology. Cambridge:
Cambridge University Press.
Bedau, M., McCaskill, J., Packard, N., Rasmussen, S., Adami, C., Green, D., Ikegami, T.,
Kaneko, K. and Ray, T. 2000. “Open problems in artificial life”. Artificial Life 6: 363-376.
Buss, L.W. 1987. The Evolution of Individuality. Princeton: Princeton University Press.
El-Hani, C. N. and Emmeche, C. 2000. “On some theoretical grounds for an organism-
centered biology: Property emergence, supervenience, and downward causation”.
Theory in Biosciences 119 (3/4): 234-275.
El-Hani, C. N., Queiroz, J., and Emmeche, C. 2004. “A Semiotic Analysis of the Genetic
Information System”. Semiotica 160 (1/4): 1-68.
Emmeche, C. 1994. The garden in the machine. The emerging science of artificial life. Princeton:
Princeton University Press.
__1997. “Defining life, explaining emergence”. Online paper at: http://www.nbi.dk/
~emmeche/
__2000. “Closure, Function, Emergence, Semiosis and Life: The Same Idea? Reflections on
the Concrete and the Abstract in Theoretical Biology”. In: J. L. R. Chandler & G. Van
de Vijver, eds.: Closure: Emergent Organizations and Their Dynamics. Annals of the New
York Academy of Sciences 901: 187-197.
__2001. “Does a robot have an Umwelt? Reflections on the qualitative biosemiotics of Jakob
von Uexküll”. Semiotica 134 (1/4): 653-693.
__2004. “Causal processes, semiosis, and consciousness”. In: J. Seibt (ed.), Process Theories:
Crossdisciplinary Studies in Dynamic Categories. p. 313-336. Dordrecht: Kluwer.
Gilbert, S. F. & Sarkar, S. 2000. “Embracing complexity: Organicism for the 21st Century”.
Developmental Dynamics 219: 1-9.
Haraway, D. 1991. Simians, Cyborgs and Women: The Reinvention of Nature. New York:
Routledge.
Hoffmeyer, J. 1996. Signs of Meaning in the Universe. Bloomington: Indiana University Press.
Hofstadter, D.R. 1979. Gödel, Escher, Bach: an Eternal Golden Braid. London: The Harvester
Press.
Hulswit, M. 2002. From Cause to Causation. A Peircean Perspective. Dordrecht: Kluwer.
Kauffman, S. 2000. Investigations. Oxford: Oxford University Press.
Kember, S. 2003. Cyberfeminism and Artificial Life. London & New York: Routledge.
Kohler, R.E. 1994. Lords of the Fly. Drosophilae Genetics and the Experimental Life. Chicago:
The university of Chicago Press. 17
Kull, K. (ed.) 2001. Jakob von Uexküll: A paradigm for biology and semiotics. Berlin & New York:
Mouton de Gruyter (= Semiotica 127(1/4): 1–828).
__2000. “An introduction to phytosemiotics: Semiotic botany and vegetative sign systems”.
Sign Systems Studies 28: 326-350.
__1999. “Biosemiotics in the twentieth century: A view from biology”. Semiotica 127(1/4):
385–414.
Latour, B. 1991. We have never been Modern. New York: Harvester Wheatsheaf.
Liszka, J. J. 1996. A general introduction to the semeiotic of Charles Sanders Peirce. Bloomington:
Indiana University Press.
Mayr, E. 1997. This is Biology. The science of the Living World. Cambridge: Harvard University
Press.
Nöth, W. 2003. “Semiotic machines”. S.E.E.D. Journal (Semiotics, Evolution, Energy, and
Development) 3 (3): 81-99.
Pattee, H.H., 1989. “Simulations, realizations, and theories of life”. In: Artificial Life (Santa Fe
Institute Studies in the Sciences of Complexity, Vol. VI), C.G. Langton (ed.). pp. 63-77.
Redwood City: Addison-Wesley.
Rasmussen, S., Chen, L., Deamer, D., Krakauer, D., Packard, N., Stadler, P., Bedau M. 2004.
“Transitions from nonliving and living matter”. Science 303: 963-965.
Reynolds, A., 2002. Peirce’s Scientific Metaphysics. The Philosophy of Chance, Law, and Evolution.
Nashville: Vanderbilt University Press.
Rosen, R. 1991. Life Itself. A Comprehensive Inquiry Into the Nature, Origin, and Fabrication of
Life. New York: Columbia University Press.
Searle, J. 1992. The Rediscovery of the Mind. Cambridge, Mass.: MIT Press.
Sheets-Johnstone, M. 1999. The Primacy of Movement. Amsterdam & Philadelphia: John
Benjamins.
J. Szostak, J., Bartel, D., Luisi, P. 2001. Synthesizing life. Nature 409: 383-390.
Star, S.L. & Griesemer J.R. 1989. “Institutional ecology, ‘translations’, and Boundary Objects:
amateurs and professionals in Berkeley’s Museum of Vertebrate Zoology, 1907-39”.
Social Studies of Science 19: 387-420.
Stjernfelt, F. 2002. “Tractatus Hoffmeyerensis: Biosemiotics expressed in 22 basic
hypothesis”. Sign Systems Studies 30(1): 337-345.
Van de Vijver, G., S. Salthe, S. and Delpos, M. (eds.) 1998. Evolutionary Systems: Biological and
Epistemological Perspectives on Selection and Self-Organization. Dordrecht: Kluwer.
von Uexküll, T. 1986. Medicine and semiotics. Semiotica 61 (3/4): 201-217.
Wicken, J.S. 1987. Evolution, Thermodynamics, and Information. Extending the Darwinian
Program. Oxford: Oxford University Press.
Ziemke, T. 2003. “What’s that thing called embodiment?”. In: R. Alterman and D. Kirsh
(eds.), Proceedings of the 25th Annual Meeting of the Cognitive Science Society. p. 1305-
1310. Mahwah, NJ: Lawrence Erlbaum.
Ziemke, T. and Sharkey, N. E. 2001. “A stroll through the worlds of robots and animals:
Applying Jakob von Uexküll’s theory of meaning to adaptive robots and artificial
life”. Semiotica 134(1-4): 701-746.
18
Semiosis and living membranes
Jesper Hoffmeyer
23
ORIGIN OF LIFE
Five necessary steps
1. Autocatalytic closure (Kauffman)
2. Inside-outside asymetry (closed surface)
3. Proto-communication (a comunity of surfaces)
4. Digital redescription (code duality)
5. Formation of an interface (inside-outside loops)
Figure 2. Five necessary steps on the road to the origin of life. See text for discussion.
Figure 3. Lypid bilayers. The hydrophilic “heads” and “tails” of phospholipids cause them to form
bilayers when dispersed in water. Bilayers tend to form closed vesicles.
join with each other by fusion and to divide by fission, while always maintaining
a closed, vesicular shape.” (de Duve 1991).
Now, while such closed membrane systems may have formed rather easily
in the pre-biotic world, they would probably also die out relatively quickly
unless they had acquired a capacity to canalize a selective flow of chemicals
(“nutrients” and “waste”) across the membrane. But in themselves lipid bilayers
are very impenetrable and in cell membranes as known today the transport of
chemicals in and out of the cell is only possible because of the inclusion into the
membrane itself of thousands of rather complex protein molecules (figure 4). For
this reason de Duve in his scenario for the origin of life does not favor membrane
encapsulation as an early step.
Here however we are concerned with the chemistry of the process only in
the trivial sense that what is suggested should conform to the chemical evidence
we have, and that evidence is certainly very speculative. Whatever did happen
at the chemical level and whatever the order of succession actually was what I
claim here only is that from a conceptual point of view membrane formation was
THE decisive step in the process. Before membrane closing occurred there could be
no inside-outside asymmetry and thus no communication between systems. And
my guess is that it is only because pre-biotic systems reciprocally dragged each
26 other into a communicative network that they could muster the creativity needed
for the gradual construction of a true cell. The continuous interaction between
processes of “invention” and “interpretation”, which I have termed SEMETIC
INTERACTION, would then have been put in play (Hoffmeyer 1997b).
If for instance at some locality conditions allowed for the production
of swarms of such closed membrane systems one might eventually obtain a
higher level autocatalytic closure so that the outputs from one entity served as
inputs to other entities and vice versa. In such a swarm one might say that the
closed membrane systems had acquired a germ form of other-reference or proto-
communication (step 3 in the figure 2).
Still, in my terminology this would not qualify for Umwelt possession
since at this stage the system still has no self-referential dynamics. For self-
referential dynamics to occur the system needs what Kauffman has called a
“written record”, i.e. the spatially organized components of the system should
somehow become re-described in the digital alphabet of DNA or RNA thereby
forming what Claus Emmeche and I have called a code-duality (step 4 in figure
2). Code-duality refers to the idea that organisms and their DNA are both carriers
of a message sent down through generations: Living systems form a unity of two
coded and interacting messages, the analogically coded message of the organism
itself and its re-description in the digital code of DNA. As analog codes, the
organisms recognize and interact with each other in the ecological space, giving
rise to a horizontal semiotic system, while as digital codes they (after eventual
recombination through meiosis and fertilization in sexually reproducing species)
are passively carried forward in time between generations (Hoffmeyer and
Emmeche 1991).
So far a system has been established which —seen from the observer’s point
of view— has an obvious interest in maintaining the needed flow of chemicals
across the surface. But the system still has no way to assist the fulfillment of its
own “interest”, it has no mechanism for goal-oriented modification or action.
Thus the system is not an agent in its own interests. It doesn’t matter to the
system whether it can distinguish features of its environment or, in other words,
it has not yet acquired the capacity for making distinctions.
What is needed in addition to the DNA-record is the formation of a
feedback link between DNA and environment, so that events outside the system
become translated into appropriate events inside the system (Weber et al. 1989).
The membrane in other words must turn into an interface linking the interior
and the exterior (step 5 in figure 1). Only then does the system’s understanding
of its environment matter to the system: relevant parts of the environment
becomes internalized as an “inside exterior”, the Umwelt, and in the same time
the interior becomes externalized as an “outside interior” in the form of “the
semiotic niche”, i.e. the diffuse segment of the semiosphere which the lineage
has learned to master in order to control organism survival in the semiosphere 27
(Hoffmeyer 1996). I see this as the decisive step in the evolutionary process of
attaining true semiotic competence, i.e. the competence to make distinctions in
space-time where formerly there were only differences. The semiotic looping of
organism and environment into each other through the activity of their interface,
the closed membrane, also lies at the root of the strange future-directedness or
Figure 4. Model of membrane structure showing the integration of specialized proteins into the lipid
bilayer. Membrane bound proteins may perform a variety of functions such as recognition of molecular
messages from other cells or transport of specific molecules across the membrane. A hypothetical
channel for water and certain ions between several protein subunits is shown.
“intentionality” of life, its “striving” towards growth and multiplication. The
spatial asymmetry between the “inside interior” and the “outside exterior” is
coupled to the time asymmetry implicit in the self-referential mechanism of DNA
re-description followed by cell division.
Dynamic boundaries
If the prototype tool is a hammer the prototype organism is probably a dog (in
the western world at least). Like ourselves dogs have relatively well defined
boundaries, they are mortal individuals and they cannot be at two places at the
same time, but have to move in order to get food. Such organisms have been
called determinate organisms. Most organisms in the world however are not at all
like this. The life of fungi for instance is a constantly changing interplay between
dissociation and association generating varied patterns in the interconnected,
protoplasm-filled tubes (hyphea) that spread through and absorb sources of
nutrients. The hyphea branch away from one another (i.e. dissociate) most
prolifically when nutrients are freely available, but re-associate to form such
structures as mushrooms when supplies are depleted. Fungi exemplifies what
might been called indeterminate organisms, and according to the British biologist
Alan Rayner “The fungi are an entire kingdom of organisms: their total weight
may well exceed the total weight of animals by several times and there are many
more species of them than there are of plants! In many natural environments fungi
provide the hidden energy-distributing infrastructure —like the communication
pipelines and cables beneath a city— that connects the lives of plants and animals
in countless and often surprising ways” (Rayner 1997: vii).
Indeterminate organisms possess expandable or “open” boundaries that
enable them to continue to grow and alter their patterns indefinitely. Such
organisms are potentially immortal. Thus, in a Canadian forest one individual
rhizomorphic fungus, Armillaria bulbosa, is reported to have produced a network
some 1500 ha in area and estimated to weigh 100 tones and to be 1500 years old
(Rayner 1997: 130). Only extreme conditions would kill this exemplar.
The extreme emphasis on the regenerative aspect of life, proliferation
and reproduction, which is the essence of modern gene centered evolutionary
thinking does not work well in the world of indeterminate creatures. Here
processes of boundary-fusion, boundary-sealing and boundary-redistribution
all provides means for reducing dissipation, allowing energy to be maintained
within the system rather than lost to the outside. Such processes lead to more
persistent organizations in which individuality is blurred. For illustration let us
consider a few of the many cases offered in Alan Rayner’s book (Rayner 1997):
Heartrot is intuitively conceived as a disease but in actual fact it is part
of a quite normal recycling process, i.e. a process where internal partitioning
allows resources to be redistributed from locations that no longer participate
in energy gathering or exploration to sites where these processes are being
sustained. Heartrot is caused by the degradation by fungi of the predominantly
30 dead wood of mature trees. Heartrot thus results in the hollowing of tree trunks
which provides a huge variety of habitats for animals as well as allowing the
tree to recycle itself by proliferating roots within its own internal ‘compost’ of
decomposing remains that accumulate within the cavity (Rayner 1997:179).
Another illustrative example is the partnership formed by the majority
of higher plants with fungi, so-called mychorrhizas. Here the funguses not
only provide their plant partners with improved access to mineral nutrients
and water in exchange for organic compounds produced by photosynthesis.
The mycorrhizal mycelia are also thought to provide communication channels
between plants enabling adult plants to “nurse” seedlings through fungal
“umbilical cords” to reduce competition and to enhance efficient usage and
distribution of soil nutrients. There is a risk to this invention,, however, because
“pirating plants”, by tapping into mycorrhizal networks, may indirectly divert
resources from the participating plants (Rayner 1997: 63).
These are both cases of mutual symbiosis, of course, and symbiosis in general
is probably the most underestimated aspect of evolutionary creativity (cf. Sapp
1994). The one-eyed focus on the reproductive aspect of life, the proliferation of
successful genes, systematically weeds out the heterogeneous contexts in which
organisms always live. These contexts do not only consist in material interactions
but also always include subtle semiotic interactions mediated by environmental
cues of all kinds. Thus, traditional symbiosis should be seen as just a particular
kind of a much more widespread eco-semiotic integration (Hoffmeyer 1997c).
The absurdity of conflating all of this into one simple measure of genetic fitness
becomes especially obvious when considering the world of indeterminate
organisms where individuality and mortality are only loosely connected, and
where the dynamic boundaries in space and time are not defined by their genetic
set-up. The evolution of boundaries and the evolution of the contexts in which
they put themselves are assisted by, not caused by, genetic inventions.
Membrains
The distinction between determinate and indeterminate organisms is itself
indeterminate of course, no organism is completely determinate or completely
indeterminate. Complicated functions such as photosynthesis, capture of prey,
ingestion of food and reproduction requires a high degree of adaptive refinement
and in order to handle these tasks even indeterminate organisms regularly
produce highly prescribed determinate offshoots, e.g. flowers, fruits, leaves,
and fungal fruit bodies. While these offshoots often attract the interest of the
human observer it is nevertheless the indeterminate part of the system “that
generates the offshoot and regulates the interrelationships between offshoots by
providing interconnected pathways whose variably deformable and penetrable
boundaries outline the channels of communication within the system. Determinate
superstructure is integrated by indeterminate infrastructure” (Rayner 1997: 80, my
italics). 31
This pattern can be generalized since even clearly determinate organisms
such as dogs or human beings are dependent on indeterminate infrastructure
in the form of the vascular system and the nervous system. The pattern of
development of nervous and vascular infrastructures resembles that of a
foraging mycelium says Rayner: “Like the hyphal tubes of a mycelium they may
be variably partitioned and cross-linked, and their lateral boundaries are variably
insulated so as to receive and distribute input with minimal dissipation” (Rayner
1997: 141).
There seems to be a deep logic in this arrangement. Nervous systems and
brains never developed in plants or fungi. From the beginning these structures
were connected to the need of animals to move for purposes of flight or foraging.
Nerve cells became specialized for the “long distance” communication needed in
order to co-ordinate the activities of body parts too far apart for quick interaction
through traditional cell to cell communication. Thus, determinacy of the body
structure, was compensated through the indeterminacy of their movements. As
Rayner observes: “Animals...follow and create ‘trajectories’ in their surroundings
as they change their position and behavior over time and interact with one
another. If these trajectories are mapped, they often exhibit a clear but irregular
structure, similar to the body boundaries of indeterminate organisms like plants
and fungi” (Rayner 1997: 70).
The plasticity of the sensorimotor system of determinate organisms, in
other words, assures the capacity for “capitalizing on opportunities” which
in indeterminate organisms are assured simply through the plasticity of their
boundaries. It follows that the brains, which gradually evolved to guide the
activity of animals, could not themselves be determinate but would have to
retain a plasticity, which could match the unpredictable semiotic heterogeneity
of the environment.
While morphogenesis in general is guided by local cell to cell interactions
neurons can be in direct contact with many cells that are located quite far apart
from one another, by virtue of their long output (axons) and input (dendrite)
branches. This allows populations of cells distant apart from one another in
the brain directly to interact and influence one another so that a non-local
developmental logic is superimposed on top of the local regional differentiation
that preceded it. And this is the key to the adaptive plasticity of the developing
brain since “it makes it possible for the nervous system as a whole to participate
actively in its own construction” (Deacon 1997: 195).
Loosely sketched what happens is a kind of neuronal selection process.
The developing brain produces a surplus of nerve cells and each of these
nerve cells produces far more branches of their growing axons than will finally
become functional synaptic connections. Only a fraction of the newly produced
32 connections will happen to get involved in persistent coordinated activity while
the remainders are eliminated in a competition between axons from different
neurons over the same synaptic targets. Nerve cells, which do not succeed in
making any synaptic contributions, are persuaded to commit suicide, so that
only some 60% of the totality of nerve cells originally produced will survive into
adulthood.
That experience from the outer world influences the pattern of neuronal
cell death and synaptic strengthening was illustrated by some rather harsh
experiments done to newborn kittens. If the right eye of kittens were sewed shut
during a critical period of early life they would develop functional blindness on
that eye for the rest of their lives. It could be shown that this was not due to lack
of experience as such. What happened was that synaptic competition eliminated
all those connections to the visual cortex that were derived from the passive right
eye leaving all the available synaptic space for left eye connections to capture. The
patterns of impulses emitted through the optical nerve from the right eye never
reached the visual cortex (Gilbert 1991: 644).
The indeterminacy of the brain is its strength. In Terrence Deacon’s words:
“Cells in different areas of the brain are not their own masters, and have not been
given their connection orders beforehand. They have some crude directional
information about the general class of structures that make appropriate targets,
but apparently little information about exactly where they should end up in a
target structure or group of target structures. In a very literal sense, then, each
developing brain region adapts to the body it finds itself in....There need be no
‘pre-established harmony’ of brain mutations to match body mutations, because
the developing brain can develop a corresponding organization ‘on line’, during
development” (Deacon 1997: 205).
For decades neuroscientists have tried to model the brain as a computational
organ. Brain cells were seen as logical gates, adding and subtracting input spikes
until some threshold level of charge is breached, at which point they convulse to
produce a spike of their own. This all-or-nothing nature of a nerve cell’s firing was
thought to overcome the usual soupy sloppiness of cellular processes, i.e. to bring
it into an area of digital calculation. But 30 years of research along these simplistic
ideas has not been able to give any real answers to how the brain works. And it
has now become clear that the output of any individual neuron depends on what
the brain happens to be thinking at the time. Not the single firing nerve cell but
the self-organizing and ever-changing global pattern of activity all over the brain
seem to hold the key to an understanding of its capacity to produce the mental
phenomena we all experience.
Thus on top of the indeterminacy of the growing brain comes an
indeterminacy at the level of synaptic connections, which furthermore makes up
for an indeterminacy of global activity pattern formation. And finally on the top of
all this plasticity comes yet another level of indeterminacy: Language. “Language 33
ran its hyphae far into the nervous system allowing, today, no hope of excision
—not even in theory. Language does not think through us but it has become a
part of us. And yet language is common property and, hence, extraneous to us”
(Hoffmeyer 1996: 112).
Terrence Deacon in his recent book “The Symbolic Species” suggests that
languages have adapted to children’s brains much more than the brains have
evolved to become linguistic. This would explain the mystery of how children
can learn to talk in spite of the often postulated unlearnability of language:
“Human children appear preadapted to guess the rules of syntax correctly,
precisely because languages evolves so as to embody in their syntax the most
frequently guessed patterns. The brain has co-evolved with respect to language,
but languages have done most of the adapting.” (Deacon 1997: 122). Thus, there
is no need for postulating innate linguistic knowledge.
With the invention of speech the Umwelts of individual organisms gradually
gave way to the idea of the one and only world. The searching membranes of
life had at length found the means for a productive non-local association with
other membranes thereby creating what the American neurophysiologist Walter
Freeman has called “societies of brains” (Freeman 1995). And this is where the
idea of objectivity is rooted, i.e. in the social nature of human knowledge. Our
ingenious membrains communicate directly with other membrains attempting to
construct the world in the image of the collective, i.e. in a view from nowhere.
But membrains are mortal, they have a beginning and an ending. In between
is a life story, which cannot be thought away as nowhere business.
References
Churchland, Paul M. 1991. “Folk Psychology and the Explanation of Human Behavior”.
In J.D. Greenwood (eds.). The Future of Folk Psychology. Cambridge: Cambridge
University Press,
Churchland, Patricia S. 1986. Neurophilosophy: Towards a Unified Theory of Mind/Brain.
Cambridge: MIT Press/A Bradford Book.
Deacon, Terrence 1997. The Symbolic Species. New York: Norton.
de Duve, Christian 1991. Blueprint for a Cell. The Nature and Origin of Life. Burlington: Neil
Patterson Publishers.
Dennet, Daniel C. 1991. Consciousness Explained. London: Allan Lane, Penguin Press.
Freeman, Walter J. 1995. Societies of Brains. A study in the Neuroscience of Love and Hate. New
Jersey: Lawrence Erlbaum.
Gilbert, Scott F. 1991. Developmental Biology. Sunderland: Sinauer.
Goodwin, Brian 1995. How the Leopard Changed its Spots. London: Phoenix Giants.
Hoffmeyer, Jesper 1992. “Some Semiotic Aspects of the Psycho-Physical Relation: The
Endo-Exosemiotic Boundary”. In Thomas A. Sebeok and Jean Umiker-Sebeok (eds.).
Biosemiotics: The Semiotic Web 1991. Berlin: Mouton de Gruyter, 101-123.
__1995. “The Swarming Cyberspace of the Body”. Cybernetics & Human Knowing 3(1): 1-10.
__1996. Signs of Meaning in the Universe. Bloomington: Indiana University Press.
__1997a. “Semiotic Emergence”. Revue de la Pense d’aujourd’hui 25-7(6): 105-17 (in japanese
34 language).
__1997b. “The Swarming Body”. In Irmengard Rauch and Gerald F Carr (eds.) Proceedings
of Semiotics Around the World. Proceedings of the Fifth Congress of the International
Association for Semiotic Studies, 937-940. Berkeley 1994: Berlin: Mouton de Gruyter.
__1997c. “Biosemiotics: Towards a New Synthesis in Biology”. European Journal for Semiotic
Studies, 9(2): 355-376.
__1998a. “The Unfolding Semiosphere”. In Gertrudis van de Vijver, Manuela Delpos and
Stanley Salthe (eds.) Evolutionary Systems. Dordrecht: Klüwer.
__1998b. “Anticipatory Surfaces”, Cybernetics and Human Knowing, 5 (1), 33-42.
Hoffmeyer, Jesper and Claus Emmeche (1991). “Code-Duality and the Semiotics of Nature”.
In Myrdene Anderson and Floyd Merrell (eds.). On Semiotic Modeling. New York:
Mouton de Gruyter, 117-166.
Kauffman, Stuart (1995). At Home in the Universe. New York: Oxford University Press.
Matsuno, Koichiro (1989). Protobiology: Physical Basis of Biology. Boca Raton: CRC Press.
__(1996). “Internalist Stance and the Physics of Information”. Biosystems 38: 111-118.
Matsuno, Koichiro and Stanley Salthe (1995). “Global Idealism/Local Materialism”. Biology
and Philosophy 10: 309-337.
Meystel, Alex (1998). “Multiresolutional Umwelt: Toward Semiotics of Neurocontrol”. In
Jean Umiker-Sebeok (ed.) Semiotics in the Biosphere: Reviews and a Rejoinder. Special
Issue of Semiotica, 120 (3/4).
Monod, Jacques (1971). Chance and Necessity. New York: Knopf.
Nagel, Thomas (1986). The View from Nowhere. Oxford: Oxford University.
Rayner, Allan D. M. (1997). Degrees of Freedom. Living in Dynamic Boundaries. London:
Imperial College Press.
Salthe, Stanley 1998. “Naturalizing semiotics”. In Jean Umiker-Sebeok (ed.) Semiotics in the
Biosphere: Reviews and a Rejoinder. Special Issue of Semiotica 120 (3/4).
Sapp, Jan 1994. Evolution by Association. A History of Symbiosis. New York: Oxford University
Press.
Swenson, Rod 1989. “Emergent Attractors and the Law of Maximum Entropy Production”.
Systems Research 6: 187-197.
Swenson, Rod and M. T. Turvey 1991. “Thermodynamic Reasons for Perception-Action
Cycles”. Ecological Psychology 3(4): 317-348.
Uexküll, Jakob von 1982 [1940). “The Theory of Meaning”. Semiotica 42(1): 25-87.
35
36
Developing biosemiotics into cybersemiotics
Søren Brier
In the last decade there has been a growing interest in making a new
transdisciplinary science that could grasp, understand and manipulate the
informational aspect of nature, culture and technology. Especially a better
understanding of the semantics of cognition and the representation of knowledge
in texts compared to the way a computer represents and manipulates linguistic
knowledge has been looked for to improve the technological evolution of the
international computer network’s ability to handle the semantic aspects of text
and speech. But at the same time the debate about how to define information
has lead deep into the question of the nature and limits of scientific knowledge
and what kinds of reality we are dealing with. To formulate cognitive and
informational science frameworks is to go deep into the philosophical foundation
of science especially the epistemological aspect.
In science the universe has generally been considered to consist of matter
and energy, but following Norbert Wiener’s declaration that ‘information is
information and not energy or matter’ the question has been raised weather 37
information is a third true constituent of our basic reality as science sees it (Hayles
1999). One of the most clear and outspoken promoters of this view is Stonier
(1990, 1992, 1997). I have discussed this idea of a unified theory based on the idea
of objective information in Brier (1992 and 1996c, d). The main problem perhaps
is that it has very little to say about semantics and signification in living systems
as also Hayles (1999) points out.
An international program on the Foundation of Information Science (FIS) is
now evolving. In Brier (1997) I argue for my own approach to this project, which
shortly stated is the following: A truly scientific theory of information, cognition
and communication has to encompass the area covered by social sciences and
humanities as well as biology and the physio-chemical sciences. A genuine
transdisciplinarity is necessary if we want to understand information, cognition
and communication in natural, living, artificial and social systems in a broadly
based scientific theory. A way to connect the phenomenological view from within
with a theory of behavior and language is crucial for such an enterprise —in short
a theory of signification.
In my opinion we have to look for a theory, which is on the one hand not
mechanistic —because our scientific results so far do not support the belief
that semantics is mechanistically explainable. The evolutionary view of reality
seems to be well supported by empirical data and therefore a necessary basis
of a worldview. As Prigogine and Stengers (1985) argues and demonstrate then
evolution cannot be understood on a mechanistic foundation, but must be based
on a complexity foundation at least as the thermodynamic, with a probability
function and the irreversibility coming from the second law of thermodynamics of
the irreversibly growth of entropy. But this is still not enough to make a foundation
for understanding the emergence of life and mind (as inner life, emotion, will
and qualia). On the other hand idealistic, transcendental phenomenological
views like Husserl’s, social constructivism, or as an alternative, a radical social
constructivist epistemology seems unable to account for the genuine aspect of
reality that science has uncovered and the connection and continuity between
nature and culture (discussed in Brier 1993b and 1996b). Based on our personal
and intersubjective experiences with forces like electromagnetism and gravity, I
find it necessary to hold on to some kind of realism. As we also find it problematic
and unnecessarily complicated to establish a non-empirical qualitative different
mental world, and still have to explain how that can have causal influence on
matter, I want to adhere also to a monism. A monism viewed as the general
scientific idea that everything in the whole Universe has —at least from the
beginning— been developed from the same “stuff.” But within a traditional
physicalistic understanding, may it be deterministic as in classical physics or
probabilistic as in thermodynamics and cybernetics, this “stuff of the world”
38 is too reductionistic to explain life and mind. But I also find the attempt to
explain life and mind away in an eliminative materialism paradoxical and self-
refuting (Churchland 1986, Churchland 1995). With Peirce, I therefore prefer the
Aristotelian concept of ‘hyle’ that through its continuity or field idea of matter
first brings us closer to the quantum field framework, and second opens for the
present science the obscure idea that matter can have an internal aspect of life and
mind. This is what Peirce calls ‘pure feeling’.
To make a realistic, evolutionary and non-mechanistic cognitive science was
actually what Lorenz and Tinbergen set out to do when they created the science
of “Ethology”. From the beginning in the 1930’es ethology was based on the three
theoretical foundations of modern biology: The theory of evolution, the ecological
theory and modern population genetics, plus the method of comparative anatomy
transferred to instinctive movements (Lorenz 1970-1971). From this foundation,
Lorenz and Tinbergen, especially, developed a theory of innate release response
mechanisms fueled by specific motivational energy and released by innate sign
stimuli. Although the theory is very compatible with Freud’s psychoanalytical
theory, it was never able to deal with the phenomenological aspect in a theoretical
consistent and constructive way (Brier 1993b, 1998a).
In the present paper I want to further develop the epistemological framework
of ethology and evolutionary epistemology through a biosemiotic founding of
basic concepts in the light of the problem of establishing the reality of qualia in
a materialistic evolutionary cognitive biology and from there to the semantic
level of meaning in human language communication. I see the development of a
non-reductionistic biosemiotics as essential for making a general framework for
information, cognitive and communication studies.
What is interesting and fruitful about Lorenz’s biological theory of animal
behavior is the attempt to make a cognitive science based on biological theory
surpassing on the one hand the reductionism based on the mechanisms of
physics and chemistry and on the other hand the vitalism of Drietsch and
others. Both Lorenz and Tinbergen were aware of the fact that animal instinctive
behavior is largely inherited. A good theory of genes was not available at the
time, but heredity was well known and supposed to have a material basis in the
chromosomes, and population genetics was under development. Morphology
was well studied, and, according to the Darwinian paradigm, it was studied
from the angle of survival value of animal behavior. One of the puzzles was how
animal instinctive behavior and learning could at the same time be hereditary
and purposeful. There was no doubt that animals had a selective perception,
and related to certain events as biologically meaningful to their survival, when
they appeared in certain situations, depending on the animals mood. But
neither Lorenz nor Tinbergen managed, in my opinion, to formulate the needed
integrative evolutionary-ecological theory for cognitive science that could be an
alternative to the objectivism of modern cognitive science and its information- 39
processing paradigm (Lakoff 1987, Brier 1992, 1996 b).
They did manage to make a theory of genetic preprogrammed behavior
and learning, showing how perception was dependent on specific kinds of
partly self-energizing specific motivations that were also regulated by age, sex,
physiological needs and time of the year. But especially the foundation of the
concept of motivation and its relation to emotions and consciousness has not
found a broadly accepted form (Hinde 1970, Reventlow 1970, Brier 1980). This
has limited its usefulness in the human sphere and most ethologists do not
use the concepts anymore. But, at least, a new ‘cognitive ethology’ has been
developed by Mark Bekoff (Colin and Bekoff, 1997; Bekoff, Mark, Allen, Colin,
and Burghardt, Gordon M. 2002.). It recognizes the inner life of animals as a
causal factor, but unfortunately only argue for its existence from the common
traits of humans and animals and observations of animal behavior, but like
Lorenz fails to develop a new theoretical framework to sustain and develop its
scientific concepts. Among other things, this is what I hope to accomplish with
my cybersemiotic framework.
On the other hand a very important conclusion in Lakoff (1987) is that our
biology is decisive for the way we formulate concepts and make categorizations.
He points out that linguistics lacks a broader theory of motivation based on
embodiment to understand how we extend metaphors from the concrete to
the abstract in a meaningful way and to explain how we organize concepts into
different types of categories. He points out that cognitive models are embodied,
or based on an abstraction of bodily experiences, so that many concepts, contents
or other properties are motivated by bodily or social experience in a way that goes
beyond the usual linguistic idea of motivation. Only in this way do they make
sense thereby providing a non-arbitrary link between cognition and experience
that is not logical in the way we usually understand it. This means that human
language is based on human concepts that are motivated by human experience. It
is easier to learn something that is motivated than something that is arbitrary or
logically arranged. So one of Lakoff’s conclusions is that motivation is a central
phenomenon in human cognition especially in categorization. Lakoff also points
out that motivational categorization in humans is based on idealized cognitive
models (ICMs) that are the result of accumulated embodied social experience and
gives rise to certain anticipations.
This fits very well with the ethological thinking around concepts like
sign, stimuli and, imprinting (Brier 1995 and 1998), but unfortunately its
very physiological and energy-oriented models of motivation, cognition and
communication are not developed enough to encompass the area from animal
instinctive communication to human linguistic behavior. A further development
is needed that focus more on signification and communication. From the other
40 end, it is a problem that Lakoff only develops a rather simplistic model of bodily
kinetic-image schemata as the source of metaphor a metonymy. I tend to think that
one could develop the theory much further if one also draws on a combination
of ethological and psychoanalytical knowledge of the connection between
motivational states and the cognition of phenomena as meaningful signs.
Anticipatory behavior in animals
Based on the results of ethology I propose that all perceptual cognition is
anticipatory (Brier 1993b). According to my ethological model of instinctive
reaction, the fixed action patterns that are the behavioral part of the animal
instinct are only released by the innate release mechanism if motivationally
borne pattern recognition appears. The perceptions that release the more or less
hereditary innate release response mechanism, if it is properly motivated, are
called sign stimuli. Ethology enumerates many different motivations and some
of them species specific (Brier 1993b). Most living systems have great problems
in perceiving something not biological, psychological or socially anticipated. So
from an ethological point of view one should include the action of the subject
and its motivational value into a model of the dynamics of behavior. This brings
ethology’s theoretical framework close to the crucial concept of intentionality
in Husserl’s Phenomenology. It therefore makes sense to view animal instincts
with their specific motivation for fight, mating, hunting etc., as bio-psychological
expectations or anticipations.
Some of these anticipations are completely innate, such as the hunting and
mating behavior of the digger wasp, which have no contact whatsoever with its
parents. It gets out of its egg buried in a little cave under the surface of the ground
and eats the prey put there by its mother. It does not encounter other species
members in its larval stage and nobody teaches it anything. Still the wasp is able
to hunt and mate when it meets the relevant other living systems. So this is a
rather closed inheritance based system.
Some behavior systems in other species are partly open for variations in
what is anticipated as e.g. in the imprinting of ducklings who will follow the
first big moving object expressing sound they see within a certain time span after
they have hatched and later on will choose a mate like it. Many birds have a basic
song but have to listen to other members of the species to learn the full song, but
they will not learn the song of another species. But again some bird species can
include the song of other species and natural sounds and make different kinds of
variations some even as ongoing improvisations.
It is clear to Lorenz that emotions have functions and survival value.
Wimmer (1995) gives a further development of this kind of science about
emotion, which one can also find in a cybernetic version in Bateson’s (1972) work
where it is viewed as inter-individual relational logic. But the problem is that it
is still a purely functionalistic description not really able to explain how certain
things and events becomes significant for the living system in such a way that 41
they see them as a sign for something emotional, existential and vital for their
self-organized system (Brier 1992).
The crux of the matter is the problem of the relation of motivation,
intentionality and feelings to the experience of anything as being meaningful.
So far, no functionalistic model of explanation of behavior, perception or
communication has been able to, alone, account, in a sufficient way, for the wills
and emotions of the minds of animals and man. This problem is also crucial in
the discussion of what information is and what the foundation of information
science should be. The foundation of meaningful experience, categorization and
communication is the crucial question that cognitive science should solve, as for
instance pointed strongly out by Lakoff (1987).
On one side we have the mathematical information theory, which in
Wiener’s cybernetics is connected to thermodynamics. Cybernetics integrates
the ideas of computing —formalized among others by Turing— and the idea
of artificial intelligence and the functionalist information concept and this
mixture is often used in cognitive sciences. Today these trends are united in “The
Information Processing Paradigm” (See Brier 1997 for further argumentation and
documentation).
On the other side we have phenomenology, hermeneutics and semiotics. They
are the traditional humanistic disciplines of meaning, signification, mediation,
interpretation and cultural consciousness and their conceptual foundations do
not allow them to encompass the areas of science, not even biology.
Cognitive information sciences, partly based on first order cybernetics,
have run into a powerlessness situation in their attempts to find the algorithms
of intelligence, informational meaning and language (Winograd & Flores 1986,
Dreyfus & Dreyfus 1996, Searle 1989 and Penrose 1995). This structural approach
has great problems with the phenomenon of context and signification and how
they interact. They want to understand everything, including consciousness and
meaning, algorithmically (Lakoff 1987) based on the old belief that both the world
and the mind is structured mathematically and that mathematics in the end is the
Logos, be it in the One, the unmovable mover, the pure nature of the vacuum field
or the Christian God.
When such foundational limitations are realized it is surely time to make
a further development in your conceptual foundation, as Bohr pointed out
(Bohr 1954). I believe that Peirce’s semiotics and its ability to be the foundation
of a biosemiotics (Sebeok 1979, Hoffmeyer 1996) can establish such a new
transdisciplinary foundation integrating the new results from other more specific
disciplines such as second order cybernetics, cognitive semantics and pragmatic
language philosophy.
I will show that Bateson (1972), and later on the new second order cybernetics
42 of von Foerster, have developed some fruitful concepts on the self-organization
of cognition. Furthermore the concepts of autopoiesis and structural couplings
of Maturana and Varela, which were developed in the same tradition, bring us
important steps forward. But I also want to show that these are not sufficient to
explain how meaningful communication is possible (Fogh Kirkeby 1997). To this
end I turn to integrate concepts from Peirce’s semiotics which at the same time
offer an alternative philosophical foundation to mechanistic materialism, on one
hand, and pure constructivism on the other, in the form of an objective idealistic,
realistic, evolutionary and, pragmatic philosophy based on the papers of mine
you can find in the reference list. I refer to the most relevant of them as I unfold
the argument, for those who want to examine a more detailed argumentation.
References
Allen, Colin and Bekoff, Marc 1997. Species of Mind: The Philosophy and Biology of Cognitive
Ethology. Cambridge: MIT Press.
Bateson, G. 1972. Steps to an Ecology of Mind. Paladin, Frogmore, St. Albans.
Bekoff, Mark, Allen, Colin, and Burghardt, Gordon M. 2002. The Cognitive Animal: Empirical
and Theoretical Perspectives on Animal Cognition. Cambridge: MIT Press
Bohr, N. 1954. ”Kundskabens Enhed”. in Bohr, N.. Atomfysik og menneskelig erkendelse, pp.
83-99. Copenhagen: J.H. Schultz Forlag.
Brier, S. 1980. Prize Essay in psychology about the fruitfulness of hierarchy - and probability -
deliberations in constructing models of motivation from behavioral analysis. Awarded with
the Gold Medal in psychology of Copenhagen University.
__1992. “Information and consciousness: A critique of the mechanistic concept of
information”. Cybernetics & Human Knowing, Vol.1, (2/3): 71-94.
__1993a. “Cyber-Semiotics: Second-order cybernetics and the semiotics of C.S. Peirce”.
Proceedings from the Second European Congress on Systemic Science, Vol. II: 427-436.
__1993b. “A Cybernetic and Semiotic view on a Galilean Theory of Psychology”. Cybernetics
& Human Knowing, Vol. 2, (2): 31-45.
__1995. “Cyber-Semiotics: On autopoiesis, code-duality and sign games in bio-semiotics”.
Cybernetics & Human Knowing, Vol. 3, (1): 3-14
__1996a. “From Second-order Cybernetics to Cybersemiotics: A Semiotic Re-entry into the
Second-order Cybernetics of Heinz von Foerster”. Systems Research, Vol. 13, (3): 229-
244
__1996b. “The Necessity of a Theory of Signification and Meaning in Cybernetics and
Systems Science”. Proceedings of the Third European Congress on Systems Science, Rome,
1-4 October 1996, pp. 693-697. Rome: Edizioni Kappa.
__1996c. “Cybersemiotics: a new interdisciplinary development applied to the problems of
knowledge organization and document retrieval in information science”. Journal of
Documentation, 52 (3), September 1996: 296-344.
__1996d. “The Usefulness of Cybersemiotics in dealing with Problems of Knowledge
Organization and Document Mediating Systems”. Cybernetica: Quarterly Review of the
International Association for Cybernetics, Vol. 39 (4): 273-299.
__1996e. “Trust in the order of things: an extended review of Freya Mathew’s book The
Ecological Self”. System Practice 9 (4): 377-385.
__1997. “What is a Possible Ontological and Epistemological Framework for a True
Universal ‘Information Science’? The suggestion of a Cybersemiotics”. World Futures,
Vol. 49:297-308.
__1998a. “The Cybersemiotic Explanation of the Emergence of Cognition: The Explanation
of Cognition, Signification and Communication in a non-Cartesian Cognitive
biology”. Cognition and Evolution, Vol. 4, no.1, pp. 90-102. 55
__1998b. “Cybersemiotics: A suggestion for a Transdisciplinary Framework for Description
of Observing, Anticipatory and Meaning Producing Systems”. Proceedings form
CASYS97: conference on anticipatory systems. Liege, Belgium: Chaos foundation. American
Institute of Physics Conference Proceedings 437, pp. 182-193. Berlin: Springer Verlag
__1999. “Biosemiotics and the foundation of cybersemiotics. Reconceptualizing the insights
of Ethology, second order cybernetics and Peirce’s semiotics in biosemiotics to create
a non-Cartesian information science”. Semiotica, Special issue on Biosemiotics, 127-
1/4, 1999, 169-198.
__2000a. “Konstruktion und Information. Ein semiotisches re-entry in Heinz von Foersters
metaphysische Konstruktion der Kybernetik zweiter Ordnung”. in Jahraus, Oliver,
Nina Ort und Benjamin Marius Schmidt (eds.). Beobactungen des Unbeobacthtbaren, pp.
254-295. Weilerswist: Velbrück Wissenschaft.
__2000b. “Trans-Scientific Frameworks of Knowing: Complementarity Views of the
Different Types of Human Knowledge”. Yearbook Edition of Systems Research and
Behavioral Science, V.17, No. 5, pp. 433-458.
__2001a. “Cybersemiotics and Umweltslehre”. Semiotica, 134-1/4 (2001), 779-814.
__2001b “Cybersemiotics: A reconceptualization of the foundation for Information Science”.
Systems Research and Behavioral Science, 18, 421-427.
__2001c. “Ecosemiotics and Cybersemiotic”. Sign Systems Studies 29.1, 107-120.
__2002. “Intrasemiotics and Cybersemiotics”. Sign System Studies 30.1, 113-127.
__2002a. “Varela’s contribution to the creation of Cybersemiotics: The calculus of self-
reference”. ASC-column, Cybernetics & Human Knowing, 9,2:77-82.
__2002a. “The five-leveled Cybersemiotic Model of FIS”. Trappl, R. (ed.): “Cybernetics and
Systems vol. 1, 2002”, Austrian Society for Cybernetic Studies. 1:197-202.
__2003. “The Cybersemiotic model of communication: An evolutionary view on the
threshold between semiosis and informational exchange”. TrippleC 1(1): 71-94.
Churchland, Patricia Smith. 1986. Neurophilosophy: Toward a Unified Science of the Mind-Brain.
Cambridge: MIT Press.
Churchland, Paul M. 1995. The Engine of Reason, the Seat of the Soul: A Philosophical Journey
into the Brain. Cambridge: MIT Press.
Foerster, Heinz von 1984. Observing systems. California: Intersystems Publication
__1993. “Für Niklas Luhmann: Wie rekursiv ist Kommunikation“. Teoria Sociologica 1(2):
61-88.
__1996. “From Stimulus to Symbol”. in McCabe, V. and Balzano, G.J. (eds.). Event and
Cognition: An Ecological Perspective. New Jersey: Lawrence Erlbaum.
Hayles, N.K. 1999. How We Became Posthuman: Virtual Bodies and Cybernetics, Literature and
Informatics. Chicago: The University of Chicago Press.
Heidegger. M. 1962. Being and Time: A Translation of Sein und Zeit by John Macquarrie and
Edward Robinson. San Francisco: Harper.
Hinde, R. 1970. Animal Behavior: A synthesis of Ethology and Comparative behavior. Tokyo:
McGraw-Hill.
Hoffmeyer, J. 1996. Sign of Meaning in the Universe. Bloomington: Indiana University Press.
Kirkeby, O.F. 1997. Event and body-mind. An outline of a Post-postmodern Approach to
Phenomenology, Cybernetics & Human Knowing, Vol. 4, No. 2/3, pp. 3-34.
Kull, K. 1998. “Semiotic ecology: different natures in the semiosphere”. Sign System Studies,
Vol. 26, pp. 344-364.
Lakoff, G. 1987. Women, Fire and Dangerous Things: What categories reveal a bout the Mind.
Chicago: The University of Chicago Press.
Lorenz, K. 1935. “Der Kumpan in der Umwelt des Vogels”. J. F. Ornith., 83, 137-213, 289-
56 413.
__1950. “The comparative method in studying innate behavior patterns”. Sym. Soc. Exp.
Biol., 4, 221-268
__1970-71. Studies in animal and human behavior I and II. Cambridge: Harvard Univ. Press.
__1973. Die Rückseite des Spiegels. München: Pieper and Co.
__1966. On Aggression. London: Methuen
Luhmann, N. 1990. Essays on self-reference. New York: Columbia University Press.
Luhmann, N. 1992. “What is communication?”. Communication Theory, Vol. 2, no.3, pp.
251-258.
Maturana, H. R. 1983. “What is it to see?”. Arch. Biol. Med. Exp. 16: 255-269.
__(1988). “Ontology of observing: The Biological Foundation of Self Consciousness and the
Physical Domain of Existence”. The Irish Journal of Psychology, Vol. 9, (1. 25-82).
Maturana, H.R. & Varela, F.J. 1980. Autopoiesis and Cognition: The Realization of the Living.
Boston: Reidel.
Nöth, W. 2001. “Introduction to Ecosemiosis,” Tarasti, ISI Congress papers, Nordic Baltic
Summer Institute for Semiotic and Structural Studies Part IV, June 12-21 2001 in
Imatra, Finland: Ecosemiotics: Studies in Environmental Semiosis, Semiotics of the
Biocybernetic Bodies, Human/too Human/ Post Human, pp. 107-123.
Peirce, C.S. 1931-58. Collected Papers vol. I-VIII, Eds. Hartshorne and Weiss, Harvard
University Press.
Penrose, R. 1995. Shadows of the Mind: A Search for the Missing Science of Consciousness.
London: Oxford University Press.
Popper, K. R. 2004. The Poverty of Historicism. London: Rutledge
Prigogine, I. & Stengers, I. 1986. Order out of Chaos: Man’s new Dialogue with Nature. Toronto:
Bantam Books
Reventlow, I. 1970. Studier af komplicerede psykobiologiske fænomener; Munksgaard,
Copenhagen. Doctoral thesis. University of Copenhagen.
__1977. ”Om ‘dyrepsykologien’ i dansk psykologi og om dens betydning for
begrebsdannelsen i psykologien, I”. Dansk filosofi og psykologi, bind 2. Filosofisk
Institut, University of Copenhagen: 127-137.
Searle, J. 1989. Minds, Brains and Science. London: Penguin Books.
Sebeok, T. 1976. Contributions to the Doctrine of Signs. Bloomington: Indiana University
__1979. The Sign & Its Masters. University of Texas Press.
Stonier, T. 1990. Information and the Internal Structure of the Universe. London: Springer
Verlag.
__1992. Beyond Information: The Natural History of Intelligence, London: Springer Verlag.
__1997. Information and Meaning: An Evolutionary Perspective, Berlin: Springer Verlag.
Tinbergen, N. 1973. The Animal in Its World, London: George Allan & Unwin.
Uexküll, J. von 1973. “The Theory of Meaning”. in Thure von Uexküll 1982. Jakob von Uexküll’s
“The Theory of Meaning”. Special issue of Semiotica, 42-1.
__1986. “Environment (Umwelt) and Inner World of Animals”. in Burghardt, G.M. (ed.).
Foundations of Comparative Ethology. New York : Van Nostrand Reinhold.
Wimmer, M. 1995. Evolutionary Roots of Emotions, Evolution and Cognition, Vol. 1. (1): 8-50,
Vienna: Vienna University Press.
Winograd, T. & Flores F. 1987. Understanding Computers and Cognition. Norwood, New
Jersey: Alex Publishing Corporation
Wittgenstein, L. 1958. Philosophical Investigations. New York: MacMillian Publishing.
57
58
Information and direct perception: a new approach
Anthony Chemero
Introduction
Since the 1970s, Michael Turvey, Robert Shaw, and William Mace have worked
on the formulation of a philosophically-sound and empirically-tractable version
of James Gibson’s ecological psychology. It is surely no exaggeration to say that
without their theoretical work ecological psychology would have died on the vine
because of the high-profile attacks from establishment cognitive scientists (Fodor
and Pylyshyn 1981, Ullman 1981). But thanks to Turvey, Shaw and Mace’s work
as theorists and, perhaps more importantly, as teachers, ecological psychology
is currently flourishing. A generation of students, having been trained by
Turvey, Shaw and Mace at Trinity College and/or the University of Connecticut,
ecological psychology, are now distinguished experimental psychologists who
train their own students in Turvey-Shaw-Mace ecological psychology. Despite
the undeniable and lasting importance of Turvey, Shaw and Mace’s theoretical
contributions for psychology and the other cognitive science, their work has not
received much attention from philosophers. It will get some of that that attention 59
in this paper. I will point to shortcomings in the Turvey-Shaw-Mace approach to
ecological psychology, and will offer what I take to be improved versions of two
important aspects of it. In particular, I will describe theories of information and of
direct perception that differ from the Turvey-Shaw-Mace account.
Given the debt that those of us who wish to pursue ecological psychology
owe to Turvey, Shaw and Mace, this, no doubt, seems ungrateful.2 Perhaps it is.
But I would argue that because of the success of the Turvey-Shaw-Mace approach
to ecological psychology, the field has become a true contender in psychology,
cognitive science and artificial intelligence. Given the stability of ecological
psychology and its standing as a research program, it can withstand some
questioning of the assumptions on which its current practice is founded. This is
especially the case if the questioning is aimed at firming up foundations rather
than tearing down the house.
On individuals
Because Turvey, Shaw and Mace take direct perception to be infallible, they insist
that it be underwritten by information, which is, in turn, underwritten by natural
law. They are careful to maintain that the laws in question are ecological laws, laws
that hold only in particular niches. Thus, laws need not be universal in order to
allow information to be carried in the environment. But, of course, ecological laws
must still be general in that they apply to a variety of individuals. For example,
there would be an ecological law that connects a particular optical structure, a
visible texture, to the bark of a particular kind of tree: in the environment of
gray squirrels, say, optical structure O is present only when light has reflected
off a silver maple. Note that making the ecological law niche-specific makes it
so that the presence of optical pattern O in other environments, where lighting
conditions or tree species differ, doesn’t affect O’s information carrying in the
squirrel’s environment. So far so good, but in each gray squirrel’s environment
there are a few trees that have special affordances in that, unlike most trees in the
environment, they contain nests. There are no ecological laws relating these trees,
as individuals, to properties of the optic array, so there is no information about
these trees, as individuals, available to the squirrels. This, of course, does not
apply only to trees. If information depends on laws, there is also no information
about individual people available for perception. So although a human infant
might have information available about humans, she has none about her mother.
So, on the Turvey-Shaw-Mace view, either babies do not perceive their mothers 63
(because the information for direct perception is unavailable) or they do not
perceive them directly. I take it that either alternative is unacceptable to the
ecological psychologists.
67
Figure 1. Situation token s1 carries information about s2 while there is a constraint linking
type S1 to type S2.
Figure 3. The top part of the diagram is analogous to Shaw’s E > I > P; the bottom is analogous to his
P > I > E.
to environment direction of fit is due to an informational relationship among
tokens. On this view, Shaw and MacIntyre were right that there is a two-way
informational relationship between perception and the environment, but they
were wrong in thinking that both directions of the relationship are the same.
Final words
In this paper, I have offered understandings of direct perception and information
that differ from the ecological psychology orthodoxy, the Turvey-Shaw-Mace
view. While their view takes perception to be direct when it is necessarily
correct, on the view I have outlined, perception is direct when the perceiver
and perceived are coupled and their relationship is unmediated by mental
representations. While their view takes information to depend upon ecological
laws and fully-specifying variables, my view takes information to depend upon
constraints that may be only partly-specifying. I hope that I have said enough to
make it clear that my alternative views comprise a coherent and attractive option
for those interested in the ecological approach to psychology and those interested
in embodied cognitive science. Indeed, I have argued elsewhere that the theory of
information found in situation semantics ought to be appropriate for everyone in
the cognitive and computing sciences. I have, of course, said nothing that makes
the Turvey-Shaw-Mace orthodox view incoherent, though some of my arguments
should make it less attractive. 71
Notes
1. For a representative sample, see (Port & van Gelder, 1995).
2. I should also point out that I owe them a personal debt. Though I was never
formally a student of Shaw, Turvey, or Mace, each has been patient corrector of my
misinterpretations and has even encouraged me in the development of my competing
views. They still think that I’m wrong.
3. A quick note on Edward Reed: Although Reed was an author on the paper on
cognition and spent his career working on a philosophically-sound version of Gibson’s
ecological psychology, I think it makes more sense to speak of the Turvey-Shaw-Mace
view and not the ‘Turvey-Shaw-Reed-Mace view’. This is because after working on the
1981 paper, Reed developed views that diverged both from that presented in the 1981
paper and from the one I’m presenting here.
4. Furthermore, Michael Turvey and I have recently come to think that the differences
between our views of affordances are actually more similar than meets the eye. See
Chemero and Turvey (forthcoming).
5. I should point out that there are some who would argue that there are mental
representations involved, even in effective tracking. I have written about this at length
elsewhere (Chemero 2000, 2001) and will not repeat myself here other than to say
that during tracking claiming that some part of an animal represents some part of
the environment provides no explanatory purchase. That is, it is only possible to pick
out the part of the animal that is the representation once one already understands the
system as a causally connected whole.
6. Note that everything said here about Turvey-Shaw-Mace is also true of Dretske’s
classic probability-based theory of information (1981).
References
Barwise, J. and Perry, J. 1981. “Situations and attitudes”. Journal of Philosophy 77: 668-91.
__1983. Situations and Attitudes. Cambridge: MIT Press.
Barwise, J. and Seligman, J. 1994. “The rights and wrongs of natural regularity”. Philosophical
Perspectives, 8, 331-364.
__1997. Information Flow. Cambridge: Cambridge University Press.
Chemero, A. 2003a. “An outline of a theory of affordances”. Ecological Psychology15: 181-195.
__2003b. “Information for perception and information processing”. Minds and Machines 13:
577-588.
Chemero, A. & Turvey, Michael T. (forthcoming). “Complexity and “Closure to Efficient
Cause”. in K. Ruiz-Mirazo and R. Barandiaran (eds.). Proceedings of AlifeX 2006:
Workshop on Artificial Autonomy.
Devlin, K. 1991. Logic and Information. Cambridge: Cambridge University Press.
Dretske, F. 1981. Knowledge and the Flow of Information. Cambridge: MIT Press.
Fodor, J. and Pyslyshyn, Z. 1981. “How direct is visual perception? Some reflections on
Gibson’s ‘ecological approach’”. Cognition, 9, 139-196.
Gibson, J. 1979. The Ecological Approach to Visual Perception. Boston: Houghton-Mifflin.
Israel, D. and Perry J. 1990. “What is Information?”. in P. Hanson (ed.). Information, Language
and Cognition. Vancouver: University of British Columbia Press.
Mace, W. 1977. “James Gibson’s strategy for perceiving: Ask not what’s inside your head,
but what your head’s inside of”. In Shaw and Bransford (eds.). Perceiving, Acting and
Knowing. Hillsdale: Erlbaum.
Millikan, R. 2000. On Clear and Confused Ideas. Cambridge: Cambridge University Press.
72 Norman, D. 1986. The Psychology of Everyday Things. New York: Basic Books.
Reed, E. 1996. Encountering the World. New York: Oxford University Press.
Scarantino, A. 2003. “Affordances explained”. Philosophy of Science 70: 949-961.
Shaw, R. and McIntyre, M. 1974. “Algoristic foundations to cognitive psychology”. In
Weimer and Palermo (eds.). Cognition and Symbolic Processes. Hillsdale: Erlbaum.
Shaw, R., Turvey, M. and Mace, W. 1982. “Ecology psychology: The consequence of a
commitment to realism”. In Weimer and Palermo (eds.). Cognition and the Symbolic
Processes II. Hillsdale: Erlbaum.
Smith, B. C. 1996. On the Origin of Objects. Cambridge: MIT Press.
Stoffregen T. 2003. “Affordances as properties of the animal-environment system”. Ecological
Psychology15: 115-134.
Turvey, M. 1977. “Preliminaries to a theory of action with reference to vision”. In Shaw and
Bransford (eds.). Perceiving, Acting and Knowing. Hillsdale: Erlbaum.
__1990. “The challenge of a physical account of action: A personal view”. In H. T. A.
Whiting, 0. G. Meijer, & P. C. W. van Wieringen (eds.). The natural physical approach to
movement control. Amsterdam: Free University Press.
__1992. “Affordances and prospective control: An outline of the ontology”. Ecological
Psychology, 4, 173-187.
Turvey, M. and Shaw, R. 1979. “The primacy of perceiving: An ecological reformulation of
perception for understanding memory”. In. L.G. Nillson (ed.). Perspectives on Memory
Research. Hillsdale: Erlbaum.
Turvey, M., Shaw, R., Reed, E., and Mace, W. 1981. “Ecological laws of perceiving and
acting: In reply to Fodor and Pylyshyn”. Cognition 9: 237-304.
Ullman, S. 1981. “Against direct perception”. Behavioral and Brain Sciences 3: 373-381.
Withagen, R. 2004. “The pickup of nonspecifying variables does not entail indirect
perception”. Ecological Psychology 16: 237-253.
Revisiting the dynamical hypothesis
Tim van Gelder
Dynamical systems
Dynamical systems are, obviously, systems of a particular sort. A system, in
the sense most useful for current purposes, is a set of variables changing
interdependently over time. For example the solar system of classical mechanics is
the set of positions and momentums of the sun and planets (and their moons, and
the asteroids…). The question then is: when is a system dynamical? Interestingly,
there is no established answer within cognitive science or even outside it. A search
of the literature reveals a wide range of definitions of dynamical systems, ranging
from the very specific (“a set of bodies behaving under the influence of forces”)
80 to the hopelessly broad (“a system which changes in time”). Somewhere on this
spectrum lies the definition that is the most useful in articulating the dynamical
hypothesis in cognitive science. Which is that?
The clue, I think, is found in the fact that in all those systems standardly
counted as dynamical systems in the practice of cognitive science, the variables
are numerical, in the sense that we can use numbers to specify their values. Why
is this? Well, one thing about numerical quantities is that it makes sense to talk
about how far apart any two values are, and indeed we have an easy way of telling
what that distance is. And when a system’s variables are numerical, we can also
tell how far apart any two overall states of the system are. And the key point
is this. For some systems, it is possible to describe how they change over time
—their behavior— by specifying how much or far they change in any given time
step or period. The rule capturing this description is a difference or differential
equation.
In my opinion the best way to articulate the dynamical hypothesis is to take
dynamical systems to be systems with this property, i.e., quantitative systems.
There are various reasons for this. First, it reflects pretty well the actual practice of
cognitive scientists in classifying systems as dynamical or not, or as more or less
dynamical. Second, it is cast in terms of deep, theoretically significant properties
of systems. For example, a system that is quantitative in state is one whose states
form a space, in a more than merely metaphorical sense; states are positions in
that space, and behaviors are paths or trajectories. Thus quantitative systems
support a geometric perspective on system behavior, one of the hallmarks of a
dynamical orientation. Other fundamental features of dynamical systems, such
as stability and attractors, also depend on distances. Third, the definition sets up
a solid contrast between dynamical systems and digital computers, essential if we
want to understand dynamical cognitive science as distinctively different from
orthodox computational cognitive science.
Now we have a fix on dynamical systems; what does it mean to say that
cognitive agents are those things? Here things will be clearer if we make a
distinction between what I call the Nature and Knowledge hypotheses.
Nature hypothesis
The nature hypothesis tells us something about reality itself, i.e., that things
of one kind (cognitive agents) are things of another kind (dynamical systems).
The truth or falsity of the nature hypothesis is completely independent of what
we happen to think or know about reality; it concerns the way the world is.
And to say that cognitive agents are dynamical systems is to make a somewhat
complicated claim. Notice first of all that it is not a straightforward identification.
Jean is a cognitive agent, and one thing for sure, Jean is not simply a set of
interdependent variables, just as the Watt governor is not simply the arm angle
variable. The simple slogan is really saying that for any cognitive performances
of mine you might be interested in, there is some set of variables associated with 81
me (and the relevant environment) which constitute a dynamical system of a
particular sort, and the cognitive performances are behaviors of that system.
So for example Jean’s “deciding” to reach for one box or another is a kind of
cognitive performance, and Thelen et al’s account suggests that associated with
Jean there are various variables tied together and changing in the way specified
by their grand equation, such that his reaching behavior (including his A not
B error) is the behavior of that set of variables. And the nature aspect of the
dynamical hypothesis says that all cognitive performances are like that. Note
that on this analysis each cognitive agent “is” no one dynamical system; different
kinds of cognitive performances would be the behavior of different systems
associated with the same agent.
Knowledge hypothesis
While the nature hypothesis is a claim about reality, the knowledge hypothesis
is a claim about cognitive science. It says that cognition (at least in the case of
natural cognitive agents, such as humans and other animals) is best understood
dynamically. This is of course because cognitive agents are in fact dynamical
systems (the nature hypothesis), and your intellectual tools really ought to fit the
subject matter at hand. Conversely, our best evidence for the nature hypothesis
would be discovering that the best way to study cognition is to use dynamics.
However we should not allow the undeniable fact that the nature and knowledge
hypotheses are intimately related to cloud this important distinction.
What is it to understand natural cognition dynamically? I said above that
the easiest way to pick out a dynamicist in cognitive science is to see whether
they use differential or difference equations, and while this is not the whole story
it is certainly a key part of it. A thoroughly dynamical perspective on cognition
has three major components: a dynamical model, use of the intellectual tools of
dynamics, and adopting a broadly dynamical perspective.
A dynamical model is an abstract dynamical (i.e., quantitative) system
whose behavior is defined by the scientist’s equations. The behavior of the
model is compared with empirical data on the cognitive performances of human
subjects. If the match is good, we infer that the cognitive performances simply
are the behavior of relevantly similar dynamical systems associated with the
subjects. So for example Thelen et al define an abstract dynamical model by
specifying a set of variables and a grand differential equation governing their
interdependent change. They then show that if the parameters are set right, the
model system behaves just the way Jean does; from which they infer that Jean’s
cognitive performances are, in reality, the behavior of a very similar system
whose variables are aspects of Jean himself and his environment.
One problem with this whole approach to cognitive science is that the
82 behavior of even simple nonlinear dynamical systems can be rather hard to
understand. So while defining an abstract dynamical model might be easy
enough, understanding what it does —and so whether it is a good model— can
be pretty challenging. One handy tool here is the digital computer, used to
simulate the dynamical model. Note that in such cases the digital computer is
not itself a model of cognition; it is just a tool for exploring models. But much
more important than the computer is the repertoire of concepts and techniques
loosely gathered under the general heading of “dynamics.” This includes
dynamical modeling, that traditional branch of applied mathematics which aims
to understand some natural phenomenon (e.g., the solar system) via abstract
dynamical models; and also dynamical systems theory, the much newer branch
of pure mathematics that aims to understand systems in general and nonlinear
dynamical systems in particular. To use the intellectual tools of dynamics is to
apply this body of theory (suitably modified and supplemented for the purposes
at hand) to the study of natural cognition.
The third component of dynamical understanding is a broadly dynamical
perspective. The best way to convey this somewhat nebulous idea is to describe
it as the difference between the two ways of conceiving the steam-governing
problem. From a broadly dynamical perspective, cognition is seen as the emergent
outcome of the ongoing interaction of sets of coupled quantitative variables rather
than as sequential discrete transformations from one data structure to another.
Cognitive performances are conceived as continual movement in a geometric
space, where the interesting structure is found over time rather than statically
encoded at a time. Interaction with the world is a matter of simultaneous mutual
shaping rather than occasional inputs and outputs. Dynamicists are certainly
interested in “within-the-head” structures and processes, and usually even allow
that some of these count as representations, but they reject the idea that cognition
is to be explained exclusively in terms of internal representations and their
algorithmic transformations.
It is hard to overemphasize how different dynamical cognitive science is in
practice from its orthodox computational counterpart, and also hard to convey the
nature of the dynamical approach in a few short paragraphs. In my opinion the
Dynamical Hypothesis, as formulated above, comes pretty close to encapsulating
the theoretical essence of the dynamical approach; further, the contrast between
the DH and the computational hypothesis is the most significant theoretical
division in contemporary cognitive science. However these are contentious
philosophical claims about the nature of cognitive science. How have they been
received by other cognitive scientists?
Conclusion
I conclude that the DH still stands as the proper way to articulate the essence of
contemporary dynamical approaches to cognition. But what about the question
I keep deferring: is it actually true? To answer this is, in effect, to predict the
course of cognitive science; and, as a pundit once pointed out, it is hard to make
predictions, especially about the future. Moreover, as a philosopher somewhat
90 removed from the front lines I have certainly have no special insight. However,
putting qualifications aside, recent broad trends in cognitive science, as well
as some very general considerations, indicate that the Dynamical Hypothesis
will turn out to be true of a considerable portion of natural cognition; that
where computation is relevant, it will be analog computation implemented in
dynamical systems; and insofar as the DH is false, it will be superseded by some
form of theoretical framework whose elements are being pieced together by
unheard-of mathematicians laboring under the illusion that their ideas couldn’t
possibly have any application to reality.
References
Beer, R. 1998. “Framing the debate between computational and dynamical approaches to cognition”.
Behavioral and Brain Sciences, 21, 630.
Bernstein, D., & van de Wetering, S. (forthcoming). “More boulders of confusion”. Behavioral and Brain
Sciences, 21.
Dennett, D. 1995. Darwin’s Dangerous Idea. New York: Touchstone.
Garson, J. 1996. “Cognition poised at the edge of chaos: A complex alternative to a symbolic mind”.
Philosophical Psychology, 9, 301-321.
Heath, R. 1998. “Cognitive dynamics: a psychological perspective”. Behavioral and Brain Sciences, 21, 642.
Kuhn, D. 1962. The Structure of Scientific Revolutions. Chicago: University of Chicago Press.
Newell, A., & Simon, H. 1976. “Computer science as empirical enquiry: Symbols and search”.
Communications of the Association for Computing Machinery, 19, 113-126.
Quartz, S. 1998. “Distinguishing between the computational and dynamical hypotheses: What difference
makes the difference?”. Behavioral and Brain Sciences, 21, 649-650.
Thelen, E., G. Schöner, et al. 2001. “The Dynamics of Embodiment: A Field Theory of Infant Perseverative
Reaching.” Behavioral and Brain Sciences 24: 1-86.
van Gelder, T. J. 1998a. “Disentangling dynamics, computation, and cognition” Behavioral and Brain
Sciences, 21, 40-7.
van Gelder, T. J. 1998b. “The dynamical hypothesis in cognitive science”. Behavioral and Brain Sciences,
21, 1-14.
91
92
The dynamical approach to cognition:
inferences from language
Robert F. Port
Figure 1. A frequency histogram of the phase of onset for the final stressed syllable, “bone”, in “Give the
dog a bone” as the metronome rate was increased from very slow to very fast. A strong preference for
phases near 1/2 is evident plus a possible attractor near 1/3. (From Port, Tajima and Cummins, 1998).
Figure 2 shows histograms of the observed median phase of each trial of 10-
12 repetitions of the test phrase for target phases drawn from a uniform random
distribution of target phases between 0.3 and 0.75 for four of the 8 subjects (the
other four closely resembled these). It can be seen that subjects found certain
ranges of phase easy and natural to reproduce while other values of target phase
(which had been uniformly distributed along the X axis) could not be reproduced
accurately. It is clear that all these subjects found 1/2 and 1/3 to be a phase lag
that could be easily reproduced and that 3 of the subjects exhibit attractive phases
also near 2/3. When target phases occurred at other phase angles, they tended to
be reproduced with values close to these three values, 1/2, 1/3 and 2/3.
This very strong constraint on the timing of speech production is persuasive
evidence that Ss are somehow subdividing the long cycles into integer fractions
(that is, the harmonic fractions – halves and thirds). Apparently they actually
have attractor phase angles at these harmonic fractions of the phase circle. These
attractors are similar in certain ways to the attractors of relative phase observed
in the finger-wagging task employed by Haken, Kelso and Bunz, thus suggesting
a similar model, but one with attractors at 1/3 and 2/3 as well.
Conclusions.
Evidence from these experiments as well as in various folk-art of genres human
speech shows that humans have a strong tendency to produce speech in
rhythmical ways. Periodic structure in speech is so ubiquitous that we must have 111
an account of speech that will both render such entrainment of speech possible
and even to render it likely and natural.
Meter system. It seems that a natural way to account for these 2-beat and 3-beat
patterns during speech cycling is to propose that when Ss repeat the text, they
activate a meter system (see Port 2003). A meter system is hypothesized to be
a pair (or more) of oscillators that are phase locked (that is, coupled in such a
way that the pair of oscillators fire simultaneously when possible). They fire at
frequency ratios of 1:2 or 1:3 (or conceivably m:n) (cf. Large and Kolen 1996, Large
Figure 2. A frequency histogram of the phase of onset of the phrase final stressed syllable in the
Cummins and Port experiment (1998). Target phases were specified uniformly along the phase
continuum from 0.3 to 0.7. These histograms show the median phase for the ‘beat’ of the final
stressed syllable on each trial of 10-12 repetitions. Only a few intervals along the phase circle seem
to attract the onset of the stressed syllable.
and Jones 1999). A reasonable guess is that some region of the brain must exhibit
a pattern of firing that cycles for each of these oscillators, eg. for both the 1 and
the 2. The pulses of these oscillators apparently attract the stressed syllable ‘beats’
of the repeated phrases.
The balance of this paper will develop aspects of a mathematical model that
will begin to provide an explanation of some of the phenomena just described.
Comment on ‘universals of language’. This research program began with the
question “What are the dynamical properties of language? And what properties
are ignored by the conventional search for an inventory of the symbolic apriori
units of language.” We had discarded as misguided the conventional search for
linguistic universals. But surprisingly, almost as soon as a methodology for this
study reached workable form, we began to realize that we were finding very
strong and easily recognizable universal features of speech. For example, it is
likely that periodic alternations of strong and weak elements in speech production
are widespread across languages, possibly even universal (as proposed for the
“metrical grid” by Liberman 1978). And it appears likely that periodic attractors
at 1/2, 1/3 and 2/3 that we have discovered are universals of all human behavior,
including language production. They can probably be observed in most languages
given an appropriate linguistic task. The difference now, is that the universals we
have discovered are not symbolic in form, but rather reflect dynamical systems
112 that are very general and widespread.
It seems possible that there is a fairly small number of distinct rhythmical
styles, that is, styles for imposing rhythmic constraints on speech, across the
languages of the world, as claimed by Abercrombie (1967) and hinted at by Pike
(1945). It is not likely that such a typology will be as restricted and simplistic as the
“stress-timed” vs. “syllable-timed” typology claims. But, just as there are only a
small set of widely observed musical meters (eg. 2-beat, 3-beat and 4-beat meters,
etc), there may be a fairly limited set of possible linguistic rhythmical styles
too. Our approach seeks general dynamic mechanisms for rhythm and meter,
suitable for any language (and probably for music as well). This work makes a
small start toward uncovering a subdiscipline within phonology that might be
called the temporal phonology of language. Of course, other aspects of phonology
(like the production of consonants and vowels) also require dynamical models,
probably along the lines suggested by Browman and Goldstein (1986 1995) and
by Saltzman and Byrd (Saltzman 1995, Byrd and Saltzman 2003).
Do the rhythms that we have observed in speech have any relationship
to other kinds of motor behavior? In the next section, we shall review a simple
motor task having nothing to do with speech that exhibits simple rhythmicity.
The task is simple enough that an easily understood mathematical model has
been proposed. From this simple model, we will derive a variant that may be
more suitable for the speech results.
Dynamical Model for Limb Motion and Speech Timing
One question to ask is whether there might be any similarity between these
rhythmical phenomena and other dynamical effects observed in human
behavior even if they have no obvious relationship to speech at all. The Haken-
Kelso-Bunz model (Haken et al. 1985) is a very simple model for some very
simple motor phenomena.
The H-K-B model.
Scott Kelso tried wagging his left and right index fingers in various ways. He
observed (1981) that he (and his experimental subjects), could stably reproduce
just two specific patterns of bimanual coordination of the index fingers. In one
the fingers approach each other at the midline of the body (such that homologous
muscle groups contract simultaneously, defining zero relative phase), and in
the other the fingers alternate in motion toward the midline, but both move
simultaneously to the left and then both right (defined as a relative phase of 0.5).
He asked his subjects to move their index fingers back and forth in the 0.5 phase
relation (it doesn’t seem tomatter much what particular hand orientation you
employ). Each trial was begun with a slow rate and sped up gradually. At some
rate each subject became unable to keep the alternating coordination and found
themselves unable to prevent shifting to move the fingers symmetrically (in 0
relative phase).
Haken, Kelso and Bunz proposed a model for this robust behavioral
phenomenon (1985, Kelso 1995). The model proposes a vector field applying to
the relative phase of the two fingers. This vector field always has an attractor at
0 relative phase (where both go toward midline and then both away). By saying 113
0 phase is an attractor, they are claiming that the state of the system will tend to
move toward a relative phase of zero if it finds itself in the neighborhood. Thus it
implies that moderate perturbations of instantaneous phase (like those that occur
in the nervous sytem when you try to produce a semiskilled act like wagging
your index fingers according to some silly instructions) will tend to result in
automatic correction back to the region near 0 relative phase. At slower rates,
the model asserts the existence of a second attractor at phase 1/2. Although this
attractor is not as ‘strong’ or ‘deep’ etc. as the attractor at phase 0, it is nevertheless
fairly effective, at least at slow rates. When the rate is increased, however, the
attractor at 1/2 typically becomes weaker (modeled by reducing the amplitude
of the 2d harmonic component for faster rates). Eventually, at even faster rates,
the attractor at 1/2 will disappear (at the “critical point”) and at faster rates 1/2
becomes a maximally repelling phase angle.
The HKB-model of coordination is an example of a very high-level
descriptive idea: Observe macroscopic patterns of a complex dynamical system
and seek collective variables (eg. relative phase) and control parameters (eg. rate)
of the underlying self-organizing processes that coordinate a large number of
subsystems (including not just the two fingers, but also at least the various
muscles groups that control them and various relevant parts of the central motor
system). Because complex systems have a propensity for turning themselves into
simpler devices that may exhibit various functional requirements, the behavior
of complex systems can sometimes be modeled by means of a few macroscopic
order parameters, or dimensions, on which pattern changes are reflected (Haken
1983, Kelso 1995). It is possible that there may be certain dynamical characteristics
that hold for many kinds of complex systems, from simple mechanical devices all
the way to the motor-cognitive apparatus responsible for language.
The HKB model puts the finger wagging task into a broader framework
of phase transitions occurring when systems with two stable states turn into
systems with only one stable state – as change in the control parameter causes
changes in the attractor landscape. One way to think of attractors is as basins
in a potential function, V, whose slopes represent the vector field that changes
the state of the system. Thus in a potential function, a ‘valley’ is an attractor and
a ‘peak’ is a repellor. The steeper the sides, the faster the change for any value
of relative phase. This would lead a magic marble to settle toward local energy
minima (imagining motion without any momentum in the basin).
Phi stands for the collective variable, the relative phase of oscillators 1 and
2. Since the data seem to exhibit attractors at phi = 0 and phi = 0.5, the HKB-model
describes the situation in terms of a potential function, V (made from negative
cosines) and a control parameter, rate of finger oscillation. Increases in the rate
cause the relative amplitude of a negative cosine with a trough at 1/2 to become
smaller. V thus gives a range of attractor landscapes for different values of the
control parameter, rate. The two attractors at the slowest rate correspond to phase
lockings of antiphase and inphase. Figure 3 shows the attractor landscape for the
slow rate with a moderately strong attractor at 1/2.
114
Figure 3. The H-K-B model of attractors on the relative phase of the two wagging fingers at slow rate.
It shows attractors at a relative phase of 0 and 0.5. The potential function is simply the sum of the two
negative cosines shown in the equation. As rate increased, the amplitude b of the cos 2phi terms is
assumed to decrease. And the phase angle of 0.5 becomes a repelling phase angle as b approaches 0.
As b gets smaller (assuming a>b), the attractor at 0.5 gets shallower and flatter, thus predicting slower
return to the attractor after any perturbation, and also predicting more variation in phase due to the
constant internal noise than would be predicted for the slower rate.
At a slow rate, as shown in Figure 3, there is a deep basin in V which
corresponds to the stronger inphase attractor and a shallower basin showing
where the weaker, antiphase attractor is located. As the rate is accelerated, the
weaker attractor basin is hypothesized to become more shallow, gradually losing
its effectiveness as an attractor. The system will gradually lose its stability, and
eventually reach the critical point where the attractor disappears and the system
state must slip over to the deeper inphase basin.
Model Predictions. This set of potential functions makes several experimentally
testable predictions (using stochastic methods, Schöner, Haken and Kelso 1986)
when the control parameter (cycling rate) is manipulated so as to approach the
critical rate.
1. When the rate is slowly increased while Ss maintain a target phase of
1/2, Ss will eventually slip over to the “inphase” pattern.
2. Even at slow rates, no phase other than 0 and 1/2 will be stable. (With
practice, 1/3 and 2/3 and perhaps other phases might be trained up.)
3. “Critical slowing down.” Because the slopes of the attractor basin became
more gentle at rates near the critical point, there will be less tendency
to draw the phase back to the basin bottom. Therefore, given a single
perturbation, such as mechanical hindering of the motor performance
or a delayed stimulus cycle, when the rate is near the critical rate, the
recovery of phase back to the pre-perturbation target phase should take 115
longer than at slower rates.
4. “Critical fluctuations.” Given some amount of intrinsic noise, the weakening
stability of the antiphase attractor should manifest itself as an increase of
fluctuations of relative phase as the rate approaches the critical rate.
These prediction have been verified for the finger wagging task as well as for
several other related tasks involving other limbs (See Kelso 1995 for review).
General Model of Meter
The phenomena reported here suggest a speculative new model of meter. This
notion is more general than just a linguistic structure, and may include the HKB
finger-wagging task as a special case. Only the most general properties are clear
at the moment. A more explicit mathematical treatment is required to spell out its
implications explicitly. This theory depends on the notion of adaptive oscillation
(McAuley and Kidd 1998, Large and Kolen 1996) and closely resembles the Large
and Jones (1998) notion of musical meter as coupled oscillators whose phase
zeros specify temporal attractors in perception as well as production. See Port
(2003) for further discussion.
Notice that HKB had only one attractor on the relative phase circle in addition
to 0, an attractor at 1/2. But the speech cycling tasks revealed three attractors in
addition to 0, at 1/2, 1/3 and 2/3. The two new attractors can be accounted for by
adding one more harmonic to the HKB model, a term of [-c cos 3phi ] (where a, b,
c are positive and c is normally smaller than b and b is normally smaller than a.)
With inclusion of this term, then, if c is sufficiently large, attractors will appear (in
the order of their attractiveness) at 0, 1/2 and then equally 1/3 and 2/3.
Figure 4 shows the architypal structure of such a system displayed as a
potential function. It seems likely that there is some region of tissue in the brain
that cycles for each of these coupled oscillators. So, if one is producing (or listening
to) a waltz rhythm, then something is oscillating on each beat as well as on each
measure of 3 beats. If this is a metrical structure, then the two oscillations are
phase locked so that they fire simultaneously at 0 phase of the slower oscillator.
This system of a single or several coupled oscillators should generate regularly
spaced and nested, hierarchical structures of pulses.
The way in which meter makes its presence felt, then, is that if a subject
repeats a motor gesture (whether speech or nonspeech) or hears an auditory
pattern that contains something perceptually salient (eg. a loud onset) that
repeats at an appropriate rate, then those events may become coupled to one
of the pulses of the meter (either to a faster one or a slower one). Because many
other kinds of behavior can exhibit meter aside from speech, we cannot call it
essentially a linguistic structure. But apparently repeated text phrases (or the
lines of poem or chant) can become entrained to one – along with wagging or
tapping fingers as well.
Obviously, the relative depth of these attractors may vary with the task
and with the subject, as well as over time within a subject (if certain patterns are
practiced). It is only the general form of the potential function that I suggest may
be a universal (on the assumption that all cultures can produce both 2-beat and
116 3-beat patterns in at least some genre or another). The details of attractor shapes
may turn out to be quite different from negative cosines. These issues require
more data and more theory. The proposal made here is that a set of attractors
on the relative phase (shown in Figure 4) are what explains results like those in
Figures 1 and 2.
Figure 4. A summary image representing the attractors of a 3-oscillator system with frequence ratios
of 1:2:3 and a>b>c. It is proposed as a representation of the most likely attractors of the basic metrical
patterns for most people (but may vary depending on cultural experience). The attractor at 1/2 is
shown as less stable than at 0 but as more stable than at 1/3 and 2/3 because this is what seems
generally to be observed.
What I have presented here is only a sketch of some of the properties that
a general theory of meter should have: (1) both 2-beat and 3-beat attractors with
2-beat being easier, (2) restriction to a certain range of rates between 2-3 sec at the
long end down to about a quarter second at the shorter end, and (3) attraction
of prominent events, especially tapping sounds or rapid motions – whatever
can serve to define a unique repeating event, toward the 0 phase locations of
all the participating oscillators. This theory implies many testable hypotheses.
Obviously both more modeling work and more experiments are required to
clarify these ideas.
Concluding discussion
This paper has raced over a broad range of issues. It is time to wrap them all up,
beginning with the data and working backwards to more general topics.
Meters as attractor systems. In recent experiments, we modified a familiar
dynamical research method and asked subjects to repeat a piece of text regularly.
During regular repetition of the utterance (especially when stabilized by a
metronome), we found that harmonics of the repetition cycle appeared as
attractors at very predictable points in time that happened to be small integer
fractions of the basic text cycle. We call them attractors because the onsets of
stressed syllables tend to be biased toward these specific phase angles. From these
and other experiments we have observed some similarities between these speech
patterns and other nonspeech periodic motor patterns. Eventually, we leaped 117
to hypothesis that there may be a general metrical structure that is widespread
in human behavior (perhaps what linguists would like to call a ‘universal’).
This system can be easily characterized (in the most distinctive cases) as an
‘underlying’ (in some sense) pair of coupled, phase-locked oscillators. The phase
zeros of these oscillators appear to be rather powerful attractors for prominent
motor or perceptual events.
We might say that the simplest possible meter is a single oscillator ticking at
some frequency. Let’s call this Level 0 (modeled by a single adaptive oscillator). A
meter system at Level 1 (the simplest structure that one would usually call meter)
has two oscillators at frequency ratio 1:2 or 1:3 (most likely). These are the simplest
hierarchical structures in time. A nested system at Level 2, might have 1:2 nested
within 1:2 (thus resembling 1:4), or 1:2 nested within 1:3 (for one kind of 6/8 pattern).
In musical contexts many other meters are possible (See Tajima and Port 1998, Gasser,
Eck and Port 1998, Large and Kolen 1996 and Port 2003 for further discussion).
Why is it so difficult to observe these meters in spontaneous activity
or natural speech? It is probably because in spontaneous action (including
conversational talk) there are a great many factors affecting timing. At the very
least there is the dynamics of the individual speech gestures, the dynamics of
meter and the dynamics of ‘deciding what to do or say’. The metrical attractors
are only one source of constraint. Only when most other variation is discarded,
as when repeating some well-known text, do the meters spontaneously make
themselves felt in human behavior. Presumably these attractors exist at all times,
just waiting for an opportunity to entrain a speech pattern to itself.
Linguistic theory. Some may argue that these results showing how speech
is easily entrained to periodic patterns and the demonstration of nondiscrete
categories may be of some interest, but that, no matter how regular they are, these
results are not about language at all, but only about linguistic performance. In response,
I would repeat my argument that the distinction of Competence and Performance
is a great mistake. The mistake is to assume in advance that Mind and Body are
clearly distinct, and that we know enough about each of them that a line can be
drawn. I don’t think we know enough yet. I would point out that the very fact
that language is so easily entrained to meters provides us with some kind of
vivid and potentially useful information about what words are really like. They
do have temporal properties, and abstract [+stress] syllables are attracted strongly
to the phase 0s of one meter or another. So it seems foolhardy to insist that this
attractiveness and these rhythmic behaviors simply cannot be ‘The Language
Itself.’ Surely if they show us critical facts about the ‘execution’ of language, they
are also showing us something important about the language itself. Only words
that are temporal, dynamical objects could be coupled to a metrical system. No
matter how one may prefer to think of language, if one also seeks to study language
in a way that might be useful for clinical purposes or for high performance speech
recognition, then dealing with the temporal properties of language will prove
more that simply “relevant,” they are essential. The fact that linguistics has
always restricted itself to static descriptions in the past is not a good rationale
118 for continuing to ignore temporal structure. Approaching linguistics from this
perspective implies that new metaphors will be needed for interpretation as well
as new methods of research. Sooner or later, linguistics must deal with time, because
sooner or later language is always spoken, listened to and read in time.
The Speech Cycling tasks developed here may turn out to be revealing
for many kinds of linguistic phenomena. Of course, these tasks are subject to
the accusation of being ‘artificial’, but this is a small price to pay. It is almost
impossible to see what various temporal factors do to speech ‘in the wild’ and
in unconstrained contexts. When speakers repeat something over and over, it
gets polished and symmetries that are intrinsic to the behavior of the system can
begin to assert themselves. It is rather like studying the angles and the refractive
properties of a large crystal of salt in order to learn about the micro-structure of
individual salt molecules. When a small form is repeated over and over, some of
its intrinsic regularities may become observable. Speech cycling employs massive
repetition to render visible the information we need about subtle linguistic
patterns. Although phonology and phonetics is the first subdiscipline of
linguistics to exploit speech cycling as a tool to study language, it’s possible that
variants of the method can be employed for research on syntax and other issues.
Cognitive Science. Does all of this discussion about how language should be
analyzed have implications for cognitive science in general? First, we have
argued in this paper that natural language does not necessarily provide the prima
facia evidence for cognitive symbols that it is often assumed to provide. The fact
that people talk using fairly discrete words does not tell us a lot about how we
manage to think in general. Languages are, at best, only partially symbol-like.
Nondiscrete ‘categories’ exist and non-serial order features seem to differentiate
one language from another (and thus must be learned rather than being universal
and apriori). Therefore one cannot assume that linguistic units are authentic
examples of symbolic structures. Instead of linguistics providing the model for
general cognition, linguistics too needs an account of what its structures are
and how they could exist. Hopefully dynamical systems theory will continue to
provide improved understanding of such phenomena.
Second, just as a new unexplored set of dynamical phenomena presented
themselves for investigation when linguistic behavior was examined from the
dynamical orientation (eg. with speech cycling tasks), similarly, other aspects of
cognition may find new and productive research methods when the possibility of
entrainment in time is explored.
Finally, language seems to have always provided the basic metaphor for the
rest of cognition. Thinking is a little like talking. Aristotle based his propositional
logic on simple sentence structures, and 19th century predicate calculus enlarged
the linguistic coverage of logic considerably. The discreteness of language has
had the effect of encouraging us humans (or at least Westerners) to believe that
the powers of language could be captured by mechanical automata. But thus
far machines have been able to emulate humans only when we leave time out
— when the inputs are assumed to be static. Even for language, where the
phenomena look at first glance to be discrete and representable quite naturally in 119
the static medium of writing, it turns out that understanding its temporal (non-
serial) structure plays a critical role in theories of cognition. Writing, no matter
how practical it may be, leaves out something central to a practical understanding
of language, the dimension of time.
In the same way, cognitive science too must take responsibility for the
temporal coordination of cognitive acts. Ordinary thinking and reasoning also take
place in real, continuous time. Coupling of all these perceptual and motor activities
with predictable external events is surely quite natural and unavoidable.
Finally, looking even beyond cognitive science, it is reassuring to note that the
simple temporal structures we observe in speech and song are very reminiscent
of many kinds of periodic structures in time and space. For example, tubes
and strings resonate at discrete harmonically related frequencies (if they have
uniform diameter), crickets chirp at a regular rates, clouds and sand dunes
often arrange themselves, we might say, in spatially periodic ‘streets.’ Tigers,
zebras and butterflies grow regular periodicities of coloring. Fireflies and
cicadas sometimes entrain themselves to each other in time. Mathematical
accounts of all these phenomena appeal to the behavior of relatively simple
dynamic systems, whether in the atmosphere, in embryonic zebra skin, in
the firefly brain, or wherever (Haken 1983, Murray 1993). It seems likely that
morphogenetic processes that structure the mechanical world and biological
systems are very similar to ones that contribute to the creation of linguistic and
other cognitive structures.
Acknowledgments
The author is grateful for helpful discussions and comments to Tony Chemero, Fred Cummins, Ken de
Jong, Mafuyu Kitahara and Keiichi Tajima.
Bibliography
Abercrombie, D. 1967. Elements of General Phonetics. Chicago: Aldine Publishing Company.
Abraham, Ralph & Shaw, Christopher 1983. Dynamics, The Geometry of Behavior, Part 1. Santa Cruz: Aerial
Press.
Bloomfield, Leonard 1933. Language. London: Allen-Unwin.
Boomsliter, P. C. & Creel ,w. 1977. ‘The secret springs: Housman’s outline on metrical rhythm and
language’. Language and Style 10, 296-323.
Browman, Catherine, & Goldstein, Louis 1986. ‘Towards an articulatory phonology’. Phonology Yearbook
3, 1986, 219-252.
__1995. ‘Dynamics and articulatory phonology’. In R. Port and T. van Gelder (1995) Mind as Motion:
Explorations in the Dynamics of Cognition. Cambridge: Bradford Books /MIT Press.
__1992. ‘Articulatory phonology: An overview’. Phonetica, 49, 155-180.
Byrd, D., & Saltzman, E. 2003. ‘Task dynamics of gestural timing: Phase windows and multifrequency
rhythms’. Human Movement Science, 19, 499-526.
Chomsky, Noam 1965. Aspects of the Theory of Syntax. Cambridge: MIT Press.
Chomsky, N. & Halle , M. 1968. Sound Pattern of English. New York: Harper-Row.
Cummins, Fred 1996. Rhythmic Coordination in English Speech: An Experimental Study. PhD thesis, Indiana
University, Bloomington, IN. Also Technical Report 198, Indiana University Cognitive Science
Program.
__1997. ‘Synergetic organization in speech rhythm’. Proceedings of the Joint Conference on Complex Systems
in Psychology, Gstaad, Switzerland.
Cummins, Fred & Port, Robert 1996. ‘Rhythmic constraints on stress timing’. In Proceedings of the Fourth
120 International Conference on Spoken Language Processing. Alfred duPont Institute, Wilmington,
Delawapre, pp. 2036 - 2039.
__1998. ‘Rhythmic constraints on stress timing in English’. Journal of Phonetics, 26, 145-171.
Fodor, Jerry A. 1975. The Language of Thought. New York: T. Y. Crowell.
Fodor, Jerry & Pylyshyn, Z. 1988. ‘Connectionism and cognitive architecture: a critical analysis’.
Cognition 28, 3-71.
Fowler, C. P. Rubin, R. Remez And M.t. Turvey 1981. ‘Implications for speech production of a general
theory of action’. In B. Butterworth Language Production, pp. 373-420. New York: Academic Press.
Freeman, Walter 1975. Mass Action in the Nervous System. New York:Academic Press.
Gasser, Michael, Eck, Douglas & Port, Robert 1998. ‘Meter as mechanism: A neural network that learns
metrical patterns’. Connection Science (under review).
Glass, L. & Mackey, M. 1988. From Clocks to Chaos. Princeton: Princeton Univ. Press.
Haken, H. 1983. Synergetics: An Introduction. Berlin: Springerverlag.
Haken, H., Kelso, J. A. S. & Bunz, H. 1985. ‘A theoretical model of phase transitions in human hand
movements’. Biological Cybernetics 51, 347-356
Haugeland, John 1985. Artificial Intelligence: The Very Idea. Cambridge: Bradford Books/MIT Press.
Hayes, Bruce 1995. Metrical stress theory: Principles and case studies. Chicago: Univ. of Chicago.
Hockett, Charles 1955. Manual of Phonology. Baltimore: Waverly Press.
James, Jamie 1993. The Music of the Spheres: Music, Science and the Natural Order of the Universe. Berlin:
Grove Press/Springer-Verlag.
Jeka, J. J., Kelso, J. A. S. & Kiemel, L. 1993. ‘Pattern switching in human multilimb coordination
dynamics’. Bulletin of Mathematical Biology 55, 829-845.
Jones, Daniel 1932. An Outline of English Phonetics, 3d Ed.. Cambridge: Cambridge Univ Press.
Jones, Mari & Boltz, Marilyn 1989. ‘Dynamic attending and responses to time’. Psychological Review 96,
459-491.
Keating, Patricia 1985. ‘Universal phonetics and the organization of grammars’. In V. Fromkin (ed.)
Phonetic Linguistics: Essays in Honor of Peter Ladefoged, 115-132. New York: Academic Press.
Kelso, J.a.s 1981. ‘On the Oscillatory Basis of Movement’ (conference abstract). Bulletin of the Psychonomic
Society 18, 63.
__1995. Dynamic Patterns: The Self-Organization of Brain and Behavior. Cambridge: Bradford Books/MIT
Press.
Kelso, J.a.s., Delcolle, J. & Schîner, G. 1990. Action-perception as a pattern formation process. M.
Jeannerod (ed.). Attention and Performance XIII. Hillsdale: Erlbaum.
Kelso, J.a.s. & Jeka, J.j. 1992. ‘Symmetry breaking dynamics of human multilimb coordination’. Journal of
Experimental Psychology: Human Perception and Performance, 18, 645-668.
Kelso, J.a.s., Buchanan, J.j. & Wallace, S.a. 1991. ‘Order parameters for the neural organization of single
multipoint limb movement patterns’. Experimental Brain Research 85, 432-444.
Kewley-port, D., Watson, C. & Foyle, D. 1988. ‘Auditory temporal acuity in relation to category
boundaries: Speech and nonspeech stimuli’. J. Acous. Soc. Amer.83, 1133-1145.
Klatt, Dennis 1976. ‘Linguistic uses of segmental duration in English: Acoustic and perceptual evidence’.
J. Acous. Soc. Amer. 59, 1208-21.
Ladefoged, Peter & Maddieson , I. 1996. The Sounds of the World’s Languages. Oxford: Blackwell.
Large, E. W. & Jones, M. R. 1999. ‘The dynamics of attending: How we track time-varying events’.
Psychological Review, 106, 119-159.
Large, Ed & Kolen, John 1995. ‘Resonance and the perception of musical meter’. Connection Science 6,
177-208.
Liberman, A. M & Mattingly, I. 1985. ‘The motor theory reconsidered’. Cognition 21,
LIBERMAN, Mark 1978. The intonational system of English. MIT Doctoral Dissertation, Department of
Linguistics. Published by IU Linguistics Club, Bloomington, Indiana.
Liberman, Mark & Prince, Alan 1977. ‘On stress and linguistic rhythm’. Linguistic Inquiry 8, 249-336.
Marcus, S. 1981. ‘Acoustic determinants of perceptual center (P-center) location’. Perception and
Psychophysics 30 :247-256
Martin, James G. 1972. Rhythmic (hierarchical) versus serial structure in speech and other bahavior.
Psychological Review 79, 487-509.
Mcauley, J. Devin & Kidd, Gary 1998. ‘Effect of deviations from temporal expectations on tempo
discrimination of isochronous tone sequences’. J Experimental Psychology: Hum. Perc. Perf, in 121
press.
Murray, J. D 1993. Mathematical Biology, 2d Ed. Berlin: Springer-Verlag.
Newell, Allen & Simon, Herbert 1975. ‘Computer science as empirical enquiry’. Communications of the
Association for Computing Machinery 6, 113-126.
Norton, Alec 1995. ‘Dynamics: An introduction’. In R. Port and T. van Gelder (eds) Mind as Motion:
Explorations in the Dynamics of Cognition, 45-68. Cambridge: Bradford Books/MIT Press.
Pike, Kenneth 1945. The Intonation System of American English. Ann Arbor: Univ. of Michigan Press.
Port, Robert F. 1996. ‘Phonetic discreteness and formal linguistics: Reply to A. Manaster-Ramer’. Journal
of Phonetics 24, 491-511.
__2003. ‘Meter and speech’. Journal of Phonetics, 31, 599-611.
Port, R. F. & Leary, A. 2000. ‘Against formal phonology’. Language, 81, 927-964.
Port, Robert, Cummins, Fred & McAuley, Devin 1995. ‘Naive time, temporal patterns and human
audition’. In R. Port and T. van Gelder (eds) Mind as Motion: Explorations in the Dynamics of
Cognition. Cambridge: Bradford Books/MIT Press.
Port, Robert & Crawford, Penny 1989. ‘Pragmatic effects on neutralization rules’. J. Phonetics 16, 257-
282.
Port, Robert & O’Dell , Michael 1986. ‘Neutralization of syllable-final voicing in German’. Journal of
Phonetics 13, 455-471
Port, R., Dalby, J. & O’Dell , M. 1987. ‘Evidence for mora timing in Japanese’. Journal of the Acoustical
Society of America, 81 :1574-1585.
Port, R. & van Gelder , T. 1995. Mind as Motion: Explorations in the Dynamics of Cognition. Cambridge:
Bradford Books/MIT Press.
Port, R.f., Tajima , K. & Cummins, F. 1996. ‘Self-entrainment in animal behavior and human speech’.
Online Proceedings of the 1996 Midwest Artificial Intelligence and Cognitive Science Conference, http:
//www.cs.indiana.edu/event/maics96/proceedings.html.
__1998. ‘Speech and rhythmic behavior’. in G. J. P. Savelsburgh, H. van der Maas and P. C. L. van Geert
(eds) The Non-linear Analysis of Developmental Processes, 53-78. Amsterdam: Royal Dutch Academy
of Arts and Sciences.
Port, Robert, Al-ani, Salman & Maeda , Shosaku 1980. ‘Temporal compensation and universal phonetics’.
Phonetica 37, 235-266.
Saltzman, E. 1995. ‘Dynamics and coordinate systems in skilled sensorimotor activity’. In R. Port & T.
van Gelder (Eds.), Mind as Motion: Explorations in the Dynamics of Cognition, 150-173. Cambridge:
Bradford Books/MIT Press.
Saussure, Ferdinand 1915/1959. Course in General Linguistics. New York: Philosophical Library.
Schöner, Gregor, Haken, H. & Kelso, J. A. S. 1986. ‘A stochastic theory of phase transitions in human hand
movement’. Biological Cybernetics 53, 442-452.
__1986. ‚A stochastic theory of phase transitions in human hand movement’. Biological Cybernetics; 53,
442-452.
Scholz, J.p., Kelso, J.a.s. & Schöner, G. 1987. ‚Non-equilibrium phase transitions in coordinated biological
motion: Critical slowing down and switching time’. Physics Letters A; 123, 390-394.
Scott, S. K. 1993. P-centers in Speech: An Acoustic Analysis. PhD thesis, University College London.
Stevens, K. N. & Blumstein, Sheila 1981. ‘The search for invariant acoustic correlates of phonetic features’.
In P. Eimas and J. L. Miller Perspectives on the Study of Speech, 1-38. Hillsdale: Erlbaum.
Strogatz, Stephen 1994. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and
Engineering. New York: Addison-Wesley.
Tajima, Keiichi & Port, Robert 2003. ‘Speech rhythm in English and Japanese’. In J. Local, R. Ogden & R.
Temple (Eds.), Phonetic Interpretation: Papers in Laboratory Phonology Vol. 6, 317-334. Cambridge:
Cambridge University Press.
Tajima, Keiichi, Port, Robert & Dalby, Jonathan. ‘Effects of speech timing on intelligibility of foreign-
accented English’. Journal of Phonetics 25, 1-24.
Thelen, E. & Smith, L. 1996. The Dynamic Systems Approach to the Development of Cognition and Action.
Cambridge: MIT Press.
Treffner, Paul J. & Turvey, Michael T. 1993. ‘Resonance constraints on rhythmic movement’. J. Experimental
Psychology: Human Perception and Performance 19, 1221-1237.
122 Turvey, M. T. 1990. ‘Coordination’. American Psychology, 45 :938-953
Tuller, B., Kelso, J.a.s. 1990. ‘Phase transitions in speech production and their perceptual consequences’.
M. Jeannerod (ed.) Attention and Performance XIII. Hillsdale: Erlbaum.
van Gelder, Timothy & Port, Robert 1994. ‘Beyond symbolic: Toward a Kama-Sutra of compositionality’.
In Vasant Honavar and Leonard Uhr (eds.) Artificial Intelligence and Neural Networks: Steps Toward
Principled Integration, 107-125. New York: Academic Press.
__1995. ‘It’s about time: Overview of the dynamical approach to cognition’. In Robert Port and Timothy
van Gelder (1995) Mind as Motion: Explorations in the Dynamics of Cognition, 1-43. Cambridge:
Bradford Books/MIT Press.
Van Gelder, Tim 1998. ‘The dynamical hypothesis in cognitive science’. Brain and Behavioral Science, in
press.
Warner, N., Jongman, A., Sereno, J., & Kemps, R. R. 2004. ‘Incomplete neutralization and other sub-
phonemic durational differences in production and perception: Evidence from Dutch’. Journal of
Phonetics, 32, 251-276.
Models of abduction
Paul Bourgine
Introduction
C.S. Peirce had the view that reasoning involves three kinds of inference,
abduction, deduction and induction. His reflections on this question developed
over forty years, at a time when logic in its modern guise started emerging.
Peirce’s thoughts changed substantially during this period, as was shown by
Burks, Fann, Thagard and Anderson. These analysts distinguished two main
phases in Peirce’s reflections about the kinds of reasoning. In the first phase,
before 1900, Peirce offers a syllogistic approach of the three kinds of reasoning;
in the second phase, after 1900, he favors an inferential approach that is closer to
scientific inquiry.
In the syllogistic approach, exemplified by the celebrated Barbara syllogism,
deduction is the type of reasoning which allows deriving, from a major premise
123
(the rule) and a minor premise (the case), a conclusion (the result). Induction
derives by generalization a rule from a collection of observations of case-result
pairs. Abduction starts with the conclusion and the major premise to derive the
minor premise.
In the inferential approach, the function of abduction is to emit a hypothesis,
which can be either a premise, a rule or a theory. The function of deduction is to
draw from a hypothesis its necessary or probable consequences. The function of
induction consists in comparing the predicted consequences with the observed
results. These three kinds of reasoning are considered necessary in scientific
inquiry, abduction being the main instrument in a logic of discovery. When a new
surprising fact is encountered, the first stage in reasoning consists in abducing an
explanatory hypothesis, the second in deducing the testable consequences and
the third in testing these consequences to either confirm or falsify the explanatory
hypothesis. In its inferential approach, the second Peirce distinguishes clearly
these three kinds of inferences and makes their relations explicit.
The aim of this paper is to sketch a theory of abduction with its relations with
deduction and induction in the sense of the second Peirce. Abduction is seen as a
relation reciprocal to deduction, not directly as done by Flach (1996), but in a more
general and sophisticated sense very close to the framework of belief revision
(Alchourron & al. 1985, Gärdenfors 1988, Katsuno & al. 1991). One supplementary
advantage beyond generality is that this conception of abduction is directly
compatible with the conception of induction underlying belief revision.
This theory of abduction will be expressed within models belonging to
different paradigms of cognition, the purpose being to check its adequacy for
the whole field of cognitive science. Three main paradigms are considered: the
cognitivist paradigm which takes knowledge to be symbolic, with validity as a
criterion of success; the connectionist paradigm which substitutes sub-symbolic
states of a neural network to symbols, while retaining the same criterion of
success; the constructivist paradigm which replaces the criterion of validity
with an evolutionary criterion of viability and claims that the symbolic level
to be grounded in the sub-symbolic level. The three parts of the paper are
discussing models of abduction corresponding to these three paradigms. For
the sake of clarity, inferences are not considered to be probabilistic and are only
analyzed in a set-theoretic framework, which supposes that Nature’s responses
are deterministic.
Basic schema
Let us consider a special type of expert, a physician. There is a set of diseases
that are not directly observable; only signs are available to the physician. A
causal relation develops from a hidden disease to a set of observable signs. The
124 physician’s reasoning moves in the converse sense from the observable signs
towards the hidden diseases: she has to “abduct” the hidden hypothesis (disease)
from the observable facts (signs).
What are first to be understood are the constraints that apply to this kind of
reasoning. The most convenient way to express these constraints, as is the case
in other kinds of reasoning, is to spell out the axioms which abduction follows.
A great advantage of such a specification of abductive reasoning is that it can be
falsified by psychological experiments. More precisely, a set of axioms defines
a class of abductive reasonings that can be enlarged by weakening the axioms
or restricted by strengthening them. The above set of axioms seems to be both
interesting for it global properties and plausible:
- A1-Consistency: nothing can be abducted from a contradiction. It is
the dual of the well-known property of deduction: everything can be
deduced from a contradiction.
- A2-Success: every hypothesis can be abducted from itself. Again, this
is the dual of the property that every hypothesis can be deduced from
itself.
- A3-Cautious monotony: if one can abduct a first hypothesis from a fact
and if this hypothesis can be abduced from another hypothesis, one can
abduct both hypotheses from the fact. This principle can be reformulated
by saying that the abductive inference for a hypothesis can be extended to
other underlying facts or hypotheses that might confirm the hypothesis.
- A4-Or: if one can abduct a first hypothesis from a fact and if a second
hypothesis is incompatible with the first one then the conjunction of the
two hypotheses can be abduced from the conjunction of the fact and the
second hypothesis. This principle can be reformulated by saying that, if
many incompatible hypotheses are possible, they can be preserved within
a particular scheme of abduction.
- A5-And: If one can abduct a hypothesis from two facts then one can
abduct it from the conjunction of these two facts.
We suppose that the facts and the hypotheses are represented as propositions
constructed with the help of the classical connectors {¬,∧,∨, →,↔} from a finite
(for the sake of simplicity) set of atomic propositions P={π1,…,πn}. The set W
of possible worlds is the set of interpretations of P, i.e. the set of functions from
P to {T=true, ⊥=false}. To each proposition “a” it is possible to associate the
set of possible worlds ‘A’ that makes the proposition true. Then the syntactic
implication α→β holds if and only if the semantic inclusion Α⊆Β holds between
their corresponding sets of worlds “A” and “B”. And the syntactic equivalence
α↔β holds if and only if the semantic equality A=B holds.
In this framework, we now collect together the previous axioms for
abductive inferences, where “α |< β” means “from a it possible to abduct b or
simpler “from α one abducts β”: 125
A1-Consistency: ¬ (⊥ |< α)
A2-Success: α |< α
A3-Cautious monotony: if α |< β and γ |< β then α |< β∧γ
A4- Weak Or: α |< β and β ∧ γ → ⊥ then α ∧ γ |< β / γ
A5-And: α |< γ and β |< γ then a ∧ β |< γ
Given a preorder relation “≤” on the worlds set W, the meaning of min(β) is clear:
it consists in selecting the set B of worlds where β is true, keeping its minimal
worlds min(B)={w∈B :w’≤w for all w’∈B} and returning to the corresponding
proposition min(β).In this theorem, the concept of a parsimonious hypothesis is
crucial: it is simply an hypothesis which contains only its minimal worlds. Now,
point (i) means that the only relevant hypotheses in abductive reasoning are the
parsimonious hypotheses, and point (ii) means that abduction is the relation
reciprocal to deduction for parsimonious hypotheses.
It is easy to verify that an inference relation which satisfies points (i) and
(ii) when there is a preorder relation in the set of possible worlds is an abductive
inference in the meaning of axioms A1-A5. What the theorem of representation
expresses is that all the abductive inference satisfying A1-A5 have necessarily this
canonical form.
The total preorder “≤” defines a system of spheres on the finite set W of
possible worlds. The inner sphere is the set K of minimal worlds. Then comes the
sphere K1 of minimal worlds of W-K, and K2 of the minimal worlds of W-K-K2,…
These spheres are also defined as the equivalent classes of W by the equivalence
relation “≈” defined by the preorder (w≈w’ iff w≤w’ and w’≤w). Thus it is possible
to define a distance from a set to the set K: the worlds in K1 are at distance “1”
from K, in K2 at “2”, ... And min(B), which is the set of minimal worlds of B, is
also the set of worlds of B with minimal distance from K, which is written as K*B
in the belief revision literature. And α |< β iff K*B ⊆ A.
It is easy to see that A1’ implies A1 (by taking α=β=γ) and A2’ implies A2 (by
taking α=β). And the converse is true if the abductive reasoning follows the
axioms A1-A5. These two properties have interesting interpretations, which
can be also falsified (and in this case the whole set of axioms also falls, by meta-
applying A1’ to the present theory of abduction!):
- The first property, as usual, explains how to falsify a hypothesis: if one
can abduct an hypothesis from a fact, and if a second fact which is a
consequence of both the first fact and the hypothesis is false, then one can
no longer abduct the hypothesis.
- The second one expresses a usual property of abduction: if a hypothesis
implies some consequence, one can abduct this hypothesis when
observing this consequence; but the converse is not generally true: if
one can abduct a hypothesis from a fact, it does not follow that one can
deduce the fact from the hypothesis.
The following representation theorem generalizes a little the one of Flach (Flach
1996), because his axioms are weakened. As already announced, the set K is
trivialized as the whole set W, and thus min(B) is nothing else than B.
Theorem 2 (representation theorem): the abductive reasoning ”|<” follows
the axioms A1-A6 if and only α |< β iff B⊆A , i.e. iff β |− α.
Let us remark that, by adding this new axiom, only one abduction relation remains
which is exactly reciprocal to the deduction relation. But, this presupposes that
beliefs are true, i.e. are knowledge.
Let us come back to the physician’s abductive reasoning. Let I be the set
of the observable signs and S={si} for i∈I the set of the elementary propositions
corresponding to the presence/absence of the possible signs of possible diseases.
Let J the set of possible diseases and H={hj} for j∈J the set of the elementary
propositions corresponding to the presence/absence of the possible diseases (no
matter whether the diseases are pure or combined: if it is a combined disease,
it will be considered as a different disease). Let WS the set of possible worlds
of signs, WH the set of possible worlds of diseases and W=WS × WH the set of
possible worlds, constructed from the elementary propositions: an elementary
world is an interpretation from W into {true, false}. Then the causal relation from
hidden diseases towards observable signs can be expressed as logical rules
hj→bj where “bj” is a proposition in WS which means that if it is the case that the
disease is “j” then necessarily the observable signs satisfy bj or, equivalently, that
the possible world is in Bj.
Now, let us suppose that the physician’s abductive reasoning follows A1-
A6, i.e. her beliefs are true. Then, following the second representation theorem,
when she has the information that the world wS ∈ A, she can abduct the diseases
JA={j ∈ J : Bj ⊆ A}. More precisely, let {At} be the information process through
time “t” of the physician during an interview with a patient: because information
is generally increasing, this sequence of sets {At} is decreasing. At each stage of
this information process, we can define the hypotheses still possible: Jt ={j ∈ J : Bj
⊆ At}. Because the information is increasing, the sequence At is decreasing and
the sequence Jt is also decreasing: thus the sequence converges towards a set of
possible diagnoses (with zero, one, two or more elements). If finally it is true that
all the diseases are known and that there are enough signs (using complementary
analyses if necessary) to discriminate each disease from the others (Bj ∩ Bj’=∅),
then there is an information sequence which converges to the right diagnosis for
this perfectly competent physician.
Let us next suppose that the physician’s abductive reasoning follows
only A1-A5. By the first representation theorem, her belief are structured into
128 successive spheres of beliefs with less and less competence from the inner sphere
K0 towards spheres K1, K2,..., Kn… Thus, for this particular physician, her set
of diseases J is separated into the sets Jn={j ∈ J : K0*Bj ⊆ Kn} and J= ∪n Jn. Then,
with information At at time “t” of the appointment, her set of diagnoses Jt ={j ∈ J :
K0*Bj ⊆ At} has to be separated into the sets of strategies Jtn={j ∈ Jn : K0*Bj m At}.
If Jtn becomes empty or the possibility of serious diseases remains true outside
Jtn then she will probably send the patient to another competent physician.
(i) the admissible spaces are S(Z)=min(S’Z) where S’Z={ s∈S with HsZ≠∅}
(ii) the admissible hypothesis are ∑(Z)=∑ s∈S(Z) HsZ
Conclusion
As stated in the introduction, the purpose was to give to the concept of
abduction a status of the same importance as to deduction. The representation
theorem is the main proposition providing a unification of the meanings of
abduction. This theorem has two equivalent formulations. The first delivers a
set of axioms; it is convenient for an expert or a scientist. The second offers a 135
geometrical understanding of how abduction and deduction are linked through
a system of spheres defined by a total preorder on the possible worlds, which
implies a preference relation on the set of beliefs: any hypothesis contains a
preferred part, its corresponding parsimonious hypothesis; all happens as if the
hypothesis interacted with the system of beliefs only through its parsimonious
hypothesis during abductive inferences; and for parsimonious hypotheses,
abduction is exactly the relation reciprocal to deduction. In other words,
abductive inferences can be performed only with parsimonious hypotheses as
reciprocal to deductive inferences.
Like deduction, abduction is a synchronic relation; there is no change in
the system of beliefs through deduction or abduction. Change occurs only when
a new message is issued by Nature: this moment of induction is followed by a
revision of beliefs that consists in selecting the preferred part of the message.
Thus, if beliefs are structured by a relation of preference, abduction, deduction
and induction have together a simple and natural meaning.
The axioms can be seen as normative or as descriptive. They can have a
normative sense for some human activity like scientific research and other ones
(like in the research of a culprit, when they become more or less explicit conventions
on what should be acceptable abductive reasoning. They can be descriptive and
then can be falsified by psychological experiments: if it were the case, it would be
necessary to weaken some axioms, i.e. to enlarge the theory of abduction.
If the axiomatic approach of abduction is the right approach within the
cognitivist paradigm, the geometric approach is the right approach of abduction
for the connectionist and constructivist paradigms of cognition. Indeed the
geometric approach allows to understand how hypotheses of higher levels
—prototheories— can emerge from sensory experience. This geometric approach
has been discussed for non-parametric models associated to a hierarchy of
spaces of larger and larger networks. A natural hierarchy of hypotheses appears
with the same structure as a system of spheres. Parsimonious and consistent
hypotheses are exactly those obtained by selecting in the hierarchy of spaces the
minimal spaces having at least one model consistent with the observed facts. The
geometrical approach seems to be truly promising for the connectionist and the
constructivist paradigm, even if a few modes of learning have been discussed.
Enlarging the discussion to any kind of learning is a very important challenge for
a better understanding of abduction, as the “mode of reasoning of the living”, as
coined by Peirce.
Notes
1. See the forthcoming CREA-report of Bourgine & Walliser to appear.
References
Alchourron C.E., P. Gärdenfors & D. Makinson 1985. “On the logic of theory of change:
136 partial meet contraction and revision functions”. Journal of Symbolic Logic, 50, 510-
530.
Anderson, Douglas R. 1986. “The Evolution of Peirce’s Concept of Abduction”. Transactions
of the Charles S. Peirce Society 22, no 2, 145-164.
Aubin J.P. 1991. Viability Theory. Boston: Birkhäuser.
Bochereau L. and Bourgine P. 1990. “Extraction of semantic features and logical rules from a
multilayer neural network”. Proceedings IJCNN 90, Washington, DC.
Bochereau L., Bourgine P. and Deffuant G. 1990 “Equivalence between Connectionist
Classifiers and Logical Classifiers”. Lecture Notes in Physics 368, pp. 351-363.
Bourgine P. 1991. Heuristique et Abduction, PhD. Cahiers du LIUC n° 95, Université de
Caen.
Bourgine P. and F. Varela 1992. “Towards a practice of autonomous system”. in F. Varela &
P. Bourgine (eds.). Towards a practice of autonomous system, pp 3-10. Cambridge: MIT
Press.
Bourgine P. and Walliser B. 1992. Economics and Cognitive Science. New York: Pergamon
Press.
Burks, A. 1946. “Peirce Theory of Abduction”. Philosophy of Science 13, pp.301-306.
Cortes C., V. Vapnik 1995. “Support-Vector Networks”. Machine Learning, 20, 273-297.
Fann, K.T. 1970. Peirce’s theory of Abduction. the Hague: Martinus Nijhoff.
Deffuant, G., Fuchs, T., Monneret, E., Bourgine, P., and Varela, F. 1996. “Semi-algebraic
organisms: the morphodynamical network perspective”. Artificial Life 2:157-179.
Flach P.A. 1996. “Abduction and Induction: Syllogistic and Inferential Perspectives”.
Workshop on abduction and induction, Budapest.
Funahashi K.I. 1989. “On the approximate realization of continuous mappings by neural
networks”. Neural Networks, Vol.2, pp. 183-192.
Gärdenfors Peter, 1988. Knowledge in flux. Cambridge: MIT Press.
Katsuno A.&A. Mendelzon, 1991. “Propositional knowledge base revision and minimal
change”. Artificial Intelligence, 52, 263-294.
Kohonen T., 1984. Self-Organization and Associative Memory. New York: Springer Verlag.
Kraus S., D. Lehmann & M. Magidor 1990. “Nonmonotonic reasoning, preferential models
and cumulative logics”. Art. Intel. 44: 167-207.
McClelland James L., David E. Rumelhart 1986 Parallel Distributed Processing: Exploration in
the Microstructure of Cognition. Cambridge: MIT Press
Peirce, C.S. The Collected Papers. Hartshorne, Weiss & Burks (eds.). Cambridge: Harward
University Press.
Rosh, E. 1978. “Principles of Categorization”. in E. Rosh and B.B. Lloyd (eds.). Cognition and
Categorization, pp. 27-48. Hillsdalle: Lawrence Erlbaum.
Santaella, L. 1997. “The development of Peirce’s three types of reasoning: Abduction,
Deduction, and Induction”. IASS International Congress, Mexico, June 1997.
Thagard, Paul R. 1977. “The Unity of Peirce’s Theory of Hypothesis”. Transactions of the
Charles S. Peirce Society 13, no2, 112-123.
Thagard, Paul R. 1981. “Peirce on Hypothesis and Abduction”. Proceedings of the C.S. Peirce
Bicentennial International Congress, K.L. Ketner & al. (eds.), 271-274.
Walliser B., D. Zwirn 1998. “Probabilistic Belief Revision Principles” ENPC Technical Report,
Paris.
137
138
Contributors
João Queiroz
Research Group on History, Philosophy, and Biology Teaching, Institute of
Biology, Universidade Federal da Bahia; Department of Computer Engineering
and Industrial Automation, FEEC, UNICAMP, Brazil.
<queirozj@dca.fee.unicamp.com> <queirozj@gmail.com>
Priscila Farias
Department of Design, National Service for Commerce
Education University (SENAC-SP) and Universidade Federal de Pernambuco;
co-editor of InfoDesign – Brazilian Journal of Information Design, Brazil.
<priscila.lfarias@sp.senac.br> <priscilafarias@uol.com.br>
Claus Emmeche
Center for the Philosophy of Nature and
Science Studies, Niels Bohr Institute, Denmark.
<emmeche@nbi.dk>
Jesper Hoffmeyer
Institute Of Molecular Biology, University Of Copenhagen, Denmark.
<hoffmeyer@mermaid.molbio.ku.dk> 139
Søren Brier
Department of Management, Politics and Philosophy,
Copenhagen Business School; editor and publisher of the interdisciplinary
quarterly journal Cybernetics & Human Knowing, Denmark.
<sbr.lpf@cbs.dk>
Anthony Chemero
Scientific and Philosophical Studies
of Mind Program, Franklin & Marshall College, USA.
<tony.chemero@fandm.edu>
Tim van Gelder
Department of Philosophy, University of Melbourne, Australia.
<tgelder@ariel.unimelb.edu.au>
Robert F. Port
Department of Linguistics and
Department of Cognitive Science, Indiana University, USA.
<port@cs.indiana.edu>
Paul Bourgine
CREA-Ecole Polytechnique, France.
<bourgine@poly.polytechnique.fr>
140