Professional Documents
Culture Documents
footnotes) make it difficult to identify a text as a novel in the traditional sense at all
and to regard the novel as ‘a text.’ Still, the systematic and recurrent integration of
non-verbal and non-narrative elements in novelistic narration makes it necessary to
expand the notion of ‘the novel’ and ‘the literary text’ beyond word-based forms of
representation and meaning-making. On the one hand, the kind of novel described
above obviously constitutes a sub-genre, categorized as ‘the multimodal novel.’ On
the other hand, the integration of non-linguistic signs and even whole ‘texts’ calls
for a re-conceptualization of ‘diegesis’ in literary studies in general and of novelistic
narration and the theory of the novel in particular (cf. Hallet 2009, 149–152; Hallet
2014, 168–169).
It is important to note that in the kind of novel that is labelled as ‘multimodal,’
these non-linguistic elements are not extra-textual, additional illustrations (like, e.g.,
illustrated editions of novels) or complementary editorial elements, and that they are
substantially different from paratexts. Rather, there are two defining features that
make a novel ‘multimodal.’ Narratologically speaking, these other, non-linguistic or
non-narrative elements
– are distinct visual or textual entities that form an integral part of the narrative dis-
course; they are at the narrator’s disposal and displayed as the narrative unfolds.
Often, the verbal narrative refers to them or addresses them more or less explicitly;
– are an intrinsic part of the fictional world at different diegetic levels. They are
artifacts that are produced, used and located in the fictional world of the novel
and are thus related to the characters’ actions and perceptions or to the narrator’s
ways of thinking, communicating and making sense of the world.
In other words: Unlike in the traditional (‘monomodal’) novel, the fictional world is
not represented and constituted in verbal form only. The integration of different types
of symbolization and semiotic forms leads to a multisemiotic, more comprehensive, at
times also more authentic representation of the fictional world that tries to imitate or
resembles the multifarious ways in which the non-fictional reality is perceived by the
reader and in which knowledge and experiences are represented and communicated
in the reader’s lifeworld. Therefore, it is appropriate to categorize this type of multi-
semiotic narrative as a literary subgenre and designate it as the multimodal novel (cf.
Hallet 2009a, 2011a, 2011b; Gibbons 2010a, 99; Gibbons 2010b, 287; Nørgaard 2010b).
Michael Ondaatje’s Running in the Family (1982; Hallet 2015), Mark Z. Dan-
ielewski’s House of Leaves (2000; Gehring 2009; Gibbons 2012, 46–85), W. G. Sebald’s
Austerlitz (2002; cf. sections 2 and 3 in Denham and McCulloh 2006; Hallet 2011a,
2014), Mark Haddon’s The Curious Incident of the Dog in the Night-Time (2003; cf.
Hallet 2014) or Jonathan Safran Foer’s Extremely Loud & Incredibly Close (2005; cf.
Hoth 2006; Glorig 2007; Hallet 2009, 2014; Nørgaard 2010a; Gibbons 2012, 127–166)
are among the most renowned and popular multimodal novels. However, Young Adult
Novels and fictional autobiographies in diary form in particular appear to be the glob-
ally most successful type of multimodal novel. Sherman Alexie’s The Absolutely True
Diary of a Part-Time Indian and Jeff Kinney’s Diary of a Wimpy Kid engage in obviously
highly recognizable and popular communicative and notational practices and styles
of self-narration, imitating the handwritten diary, scrapbook practices, occasional
lifeworld-related graphics (such as, for instance, teacher or peer caricatures) and inte-
grating more ambitious forms of hand-drawn portraits or comics.
Fig. 1: Multimodal sample page from The Selected Works of T. S. Spivet.
Larsen 2009, 3.
A brief description and analysis of a prototypical page from Reif Larsen’s novel The
Selected Works of T. S. Spivet (2009) may serve to demonstrate how a multimodal
novel is composed and narrates (fig. 1). The main body of the page consists of verbal
narrative discourse in the traditional style, presenting the autodiegetic narrator and
his family; the first sentence also introduces the situation (the phone call) that trig-
gers the young narrator’s adventurous journey to the East. The second paragraph
further specifies the narrator’s character and simultaneously serves as a backdrop
to the visual elements on the page since the narrator describes his habit of mapping
everything he experiences: “maps of people doing things” in blue, “zoological, geo-
logical and topographical maps” in green and anatomic sketches of insects in red
notebooks. The verbal text shares the main body of the page with a topographical
map of the narrator’s home region in Montana in which his hometown Divide and
his parents’ Coppertop Ranch is located. Later in the novel, U.S. America’s east-west
divide is thematized as an existential experience and a determining factor in the col-
onisation of the American West (cf. Hallet 2014, 158–159).
A detailed plan of the narrator’s bedroom with the color-coded bookshelves
authenticates his claim that his life is organized around his mapping practices and
the art of drawing maps and scientific sketches. This room plan is placed in the
margin, which the narrator continuously uses for remarks, additional notes, reflec-
tions and excerpts from his notebooks, as in the example in fig. 1, where more details
about the young topographer’s equipment that he stores in his room are disclosed.
Sometimes the margin is also an important space to present graphic or topograph-
ical details, genealogical or personal information about other characters or about
the narrator’s (Spivet’s) mapping practices. The sketch of a sparrow skeleton at the
bottom of the margin alludes to and explains the background to the narrator’s second
name, ‘Sparrow,’ and is a demonstration of the narrator’s skill in the art of scientific
drawing. Often, as in the line between the topographical map and the map of the
bedroom at the top of the page, a particular item in the main text is assigned to a mar-
ginal detail through a dotted line; in other cases, particularly in the middle part of the
novel containing the journey, the margin is used as a navigation device that provides
the reader with geographical information about the narrator’s east-west itinerary and
the locations and places he visits or passes.
The sample page from Larsen’s novel demonstrates to what extent the way the
storyworld is represented and constituted differs from the monomodal novel; but it
also becomes evident at first sight that what is commonly regarded as ‘narrative dis-
course’ takes on a completely different shape. It is not only delinearized, but impor-
tant details concerning the setting, the characters or their actions and even the nar-
rator’s thoughts, reflections and knowledge are now, through the layout, relegated
to specific places on the page and presented in various symbolic forms, urging the
reader to decode other ‘languages’ and transforming reading into a hypertextual
activity. This is why the shift from monomodal to multimodal storytelling in the novel
affects both story and discourse, the representation of the storyworld and the (delin-
earized) presentation of information on the page, the creation of a narrative and the
act of reading (cf. sections 3 and 4 for details).
image / text problem is not just something constructed ‘between’ the arts, the media, or differ-
ent forms of representation, but an unavoidable issue within the individual arts and media. In
short, all arts are ‘composite’ arts (both text and image); all media are mixed media, combining
different codes, discursive conventions, channels, sensory and cognitive modes. (Mitchell 1995,
94–95)
The social-semiotic theory of multimodality has taken up this notion of the intrinsic
combination of ‘different codes’ and ‘modes’ involved in a single act of signification
and communication.
On the other hand, the emergence of new multimedia technologies, and the elec-
tronic hypertext in particular, has led to the insight that theories of symbolic rep-
resentation and communication need to account for the combination of different
media and symbolic forms in displays and environments in which ‘meaning’ can no
longer be explained as resulting solely from the natural human language alone. In
electronic multimedial environments, the contribution of other codes and symbolic
languages such as, e.g., sound and music, maps and diagrams, photographs and
moving images (as in videos, for example), needs to be considered, too: “Multimodal
production is now a ubiquitous fact of representation and communication. That
forces us urgently to develop precise tools requisite for the description and analysis of
texts and semiotic entities of contemporary communication.” (Kress 2010, 102) There-
fore, any theory of cultural semiosis and communication must explain and describe
how meaning is made across (and simultaneously through) a variety of different
semiotic systems, medial and generic modes, and how a combination of all of these
modes and media is able to produce one whole, integrated meaning. This applies to
both, single successful and efficient (multimodal) acts of communication, as, e.g. in
a newspaper article that combines verbal text, a diagram and a photograph, and to
the broader production of cultural meaning in discourses which “may be realised in
different ways. The ‘ethnic conflict’ discourse of war, for instance, may be realised as
(part of) a dinner-table conversation, a television documentary, a newspaper feature,
an airport thriller, and so on.” (Kress and van Leeuwen 2001, 5)
In social semiotics, the concept of multimodality is closely tied to a functional
definition of the semiotic mode. It is regarded as “a socially shaped and culturally
given resource for making meaning in representation and communication” (Kress
2010, 53), which is “used in recognisably stable ways as a means of articulating
discourse” (Kress and van Leeuwen 2001, 25). Although the status of some of these
resources (like, e.g., color or typography) remains rather unclear (cf. Bucher 2011,
131; Nørgaard 2010b), the vast majority of modes, particularly in the context of her-
meneutic approaches, can be seen as textual or medial (generic) entities. By contrast,
media are defined as merely physical and material resources “used in the production
of semiotic products and events, including both the tools and the materials used (e.g.
the musical instrument and the air; the chisel and the block of wood)” (Kress and
van Leeuwen 2001, 22), as is the case for an oil painting on canvas, a black and white
paper-print photograph, a printed newspaper article, a video blog in the electronic
environment of the Internet and so forth. Thus, a semiotic mode is always tied to a
specific material or medial carrier, but media in themselves do not produce meaning.
This is a substantial conceptual difference between intermediality theories and mul-
timodality theories (↗0 Introduction). Whereas in the former the verbal text and a
visual image are regarded and described as different, interrelated media, text-image
relations in the multimodal novel (as in multimodal texts in general) are not concep-
tualized as intermedial relations, but as an interplay of two distinct semiotic modes
(textual entities) in the same ‘medium,’ i.e. the printed book, which jointly contribute
to the production of one whole meaning in a single act of communication (cf. Bucher
2011, 125). In multimodality theories, therefore, the emphasis is on the meaning that
the single mode (textual entity) produces, and on the combination of various modes
that result in one (transmodal) meaning of a multimodal text. An analysis of the mul-
timodal novel has to account for the contribution of these single semiotic modes to
the constitution and characterization of signifying, communicative and socio-cultural
practices in the fictional world as well as to the kind of meaning it produces there, and
the role of a specific mode in the text-reader interaction (cf. sections 3 and 4).
Because of all of these observations, the multimodal novel can, on the one hand,
be regarded as just one instance of multimodality, participating in a general cul-
tural practice of signifying and making meaning and as cultural and communicative
normality. On the other hand, because of its specificity as a literary text, it can be
expected to not only mirror such social and cultural practices in fictional form, but
also to reflect and comment upon them critically and in a self-reflexive manner. The
narratological implications are multifarious, since the multimodality of novelistic
narration affects the way readers imagine the storyworld and its various constituents,
while also altering the reader’s pathways of making sense of signs. These are now not
only arranged in a hypertextual manner, as opposed to the linearity of signs in the
conventional, word-based novel, but they also belong to different sign systems which
the reader must be able to decode.
keep in touch with him after leaving his father; furthermore, these letters also trigger
his father’s confession that he killed the dog (Haddon 2004, 150). Finally, these letters
also lead to Christopher’s decision to travel to London in order to live with his mother
(Haddon 2004, 161–163). Once again, this journey is not only related in words, but also
represented through a large number of visual elements, maps in particular, but also
other modes like traffic or Underground railway signs, electronic timetables and so
forth. Therefore it can be contended that in this multimodal novel (as in many others)
non-verbal semiotic forms represent actions, key-stages of and central objects in the
story, whereas in the traditional novel actions and the story are exclusively rendered
in verbal form.
(2) Character: Apart from representing key elements and stages of the murder
mystery case and of the subsequent family story, the non-verbal semiotic modes are
of paramount importance for the narrator’s and protagonist’s ways of making sense
of the world. Since Christopher shows deviant forms of world apprehension and signs
of autism, resorting to non-linguistic ways of understanding, thinking, communi-
cating and expressing himself is this character’s cognitive and communicative key
strategy. For instance, a set of smileys helps him to typify people’s emotions, which
are otherwise difficult or almost impossible for him to decode (Haddon 2004, 2–3).
Since problem solving is a constant challenge in his everyday life, he has developed
cognitive excellence in reasoning, logical, mathematical and algorithmic thinking.
All of these cognitive activities are conducted in formal languages, which he also pre-
sents in the course of and as an intrinsic part of the narrative discourse. This is why
a key-decision in his life – whether to stay with his father or with his mother in the
future – is presented in the form of an algorithm (Haddon 2004, 162–163). This way,
readers have direct access to the narrator’s mind and the specific epistemological
tools Christopher applies to solve all the puzzles and mysteries in his life; readers
are able to observe ‘how the mind works’ (another of Christopher’s major areas of
interest; cf. Haddon 2004, 146). Thus the semiotic modes in this novel represent the
narrator’s (often solipsistic) cognition and one of this character’s important (or even
pre-dominant) features. Since he frequently also reflects upon his own cognitive abil-
ities and thinking strategies, they also form a dominant dimension in his personality
and identity development – one of the reasons why this novel can also be regarded as
a Bildungsroman.
(3) Social and Communicative Practices: Whereas in the traditional novel readers
access the fictional world via linguistic signs only, the multimodal novel makes it
possible to study all sorts of semiotic and communicative practices and artifacts
employed or produced in the textual world in a direct, unmediated way. Thus readers
are able to observe and recognize a multitude of semiotic and social practices in the
storyworld and relate them to those in their own lifeworld. For instance, the mul-
timodal youth novel in particular imitates, mirrors or popularizes youth cultural
practices like keeping diaries, creating scrapbooks or designing comics. However,
the integration of non-verbal semiotic forms in the novel not only serves to represent
fictional world
reader,
reading as a multiliterate act
reader has to study and interconnect to make meaning (cf. Gehring 2009; Gibbons
2010b; Gibbons 2012, 46–85).
A more recent and most advanced type of multimodal novel even expands nar-
ration and the act of reading beyond the medium of the printed book (i.e. ‘medium’
in terms of the physical and material quality of the carrier of a mode) and refers the
reader to the Internet in order to involve them in electronic interactive formats like
websites or blogs. Nota bene, these Internet formats are not non-diegetic additional
components (like, e.g., an interview with the author or some kind of background
information); instead, they are existents in the fictional world and part of the diegesis
so that, for instance, the reader is enabled to communicate with characters from the
novel or with the narrator, or to look at artifacts and documents from the fictional
world. In such novels, like Jeffery Deaver’s Road Side Crosses or Jennifer Cowan’s
Earthgirl, narration and reading transgress the medial boundary of the printed book
and become truly transmedial.
Therefore, ‘reading’ multimodal novels not only requires multiple literacies, i.e.
the reader’s ability to decipher a large variety of codes and symbolic languages, but
it also transforms the act of ‘reading’ into a hypertextual activity and the reader into
a ‘user’: The transmodal construction of coherent narrative discourse and meaning
now depends on the reader’s reading paths and decision-making. The reader is con-
stantly challenged to proceed from one semiotic element on the page to the other,
to identify or define the kind of interrelation between these different elements (cf.
section 2) and to assign meaning to the single semiotic element as well as to the com-
bination of all of these modes on a page and in the book as a whole.
However, a lot of these novels also develop a considerable semiotic and episte-
mological skepticism. While, in Larsen’s novel, Spivet is completely obsessed with
mapping practices, he is, at the same time, aware of their short-comings. He accuses
George Washington of producing and “imagining all sorts of false geographies”
(Larsen 2009, 33) and generalizes his observations into an epistemological and polit-
ical critique of cartography as a cultural practice in a historical perspective: “[T]hese
early cartographers of the Corps of Topographical Engineers […] were conquerors in
the most basic sense of the word, for over the course of the nineteenth century, they
slowly transferred the vast unknown continent piece by piece into the great machine
of the known, of the mapped, of the witnessed – out of the mythological realm of
empirical science.” (Larsen 2009, 16)
In the same vein, the protagonist of Sebald’s novel not only gives up his photo-
graphic project (handing over his huge collection of photographs to the anonymous
narrator), but his confidence in languages and signs in general is completely lost
when his writing project fails too: “[T]he exposition of an idea by means of a certain
stylistic facility […] now seemed to me nothing but an entirely arbitrary or deluded
enterprise. I could see no connections any more, the sentences resolved themselves
into a series of separate words, the words into random sets of letters, the letters into
disjointed signs.” (Sebald 2002, 175–176) This way, multimodal novels are often also
meta-semiotic and meta-cultural narratives that reflect upon the semiotic and com-
municative practices in which they engage. Multimodality in the novel is therefore
not only a way of representation and storytelling, but also of epistemological and
meta-semiotic reflection and critique.
6 Bibliography
6.1 Works Cited
Alexie, Sherman. The Absolutely True Diary of a Part-Time Indian. New York: Little, Brown Books for
Young Readers, 2007.
Bucher, Hans-Jürgen. “Multimodales Verstehen oder Rezeption als Interaktion: Theoretische und
empirische Grundlagen einer systematischen Analyse der Multimodalität.” Bildlinguistik:
Theorie – Methoden – Fallbeispiele. Ed. Hajo Diekmannshenke, Michael Klemm, and Hartmut
Stöckl. Berlin: Erich Schmidt, 2011. 123–156.
Chan, Eveline. “Integrating Visual and Verbal Meaning in Multimodal Text Comprehension: Towards
a Model of Intermodal Relations.” Semiotic Margins. Meaning in Multimodalities. Ed. Shoshana
Dreyfus, Susan Hood, and Maree Stenglin. London: Continuum, 2011. 144–167.
Cowan, Jennifer. Earthgirl. Toronto: Groundwood, 2009.
Danielewski, Mark Z. House of Leaves. A Novel. New York: Pantheon, 2000.
Deaver, Jeffery. Roadside Crosses. London: Hodder & Stoughton, 2009.
Denham, Scott, and Mark McCullough, eds. W. G. Sebald: History, Memory, Trauma. Berlin and New
York: De Gruyter, 2006.
Foer, Jonathan Safran. Extremely Loud & Incredibly Close. London: Penguin, 2005.
Mitchell, W. J. T. Picture Theory: Essays on Verbal and Visual Representation. Chicago: University of
Chicago Press, 1995.
Nørgaard, Nina. “Multimodality and the Literary Text: Making Sense of Safran Foer’s Extremely Loud
and Incredibly Close.” New Perspectives on Narrative and Multimodality. Ed. Ruth Page. London
and New York: Routledge, 2010a. 115–126.
Nørgaard, Nina. “Modality: Commitment, Truth Value and Reality Claims Across Modes in Multimodal
Novels.” Journal for Literary Theory 4.1 (2010b): 63–80.
Norris, Sigrid. “Multimodal Discourse Analysis: A Conceptual Framework.” Discourse and
Technology: Multimodal Discourse Analysis. Ed. Philip LeVine and Ron Scollon. Washington:
Georgetown University Press, 2004. 101–115.
Ondaatje, Michael. Running in the Family. 1982. London: Vintage, 1993.
Page, Ruth, ed. New Perspectives on Narrative and Multimodality. London and New York: Routledge,
2010.
Ryan, Marie-Laure. Narrative as Virtual Reality. Immersion and Interactivity in Literature and
Electronic Media. Baltimore and London: Johns Hopkins University Press, 2001.
Ryan, Marie-Laure. “Cognitive Maps and the Construction of Narrative Space.” Narrative Theory and
the Cognitive Sciences. Ed. David Herman. Stanford: CSLI, 2003. 214–242.
Ryan, Marie-Laure. “Fiction, Cognition and Non-Verbal Media.” Intermediality and Storytelling. Ed.
Marie-Laure and Marina Grishakova. Berlin and New York: De Gruyter, 2010. 8–26.
Ryan, Marie-Laure, and Marina Grishakova, eds. Intermediality and Storytelling. Berlin and New
York: De Gruyter, 2010.
Sebald, W. G. Austerlitz. London: Penguin, 2002. [German original 2001].
Scollon, Ron, and Philip LeVine. “Multimodal Discourse Analysis as the Confluence of Discourse and
Technology.” Discourse and Technology: Multimodal Discourse Analysis. Ed. Philip LeVine and
Ron Scollon. Washington: Georgetown University Press, 2004. 1–6.
Van Leeuwen, Theo. Introducing Social Semiotics. London and New York: Routledge, 2005.
Ventola, Eija, Cassily Charles, and Martin Kaltenbacher, eds. Perspectives on Multimodality.
Amsterdam and Philadelphia: John Benjamins, 2004.
6.2 Further Reading
Baldry, Anthony, and Paul J. Thibault. Multimodal Transcription and Text Analysis: A Multimedia
Toolkit and Coursebook with Associated On-line Course. London: Equinox, 2006.
Bateman, John. Multimodality and Genre: A Foundation for the Systematic Analysis of Multimodal
Documents. Houndmills and New York: Palgrave Macmillan, 2008.
Herman, David, and Ruth Page. “Coda/Prelude: Eighteen Questions for the Study of Narrative and
Multimodality.” New Perspectives on Narrative and Multimodality. Ed. Ruth Page. London and
New York: Routledge, 2010. 217–220.
O’Halloran, Kay L., and Bradley A. Smith. “Multimodal Studies.” Multimodal Studies: Exploring
Issues and Domains. Ed. Kay L. O’Halloran and Bradley A. Smith. London and New York:
Routledge, 2004. 1–13.