Professional Documents
Culture Documents
1 Introduction
Oral interviews have been the preferred method for the assessment of
foreign or second oral language ability since the inception of modern
Address for correspondence: Ana Maria Ducasse, La Trobe University, Spanish Program, School
of Historical and European Studies, Victoria 3086, Australia; email: A.Ducasse@latrobe.edu.au
V Methodology
A method that is increasingly being used in language testing studies
to gain insight into the rating activity is the use of verbal protocols
1 The raters were dispersed internationally and around Australia making it impossible to provide
face-to-face training.
Ana Maria Ducasse and Annie Brown 431
2 The full set of data was used to develop the rating scale, see Ducasse
(2008).
432 Assessing paired orals
3 Storch (2001) indicates that in discourse studies, the level of agreement is often in the vicinity of
coding of questions under both turn taking and topic cohesion. This
problem is acknowledged by Galaczi (2004, p. 97), who noted that
the multi-functionality of questions as both topic management and
conversation management devices caused some discrepancy in the
coding. Ultimately, in the present study the two categories were
joined to become interactional management, but including turn tak-
ing and topic cohesion, as subcategories.
As the level of inter-coder agreement was considered adequate,
the remaining two thirds of the data were recoded only where the
categories had been revised.
VIII Results
The final coding was carried out with three main categories of inter-
actional features. Each of these is defined and discussed below with
examples taken from the rater verbal-protocol transcriptions.
Each of the three categories is listed with its subcategories
before being defined. The first category is non-verbal interpersonal
communication. It consists of two subcategories: gaze and body lan-
guage. The second category is interactive listening with two subcate-
gories: supportive listening and comprehension. The third category
is interactional management with two subcategories: horizontal
and vertical management. The final category encompassed all the
comments that were irrelevant to interaction. These are not elabo-
rated further here, but included comments on linguistic resources or
vocabulary, appraisals of the performance, or the candidates level
of acquaintanceship. They also included comments on the exam or
the task.
2 Interactive listening
The second category was drawn from the comments made by raters
on the candidates manner of displaying attention or engagement
while listening during the interaction. Listening as part of successful
interaction was divided into two subcategories: comprehension, and
a different type of listening termed supportive listening.
The first subcategory, comprehension, is a means of showing
engagement, of giving encouragement for the speaker to continue
or demonstrating comprehension via verbal support. Comments fell
into this category when raters noticed that candidates are filling a
silence, asking for clarification or comprehension.
In the case of filling a silence the raters noticed that a candidate
provided the word the other partner was searching for, as in the
example: She sometimes filled in with a missing word to help
(Rater 1, Pair 3). This showed that the partner has been attending and
comprehended sufficiently to predict a missing word, thus enabling
Ana Maria Ducasse and Annie Brown 435
3 Interactional management
The third feature of successful interaction identified by the raters
emerged from comments on the management of the topics and turns.
This can be theorized from different perspectives. Between adjacent
turns it could be viewed as horizontal management that makes
the conversation flow. However, across topics it could be termed
vertical management that exhibits a flexibility that allows switch-
ing between topics. The raters comments on turn taking show how
interaction is managed horizontally. Elements connected to speaker
change such as speed of response, turn length or domination come
under this category, for example:
They are incapable of comprehending and replying quickly. He takes a long
while to answer. (Rater 10, Pair 2)
It is important to listen and to allow time to respond, to be sensitive to taking
turns and not dominating. (Rater 10, Pair 17)
These comments refer to two aspects of turn taking. First, there is
the need to reply within a reasonable time. Second, it is necessary to
leave time for the other to respond.
A second type of interactional management connected topics
vertically down the complete oral text. Raters commented on candi-
dates ability to connect topics:
She is following the conversation and as a consequence it is all interrelated.
(Rater 1, Pair 2)
They commented on candidates ability to develop the conversation
by extending the topic:
He finds something related to what has gone before. (Rater 1, Pair 2)
And they commented on whether candidates facilitated interaction:
Topics are connected and he is following the conversation he tries to make
sense with the person on the right. (Rater 6, Pair 15)
In this conversation they are asking the right kinds of questions and it
gives coherence to what they are talking about; quite good for beginners.
(Rater 9, Pair 2)
Ana Maria Ducasse and Annie Brown 437
VI Discussion
In this section the results from the final reduction of the data to
the three categories are discussed: first interpersonal non-verbal
communication, then interactive listening and finally interactional
management skills.
IX Conclusion
Egyud and Glover (2001) posed the question that if paired interaction
is, in fact, different from interview interaction, how should it be
accounted for in rating scales? This study has provided an initial
response to this question. It has done so by drawing on verbal reports
440 Assessing paired orals
X References
Berry, V. (1997). Ethical considerations when assessing oral proficiency in pairs.
In A. K. Huhta, V. Kurki-Suonio, & S. Luoma (Eds.), Current develop-
ments in language testing. Jyvaskyla: Jyvaskyla University Press.
Brown, A. (2000). An investigation of the rating process in the IELTS speaking
module. In Tulloch, R. (Ed.), IELTS research reports, Vol. 3, ELICOS,
Sydney.
Brown, A. (2003). Interviewer variation and the co-construction of speaking
proficiency. Language Testing, 20(1), 125.
Brown, A. (2005). Interviewer variability in oral proficiency interviews.
Frankfurt am Main: Peter Lang.
Brown, A., Iwashita, N., & McNamara, T. (2005). An examination of rater ori-
entations and test-taker performance on English-for-academic-purposes
speaking tasks (TOEFL Monograph No. TOEFL-MS-29), Educational
Testing Service, Princeton, NJ.
Csepes, I. (2002). Measuring oral proficiency through paired performance.
Unpublished PhD dissertation, Etvs Lornd University, Budapest.
Cumming, A., Kantor, R. & Powers, D.E. (2002). Decision making while rat-
ing ESL/EFL writing tasks: A descriptive framework. Modern Language
Journal, 86, 6796.
Ducasse, A. (2007). How do candidates view interaction in a paired oral?
In C. Gitsaki (Ed.), Language and Languages: Global and Local Tensions
(pp. 184200). Newcastle, UK: Cambridge Scholars Publishing.
Ducasse, A. (2009). An empirically based rating scale for interaction in a
Paired Test. In Brown, A., & Hill, K. (Eds.), Tasks and criteria in per-
formance assessment: Proceedings of the 28th Annual Language Testing
Research Colloquium (pp. 122). Frankfurt: Peter Lang.
Egyud, G., & Glover, P. (2001). Oral testing in pairs: A secondary school per-
spective. ELT Journal, 55(1), 7076.
Ericsson, K., & Simon, H. (1993). Protocol analysis: Verbal reports as data
(revised edition). Cambridge, MA: MIT Press.
ffrench, A. (1999). Study of qualitative differences between CPE individuals
and paired test formats. Internal UCLES EFL Report.
Folland, D., & Robertson, D. (1976). Towards objectivity in group oral testing.
ELT Journal, 30, 156167.
Fulcher, G. (1996). Does thick description lead to smarter tests? Language
Testing, 13(2), 2351.
Fulcher, G. (2003). Testing second language speaking. London: Pearson
Education.
Galaczi, E. (2004). Peer-peer interaction in a paired speaking test: The case of
the First Certificate in English. Unpublished PhD dissertation, Teachers
College, Columbia University, New York.
Gardner, R. (2001). When listeners talk: Response tokens and listener stance.
Amsterdam: John Benjamins.
Gass, S., & Varonis, E. (1985). Task variation and non-native/non-native
negotiation of meaning. In S. Gass & C. Madden (Eds.), Input in second
language acquisition (pp. 149161). Rowley, MA: Newbury House.
442 Assessing paired orals