Another paper that I am reading as part of a meta-analysis of second language vocabulary learning. I had started to read this and then paused for three weeks while I read three background theoretical papers (Laufer & Hulstijn, 2001; Hulstijn, 2001; Hulstijin, 2003) that made this one much easier to understand.
This paper is an experimental study in two parts designed to test L&H's involvement load hypothesis. One concern is control of time on task, since this varied in L&H's experimental attempt to assess involvement load hypothesis. Knight (1994) apparently brings this issue up in general for things like dictionary look up tasks. All through I was concerned with precisely how vocabulary knowledge was being measured. Like Folse (2006) Kim used the Vocabulary Knowledge Scale (VKS; Paribahkt & Wesche, 1993) but I still wonder what L&H used - later on it is described as providing L1 translation or English explanations. Laufer's (2003) experiment gave support for different performance based on different levels of involvement load, however another experiment in the set gave varying performance for three tasks that were supposed to have the same involvement load (distribution was different?). Am keen to know Laufer's explanation of that - that paper also on our reading list?
Laufer (2001) apparently indicates that involvement load construct should generalise from textual to face to face audio situations, which I had assumed, but good to be able to reference that assertion given the wide range of studies we are applying the concept to. I was unsure of the meaning of interactionally modified input versus interactionaly modified output, and in particular the concept of premodified input, although this is in the context of L&H(2001) that I guess I should be reading.
I was concerned about the random assignment implications of the split between the two experiments. One of the experimental groups from the first experiment is compared with a group constructed for the second experiment, which I think was run subsequently, and although similar had a slightly different mix of ages and nationalities.
Another concern is that it seems we could explain results independently of involvement load. In the reading condition the learners attention is only drawn to the target words through emphasis and glossing. In the gap-fill condition the learners attention is drawn to 15 words, and in the composition and sentence writing conditions the learners attention is drawn to the 10 words they will be tested on. Purely in terms of attention one might expect to see the results that were achieved. In the experiment that tested the three different involvement load levels, the immediate post test only distinguished the composition group as significantly higher, while the delayed post test distinguished all three - there was no interaction or main effect for proficiency level. The second experiment made no distinction between the composition and sentence-writing tasks. I had been wondering earlier if the results could all be explained in terms of receptive/productive or active/passive differences, although the significant difference between reading and gap-fill at post-test could not, but now I realise that there were 15 words being brought to attention in the gap-fill task, it seems that the results can all be explained in terms of attentional resources. Another question is whether the comprehension questions needed understanding of the target words in order to be answered (looking at appendix b I would say not really).
I am concerned about the bias of using the VKS tests, and the author expresses some concerns as well. I find the alleged pedagogical implications sit uneasily with me, since I am not sure that showing a benefit on a VKS test necessarily indicates that the learner has gained something of importance. The key problem here is that the VKS sentence generation task could represent various sorts of ability on the part of the learner, e.g. that they memorized a sentence containing the word versus actually generating a novel sentence. In particular it seems that if a learner was specifically practicing sentence generation or doing essay composition for a particular set of vocabulary that this would increase performance on the test through a practice effect. It seems to be obvious that practicing a productive skill would lead to higher performance on productive tests, whereas practicing a receptive skill would lead to benefits on receptive tests. The question I would like to know the answer to is what kind of transfer do we get cross-task, and thus motivational concerns aside, what is the most efficient approach to take to maximise ability on both receptive and productive tasks.
Reading proofs of our soon to be publshed paper on vocabulary study (Joseph et al. 2009) I am struck that as we discuss how to make tests more and more challenging, we are not addressing the goal of the language learner. We are arguing that gradually more challenging tasks maintains motivation and boosts long term retention, but the real question should be what is the long term task that the learner wants to succeed at. Clearly looking up a word in a dictionary can help a learner understand a sentence they are reading. The question is then whether other activity related to that word should be undertaken. The usual argument in L2 is that if nothing else is done then exposure to low frequency words will be insufficient for the learner to avoid having to look the word up again in future. I guess the real question is whether some sort of "artificial" re-exposure to the word will be a more efficient way of increasing the likelihood of future sentence comprehension, versus using that same time to just do more reading ... and what kind of experiment could actually test which approach was more efficient? I guess one could have learners perform a reading comprehension task, and then have one group perform another reading comprehension task, while a second group did vocabulary review, and then both groups would be tested on another reading comprehension task that was of comparable level and contained similar words. So for this kind of experiment we would need three different texts of comparable length, involving the same "target" vocabulary?
Depending on the results of such an experiment an argument could be made to say that although explicit vocabulary study was not recommended, that selection of subsequent texts for additional comprehension practice could be selected based on which words were looked up by a learner, in order to increase the chances of a rewarding experience - which is linked to overall motivation issue, i.e. should the learner be reading anything other than texts they specifically select themselves?
[A great deal of research has shown that when learners study definitions alone their ability to comprehend text containing the target words does not improve (Graves, 1986; Stahl & Fairbanks, 1986)] from Joseph et al. (2009), so I wonder if doing essay composition, or gap filling leads to improvements in text comprehension.
[N.B. The Kim paper also references some more studies showing the importance of negotiation that I was previously associating with Newton (1995), i.e. de la Fuente (2002) and Joe (1995, 1998) although latter focused on generative rather than negotiated tasks?]
Kim, Y. (2008). The Role of Task-Induced Involvement and Learner Proficiency in L2 Vocabulary Acquisition Language Learning, 58 (2), 285-325 DOI: 10.1111/j.1467-9922.2008.00442.x
My References
Joseph S.R.H., Watanabe Y., Shiung Y.-J., Choi B. & Robbins C. (2009) Key Aspects of Computer Assisted Vocabulary Learning (CAVL): Combined Effects of Media, Sequencing and Task Type. Research and Practice in Technology Enhanced Learning. 4(2) 1-36.
Kim's References
Arlov, P. (2000). Wordsmith: A guide to college writing (Cited by 3). Upper Saddler River, NJ: Prentice Hall.
Barcroft, J. (2002). Semantic and structural elaboration in L2 lexical acquisition (Cited by 34). Language Learning, 52(2), 323–363.
Baddeley, A. D. (1978). The trouble with levels: A reexamination of Craik and Lockhart (Cited by 190)’s framework for memory research. Psychological Review, 85, 139–152.
Brown, T. S., & Perry, F. L., Jr. (1991). A comparison of three learning strategies for ESL vocabulary acquisition, TESOL Quarterly, 25, 655–671.
Cho, K-S., & Krashen, S. (1994). Acquisition of vocabulary from the Sweet Valley Kids Series: Adult ESL acquisition. Journal of Reading, 37, 662–667.
Craik, F. I. M., & Lockhart, R. S. (1972). Levels of processing: A framework for memory research (Cited by 3428). Journal of Verbal Learning and Verbal Behavior, 11, 671–684.
Craik, F. I. M., & Tulving, E. (1975). Depth of processing and the retention of words in episodic memory (Cited by 1346). Journal of Experimental Psychology; General, 104, 268–294.
de la Fuente, M. J. (2002). Negotiation and oral acquisition of L2 vocabulary: The roles of input and output in the receptive and productive acquisition of words. Studies in Second Language Acquisition, 24, 81–112.
Ellis, N. C. (2001). Memory for language (Cited by 97). In P. Robinson (Ed.), Cognition and second language instruction (pp. 33–68). Cambridge: Cambridge University Press.
Ellis, R., & He, X. (1999). The role of modified input and output in the incidental acquisition of word meaning (Cited by 0). Studies in Second Language Acquisition, 21, 285–301.
Ellis, R., Tanaka, Y., & Yamazaki, A. (1994). Classroom interaction, comprehension, and L2 vocabulary acquisition (Cited by 19). Language Learning, 44, 449–491.
Howell, D. C. (2002). Statistical methods for psychology (Cited by 3067) (5th ed.). Pacific Grove, CA: Duxbury.
Hulstijn, J. H., Hollander, M., & Greidanus, T. (1996). Incidental vocabulary learning by advanced foreign language students: The influence of marginal glosses,
dictionary use, and reoccurrence of unknown words (Cited by 185). The Modern Language Journal, 80, 327–339.
Hulstijn, J. H., & Laufer, B. (2001). Some empirical evidence for the involvement load hypothesis in vocabulary acquisition (Cited by 91). Language Learning, 51, 539–558.
Joe, A. (1995). Text-based tasks and incidental vocabulary learning (Cited by 44). Second Language Research, 11, 149–158.
Joe, A. (1998). What effects do text-based tasks promoting generation have on incidental vocabulary acquisition (Cited by 62)? Applied Linguistics, 19, 357–377.
Knight, S. M. (1994). Dictionary use while reading: The effects on comprehension and vocabulary acquisition for students of different verbal abilities (Cited by 150). Modern Language Journal, 78, 285–299.
Laufer, B. (2000). Electronic dictionaries and incidental vocabulary acquisition: Does technology make a difference (Cited by 20)? In U. Heid, S. Evert, E. Lehmann, & C. Rohrer (Eds.), EURALEX (pp. 849–854). Stuttgart: Stuttgart University Press.
Laufer, B. (2001). Reading, word-focused activities and incidental vocabulary acquisition in a second language (Cited by 15). Prospect, 16(3), 44–54.
Laufer, B. (2003). Vocabulary acquisition in a second language: Do learners really acquire most vocabulary by reading (Cited by 44)? Some empirical evidence. Canadian Modern Language Review, 59, 567–587.
Laufer, B., & Hulstijn, J. H. (2001). Incidental vocabulary acquisition in a second language: The construct of task-induced involvement (Cited by 150). Applied Linguistics, 22, 1–26.
Luppescu, S., & Day, R. R. (1993). Reading, dictionaries and vocabulary learning (Cited by 99). Language Learning, 43, 263–287.
Nassaji, H. (2002). Schema theory and knowledge-based processes in second language reading comprehension: A need for alternative perspectives (Cited by 46). Language Learning, 52(2), 439–482.
Nation, P. (2001). Learning vocabulary in another language (Cited by 807). Cambridge: Cambridge University Press.
Newton, J. (1995). Task-based interaction and incidental vocabulary learning: A case study (Cited by 39). Second Language Research, 11, 159–177.
Paribakht, T. S., & Wesche, M. (1993). The relationship between reading comprehension and second language development in a comprehension-based ESL program (Cited by 84). TESL Canada Journal, 11, 9–29. Language Learning 58:2, June 2008, pp. 285–325
Paribakht, T. S., & Wesche, M. (1997). Vocabulary enhancement activities and reading for meaning in second language vocabulary acquisition (Cited by 136). In J. Coady & T. Huckin (Eds.), Second language vocabulary acquisition: A rationale for pedagogy (pp.174–200). Cambridge: Cambridge University Press.
Pulido, D. (2003). Modeling the role of second language proficiency and topic familiarity in second language incidental vocabulary acquisition through reading (Cited by 38). Language Learning, 53(2), 233–284.
Read, J. (2000). Assessing vocabulary. Cambridge: Cambridge University Press.
Rott, S. (2004). A comparison of output interventions and un-enhanced reading conditions on vocabulary acquisition and text comprehension (Cited by 1). The Canadian Modern Language Review, 61(2), 169–202.
Rott, S., Williams, J., & Cameron, R. (2002). The effect of multiple-choice L1 glosses and input-output cycles on lexical acquisition and retention (Cited by 20). Language Teaching Research, 6, 183–222.
Stahl, S. A., & Clark, C. H. (1987). The effects of participatory expectations in classroom discussion on the learning of science vocabulary (Cited by 20). American Educational Research Journal, 24(1), 541–555.
Waring, R., & Takaki, M. (2003). At what rate do learners learn and retain new vocabulary from reading a graded reader (Cited by 46)? Reading in a Foreign Language, 15(2), 130–163.
Wesche, M., & Paribakht, T. S. (1996). Assessing second language vocabulary knowledge: Depth vs (Cited by 7). breadth. Canadian Modern Language Review, 53, 13–39.