
The Relationship between Depth of Vocabulary Knowledge and L2 Learners' Lexical Inferencing Strategy Use and Success
Hossein Nassaji
This study examines the relationship between ESL learners' depth of vocabulary knowledge, their lexical inferencing strategy use, and their success in deriving word meaning from context. Participants read a passage containing 10 unknown words and attempted to derive the meanings of the unknown words from context. Introspective think-aloud protocols were used to discover the degree and types of inferencing strategies learners used. The Word-Associate Test (WAT) (Read, 1993) was used to measure the learner's depth of vocabulary knowledge. Results indicate a significant relationship between depth of vocabulary knowledge and the degree and type of strategy use and success. They reveal that (a) those who had stronger depth of vocabulary knowledge used certain strategies more frequently than those who had weaker depth of vocabulary knowledge; (b) the stronger students made more effective use of certain types of lexical inferencing strategies than their weaker counterparts; and (c) depth of vocabulary knowledge made a significant contribution to inferential success over and above the contribution made by the learner's degree of strategy use. These findings provide empirical support for the centrality of depth of vocabulary knowledge in lexical inferencing and the hypothesis that lexical inferencing is a meaning construction process that is significantly influenced by the richness of the learner's pre-existing semantic system.
Résumé
Cette étude se penche sur le rapport entre le niveau de connaissance du vocabulaire d'apprenants en anglais langue seconde, l'utilisation de stratégies d'inférences lexicales et le succès des apprenants à déchiffrer la signification de mots à partir du contexte. Les participants ont lu un extrait renfermant 10 mots inconnus et ont cherché à en découvrir le sens en se servant du contexte. On a eu recours aux protocoles d'introspection verbaux afin de déterminer la quantité et le genre de stratégies d'inférence utilisées par les apprenants. Le test d'association de mots WAT (Read, 1993) a été utilisé pour mesurer le niveau de profondeur du vocabulaire. Les résultats
[End Page 107]
ont révélé une corrélation significative entre ce dernier ainsi que la quantité et la variété de stratégies utilisées avec succès. On a trouvé, en particulier, que (a) les individus qui étaient plus compétents en matière de vocabulaire (profondeur) se servaient plus fréquemment de certaines stratégies que les moins compétents; (b) les étudiants plus forts utilisaient plus efficacement certains types de stratégies d'inférences lexicales que les plus faibles; (c) le niveau de connaissance du vocabulaire (profondeur) contribuait largement au succès des inférences, et ce, de façon plus significative que le degré d'utilisation des stratégies par l'apprenant. Ces conclusions confirment le rôle primordial joué par le vocabulaire (profondeur) dans la mise en oeuvre d'inférences lexicales nettement influencées par la richesse du système sémantique préexistant de l'apprenant.
Introduction
One of the central cognitive processes in reading comprehension is inferencing (R.C. Anderson & Pearson, 1984; Graesser & Bower, 1990; Kintsch, 1988, 1998; Monzo & Calvo, 2002; Nassaji, 2002, 2003a, 2003b; Whitney, 1987). Inferencing has been defined as the connections that people establish when they try to interpret texts (G. Brown & Yule, 1983). Inferencing occurs at all levels of the reading comprehension process, ranging from integrating the text with background knowledge (Kintsch, 1988), to connecting the different parts of the text together (Garrod & Sanford, 1990; Kintsch, 1988, 1998), to linking known to unknown elements in the text in order to arrive at a coherent structure of the information in the text (Garrod & Sanford, 1990; Graesser & Bower, 1990; Graesser & Zwaan, 1995). Such processes are assumed to involve prediction and interpretation of the text for meaning; hence, they are considered important processes by theories in cognitive psychology that conceptualize reading as an active meaning-construction process and a creation of a mental representation of the text (Kintsch, 1988, 1998).
The present study focused on lexical inferencing, that is, making 'informed guesses' about the meaning of unknown words based on the available linguistic and non-linguistic cues in the text (Haastrup, 1991, p. 40). Lexical inferencing has been found to be widely used by second language (L2) learners when dealing with unknown words in their reading (de Bot, Paribakht, & Wesche, 1997; Frantzen, 2003; Fraser, 1999; Huckin & Bloch, 1993; Morrison, 1996; Paribakht & Wesche, 1999; Parry, 1993). Lexical inferencing has also been found to be closely associated with incidental vocabulary learning, that is, learning vocabulary through reading natural texts (Huckin & Coady, 1999; Nagy, 1997). [End Page 108] Thus, Wesche and Paribakht (1999) argue that 'much - if not most - lexical development in both L1 and L2 appears to occur as learners attempt to comprehend new words they hear or read in context' (p. 176).
Although researchers have attributed an important role to lexical inferencing, the nature of this process has not been well understood in second language acquisition (SLA) (Paribakht & Wesche, 1999). To this end, and given the important role currently attributed to L2 learners' lexical knowledge in L2 reading comprehension (Laufer, 1996, 1997; Nassaji, 2002, 2003a; Nation, 2001; Qian, 2002; Read, 1993, 1997, 2000), the present study examined the particular role learners' depth of vocabulary knowledge plays in lexical inferencing. The question addressed was, How does L2 learners' depth of vocabulary knowledge relate to the degree and the type of the lexical inferencing strategies they use, and how does this relationship affect the learners' success in deriving word meaning from context?
Factors affecting learners' success in lexical inferencing
Many factors have been shown to affect success in lexical inferencing, including the nature of the word and the text that contains the word (Paribakht & Wesche, 1999; Parry, 1993); the degree of textual information available in the surrounding context (Dubin & Olshtain, 1993), the learner's ability to make use of extra-textual cues (de Bot et al., 1997; Haastrup, 1991); the importance of the word to comprehension of the text (C.M. Brown, 1993); the degree of cognitive and mental effort involved in the task (Fraser, 1999; Joe, 1995); and the learner's attention to the details in the text as well as his or her preconceptions about the possible meaning of the word (Frantzen, 2003). In a discussion of the factors involved in lexical inferencing, Nagy (1997) considers the role of learners' pre-existing knowledge bases and how these knowledge bases influence learners' strategy use and success. Nagy groups learners' knowledge bases into three main categories: linguistic knowledge, world knowledge, and strategic knowledge. The linguistic knowledge category covers all knowledge that learners possess about the linguistic context in which the word has occurred, including their syntactic knowledge, lexical knowledge, and knowledge of word schema (i.e., knowledge of the possible meanings of the word). World knowledge is the learner's understanding and use of the relevant domains of knowledge. Strategic knowledge is knowledge of the actual strategies learners employ during the act of inferencing and attempting to deduce the meaning of the unknown word from context. Nagy suggests that strategic knowledge may not be necessary for acquiring word meaning from context, but that [End Page 109] sometimes, when the learner is aware of the existence of new words in the text, he or she may make deliberate attempts to derive the meanings of these words from context. Nagy then reviews several studies that have highlighted the importance of such strategic attempts in deriving word meaning from context (Buikema & Graves, 1993; Huckin & Jin, 1987; Jenkins, Matlock, & Slocum, 1989).
Similarly, in a discussion of what is involved in successful strategy use, Pressley, Borkowski, and Schneider (1987) propose a framework that distinguishes between a cognitive strategic component, which includes a repertoire of general as well as domain-specific strategies learners have, and a knowledge-base component, including various knowledge bases constructed from learners' various experiences with the world. These knowledge bases range from well-established and integrated pieces of information about particular phenomena or situations, to knowledge about specific strategies and skills, to knowledge about when and how to apply these in a particular situation. Knowledge the strategy user needs to evaluate the causes of his or her failure and the relationship between his efforts and achievement also fall into this category.
In the context of L2 lexical inferencing, and based on an exploratory study with intermediate ESL learners, Huckin and Bloch (1993) propose a lexical inferencing model that incorporates similar components. These components include a knowledge module component (e.g., a vocabulary knowledge module, a text schema module, a syntax and morphology module, and a text representation module) and a metalinguistic strategic component. The metalinguistic strategic component includes a sequence of cognitive and decision-making strategies that the learner uses when trying to generate and test word meanings and hypotheses. According to Huckin and Bloch, these strategies play an important role in lexical inferencing in that they help the learner decide when and how to proceed and seek help from context and various sources of knowledge available.
The above frameworks underscore the multidimensionality of strategy use. In particular, they highlight the fact that multiple knowledge sources and strategies (i.e., the various cognitive and metacognitive activities learners use when identifying and constructing word meaning from context) are involved in inferencing word meanings from context. A number of recent studies have documented the range of knowledge sources and strategies learners employ during lexical inferencing (Chern, 1993; de Bot et al, 1997; Haynes, 1993, Huckin & Bloch, 1993; Morrison, 1996; Paribakht & Wesche, 1999). For example, in a study with university students, de Bot et al. (1997) found that when attempting to [End Page 110] infer word meaning from context, L2 readers used knowledge sources ranging from knowledge of grammar, morphology, phonology, and knowledge of the world, to knowledge of punctuation, word association, and cognates. Analyzing the lexical inferencing strategies of Danish learners of English, Haastrup (1991) found that learners used different strategies ranging from those related to the internal structure of the word (such as analysis of the phonological and orthographic structure of the word) to those involving the use of top-down contextual and sentence-level clues.
Among the knowledge sources, one type of knowledge source that has been found to be strongly related to the learner's ability to read and understand texts is vocabulary knowledge. A number of studies in both L1 and L2 have demonstrated that vocabulary knowledge is one the best predictors of reading ability and the ability to acquire new information from texts (J.C. Anderson, 2000; Nation, 2001; Qian, 2002; Read, 2000). Several studies have also demonstrated a relationship between learners' vocabulary knowledge and their subsequent learning of vocabulary through reading (Haynes & Baker, 1993; Parry 1997; Pulido, 2003; Sternberg & Powell, 1983; Verspoor & Lowie, 2003). In a study with L1 learners, for example, Sternberg and Powell (1983) found a positive relationship between learners' performance on a vocabulary test and their ability to define word meanings. In a recent study with Dutch learners of English, Verspoor and Lowie (2003) found a similar relationship. The researchers found that learners' ability to infer and retain the meanings of polysemous words from context was related significantly to their knowledge of the core sense of the word. In another study with L2 learners, Pulido (2003) found that knowledge of sight vocabulary was significantly correlated with measures of incidental vocabulary gains from reading.
Depth versus breadth of vocabulary knowledge
In research on vocabulary learning, a distinction has often been made between two dimensions of vocabulary knowledge: depth of knowledge and size, or breadth, of knowledge (Haastrup & Henriksen, 2000; Meara, 1996; Read, 2000). Breadth of vocabulary knowledge has been taken to refer to the quantity or number of words learners know at a particular level of language proficiency (Nation, 2001). Researchers have used various types of assessment tools with different formats to measure this dimension of vocabulary knowledge, including tests that require the learner to identify a synonym for a word in a multiple-choice test, match words with definitions, translate a word into L1, or use checklists (see [End Page 111] Wesche & Paribakht, 1996, for a discussion of these various assessment types). One measure that has been widely used to assess size of vocabulary knowledge in the literature, for example, is Nation's Vocabulary Level test (1990, 1993), which has a word-meaning matching format and is composed of words representing different word frequency levels, ranging from high-frequency (2,000 word level) to low-frequency words (10,000 word level).
Depth of vocabulary knowledge, on the other hand, has been used to refer to the quality of lexical knowledge, or how well the learner knows a word (Meara, 1996; Read, 1993, 2000). Researchers have noted the complexity and multidimensionality of word knowledge and have suggested that knowing a word well should mean more than knowing its individual meanings in particular contexts. Various kinds of knowledge are associated with a word that a learner must know, ranging from knowledge related to its pronunciation, spelling, register, stylistic, and morphological features (Haastrup & Henriksen, 2000; Meara, 1996; Nation, 1990; Richards, 1976) to knowledge of the word's syntactic and semantic relationships with other words in the language, including collocational meanings and knowledge of antonymy, synonymy, and hyponymy (Chapelle, 1994; Henriksen, 1999; Read, 2000).
Studies investigating the role of vocabulary knowledge in reading have found that while measures of size of vocabulary knowledge are strongly related to the reader's understanding of texts (Laufer, 1997; Qian, 1998, 1999), measures examining aspects of depth of vocabulary knowledge make a stronger contribution to reading performance than those that simply measure a single definition of a word. Qian (1999), for example, found that depth of vocabulary knowledge, conceptualized as receptive knowledge of word meanings and collocations, was not only a better predictor of L2 reading comprehension but also made a unique contribution to L2 reading comprehension, over and above the contribution made by size of vocabulary knowledge.
In the present study, expanding on this line of research, I attempted to discover the role of L2 learners' depth of vocabulary knowledge in L2 lexical inferencing. In particular, I examined how L2 learners' depth of vocabulary knowledge relates to the degree and type of lexical inferencing strategy use and how this relationship mediates learners' success in deriving word meaning from context. I also investigated the degree to which depth of vocabulary knowledge and degree of lexical inferencing strategy use contribute, collectively and independently, to learners' inferential success. If it could be shown that lexical inferencing strategies make any contribution to lexical inferencing success this could have important implications for models of L2 vocabulary instruction that [End Page 112] advocate the use of lexical inferencing strategy in L2 vocabulary acquisition.
Method
The data examined in the present study come from a larger project on lexical inferencing strategies and knowledge sources. The study was carried out in two phases. In Phase 1, data were gathered and analyzed for the relationship between learners' lexical inferencing strategies and knowledge sources and their lexical inferencing success (Nassaji, 2003b). Phase 2 gathered data about learners' depth of vocabulary knowledge and analyzed them to discover its relationship with lexical inferencing strategy use and success.
Participants
Participants were 21 adult intermediate ESL learners from different language backgrounds, including Chinese, Spanish, Persian, Portuguese, and Arabic. All had recently arrived in Canada and were taking ESL courses to improve their English. They had met Level 4 of the Canadian Language Benchmark in listening and reading (Citizenship and Immigration Canada, 1996).
Depth of vocabulary knowledge test
As discussed earlier, depth of vocabulary knowledge is complex, and it is therefore very difficult to assess all the different components that constitute the full range of meanings and meaning relationships of a word. For the same reason, most vocabulary tests in the literature deal with breadth of vocabulary knowledge. In recent years, however, there have been some attempts to develop measures to assess aspects of depth of vocabulary knowledge (Greidanus & Nienhuis, 2001; Paribakht & Wesche, 1993; Read, 1993; Wesche & Paribakht, 1996). For example, in a university ESL context, Paribakht and Wesche (1993, 1996) developed a test called the Vocabulary Knowledge Scale (VKS), which was designed to measure the different levels of lexical knowledge of specific target words learners were learning in a comprehension-based ESL program. The test involves a self-report format in which learners are presented with individual words and asked to indicate their degree of knowledge on a scale of 1 to 5, ranging from no familiarity with the target word to the ability to use it accurately in a sentence. [End Page 113]
The VKS test has an easy format that can be used as a practical tool to measure the initial stages of word knowledge for certain words. However, as Wesche and Paribakht (1996) point out, the test was not designed to 'tap sophisticated knowledge of given words or describe the mental lexical networks to which the word belongs' (p. 29). Thus, the measure does not assess the various meaning relationships a word has with other words, though these have been suggested to be an important aspect of depth of vocabulary knowledge (Read, 2000). One measure that attempts to capture these lexical meaning relationships, and which was used in the present study, is the Word-Associate Test (WAT), developed by Read (1993). The WAT purports to measure the learner's depth of vocabulary knowledge through word associations, that is, the various semantic and collocational relationships that a word has with other words in the language. The test has been found to be closely correlated with L2 reading comprehension ability and has also been shown to have a high degree of internal reliability (Qian, 1999, 2002). The test used in the present study was version 3.1 (Form B)1 of the WAT. The test is composed of 50 target words, each followed by a list of eight words, four of which are semantically related to the target word while the other four are not. The four related words have been selected to represent three main types of semantic relationship with the target word: paradigmatic relationships (i.e., the word and its associate have similar meanings, such as 'enable' and 'allow'); syntagmatic relationships (i.e., the two words are collocates and co-occur in similar contexts, such as 'income' and 'tax'); and analytic relationships (i.e., the associate represents the meaning of part of the word, such as 'team' and 'together'). The reliability of the test (KR-20), as reported by Read (1993), is 0.92. (Further information about the test and how it was developed can be found in Read, 1993, 1997, 2000.)
The test was administered during a class period. Before learners took the test, they were notified of the general purpose of the study and were informed that their performance on the test would not affect their course outcome. They were instructed to read each of the target words and then circle the four words closely related to the target word. The time allocated to the test was 30 minutes. The split-half reliability of the test in the current study was 0.89.
Lexical inferencing strategies
To gather data about learners' lexical inferencing strategies, the students were presented with a reading passage and asked to read the text for comprehension and to try to infer the meanings of the unknown words. [End Page 114] Research suggests that successful inferencing depends heavily on the ability to comprehend the text as a whole and most of the words in it (Hirsh & Nation, 1992; Laufer, 1988; Liu & Nation, 1985). To meet these requirements, several passages, including those used in previous research, were examined. The reading passage selected for use in this study was the one developed by Haastrup (1991) in a study on lexical inferencing with Danish learners of English. The passage contained 374 words, with 10 target words highlighted (see Appendix). The passage had been designed to elicit the use of a variety of inferencing strategies and processes, ranging from those involved in the use of non-linguistic global comprehension processes to those involved in the use and integration of word-level cues such as prefixes and affixes (Haastrup, 1991). Before being used in the present study, the passage was pilot-tested with a group of ESL students assumed to have similar language proficiency to the participants in the main study. The pilot study revealed that the students had a good overall comprehension of the text (mean of comprehension: 7.6/10). It also showed that the percentage of unknown words in the passage ranged from 4.27% to 2.67%, derived by dividing the total number of the words reported as unknown by the total number of words in the passage and multiplying the results by 100.
An introspective think-aloud technique was used to discover the lexical inferencing strategies learners used; in this procedure, learners are asked to verbalize the content of their thoughts while attempting to infer the meaning of an unknown word from context. Data were collected in individual sessions lasting about 45-60 minutes. In each session, the students were first trained as to how to think aloud: they were given a set of pictures and asked to report what they thought was happening in the pictures. They were then presented with an English text and were asked to practise verbalizing their thinking while trying to infer the meanings of the unfamiliar words they encountered. After this practice session, the students were presented with the text intended for the study and were asked to read the text for comprehension and try to verbalize their thoughts when attempting to infer the meaning of the new words in the text.
Analysis and results
Identification of lexical inferencing strategies
In order to identify the lexical inferencing strategies used by these learners, all the introspective think-aloud protocols were initially transcribed and then carefully examined for any observable inferencing strategies. Lexical inferencing strategies were defined as any cognitive [End Page 115] or metacognitive activity that the learner turned to for help while trying to derive the meaning of the unknown word from context. Strategies were identified using an inductive procedure involving reading and rereading the protocols. The strategies identified derive mainly from the data and reflect the thinking of the learners participating in the study. Initially, three main categories of strategy types were identified. Following Pressley and Afferbach (1995), these were characterized as identifying, evaluating, and monitoring strategies. Identifying strategies were defined as those that the learners used to identify the meaning of the new word in the text. Learners were found to use different procedures to do so (e.g., repeating the word, repeating the section that contained the word, conducting word analysis, or word-form analogy); each was coded separately. A strategy was coded as word repeating when the learner repeated the word alone; as section repeating when the learner repeated a bigger section, including the clause or the sentence, in which the word had occurred; and as word-form analogy when the learner tried to identify the word meaning based on the sound or form similarity of the word to other words. It was coded as word analysis when the learner attempted to analyze the word into its different components to figure out its meaning.
Evaluating strategies were those that learners used to evaluate and check the accuracy of their initial inferences. Two types of sub-strategies were identified and coded in this category: self-inquiry, coded when learners questioned their initial inferences, and verifying, coded when learners reread a section of the text to revise or re-evaluate their initial inferencing hypotheses or the accuracy of their choices. A strategy was coded as monitoring when the learner showed an awareness of the nature of the problem by making an explicit judgement about the ease or difficulty of the word based on the available cues in the text. While both identifying and evaluating are cognitive strategies, monitoring is a metacognitive strategy.
The reliability of the coding was established by calculating an inter-coder agreement on a sample of 20% of the data, selected from every fifth participant and coded by the researcher and a second coder, a colleague of the researcher. The inter-coder agreement for that 20% of the data was 89%. Table 1 presents a taxonomy of these strategies, along with their definitions and examples from the transcripts.
Relationship between depth of vocabulary knowledge and successful inferencing
In order to determine the relationship between depth of vocabulary knowledge and lexical inferencing success, a variety of statistical procedures [End Page 116]
![]() Click for larger view |
Table 1
Types of strategies used2 |
were employed. As a first step in the analysis, the degree of success in lexical inferencing was determined using a three-point scale (0 to 2) representing unsuccessful, partially successful, and successful attempts. Successful inferences were defined as those that were semantically, syntactically, and contextually appropriate. Responses that were semantically appropriate but syntactically deviant, or vice versa, were classified as partially successful. In order not to underestimate the learner's success, if the meaning or the definition provided made sense in the context but, when judged out of context, was not the meaning of the word, the attempt was still classified as partially successful. In cases where the response did not meet any of the above conditions, it was considered unsuccessful. All responses were rated and judged by two independent raters (the researcher and an ESL teacher who is a native English speaker), and an inter-rater reliability of 94% was established. The total number of responses to all target lexical items was 210 (10 target words (21 participants). However, during the reading and the [End Page 117] retrospective interviews, some participants reported that they knew some of the target words. This reduced the number of responses to 199.
![]() Click for larger view |
Table 2
Relationship between depth of vocabulary knowledge and inferential success |
As a second step in the analysis, the participants were divided into two groups: lexically skilled and lexically less skilled students. Grouping was done based on the participants' percentile rank for the scores they obtained on the depth of vocabulary knowledge test (WAT). A learner's percentile rank shows how well he or she performs on a test in comparison to others. Those learners whose scores fell at or below the 50th percentile were classified as lexically less skilled (LLS) readers, while those whose scores fell above the 50th percentile were classified as lexically skilled (LS) readers.There were 10 students in the LS group and 11 students in the LLS group.In the next step, the frequencies and percentages of successful, partially successful, and unsuccessful inferences were tallied. These frequencies were calculated for each of the students in each of the two groups of LS and LLS readers.
The results of these analyses are presented in Table 2. As the table shows, of a total of 51 successful inferences, 35 (68.6%) were made by LS readers and only 16 (31.4%) by LLS readers. Of the total number of unsuccessful inferences (111), a great proportion (60.4%) was made by LLS readers, and only 39.6% by LS readers. As for the partially successful inferences, the LLS group seems to have produced comparable numbers of such inferences, although the LS readers tended to produce slightly more. A two-way chi-square test conducted on the proportions of unsuccessful, partially successful, and successful inferences for the two groups of LS and LLS readers showed that the proportions of successful, partially successful, and unsuccessful inferences were significantly different across the two groups, indicating that the degree to which readers were able to infer word meaning successfully was related to their group membership, that is, whether they were lexically skilled or less skilled readers (x2 = 11.85, df = 2, p < 0.01). [End Page 118]
Relationship between depth of vocabulary knowledge and type of strategy use and success
The next stage of analysis examined the relationships between depth of vocabulary knowledge and strategy types and success. To this end, the proportion and the mean of inferencing success for each strategy across the two groups of LS and LLS readers were calculated. Means of success for each strategy were obtained by dividing the sum of each learner's scores (on the scale of 0 to 2) by the frequency of the strategy used. The resulting data were then compared across the two groups. T-tests and chi-square statistics were used to determine whether there were any significant differences between the two groups. Table 3 displays the results of these analyses.
![]() Click for larger view |
Table 3
Types of strategy use and success in lexically skilled versus lexically less skilled readers |