Cognitive aspects of lexical availability (2006)
Transcript of Cognitive aspects of lexical availability (2006)
This article was downloaded by: [Universidad De Salamanca]On: 23 January 2015, At: 01:41Publisher: RoutledgeInforma Ltd Registered in England and Wales Registered Number: 1072954Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH,UK
European Journal of CognitivePsychologyPublication details, including instructions for authorsand subscription information:http://www.tandfonline.com/loi/pecp20
Cognitive aspects of lexicalavailabilityNatividad Hernández-Muñoz a , Cristina Izura b &Andrew W. Ellis ca University of Salamanca , Salamanca, Spainb University of Wales Swansea , Swansea, UKc University of York , York, UKPublished online: 17 Feb 2007.
To cite this article: Natividad Hernández-Muñoz , Cristina Izura & Andrew W.Ellis (2006) Cognitive aspects of lexical availability, European Journal of CognitivePsychology, 18:5, 730-755, DOI: 10.1080/09541440500339119
To link to this article: http://dx.doi.org/10.1080/09541440500339119
PLEASE SCROLL DOWN FOR ARTICLE
Taylor & Francis makes every effort to ensure the accuracy of all theinformation (the “Content”) contained in the publications on our platform.However, Taylor & Francis, our agents, and our licensors make norepresentations or warranties whatsoever as to the accuracy, completeness, orsuitability for any purpose of the Content. Any opinions and views expressedin this publication are the opinions and views of the authors, and are not theviews of or endorsed by Taylor & Francis. The accuracy of the Content shouldnot be relied upon and should be independently verified with primary sourcesof information. Taylor and Francis shall not be liable for any losses, actions,claims, proceedings, demands, costs, expenses, damages, and other liabilitieswhatsoever or howsoever caused arising directly or indirectly in connectionwith, in relation to or arising out of the use of the Content.
This article may be used for research, teaching, and private study purposes.Any substantial or systematic reproduction, redistribution, reselling, loan, sub-
licensing, systematic supply, or distribution in any form to anyone is expresslyforbidden. Terms & Conditions of access and use can be found at http://www.tandfonline.com/page/terms-and-conditions
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Cognitive aspects of lexical availability
Natividad Hernandez-Munoz
University of Salamanca, Salamanca, Spain
Cristina Izura
University of Wales Swansea, Swansea, UK
Andrew W. Ellis
University of York, York, UK
Lexical availability measures the ease with which a word can be generated as amember of a given category. It has been developed by linguistic studies aimed,among other things, at devising a rational basis for selecting words for inclusion indictionaries. The measure accounts for the number of people who generated a givenword as a member of a designated semantic category and the position in which theyproduce the word. We present an analysis of lexical availability from a cognitiveperspective. Data were analysed for Spanish speakers generating words from fivesemantic categories*clothes, furniture, body parts, animals, and intelligence. Sixproperties of words were investigated as potential predictors of lexical availability.Predictors were concept familiarity, typicality, imageability, age of acquisition, wordfrequency, and word length. Categories differed on these variables, and regressionanalysis found concept familiarity, typicality, and age of acquisition to besignificant predictors of lexical availability. The cognitive basis of these findingsand the practical consequences of selecting words on the basis of lexical availabilityare considered.
Languages often contain many more words than are used in everyday
discourse. They also contain many more words than need to be contained
within concise dictionaries. Some principled methods need to be found if
suitable words are to be selected for inclusion in concise dictionaries,
teaching materials for children, and courses for second language learners.
Correspondence should be addressed to Cristina Izura, Department of Psychology, University
of Wales Swansea, Singleton Park, Swansea SA2 8PP, UK. E-mail: [email protected]
We thank Alberto Carcedo Gonzalez for his help and Professors Julio Borrego Nieto, Jose
Antonio Bartol, Rosario Llorente Pinto, Emilio Prieto de los Mozes, and Javier de Santiago
Guervos of the University of Salamanca for their invaluable assistance with this study which forms
part of the project El lexico disponsible del hablante hispano: aportacion de datos y replanteamiento
teorico and was supported by the Grant Ministerio de Ciencia y Tecnologıa (BFF2001-1005).
EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY
2006, 18 (5), 730�755
# 2006 Psychology Press Ltd
http://www.psypress.com/ecp DOI: 10.1080/09541440500339119
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
One approach that has been adopted is to select words on the basis of
frequency of use in the language. Thorndike’s (1921) compilation of the10,000 words occurring most commonly in written text was a pioneer in this
area and was followed by similar studies in other languages such as Spanish
(Buchanan, 1927) and French (Aristazabal, 1938). Frequency counts
continue to be used as the basis for dictionary creation (e.g., Sinclair,
1987) but they have their problems. Counts based on adult written language
tend to overrepresent words from some areas of discourse (e.g., politics and
finance) while underrepresenting the words of everyday spoken language
(e.g., clothing, food, household objects). Even when attempts have beenmade to include spoken language samples (Baayen, Piepenbrock, & van
Rijn, 1993), it has proved difficult to record and incorporate ordinary,
mundane domestic language. Educational programmes for teaching second
language vocabulary face a similar problem. High frequency words tend to
be considered the suitable vocabulary to teach at initial stages of second
language learning. However, functional language proficiency requires
mastery of a considerably larger number of words and there are no other
criteria applicable to the creation of word lists for use in advanced courses ofsecond language acquisition (Groot, 2000).
An alternative approach to vocabulary selection has been based on the
concept of lexical availability (Carcedo, 1998; Dimitrijevic, 1969; Mackey,
1971; Samper-Padilla, Bellon, & Samper-Hernandez, 2003). This approach
involves asking language users from specific communities to generate words
from different domains or categories. Spanish language studies have mostly
employed a set of 16 semantic categories developed by Michea (1953) and
Gougenheim, Michea, Rivence, and Sauvageot (1964) to capture thevocabulary of everyday life (e.g., Echeverrıa, Herrera, Moreno, & Pradenas,
1987; Lopez Morales, 1996; Mena Osorio, 1986). The 16 categories are body
parts, clothes, parts of the house, furniture, food and drink, objects found on
the table at dinner time, the kitchen and its utensils, the school, heating and
lighting, the city, the countryside, transport, gardening and countryside jobs,
animals, games and hobbies, and professions. Participants are presented with
the name of a category and are usually asked to write down as many words
from that category as they can within a given time period. Techniques havebeen developed to derive an availability measure for different words based
on the number of participants who generate a word and the position it
occupies in their lists (Echeverrıa et al., 1987; Lopez Morales, 1999; Mena
Osorio, 1986). At the time of writing, a major Panhispanic project is under
way aimed at producing a dictionary founded on lexical availability that will
cover most of the Spanish-speaking world (Alba, 1995; Galloso, 2003; Lopez
Morales, 1996; Mateo, 1998).
The task of generating words from predefined semantic categories hasbeen termed ‘‘semantic fluency’’ or ‘‘category instance generation’’. It has
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 731
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
been used extensively in neuropsychological studies of word retrieval in
patients with various forms of brain damage such as aphasia and dementia(e.g., Moulin et al., 2002; Troster, Salmon, McCullogh, & Butters, 1989).
Within cognitive psychology there have been many studies in which
participants are required to judge whether or not stimulus words come
from designated semantic categories (category instance verification), but few
studies involving word generation. Building on a study by Loftus and
Suppes (1972), Catling and Johnson (2005) presented category labels
and initial letters to participants who were asked to generate appropriate
words as quickly as possible (e.g., FRUIT�/O0/ ‘‘orange’’). They foundproduction speed to be predicted by age of acquisition and not by word
frequency.
The aim of the present study was to investigate which cognitive factors
(semantic, lexical, or both) influence the availability of words when
generated in response to category labels. Hernandez-Munoz (2002) collected
lexical availability data from young Spanish people for the 16 categories
mentioned above plus an additional category, intelligence, chosen to act as a
source of more abstract vocabulary. Due to the exploratory nature of thisfirst study, 5 out of the 16 original categories were chosen. As a consequence,
four ratings for 500 instead of 1600 words were collected. Animals, body
parts, clothes, and furniture were selected corresponding to two living and
two nonliving categories and because they were also used commonly in past
studies (Catling & Johnson, 2005; Chertkow, Bub, & Caplan, 1992; Loftus &
Suppes, 1972; Warrington & McCarthy, 1987). Intelligence was selected as
an abstract category likely to elicit words of a less concrete nature. The 100
items from each category with the highest availability scores were chosen,excluding compound words and phrases. Six properties of words were
employed as potential predictors of lexical availability. These were taken
from studies of normal and impaired word retrieval, including studies of
object naming and from studies of category instance verification, which can
be thought of as being to some extent the reverse of category instance
generation.
The first factor was the familiarity of a concept or thing, which can be
defined as the frequency with which people come into contact with, or thinkabout the concept or thing. Concept familiarity was included by Snodgrass
and Vanderwart (1980) among their measures of object qualities. Cuetos,
Ellis, and Alvarez (1999) reported an effect of concept familiarity on Spanish
object naming speed, with more familiar objects being named more rapidly
than less familiar ones. Ellis and Morrison (1998) reported effects of
familiarity in some of their analyses of object naming speed in English.
Neuropsychological studies have indicated that naming accuracy may be
influenced by familiarity in patients with aphasia (Cuetos, Aguado, Izura, &Ellis, 2002) and semantic dementia (Lambon Ralph, Graham, Ellis, &
732 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Hodges, 1998). The familiarity of words and their meanings has also been
held to affect the speed of deciding that words belong in particular semanticcategories (e.g., Larochelle & Pineau, 1994; Malt & Smith, 1982). Familiarity
effects have been assigned to the activation processes of semantic repre-
sentations (Hirsh & Funnell, 1995).
Typicality is a measure of how close a concept lies to the centre of a
particular category. For example, dog and horse are very typical animals
while bat and whale are atypical. It has long been recognised that the
typicality of items with respect to their category is an important determinant
of category verification speed (McFarland, Duncan, & Kellas, 1978; Rosch& Mervis, 1975; Smith, Shoben, & Rips, 1974; Southgate & Meints, 2000).
Jolicœur, Gluck, and Kosslyn (1984) reported that typical items tended to be
named more often using a basic level term (e.g., bird) while atypical items
tended to be named using their subordinate name (e.g., penguin). Barsalou
(1983) found typicality effects for ad hoc categories created for the purposes
of the experiment that were comparable to those of natural categories that
participants might be expected to be previously familiar with. Holmes and
Ellis (in press) found an effect of typicality on object naming speed, withobjects judged typical of their categories being named faster than objects
judged to be atypical.
The imageability of a word is a rating of the ease with which the word can
evoke a mental image of the concept represented. It is considered to be a
semantic variable and it is closely related to concreteness so that words
representing concrete objects tend to be given high imageability ratings while
words representing abstract concepts tend to be given low imageability
ratings. Imageability effects have been most extensively explored in memorytasks where high imageability words are easier to recall than low imageability
words (Coltheart & Winograd, 1986; Mulligan, 1998; Williams, Healy, &
Ellis, 1999) but neuropsychological studies have suggested that the ability of
some brain-injured patients to access words from the lexicon may be affected
by their imageability (e.g., Franklin, Howard, & Patterson, 1995). Paivio,
Clark, Digdon, and Bons (1989) reported a close relationship between the
speed of naming an object picture and the speed with which participants
reported forming mental images to words while de Groot (1992) reported aneffect of imageability on the speed with which bilingual people can translate
words from one language into another.
Word frequency is a measure of how often a word occurs in samples of
spoken and written language. We have noted above that it has been used in
the past as a basis for selecting words for inclusion in concise dictionaries
and in vocabulary lists of second language courses. Oldfield and Wingfield
(1965) proposed that the frequency of object names affected the speed with
which those names can be accessed and produced. Evidence has beensomewhat equivocal (Barry, Hirsh, Johnston, & Williams, 2001; Bonin,
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 733
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Chalard, Meot, & Fayol, 2002), but several recent studies have reported
independent effects of word frequency on object naming (Barry, Morrison,
& Ellis, 1997; Ellis & Morrison, 1998), including object naming in Spanish
(Cuetos et al., 1999). Cognitive models of word retrieval have generally
proposed that word frequency affects the process of accessing spoken word-
forms. For example, in Levelt’s influential model of speech production, word
frequency is held to affect the process of accessing phonological representa-
tions of spoken words (‘‘lexemes’’*see Levelt, Roelofs, & Meyer, 1999).
The age of acquisition1 (AoA) of a word is defined as the age at which a
word is typically learned. Age of acquisition has been reported to affect
word recognition and production speed in a variety of lexical processing
tasks including spoken object naming, lexical decision, and word naming or
reading aloud (Carroll & White, 1973; Ellis & Morrison, 1998; Izura & Ellis,
2002; Juhasz, 2005; Monaghan & Ellis, 2002; Morrison & Ellis, 1995, 2000).
In all of these tasks, early learned words have been reported to be processed
faster than later learned words. Of particular interest, given that participants
in lexical availability studies usually produce written lists of items from
categories, are the reports by Bonin, Fayol, and Chalard (2001) and Bonin
et al. (2002) of an effect of age of acquisition on the speed of initiating
1 The suitability of the AoA measure has been a constant matter of concern. Zevin and
Seidenberg (2002, 2004) have recently argued that the high correlations between age of acquisition
and so many other factors (word frequency, familiarity, imageability, word length, concreteness,
and number of neighbours) causes great difficulty when investigating the unique/singular influence
of AoA in lexical tasks. They proposed a new operationalisation of AoA related to the frequency
with which words are experienced through different ages. In their view, early acquired words are
those whose frequency of trajectory through life starts very high during infancy to steadily decrease
through the years. The opposite, words of low frequency in childhood that increase their frequency
through time, constitutes late acquired words. This AoA measure, called frequency of trajectory,
has been found to influence object naming and lexical decision times (Bonin, Barry, Meot, &
Chalard, 2004; Zevin & Seidenberg, 2004). Frequency of trajectory correlates highly with AoA
only, representing (at least in part) the AoA variable independently of other factors. Despite this,
frequency of trajectory is not free from problems. Firstly, it reduces age of word learning to written
frequency of exposure knowing that other factors (concept familiarity, imageability, word length,
number of neighbours, and spoken word frequency) also contribute when a word is learnt (Bonin et
al., 2004; Zevin & Seidenberg, 2002). Secondly, the way in which early and late acquired words are
understood (having different frequency of trajectories through time) excludes the great majority of
the words in the vocabulary, that is, those words experienced with an equal or similar frequency
over the years. Finally, although the operationalisation of late acquired words as having low to high
frequency of trajectory might be appropriate, most early acquired words do not have high to low
trajectories. Only words related to the fantastic world such as giant, ogre, and fairy have higher
frequencies in a child’s language than in adults. Generally, early learned words retain into
adulthood the frequencies they had in childhood. Nevertheless, and despite its problems, it would
have been interesting to see the predictor power of frequency of trajectory on the category
generation task. Unfortunately, there is not a reliable database providing word frequencies for the
different schooling years of Spanish children. For this reason we were unable to include frequency
of trajectory as another predictor in the analysis.
734 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
responses in a task requiring participants to write the names of pictured
objects. Theoretical accounts of age of acquisition differ in where exactly
within the cognitive system this variable is thought to exert its influence. A
number of authors have proposed that age of acquisition might affect the
ease of accessing spoken word forms (e.g., Brown & Watson, 1987; Gilhooly
& Watson, 1981), and Levelt et al. (1999) have suggested that age of
acquisition, like word frequency, might affect the retrieval of the phonolo-
gical representations of words (lexemes). Other authors, however, have
favoured a semantic locus for age of acquisition effects, with early meanings
being better represented than later acquired meanings (Brysbaert, van
Wijnendaele, & de Deyne, 2000; Lyons, Teer, & Rubenstein, 1978; van Loon-
Vervoorn, 1989; van Loon-Vervoorn, van Ham, & van der Koppen, 1988). A
third account of how age of acquisition could come to affect lexical
processing was offered by Ellis and Lambon Ralph (2000) who proposed
that the age or order of acquisition of words might affect the strength of
connections between different lexical representations (e.g., semantic and
phonological for naming; orthographic and phonological for reading aloud:
see also Monaghan & Ellis, 2002). Izura and Ellis (2002, 2004) found age (or
order) of acquisition effects in second language learners which reflected the
point at which different words were learned in the second language rather
than the age at which the word with equivalent meaning was learned in the
first language. This finding is problematic for the semantic theory of age of
acquisition effects, which would predict that second language words would
inherit the semantic age of acquisition properties of the equivalent first
language word, and is more compatible with a lexical access or mapping
account of age of acquisition. All these three theories would, however,
predict that early learned words would be easier to access than later acquired
words in the category generation task that underlies studies of lexical
availability. We have noted that Catling and Johnson (2005) found that the
frequency with which words occur in child language samples predicted
response times when participants were given category labels and initial
letters, and argued that child language frequency was probably an indirect
measure of age of acquisition.
Finally, it is unclear whether word length exerts an independent effect on
response speed in naming or verification tasks (e.g., Bachoud-Levi, Dupoux,
Cohen, & Mehler, 1998; Marmurek & Rinaldo, 1992) but Cuetos et al.
(1999) found an effect of length on Spanish object naming speed with age
of acquisition, word frequency and other factors statistically controlled.
They suggested that the greater variability in length of Spanish object names
may make length effects easier to detect in the Spanish language than in
English.
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 735
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
METHOD
Lexical availability data
The lexical availability data analysed in this paper were taken from a study
by Hernandez-Munoz (2002) who collected written category generation data
from 117 native speakers of Spanish (42 male, 75 female) aged 17�18 years.
All the participants were students in private and public schools with mixed
socioeconomic backgrounds living in the Cuenca region of Spain. Partici-
pants were asked to write down as many items they could think of from the
standard 16 categories used in lexical availability studies (body parts, clothes,
parts of the house, furniture, food and drink, objects found on the table at
dinner time, the kitchen and its utensils, the school, heating and lighting, the
city, the countryside, transport, gardening and countryside jobs, animals,
games and hobbies, and professions) plus the category ‘‘intelligence’’, which
was chosen to elicit more abstract words. They were allowed 2 min for each
category.
Responses from five categories (animals, body parts, clothes, furniture,
and intelligence) were selected for analysis. Morphological variants of the
same word were combined (e.g., singular and plural). If a participant wrote
the same word twice for a particular category, the first production was taken
and the second ignored. Table 1 shows the total number of different,
category-appropriate words produced for each category and the mean
number of different, category-appropriate words written by each participant
for each category. Regarding the grammatical characteristics of the words an
overall predominance of nouns was observed (73%), followed by verbs (21%)
and adjectives (6%). The lexical availability of each word was calculated
using the formula of Lopez Chavez and Strassburguer Frıas (1992, 1993,
2000), which computes an availability value based on the positions that the
word occupies in a given list, the number of participants who place the word
at those positions, and the lowest position the word occupies in any of the
lists. The formula assigns a high value to words that were written by a large
TABLE 1Mean number of different category-appropriate words produced and mean numberof category-appropriate words per participant (excluding repetitions) for the five
categories used in the study
Categories
Animals
Body
parts Clothing Furniture Intelligence
Total number of different words
produced
563 195 192 223 612
Mean number of words per participant 27.1 16.6 17.5 22.9 11.4
736 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
number of participants and which appeared early in their lists, and
exponentially lower value to words written by few participants late in theirlists. Moreno, Moreno, and de las Heras (1995) integrated the formula in the
computer program Lexidisp which is being developed as a web-based tool
(www.linguas.net/LEXIDISP/). The formula is:
D(Pj)�Xn
i�1
e
��2:3�
�i�1
n�1
��
�fji
I1
Where D(Pj)�/the lexical availability of the word j for a given category; I�/
the total number of participants who completed the task; i�/the position of
word j in a given list; f�/the number of participants who wrote word j at that
position in their list; n�/the lowest position occupied by word j in any list
produced for the category in question; and e�/the natural number
(2.718181818459045). The resulting values for lexical availability are all
less than 1 and were multiplied by 100 for ease of presentation. Values
ranged from 82.77 (perro [dog] for the category animals) to 0.64 (tiestos
[flower pot] for the category furniture).
The predictor variables
For the purpose of analysis, the 100 words from each of the five chosen
categories with the highest lexical availability values were chosen (for the 10
first words produced in each category, see Appendix). Values of the six
predictor variables were obtained for those 500 words. Where the predictorsinvolved ratings (i.e., concept familiarity, typicality, imageability, and age of
acquisition), the 500 words were split into two sets of 250, with each set
containing 50 words from each of the five semantic categories. Ratings were
provided by different groups of 50 participants for each scale. The
participants were all Spanish native speakers. They were all tested at the
end of a lecture in a classroom setting and they were allowed as much time as
needed to complete the scale. None of the scales collected was incomplete.
For each rating, half the participants rated one set of words and half theother. Reliabilities were computed for each scale by correlating the means for
the different words from the first 13 raters with the means from the
remaining 12 raters. All were highly significant.
Concept familiarity ratings were obtained using instructions adapted from
Snodgrass and Vanderwart (1980). Participants used a 7-point scale to
estimate how often they think about or come into contact with each of the
concepts (from 1�/‘‘less than once a month’’ to 7�/‘‘many times a day’’)
evoked by the list of 250 words. The 50 raters, students at the University of
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 737
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Salamanca, Spain, had a mean age of 20 years (range 17�53). Reliability was
.92.
Typicality ratings were obtained using a 7-point scale ranging from 1�/‘‘a
very atypical example of the category’’ to 7�/‘‘a very typical example’’. The
instructions were based on those used by Hampton and Gardiner (1983).
Within semantic categories we can encounter elements that seem to be better
examples of a category than others. For example, we will all probably think that
apple shares more characteristics with the category ‘‘fruit’’ than coconut.
In this test you are going to find a series of words under a category label. Please rate
the words based on how good an example of the category each word is. A rating of
‘‘1’’ corresponds to the instance being a poor example of the category. A rating of
‘‘7’’ indicates that you consider the instance to be a very good example of the
category. Remember, rate the words based only on how well they represent the
category and not how much you like them.
The 50 participants that rated typicality had a mean age of 24 years (range
18�36). Reliability was .92.
Imageability ratings were obtained using a 7-point scale ranging from 1�/
‘‘very difficult to arouse a mental image’’ to 7�/‘‘very easy to arouse a
mental image’’ (Morrison, Chappell, & Ellis, 1997). The instructions read as
follows:
Words differ in their capacity to evoke mental images. Thus, while some words are
easy to imagine (e.g., apple) others are not (e.g., fact). The purpose of this test is to
estimate how easy or difficult it is to imagine a list of words. The scale ranges from 1
to 7. Rate a word with 7 if you think the word is very easy to imagine, with 1 if you
think the word is very difficult to imagine, and give an intermediate value for those
words that are neither very easy or very difficult to imagine.
The raters had a mean age of 23 years (range 17�58). Reliability was .91.Word frequency values were taken from Alameda and Cuetos (1995),
which is based on a corpus of written Spanish texts comprising 2 million
words from 606 texts distributed across novels (50%), essays (15%),
newspapers (25%), and scientific publications (10%) published between
1978 and 1993. An average frequency value was calculated for morpholo-
gical variants (e.g., masculine and feminine; singular and plural).
Age of acquisition ratings were obtained from 50 native speakers of
Spanish living in Spain (mean age 23 years; range 18�42) using the
methodology of Ghyselinck, de Moor, and Brysbaert (2000) where
participants estimate how old they were when they first learned different
words. They provide the estimate in years rather than using the 5- or 7-point
scales typical of previous studies which constricts the values of late acquired
738 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
words. Adult estimates of age of acquisition have been shown to have high
validity in that they correlate highly with objective measures (Carroll &White, 1973; Gilhooly & Gilhooly, 1980; Morrison et al., 1997). Reliability
was .88.
Word length was measured as the number of syllables in a word.
RESULTS
Before conducting any analyses on the data, 43 of the 500 words (8.6%) were
removed. These were polysemic words, which can apply to more than one
category (e.g., mono which means monkey [animal] or overalls [clothing]),
foreign words (e.g., slip for an item of clothing), or dialect words. Many of
the raters failed to provide ratings for the foreign and dialect words (which
they presumably did not know) and word frequency values based on writtenSpanish with a single value for polysemic words are unreliable for such
words. Three words were deleted from the category of animals, four from
body parts, eighteen from furniture, fourteen from clothing, and four from
intelligence.
As is common practice in the cognitive psychology literature, word
frequency values were transformed using the formula log(1�/x) to reduce
skew. This also compresses the range of values (0 to 3.57 in the present
study). Table 2 shows the mean, standard deviation, and range for eachvariable and category for the 457 remaining words along with the lexicality
availability values for each category. One-way analyses of variance were used
to compare the scores of the words in each category on each of the variables.
There were significant differences among the categories on all of the
predictor variables. These were analysed further by using post hoc tests
(Tukey’s HSD) to compare the categories pairwise on each of the factors.
Familiarity. Categories differed in the familiarity ratings given to the
items within them, F(4, 452)�/42.57, MSE�/71.23, pB/.001. Post hoc tests
showed that familiarity ratings were significantly higher for words in the
category intelligence than for furniture (M�/0.81, SD�/0.19) and body parts(M�/1.35, SD�/0.18), which were significantly more familiar than clothing
(M�/�/1.39, SD�/0.19; M�/�/0.85, SD�/0.18), which were in turn
significantly more familiar than animals (M�/0.19, SD�/0.26).
Typicality. Categories differed significantly on typicality, F(4, 452)�/
9.72, MSE�/16.85, pB/.001. Post hoc tests showed that typicality ratings
were significantly higher for animals (M�/�/0.81, SD�/0.19), body parts
(M�/0.67, SD�/0.20), and intelligence (M�/0.45, SD�/0.20) than forfurniture and clothing. This implies that furniture and clothing are more
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 739
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
TABLE 2Mean (M), standard deviation (SD) and ranges on concept familiarity, typicality, imageability, word frequency, age of acquisition,
word length, and lexical availability for words in the five categories used in the study
Predictor variables Animals Body parts Clothing Furniture Intelligence
Concept familiarity M 3.18 4.04 3.57 4.58 5.39
SD 1.00 1.40 1.53 1.51 0.97
Range 1.44�6.04 1.68�6.56 1.16�6.68 1.16�6.76 3.08�6.76
Typicality M 4.82 4.68 3.83 4.01 4.56
SD 1.15 1.20 1.60 1.47 1.16
Range 2.48�6.92 2.44�7.00 1.20�6.72 1.56�7.00 2.44�6.64
Imageability M 6.62 6.07 6.66 6.61 4.36
SD 0.37 0.86 0.32 0.45 1.27
Range 5.36�7.00 3.68�7.00 4.67�7.00 4.16�7.00 2.52�6.96
Log word frequency M 1.00 1.40 1.07 1.19 2.02
SD 0.50 0.79 0.62 0.67 0.66
Range 0.00�2.35 0.00�3.10 0.00�2.23 0.00�2.78 0.00�3.57
Age of acquisition M 5.70 7.00 7.24 6.70 7.05
SD 1.39 2.28 2.63 2.25 1.85
Range 3.00�9.16 2.98�10.95 3.44�15.46 3.08�11.58 3.72�11.48
Word length M 2.67 2.66 2.81 3.06 3.19
SD 0.79 0.78 0.80 0.91 1.05
Range 1.00�5.00 1.00�5.00 1.00�5.00 2.00�5.00 1.00�6.00
Lexical availability (*100) M 11.91 12.34 13.00 10.37 4.09
SD 12.28 17.65 17.55 18.16 4.31
Range 2.64�82.77 0.90�71.69 0.80�70.82 0.64�78.81 1.32�23.89
740
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
widely dispersed categories in which items tend to stray further from the
‘‘core’’ of the category than is the case for animals, body parts, andintelligence.
Imageability. Categories differed significantly on imageability, F(4,
452)�/155.34, MSE�/90.44, pB/.001. Post hoc tests showed that items in
the categories clothing (M�/�/0.59, SD�/0.11), animals (M�/�/0.55, SD�/
0.11), and furniture (M�/�/0.54, SD�/0.12) were rated as significantly more
imageable than body parts, which in turn were significantly more imageable
than items in the category intelligence (M�/1.70, SD�/0.11).
Word frequency. Categories differed significantly on word frequency,
F(4, 452)�/37.24, MSE�/16.01, pB/.001. Post hoc tests showed that items in
the category intelligence had significantly higher word frequencies than body
parts (M�/0.62, SD�/0.09) and furniture (M�/0.82, SD�/0.10), which were
significantly higher than clothing (M�/0.32, SD�/0.09) and animals (M�/
0.39, SD�/0.09).
Age of acquisition. Categories differed significantly on age of acquisi-
tion, F(4, 452)�/8.03, MSE�/35.61, pB/.001. Post hoc tests showed that
animals (M�/�/1.30, SD�/0.30) were rated as significantly earlier acquired
than all the other categories, which did not differ from each other (p�/.1).
Word length. Categories differed significantly on word length, F(4,
452)�/6.97, MSE�/5.30, pB/.001. Post hoc tests showed that intelligence
and furniture items had significantly longer names than animals (M�/0.52,SD�/0.13) and body parts (M�/0.53, SD�/0.13). Clothing items were
intermediate in length and not significantly different (p�/.1) from any of the
other categories.
Lexical availability. The categories also differed overall on lexical
availability, F(4, 452)�/5.70, MSE�/0.12, pB/.001. Post hoc tests showed
that words from the intelligence category (M�/�/0.47, SD�/0.06) had
significantly lower lexical availabilities than words from the other fourcategories, which did not differ significantly (p�/.1). This relates to the fact
noted above that the words generated for the category intelligence were
much more variable across participants than the words generated for the
other categories. High variability means that individual words will have been
produced by fewer participants than words from the other categories. The
formula for determining lexical availability is such that smaller numbers of
participants writing each word results in smaller values of f in the equation
and therefore lower values of lexical availability. Figure 1 shows thesignificant and nonsignificant differences amongst factors for each category.
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 741
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
The correlations of the predictor variables with each other and with
lexical availability across the combined categories for the 457 words used in
the analyses is shown in Table 3. With so many items, even low correlations
emerge as significant at the .05 significance level, so only correlations that
are significant at the .01 level are marked. Among the predictors, concept
familiarity correlated quite highly with word frequency (words that occur
more often in samples of written language denote concepts that are rated as
being encountered more often). Age of acquisition also had relatively high
correlations with concept familiarity and word frequency (early learned
words tend to denote more familiar concepts and to occur with higher
frequency in adult language than later learned words). None of the
correlations between the predictors was high enough (above .7), however,
to undermine the use of regression analysis (Miles & Shevlin, 2001).
Typicality had the highest raw correlation with lexical availability, with
words judged as typical for their category being more available than words
Familiarity
Intelligence Furniture Body parts Clothing Animals
Typicality
Animals Body parts Intelligence Furniture Clothing
Imageability
Clothing Animals Furniture Body parts Intelligence
Word frequency
Intelligence Body parts Furniture Clothing Animals
Age of acquisition
Animals Furniture Body parts Intelligence Clothing
Word length
Intelligence Furniture Clothing Animals Body parts
Figure 1. Differences between categories on familiarity, typicality, imageability, word frequency,
age of acquisition, and word length. The categories joined by lines represent no significant
differences.
742 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
judged to be relatively atypical. Age of acquisition had the second highest
correlation with early learned words being more available than later learned
words. The correlations of lexical availability with concept familiarity,
imageability, word frequency, and length were progressively lower but still
significant.
When predictor variables are themselves intercorrelated, their individual
correlations with lexical availability should be interpreted with caution.
Multiple regression techniques can help to distinguish real from apparent
relationships under such circumstances. The particular technique used here
is known as multilevel multiple regression (Miles & Shevlin, 2001). It was
developed to analyse data that comes grouped in clusters like the semantic
categories of the present study. It can be thought of as a two-step analysis. In
the first step, variance in lexical availability due to differences between the
five categories on the predictor variables was extracted. In the second step,
the six predictor variables were entered in a simultaneous analysis to
determine the ability of each predictor to account for differences between
words in their lexical availability after overall differences between the
categories have been accounted for. For the purpose of these analyses,
lexical availabilities were log transformed to reduce skew.
The results of the second step of the multilevel multiple regression
analysis are shown in Table 4. Taken together, the independent variables
were able to predict lexical availability to a significant degree, F(4, 446)�/
55.94, MSE�/6.27, pB/.001, accounting for 56% of the variance in lexical
availability after the extraction of variance due to differences between the
categories. Concept familiarity, typicality, and age of acquisition all made
significant independent contributions to predicting lexical availability.
Words representing familiar concepts that are typical of their category and
are learned early in life had high availabilities while words representing less
familiar concepts that are less typical of their categories and are later
TABLE 3Correlation among the predictor variables and with lexical availability
Concept
familiarity Typicality Imageability
Word
frequency
Age of
acquisition
Word
length
Concept familiarity 1.00
Typicality �/.25** 1.00
Imageability �/.09 .00 1.00
Word frequency .66** .29** �/.20** 1.00
Age of acquisition �/.47** �/.24** �/.34** �/.44** 1.00
Word length .03 �/.08 �/.18** �/.19** .23** 1.00
Lexical availability .24** .59** .33** .21** �/.43** �/.14**
**p B/.01.
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 743
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
acquired had lower availabilities. The independent contribution of word
frequency was very close to significance (p�/.051), reflecting a tendency for
higher frequency words to be more available than lower frequency words.
The independent contributions of imageability and word length were not
significant: There was no significant tendency for more available words to
differ from less available words in terms of imageability or length once
differences in familiarity, typicality, etc. had been accounted for.
GENERAL DISCUSSION
We have analysed the words produced by young adult speakers of Spanish
when asked to generate as many items as they could from five semantic
categories (clothes, furniture, body parts, animals, and intelligence) given 2
minutes per category. Participants were able to generate substantially more
words for some categories than others in the time available. Animals and
furniture elicited the greatest number of category-appropriate words per
participant, while intelligence elicited the fewest. At the same time, the
category of intelligence elicited the greatest number of relevant words (n�/
612) across the 117 participants. That is because participants tended to write
relatively few words in response to the category label ‘‘intelligence’’ but the
words produced were very variable from one participant to another. The
categories of body parts and clothing elicited intermediate numbers of words
per participant but the smallest numbers of different words, indicating that
participants tended to write the same words in response to those category
labels. That is, there was much greater homogeneity across participants in
the responses generated in response to those category labels. Categories
differ in their apparent difficulty (as measured by the mean number of words
per participant) and in the variability of words elicited (as measured by the
total number of different words produced).
TABLE 4Results of a multilevel regression analysis of lexical availability for six predictorvariables entered in the second stage of the analysis after removal of variance
attributable to overall differences between the categories
Predictor variable B Std. error Beta t
Concept familiarity 0.007 0.018 0.226 4.23**
Typicality 0.184 0.013 0.504 14.32**
Imageability 0.000 0.024 0.023 0.41
Word frequency 0.006 0.033 0.097 1.96a
Age of acquisition �/0.003 0.011 �/0.115 �/2.43**
Word length 0.002 0.019 0.041 1.18
ap�/.051, **p B/.01.
744 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Our analyses focused on the six properties of words and their meanings
selected because they had been implicated as potential determinants of ease
of word retrieval in previous cognitive and neuropsychological studies and/
or because they had been implicated as predicting the speed of deciding that
words belong in different semantic categories (category instance verifica-
tion). The six properties were concept familiarity, typicality, imageability, age
of acquisition, word frequency, and word length. The first point to note is
that the words generated for the different categories varied to some extent on
these properties. For example, animals had the earliest age of acquisition
ratings but had low concept familiarity ratings because while animals like
gato (cat) and perro (dog) are highly familiar (i.e., rated as being encountered
or thought about at frequent intervals), the majority of animals generated
have much lower rated concept familiarities (e.g., sapo [toad]; oso [bear]).
Funnell and Sheridan (1992) noted the lower average familiarity of animals
compared with nonliving categories. Animals also had low frequency and
relatively long names but higher imageability and typicality ratings
compared with other categories.
It is clear that Hernandez-Munoz’s (2002) introduction of the category of
intelligence was partially successful in stimulating the production of more
abstract words. Imageability ratings were lower for intelligence words than
for the other categories. But the mean imageability ratings of the intelligence
words was still above the mid-point of the rating scale and, somewhat
surprisingly, the words produced for the category intelligence were more
familiar and of higher frequency than the other categories and denoted more
familiar concepts. Inspection of the intelligence words shows that the
students who generated the words in the intelligence category chose to write
many education-related words such as profesor (teacher), libro (book), and
estudiar (to study) that are very familiar concepts for them and also relatively
high frequency in written language samples.
We analysed the ability of the six word properties to predict the lexical
availability of different words in relation to the five category labels. The
formula for determining the lexical availability of a word takes into account
the number of times that word is generated at different positions in lists so
that words have high lexical availability if they are generated by many people
early in their lists and low availability if they are generated by few people late
in their lists. The six properties of the words used as predictors were jointly
capable of accounting for 56% of the variance in lexical availability. This is
an impressively high proportion given that there are undoubtedly factors
affecting the selection of words and the order in which they are produced
that we have not investigated here (for example, the use of strategies like
grouping subcategories of animals together, e.g., ‘‘pets, farm animals, wild
animals’’, or parts of the body that are physically adjacent).
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 745
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Four of the word properties had independent influences on lexical
availability, taking into account the contributions of all the other predictors.The property that correlated most highly with lexical availability and that
emerged as the strongest predictor in the regression analysis was typicality.
Words that are judged to be highly typical exemplars of a chosen category
are generated by more people earlier in their lists than words that belong in
the category but are judged to be less typical exemplars. Randall, Moss,
Rodd, Greer, and Tyler (2004) have argued that there is more mutual support
between typical concepts that share more features with other, related
concepts than atypical items do. This could help explain why typical itemsare more easily accessed in object naming (Holmes & Ellis, in press) and in
the category instance generation task that underlies lexical availability
studies, as well as being faster to verify as members of a category in a
recognition task (McFarland et al., 1978; Rosch & Mervis, 1975; Smith et al.,
1974; Southgate & Meints, 2000).
The other independent predictors of lexical availability were concept
familiarity (words denoting very familiar objects or concepts are more
available than words denoting less familiar objects or concepts), age ofacquisition (objects or concepts whose names are learned early in life are
more available than those whose names are learned later in life), and word
frequency (objects or concepts whose names are encountered and used with
higher frequency are more available than those whose names are encoun-
tered and used with lower frequency). The results imply that when a
participant is given the name of a category, the search for exemplars to
produce starts with the items that are closest to the core of the category; that
is, with the most typical items. Among those, the most available are thoseitems that are most familiar and earliest acquired, with a small additional
bias towards those with higher frequency names. We would suggest that
familiarity effects, like typicality effects, arise within the semantic system
itself, with the strength of the semantic representations of concepts varying
according to how often they are activated (by either verbal or nonverbal
perceptual experience). This would explain the effect of familiarity on object
naming (Cuetos et al., 1999; Ellis & Morrison, 1998) and on category
instance verification (Larochelle & Pineau, 1994; Malt & Smith, 1982) aswell as explaining the better preservation of familiar meanings in neurop-
sychological patients with known semantic impairments (Hirsh & Funnell,
1995; Lambon Ralph et al., 1998). The effect of age of acquisition on lexical
availability could also arise within the semantic system, with early acquired
meanings being more available that later acquired meanings (cf. Brysbaert et
al., 2000; Lyons et al., 1978; van Loon-Vervoorn, 1989; van Loon-Vervoorn
et al., 1988), but it could equally arise at the lexical level, affecting the ease of
accessing spoken word forms (Brown & Watson, 1987; Gilhooly & Watson,1981; Levelt et al., 1999) or in the ease of mapping between the semantic and
746 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
phonological representations of words (Ellis & Lambon Ralph, 2000; Izura
& Ellis, 2002, 2004). The present data do not arbitrate between thesealternatives. They do, however, add another layer of empirical support to the
claim that age of acquisition is an important determinant of the ease with
which words can be accessed and produced.
The independent contribution of word frequency to predicting lexical
availability was only marginally significant. We note that in the area of
object naming, genuine effects of word frequency have been hard to
demonstrate when factors like familiarity and age of acquisition have been
controlled, with some studies finding effects (e.g., Barry et al., 1997; Ellis &Morrison, 1998) and others not (e.g., Barry et al., 2001; Bonin et al., 2002).
Part of the problem here may be the fact alluded to in the introduction that
even when researchers have gone to the effort of obtaining spoken as well as
written language samples (e.g., Baayen et al., 1993), the ordinary language of
the home has not been sampled adequately. Hence the frequencies recorded
for domestic vocabulary may considerably underestimate the actual,
experienced frequencies of those words. This is particularly true of the
present study where the only word frequency count available for Spanish(Alameda & Cuetos, 1995) is based entirely on written language samples.
The poor relationship between written frequencies and ordinary language
was the original motivation to develop a measure such as lexical availability
(Gougenheim et al., 1964; Michea, 1953). Nevertheless, if there is an effect of
word frequency on lexical availability over and above the effect of concept
familiarity it may reflect a role of frequency in influencing ease of access to
spoken word forms for output in the category generation task (cf. Levelt et
al., 1999).Imageability did not emerge as a significant predictor of lexical
availability when the other factors were taken into account. It may simply
be that abstract concepts and words are no harder to activate than more
concrete concepts of comparable typicality, familiarity, word frequency, and
age of acquisition. We note, though, that Schwanenflugel and colleagues
have shown that any processing differences between concrete and abstract
words when encountered in isolation can be removed if the abstract or low
imageability words are provided with a supportive context such as a sentencestem (Schwanenflugel, Harnishfeger, & Stowe, 1988; Schwanenflugel &
Shoben, 1983; Schwanenflugel & Stowe, 1989). In the category instance
generation task the category label and the preceding items in the written list
provide a context for the retrieval of the next word, which may eliminate any
intrinsic differences in availability between concrete and abstract words.
Word length also failed to emerge as a significant predictor of lexical
availability. As noted in the introduction, studies of word naming have
produced inconsistent findings as regards the possible effect of length onnaming speed, although Cuetos et al. (1999) reported an effect for Spanish
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 747
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
naming. But length may affect articulation speed or some other relatively
peripheral process. It may not be a factor influencing the probability ofaccessing a word in a situation such as the category instance generation task
on which calculations of lexical availability are based.
The results of this study carry some practical lessons for the use of lexical
availability as a means of selecting words for inclusion in dictionaries or
teaching materials. The first point we would note is that lexical availability is
not a fixed property of a word but rather an interaction between a word and
a category label*a measure of how available a word is as an exemplar of a
particular category. Looking at Hernandez-Munoz’s (2002) full data set, wesee that coche (car) appears in three categories but that it has a high
availability in the category ‘‘transport’’, a somewhat lower availability in the
category ‘‘the city’’, and a low availability in the category ‘‘countryside’’.
Likewise perro (dog) has high availability as an animal but a much lower
availability in the categories ‘‘city’’ and ‘‘countryside’’. The apparent
availability or unavailability of a word depends crucially on the categories
chosen, making the selection of appropriate categories vital.
Secondly, the present experiment might also be a source of valuableinformation for those researchers interested in the study of semantic
memory. With the intention of facilitating research in this area, Battig and
Montague (1969) published a set of norms for words in 56 categories. The
norms included information about three different measures of frequency and
the rank or order in which words were produced as members of a category.
Given the extended used of Battig and Montague’s norms (cited in excess of
1600 times since 1969), van Overschelde, Rawson, and Dunlosky (2004)
produced an updated version of Battig and Montague’s norms, adding 14new categories, data from participants with different geographical back-
grounds, and mean reaction times for the first response in a written category
generation task. In Spanish, Soto, Sebastian, Garcıa, and del Amo (1982)
created a set of norms for nine categories. These norms, derived from the
results of a category generation task, consist of the frequency of occurrence
of each word in the first seven locations of the list and rank position. It is
probable that frequency of occurrence is a factor of recognised importance in
the organisation of categories as are typicality and concept familiarity. Thepresent study shows that in addition to frequency, typicality, and familiarity,
AoA is another factor to consider in categorisation studies. It also reveals a
new measure when computing the resulting data from category generation
tasks. Lexical availability is a reliable and useful measure for this purpose
since it encapsulates not only the frequency of word occurrence but also the
order of word production. It is also of importance to mention here that the
major advantage of the category instance generation task, as opposed to the
word association task to collect data, is that it focuses on the properties ofwords generated rather than the properties of the stimulus words. Brysbaert
748 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
et al. (2000) investigated effects of AoA on the speed of making free word
associations. However, like other studies of word association, they only
looked at the characteristics of the stimulus words used to elicit the
associations, failing to appreciate that a complete study of word association
should look at the properties of both stimulus and response words.
Thirdly, whatever the categories, items having high availability will be
those that are most typical of the categories, whose meanings are most
commonly encountered in everyday life and learned earliest in life, and
which are communicated using words that are of higher frequency. The effect
of familiarity that we have identified is probably desirable. If lexical
availability is to be used to constrain the selection of words for dictionaries
and other purposes, then a criterion that is influenced by the familiarity of
concepts should be desirable as it would introduce words referring to
concepts that are frequent for the speaking community. For most purposes, a
bias towards early learned concepts with higher frequency names should also
be a advantageous, however, although we note that materials for use with
adults learning a language late in life often introduce words and concepts to
do with travel, finding accommodation, dealing with money, etc. early in the
training programme, although native speakers tend to acquire them
relatively late (Izura & Ellis, 2002). Language teaching materials aimed at
adults will either need to derive lexical availabilities from adults in response
to typically adult categories such as travel and money, or will need to admit
words for specific purposes whose availability, as determined from the
standard category list, is low.
The general bias towards typical rather than atypical concepts could be
more of a problem for applications of lexical availability. Some familiar
items with high frequency names have relatively low typicalities, at least in
relation to the standard categories. For example, reloj (wristwatch) has low
typicality as an item of clothing and a correspondingly low availability
but is a very familiar object and a word that one would want to see
included in a dictionary or teaching materials. Ventana (window) is also a
low typicality item of furniture with a correspondingly low availability but
a useful word to know. Finally, most of the words analysed in the present
study are nouns because the category labels encourage noun production. If
lexical availability is to be used to select useful words for inclusion in
dictionaries, then alternative categories that elicit words from other
grammatical classes (e.g., words that emphasise actions) will need to be
added.
Original manuscript received September 2004
Revised manuscript received May 2005
First published online 9 March 2006
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 749
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
REFERENCES
Alameda, J. R., & Cuetos, F. (1995). Diccionario de frecuencias de las unidas de castellano. Oviedo,
Spain: Servicio de Publicaciones de la Universidad de Oviedo.
Alba, O. (1995). Lexico disponible de la Republica Dominicana. Santiago de los Caballeros,
Dominican Republic: Pontificia Universidad Catolica Madre y Maestra.
Aristazabal, M. (1938). Determination experimentale du vocabulaire ecrit pour servir de base a
l’enseignement de l’ortographie a l’ecole primaire. Paris: H. Champion.
Baayen, R. H., Piepenbrock, R., & van Rijn, H. (1993). The CELEX lexical database (CD-ROM).
Philadelphia: University of Pennsylvania, Linguistic Data Consortium.
Bachoud-Levi, A. C., Dupoux, E., Cohen, L., & Mehler, J. (1998). Where is the length effect? A
cross-linguistic study of speech production. Journal of Memory and Language, 39, 331�346.
Barry, C., Hirsh, K. W., Johnston, R. A., & Williams, C. L. (2001). Age of acquisition, word
frequency, and the locus of repetition priming of picture naming. Journal of Memory and
Language, 44, 350�375.
Barry, C., Morrison, C. M., & Ellis, A. W. (1997). Naming the Snodgrass and Vanderwart pictures:
Effects of age of acquisition, frequency and name agreement. Quarterly Journal of Experimental
Psychology, 50A, 560�585.
Barsalou, L. W. (1983). Ad hoc categories. Memory and Cognition, 11, 211�227.
Battig, W. F., & Montague, W. E. (1969). Category norms of verbal items in 56 categories A
replication and extension of the Connecticut category norms. Journal of Experimental
Psychology, 80, 1�46.
Bonin, P., Barry, C., Meot, A., & Chalard, M. (2004). The influence of age of acquisition in word
reading and other tasks: a never ending story? Journal of Memory and Language, 50, 456�476.
Bonin, P., Chalard, M., Meot, A., & Fayol, M. (2002). The determinants of spoken and written
picture naming latencies. British Journal of Psychology, 93, 89�114.
Bonin, P., Fayol, M., & Chalard, M. (2001). Age of acquisition and word frequency in written
picture naming. Quarterly Journal of Experimental Psychology, 54A, 469�489.
Brown, G. D. A., & Watson, F. L. (1987). First in, first out: Word learning age and spoken word
frequency as predictors of word familiarity and word naming latency. Memory and Cognition,
15, 208�216.
Brysbaert, M., van Wijnendaele, I., & de Deyne, S. (2000). Age of acquisition effects in semantic
tasks. Acta Psychologica, 104, 215�226.
Buchanan, M. (1927). A graded Spanish word book. Toronto, Canada: Toronto University Press.
Carcedo, A. (1998). Tradicion y novedad en las aportaciones hispanicas a los estudios de
disponibilidad lexica. Linguıstica, 10, 5�68.
Carroll, J. B., & White, M. N. (1973). Word frequency and age-of-acquisition as determiners of
picture-naming latency. Quarterly Journal of Experimental Psychology, 25, 85�95.
Catling, J. C., & Johnston, R. A. (2005). Age of acquisition effects on word generation. European
Journal of Cognitive Psychology, 17, 161�177.
Chertkow, H., Bub, D., & Caplan, D. (1992). Constraining theories of semantic memory
processing: Evidence from dementia. Cognitive Neuropsychology, 9, 327�365.
Coltheart, V., & Winograd, E. (1986). Word imagery but not age of acquisition affects episodic
memory. Memory and Cognition, 14, 174�180.
Cuetos, F., Aguado, G., Izura, C., & Ellis, A.W. (2002). Aphasic naming in Spanish: Predictors and
errors. Brain and Language, 82, 344�365.
Cuetos, F., Ellis, A. W., & Alvarez, B. (1999). Naming times for the Snodgrass and Vanderwart
pictures in Spanish. Behavior Research Methods, Instruments, and Computers, 31, 650�658.
De Groot, A. M. B. (1992). Determinants of translations. Journal of Experimental Psychology:
Learning, Memory, and Cognition, 18, 1001�1018.
Dimitrijevic, N. (1969). Lexical availability. Heidelberg, Germany: Julius Gross Verlag.
750 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Echeverrıa, M., Herrera, O., Moreno, P., & Pradenas, F. (1987). Disponibilidad lexica en
Educacion Media. Revista de Linguistica Teorica y Aplicada, 25, 55�102.
Ellis, A. W., & Lambon Ralph, M. A. (2000). Age of acquisition effects in adult lexical processing
reflect loss of plasticity in maturing systems: Insights from connectionist networks. Journal of
Experimental Psychology: Learning, Memory, and Cognition, 26, 1103�1123.
Ellis, A. W., & Morrison, C. M. (1998). Real age of acquisition effects in lexical retrieval. Journal of
Experimental Psychology: Learning, Memory, and Cognition, 24, 515�523.
Franklin, S., Howard, D., & Patterson, K. E. (1995). Abstract word anomia. Cognitive
Neuropsychology, 12, 549�566.
Funnell, E., & Sheridan, J. (1992). Categories of knowledge? Unfamiliar aspects of living and
nonliving things. Cognitive Neuropsychology, 9, 135�153.
Galloso, V. (2003). El lexico disponible de Avila, Salamanca y Zamora. Burgos, Spain: Instituto
Castellano y Leones de la Lengua.
Ghyselinck, M., de Moor, W., & Brysbaert, M. (2000). Age of acquisition ratings for 2816 Dutch
four and five letter nouns. Psychologica Belgica, 40, 77�98.
Gilhooly, K. J., & Gilhooly, M. L. M. (1980). The validity of age-of-acquisition ratings. British
Journal of Psychology, 71, 105�110.
Gilhooly, K. J., & Watson, F. L. (1981). Word age-of-acquisition effects: A review. Current
Psychological Research, 1, 269�286.
Gougenheim, G., Michea, R., Rivence, P., & Sauvageot, A. (1964). L’elaboration du francais
elementaire: Etude sur l’etablissement d’un vocabulaire et d’une grammaire de base. Paris: Didier.
Groot, P. J. M. (2000). Computer assisted second language vocabulary acquisition. Language
Learning and Technology, 4, 60�81.
Hampton, J. A., & Gardiner, M. M. (1983). Measures of internal category structure: A
correlational analysis of normative data. British Journal of Psychology, 74, 491�516.
Hernandez-Munoz, N. (2002). Disponibilidad lexica en Cuenca. Unpublished doctoral dissertation,
Universidad de Salamanca, Salamanca, Spain.
Hirsh, K. W., & Funnell, E. (1995). Those old, familiar things: Age of acquisition, familiarity and
lexical access in progressive aphasia. Journal of Neurolinguistics, 9, 23�32.
Holmes, S. J., & Ellis, A. W. (in press). Age of acquisition and typicality effects in picture naming,
object decision and category verification. Visual Cognition.
Izura, C., & Ellis, A. W. (2002). Age of acquisition effects in word recognition and production in
first and second languages. Psicologica, 23, 245�281.
Izura, C., & Ellis, A.W. (2004). Age of acquisition effects in translation judgement tasks. Journal of
Memory and Language, 50, 165�181.
Jolicœur, P., Gluck, M. A., & Kosslyn, S. M. (1984). Pictures and names: Making the connections.
Cognitive Psychology, 16, 243�275.
Juhasz, B. (2005). Age of acquisition effects in word and picture identification. Psychological
Bulletin, 131, 684�712.
Lambon Ralph, M. A., Graham, K. S., Ellis, A. W., & Hodges, J. R. (1998). Naming in semantic
dementia*what matters? Neuropsychologia, 36, 775�784.
Larochelle, S., & Pineau, H. (1994). Determinants of response times in the category verification
task. Journal of Memory and Language, 33, 796�823.
Levelt, W. J. M., Roelofs, A., & Meyer, A. S (1999). A theory of lexical access in speech production.
Behavioral and Brain Sciences, 22, 1�75.
Loftus, E. F., & Suppes, P. (1972). Structural variables that determine the speed of retrieving words
from long-term memory. Journal of Verbal Learning and Verbal Behavior, 11, 770�777.
Lopez-Chavez, J., & Strassburguer Frıas, C. (1992). Otro calculo del indice de disponibilidad lexica.
Coleccion Pedagogica Universitaria, 20, 187�207.
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 751
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Lopez Chavez, J., & Strassburguer Frıas, C. (1993). Un modelo para el calculo del ındice de
disponibilidad lexica individual. In H. Lopez Morales (Ed.), La ensenanza del espanol como
lengua materna (pp. 91�112). Rıo Piedras, Puerto Rico: Universidad de Puerto Rico.
Lopez Chavez, J., & Strassburguer Frıas, C. (2000). El diseno de una formula matematica para
obtener un ındice de disponibilidad lexica confiable. Anuario de Letras, 38, 227�251.
Lopez Morales, H. (1996). Los estudios de disponibilidad lexica: pasado y presente. Boletın de
Filologıa de la Universidad de Chile, 35, 245�259.
Lopez Morales, H. (1999). Lexico disponible de Puerto Rico. Madrid, Spain: Arco Libros.
Lyons, A. W., Teer, P., & Rubenstein, H. (1978). Age-at-acquisition and word recognition. Journal
of Psycholinguistic Research, 7, 179�187.
Mackey, W. C. (1971). Le vocabulaire disponible du francais. Paris: Didier.
Malt, B. C., & Smith, E. E. (1982). The role of familiarity in determining typicality. Memory and
Cognition, 10, 69�75.
Marmurek, H. H., & Rinaldo, R. (1992). The development of letter and syllable effects in
categorization, reading aloud, and picture naming. Journal of Experimental Child Psychology,
53, 277�299.
Mateo, M. V. (1998). Disponibilidad lexica en el COU almeriense. Estudio de estratificacion social.
Almerıa, Spain: Universidad de Almerıa.
McFarland, C. E., Jr., Duncan, E. M., & Kellas, G. (1978). Isolating the typicality effect in
semantic memory. Quarterly Journal of Experimental Psychology, 30, 251�262.
Mena Osorio, M. (1986). Disponibilidad lexica infantil en tres niveles de ensenanza basica.
Unpublished doctoral dissertation, Universidad de Almerıa, Almerıa, Spain.
Michea, R. (1953). Mots frequents et mots disponibles. Un aspect nouveau de la statistique du
langage. Les Langues Modernes, 47, 338�344.
Miles, J., & Shevlin, M. (2001). Applying regression and correlation: A guide for students and
researchers. London: Sage.
Monaghan, J., & Ellis, A. W. (2002). What, exactly, interacts with spelling�sound consistency in
word naming? Journal of Experimental Psychology: Learning, Memory, and Cognition, 28, 183�206.
Moreno, F., Moreno, E., & de las Heras, G. (1995). Calculo de la disponibilidad lexica, el programa
Lexidisp. Linguıstica, 7, 243�249.
Morrison, C. M., Chappell, T. D., & Ellis, A. W. (1997). Age of acquisition norms for a large set of
object names and their relation to adult estimates and other variables. Quarterly Journal of
Experimental Psychology, 50A, 528�559.
Morrison, C. M., & Ellis, A. W. (1995). The roles of word frequency and age of acquisition in word
naming and lexical decision. Journal of Experimental Psychology: Learning, Memory, and
Cognition, 21, 116�133.
Morrison, C. M., & Ellis, A. W. (2000). Real age of acquisition effects in word naming and lexical
decision. British Journal of Psychology, 91, 167�180.
Moulin, C. J. A., Perfect, T. J., Conway, M. A., North, A. S., Jones, R. W., & James, N. (2002).
Retrieval induced forgetting in Alzheimer’s disease. Neuropsychologia, 40, 862�867.
Mulligan, N. W. (1998). Perceptual interference at encoding enhances recall for high but not low
imageability words. Psychonomic Bulletin and Review, 5, 464�469.
Oldfield, R. C., & Wingfield, A. (1965). Response latencies in naming objects. Quarterly Journal of
Experimental Psychology, 4, 272�281.
Paivio, A., Clark, J. M., Digdon, T., & Bons, T. (1989). Referential processing: Reciprocity and
correlates of naming and imaging. Memory and Cognition, 17, 163�174.
Randall, B., Moss, H. E., Rodd, J., Greer, M., & Tyler, L. K. (2004). Distinctiveness and correlation
in conceptual structure: Behavioural and conceptual studies. Journal of Experimental
Psychology: Language, Memory, and Cognition, 30, 393�406.
752 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Rosch, E., & Mervis, C. B. (1975). Family resemblances: Studies in the internal structure of
categories. Cognitive Psychology, 7, 573�605.
Samper-Padilla, J. A., Bellon, J. J., & Samper-Hernandez, M. (2003). El proyecto de estudio de la
disponibilidad lexica en espanol. In R. Avila, J. A. Samper, & H. Ueda (Eds.), Pautas y pistas en
el analisis del lexico hispano(americano) (pp. 20�27). Frankfurt, Germany: Vervuert
Iberoamericana.
Schwanenflugel, P. J., Harnishfeger, K. K., & Stowe, R. W. (1988). Context availability and lexical
decisions for abstract and concrete words. Journal of Memory and Language, 27, 499�520.
Schwanenflugel, P. J., & Shoben, E. J. (1983). Differential context effects in the comprehension of
abstract and concrete verbal materials. Journal of Experimental Psychology: Learning, Memory,
and Cognition, 9, 82�102.
Schwanenflugel, P. J., & Stowe, R. W. (1989). Context availability and the processing of abstract
and concrete words in sentences. Reading Research Quarterly, 24, 114�126.
Sinclair, J. M. (1987). Looking up: An account of the COBUILD project in lexical computing.
London: Collins.
Smith, E. E., Shoben, E. J., & Rips, L. J. (1974). Structure and process in semantic memory: A
feature model for semantic decisions. Psychological Review, 81, 214�241.
Snodgrass, J. G., & Vanderwart, M. (1980). A standardized set of 260 pictures: Norms for name
agreement, image agreement, familiarity, and visual complexity. Journal of Experimental
Psychology: Human Learning and Memory, 6, 174�215.
Soto, P., Sebastian, M. A., Garcıa, E., & del Amo, T. (1982). Categorizacion y datos normativos en
Espana. Universidad Autonoma de Madrid, Spain: Coleccion Monografıas del ICE.
Southgate, V., & Meints, K. (2000). Typicality, naming, and category membership in young
children. Cognitive Linguistics, 11, 5�16.
Thorndike, E. L. (1921). A teacher’s word book of 20,000 words. New York: Teacher’s College Press.
Troster, A. I., Salmon, D. P., McCullogh, D., & Butters, N. (1989). A comparison of the category
fluency deficits associated with Alzheimer’s disease and Huntington’s disease. Brain and
Language, 37, 500�513.
Van Loon-Vervoorn, W. A. (1989). Eigenschappen van basiswoorden. Lisse, The Netherlands: Swets
& Zeitlinger.
Van Loon-Vervoorn, W. A., van Ham, P., & van der Koppen, M. (1988). The importance of age of
acquisition for imageability in word processing. In M. Dennis, J. Engelkamp, & J. T. E.
Richardson (Eds.), Cognitive and neuropsychological approaches to mental imagery (pp. 99�107). Dordrecht, The Netherlands: Martinus Nijhoff.
Van Overschelde, J. P., Rawson, K. A., & Dunlosky, J. (2004). Category norms: An updated and
expanded version of the Battig and Montague (1969) norms. Journal of Memory and Language,
50, 289�335.
Warrington, E. K., & McCarthy, R. A. (1987). Categories of knowledge. Brain, 110, 1273�1296.
Williams, J. M. G., Healy, H. G., & Ellis, N. C. (1999). The effect of imageability and predicability
of cues in autobiographical memory. Quarterly Journal of Experimental Psychology, 52A, 555�579.
Zevin, J. D., & Seidenberg, M. S. (2002). Age of acquisition effects in word reading and other tasks.
Journal of Memory and Language, 47, 1�29.
Zevin, J. D., & Seidenberg, M. S. (2004). Age of acquisition effects in reading aloud: Tests of
cumulative frequency and frequency trajectory. Memory and Cognition, 32, 31�38.
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 753
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
APPENDIX
First 10 words calculated to have the highest lexical availability (LA) scores and theirvalues in typicality (Typ), concept familiarity (CFam), age of acquisition (AoA),imageability (Imag), written frequency (WFreq), and number of syllables (Len)
Spanish word (English translation) Typ CFam AoA Imag LA WFreq Len
Animals
Perro (dog) 6.92 6.04 3.88 7 82.771 112 2
Gato (cat) 6.72 5.48 3 6.84 76.056 70 2
Leon (lion) 6.8 3.12 5.28 6.92 37.723 27 2
Caballo (horse) 6.48 4.2 3.64 7 34.723 93 3
Conejo (rabbit) 6.12 4.2 3.54 6.84 30.302 9 3
Vaca (cow) 6.72 4.32 4.52 6.96 27.32 14 2
Gallina (hen) 5.88 4.88 3.36 6.88 25.464 19 3
Tigre (tiger) 6.52 2.44 6.04 6.88 25.222 7 2
Burro (donkey) 5.56 4.28 3.72 6.92 23.995 17 2
Elefante (elephant) 6.32 3.36 4.36 6.92 21.565 9 4
Body parts
Brazo (arm) 7 6.32 2.98 6.92 71.694 96 2
Cabeza (head) 7 6.28 2.98 6.8 71.287 418 3
Mano (hand) 6.92 6.56 3.96 6.96 63 550 2
Pie (foot) 6.6 5.96 4.08 7 61.8 193 1
Ojo (eye) 6.64 6.24 4.08 6.76 60.1 96 2
Pierna (leg) 6.68 5.92 4.64 6.6 58.1 39 2
Dedo (finger) 6.36 6.04 3.04 6.92 53.948 66 2
Nariz (nose) 6.48 5.48 4.16 6.96 50.4 71 2
Oreja (ear) 4 3.48 8.6 5.56 110 3 2
Boca (mouth) 2.8 2.44 8.6 5 93.5 2 4
Furniture
Silla (chair) 6.8 6.72 4.64 7 78.805 58 2
Mesa (table) 6.92 6.76 4.2 7 78.615 228 2
Cama (bed) 6.92 6.76 3.08 6.84 67.877 199 2
Armario (wardrobe) 7 6.36 4 6.72 63.039 38 3
Sillon (armchair) 6.76 5.84 5 7 61.626 31 2
Sofa (sofa) 6.8 6.36 6.24 7 56.795 4 2
Estanterıa (shelves) 6.28 6.24 5.28 6.88 44.794 5 5
Mesita (bedside table) 6.36 5.48 5.04 6.84 42.025 2 3
Escritorio (desk) 5.88 5.8 7.46 6.76 28.341 14 4
Lampara (lamp) 3.8 5.64 3.96 6.76 24.219 22 3
Clothes
Jersey (pullover) 6.56 5.52 5.78 6.84 70.819 10 2
Pantalon (trousers) 6.72 6.68 4.44 6.76 70.674 35 3
Camiseta (T-shirt) 6.64 6.16 3.64 7 64.236 12 4
Camisa (shirt) 6.48 4.8 4.28 6.84 62.484 61 3
Calcetines (socks) 4.92 5.8 3.44 6.92 58.667 12 3
Calzoncillos (pants) 4.64 4.04 3.84 6.88 46.91 5 4
Bragas (pants) 4.44 5.28 3.64 6.88 45.394 9 2
754 HERNANDEZ-MUNOZ, IZURA, ELLIS
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015
Appendix (Continued )
Spanish word (English translation) Typ CFam AoA Imag LA WFreq Len
Zapato (shoes) 3.76 6 4.76 6.8 39.132 69 3
Chaqueta (jacket) 6.16 4.92 4.64 6.76 37.792 33 3
Sudadera (sweater) 6.4 5.04 8.83 6.92 37.104 0 4
Intelligence
Cerebro (brain) 6.12 5.4 7.04 5.48 23.892 87 3
Listo (smart) 6.2 5.32 5.2 3.84 22.067 16 2
Sabidurıa (wisdom) 5.76 3.96 8.56 2.92 17.492 26 5
Pensamieto (thought) 5.72 5.56 8.32 3.28 14.611 134 4
Saber (to know) 5.52 5.64 6.36 3 14.086 287 2
Memoria (memory) 4.96 5.32 6.48 3.2 13.008 203 3
Estudiar (to study) 5.4 6.52 5.56 4.68 12.902 45 3
Pensar (to think) 6.24 5.76 7.44 3.08 11.034 218 2
Mente (mind) 6.24 4.68 8.2 2.68 10.509 87 2
Libro (book) 3.92 6.76 4.68 6.96 10.342 199 2
COGNITIVE ASPECTS OF LEXICAL AVAILABILITY 755
Dow
nloa
ded
by [
Uni
vers
idad
De
Sala
man
ca]
at 0
1:41
23
Janu
ary
2015