Issues in compiling bilingual dictionaries

Issues in compiling bilingual dictionaries

Arleta Adamska-Sałaciak

[A] 1. Preliminary considerations

Many decisions taken during the compilation of bilingual dictionaries are essentially identical to

those encountered in the preparation of monolingual ones. Consequently, whether the dictionary is

to be organised alphabetically or thematically, published in print or electronically, available online

for a fee or free of charge, are all questions that will only be mentioned in passing, the main focus

of this chapter being on topics specific to the bilingual dictionary genre.

The issues that arise in the process of making a bilingual dictionary follow, in the main, from

what kind of dictionary it is to be. The coverage (i.e. the number and type of items to be included)

and the resulting physical size of the publication need to be decided upon, as does the depth of

individual entries, that is, the amount of information offered about a single headword. At the bottom

of it all lies the most basic consideration: the target audience, a factor which has a direct bearing on

such aspects as the dictionary’s scope and directionality.

[B] 1.1. Target audience, scope, and directionality

Thanks to the fact that one of the languages dealt with in a bilingual dictionary is always known to

its users, the dictionary’s audience can be immensely varied, ranging from complete beginners to

very advanced students of the other language. What any particular user wants from their dictionary

may, of course, vary: some people will treat it as a language learning aid, while others – travellers,

tourists – may have no interest at all in mastering the foreign language. For the latter, the dictionary

can be anything from an essential tool, ensuring survival in a foreign environment, to an optional

extra used – in addition to, or instead of, a phrase book – as a gesture of politeness towards the

inhabitants of the place they are visiting.

It is customary to view bilingual dictionary users in terms of the two parties they naturally

fall into: one native speaker group for each of the two languages of a particular dictionary. This

simple dichotomy glosses over those instances when neither of the dictionary’s object languages is a

given user’s native tongue (L1). For speakers of lesser-used languages, there is often no bilingual

dictionary available which would cover the foreign language they are interested in combined with

their native language. They then have to settle for second best: an L3-L2 dictionary, where L3 is the

foreign language they want and L2 is another foreign language, one they have mastered sufficiently

well to be able to use it as an intermediary. Sadly, the needs of such users are not taken into

consideration in the preparation of bilingual dictionaries, nor is it clear how they could be.

Another special case are dictionaries of languages which no longer have any native

speakers. A bilingual dictionary of Sanskrit, for instance, will typically feature only one part, with

Sanskrit as the source language (SL) and a modern language, such as English, as the target language

(TL). Such dictionaries, used mainly by scholars, frequently rely on descriptive explanations of

meaning rather than TL equivalents. This is because many headwords are so deeply embedded in

the ancient SL culture that translating them by means of single TL words is not possible.

Whether dealing with a dead or a living language, dictionaries with only one part (Lx-Ly)

are called monoscopal (Hausmann and Werner 1991). Some monoscopal dictionaries are

additionally equipped with an Ly-Lx index, meant as a partial substitute for a proper Ly-Lx part in

that it allows Ly-speaking users access into the Lx-Ly section via their native language. Most

currently produced bilingual dictionaries are biscopal (Lx-Ly and Ly-Lx), that is, consist of two

fully-fledged word lists, each item in each list being accompanied by one or more equivalents in the

other language.

The exact contents of a biscopal dictionary for a given pair of languages may vary

considerably depending on the dictionary’s directionality. A so-called bidirectional dictionary

(Hausmann and Werner 1991) is designed with native speakers of both Lx and Ly in mind, while a

monodirectional one is meant for an audience consisting exclusively of speakers of either Lx or Ly.

The majority of contemporary bilingual dictionaries either are or claim to be bidirectional. The

motivation is largely commercial: it is cheaper to produce one dictionary which will be sold in both

markets than invest in two different ones.

Considered from the perspective of any single user, this is far from ideal. The necessity of

addressing two language communities at once results, especially in print dictionaries, in a situation

when some of the information supplied is superfluous from the point of view of one of the user

groups. Thus, speakers of German do not need to be told what the gender of each German noun is

or that a given German expression is very formal. The space occupied by such redundant

information could be put to better use if the needs of only one language community – in this

instance, that of German speakers – were being catered for. When compiling a monodirectional

dictionary, by contrast, lexicographers do not have to constantly switch perspective from one user

group to the other: they can maintain a consistent focus on that of the dictionary’s languages which

is its intended users’ L2.1 This seems to be reason enough for demanding that, whenever possible,

bilingual dictionaries should be monodirectional.

Matters are additionally complicated by the fact that, more often than not, one of the two

languages of a bilingual dictionary is less widely spoken and less often learnt than the other. As a

result, what publishers advertise as a dictionary equally useful to both language communities may,

upon closer inspection, prove to be heavily biased towards the needs of the community speaking the

less popular language. Such a dictionary is only superficially bidirectional, e.g., by virtue of giving

grammatical information about headwords in both the Lx-Ly and Ly-Lx section and/or using both

Lx and Ly as the metalanguage. Where semantic description is concerned, it cannot help but

privilege one of the user groups (see, e.g., Atkins 1985: 15). Most bilingual dictionaries with

English which are produced in non-anglophone countries fit the above description: in spite of the

promises on their back covers, they are directed primarily at speakers of the local language, who are

likely to predominate among the users.

The good news is that the number of monodirectional dictionaries published worldwide has

been growing steadily in recent years. This is especially visible in the area of pedagogical

lexicography, i.e. among dictionaries geared specifically to the needs of less-than-advanced foreign

language learners.2 Even so, it must be remembered that the coverage of such dictionaries is, as a

rule, less comprehensive than that of large, academic reference works, which are still predominantly

bidirectional.

[B] 1.2. Function, organisation, medium of presentation

A dictionary’s target audience, scope, and directionality are intimately connected with its functions.

Two main functions of bilingual dictionaries have been recognised: the receptive (passive,

decoding) and the productive (active, encoding) one. Traditionally, the emphasis has been on

reception. This is hardly surprising, since, for any language pair we care to examine, dictionaries

going from L2 to L1 will turn out to be used more readily (and, consequently, to have had a longer

history) than those going in the opposite direction. The reasons seem obvious. When a person wants

to understand something written or spoken in a foreign language, and there is no-one to help them

with the task, they have little choice but to turn to a bilingual dictionary. By contrast, when someone

wants to express themselves in a foreign language, they may choose to rely on their existing verbal

repertoire, thus forgoing dictionary consultation. Besides, the benefits of consulting a dictionary for

decoding tend to be rather more immediate and more obvious than its effects on the user’s

production.

Despite this traditional dominance of reception, much more attention is being paid to the

productive function in 21st-century bilingual lexicography than used to be the case a mere few

decades ago. This is a welcome development, since it is precisely in this area that (biscopal)

bilingual dictionaries hold an undisputable advantage over monolingual ones: thanks to being

equipped with an L1-L2 part, they enable users to access L2 lexical resources through what those

users already know, i.e. through the lexical items of their L1. Even the most production-oriented

monolingual learner’s dictionary cannot compete with that.3

The organisation of bilingual dictionaries is predominantly semasiological, i.e. based on the

headword’s (written) form. Accordingly, for languages with alphabetical writing systems, the

arrangement of entries follows the order of the alphabet. Each entry is organised around a particular

lexical item (the headword), providing the meanings (senses) which that lexical item can express.

This is the kind of reference work one normally has in mind when talking about bilingual

dictionaries.

In the much rarer onomasiological (thematic, ideological) bilingual dictionaries, the word

list is arranged according to topics, and each entry is built around a concept, listing the lexical items

through which that concept can be expressed. Onomasiological dictionaries are usually monoscopal

and cover only a limited subset of the SL lexicon, such as the vocabulary specific to a particular

domain of knowledge. They are also primarily concerned with the headwords’ semantics, to the

exclusion of aspects such as grammar or pronunciation. There is a certain parallel between an

onomasiological dictionary, whether mono- or bilingual, and a semasiological L1-L2 bilingual

dictionary: both start from the familiar (respectively, a general concept and a word in the user’s

native language) and proceed towards the less familiar (a specific word the user wants, but either

cannot recall or does not know), thus serving primarily an encoding function.

Depending on the medium of presentation, bilingual dictionaries are divided into paper

(print) and electronic ones. As the properties of electronic dictionaries are discussed in detail in

Chapters 5.1. and 5.3., the present contribution deals mainly with medium-independent features.

Otherwise, in the absence of indications to the contrary, when talking about a bilingual dictionary

we mean a printed book.

[A] 2. Compilation

[B] 2.1. Human resources

Whether a dictionary succeeds in meeting the expectations of its target users depends, to a large

extent, on the skill of the lexicographer(s) who have compiled it. Most quality bilingual dictionaries

are prepared by teams of people rather than single-handedly. When perfect bilinguals are not

available (which is in most cases), the minimum requirement is that the prospective lexicographer

should be a native speaker of one of the dictionary’s languages and have a near-native command of

the other. Some project managers insist that the lexicographers and editors working on a

dictionary’s Lx-Ly section should be native speakers of Ly (and vice versa), the assumption being

that TL equivalents are easier to come by when one is translating into, rather than from, one’s native

language. As might be expected, it is not always possible to fulfil this last requirement, either.

One variable that is impossible to control are a candidate’s predispositions for the job. Like

with translating skills, a person’s talent for lexicography often has little to do with whether they

hold a degree in linguistics or another language-related discipline (although being a trained linguist

obviously cannot hurt). The most desirable qualification, i.e. a degree in lexicography, is not a

realistic expectation, since few academic institutions offer courses in lexicography (see Chapter 6),

and those that do rarely teach actual dictionary making. This being so, the best indicator of a

candidate’s suitability for the job is how well they can execute the sample lexicographic tasks they

have been assigned in the course of their preparatory training (which is a necessary stage preceding

any respectable dictionary project).

[B] 2.2. Data sources and methods

Whatever the exact procedure followed in the compilation of a bilingual dictionary, it is certain that

existing dictionaries (bilingual Lx-Ly and Ly-Lx ones, as well as monolingual ones of both Lx and

Ly) will be consulted in the process. For many cheap, low-quality dictionaries, prior publications

are the only source of material complementing the authors’ introspection.

Many specialists believe that a modern dictionary worthy of the name cannot be prepared

without a suitably large, electronically stored language corpus. Corpora provide evidence for SL

meanings and for the frequency of occurrence of lexical items. They also help identify common

syntactic patterns and recurring phraseological combinations, so that lexicographers can tap them

for illustrations of various aspects of language use, which are then presented in the dictionary in the

form of example sentences. In sum, it is on the basis of corpus data that representative, up-to-date

word lists can be drawn up and the use of headwords illustrated, thereby ensuring that nothing of

importance has been overlooked.

All of the above is fairly uncontroversial. Exactly what kind of corpus is required for

making a bilingual dictionary is slightly less so. Ideally, two representative, well-balanced corpora

should be available, one for each of the source languages of a biscopal dictionary. Access to a

parallel corpus (i.e. a corpus of SL texts and their translations) and/or to comparable monolingual

corpora (i.e. corpora containing texts of the same type, coming from the same period, and dealing

with the same kind of topic) would be an additional bonus. There seems to be universal agreement

that such closely matched corpora could be of great help in tracking down equivalent candidates for

particularly obstinate headwords.

Disappointingly, no bilingual dictionaries are known to have been compiled exclusively on

the basis of either parallel or comparable corpora.4 Apart from the obvious obstacle (i.e.

unavailability of the right kind of corpus), the main problem seems to be the enormous amount of

time required to analyse the wealth of corpus data in a satisfactory way (Atkins 2008 (2002): 258f.).

A bilingual corpus offers too many equivalence candidates, each of which might seem important to

the lexicographer at some point, with the likely result that a dictionary compiled in this way would

contain ‘too much detail for most users’ and might end up being ‘too big to appear in print’ (Atkins

and Rundell 2008: 478). In principle, of course, these difficulties could be resolved if enough time –

and, consequently, money – were to be devoted to editing the preliminary version of the dictionary,

allowing the lexicographers to scrutinise equivalence candidates carefully and, ultimately, trim all

oversize entries. Meanwhile, as a sort of half-measure, corpus data are sometimes incorporated into

the electronic versions of bilingual dictionaries (e.g. the CD-ROMs or DVDs accompanying the

printed books) in an attempt to offer additional assistance to more advanced users, especially

translators and professional linguists.

Depending on corpus availability, but also on the time, money, and human resources

allocated to a particular project, the actual compilation of a bilingual dictionary may follow

different paths. As already indicated, ideally, the lexicographers should have at their disposal two

linguistically pre-analysed corpora (databases), one for each of the source languages of the

dictionary. As yet, this is hardly ever the case. Instead, for dictionaries whose SL is a well-described

language Lx, a frequently taken route is to start from an ‘universal’ Lx database built from the

resources of an Lx corpus. The database is universal in the sense that it can serve as a blueprint for a

bilingual dictionary from Lx into any other language. It is made up of a list of Lx headwords, each

complete with the relevant grammatical information, preliminary sense divisions, and definitions of

the distinguished senses; example sentences are sometimes provided as well. Such a pre-constructed

framework is passed on to a team of TL lexicographer-translators, whose main task is to fill the

empty slots, in other words, to supply TL equivalents for all the senses identified for the SL

headwords. Subject to the chief editor’s approval, TL lexicographers may be allowed to modify the

initial framework to varying extents in order to make it fit the target language better; this is

achieved mainly by splitting and lumping senses (see Chapter 4.9.).

What happens with the Ly-Lx part of the dictionary depends on the resources available for

Ly. The procedure may either be fully analogous (i.e. Ly-corpus-based) or, in the absence of an Ly

corpus, it may involve working with several monolingual dictionaries of Ly and/or using the results

of an automatic reversal of the Lx-Ly part (provided the two parts are not compiled simultaneously).

There are also cases when a bilingual dictionary is created through a bilingualisation of an

existing monolingual dictionary. Although seemingly a simpler task, it may actually require more

skill and effort to turn a fully-fledged monoligual dictionary into a bilingual one than to fashion a

bilingual dictionary from a semi-finished product, i.e. a database of the kind discussed above.

Again, depending on the project, different degrees of intervention in the original macro- and

microstructure may be allowed.5

For the sake of completeness, it ought to be mentioned that somewhere between a

monoligual dictionary and a bilingual dictionary created on its basis lies an intermediate genre: a

bilingualised (semi-bilingual) dictionary, i.e. one that offers TL equivalents while retaining the SL

definitions from the monolingual dictionary on which it has been founded. Such dictionaries are

always monodirectional and monoscopal (L2-L1), with only an L1-L2 index in place of a regular

L1-L2 section.

[A] 3. Megastructure

The overall structure of a dictionary (its megastructure) comprises the central word list

(macrostructure) and the outer texts (outside matter).

[B] 3.1. Word lists

In a biscopal dictionary, of course, we have not one word list but two. The main criterion for

inclusion in either of them is a given item’s frequency of occurrence, but other factors play a role as

well. In the language-learning context, that is, in pedagogical bilingual dictionaries, potential

usefulness for learners – which does not always coincide with corpus frequency – must be taken

into account. Thus, vocabulary items connected with the language classroom, complying with

examination requirements, and correlated with the interests of the target age group (e.g.

schoolchildren or young adults) have a fair chance of being included.

Another reason why the word lists cannot simply be determined by corpus frequency is that

this would automatically disqualify older words, which are either extremely rare in, or altogether

absent from, corpora of contemporary language. Although most bilingual dictionaries are

synchronic contemporary dictionaries (Svensén 2009: 23), a comprehensive dictionary of the

general language must also include some obsolescent or even completely obsolete words, in order

to meet the expectations of those users whose proficiency allows them to read older literature in the

foreign language. Since such users will need information about those words for decoding purposes

only, it follows that, in a monodirectional dictionary, older vocabulary items will feature mainly, if

not exclusively, as headwords in the L2-L1 part. By the same logic, in a bidirectional dictionary,

where each of the parts acts as an L2-L1 resource for one of the user groups, both word lists should

include important old words.

What about the other end of the time spectrum, that is, neologisms? On the whole, lexical

items which have not yet become institutionalised do not feature in dictionaries of any sort (except,

naturally, dictionaries of neologisms). Indeed, lack of a dictionary record is one of the main criteria

for deciding that a particular item is still at the neologism stage. It is all the more remarkable that,

occasionally, there may be more justification for admitting a neologism into a bilingual Lx-Ly

dictionary than into a monolingual dictionary of Ly. What we have in mind are cases when no Ly

equivalent can be found for a particular Lx headword and when, additionally, there is reason to

believe that the Lx word will eventually be borrowed into Ly (the tell-tale signs being its

attestations in informal Ly speech and/or in Ly content on the internet). Under such circumstances, a

bilingual lexicographer may decide to sanction the incipient borrowing by listing it among the

proposed Ly equivalents of the Lx headword. The only chance for a neologism to feature in a

bilingual dictionary is thus as a tentative TL equivalent in the Lx-Ly part rather than as a headword

in either Lx-Ly or Ly-Lx.

Finally, not only single words, but increasingly also multiword units can be headwords in

bilingual dictionaries. This is a result of the growing realisation that units of meaning are not always

co-extensive with orthographic words, ample evidence for which comes, among others, from the

study of language corpora. From the users’ point of view, one of the benefits of multiword units

being elevated to the status of independent headwords is that they are easier to locate than when

nested inside entries.

[B] 3.2. Outer texts

Outer texts are additions situated outside the A-Z core of the dictionary, either on the peripheries of

the dictionary proper (front and back matter) or as inserts – plates of drawings, diagrams, etc –

interspersed among the entries (middle matter). Until quite recently, neither dictionary makers nor

theoretical lexicographers used to pay much attention to those optional sections, arguing that very

few users ever consulted them.6 While doubtless true, it is hard to say whether the latter is solely the

cause of the lexicographers’ neglect or also one of its effects. What is certain is that outer texts often

give the impression of an afterthought, carrying material which is relatively easy to get hold of, very

likely added at the last minute and without much consideration to what the dictionary audience

might really need.

Fortunately, things are beginning to look up. In good dictionaries, the A-Z text is now

routinely preceded by a ‘How to use the dictionary’ section. Also present is a list of abbreviations,

including grammatical codes, phonetic symbols, usage labels, and the like. A grammar section –

with a list of noun declensions and/or verb conjugations, irregular verbs, and similar – is a frequent

feature. Occasionally, one can find some or all of the following: a selection of false friends, a

writing guide, a bank of common phrases used in everyday conversation, a list of popular texting

and e-mailing abbreviations (and their TL equivalents). In learners’ dictionaries, there is often a

special section devoted to aspects of the L2 culture. Lists of geographical and personal (given)

names, once quite common, are less frequent these days, partly because important place names tend

to feature as headwords in the A-Z part.

In print dictionaries, the choice of elements as well as the amount of information to be

included in the outside matter is heavily circumscribed by space considerations. In the case of

discursive sections, such as those on culture or essay writing, the language of presentation has to be

decided upon, as lexicographers can rarely afford the luxury of two language versions. Ideally, of

course, any text in L2 should be accompanied by its L1 translation, thus allowing users to choose

the version they fell more comfortable with – an easy thing to do in an electronic dictionary.

In sum, if a dictionary’s outer texts are to be of real use to the language learner, a lot of

thought has to go into their preparation. It may be a good idea to enlist the help of other specialists –

language teachers, grammarians, phoneticians – or even entirely delegate the task to them. All of the

lexicographers’ time and effort can then be spent on their main job: identifying TL equivalents and

presenting them in the most effective and efficient way.

[A] 4. Microstructure

The microstructure, that is, the structure of a single entry, can be quite complex in a modern

dictionary, with many constituent elements following one another in a set order, each conveying

different kinds of information. Here, only two types of entry constituent will be dealt with:

equivalents and examples of usage.

[B] 4.1. Equivalents

The one indispensable element of the microstructure of a bilingual dictionary is the headword with

its translations (known in lexicography as TL equivalents). Both the identification of suitable TL

equivalents and their clear presentation are crucial to the dictionary’s success.

[C] 4.1.1. Equivalent provision

The principal idea of a bilingual dictionary is deceptively simple: to provide equivalents for all

senses of all headwords, such that each equivalent is identical in meaning to the sense it has been

matched with.7 Unfortunately, the execution of this idea is, in most cases, extremely difficult and, at

times, utterly impossible. Three properties of natural languages – two intra- and one interlingual –

are responsible for that: vagueness of meaning, polysemy, and lack of one-to-one correspondence

between different lexical systems (i.e. so-called anisomorphism8 ).

Both vagueness and polysemy carry substantial benefits for language users, making it

possible to express an infinitude of meanings with the help of limited lexical resources. The same

cannot be said about their implications for lexicography. Semantic vagueness (indeterminacy)

significantly complicates the process of identifying the meanings of (decontextualised) lexical items

within each language. Until such identification has been accomplished, one cannot even begin to try

and match meanings interlingually for the purposes of a bilingual dictionary.

Besides, it is not always clear – not only to lexicographers, but also to lexical semanticists –

whether a particular lexical item is best viewed as vague or as polysemous, that is, whether it should

be thought of as having one general, underspecified meaning or several more or less distinct,

independent meanings.9 Opting for the latter, i.e. for polysemy, brings with it the need for careful

sense division of SL headwords (see Chapter 4.9.) as well as for meticulous discrimination and

disambiguation of TL equivalents (see sections 4.1.2. and 4.1.3. of the present chapter).

There is also a connection between polysemy and anisomorphism: the rarity of interlingually

symmetrical polysemy (i.e. of cases where two polysemous words in two different languages have

the same number of exactly the same senses) can be viewed as one of the facets of anisomorphism.

To illustrate this issue in a condensed form, let us consider The Sense of an Ending, the title of a

novel by Julian Barnes. Individually, both sense and ending are polysemous; combined as above,

they not only do not disambiguate each other, but intensify the ambiguity of the resulting phrase. At

least three readings of the phrase seem possible here: ‘the feeling that an end is approaching’; ‘the

meaning of a book’s ending’; ‘the meaning of a life’s ending’10. Indeed, it is possible to read the title

at a more general level, with all three interpretations being ‘turned on’ at once (the overall effect

thus amounting to vagueness rather than polysemy). When it comes to translation, there is,

naturally, little chance of preserving the multi-layered ambiguity – or vagueness, as the case may be

– unless the items proposed as TL equivalents of the two English nouns exhibit polysemy parallel

(at least in the relevant senses) to that of, respectively, sense and ending.11

In general, anisomorphism – a consequence of the fact that different languages structure

reality differently – means that perfect interlingual equivalence is an exception rather than the rule.

Consequently, if bilingual dictionaries are to be viable at all, the sameness-of-meaning requirement

must be relaxed, in recognition of the graded, rather than absolute, nature of interlingual

correspondences. A great deal of ingenuity and effort on the part of the lexicographer may be

required before even a limited degree of equivalence between the headword and its dictionary TL

counterpart can be reached.12

One of the means of achieving correspondence is to extend the scope of the unit to be

translated, in accordance with the principle that, when there is no equivalence at word level, it can

still be reached at the level of the entire message. Sometimes, it may be necessary to embed the

untranslatable SL lexical item in a sentence (and then translate the lot); at other times, adding some

minimal context (e.g. a modifier) and translating the resulting phrase will do the trick.

It would, however, be wrong to conclude that it is always easier to provide equivalents for

SL stretches larger than a single word. In particular, conventional multiword lexical units, especially

figurative ones, bring challenges of their own. On top of the difficulties which follow from lack of

isomorphism, the extra complication here is that the meaning of a figurative expression is often to

some extent motivated (transparent). The motivating link between the actual (that is, figurative)

meaning of an expression (such as an idiom or a proverb) and its lexical structure resides in the so-

called image component,13 i.e. the mental picture the expression evokes in the language user.

Strictly speaking, a perfect TL equivalent of a conventional figurative expression should therefore

correspond to it on both planes: that of the actual meaning and that of the rich image. What usually

happens is that correspondence is present only on one of the planes, if that. If two expressions are

identical exclusively on the level of the image, then, of course, they do not qualify as equivalent at

all; rather, they are phraseological false friends. There is no agreement as to how to treat two

expressions which do mean the same, but are based on markedly different images. Most authors

(including practising lexicographers) will accept them as fully equivalent, but some (e.g.

Dobrovol’skij and Piirainen 2005) will insist, in accordance with the reasoning presented above,

that their equivalence is only partial.

Consider the pair of proverbs which Duval (2008: 280) cites as an example of full

equivalence: English once bitten twice shy and French chat échaudé craint l’eau froide. Despite

their identical figurative meaning, the two are not always substitutable in translation: one cannot be

used in place of the other if, for whatever reason, the literal meaning – either that of having being

bitten or that of having been scalded – gets activated, e.g.:

Once bitten (by Godzilla), twice shy.

Unfortunetely, the nation is not “once bitten, twice shy.” More like multiple bites...14

Of course, bilingual lexicographers could count themsleves lucky if all their problems with

matching phraseological units were of this kind, the sole question being whether the equivalence

works in all contexts or only in some. Most of the time they deal with correspondences which are

much more tenuous than in the case just quoted, and sometimes the target language has no fixed

expression at all which would make a passable equivalent candidate. In such cases, it is better to

give a discursive explanation of the SL meaning in the target language than to propose as an

equivalent a TL expression which exhibits significant differences (whether semantic, syntactic, or

pragmatic) when compared with the headword, thereby potentially misleading the user.

[C] 4.1.2. Equivalent discrimination

Early bilingual dictionaries did not discriminate between equivalents at all, listing them one after

another, separated only by commas. Later, in slightly more ambitious publications, a distinction was

introduced between a comma, separating supposedly interchangeable equivalents, and a semicolon,

separating equivalents which were not fully interchangeable. There are still quite a few dictionaries

which continue this tradition. That it is not a tradition worth continuing shoud be obvious from the

paucity of cases when a SL headword (or a sense thereof) can be supplied with even a single perfect

TL equivalent. To assume that there are more perfect equivalents per headword (sense), and that

they are all fully interchangeable, flies in the face of what we know about interlingual

anisomorphism, on the one hand, and about the rarity of absolute synonyms within one language, on

the other.

In a bilingual L2-L1 dictionary meant to serve reception only, such indiscriminate

accumulation of equivalents, while hardly ideal, might perhaps be defended. When the dictionary’s

target language is the users’ L1, we can, arguably, expect them to be able to pick the appropriate

equivalent from among those provided. However, in a dictionary which also aims to take care of the

users’ productive needs, lack of equivalent discrimination is bound to have disastrous consequences.

Left to their own devices, some users will inevitably make some wrong choices, producing

utterances which are, at best, stylistically inappropriate and, at worst, downright incomprehensible

or ridiculous. Or simply not what they wanted to say.

Guiding the user to the equivalent they need involves making a number of decisions: what

kind of information should be given to identify the equivalent; in which of the two languages it

should be phrased; what non-verbal signs (numbers, punctuation marks, icons) should be employed

for the purpose. The information made use of in equivalent discrimination can be semantic (e.g. a

synonym, hyperonym, or collocate), syntactic (e.g. an indication of part of speech, transitivity, or

type of subject taken by the verb), or pragmatic (e.g. style, domain, regional, or temporal label). The

choice of typography is largely a question of aesthetics; what matters is that the signs should be

distinct enough and not too numerous or difficult to interpret.

The metalanguage in a monodirectional dictionary will, naturally, be the native language of

the users. Unfortunately, the policy cannot be consistently applied in bidirectional dictionaries, and

this necessarily puts one of the user groups at a disadvantage. The next best thing would be to give

all information in both languages – not a viable proposition in a printed book, whose entries would

swell in size and become impossibly hard to navigate, but easily doable in electronic reference

works, in which the choice of the metalanguage can be left in the hands of the user. The entry

quoted below demonstrates how a bidirectional dictionary with English and German (OGD)

attempts to solve the problem by alternating the metalanguage (in this instance, preceding the

German equivalent by an English synonym of the headword in the appropriate sense or subsense

and following it with two or three German collocates):

handsome

1. (good-looking) gut aussehend [Mann, Frau]; schön, edel [Tier, Möbel, Vase usw.]

2. (generous) großzügig [Geschenk, Belohnung, Mitgift]; nobel [Behandlung, Verhalten, Empfang];

(considerable) stattlich, ansehnlich [Vermögen, Summe, Preis]; stolz [Preis, Summe]15

[C] 4.1.3. Equivalent disambiguation

Whenever the lexical item offered as a TL lexicographic equivalent is ambiguous (whether due to

polysemy, homonymy, or vagueness), information should be provided enabling the user to home in

on the sense in which the equivalent is to be interpreted. This is especially important in L2-L1

dictionaries; in L1-L2 ones, the problem should not arise, thanks to the user’s familiarity with the

meaning of the (native) headword. Often, the same methods which serve the purposes of sense

and/or equivalent discrimination simultaneously resolve equivalent ambiguity. Thus, grammar

codes, usage labels, and guide phrases containing different kinds of clues (synonyms, collocates,

etc) can all be effective disambiguating devices. In addition, if two or more near-synonymous

equivalents are given side by side, they often automatically disambiguate each other, their semantic

‘common part’ indicating the relevant sense. If all else fails, the lexicographer can furnish the

equivalent with a gloss, as in this entry from a Russian-English dictionary (CORD):

интерпретáтор interpreter (expounder),

where the gloss deactivates the ‘oral translator’ reading of the equivalent.

[B] 4.2. Examples

Unlike equivalents, without which a bilingual dictionary is simply impossible, examples of usage

are not an obligatory element of the microstructure. Nonetheless, they can be found more and more

often these days, especially in bilingual dictionaries geared specifically to the needs of language

learners. Once the decision has been made to include this feature, a number of questions have to be

answered, such as whether the examples are better invented by the lexicographer or extracted from

a corpus (and, if corpus-based, to what extent they can be modified) and whether or not they should

be translated.

In dictionaries for beginners and/or young schoolchildren, where the examples need to be

simple and short, and where, consequently, the authenticity of the illustrative material is less

important than its attunement to the users’ proficiency level, examples entirely made up by the

lexicographer may be the best solution. On the other hand, simplicity must give way to authenticity

in the exemplification of rare words (or senses), which need to be shown in the context in which

they are normally found rather than in artificially simplified sentences. In sum, neither corpus-based

nor made-up examples are inherently better – it all depends on the level of the user and on the kind

of word (or sense) being illustrated.

Since bilingual dictionaries serve users at all levels of proficiency, the examples they offer

cannot always be quoted exactly in the form in which they appear in the corpus, but need to be

carefully edited before being put in the dictionary. Editorial intervention involves cutting sentences

down to manageable size as well as eliminating excessively difficult lexical items, obscure culture-

specific references, and other potentially distracting detail.

Something more than simple editing may be required when examples are meant to assist in

encoding, as they normally are in bilingual lexicography (given that the decoding function is taken

care of by the equivalents). As demonstrated in a recent experimental study (Frankenberg-Garcia

2012), production-oriented examples are much more effective if they have been hand-picked in

such a way as to address common production errors, thus providing repeated exposure to structures

that are problematic for learners with a particular mother tongue background.16 While not a realistic

requirement for each single entry, this policy should at least be followed in entries containing words

or structures notoriously difficult for the target users.

Opinions differ as to what it is that example sentences ought to illustrate in a bilingual

dictionary. In trying to answer this question, it helps to think about what bilingual dictionaries are

typically used for. It seems that, whatever their motivation, people do not normally consult a

bilingual dictionary in order to find out how their native language works. Consequently, in a

monodirectional dictionary, it makes sense for examples always to illustrate the users’ L2. This

boils down to illustrating the headword in the L2-L1 section (which is what dictionaries have

traditionally been doing) and the equivalent in the L1-L2 section (which is an innovation). The

innovative approach has been taken by LSW, a dictionary for Polish learners of English, whose

Polish-English part contains examples illustrating the equivalents, as in the following:

płynąć

1 (ryba, człowiek) swim:

· fish swimming up the stream

2 (rzeka, woda) flow:

· The River Elbe flows through the Czech Republic.

· A steady stream of cars flowed past her window.

3 (woda, łzy) run:

· Tears ran down her face.

4 (statek, statkiem) sail:

· We sailed along the coast of Alaska.

5 (czas) go by:

· Time goes by so quickly these days.

Regrettably, most bilingual dictionaries which feature examples still employ them

exclusively to illustrate source language material (i.e. the headword and the combinations it enters

into), in both parts of the dictionary and irrespective of its directionality. In bidirectional

dictionaries, there is, of course, no other option: examples must illustrate the headword, because the

dictionary’s source language is always an L2 for one of the user groups. Still, bilingual learners’

dictionaries being, by definition, monodirectional, there is hope that the situation there might

change in the future.

The controversy over what examples should illustrate is intimately connected with another

question, namely, whether they ought to be translated.17 Most examples support information given

earlier in the entry, but some expand on or qualify it, usually by introducing important exceptions

(e.g. by showing that, in certain circumstances, a different translational equivalent is needed or that

the headword is regularly omitted in translation). Examples which merely support the equivalent(s)

can – perhaps even should – be left untranslated, whereas those which illustrate exceptions must be

supplied with translations, either of the whole or of the part for which the so-far-unmentioned

equivalent is needed. In electronic dictionaries, the question loses much of its urgency, as it is

possible to make translation an optional feature, to be switched on and off when needed. Even there,

however, the problem does not disappear completely. After all, it does matter – especially for

advanced users, who should be given maximally authentic models – whether a particular sentence

has originally been uttered in a given language or is merely a translation (and thus potentially

exhibits features of translationese).

[A] 5. Conclusion

While the present chapter has touched upon some important issues involved in the compilation of

bilingual dictionaries, it has not attempted to make any systematic predictions about the future. This

caution seems justifiable, since even the world’s most renowned lexicographers, when asked what

future dictionaries will look like – or, indeed, whether there still will be dictionaries a few years

from now – tend to avoid direct answers.18 While it is hard at present to imagine a world without

dictionaries, especially bilingual ones, no-one really knows how the world of reference science is

going to be transformed by the technological advances we are witnessing. What does seem certain

is that people will always need information about interlingual lexical correspondences and that, no

matter in what form and through what medium that information is conveyed to them, it will

capitalise upon the work which bilingual lexicographers have done over the past centuries.

Notes

1. Or, in general, on the language less known to them (see the remarks about L3-L2 dictionaries

above).

2. The dictionaries in the Oxford Beginners series are a good example.

3. See Adamska-Sałaciak (2010b) for a review of the inherent deficiencies of monoligual learners’

dictionaries and the resulting need for bilingual ones.

4. As of December 2011 (on the authority of Gilles-Maurice de Schryver, guest lecturer at the

Faculty of English, Adam Mickiewicz University in Poznań). Basing a dictionary on a parallel

corpus seems a much more viable proposition where specialist publications are concerned (see, e.g.,

the work in progress on a monoscopal English-Polish dictionary of phrasal verbs reported on in

Perdek 2011).

5. Adamska-Sałaciak (2006a: Chapter One) illustrates the different ways in which bilingual

dictionary compilation can proceed, using English-Polish/Polish-English dictionary projects from

the 1990s and early 2000s as examples.

6. For some background on lexicographic outer texts, see the first two chapters of Bielińska (2010).

7. The intricate relations between lexicographic equivalence and semantic identity are discussed in

Adamska-Sałaciak (2013).

8. The term anisomorphism was introduced by Zgusta (1971: 294ff.).

9. The different diagnostic tests for distinguishing vagueness from polysemy proposed by

philosophers and semanticists are not wholly reliable, often yielding inconclusive or conflicting

results (for an overview, see Geeraerts 1993).

10. Either the life of the aging narrator-protagonist or that of his best friend, whose reasons for

killing himself are revealed towards the end of the book, completely undermining the reader’s

earlier perception of the characters and events described.

11. Since Polish does not afford such a possibility, in Barnes (2012) the translator chose the first –

arguably, most salient – reading, thus eliminating any uncertainty as to the interpretation and

destroying the vague aura of mystery carried by the title.

12. A typology of equivalence types, conceived of as areas on a cline, has been developed in

Adamska-Sałaciak (2006a, 2010a, 2011). A discussion illustrated with examples taken from

dictionaries for different language pairs is forthcoming in Adamska-Sałaciak (to appear).

13. The discussion in this paragraph relies on the terminology of the Conventional Figurative

Language Theory developed in Dobrovol’skij and Piirainen (2005).

14. The examples have been obtained from the enTenTen Corpus with the help of the Sketch Engine

Corpus Query System; original spelling has been retained.

15. In this, as in all other examples of dictionary entries quoted in this chapter, only those elements

of the entry are given which illustrate the point under discussion.

16. Although Frankenberg-Garcia’s study deals with definitions and examples, imitating the process

of monoligual dictionary consultation by (Portuguese) learners of English, it seems that this

particular conclusion reached by the author can be extended to bilingual dictionaries.

17. The problem is considered in detail in Adamska-Sałaciak (2006b).

18. It has become a custom at recent lexicographic conferences (e.g. EURALEX 2010, e-LEX

2011) to organise panel discussions on the topic.

References

Adamska-Sałaciak, Arleta 2006a. Meaning and the Bilingual Dictionary: The Case of English and

Polish. Frankfurt am Main: Peter Lang.

Adamska-Sałaciak, Arleta 2006b. ‘Translation of dictionary examples – notoriously unreliable?’, in

Elisa Corino, Carla Marello, and Cristina Onesti (eds.) Atti del XII Congresso Internazionale di

Lessicografia, Torino, 6-9 settembre 2006. Alessandria: Edizioni dell’Orso. vol. 1. 493-501.

Adamska-Sałaciak, Arleta 2010a. ‘Examining equivalence’, International Journal of Lexicography

23.4. 387-409.

Adamska-Sałaciak, Arleta 2010b. ‘Why we need bilingual learners’ dictionaries’, in Ilan Kernerman

and Paul Bogaards (eds.). English Learners’ Dictionaries at the DSNA 2009. Tel Aviv: K

Dictionaries. 121-137.

Adamska-Sałaciak, Arleta 2011. ‘Between designer drugs and afterburners: A lexicographic-

semantic study of equivalence’, Lexikos 21. 1-22.

Adamska-Sałaciak, Arleta 2013. ‘Equivalence, sameness of meaning, and synonymy in a bilingual

dictionary’, International Journal of Lexicography. Special Issue ed. by Fredric Dolezal.

Adamska-Sałaciak, Arleta. To appear. ‘Explaining meaning in a bilingual dictionary’, in Philip

Durkin (ed.) The Oxford Handbook of Lexicography. Oxford: Oxford University Press.

Atkins, Beryl T. Sue. 1985. ‘Monolingual and bilingual learners’ dictionaries: A comparison’, in:

Robert Ilson (ed.). Dictionaries, Lexicography and Language Learning. Oxford: Pergamon

Press. 15-24.

Atkins, B. T. Sue. 2002. ‘Then and now: Competence and performance in 35 years of

lexicography’, in Anna Braasch and Claus Povlsen (eds.). Proceedings of the Tenth EURALEX

International Congress. Vols. 1-2. Copenhagen: Center for Sprogteknologi. 1-28. (Reprinted in

Thierry Fontenelle (ed.) 2008. Practical Lexicography: A Reader. Oxford: Oxford University

Press. 247-272.)

Atkins, B. T. Sue and Michael Rundell. 2008. The Oxford Guide to Practical Lexicography. Oxford:

Oxford University Press.

Barnes, Julian. 2011. The Sense of an Ending. London: Jonathan Cape.

Barnes, Julian. 2012. Poczucie kresu. Warszawa: Świat Książki. (Polish transl. of Barnes 2011 by

Jan Kabat).

Bielińska, Monika. 2010. Lexikographische Metatexte. Eine Untersuchung nichtintegrierter

Aueβentexte in einsprachigen Wörterbüchern des Deutschen als Fremdsprache. Frankfurt am

Main: Peter Lang.

Dobrovol’skij, Dmitrij and Elisabeth Piirainen. 2005. Figurative Language: Cross-cultural and

Cross-linguistic Perspective. Amsterdam and Boston: Elsevier.

Duval, Alain. 1991. ‘Equivalence in bilingual dictionaries’, in Thierry Fontenelle (ed.) 2008.

Practical Lexicography: A Reader. Oxford: Oxford University Press. 273-82. (Originally

published as Duval, A. 1991. ‘L’equivalence dans le dictionnaire bilingue’, in Franz-Josef

Hausmann, Oskar Reichmann, Herbert Ernst Wiegand and Ladislav Zgusta (eds.).

Wörterbücher/ Dictionaries/ Dictionnaires. Ein internationales Handbuch zur Lexikographie. An

international encyclopedia of lexicography. Encyclopédie internationale de lexicographie. Berlin

and New York: Mouton de Gruyter. vol. 3. 2817-2824.)

Frankenberg-Garcia, Ana. 2012. ‘Learners’ use of corpus examples’. International Journal of

Lexicography (http://ijl.oxfordjournals.org/content/early/2012/06/25/ijl.ecs011.full.pdf+html)

Geeraerts, Dirk. 1993. ‘Vagueness’s puzzles, polysemy’s vagaries’, Cognitive Linguistics 4: 223-

272 (Reprinted in Geeraerts, D. 2006. Words and Other Wonders. Berlin: Mouton de Gruyter. 99-

148.)

Hausmann, Franz-Josef and Reinhold Otto Werner. 1991. ‘Spezifische Bauteile und Strukturen

zweisprachigen Wörterbücher: Eine Übersicht’, in Franz-Josef Hausmann, Oskar Reichmann,

Herbert Ernst Wiegand, and Ladislav Zgusta (eds.), Wörterbücher/ Dictionaries/ Dictionnaires.

Ein internationales Handbuch zur Lexikographie. An international encyclopedia of

lexicography. Encyclopédie internationale de lexicographie. vol. 3 Berlin and New York:

Mouton de Gruyter.

Perdek, Magdalena. 2011. English phrasal verbs in translation: A lexicographic and corpus study of

equivalence. Unpublished PhD dissertation, Adam Mickiewicz University.

Svensén, Bo. 2009. A Handbook of Lexicography. Cambridge: Cambridge University Press.

Zgusta, Ladislav. 1971. Manual of Lexicography. Prague: Academia. The Hague: Mouton.

Dictionaries

Concise Oxford Russian Dictionary. 1998. Revised Edition, based on the Oxford Russian

Dictionary. Oxford: Oxford University Press. [CORD]

Longman Słownik Współczesny Angielsko-Polski, Polsko-Angielski. 2011. Harlow: Pearson

Education Limited. 2nd edition. [LSW]

Oxford German Dictionary. 2008. Oxford: Oxford University Press. 3rd edition. [OGD]

Issues in compiling bilingual dictionaries

Documents

Transcript of Issues in compiling bilingual dictionaries