Gender, animacy, and declensional class assignment: a unified account for Russian moreNorman Fraser & Greville G. Corbett. 1995. Gender, animacy and declensional class assignment: a unified account for Russian. In: Geert Booij and Jaap van Marle (eds) Yearbook of Morphology 1994. Dordrecht: Kluwer. pp. 123-50. [Note: the Yearbook for a given year was published the next year.] |
14 views |
Gender, Animacy, and Declensional Class Assignment:
A Unified Account for Russian
NORMAN M. FRASER and GREVILLE G. CORBETT
I. INTRODUCTION1
In an earlier paper we introduced Network Morphology, an approach to inflec-
tional morphology which relies heavily on the notion of default inheritance
(Corbett & Fraser 1993), and showed how a complex set of data pertaining
lo the inflectional morphology of Russian nominals could be handled in this
framework. We extend that analysis here to include a range of phenomena
which, we believe, has not previously been described in a single formally
explicit framework. The key concept underlying our analysis is that of default
inheritance. We introduce this below, together with a lexical knowledge rep-
resentation language called DATR, which we use to make our analysis fully
explicit.2 In Section 3 we describe our approach to gender assignment in
Russian; animacy assignment is dealt with in Section 4; and declensional
class assignment in Section 5.
The issues tackled here - the interrelation of meaning, gender, declen-
sional class and phonology - have been considered by Aronoff (1992), in
his discussion of Arapesh based on Fortune (1942), and by Corbett (1991:
7-69). Aronoff and Corbett both highlighted the complex interactions of factors
in the assignment of gender, and Aronoff made extensive use of defaults.
However, neither had a formal apparatus to show clearly whether their claims
were valid. In this paper we go further in that we give a formal account,
which allows us to demonstrate that our model does indeed make the correct
predictions. Our demonstration is based on Russian, which provides a good
range of overlapping factors involved in assignment. Russian has the advan-
tage that, following work by Ilola & Mustajoki (1989), based on Zaliznjak
(1977), we have a clear picture of the numbers of nouns of each type in the
lexicon. We "believe that Aronoff's account of Arapesh is similarly robust,
and that it yields the right results if subjected to a comparable formal analysis
(Fraser & Corbett 1993). We offer some brief conclusions in Section 6. The
DATR encoding of our theory and the output listing which demonstrates the
validity of our claims are given as appendices.
2. DEFAULT INHERITANCE
Default inheritance is usually introduced by means of non-linguistic examples,
and we make no exception here. Consider the taxonomic hierarchy in Figure
1. The lines in the taxonomy indicate instantiation rather than sub-classifica-
123
Geerl Booij and Jaap van Mailt (ids.). Yearbook of Morphology 1994, 123-1 SO.
© 1995 Kluwer Academic Publithers. Primed in the Netherlands.
124
Norman M. Frascr and Greville G. Corbett
BIRD
has feathers
can fly
EAGLE ROBIN PENGUIN
cannot fly
Edwina Rupert Percy
Fig. 1. A simple instantiation hierarchy.
tion. So an eagle is a bird, as is a robin and a penguin; Edwina is an eagle,
Rupert is a robin, and Percy is a penguin.
Given an instantiation hierarchy of this kind, default inheritance allows
all attributes of a given node in the hierarchy (such as BIRD) to be inher-
ited by a node which instantiates it (such as EAGLE), except in cases where
the lower node already has a value for some'attribute and thus overrides the
default (i.e. inheritable) value for that attribute. In our example, a BIRD has
feathers and can fly. These facts are inherited by EAGLE and ROBIN and,
indirectly, by Edwina and Rupert. The attribute of having feathers is also
inherited by PENGUIN and, thereby, by Percy. However, specific informa-
tion about the flying abilities of PENGUINs blocks inheritance of the more
general information about BIRDs. Thus, despite the fact that Percy is a BIRD,
he cannot fly.
This approach to information organization allows generalizations to be
expressed once at a high level, and then automatically to apply to everything
which inherits from there. In this way regularities, subregularities and excep-
tions can be encoded with considerable ease and parsimony. This approach has
the added advantage of marking exceptions as such, as in the case of PENGUIN
in Figure I. If Percy were a truly exceptional penguin who had obtained a
pilot's license, this extreme degree of markedness would be visible in the
inheritance structure by virtue of an exceptional fact (PERCY can fly) over-
riding an exceptional fact (PENGUINs cannot fly), overriding a default (BIRDs
can fly).
The history of linguistics offers many examples of linguistic theories which
conflate substance and form. It is now widely acknowledged that the ability
to separate these offers at least two benefits: first, it focuses attention on the
substantive content of theories rather than on their notational sugar; second,
A Unified Account for Russian
125
it facilitates cross-theoretical comparisons (Shieber 1987). Network Morphol-
ev is concerned with substantial matters of linguistic theory, which could,
irf principle, be formalized in a number of different ways. For this purpose,
We use an existing formalism called DATR.3 The DATR knowledge repre-
sentation language was developed by Roger Evans and Gerald Gazdar as a
logic-based formalism for describing inheritance networks (Evans & Gazdar
1989a, b). Since computer implementations of the DATR language are avail-
able, it is possible to check that an inheritance network expressed in DATR
captures the intended generalizations.'1 AH of the analyses in this paper have
been checked in this fashion.
The information in Figure 1 would be expressed in DATR as follows:
(0
BIRD: <has feathers> ==
<can fly> == yes.
EAGLE: <> = = BIRD.
ROBIN: <> = = BIRD.
PENGUIN: <> = = BIRD
<can fly> == no.
Edwina: <> = - EAGLE.
Rupert: <> = = ROBIN.
Percy: <> = = PENGUIN.
yes
The labels preceding colons are 'nodes'; the angle bracket expressions to the
left of the symbol are 'paths' (also called 'attributes'); the words to the
right of non-empty paths are 'values'. Thus, the value of the <can f ly>
path at the PENGUIN node is 'no'.
It should be clear how this kind of formalism might be useful in expressing
linguistic generalizations. Figure 2 shows the inheritance network for Russian
nominals which we presented in our earlier work (Crobett & Fraser 1993).
N_l to N_IV are declensional classes. (Notice that we posited the
existence of a node N_0 from which N_I and N„IV inherit, thereby finding
a principled reason for sitting on the fence in the debate over whether Russian
has three or four noun declensional classes). The following (incomplete)
fragment is taken from our earlier analysis.*
"<stem>" _e
"<stem>" i.
(2) NOUN: <mor loc sg> •==
<mor nom pl> —
N_III: <> « NOUN
<mor loc sg> == "<mor dat sg>".
Kost': <> == N_III
<stem> == kost'.
/The first sentence at NOUN should be read as saying that the locative singular
^consists of the stem followed by an -e ending.6 A path enclosed in double
126
Norman M. Fraser and Greville G. Corbel!
NOMINAL
N_I NJV N_II N_m
zakon v'ino komnafa kost'
Fig. 2. An inheritance siruciure for Russian nominats.
quotes in a DATR sentence is used to retrieve the specified value for the
thing inheriting from that sentence. If we wanted to find the nominative
plural of Kost1, we would inherit the sentence <mor nom pl> == "<stenv>"
_i. Before going any further we would have to find out what the <stem>
of Kost' is. Since the answer is kost', the nominative plural is kost'L If we
wanted to know the locative singular of Kost', we would never inherit the
definition of locative singular at NOUN because it is overridden at N_lll,
from which Kost' inherits. The definition of locative singular at N_III
establishes an asymmetric identity between the locative singular form of an
N_III noun and its dative singular.
It is often desirable to inherit most information from one source, but to have
access to certain kinds of information stored elsewhere. DATR supports this
kind of multiple inheritance. Consider the following fragment.7
(3) N_II: <> -= NOUN
<mor gen sg> == "<stem>" _i.
N_III: <> « NOUN
<mor gen sg> == N_II.
This says that N_III may inherit its schema for forming the genitive singular
from N_ii, even though N_iii (like N_II) inherits primarily from NOUN.
This may be expressed more explicitly as follows;
(4) N_III: <mor gen sg> « N_II: <mor gen sg>.
A Unified Account for Russian
127
The last form of DATR construction we shall consider here is the so-called
'evaluable path'. Consider the following (slightly simplified) fragment:
(5) NOUN: <mor gen pl>
*s< *<mor stem hardness>" mor gen pl>"
<sof t mor gen pl> == "<stem>" „ej.
N_III: <> M NOUN.
Kost': <> == N_III
<stem> == kost'
<mor stem hardness> == soft.
Nouns in Russian can be divided into those having soft (palatalized) stems and
those having hard (non-palatalized) stems. The difference ts not usually of
morphological significance, except in the case of genitive plurals, where it
is crucial. The first equation shown here can be interpreted as follows: to obtain
the genitive plural form, find out the hardness of the stem and prepend the
value to the path <mor gen pl>, then evaluate the resulting path.8 Thus, to
find the genitive plural of Kost', inference would proceed as follows (expressed
syliogistically in terms of premises and conclusions):9
a. NOUN : <mor gen pl> =
"< "<mor stem hardness>" mor gen pl>"
N_III: <> == NOUN
Kost': <> == N_III
Kost': <mor gen pl> =
"< "<mor stem hardness>" mor gen pl>"
b. Kost': <mor gen pl> =
"<mor stem hardness>" mor gen pl>"
Kost': <mor stem hardness> == soft
Kost' : <mor gen pl> = = *<sof t mor gen pl>"
c. NOUN: <sof t mor gen pl> == "<stem>" „e j
N_111 :■ <> « NOUN
Kost': <> == N_III
Kost': <sof t mor gen pl> == "<stem>" _ej
d. Kost': <sof t mor gen pl> == "<stem>" _ej
Kost': <stem> == kost
Kost': <sof t mor gen pl> == kost' „ej
e. Kost': <tnor gen pl> = = "<soft mor gen pl>"
Kost': <sof t mor gen pl> === kost' _ej
Kost': <mor gen pl> s == kost' _ej
128
Norman M. Fraser and Greville G. Corbetl
This concludes our brief inlroduction to default inheritance and to the DATR
formalism. More detailed introductions to default inheritance can be found
in Gazdar (1987) and Daelemans, de Smedt and Gazdar (1992), and to DATR
in Gazdar (1990; forthcoming). In the following sections we show how
Network Morphology uses default inheritance to produce elegant solutions
for some well known problems in the analysts of Russian nouns.
3. GENDER ASSIGNMENT
Russian has three genders, masculine, feminine and neuter.10 We could enter
a value for gender in the lexical entry of each noun. However, this would
miss generalizations at two levels: first, the gender of Russian nouns does
not appear to be random, and second, more generally, we have argued that
languages never have to specify gender for the majority of nouns. We have
claimed that the gender of the overwhelming majority of nouns can always
be predicted, either from semantic information which must, in any case, be
stored in the lexical entry or from semantic information supplemented by
formal information, which may be morphological or phonological (Corbett
1991: 68). Specifically in Russian, the gender assignment rules refer to
semantic and to morphological information:
Semantic assignment rules for gender
1. Sex-differentiable nouns denoting males (humans and higher animals)
are masculine: for example, student '(male) student';
2. Sex-differentiable nouns denoting females are feminine: for example,
uditel'nica '(female) teacher'
Nouns which are sex-differentiable are those where the sex of the referent
matters to humans (those denoting humans and domesticated animals) and
where the difference is striking (as in the case of lions).
These rules leave very few exceptions, but there are many nouns which
are not covered by them. These remaining nouns are distributed over the
three genders and their distribution is accounted for by the morphological
assignment rules. The major morphological assignment rules are as follows:
Morphological assignment rules for gender:
1. nouns of declensional class I are masculine;
2. nouns of declensional class II and III are feminine;
3. nouns of declensional class IV are neuter.
As we shall consider in more detail later, there is some overlap between the
two sets of rules: nouns which denote males are often in declensional class I.
We might try to do away with the semantic assignment rules. However, the
two sets of rules can make conflicting assignments, and when they do it is
the semantic rules which dominate. The crucial case is muzftna 'man* (and
A Unified Account for Russian
129
similar nouns), which ought to be masculine according to its semantics (male),
but feminine according to its morphology (declensional class II). In fact, it
belongs in the masculine gender."
There are also nouns which do not decline, and to which the above mor-
phological assignment rules do not apply. However, their lack of declension
is itself a matter of morphology, and this interacts with the semantic feature
of animacy to allow gender assignment. We treat indeclinable nouns as having
their own declensional class (V); nouns of this class may be subject to the
normal semantic assignment rules. Failing this, they are masculine if animate
and neuter if not.1'
in our Network Morphology account, every noun inherits from the NOUN
node. Since the following path equation is specified at NOUN, every noun
inherits it unless it is overridden by a more specific equation declared in the
lexical entry."
[1] NOUN: <syn gender> == GENDER: < "<sem sex>" >
In order to find a value for the path <syn gender>, the path <sem sex>
is evaluated (i.e. the sex of the noun's referent is retrieved) and then a path
consisting only of the sex is evaluated at the node GENDER (shown below):
[2] GENDER: <male> == masc
<female> == fern
<undifferentiated>
== "<mor formal„gender>".
The interpretation of this fragment of DATR is straightforward in the case
of sex-differentiated nouns. If the referent of the noun is male then the gender
is masculine; if the referent is female then the gender is feminine.14 However,
if the sex is undifferentiated, it is necessary to consider additional formal
criteria, and this is done by evaluating the path <mor f ormal_gender>.
Formal gender is defined for each of the five main declensional classes. Thus,
for example, the following equation is located at the node for declensional
class II nouns, N„ll:
[3] N_II: <formal_gender> == fern
This is used in the evaluation of the path <mor f ormal_gender> (the
function of the initial mor will be spelt out below). Thus, a class II noun
denoting a male (such as muUina 'man') will have masculine gender; a class
II noun denoting a female (such as ufitel'nica 'female teacher') will have
feminine gender; and a class II noun denoting a non-sex-differentiated referent
(such as komnata 'room') will have feminine gender, the default gender for
class II nouns.
As wc noted above, there is an additional complication with declensional
class V, where animacy must also be taken into consideration. Some equa-
tions from the node N„V, from which all class V nouns inherit, are given below.
130
Norman M. Fraser and Greville G. Corbetl
[4] N_V: <formal_gender> « < Vsem animacy>" >
<animate> == itiasc
<inanimate> == neut.
In order lo assign a formal gender for a given class V noun it is necessary
to evaluate the <sem animacy> path for that noun and use it to select
masculine gender if the noun denotes an animate referent and neuter gender
if the referent is inanimate.
Our analysis so far accounts for the gender of the vast majority of Russian
nouns. It is an advance on Corbett (1982) in that it is formally explicit, and
since it is encoded in DATR we can see that the correct predictions are indeed
made (see Appendix B).
4. ANIMACY ASSIGNMENT
Animacy in Russian is of particular interest. It is reflected in the accusative
case forms of certain nouns, and in agreement. There are no exclusively
animate accusative forms in Russian; instead, animacy shows itself by different
patterns of syncretism. With an animate noun of the first declensional class
we find the accusative identical to the genitive.
(7) pervogo (acc - gen) studenta (acc - gen)
first student
'the first student'
With an inanimate, by contrast, the accusative is as the nominative:
(8) pcrvyj (acc - nom) zakon (acc - nom)
first law
'the first law'
In both these examples, animacy is reflected not only in the morphological
form of the noun, but also in the agreement of the adjective. This agreement
can also be found with those nouns which decline according to pattern N_ll,
and so have a unique accusative form, but which are masculine animate, as
is the case with muzfina 'man':
(9) pervogo (acc = gen) muzemu (acc)
first man
'the first man'
Animacy is a sub-gender (Corbett 1991: 165-167). It is narrower in range than
the three main genders, since it affects only the accusative case. It is a more
recent addition to the system than the main genders, and assignment to the
animate sub-gender is (still) largely based on semantics: animate nouns are
those denoting humans, animals and insects. Returning to the morphology,
the crucial point is that we get the same regularity but in different paradigms:
A Unified Account for Russian
13!
the accusative matches the genitive for animate nouns under certain circum-
stances, and the nominative for inanimate nouns, though the phonological form
of the nominative or genitive varies from paradigm to paradigm.
In Corbett & Fraser (1993) we specified animacy in lexical entries as a
semantic feature; then by default, syntactic animacy was the same as semantic
animacy. However, there are further simplifications available. First we can
say that all sex-differentiables will be animate, hence semantic animacy need
not be specified in their lexical entries. Second, by default, nouns arc seman-
tically inanimate. This second default makes sense in the light of the statistical
evidence provided by Ilola & Mustajoki (1989: 15-17), based on Zaliznjak
(1977): of the 45,779 nouns in Zaliznjak's dictionary, 75% (34,178) denote
inanimates. Together, these two generalizations mean that in most lexical
entries there is no need to specify animacy.
The default identity between syntactic animacy and semantic animacy can
be expressed trivially in DATR:
[ 5 ] NOUN: <syn animacy> == "<sem animacy>"
This simply says that the syntactic animacy is identical to the semantic
animacy. Since this default generalization is stated at the level of the NOUN
node, it may be overridden by equations in the lexical entries of words with
conflicting semantic and syntactic animacy. Kon' 'knight (chess piece)' is such
a word: it is semantically inanimate but syntactically animate.
So much for syntactic animacy. For nouns which are sex-differentiable,
semantic animacy is obtained as follows:
[6] NOUN: <sem animacy>:
= = ANIMACY: < Vsem sex>" >.
[7] ANIMACY: <> — animate
<undifferentiated> == inanimate.
The path shown at [6] indicates that the semantic animacy may be found by
evaluating the sex of the word's referent at the ANIMACY node. If the sex is
undifferentiated, the value inanimate is found at the ANIMACY
node, as shown in [7}. For all other values of <sem sex> (i.e. male or
female) the value animate is found. Thus the ANIMACY node can be
thought of as a look-up table specifying default sex/animacy associations.
Nouns are inanimate by default because their sex is undifferentiated by
default and the final equation in {7} establishes the connection between the
two. The undifferentiated sex default is encoded by means of the following
equation, which is located at the NOUN node:
(8] NOUN: <sem sex> == undifferentiated
Our analysis now accounts for the syntactic animacy as well as the gender
of the vast majority of Russian nouns in a formally explicit way. While animacy
is less complex than gender, it is perhaps surprising to see just how simple
132
Norman M. Fraser and Greville G. Corbetl
it becomes: with appropriate use of defaults, animacy need hardly ever be
specified in lexical entries. Again, we make the right predictions as Appendix
B shows.
5. DECLENSIONAL CLASS ASSIGNMENT
So far we have specified the declensional class of each noun in its lexical entry.
This seems to be missing a generalization, since there are common, though
by no means exceptionless, correspondences between meaning, gender and
declensional class. Indeed, some earlier analyses attempt to predict declen-
sional class from other information which was specified.15 For many nouns,
it is the case that declensional class is predictable from semantic or formal
information. The semantic correspondences are as follows:
Semantic assignment rules for declensional class
1. Sex-differentiable nouns denoting males (humans and higher animals)
are of declensional class N„I: for example, student '(male) student';
2. Sex-differentiable nouns denoting females are of declensional class N_II:
for example, uditel'nica '(female) teacher'
There are substantial numbers of nouns whose declensional class must be spec-
ified to override rule one. These are nouns like muz£ina 'man' (as in example
(3», which denote males but which decline according to declensional class
NJI. (Lazova 1974: 942-943 puts the figure at 273, but the number involved
is actually larger because there are many hypocoristics of this type, like Saga
'Sasha' which do not appear in dictionaries like Lazova's). There are fewer
instances of overrides to rule 2, but we find a small number of nouns like
svekrov' 'mother-in-law' in declensional class N_UI.
Forma! assignment rule for declensional class
1. Nouns whose stem ends in a vowel are of declensional class N„V.
The effect of this rule is to make nouns whose stem ends in a vowel indec-
linable (class N_V are the indeclinables). Our rule follows Worth (1966),
though as he points out the idea goes back much further. A noun like taksi
'taxt' is entered as such in the lexicon and this guarantees its indeclinability.
These generalizations can be formalized fairly easily. Consider the following
DATR fragment, which is positioned at the NOUN node:
[9] NOUN: <mor> == "<declensional_class>"
<declensional„class> == DECLENSION:
<"<infl_root final>" *<sem sex>">
The first equation indicates that in order to find a value (or values) for
the <mor> path (or paths), it is necessary to evaluate the <declen-
sional_class> path. The second equation is somewhat more complex: a
value can be retrieved for the <declensional_class> path by evaluating
A Unified Account for Russian
133
a path consisting of the value of the path <inf l_root f inal> followed
by the value of the path <sem sex>, Paths beginning <inf l_root> make
available information about the inflectional root of a word. The <inf l„root
f inal> path is used to store information concerning the final segment of
the inflectional root, specifically whether it is a consonant or a vowel. We
may assume that this information would, in reality, be supplied by a phono-
logical component, though for convenience we simulate this crudely by means
of the following equation recorded at the NOUN node.16
(10] NOUN: <infl_root final> == consonant
This generalization holds for all Russian nouns, except certain indeclinable
nouns of relatively recent foreign origin. In the case of these exceptions the
default generalization is overridden in the lexical entries.
We have already noted how values for the <sem sex> path are supplied
either in the lexical entries or by means of a default assignment of undiffer-
entiated sex. Thus, where declensional class is predictable, it is found by
evaluating a path at the DECLENSION node. This path consists of informa-
tion on the final segment of the inflectional root of the word followed by
the sex of the word's referent. The DECLENSION node is given below:
[11J DECLENSION: <consonant male> == N_I:<>
<consonant female> == N_II:<>
<vowel $sex> == N_V:<>.
The first two paths encode the semantic assignment rules for declensional class.
The first element of each path is consonant so these paths potentially
apply to the bulk of Russian nouns. The second elements of the paths narrow
down on two subsets of these. The first path assigns core nouns (typical
native nouns) denoting males to class N„l; the second assigns core nouns
denoting females to class N_II. The third path picks out those nouns whose
inflectional root ends with a vowel and assigns them to class N_v, the class
for indeclinables. In DATR, every symbol which begins with a dollar sign
($) is a variable. The variable $sex is defined to range over all possible values
for the <sem sex> path, namely male, female and undifferentiated.
One group of nouns is not covered by the equations at this node, namely
those whose stem ends in a consonant and which have non-sex-differenti-
ated referents. Such nouns must typically specify declensional class in their
lexical entries. However, as we shall see shortly (in the discussion leading
to [11']), a substantial proportion of them can be assigned to a declensional
class by the setting of a default.
Let us first look at the working of [Uj. Take a noun such as student
'student', the lexical entry for which is as follows:
[12] Student: <> — NOUN
<gloss> == student
<infl„root all> == student
<sem sex> == male.
134
Norman M. Fraser and Grevitle G. Corbett
To which declensional class should this word be assigned? Interesting though
this question is, it is likely to be of less importance than specific questions
such as 'what is the dative plural form of this word?', or 'what is its nomi-
native singular?' Let us take the second question. In terms of our theory, it
maps onto the question 'what is the value of the <mor nom sg> path?'
Since the Student node inherits directly from the NOUN node, the two
equations shown in [9] are available to it. The first of these can be paraphrased
as saying 'for paths beginning <mor . . .> evaluate the path <declen-
sional_class> \ The definition of that path (also given in [9]) requires the
evaluation at the DECLENSION node of a path consisting of the value of
<infl_root t inal> for student, followed by the value of <sem sex>
for the same word. The value of <inf l_root f inal> is consonant
according to the unoverridden equation [10), and the value of <sem sex>
for student is male according to the last equation in the lexical entry (shown
in [12]). Thus the path <consonant male> must be evaluated at the
DECLENSION node. According to the first equation in [11] this returns a value
of N_I :<> for the path <mor>, where our search originated. However, this
is not the end of the look-up process. We were interested in the value of the
path <mor nom sg>, not just <mor>.
It is necessary at this point to introduce another kind of DATR equation
which was not mentioned earlier. An expression of the form NODE:<> may
appear on the right hand side of an equation, for example, NODE1: Pathl
== NODE2 : <>. In this example, an attempt to evaluate NODE1: SomePath
will succeed and return a value if SomePath can be formed by concate-
nating Pathl with a path defined at NODE2. In effect, the NODE: <> notation
allows the description of a path to be distributed over more than one node.
The entry in the DATR theory for N_I is given in [13].
[13] N_I: <> == N_0
<formal_gender> == masc
<nom sg> == "<stem sg>"
<hard gen pl> == "<stem pl>"_ov.
Evaluating the path <mor nom sg> at the Student node will lead via
NOUN to be following inference:17
(10) Student: <mor> == N_I : <>
This allows the <mor> path from NOUN to be concatenated with the <nom
sg> path from N_l to produce the following inference:
(11) Student: <mor nom sg> =*= "<stem sg>"
In other words, the nominative singular of student has a zero ending;
(12) Student: <mor nom sg> = student
It is worth contrasting declensional class assignment with gender assignment.
A clear difference is the outcome when semantic and formal rules make dif-
A Unified Account for Russian
135
ferenl predictions. For gender assignment, quite generally, it is the semantic
rule which takes precedence. As wc noted earlier, with nouns like muicina
'man*, which ought to be masculine according to its semantics (male), but
feminine according to its morphology (declensional class li), it is the semantic
rule which 'wins' and the noun is masculine. When we find a similar clash
in declensional class assignment, in Russian at least,18 the formal factor dom-
inates. Thus attase 'attache' denotes a male and 'should' be in declensional
class N_l, but it ends in a vowel and so 'should* be in declensional class
N_V. It is actually in declensional class N_V. Conversely it denotes a male and
so 'should' be masculine but it is in class N_V and so 'should' be neuter; as
already noted, semantic factors take precedence in gender assignment and so
the noun is masculine.
A second difference between gender assignment and declensional class
assignment, is that the gender assignment rules make a prediction for every
noun (which has to be overridden in an extremely small number of instances),
while the declensional class assignment rules leave large numbers of nouns
with no prediction. A non-sex-differentiable noun with a stem ending in a
soft consonant could decline according to class N_l, N_ll, N_l 11 or N_IV.19
However, assignment to the four classes is by no means equally likely. Consider
data on the numbers of nouns in each class, to the nearest fifty (derived from
Lazova 1974, especially pp. 942-943), given in Table 1.
Table ). Number of nouns in (he different declensional classes.
Class N_I NJi N_U1 N_IV N„V
Example lakon komnata kosf vino laksi
'taw' 'room' 'bone' 'wine' 'laxi'
No of nouns 20,850 16.050 5.150 11.050 450*
These raw statistics give a rough picture of what is going on. The figures do
not take account of the effect of derivational morphology; for instance, over
4,300 of the nouns in class N_lll have the suffix -ost' which forms abstract
nouns from adjectives (star-yj 'old', star-ost' 'old age'). If the suffix is labelled
as belonging to class n_iii, then the number of distinct members of the class
is substantially reduced. Similarly nominalizations in -aniel-enie (like
ravuSenie 'destruction', derived from razrusit' 'destroy') inflate the figure
for n_iv (see Schupbach 1984 for discussion).
Table i shows that N_l and N_ll have substantially more members than
the other three. We can therefore set a default which will put nouns in the
majority declensional class n_i. Clearly this will be overridden in many
instances, but equally it will simplify a very substantial number of lexical
entries.
This is a considerable simplification, and one which can be justified in three
interrelated ways. First, there is the numerical preponderance of n„l. Second,
136
Norman M. Fraser and Greville G. Corbett
it is the declensional class which takes the majority of borrowings. There is
no need to appeal to any markedness considerations to explain this, it is simply
that N__I has no ending in the citation form, the nominative singular, so that
foreign words ending in a consonant are typically borrowed into this class. The
third argument is more debatable. If N_l is the default declensional class,
and masculine is the gender associated by default with this class, then we make
masculine the default gender for nouns, without specifying it directly. Sources
vary as to the numerical preponderance of the genders. All agree that the neuter
is easily the weakest; Lazova (1974: 942-943) has only marginally more
masculines than feminines, while Mucnik (1971: 196-197) calculates that
the masculine has 46% to the feminine 41% (on a sample of 33,952 nouns)
and Zaliznjak's dictionary has 46% masculine and 38% feminine (sample
47,030: see Ilola and Mustajoki 1989: 9); the masculine is growing fastest,
from the assignment of borrowings (see the sources reported in Corbett 1991-
78).
The only change required in our DATR theory to include all of these
interacting defaults is given below, as a revision to the fragment shown above
as [11].
(11' ) DEF„DECL: <$phonological_type $sex>
=- N_I:<>
DECLENSION: <> == DEF_DECL
<consonant female> == N_II:<>
<vowel $sex> == N_V:<>.
Here, the variable $phonological_type ranges over consonant and
vowel, and $sex ranges over male, female and undifferentiated,
as before. In all cases other than the two explicitly described in the paths
at the DECLENSION node, the maximally general path specified at the
DBF_DECL node is inherited and, thereby, class 1 becomes the default declen-
sional class for nouns.
We argued above that if class I is the default declensional class, and
masculine is the gender associated by default with this class, then we make
masculine the default gender for nouns. Note that we are claiming only that
the masculine is the default gender for nouns. In a fuller analysis of Russian,
we would claim that there is a default for gender at a higher level than the
nodes relating directly to nouns. This higher default is necessary for items other
than nominals which may head syntactic constituents with which gender
agreement is required. The situation arises if, say, an infinitive phrase stands
in subject position (and there is a past tense verb). Under these circumstances
'neutral' agreement forms occur (these are usually identical to neuter forms.
Corbett 1991: 204-205, 216). Thus in an analysis of the full lexicon, at a
high level, perhaps at the level of WORD, the default gender is the neuter; at
the lower level of nouns it is the masculine. Of course, the lower default is
much the more important: gender is a central category for Russian nouns, while
A Unified Account for Russian
137
for the various items (such as infinitives) covered by the higher default, it is
of much less importance. This hierarchy of defaults would thus allow us to
capture the intuition that in one sense the masculine is the default gender
and in another it is the neuter.2'
In this section we have shown how the declensional class can be predicted
for a substantial proportion of Russian nouns. We have seen how declen-
sional class assignment differs from gender assignment in interesting ways;
further research is needed to show whether these differences are specific to
Russian or, as we suspect, are found more generally."
6. CONCLUSION
In previous work we gave an account of gender assignment in Russian. Other
analyses have given at least a partial account of declensional class assignment.
To the best of our knowledge no-one has been able to combine them in a single
analysis, as our present Network Morphology analysis does. It is difficult to
be certain about this since many accounts are vague. DATR requires us to make
our claims explicit. Having laid out our claims, we can demonstrate that they
do indeed make the correct predictions, as shown in Appendix B.
The success of our gender assignment analysis of the Russian data lends
further support to the typology of gender assignment of Corbett (1991: 7-69).
Since the account of Russian gender assignment (one of the more complex
systems) has been shown to be robust, it follows that the entire typology is
more securely founded. The fact that we have been able to combine gender,
animacy and declensional class assignment into a single account, with a
resulting considerable simplification of lexical entries, suggests that this
approach to morphology based, in large part, on default inheritance and
implemented in DATR is an extremely fruitful one.
APPENDIX A: DATR THEORY
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% TITLE: Russian nominal inflection
% AUTHORS: Grev Corbett s Norman Fraser, University
% of Surrey
% NOTE: The labels for lexical entries are
% simple transliterations; linguistic forms
% are in phonological transcription.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
8 Vars $number: sg pi.
* vars $sex: male female undifferentiated.
138
Norman M. Fraser and Greville G. Corbeii
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% WORD CLASS HIERARCHY
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
NOMINAL: <stem> == Vinf l„root all>"
<phon stem hardness> == hard.
NOUN: <> == NOMINAL
<infl_root final> == consonant [10]
<mor> == Vdeclensional„class>" [ 9 J
Vdeclensional_class>" ==
DECLENSION: < Vinfl_root
final>" *<sem sex>" >
<syn cat> == n
<sem sex> == undifferentiated (8]
<syn gender> == GENDER: < Vsem sex>" > [1]
<syn animacy> == Vsem animacy>" [5]
<sem animacy> ==
ANIMACY:< Vsem sex>" >. (6)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% SOME CASE SWITCHES
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
GENDER: <male> == masc[2]
<female> ■== fern
<undifferentiated> ==
Vmor formal_gender>".
ANIMACY: <> — animate (7 J
<undif f erentiated> ==: inanimate .
ACCUSATIVE: <> == "<mor nom>"
<pl animate> == *<mor gen pl>"
<sg animate masc> == Vmor gen sg>"
DECLENSION: <consonant male> == N_I:<> (11]
<consonant female> == N_II:<>
<vowel $sex> == N„V:<>
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% DECLENSIONAL CLASS HIERARCHY
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
N_NOM: <stem hardness> ==
Vphon stem hardness>"
<acc $number> == ACCUSATIVE: < $number
"<syn animacy>" Vsyn gender>" >
A Unified Account for Russian
139
N_NOUN:
<dat pl> «
Vstem pl>" Vraor theme„_vowel>" _m
<inst pl> ==
Vstem pl>" Vmor theme„vowel>" __m' i
<loc pl> ==
"<stem pl>" Vmor theme_vowel>" _x.
<> -= N_NOM
<loc sg> == Vstem sg>" _e
<nom pl> — Vstem pl>" „i
<gen pl> *==
V mor Vmor stem hardness>" gen pl>"
<soft gen pl> == *<stem pl>" _ej
<theme_vowel> == _a.
N_0: <> =- N_N0UN
<gen sg> « Vstem sg>"_a
<dat sg> — Vstem sg>"„u
<inst sg> =- Vstem sg>"_om.
<> — N_0 E13l
<formal_gender> == masc
<nom sg> == Vstem sg>"
<hard gen pl> ™ Vstem pl>"_ov.
N_II: <> " N_N0UN
<formal„gender> == fern E 3]
<nom sg> — Vstem sg>"_a
<acc sg> == Vstem sg>"_u
<gen sg> »= Vstem sg>"_i
<dat sg> « Vstem sg>"_e
<inst sg> — Vstem sg>"_oj
<hard gen pl> — Vstem pl>"
N_III: <> — N_NOUN
<formal_gender> == fern
<stem hardness> == soft
<nom sg> == N_I
<gen sg> == N_II
<dat sg> == Vmor gen sg>"
<inst sg> — Vstem sg>"_ju
<loc sg> == Vmor dat sg>" .
N_IV: <> = = N_0
<formal_gender> == neut
<nom sg> « Vstem sg>"_o
<nom pl> == Vstem pl>"_a
<hard gen pl> — N_H ■
140
Norman M. Fraser and Greville G. Corbet!
(12]
N„V: <> « "<inf l_root all>"
<formal_gender> == < "<sem animacy>" >
<animate> == masc
% IF animate THEN masc
<inanimate> == neut.
% IF inanimate THEN neut
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% LEXICAL ENTRIES
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
NZakon: <> == NOUN
<declensional„class> N_I:<>
<gloss> == law
<infl_root all> == zakon.
No2: <> == NOUN
<declensional_class> -= N_I:<>
<gloss> == knife
<inf l_root all> == noz
<mor stem hardness> == soft.
Student: <> == NOUN
<gloss> == student
<infl_root all> — student
<stem sex> == male.
Bolgarin: <> == NOUN
<gloss> == bulgarian_man
<infl_root all> == bolgar
<stem sg> =» "<infl_root all>" _'in)
<mor gen pl> « N_I I: <hard gen pl>
<sem sex> == male.
Komnata: <> == NOUN
<declensional__class> == N_II:<>
<gloss> == room
<infl_root all> == komnat.
Ucitel'nica: <> == NOUN
<gloss> == female_teacher
<infl_root all> == uc'itel'n'ic
<sem sex> == female.
Muzcina: <> == NOUN
<declensional_class> == N_II:<>
<gloss> == man
<infl_root all> == mu§c'in
<sem sex> == male.
A Unified Account for Russian
141
Kost
<> == NOUN
<declensional_class> == N„III:<>
<gloss> == bone
<infl_root all> == kost'.
<> == NOUN
<declensional„class> == N_III:<>
<gloss> == mouse
<infl_root all> === mis'
<sem animacy> == animate.
put' : <> == NOUN
<declensional„class> == N_III:<>
<gloss> == way
<mor inst sg> == N_I:<inst sg>
<mor formal„gender> ==
N_I:<formal_gender>
<infl„root all> == put' .
Vino: <> ==: NOUN
<declensional_class> == N„IV:<>
<gloss> == wine
<infl„root all> == v'in.
More: <> =- NOUN
<declensional_class> == N_IV:<>
<gloss> == sea
<infl„root all> ='= mor'
<phon stem hardness> == soft.
Taksi: <> == NOUN
<gloss> == taxi
<infl_root all> == taks'i
<infl_root final> = = vowel.
Gnu: <> == NOUN
<gloss> == gnu
<infl„root all> == gnu
<infl„root final> == vowel
<sem animacy> == animate.
APPENDIX B: SOME THEOREMS OF THE DATR THEORY
Zakon: <gloss> = law.
Zakon: <mor nora sg> = zakon.
Zakon: <mor acc sg> = zakon.
Zakon: <mor gen sg> = zakon „a.
Zakon: <mor dat sg> = zakon _u.
142
Norman M. Fraser and Greville G. Corbett
Zakon <mor inst sg>. = = zakon _om
Zakon <mor loc sg> = zakon _e.
Zakon <mor nom pl> = zakon _i ■
Zakon <mor acc pl> = zakon „i ■
Zakon <mor gen pl> = zakon _ov.
Zakon <mor dat p±> = zakon _a _
Zakon <mor inst pl> = zakon _a
Zakon <mor loc pl> = zakon _a _
Zakon <syn gender> = masc.
Zakon <syn animacy> = inanimate.
m.
m' i.
x.
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
Noz:
<gloss> = knife.
<mor nom sg> = noz\
acc sg> = noz.
gen sg> = noz _a.
dat sg> = noz __u.
inst sg> = noz _om.
loc sg> = noz _e.
nom pl> = noz _i.
acc pl> = noz _i.
gen pl> = noz _ej.
dat pl> = noz _a _m.
inst pl> = nog _a _m'
x.
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<syn
<syn
1.
loc pl> = noz _
gender> = masc.
animacy> = inanimate.
Student <gloss> = student.
Student <mor nom sg> = student.
Student <mor acc sg> = student _a.
Student <mor gen sg> = student „a.
Student <mor dat sg> = student _u.
Student <mor inst sg> = student „om
Student <mor loc sg> = student _e.
Student <mor nom pl> = student _i.
Student <mor acc pl> = student _ov.
Student <mor gen pl> = student _ov.
Student <mor dat pl> = student _a _
Student <mor inst pl> = student _a
Student <mor loc pl> = student _a _
student <syn gender> = masc.
Student <syn animacy> = animate
x.
Bolgarin
Bolgarin
Bolgarin
Bolgarin
<gloss> = bulgarian_man.
<mor nom sg> = bolgar _'in
<mor acc sg> = bolgar „'in
<mor gen sg> = bolgar _'in
A Unified Account for Russian
143
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Bolgarin:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
Komnata:
<mor dat sg> = bolgar _' in „u.
<mor inst sg> = bolgar in _om.
<mor loc sg> = bolgar in _e.
<mor nom pl> = bolgar _i.
<mor acc pl> = bolgar.
<mor gen pl> = bolgar.
<mor dat pl> = bolgar _a _m.
<mor inst pl> = bolgar _a „m' i.
<mor loc pl> = bolgar _a __x.
<syn gender> =
<syn animacy> =
<gloss> = room
<mor nom sg> =
acc sg> =
gen sg> =
dat sg> =
inst sg>
loc sg> =
nom pl> =
acc pl> =
gen pl> =
dat pl> =
inst pl>
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<mor
<syn
<syn
bolgar
masc.
= animate
komnat
komnat
komnat
komnat
komnat
komnat
komnat
komnat
komnat.
komnat
komnat
komnat
.a •
_u.
_i .
_e.
_oj .
_e •
_i.
i.
loc pl> =
gender> = fern.
animacy> = inanimate.
_m.
x.
U5itel' nica: <gloss> = female_teacher.
ufiitel' nica: <mor nom sg> = uc'itel'n' ic _a.
U5itel' nica: <mor acc sg> = u£'itel'n' ic _u.
Ufiitel' nica: <mor gen sg> = u£'itel'n' ic „i ■
Ufiitel' nica: <mor dat sg> = uc'itel'n' ic _e.
Ocitel' nica: <mor inst sg> = uc'itel'n 'ic „oj
Ucitel' nica: <mor loc sg> = u£'itel'n' ic __e.
Ubitel' nica: <mor nom pl> = u5'itel'n' ic _i -
U£itel' nica: <mor acc pl> - ufi'itel'n' ic.
Ucitel' nica: <mor gen pl> - u£'itel'n' ic.
Ucitel' nica: <mor dat pl> - ufi'itel'n' ic _a _m.
Ucitel' nica: <mor inst pl> = uc'itel'n ic „a m' i
Ufiitel' nica: <mor loc Pl> = uc'itel'n' ic _a _x.
U5itel' nica: <syn gender> = fern.
Ufiitel' nica: <syn animacy> = animate.
Muzcina
Muz5ina
MuzSina
<gloss> = man.
<mor nom sg> = mu§c 'in _a.
<mor acc sg> = mus5'in __u.
144
Norman M. Fraser and Greville G. Corbelt
Muzcina : <mor gen sg> = musc'in _i .
Muzcina : <mor dat sg> = mugc'in _e .
MuzSina : <fitor inst sg> = mugc'in _oj ■
Muzcina : <mor loc ag> = mugc'in _e .
Muzcina : <mor nom pl> = mugc'in _i .
Muzcina : <mor acc pl> = mugc'in.
Muzcina : <mor gen pl> = mugc'in.
Muzcina : <mor dat pl> = mugc'in a _m
Muzcina : <mor inst pl> = mus£' in
Muzcina : <mor loc pl> = musc'in _a _x ■
Muz£ina : <syn gender> = masc.
MUzC Lna : <syn animacy> = animate
Kost ' <gloss> = bone
Kost <mor nom sg> = kost'.
Kost <mor acc sg> = kost'.
Kost ' <mor gen sg> = kost' _i.
Kost ' <mor dat sg> = kost' _i.
Kost i <mor inst sg> = kost' _)u
Kost ' <tnor loc sg> = kost' _i.
Kost <mor nom pl> = kost' _i.
Kost i <mor acc pl> = kost' _i.
KOS t <mor gen pl> = kost' _ej.
V r» 4-KOSt <mor dat pl> = kost' _a jn.
SOS t i <mor inst pl> = kost' _a _m' i.
l\OS U i <mor loc pl> = kost' _a _x.
<syn gender> = fern.
Kost <syn animacy> = inanimate.
Mys <gloss> = mouse
Myg' <mor nom sg> = mis.
Myg' <mor acc sg> = mig.
Myg' <mor gen sg> = miS _i.
Myg' <mor dat sg> = mi§ _i.
Myg' <mor inst sg> = mis _ju.
Myg' <mor loc sg> = mi2 _\,
Myg' <mor nom pl> = mig _i.
Myg' <raor acc pl> = mig _ej.
Myg' <mor gen pl> = mis _ej.
Myg' <mor dat pl> = mig _a _m.
Myg' <mor inst pl> = mig _a _m' i .
Mys' <mor loc pl> = miS _a _x.
Myg' <syn gender> = fern.
Mys' <syn animacy> = animate.
Put' <gloss> = way
Put' <mor nom sg> = put' .
A Unified Account for Russian
145
put' <mor acc sg> = put' .
put' <mor gen sg> = put' _i.
put' <mor dat sg> = put' „i •
put' <mor inst sg> = put' „om.
put' <mor loc sg> = put' _i ■
put' <mor nom sg> = put' _i .
put' <mor acc pl> = put' _i.
put' <mor gen pl> = put' _ej .
put' : <mor dat pl> = put' _a _m
put' : <mor inst pl> = put' _a
put' : <mor loc pl> = put' _a _x
put' : <syn gender> = masc.
Put' : <syn animacy> = inanimate.
Vino : <gloss> = wine.
Vino : <mor nom sg> = v' in _o.
Vino ; <mor acc sg> = v' in _o.
Vino : <mor gen sg> = v' in _a.
Vino : <mor dat sg> = v' in _u.
Vino : <mor inst sg> = v' in „om.
Vino : <mor loc sg> = v' in _e.
Vino : <mor nom pl> = v' in _a.
Vino : <mor acc pl> = v' in _a.
Vino : <mor gen pl> = v'in.
Vino : <mor dat pl> = v' in _a _n
Vino : <mor inst pl> = v' in _a _
Vino : <mor loc pl> = v' in _a _y
Vino : <syn gender> = neut.
Vino : <syn animacy> = inanimate.
More : <gloss> = sea
More : <mor nom sg> = mor' _o.
More : <mor acc sg> = mor' _o.
More : <mor gen sg> = mor' _a.
More : <mor dat sg> = mor' _u.
More : <mor inst sg> = mor' _om.
More : <mor loc sg> = mor' _e.
More : <mor nom pl> = mor' _a .
More : <mor acc pl> = mor' _a.
More .: <mor gen pl> = mor' _ej .
More ;: <mor dat pl> = mor' _a „i
More i: <mor inst pl> = mor _a
Mors ;: <mor loc pl> = mor' _a _
More s: <syn gender> = neut.
Mort 5: <syn animacy> = inanimate
Taksi: <gloss> = taxi
146
Norman M. Fraser and Greville G. Corbet!
Taksi <mo c nom sg> = taks i .
Ta k s i <JJ)OJ " acc sg> = taks' i .
rr> \, i i aks i <mo2 " gen sg> = taks' i .
Ta ks i <moi " dat sg> = taks' i.
I etKS 1 <moi ' inst sg> = taks ' i .
Ta ks i <moi * loc sg> = taks' i .
Taks i: <moi nom pl> = taks' i.
Taksi: <mor acc pl> = taks' i .
Taksi: <mor gen pl> = taks' i .
Taks i: <moi dat pl> = taks' i.
Taksi: inst pi> = taks ' i.
Taksi: <mor loc pl> = taks' i.
Taksi : <syn gender> = neut.
Taksi: <syn animacy> = inanimat
C1 r\ u -o n U , <gloss> - gnu.
pnn ■ IjfJll . <mor nom sg> = gnu.
oil U . <mor acc sg> = gnu.
Gnu: <mor gen sg> = gnu.
Gnu : <mor dat sg> = jnu.
Gnu : <mor inst sg> = gnu.
<mor loc sg> = < jnu.
Gnu: <raor nom pl> = < jnu.
Gnu : <mor acc pl> = c fnu.
Gn U : <mor gen pl> = c fnu.
<mor dat pl> *. c fnu.
Gnu: <mor inst pl> = gnu.
Gnu: <mor loc pl> = g nu.
Gnu: <syn gender> => m asc.
Gnu: <syn anirnacy> = animate.
NOTES
1 Versions of ihis paper were read at the Spring Meeting of the Linguistics Association of
Great Britain, University of Birmingham. March 1993 and at the Finnish Circle of Slavists,
Helsinki, May 1994. We would like to thank those present for helpful comments, and particu-
larly Alan Timberlakc and Ursula Doleschal for discussion. We arc also very grateful to Gerald
Gazdar and Dafydd Gibbon for a number of useful suggestions, and to Petra Barg for comments
on an earlier draft. Any errors are ours. This research was supported by the Economic and
Social Research Council (grant R0O0233633) and the Leverhulme Trust (grant F.242M); the
support of both funding bodies is gratefully acknowledged.
1 Our approach is complementary to Gibbon's work on (LEX. The Utter is an approach to
the lexicon in computational linguistics, implemented in DATR. '. . . the ILEX concept may
be thought of as a set of linguistic constraints on the form of possible DATR representations'
(Gibbon 1992; 47). Network Morphology can be viewed as a set of linguistic constraints on
possible DATR representations of morphology.
A Unified Account for Russian
147
I The choice of DATR as the language of formal expression for Network Morphology is no
accident. The theoretical framework emerged out of our exploratory work with DATR. informed
by our earlier experience of default inheritance in Artificial intelligence and in Word Grammar
(Fraser & Hudson 1992). DATR's special attractions are a conceptually simple syntax and
semantics, and a range of existing computer interpreters for the language.
* The implementations we know of arc by A. Bollz (Konstan*), Roger Evans (Sussex), Norman
Fraser (Surrey), Dafydd Gibbon (Bielefeld), James Kilbury (DOsseldorf). Hagcn Langer
(Bielefeld) and A. Sikorski (Poznari),
5 Morphological forms to be accounted for are given in phonemic transcription (while other
forms, such as in examples (7)-(9) below are in the standard transliteration). The automatic
phonological correspondences we assume are as follows:
1. lil is retracted to its allophone [ij after non-back hard (unpa(atalized) consonants. Thus the
nominative plural form /zakontf will be realized with (i) but /kost'i/ retains li) since [t'J is
soft.
2. All consonants which can be palatalized arc automatically palatalized before Id. Thus the
locative singular of /zakon/, namely /zakonc/. will be realised wiih a palatalized (n'J. If the
consonant is already palatalized as in genitive plural /kost'-ej/, it simply remains palatal-
ized. Some consonants are always hard (/S, i, cl), and remain so before Id. On the other hand,
ti'f and lift arc always soft (palatalized), and naturally remain so before Id. We have
chosen to mark softening redundantly for greater clarity in this instance. In addition, the
gutturals Ik, g, x/ are palatalized before HI, so that the genitive form /knigi/. from /kniga/
'book', will be realized with palatalized [g'j (which then demands the front allophone [i|).
3. There are complex patterns of reduction of vowels in unstressed position, which can safely
be omitted from the transcription since our focus is on morphology. In particular, the
unstressed lol ending of nouns with soft stems such as /pol'o/ 'field' (orthographically pole)
is realized as either!»} or [ij. Although this lol is never realized as anything approximating
to a mid rounded back vowel, positing lol is justified by the stressed (6) which occurs in
Iv'ino] 'wine' and (p'it'j6J 'drink(ing)'.
For an informative sketch of Russian phonology, see Timberlake (1993: 828-832).
6 Some implementations of DATR require value sequences to be enclosed in round brackets;
other implementations allow bracket enclosed sequences but do not require them. Here we follow
the current practice of Gazdar and Evans and omit all round brackets.
' Only facts relevant to the immediate discussion are shown at nodes NJI and N_IIi in (3).
In our full analysis of Russian nominal morphology, each of these nodes records a much richer
set of facts than is shown here. The facts for N_III are quite different from those for N_H, so
an analysis of the form 'N_lli:<> — N_IP is out of the question. See Appendix A for further
details.
* A full account of the complexities of the genitive plural is given in Brown and Hippisley
(1994).
* This presentation is illustrative only, since it masks a formal distinction in DATR between
the '-' and '—' symbols. See Evans & Gazdar (1989a) for further discussion.
10 For a recent discussion of how gender can be defined for Russian see Hubenschmid (1993:
47-77).
" We do not examine the complexities of hybrid nouns here, for which see Corbett (1991:
39, 231-232), and for interesting data see Weiss (1991) and Doleschal (1992, 1993: 40-46,
138-144).
II There is a smalt number of exceptions, for which see Smimova (1979). We are not concerned
here with the special case of acronyms where derivational information may be relevant.
" DATR fragments from our theory of Russian nominal morphology are identified by
square-bracketted numbers. These numbers are also used to pick out the fragments in the full
theory given in Appendix A.
A more extended account to include cases where reference may be to either sex would require
us to state the first equation as: GEHDER: <> == masc. Instances of this kind would be
148
Norman M. Fraser and Greville G. Corbett
conectly assigned masculine gender so long as the indeterminate sex of the referent were
identified by an explicit marker such as either.
" Thus Thelin (1975: 30) claims that 'the inflectional class membership of Russian nouns
can to a great extent be predicted from information on gender and/or the nature of the final
consonam of the stem'; see also Crocket! (1976: 12). And Stankiewicz (1978: 666-667)
postulates just two declensions for all the Slavonic languages, and then predicts the differences
within the paradigms from gender and from the morphological structure of the stem. Corbett
(1982) demonstrates that full prediction of declensional class from gender is not possible, but
that prediction of gender from meaning and declensional class is possible, given four declen-
sional types. Our earlier paper (Corbett & Fraser 1993) allowed us to reconcile the competing
claims of the three- or four-declensional models (our objections to Stankiewicz's two-declen-
sional model are given in Corbett 1982: 207). Here we arc adding in the predictability of
declensional class which, though weaker than that of gender (Corbett 1982: 224), stilt permits
a reduction in the lexical entry of large numbers of nouns. Halle (1992: 37-38, 45) postulates
the prediction of declensional class from gender for Latvian, Latin and Russian, but does not
argue for this position rather than for the converse. Aronoff (1994: 73-74) allows predictions
in both directions.
Some claim to predict declensional class from gender but use the nominative singular as
the lexical entry rather than the stem; for many nouns the nominative singular effectively provides
the stem and the declensional class, hence there is actually very little prediction going on.
16 There is no reason why such a phonological component could not also be formalized using
DATR, as the work of Reinhard & Gibbon (1991) and Gibbon (1992) demonstrates.
" The inferences shown in (IOH'2) may be drawn from the DATR theory but do not belong
to it. Hence they cannot be indexed with any part of the theory by means of square bracketed
identifiers.
11 While the claim relating to gender assignment is a general one, we have not investigated
declensional class assignment beyond Russian. It is a plausible hypothesis that in declensional
class assignment generally, formal factors will take precedence over semantic.
'* If the stem ends in a hard consonant, then type N_lll can be eliminated. We have no rule
to predict membership of class N_I1I; stems are by default 'hard' (they end in a hard conso-
nant). The N_lll class is quite small (see discussion after Table 1).
M This figure includes the small number of nouns (32 in Lazova's table) which are indeclin-
able but which do not end in a vowel. They are typically borrowings. Borrowings tike miss 'miss',
which denote females but end in a consonant are indeclinable. These require art exceptional marker
just to indicate that they are indeclinable: the semantic gender assignment rule will then cor-
rectly assign them to the feminine gender. Rare borrowings which are indeclinable, consonant
final, do not denote an animate, and are masculine or feminine, require two irregular markers
(for declensional class V, and for gender). These marginal cases tend to be integrated over
time or lost.
" It is important to note that grammatical defaults of different types may or may not tine up
together (and when there are only two possibilities, as with two-gender systems, then the coin-
cidence cannot be assumed to be of any great significance). In Arapesh, Aronoff (1992) discovered
remarkable consistency in the way the grammatical defaults lined up. In other cases, as in Russian,
the picture is less straightforward. For an unusual type of gender default, see Corbett & Mtenje
(1987: 15) on Chichewa.
" The declensional class assignments are not given in Appendix B; this gives all the avail-
able morphological forms (which require the declensional class assignment to be correct) and
the gender and animacy. In other words, we provide all the information which might be required
by the syntax (for agreement purposes), and the morphological forms supplied according to
syntactic specifications. This seems to be a realistic set of outputs. The declensional class
assignments are correct, and could be shown; the point is simply that there is nothing outside
the morphological component which requires information about them.
A Unified Account for Russian
149
REFERENCES
Aronoff, Mark. 1992, "Noun Classes in Arapesh". In Geert Booij and Jaap van Marie (cds.),
Yearbook of Morphology {991. Dordrecht: Kluwer, 21-32.
Aronoff, Mark 1994. Morphology By Itself: Stems and Inflectional Classes (- Linguistic Inquiry
Monograph 22). Cambridge MA: MIT Press.
Brown, Dunstan and Andrew Hippisley. 1994. "Conflict in Russian Genitive Plural Assignment:
A Solution Represented in DATR". Journal of Slavic Linguistics 2, 30-48.
Corbett, Greville G. 1982. "Gender in Russian: an Account of Gender Specification and its
Relationship to Declension". Russian Linguistics 6, 197-232.
Corbett, Greville G. 1991. Gender. Cambridge: Cambridge University Press.
Corbett. Greville G. and Norman M. Fraser. 1993. "Network Morphology: A DATR Account
of Russian Inflectional Morphology". Journal of Linguistics 29, 113-142.
Corbett, Greville G. and Alfred D. Mtenje. 1987. "Gender Agreement in Chichewa". Studies
in African Linguistics 18, 1-38.
Crockett, Dina B. 1976. Agreement in Contemporary Standard Russian. Cambridge, MA: Slavka.
Daelemans. Walter, Koenraad De Smedt and Gerald Gazdar. 1992. "Inheritance in Natural
Language Processing". Computational Linguistics 18, 205-218.
Doleschal. Ursula. 1992. "Genus und (Ko-)Referenz". In Tilmann Rcuthcr (cd.). Slavtstische
Linguistik 1991: Referate des XVII. Konstanier Siavistischen Arbeitstreffens, Klagenfurt-St.
GeorglLangsee I0.-I4.9.I99I (- Slavistische Bcitrage 292). Munich: OnoSagner, 123-135.
Doleschal, Ursula. 1993. Genus als grammatische und textlinguistiscfie Kategorie: Eine kognitiv-
funktionalisiische Untersuchung des Russischen. Unpublished doctoral dissertation, University
of Vienna.
Evans, Roger and Gerald Gazdar. 1989a. "Inference in DATR". Proceedings of the 4th Conference
of the European Chapter of the Association for Computational Linguistics. Manchester,
England. 66-71.
Evans, Roger and Gerald Gazdar, 1989b. "The Semantics of DATR". In A. G. Cohn (ed.),
Proceedings of the Seventh Conference of the Society for the Study of Artificial Intelligence
and Simulation of Behaviour. London: Pitman/Morgan Kaufmann, 79-87.
Fortune, Reo F. 1942. Arapesh. (Publications of the American Ethnological Society 19). New
York: J. J. Augustin. (Reprinted 1977. New York: AMS Press).
Fraser, Norman M. and Greville G. Corbett. 1993. "It's not my Default: a Network Morphology
Analysis of Noun Classes in Arapesh". Paper read at the Autumn Meeting of the Linguistics
Association of Great Britain, Bangor, 13-15 September.
Fraser, Norman M. and Richard A. Hudson. 1992. "Inheritance in Word Grammar".
Computational Linguistics 18, 133-158.
Gazdar, Gerald. 1987. "Linguistic Applications of Default inheritance Mechanisms". In
P. Whitelock, M. M. Wood, H. L. Somers, R. Johnson and P. Bennett (cds.), Linguistic Theory
and Computer Applications. London: Academic Press, 37-68.
Gazdar, Gerald. 1990. "An Introduction to DATR". In Roger Evans and Gerald Gazdar (eds.),
77ie DATR Papers (- Cognitive Science Research Paper CSRP 139). School of Cognitive
and Computing Sciences, University of Sussex, 1-14.
Gazdar, Gerald. Forthcoming. "Ceteris paribus". To appear in J. A. W, Kamp and C. Rohrcr
(eds.), Aspects of Computational Linguistics. Berlin: Springer.
Gibbon, Dafydd 1992. "ILEX: A Linguistic Approach to Computational Lcxica". In Ursula Klenk
(ed.), Computatio Linguae: Aufsatze zur algorithmischen und quantitativen Analyse der
Sprache {- Zeitschrift fiir Dialektologie und Linguistik, Beihefi 73). Stuttgart: Franz Stciner
Vcrlag, 32-53.
Halle, Morris. 1992. "The Latvian Declension". In Geert Booij and Jaap van Marie (eds.).
Yearbook of Morphology 1991. Dordrecht: Kluwer, 33-47.
Hubenschmid, Markus 1993. Genus und Kasus der russischen Substantive: Zur Definition und
Identiftkation grammatischer Kategorien (= Vorrage und Abhandlungen zur Stavistik 21).
Munich: Otto Sagner.
150
Norman M. Fraser and Greville G. Corbetl
Ilola, Eeva and Ano Mustajoki. 1989. Repori on Russian Morphology as ii appears in Zaliyiyak's
Grammatical Dictionary (- Slavica Hclsingiensia 7). Helsinki: Department of Slavonic
Languages, University or Helsinki.
Lazova, M. V. (cii.). 1974, Obrainyj slovar" russkogo jazyka: okolo 125 000 slow Moscow:
Sovetskaja Enciklopedija.
Muc'nifc, I. P. 1971. Grammatic'eskie kategorii glagola i imeiti v sovremennom russkom liter-
aturnon jazyke. Moscow: Nauka.
Reinhard, Sabine and Dafydd Gibbon. 1991. "Prosodic Inheriiancc and Morphological
Generalisaiions". Proceedings, Fifth Conference of the European Chapter of the Association
for Computational Linguistics, Berlin 1991, 131-136.
Schupback, R. D. 198-1. Lexical Specialization in Russian (= UCLA Slavic Studies 8). Columbus,
Ohio: Slavica.
Shieber, Swart M. 1987. "Separating Linguistic Analyses from Linguistic Theories". In Peter
Whitclock, Mary McGee Wood, Harold L. Somers, Rod L. Johnson and Paul Bennett (cds.),
Linguistic Theory and Computer Applications. London: Academic Press, 1-36.
Smirnova, G. A. 1979. "Kategorija roda ncsklonjaemyx suScestvitel'nyx". In V. P. Grtgor'ev
(ed.), Lingvistika i poetika. Moscow: Nauka, 86-105.
Stankiewicz, Edward. 1978. "The Inflection of Serbo-Croatian Substantives and Iheir Genitive
Plural Endings". In Henrik Birnbaum (ed.), American Contributions to the 8lh International
Congress of Slavists. I: Linguistics and Poetics. Columbus, Ohio: Slavica, 666-681.
Thelin, Nils B. 1975. Notes on General and Russian Morphology (- Studia Slavica Upsaiiensia
15). Stockholm: Alniqvist & Wiksell.
Timbcrlake, Alan. 1993. "Russian". In Bernard Connie and Greville G. Corbett (eds.), The
Slavonic Languages. London: Routledge, 827-886.
Weiss, Daniel. 1991. "Scxus Distinctions in Polish and Russian". In Maciej Grochowski and
Daniel Weiss (eds.), "Words are physicians for an ailing mind'' (- Sagners Slavistische
Sammtung 17). Munich: Otto Sagner, 449-466.
Worth, Dean S. 1966. "On the Stem/ending Boundary in Slavic Indeclinables". Zbornlk za
filologiju i Ungvistiku 9, 11-16.
Zaliznjak, A. A. 1977. Grammatideskij slovar1 russkogo jazyka: slovoizmenenie. Moscow: Russkij
jazyk.
(Corbett)
Dept. of Linguistic and International Studies
University of Surrey
Guildford, Surrey GV2 5XH
England
e-mail: g.corbelt@surrey.ac.uk.
(Fraser)
Vocalis Ltd.
Chaston House
Mill Court
Great Shelford
Cambridge CB2 5LD
England
e-mail: norman@vocalis.demon.co.uk
Evaluative Affixes in Italian
DONNA JO NAPOU and BILL REYNOLDS
I. INTRODUCTION
Italian nouns and adjectives make use of almost two dozen evaluative affixes
(to borrow Scalise's 1984 term), most of which can also be found on verbs.
While a few of these affixes are strongly productive on N and A and weakly
productive on V, today most of those that occur on V are lexicalized (though
not all; see Cortelazzo & Cardinale 1989). Nevertheless, their evaluative
sense is obvious, and their occurrence suggests a period in the history of Italian
(from around 1300 to 1600, judging by Cortelazzo & Zolli 1979) when all
these affixes were productive. Since prepositions in Italian form an inerl
class with respect to morphology (with the exception of portmanteau prepo-
sition-articles, as in Napoli & Nevis 1987) and always have in the history of
Italian, it is natural that these affixes did not occur on P. Thus, evaluative
suffixes did occur on ail morphologically relevant categories. This means
that most evaluative affixes went through a period when they did not select
for category.
Aronoff claims that word formation rules (WFRs), such as affixation, operate
on a base that is "always specified syntactically" (1976: 47). The general
syntactic and semantic conditions that a WFR might be sensitive to are
"category, subcategory, selection, and lexically governed entailment and pre-
supposition" (1976: 48). The specification is, furthermore, always unique.
Aronoff dubs this claim the Unitary Base Hypothesis and argues that if any
WRF can be found to apply to bolh nouns and verbs, for example, the UBH
would be refuted, given that N and V cannot be stated as a single syntactic
category. In support of the UBH, Aronoff looks at a potential counter-
-example, the English affix-able, which attaches to both N (sizable) and V
(readable), and argues that the identity here is only apparent - that we have,
in fact, two distinct affixes with different properties. Aronoff does not admit
. the possibility of an affix which would put no syntactico-semantic condi-
; tions on the base to which it attaches.
The evaluative suffixes studied here, which definitely attach to N as well
| as V, provide a clear counterexample to Aronoff s UBH, even as modified
^ in Scalise (1984), as we discuss below (see Section 8).
V Still, as Zwicky & Pullum (1987) have pointed out, "plain" morphology
;f contrasts with "expressive" morphology in a number of ways, including pre-
cisely the fact that expressive morphology is promiscuous with respeel to
the base it operates on. By plain morphology they mean morphological rules
lliat belong properly to the rules of grammar of natural languages. By expres-
.fiye morphology they mean extragrammatical phenomena in word formation,
151
peer! Boolj and Jaap van Marie (eds.). Yearbook of Morphology 1994, 151-178.
-© 1995 Kluwtr Academic Publishers. Printed in the Netherlands.