Gender, animacy, and declensional class assignment: a unified account for Russian more

Norman Fraser & Greville G. Corbett. 1995. Gender, animacy and declensional class assignment: a unified account for Russian. In: Geert Booij and Jaap van Marle (eds) Yearbook of Morphology 1994. Dordrecht: Kluwer. pp. 123-50. [Note: the Yearbook for a given year was published the next year.]

Gender, Animacy, and Declensional Class Assignment: A Unified Account for Russian NORMAN M. FRASER and GREVILLE G. CORBETT I. INTRODUCTION1 In an earlier paper we introduced Network Morphology, an approach to inflec- tional morphology which relies heavily on the notion of default inheritance (Corbett & Fraser 1993), and showed how a complex set of data pertaining lo the inflectional morphology of Russian nominals could be handled in this framework. We extend that analysis here to include a range of phenomena which, we believe, has not previously been described in a single formally explicit framework. The key concept underlying our analysis is that of default inheritance. We introduce this below, together with a lexical knowledge rep- resentation language called DATR, which we use to make our analysis fully explicit.2 In Section 3 we describe our approach to gender assignment in Russian; animacy assignment is dealt with in Section 4; and declensional class assignment in Section 5. The issues tackled here - the interrelation of meaning, gender, declen- sional class and phonology - have been considered by Aronoff (1992), in his discussion of Arapesh based on Fortune (1942), and by Corbett (1991: 7-69). Aronoff and Corbett both highlighted the complex interactions of factors in the assignment of gender, and Aronoff made extensive use of defaults. However, neither had a formal apparatus to show clearly whether their claims were valid. In this paper we go further in that we give a formal account, which allows us to demonstrate that our model does indeed make the correct predictions. Our demonstration is based on Russian, which provides a good range of overlapping factors involved in assignment. Russian has the advan- tage that, following work by Ilola & Mustajoki (1989), based on Zaliznjak (1977), we have a clear picture of the numbers of nouns of each type in the lexicon. We "believe that Aronoff's account of Arapesh is similarly robust, and that it yields the right results if subjected to a comparable formal analysis (Fraser & Corbett 1993). We offer some brief conclusions in Section 6. The DATR encoding of our theory and the output listing which demonstrates the validity of our claims are given as appendices. 2. DEFAULT INHERITANCE Default inheritance is usually introduced by means of non-linguistic examples, and we make no exception here. Consider the taxonomic hierarchy in Figure 1. The lines in the taxonomy indicate instantiation rather than sub-classifica- 123 Geerl Booij and Jaap van Mailt (ids.). Yearbook of Morphology 1994, 123-1 SO. © 1995 Kluwer Academic Publithers. Primed in the Netherlands. 124 Norman M. Frascr and Greville G. Corbett BIRD has feathers can fly EAGLE ROBIN PENGUIN cannot fly Edwina Rupert Percy Fig. 1. A simple instantiation hierarchy. tion. So an eagle is a bird, as is a robin and a penguin; Edwina is an eagle, Rupert is a robin, and Percy is a penguin. Given an instantiation hierarchy of this kind, default inheritance allows all attributes of a given node in the hierarchy (such as BIRD) to be inher- ited by a node which instantiates it (such as EAGLE), except in cases where the lower node already has a value for some'attribute and thus overrides the default (i.e. inheritable) value for that attribute. In our example, a BIRD has feathers and can fly. These facts are inherited by EAGLE and ROBIN and, indirectly, by Edwina and Rupert. The attribute of having feathers is also inherited by PENGUIN and, thereby, by Percy. However, specific informa- tion about the flying abilities of PENGUINs blocks inheritance of the more general information about BIRDs. Thus, despite the fact that Percy is a BIRD, he cannot fly. This approach to information organization allows generalizations to be expressed once at a high level, and then automatically to apply to everything which inherits from there. In this way regularities, subregularities and excep- tions can be encoded with considerable ease and parsimony. This approach has the added advantage of marking exceptions as such, as in the case of PENGUIN in Figure I. If Percy were a truly exceptional penguin who had obtained a pilot's license, this extreme degree of markedness would be visible in the inheritance structure by virtue of an exceptional fact (PERCY can fly) over- riding an exceptional fact (PENGUINs cannot fly), overriding a default (BIRDs can fly). The history of linguistics offers many examples of linguistic theories which conflate substance and form. It is now widely acknowledged that the ability to separate these offers at least two benefits: first, it focuses attention on the substantive content of theories rather than on their notational sugar; second, A Unified Account for Russian 125 it facilitates cross-theoretical comparisons (Shieber 1987). Network Morphol- ev is concerned with substantial matters of linguistic theory, which could, irf principle, be formalized in a number of different ways. For this purpose, We use an existing formalism called DATR.3 The DATR knowledge repre- sentation language was developed by Roger Evans and Gerald Gazdar as a logic-based formalism for describing inheritance networks (Evans & Gazdar 1989a, b). Since computer implementations of the DATR language are avail- able, it is possible to check that an inheritance network expressed in DATR captures the intended generalizations.'1 AH of the analyses in this paper have been checked in this fashion. The information in Figure 1 would be expressed in DATR as follows: (0 BIRD: <has feathers> == <can fly> == yes. EAGLE: <> = = BIRD. ROBIN: <> = = BIRD. PENGUIN: <> = = BIRD <can fly> == no. Edwina: <> = - EAGLE. Rupert: <> = = ROBIN. Percy: <> = = PENGUIN. yes The labels preceding colons are 'nodes'; the angle bracket expressions to the left of the symbol are 'paths' (also called 'attributes'); the words to the right of non-empty paths are 'values'. Thus, the value of the <can f ly> path at the PENGUIN node is 'no'. It should be clear how this kind of formalism might be useful in expressing linguistic generalizations. Figure 2 shows the inheritance network for Russian nominals which we presented in our earlier work (Crobett & Fraser 1993). N_l to N_IV are declensional classes. (Notice that we posited the existence of a node N_0 from which N_I and N„IV inherit, thereby finding a principled reason for sitting on the fence in the debate over whether Russian has three or four noun declensional classes). The following (incomplete) fragment is taken from our earlier analysis.* "<stem>" _e "<stem>" i. (2) NOUN: <mor loc sg> •== <mor nom pl> — N_III: <> « NOUN <mor loc sg> == "<mor dat sg>". Kost': <> == N_III <stem> == kost'. /The first sentence at NOUN should be read as saying that the locative singular ^consists of the stem followed by an -e ending.6 A path enclosed in double 126 Norman M. Fraser and Greville G. Corbel! NOMINAL N_I NJV N_II N_m zakon v'ino komnafa kost' Fig. 2. An inheritance siruciure for Russian nominats. quotes in a DATR sentence is used to retrieve the specified value for the thing inheriting from that sentence. If we wanted to find the nominative plural of Kost1, we would inherit the sentence <mor nom pl> == "<stenv>" _i. Before going any further we would have to find out what the <stem> of Kost' is. Since the answer is kost', the nominative plural is kost'L If we wanted to know the locative singular of Kost', we would never inherit the definition of locative singular at NOUN because it is overridden at N_lll, from which Kost' inherits. The definition of locative singular at N_III establishes an asymmetric identity between the locative singular form of an N_III noun and its dative singular. It is often desirable to inherit most information from one source, but to have access to certain kinds of information stored elsewhere. DATR supports this kind of multiple inheritance. Consider the following fragment.7 (3) N_II: <> -= NOUN <mor gen sg> == "<stem>" _i. N_III: <> « NOUN <mor gen sg> == N_II. This says that N_III may inherit its schema for forming the genitive singular from N_ii, even though N_iii (like N_II) inherits primarily from NOUN. This may be expressed more explicitly as follows; (4) N_III: <mor gen sg> « N_II: <mor gen sg>. A Unified Account for Russian 127 The last form of DATR construction we shall consider here is the so-called 'evaluable path'. Consider the following (slightly simplified) fragment: (5) NOUN: <mor gen pl> *s< *<mor stem hardness>" mor gen pl>" <sof t mor gen pl> == "<stem>" „ej. N_III: <> M NOUN. Kost': <> == N_III <stem> == kost' <mor stem hardness> == soft. Nouns in Russian can be divided into those having soft (palatalized) stems and those having hard (non-palatalized) stems. The difference ts not usually of morphological significance, except in the case of genitive plurals, where it is crucial. The first equation shown here can be interpreted as follows: to obtain the genitive plural form, find out the hardness of the stem and prepend the value to the path <mor gen pl>, then evaluate the resulting path.8 Thus, to find the genitive plural of Kost', inference would proceed as follows (expressed syliogistically in terms of premises and conclusions):9 a. NOUN : <mor gen pl> = "< "<mor stem hardness>" mor gen pl>" N_III: <> == NOUN Kost': <> == N_III Kost': <mor gen pl> = "< "<mor stem hardness>" mor gen pl>" b. Kost': <mor gen pl> = "<mor stem hardness>" mor gen pl>" Kost': <mor stem hardness> == soft Kost' : <mor gen pl> = = *<sof t mor gen pl>" c. NOUN: <sof t mor gen pl> == "<stem>" „e j N_111 :■ <> « NOUN Kost': <> == N_III Kost': <sof t mor gen pl> == "<stem>" _ej d. Kost': <sof t mor gen pl> == "<stem>" _ej Kost': <stem> == kost Kost': <sof t mor gen pl> == kost' „ej e. Kost': <tnor gen pl> = = "<soft mor gen pl>" Kost': <sof t mor gen pl> === kost' _ej Kost': <mor gen pl> s == kost' _ej 128 Norman M. Fraser and Greville G. Corbetl This concludes our brief inlroduction to default inheritance and to the DATR formalism. More detailed introductions to default inheritance can be found in Gazdar (1987) and Daelemans, de Smedt and Gazdar (1992), and to DATR in Gazdar (1990; forthcoming). In the following sections we show how Network Morphology uses default inheritance to produce elegant solutions for some well known problems in the analysts of Russian nouns. 3. GENDER ASSIGNMENT Russian has three genders, masculine, feminine and neuter.10 We could enter a value for gender in the lexical entry of each noun. However, this would miss generalizations at two levels: first, the gender of Russian nouns does not appear to be random, and second, more generally, we have argued that languages never have to specify gender for the majority of nouns. We have claimed that the gender of the overwhelming majority of nouns can always be predicted, either from semantic information which must, in any case, be stored in the lexical entry or from semantic information supplemented by formal information, which may be morphological or phonological (Corbett 1991: 68). Specifically in Russian, the gender assignment rules refer to semantic and to morphological information: Semantic assignment rules for gender 1. Sex-differentiable nouns denoting males (humans and higher animals) are masculine: for example, student '(male) student'; 2. Sex-differentiable nouns denoting females are feminine: for example, uditel'nica '(female) teacher' Nouns which are sex-differentiable are those where the sex of the referent matters to humans (those denoting humans and domesticated animals) and where the difference is striking (as in the case of lions). These rules leave very few exceptions, but there are many nouns which are not covered by them. These remaining nouns are distributed over the three genders and their distribution is accounted for by the morphological assignment rules. The major morphological assignment rules are as follows: Morphological assignment rules for gender: 1. nouns of declensional class I are masculine; 2. nouns of declensional class II and III are feminine; 3. nouns of declensional class IV are neuter. As we shall consider in more detail later, there is some overlap between the two sets of rules: nouns which denote males are often in declensional class I. We might try to do away with the semantic assignment rules. However, the two sets of rules can make conflicting assignments, and when they do it is the semantic rules which dominate. The crucial case is muzftna 'man* (and A Unified Account for Russian 129 similar nouns), which ought to be masculine according to its semantics (male), but feminine according to its morphology (declensional class II). In fact, it belongs in the masculine gender." There are also nouns which do not decline, and to which the above mor- phological assignment rules do not apply. However, their lack of declension is itself a matter of morphology, and this interacts with the semantic feature of animacy to allow gender assignment. We treat indeclinable nouns as having their own declensional class (V); nouns of this class may be subject to the normal semantic assignment rules. Failing this, they are masculine if animate and neuter if not.1' in our Network Morphology account, every noun inherits from the NOUN node. Since the following path equation is specified at NOUN, every noun inherits it unless it is overridden by a more specific equation declared in the lexical entry." [1] NOUN: <syn gender> == GENDER: < "<sem sex>" > In order to find a value for the path <syn gender>, the path <sem sex> is evaluated (i.e. the sex of the noun's referent is retrieved) and then a path consisting only of the sex is evaluated at the node GENDER (shown below): [2] GENDER: <male> == masc <female> == fern <undifferentiated> == "<mor formal„gender>". The interpretation of this fragment of DATR is straightforward in the case of sex-differentiated nouns. If the referent of the noun is male then the gender is masculine; if the referent is female then the gender is feminine.14 However, if the sex is undifferentiated, it is necessary to consider additional formal criteria, and this is done by evaluating the path <mor f ormal_gender>. Formal gender is defined for each of the five main declensional classes. Thus, for example, the following equation is located at the node for declensional class II nouns, N„ll: [3] N_II: <formal_gender> == fern This is used in the evaluation of the path <mor f ormal_gender> (the function of the initial mor will be spelt out below). Thus, a class II noun denoting a male (such as muUina 'man') will have masculine gender; a class II noun denoting a female (such as ufitel'nica 'female teacher') will have feminine gender; and a class II noun denoting a non-sex-differentiated referent (such as komnata 'room') will have feminine gender, the default gender for class II nouns. As wc noted above, there is an additional complication with declensional class V, where animacy must also be taken into consideration. Some equa- tions from the node N„V, from which all class V nouns inherit, are given below. 130 Norman M. Fraser and Greville G. Corbetl [4] N_V: <formal_gender> « < Vsem animacy>" > <animate> == itiasc <inanimate> == neut. In order lo assign a formal gender for a given class V noun it is necessary to evaluate the <sem animacy> path for that noun and use it to select masculine gender if the noun denotes an animate referent and neuter gender if the referent is inanimate. Our analysis so far accounts for the gender of the vast majority of Russian nouns. It is an advance on Corbett (1982) in that it is formally explicit, and since it is encoded in DATR we can see that the correct predictions are indeed made (see Appendix B). 4. ANIMACY ASSIGNMENT Animacy in Russian is of particular interest. It is reflected in the accusative case forms of certain nouns, and in agreement. There are no exclusively animate accusative forms in Russian; instead, animacy shows itself by different patterns of syncretism. With an animate noun of the first declensional class we find the accusative identical to the genitive. (7) pervogo (acc - gen) studenta (acc - gen) first student 'the first student' With an inanimate, by contrast, the accusative is as the nominative: (8) pcrvyj (acc - nom) zakon (acc - nom) first law 'the first law' In both these examples, animacy is reflected not only in the morphological form of the noun, but also in the agreement of the adjective. This agreement can also be found with those nouns which decline according to pattern N_ll, and so have a unique accusative form, but which are masculine animate, as is the case with muzfina 'man': (9) pervogo (acc = gen) muzemu (acc) first man 'the first man' Animacy is a sub-gender (Corbett 1991: 165-167). It is narrower in range than the three main genders, since it affects only the accusative case. It is a more recent addition to the system than the main genders, and assignment to the animate sub-gender is (still) largely based on semantics: animate nouns are those denoting humans, animals and insects. Returning to the morphology, the crucial point is that we get the same regularity but in different paradigms: A Unified Account for Russian 13! the accusative matches the genitive for animate nouns under certain circum- stances, and the nominative for inanimate nouns, though the phonological form of the nominative or genitive varies from paradigm to paradigm. In Corbett & Fraser (1993) we specified animacy in lexical entries as a semantic feature; then by default, syntactic animacy was the same as semantic animacy. However, there are further simplifications available. First we can say that all sex-differentiables will be animate, hence semantic animacy need not be specified in their lexical entries. Second, by default, nouns arc seman- tically inanimate. This second default makes sense in the light of the statistical evidence provided by Ilola & Mustajoki (1989: 15-17), based on Zaliznjak (1977): of the 45,779 nouns in Zaliznjak's dictionary, 75% (34,178) denote inanimates. Together, these two generalizations mean that in most lexical entries there is no need to specify animacy. The default identity between syntactic animacy and semantic animacy can be expressed trivially in DATR: [ 5 ] NOUN: <syn animacy> == "<sem animacy>" This simply says that the syntactic animacy is identical to the semantic animacy. Since this default generalization is stated at the level of the NOUN node, it may be overridden by equations in the lexical entries of words with conflicting semantic and syntactic animacy. Kon' 'knight (chess piece)' is such a word: it is semantically inanimate but syntactically animate. So much for syntactic animacy. For nouns which are sex-differentiable, semantic animacy is obtained as follows: [6] NOUN: <sem animacy>: = = ANIMACY: < Vsem sex>" >. [7] ANIMACY: <> — animate <undifferentiated> == inanimate. The path shown at [6] indicates that the semantic animacy may be found by evaluating the sex of the word's referent at the ANIMACY node. If the sex is undifferentiated, the value inanimate is found at the ANIMACY node, as shown in [7}. For all other values of <sem sex> (i.e. male or female) the value animate is found. Thus the ANIMACY node can be thought of as a look-up table specifying default sex/animacy associations. Nouns are inanimate by default because their sex is undifferentiated by default and the final equation in {7} establishes the connection between the two. The undifferentiated sex default is encoded by means of the following equation, which is located at the NOUN node: (8] NOUN: <sem sex> == undifferentiated Our analysis now accounts for the syntactic animacy as well as the gender of the vast majority of Russian nouns in a formally explicit way. While animacy is less complex than gender, it is perhaps surprising to see just how simple 132 Norman M. Fraser and Greville G. Corbetl it becomes: with appropriate use of defaults, animacy need hardly ever be specified in lexical entries. Again, we make the right predictions as Appendix B shows. 5. DECLENSIONAL CLASS ASSIGNMENT So far we have specified the declensional class of each noun in its lexical entry. This seems to be missing a generalization, since there are common, though by no means exceptionless, correspondences between meaning, gender and declensional class. Indeed, some earlier analyses attempt to predict declen- sional class from other information which was specified.15 For many nouns, it is the case that declensional class is predictable from semantic or formal information. The semantic correspondences are as follows: Semantic assignment rules for declensional class 1. Sex-differentiable nouns denoting males (humans and higher animals) are of declensional class N„I: for example, student '(male) student'; 2. Sex-differentiable nouns denoting females are of declensional class N_II: for example, uditel'nica '(female) teacher' There are substantial numbers of nouns whose declensional class must be spec- ified to override rule one. These are nouns like muz£ina 'man' (as in example (3», which denote males but which decline according to declensional class NJI. (Lazova 1974: 942-943 puts the figure at 273, but the number involved is actually larger because there are many hypocoristics of this type, like Saga 'Sasha' which do not appear in dictionaries like Lazova's). There are fewer instances of overrides to rule 2, but we find a small number of nouns like svekrov' 'mother-in-law' in declensional class N_UI. Forma! assignment rule for declensional class 1. Nouns whose stem ends in a vowel are of declensional class N„V. The effect of this rule is to make nouns whose stem ends in a vowel indec- linable (class N_V are the indeclinables). Our rule follows Worth (1966), though as he points out the idea goes back much further. A noun like taksi 'taxt' is entered as such in the lexicon and this guarantees its indeclinability. These generalizations can be formalized fairly easily. Consider the following DATR fragment, which is positioned at the NOUN node: [9] NOUN: <mor> == "<declensional_class>" <declensional„class> == DECLENSION: <"<infl_root final>" *<sem sex>"> The first equation indicates that in order to find a value (or values) for the <mor> path (or paths), it is necessary to evaluate the <declen- sional_class> path. The second equation is somewhat more complex: a value can be retrieved for the <declensional_class> path by evaluating A Unified Account for Russian 133 a path consisting of the value of the path <inf l_root f inal> followed by the value of the path <sem sex>, Paths beginning <inf l_root> make available information about the inflectional root of a word. The <inf l„root f inal> path is used to store information concerning the final segment of the inflectional root, specifically whether it is a consonant or a vowel. We may assume that this information would, in reality, be supplied by a phono- logical component, though for convenience we simulate this crudely by means of the following equation recorded at the NOUN node.16 (10] NOUN: <infl_root final> == consonant This generalization holds for all Russian nouns, except certain indeclinable nouns of relatively recent foreign origin. In the case of these exceptions the default generalization is overridden in the lexical entries. We have already noted how values for the <sem sex> path are supplied either in the lexical entries or by means of a default assignment of undiffer- entiated sex. Thus, where declensional class is predictable, it is found by evaluating a path at the DECLENSION node. This path consists of informa- tion on the final segment of the inflectional root of the word followed by the sex of the word's referent. The DECLENSION node is given below: [11J DECLENSION: <consonant male> == N_I:<> <consonant female> == N_II:<> <vowel $sex> == N_V:<>. The first two paths encode the semantic assignment rules for declensional class. The first element of each path is consonant so these paths potentially apply to the bulk of Russian nouns. The second elements of the paths narrow down on two subsets of these. The first path assigns core nouns (typical native nouns) denoting males to class N„l; the second assigns core nouns denoting females to class N_II. The third path picks out those nouns whose inflectional root ends with a vowel and assigns them to class N_v, the class for indeclinables. In DATR, every symbol which begins with a dollar sign ($) is a variable. The variable $sex is defined to range over all possible values for the <sem sex> path, namely male, female and undifferentiated. One group of nouns is not covered by the equations at this node, namely those whose stem ends in a consonant and which have non-sex-differenti- ated referents. Such nouns must typically specify declensional class in their lexical entries. However, as we shall see shortly (in the discussion leading to [11']), a substantial proportion of them can be assigned to a declensional class by the setting of a default. Let us first look at the working of [Uj. Take a noun such as student 'student', the lexical entry for which is as follows: [12] Student: <> — NOUN <gloss> == student <infl„root all> == student <sem sex> == male. 134 Norman M. Fraser and Grevitle G. Corbett To which declensional class should this word be assigned? Interesting though this question is, it is likely to be of less importance than specific questions such as 'what is the dative plural form of this word?', or 'what is its nomi- native singular?' Let us take the second question. In terms of our theory, it maps onto the question 'what is the value of the <mor nom sg> path?' Since the Student node inherits directly from the NOUN node, the two equations shown in [9] are available to it. The first of these can be paraphrased as saying 'for paths beginning <mor . . .> evaluate the path <declen- sional_class> \ The definition of that path (also given in [9]) requires the evaluation at the DECLENSION node of a path consisting of the value of <infl_root t inal> for student, followed by the value of <sem sex> for the same word. The value of <inf l_root f inal> is consonant according to the unoverridden equation [10), and the value of <sem sex> for student is male according to the last equation in the lexical entry (shown in [12]). Thus the path <consonant male> must be evaluated at the DECLENSION node. According to the first equation in [11] this returns a value of N_I :<> for the path <mor>, where our search originated. However, this is not the end of the look-up process. We were interested in the value of the path <mor nom sg>, not just <mor>. It is necessary at this point to introduce another kind of DATR equation which was not mentioned earlier. An expression of the form NODE:<> may appear on the right hand side of an equation, for example, NODE1: Pathl == NODE2 : <>. In this example, an attempt to evaluate NODE1: SomePath will succeed and return a value if SomePath can be formed by concate- nating Pathl with a path defined at NODE2. In effect, the NODE: <> notation allows the description of a path to be distributed over more than one node. The entry in the DATR theory for N_I is given in [13]. [13] N_I: <> == N_0 <formal_gender> == masc <nom sg> == "<stem sg>" <hard gen pl> == "<stem pl>"_ov. Evaluating the path <mor nom sg> at the Student node will lead via NOUN to be following inference:17 (10) Student: <mor> == N_I : <> This allows the <mor> path from NOUN to be concatenated with the <nom sg> path from N_l to produce the following inference: (11) Student: <mor nom sg> =*= "<stem sg>" In other words, the nominative singular of student has a zero ending; (12) Student: <mor nom sg> = student It is worth contrasting declensional class assignment with gender assignment. A clear difference is the outcome when semantic and formal rules make dif- A Unified Account for Russian 135 ferenl predictions. For gender assignment, quite generally, it is the semantic rule which takes precedence. As wc noted earlier, with nouns like muicina 'man*, which ought to be masculine according to its semantics (male), but feminine according to its morphology (declensional class li), it is the semantic rule which 'wins' and the noun is masculine. When we find a similar clash in declensional class assignment, in Russian at least,18 the formal factor dom- inates. Thus attase 'attache' denotes a male and 'should' be in declensional class N_l, but it ends in a vowel and so 'should* be in declensional class N_V. It is actually in declensional class N_V. Conversely it denotes a male and so 'should' be masculine but it is in class N_V and so 'should' be neuter; as already noted, semantic factors take precedence in gender assignment and so the noun is masculine. A second difference between gender assignment and declensional class assignment, is that the gender assignment rules make a prediction for every noun (which has to be overridden in an extremely small number of instances), while the declensional class assignment rules leave large numbers of nouns with no prediction. A non-sex-differentiable noun with a stem ending in a soft consonant could decline according to class N_l, N_ll, N_l 11 or N_IV.19 However, assignment to the four classes is by no means equally likely. Consider data on the numbers of nouns in each class, to the nearest fifty (derived from Lazova 1974, especially pp. 942-943), given in Table 1. Table ). Number of nouns in (he different declensional classes. Class N_I NJi N_U1 N_IV N„V Example lakon komnata kosf vino laksi 'taw' 'room' 'bone' 'wine' 'laxi' No of nouns 20,850 16.050 5.150 11.050 450* These raw statistics give a rough picture of what is going on. The figures do not take account of the effect of derivational morphology; for instance, over 4,300 of the nouns in class N_lll have the suffix -ost' which forms abstract nouns from adjectives (star-yj 'old', star-ost' 'old age'). If the suffix is labelled as belonging to class n_iii, then the number of distinct members of the class is substantially reduced. Similarly nominalizations in -aniel-enie (like ravuSenie 'destruction', derived from razrusit' 'destroy') inflate the figure for n_iv (see Schupbach 1984 for discussion). Table i shows that N_l and N_ll have substantially more members than the other three. We can therefore set a default which will put nouns in the majority declensional class n_i. Clearly this will be overridden in many instances, but equally it will simplify a very substantial number of lexical entries. This is a considerable simplification, and one which can be justified in three interrelated ways. First, there is the numerical preponderance of n„l. Second, 136 Norman M. Fraser and Greville G. Corbett it is the declensional class which takes the majority of borrowings. There is no need to appeal to any markedness considerations to explain this, it is simply that N__I has no ending in the citation form, the nominative singular, so that foreign words ending in a consonant are typically borrowed into this class. The third argument is more debatable. If N_l is the default declensional class, and masculine is the gender associated by default with this class, then we make masculine the default gender for nouns, without specifying it directly. Sources vary as to the numerical preponderance of the genders. All agree that the neuter is easily the weakest; Lazova (1974: 942-943) has only marginally more masculines than feminines, while Mucnik (1971: 196-197) calculates that the masculine has 46% to the feminine 41% (on a sample of 33,952 nouns) and Zaliznjak's dictionary has 46% masculine and 38% feminine (sample 47,030: see Ilola and Mustajoki 1989: 9); the masculine is growing fastest, from the assignment of borrowings (see the sources reported in Corbett 1991- 78). The only change required in our DATR theory to include all of these interacting defaults is given below, as a revision to the fragment shown above as [11]. (11' ) DEF„DECL: <$phonological_type $sex> =- N_I:<> DECLENSION: <> == DEF_DECL <consonant female> == N_II:<> <vowel $sex> == N_V:<>. Here, the variable $phonological_type ranges over consonant and vowel, and $sex ranges over male, female and undifferentiated, as before. In all cases other than the two explicitly described in the paths at the DECLENSION node, the maximally general path specified at the DBF_DECL node is inherited and, thereby, class 1 becomes the default declen- sional class for nouns. We argued above that if class I is the default declensional class, and masculine is the gender associated by default with this class, then we make masculine the default gender for nouns. Note that we are claiming only that the masculine is the default gender for nouns. In a fuller analysis of Russian, we would claim that there is a default for gender at a higher level than the nodes relating directly to nouns. This higher default is necessary for items other than nominals which may head syntactic constituents with which gender agreement is required. The situation arises if, say, an infinitive phrase stands in subject position (and there is a past tense verb). Under these circumstances 'neutral' agreement forms occur (these are usually identical to neuter forms. Corbett 1991: 204-205, 216). Thus in an analysis of the full lexicon, at a high level, perhaps at the level of WORD, the default gender is the neuter; at the lower level of nouns it is the masculine. Of course, the lower default is much the more important: gender is a central category for Russian nouns, while A Unified Account for Russian 137 for the various items (such as infinitives) covered by the higher default, it is of much less importance. This hierarchy of defaults would thus allow us to capture the intuition that in one sense the masculine is the default gender and in another it is the neuter.2' In this section we have shown how the declensional class can be predicted for a substantial proportion of Russian nouns. We have seen how declen- sional class assignment differs from gender assignment in interesting ways; further research is needed to show whether these differences are specific to Russian or, as we suspect, are found more generally." 6. CONCLUSION In previous work we gave an account of gender assignment in Russian. Other analyses have given at least a partial account of declensional class assignment. To the best of our knowledge no-one has been able to combine them in a single analysis, as our present Network Morphology analysis does. It is difficult to be certain about this since many accounts are vague. DATR requires us to make our claims explicit. Having laid out our claims, we can demonstrate that they do indeed make the correct predictions, as shown in Appendix B. The success of our gender assignment analysis of the Russian data lends further support to the typology of gender assignment of Corbett (1991: 7-69). Since the account of Russian gender assignment (one of the more complex systems) has been shown to be robust, it follows that the entire typology is more securely founded. The fact that we have been able to combine gender, animacy and declensional class assignment into a single account, with a resulting considerable simplification of lexical entries, suggests that this approach to morphology based, in large part, on default inheritance and implemented in DATR is an extremely fruitful one. APPENDIX A: DATR THEORY %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % TITLE: Russian nominal inflection % AUTHORS: Grev Corbett s Norman Fraser, University % of Surrey % NOTE: The labels for lexical entries are % simple transliterations; linguistic forms % are in phonological transcription. %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 8 Vars $number: sg pi. * vars $sex: male female undifferentiated. 138 Norman M. Fraser and Greville G. Corbeii %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % WORD CLASS HIERARCHY %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% NOMINAL: <stem> == Vinf l„root all>" <phon stem hardness> == hard. NOUN: <> == NOMINAL <infl_root final> == consonant [10] <mor> == Vdeclensional„class>" [ 9 J Vdeclensional_class>" == DECLENSION: < Vinfl_root final>" *<sem sex>" > <syn cat> == n <sem sex> == undifferentiated (8] <syn gender> == GENDER: < Vsem sex>" > [1] <syn animacy> == Vsem animacy>" [5] <sem animacy> == ANIMACY:< Vsem sex>" >. (6) %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % SOME CASE SWITCHES %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% GENDER: <male> == masc[2] <female> ■== fern <undifferentiated> == Vmor formal_gender>". ANIMACY: <> — animate (7 J <undif f erentiated> ==: inanimate . ACCUSATIVE: <> == "<mor nom>" <pl animate> == *<mor gen pl>" <sg animate masc> == Vmor gen sg>" DECLENSION: <consonant male> == N_I:<> (11] <consonant female> == N_II:<> <vowel $sex> == N„V:<> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % DECLENSIONAL CLASS HIERARCHY %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% N_NOM: <stem hardness> == Vphon stem hardness>" <acc $number> == ACCUSATIVE: < $number "<syn animacy>" Vsyn gender>" > A Unified Account for Russian 139 N_NOUN: <dat pl> « Vstem pl>" Vraor theme„_vowel>" _m <inst pl> == Vstem pl>" Vmor theme„vowel>" __m' i <loc pl> == "<stem pl>" Vmor theme_vowel>" _x. <> -= N_NOM <loc sg> == Vstem sg>" _e <nom pl> — Vstem pl>" „i <gen pl> *== V mor Vmor stem hardness>" gen pl>" <soft gen pl> == *<stem pl>" _ej <theme_vowel> == _a. N_0: <> =- N_N0UN <gen sg> « Vstem sg>"_a <dat sg> — Vstem sg>"„u <inst sg> =- Vstem sg>"_om. <> — N_0 E13l <formal_gender> == masc <nom sg> == Vstem sg>" <hard gen pl> ™ Vstem pl>"_ov. N_II: <> " N_N0UN <formal„gender> == fern E 3] <nom sg> — Vstem sg>"_a <acc sg> == Vstem sg>"_u <gen sg> »= Vstem sg>"_i <dat sg> « Vstem sg>"_e <inst sg> — Vstem sg>"_oj <hard gen pl> — Vstem pl>" N_III: <> — N_NOUN <formal_gender> == fern <stem hardness> == soft <nom sg> == N_I <gen sg> == N_II <dat sg> == Vmor gen sg>" <inst sg> — Vstem sg>"_ju <loc sg> == Vmor dat sg>" . N_IV: <> = = N_0 <formal_gender> == neut <nom sg> « Vstem sg>"_o <nom pl> == Vstem pl>"_a <hard gen pl> — N_H ■ 140 Norman M. Fraser and Greville G. Corbet! (12] N„V: <> « "<inf l_root all>" <formal_gender> == < "<sem animacy>" > <animate> == masc % IF animate THEN masc <inanimate> == neut. % IF inanimate THEN neut %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % LEXICAL ENTRIES %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% NZakon: <> == NOUN <declensional„class> N_I:<> <gloss> == law <infl_root all> == zakon. No2: <> == NOUN <declensional_class> -= N_I:<> <gloss> == knife <inf l_root all> == noz <mor stem hardness> == soft. Student: <> == NOUN <gloss> == student <infl_root all> — student <stem sex> == male. Bolgarin: <> == NOUN <gloss> == bulgarian_man <infl_root all> == bolgar <stem sg> =» "<infl_root all>" _'in) <mor gen pl> « N_I I: <hard gen pl> <sem sex> == male. Komnata: <> == NOUN <declensional__class> == N_II:<> <gloss> == room <infl_root all> == komnat. Ucitel'nica: <> == NOUN <gloss> == female_teacher <infl_root all> == uc'itel'n'ic <sem sex> == female. Muzcina: <> == NOUN <declensional_class> == N_II:<> <gloss> == man <infl_root all> == mu§c'in <sem sex> == male. A Unified Account for Russian 141 Kost <> == NOUN <declensional_class> == N„III:<> <gloss> == bone <infl_root all> == kost'. <> == NOUN <declensional„class> == N_III:<> <gloss> == mouse <infl_root all> === mis' <sem animacy> == animate. put' : <> == NOUN <declensional„class> == N_III:<> <gloss> == way <mor inst sg> == N_I:<inst sg> <mor formal„gender> == N_I:<formal_gender> <infl„root all> == put' . Vino: <> ==: NOUN <declensional_class> == N„IV:<> <gloss> == wine <infl„root all> == v'in. More: <> =- NOUN <declensional_class> == N_IV:<> <gloss> == sea <infl„root all> ='= mor' <phon stem hardness> == soft. Taksi: <> == NOUN <gloss> == taxi <infl_root all> == taks'i <infl_root final> = = vowel. Gnu: <> == NOUN <gloss> == gnu <infl„root all> == gnu <infl„root final> == vowel <sem animacy> == animate. APPENDIX B: SOME THEOREMS OF THE DATR THEORY Zakon: <gloss> = law. Zakon: <mor nora sg> = zakon. Zakon: <mor acc sg> = zakon. Zakon: <mor gen sg> = zakon „a. Zakon: <mor dat sg> = zakon _u. 142 Norman M. Fraser and Greville G. Corbett Zakon <mor inst sg>. = = zakon _om Zakon <mor loc sg> = zakon _e. Zakon <mor nom pl> = zakon _i ■ Zakon <mor acc pl> = zakon „i ■ Zakon <mor gen pl> = zakon _ov. Zakon <mor dat p±> = zakon _a _ Zakon <mor inst pl> = zakon _a Zakon <mor loc pl> = zakon _a _ Zakon <syn gender> = masc. Zakon <syn animacy> = inanimate. m. m' i. x. Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: Noz: <gloss> = knife. <mor nom sg> = noz\ acc sg> = noz. gen sg> = noz _a. dat sg> = noz __u. inst sg> = noz _om. loc sg> = noz _e. nom pl> = noz _i. acc pl> = noz _i. gen pl> = noz _ej. dat pl> = noz _a _m. inst pl> = nog _a _m' x. <mor <mor <mor <mor <mor <mor <mor <mor <mor <mor <mor <syn <syn 1. loc pl> = noz _ gender> = masc. animacy> = inanimate. Student <gloss> = student. Student <mor nom sg> = student. Student <mor acc sg> = student _a. Student <mor gen sg> = student „a. Student <mor dat sg> = student _u. Student <mor inst sg> = student „om Student <mor loc sg> = student _e. Student <mor nom pl> = student _i. Student <mor acc pl> = student _ov. Student <mor gen pl> = student _ov. Student <mor dat pl> = student _a _ Student <mor inst pl> = student _a Student <mor loc pl> = student _a _ student <syn gender> = masc. Student <syn animacy> = animate x. Bolgarin Bolgarin Bolgarin Bolgarin <gloss> = bulgarian_man. <mor nom sg> = bolgar _'in <mor acc sg> = bolgar „'in <mor gen sg> = bolgar _'in A Unified Account for Russian 143 Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Bolgarin: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: Komnata: <mor dat sg> = bolgar _' in „u. <mor inst sg> = bolgar in _om. <mor loc sg> = bolgar in _e. <mor nom pl> = bolgar _i. <mor acc pl> = bolgar. <mor gen pl> = bolgar. <mor dat pl> = bolgar _a _m. <mor inst pl> = bolgar _a „m' i. <mor loc pl> = bolgar _a __x. <syn gender> = <syn animacy> = <gloss> = room <mor nom sg> = acc sg> = gen sg> = dat sg> = inst sg> loc sg> = nom pl> = acc pl> = gen pl> = dat pl> = inst pl> <mor <mor <mor <mor <mor <mor <mor <mor <mor <mor <mor <syn <syn bolgar masc. = animate komnat komnat komnat komnat komnat komnat komnat komnat komnat. komnat komnat komnat .a • _u. _i . _e. _oj . _e • _i. i. loc pl> = gender> = fern. animacy> = inanimate. _m. x. U5itel' nica: <gloss> = female_teacher. ufiitel' nica: <mor nom sg> = uc'itel'n' ic _a. U5itel' nica: <mor acc sg> = u£'itel'n' ic _u. Ufiitel' nica: <mor gen sg> = u£'itel'n' ic „i ■ Ufiitel' nica: <mor dat sg> = uc'itel'n' ic _e. Ocitel' nica: <mor inst sg> = uc'itel'n 'ic „oj Ucitel' nica: <mor loc sg> = u£'itel'n' ic __e. Ubitel' nica: <mor nom pl> = u5'itel'n' ic _i - U£itel' nica: <mor acc pl> - ufi'itel'n' ic. Ucitel' nica: <mor gen pl> - u£'itel'n' ic. Ucitel' nica: <mor dat pl> - ufi'itel'n' ic _a _m. Ucitel' nica: <mor inst pl> = uc'itel'n ic „a m' i Ufiitel' nica: <mor loc Pl> = uc'itel'n' ic _a _x. U5itel' nica: <syn gender> = fern. Ufiitel' nica: <syn animacy> = animate. Muzcina Muz5ina MuzSina <gloss> = man. <mor nom sg> = mu§c 'in _a. <mor acc sg> = mus5'in __u. 144 Norman M. Fraser and Greville G. Corbelt Muzcina : <mor gen sg> = musc'in _i . Muzcina : <mor dat sg> = mugc'in _e . MuzSina : <fitor inst sg> = mugc'in _oj ■ Muzcina : <mor loc ag> = mugc'in _e . Muzcina : <mor nom pl> = mugc'in _i . Muzcina : <mor acc pl> = mugc'in. Muzcina : <mor gen pl> = mugc'in. Muzcina : <mor dat pl> = mugc'in a _m Muzcina : <mor inst pl> = mus£' in Muzcina : <mor loc pl> = musc'in _a _x ■ Muz£ina : <syn gender> = masc. MUzC Lna : <syn animacy> = animate Kost ' <gloss> = bone Kost <mor nom sg> = kost'. Kost <mor acc sg> = kost'. Kost ' <mor gen sg> = kost' _i. Kost ' <mor dat sg> = kost' _i. Kost i <mor inst sg> = kost' _)u Kost ' <tnor loc sg> = kost' _i. Kost <mor nom pl> = kost' _i. Kost i <mor acc pl> = kost' _i. KOS t <mor gen pl> = kost' _ej. V r» 4-KOSt <mor dat pl> = kost' _a jn. SOS t i <mor inst pl> = kost' _a _m' i. l\OS U i <mor loc pl> = kost' _a _x. <syn gender> = fern. Kost <syn animacy> = inanimate. Mys <gloss> = mouse Myg' <mor nom sg> = mis. Myg' <mor acc sg> = mig. Myg' <mor gen sg> = miS _i. Myg' <mor dat sg> = mi§ _i. Myg' <mor inst sg> = mis _ju. Myg' <mor loc sg> = mi2 _\, Myg' <mor nom pl> = mig _i. Myg' <raor acc pl> = mig _ej. Myg' <mor gen pl> = mis _ej. Myg' <mor dat pl> = mig _a _m. Myg' <mor inst pl> = mig _a _m' i . Mys' <mor loc pl> = miS _a _x. Myg' <syn gender> = fern. Mys' <syn animacy> = animate. Put' <gloss> = way Put' <mor nom sg> = put' . A Unified Account for Russian 145 put' <mor acc sg> = put' . put' <mor gen sg> = put' _i. put' <mor dat sg> = put' „i • put' <mor inst sg> = put' „om. put' <mor loc sg> = put' _i ■ put' <mor nom sg> = put' _i . put' <mor acc pl> = put' _i. put' <mor gen pl> = put' _ej . put' : <mor dat pl> = put' _a _m put' : <mor inst pl> = put' _a put' : <mor loc pl> = put' _a _x put' : <syn gender> = masc. Put' : <syn animacy> = inanimate. Vino : <gloss> = wine. Vino : <mor nom sg> = v' in _o. Vino ; <mor acc sg> = v' in _o. Vino : <mor gen sg> = v' in _a. Vino : <mor dat sg> = v' in _u. Vino : <mor inst sg> = v' in „om. Vino : <mor loc sg> = v' in _e. Vino : <mor nom pl> = v' in _a. Vino : <mor acc pl> = v' in _a. Vino : <mor gen pl> = v'in. Vino : <mor dat pl> = v' in _a _n Vino : <mor inst pl> = v' in _a _ Vino : <mor loc pl> = v' in _a _y Vino : <syn gender> = neut. Vino : <syn animacy> = inanimate. More : <gloss> = sea More : <mor nom sg> = mor' _o. More : <mor acc sg> = mor' _o. More : <mor gen sg> = mor' _a. More : <mor dat sg> = mor' _u. More : <mor inst sg> = mor' _om. More : <mor loc sg> = mor' _e. More : <mor nom pl> = mor' _a . More : <mor acc pl> = mor' _a. More .: <mor gen pl> = mor' _ej . More ;: <mor dat pl> = mor' _a „i More i: <mor inst pl> = mor _a Mors ;: <mor loc pl> = mor' _a _ More s: <syn gender> = neut. Mort 5: <syn animacy> = inanimate Taksi: <gloss> = taxi 146 Norman M. Fraser and Greville G. Corbet! Taksi <mo c nom sg> = taks i . Ta k s i <JJ)OJ " acc sg> = taks' i . rr> \, i i aks i <mo2 " gen sg> = taks' i . Ta ks i <moi " dat sg> = taks' i. I etKS 1 <moi ' inst sg> = taks ' i . Ta ks i <moi * loc sg> = taks' i . Taks i: <moi nom pl> = taks' i. Taksi: <mor acc pl> = taks' i . Taksi: <mor gen pl> = taks' i . Taks i: <moi dat pl> = taks' i. Taksi: inst pi> = taks ' i. Taksi: <mor loc pl> = taks' i. Taksi : <syn gender> = neut. Taksi: <syn animacy> = inanimat C1 r\ u -o n U , <gloss> - gnu. pnn ■ IjfJll . <mor nom sg> = gnu. oil U . <mor acc sg> = gnu. Gnu: <mor gen sg> = gnu. Gnu : <mor dat sg> = jnu. Gnu : <mor inst sg> = gnu. <mor loc sg> = < jnu. Gnu: <raor nom pl> = < jnu. Gnu : <mor acc pl> = c fnu. Gn U : <mor gen pl> = c fnu. <mor dat pl> *. c fnu. Gnu: <mor inst pl> = gnu. Gnu: <mor loc pl> = g nu. Gnu: <syn gender> => m asc. Gnu: <syn anirnacy> = animate. NOTES 1 Versions of ihis paper were read at the Spring Meeting of the Linguistics Association of Great Britain, University of Birmingham. March 1993 and at the Finnish Circle of Slavists, Helsinki, May 1994. We would like to thank those present for helpful comments, and particu- larly Alan Timberlakc and Ursula Doleschal for discussion. We arc also very grateful to Gerald Gazdar and Dafydd Gibbon for a number of useful suggestions, and to Petra Barg for comments on an earlier draft. Any errors are ours. This research was supported by the Economic and Social Research Council (grant R0O0233633) and the Leverhulme Trust (grant F.242M); the support of both funding bodies is gratefully acknowledged. 1 Our approach is complementary to Gibbon's work on (LEX. The Utter is an approach to the lexicon in computational linguistics, implemented in DATR. '. . . the ILEX concept may be thought of as a set of linguistic constraints on the form of possible DATR representations' (Gibbon 1992; 47). Network Morphology can be viewed as a set of linguistic constraints on possible DATR representations of morphology. A Unified Account for Russian 147 I The choice of DATR as the language of formal expression for Network Morphology is no accident. The theoretical framework emerged out of our exploratory work with DATR. informed by our earlier experience of default inheritance in Artificial intelligence and in Word Grammar (Fraser & Hudson 1992). DATR's special attractions are a conceptually simple syntax and semantics, and a range of existing computer interpreters for the language. * The implementations we know of arc by A. Bollz (Konstan*), Roger Evans (Sussex), Norman Fraser (Surrey), Dafydd Gibbon (Bielefeld), James Kilbury (DOsseldorf). Hagcn Langer (Bielefeld) and A. Sikorski (Poznari), 5 Morphological forms to be accounted for are given in phonemic transcription (while other forms, such as in examples (7)-(9) below are in the standard transliteration). The automatic phonological correspondences we assume are as follows: 1. lil is retracted to its allophone [ij after non-back hard (unpa(atalized) consonants. Thus the nominative plural form /zakontf will be realized with (i) but /kost'i/ retains li) since [t'J is soft. 2. All consonants which can be palatalized arc automatically palatalized before Id. Thus the locative singular of /zakon/, namely /zakonc/. will be realised wiih a palatalized (n'J. If the consonant is already palatalized as in genitive plural /kost'-ej/, it simply remains palatal- ized. Some consonants are always hard (/S, i, cl), and remain so before Id. On the other hand, ti'f and lift arc always soft (palatalized), and naturally remain so before Id. We have chosen to mark softening redundantly for greater clarity in this instance. In addition, the gutturals Ik, g, x/ are palatalized before HI, so that the genitive form /knigi/. from /kniga/ 'book', will be realized with palatalized [g'j (which then demands the front allophone [i|). 3. There are complex patterns of reduction of vowels in unstressed position, which can safely be omitted from the transcription since our focus is on morphology. In particular, the unstressed lol ending of nouns with soft stems such as /pol'o/ 'field' (orthographically pole) is realized as either!»} or [ij. Although this lol is never realized as anything approximating to a mid rounded back vowel, positing lol is justified by the stressed (6) which occurs in Iv'ino] 'wine' and (p'it'j6J 'drink(ing)'. For an informative sketch of Russian phonology, see Timberlake (1993: 828-832). 6 Some implementations of DATR require value sequences to be enclosed in round brackets; other implementations allow bracket enclosed sequences but do not require them. Here we follow the current practice of Gazdar and Evans and omit all round brackets. ' Only facts relevant to the immediate discussion are shown at nodes NJI and N_IIi in (3). In our full analysis of Russian nominal morphology, each of these nodes records a much richer set of facts than is shown here. The facts for N_III are quite different from those for N_H, so an analysis of the form 'N_lli:<> — N_IP is out of the question. See Appendix A for further details. * A full account of the complexities of the genitive plural is given in Brown and Hippisley (1994). * This presentation is illustrative only, since it masks a formal distinction in DATR between the '-' and '—' symbols. See Evans & Gazdar (1989a) for further discussion. 10 For a recent discussion of how gender can be defined for Russian see Hubenschmid (1993: 47-77). " We do not examine the complexities of hybrid nouns here, for which see Corbett (1991: 39, 231-232), and for interesting data see Weiss (1991) and Doleschal (1992, 1993: 40-46, 138-144). II There is a smalt number of exceptions, for which see Smimova (1979). We are not concerned here with the special case of acronyms where derivational information may be relevant. " DATR fragments from our theory of Russian nominal morphology are identified by square-bracketted numbers. These numbers are also used to pick out the fragments in the full theory given in Appendix A. A more extended account to include cases where reference may be to either sex would require us to state the first equation as: GEHDER: <> == masc. Instances of this kind would be 148 Norman M. Fraser and Greville G. Corbett conectly assigned masculine gender so long as the indeterminate sex of the referent were identified by an explicit marker such as either. " Thus Thelin (1975: 30) claims that 'the inflectional class membership of Russian nouns can to a great extent be predicted from information on gender and/or the nature of the final consonam of the stem'; see also Crocket! (1976: 12). And Stankiewicz (1978: 666-667) postulates just two declensions for all the Slavonic languages, and then predicts the differences within the paradigms from gender and from the morphological structure of the stem. Corbett (1982) demonstrates that full prediction of declensional class from gender is not possible, but that prediction of gender from meaning and declensional class is possible, given four declen- sional types. Our earlier paper (Corbett & Fraser 1993) allowed us to reconcile the competing claims of the three- or four-declensional models (our objections to Stankiewicz's two-declen- sional model are given in Corbett 1982: 207). Here we arc adding in the predictability of declensional class which, though weaker than that of gender (Corbett 1982: 224), stilt permits a reduction in the lexical entry of large numbers of nouns. Halle (1992: 37-38, 45) postulates the prediction of declensional class from gender for Latvian, Latin and Russian, but does not argue for this position rather than for the converse. Aronoff (1994: 73-74) allows predictions in both directions. Some claim to predict declensional class from gender but use the nominative singular as the lexical entry rather than the stem; for many nouns the nominative singular effectively provides the stem and the declensional class, hence there is actually very little prediction going on. 16 There is no reason why such a phonological component could not also be formalized using DATR, as the work of Reinhard & Gibbon (1991) and Gibbon (1992) demonstrates. " The inferences shown in (IOH'2) may be drawn from the DATR theory but do not belong to it. Hence they cannot be indexed with any part of the theory by means of square bracketed identifiers. 11 While the claim relating to gender assignment is a general one, we have not investigated declensional class assignment beyond Russian. It is a plausible hypothesis that in declensional class assignment generally, formal factors will take precedence over semantic. '* If the stem ends in a hard consonant, then type N_lll can be eliminated. We have no rule to predict membership of class N_I1I; stems are by default 'hard' (they end in a hard conso- nant). The N_lll class is quite small (see discussion after Table 1). M This figure includes the small number of nouns (32 in Lazova's table) which are indeclin- able but which do not end in a vowel. They are typically borrowings. Borrowings tike miss 'miss', which denote females but end in a consonant are indeclinable. These require art exceptional marker just to indicate that they are indeclinable: the semantic gender assignment rule will then cor- rectly assign them to the feminine gender. Rare borrowings which are indeclinable, consonant final, do not denote an animate, and are masculine or feminine, require two irregular markers (for declensional class V, and for gender). These marginal cases tend to be integrated over time or lost. " It is important to note that grammatical defaults of different types may or may not tine up together (and when there are only two possibilities, as with two-gender systems, then the coin- cidence cannot be assumed to be of any great significance). In Arapesh, Aronoff (1992) discovered remarkable consistency in the way the grammatical defaults lined up. In other cases, as in Russian, the picture is less straightforward. For an unusual type of gender default, see Corbett & Mtenje (1987: 15) on Chichewa. " The declensional class assignments are not given in Appendix B; this gives all the avail- able morphological forms (which require the declensional class assignment to be correct) and the gender and animacy. In other words, we provide all the information which might be required by the syntax (for agreement purposes), and the morphological forms supplied according to syntactic specifications. This seems to be a realistic set of outputs. The declensional class assignments are correct, and could be shown; the point is simply that there is nothing outside the morphological component which requires information about them. A Unified Account for Russian 149 REFERENCES Aronoff, Mark. 1992, "Noun Classes in Arapesh". In Geert Booij and Jaap van Marie (cds.), Yearbook of Morphology {991. Dordrecht: Kluwer, 21-32. Aronoff, Mark 1994. Morphology By Itself: Stems and Inflectional Classes (- Linguistic Inquiry Monograph 22). Cambridge MA: MIT Press. Brown, Dunstan and Andrew Hippisley. 1994. "Conflict in Russian Genitive Plural Assignment: A Solution Represented in DATR". Journal of Slavic Linguistics 2, 30-48. Corbett, Greville G. 1982. "Gender in Russian: an Account of Gender Specification and its Relationship to Declension". Russian Linguistics 6, 197-232. Corbett, Greville G. 1991. Gender. Cambridge: Cambridge University Press. Corbett. Greville G. and Norman M. Fraser. 1993. "Network Morphology: A DATR Account of Russian Inflectional Morphology". Journal of Linguistics 29, 113-142. Corbett, Greville G. and Alfred D. Mtenje. 1987. "Gender Agreement in Chichewa". Studies in African Linguistics 18, 1-38. Crockett, Dina B. 1976. Agreement in Contemporary Standard Russian. Cambridge, MA: Slavka. Daelemans. Walter, Koenraad De Smedt and Gerald Gazdar. 1992. "Inheritance in Natural Language Processing". Computational Linguistics 18, 205-218. Doleschal. Ursula. 1992. "Genus und (Ko-)Referenz". In Tilmann Rcuthcr (cd.). Slavtstische Linguistik 1991: Referate des XVII. Konstanier Siavistischen Arbeitstreffens, Klagenfurt-St. GeorglLangsee I0.-I4.9.I99I (- Slavistische Bcitrage 292). Munich: OnoSagner, 123-135. Doleschal, Ursula. 1993. Genus als grammatische und textlinguistiscfie Kategorie: Eine kognitiv- funktionalisiische Untersuchung des Russischen. Unpublished doctoral dissertation, University of Vienna. Evans, Roger and Gerald Gazdar. 1989a. "Inference in DATR". Proceedings of the 4th Conference of the European Chapter of the Association for Computational Linguistics. Manchester, England. 66-71. Evans, Roger and Gerald Gazdar, 1989b. "The Semantics of DATR". In A. G. Cohn (ed.), Proceedings of the Seventh Conference of the Society for the Study of Artificial Intelligence and Simulation of Behaviour. London: Pitman/Morgan Kaufmann, 79-87. Fortune, Reo F. 1942. Arapesh. (Publications of the American Ethnological Society 19). New York: J. J. Augustin. (Reprinted 1977. New York: AMS Press). Fraser, Norman M. and Greville G. Corbett. 1993. "It's not my Default: a Network Morphology Analysis of Noun Classes in Arapesh". Paper read at the Autumn Meeting of the Linguistics Association of Great Britain, Bangor, 13-15 September. Fraser, Norman M. and Richard A. Hudson. 1992. "Inheritance in Word Grammar". Computational Linguistics 18, 133-158. Gazdar, Gerald. 1987. "Linguistic Applications of Default inheritance Mechanisms". In P. Whitelock, M. M. Wood, H. L. Somers, R. Johnson and P. Bennett (cds.), Linguistic Theory and Computer Applications. London: Academic Press, 37-68. Gazdar, Gerald. 1990. "An Introduction to DATR". In Roger Evans and Gerald Gazdar (eds.), 77ie DATR Papers (- Cognitive Science Research Paper CSRP 139). School of Cognitive and Computing Sciences, University of Sussex, 1-14. Gazdar, Gerald. Forthcoming. "Ceteris paribus". To appear in J. A. W, Kamp and C. Rohrcr (eds.), Aspects of Computational Linguistics. Berlin: Springer. Gibbon, Dafydd 1992. "ILEX: A Linguistic Approach to Computational Lcxica". In Ursula Klenk (ed.), Computatio Linguae: Aufsatze zur algorithmischen und quantitativen Analyse der Sprache {- Zeitschrift fiir Dialektologie und Linguistik, Beihefi 73). Stuttgart: Franz Stciner Vcrlag, 32-53. Halle, Morris. 1992. "The Latvian Declension". In Geert Booij and Jaap van Marie (eds.). Yearbook of Morphology 1991. Dordrecht: Kluwer, 33-47. Hubenschmid, Markus 1993. Genus und Kasus der russischen Substantive: Zur Definition und Identiftkation grammatischer Kategorien (= Vorrage und Abhandlungen zur Stavistik 21). Munich: Otto Sagner. 150 Norman M. Fraser and Greville G. Corbetl Ilola, Eeva and Ano Mustajoki. 1989. Repori on Russian Morphology as ii appears in Zaliyiyak's Grammatical Dictionary (- Slavica Hclsingiensia 7). Helsinki: Department of Slavonic Languages, University or Helsinki. Lazova, M. V. (cii.). 1974, Obrainyj slovar" russkogo jazyka: okolo 125 000 slow Moscow: Sovetskaja Enciklopedija. Muc'nifc, I. P. 1971. Grammatic'eskie kategorii glagola i imeiti v sovremennom russkom liter- aturnon jazyke. Moscow: Nauka. Reinhard, Sabine and Dafydd Gibbon. 1991. "Prosodic Inheriiancc and Morphological Generalisaiions". Proceedings, Fifth Conference of the European Chapter of the Association for Computational Linguistics, Berlin 1991, 131-136. Schupback, R. D. 198-1. Lexical Specialization in Russian (= UCLA Slavic Studies 8). Columbus, Ohio: Slavica. Shieber, Swart M. 1987. "Separating Linguistic Analyses from Linguistic Theories". In Peter Whitclock, Mary McGee Wood, Harold L. Somers, Rod L. Johnson and Paul Bennett (cds.), Linguistic Theory and Computer Applications. London: Academic Press, 1-36. Smirnova, G. A. 1979. "Kategorija roda ncsklonjaemyx suScestvitel'nyx". In V. P. Grtgor'ev (ed.), Lingvistika i poetika. Moscow: Nauka, 86-105. Stankiewicz, Edward. 1978. "The Inflection of Serbo-Croatian Substantives and Iheir Genitive Plural Endings". In Henrik Birnbaum (ed.), American Contributions to the 8lh International Congress of Slavists. I: Linguistics and Poetics. Columbus, Ohio: Slavica, 666-681. Thelin, Nils B. 1975. Notes on General and Russian Morphology (- Studia Slavica Upsaiiensia 15). Stockholm: Alniqvist & Wiksell. Timbcrlake, Alan. 1993. "Russian". In Bernard Connie and Greville G. Corbett (eds.), The Slavonic Languages. London: Routledge, 827-886. Weiss, Daniel. 1991. "Scxus Distinctions in Polish and Russian". In Maciej Grochowski and Daniel Weiss (eds.), "Words are physicians for an ailing mind'' (- Sagners Slavistische Sammtung 17). Munich: Otto Sagner, 449-466. Worth, Dean S. 1966. "On the Stem/ending Boundary in Slavic Indeclinables". Zbornlk za filologiju i Ungvistiku 9, 11-16. Zaliznjak, A. A. 1977. Grammatideskij slovar1 russkogo jazyka: slovoizmenenie. Moscow: Russkij jazyk. (Corbett) Dept. of Linguistic and International Studies University of Surrey Guildford, Surrey GV2 5XH England e-mail: g.corbelt@surrey.ac.uk. (Fraser) Vocalis Ltd. Chaston House Mill Court Great Shelford Cambridge CB2 5LD England e-mail: norman@vocalis.demon.co.uk Evaluative Affixes in Italian DONNA JO NAPOU and BILL REYNOLDS I. INTRODUCTION Italian nouns and adjectives make use of almost two dozen evaluative affixes (to borrow Scalise's 1984 term), most of which can also be found on verbs. While a few of these affixes are strongly productive on N and A and weakly productive on V, today most of those that occur on V are lexicalized (though not all; see Cortelazzo & Cardinale 1989). Nevertheless, their evaluative sense is obvious, and their occurrence suggests a period in the history of Italian (from around 1300 to 1600, judging by Cortelazzo & Zolli 1979) when all these affixes were productive. Since prepositions in Italian form an inerl class with respect to morphology (with the exception of portmanteau prepo- sition-articles, as in Napoli & Nevis 1987) and always have in the history of Italian, it is natural that these affixes did not occur on P. Thus, evaluative suffixes did occur on ail morphologically relevant categories. This means that most evaluative affixes went through a period when they did not select for category. Aronoff claims that word formation rules (WFRs), such as affixation, operate on a base that is "always specified syntactically" (1976: 47). The general syntactic and semantic conditions that a WFR might be sensitive to are "category, subcategory, selection, and lexically governed entailment and pre- supposition" (1976: 48). The specification is, furthermore, always unique. Aronoff dubs this claim the Unitary Base Hypothesis and argues that if any WRF can be found to apply to bolh nouns and verbs, for example, the UBH would be refuted, given that N and V cannot be stated as a single syntactic category. In support of the UBH, Aronoff looks at a potential counter- -example, the English affix-able, which attaches to both N (sizable) and V (readable), and argues that the identity here is only apparent - that we have, in fact, two distinct affixes with different properties. Aronoff does not admit . the possibility of an affix which would put no syntactico-semantic condi- ; tions on the base to which it attaches. The evaluative suffixes studied here, which definitely attach to N as well | as V, provide a clear counterexample to Aronoff s UBH, even as modified ^ in Scalise (1984), as we discuss below (see Section 8). V Still, as Zwicky & Pullum (1987) have pointed out, "plain" morphology ;f contrasts with "expressive" morphology in a number of ways, including pre- cisely the fact that expressive morphology is promiscuous with respeel to the base it operates on. By plain morphology they mean morphological rules lliat belong properly to the rules of grammar of natural languages. By expres- .fiye morphology they mean extragrammatical phenomena in word formation, 151 peer! Boolj and Jaap van Marie (eds.). Yearbook of Morphology 1994, 151-178. -© 1995 Kluwtr Academic Publishers. Printed in the Netherlands.
x

Log In

or reset password

Reset Password

Enter the email address you signed up with, and we'll send a reset password email to that address

Academia © 2012