Morphophonological nature of Mari accentuation as viewed from the Uralic perspective.

1. Background

A system of accentuation in the Mari language seems to be non-typical for Uralic languages, as Mari has variable but clearly definable stress. (1) The variable position of Mari stress has already been noted by Castren (1845: 8-9), along with a few remarks about its location (e.g. that in disyllabic words the stress is more often on the first syllable, although it can also be placed on the second syllable).

In the course of the 20th century, several attempts were made to formulate a set of rules that predicted the position of stress in Mari words. Below I give a few quotations from different sources describing Mari stress. The works are listed in chronological order.

1.1. V. M. Vasil'jev ([TEXT NOT REPRODUCIBLE IN ASCII] 1927: 8-9) noted: The stress in Mari can be placed in the beginning, in the middle, or at the end of the word.

1. In words having <<a, [??]>> (2) in the final syllable (both open and closed), the final syllable is stressed.

2. Words with <<[??]>> in the final closed syllable have stress on this syllable. Words with <<[??]>> in the final open syllable have stress on the penultimate syllable, unless the vowel in the penultimate syllable is <<bl>>. In the latter case the stress shifts on the antepenultimate syllable. Exception: a few words ending with <<[??]>> have stress on it.

3. Short <<bl, bl, u>> are usually unstressed. If a word containing <<bl>> and <<bl>> also has some other vowels, the stress is placed on the rightmost other vowel. If there are no other vowels in non-final syllables except for <<[??]>>, the stress is on the first syllable of the word.

4. In words having <<o, o, y, y>> in the final open syllable the stress comes on the penultimate syllable, unless the vowel in the penultimate syllable is <<bl>> or <<bl>>. In the latter case, the stress moves on the antepenultimate syllable. Very few words ending in <<y, y>> have stress on these vowels (mostly in Meadow Mari).

1.2. G. G. Karmazin ([TEXT NOT REPRODUCIBLE IN ASCII] 1936: 15-16) added some morphological criteria to define the position of stress:

In order to have a true understanding of Mari stress one has to consider each morphological category separately. Additionally, Mari has three basic varieties: Eastern, Meadow and Hill Mari. Each dialect has its own specific accentuation. [---] (3)

1. The stress in Meadow Mari can be located both in the beginning, in the middle, and at the end of the word.

2. The <<bl>> vowel is not stressed if there are any other vowels in the word.

3. If there are no other vowels in the word except for <<bl>>, the stress comes on the first syllable.

4. In Meadow Mari, the word-final vowels <<o, o, [??]>> are always unstressed and pronounced similarly to the <<bl>> vowel (see the exceptions below), thus the stress never falls on the final <<o, o, [??]>>. [---]

8. The main rule: the stress is on the final syllable of the word if the final syllable (both open and closed) contains the vowels <<a, a, u>> or if the final syllable is closed and contains the vowel <<[??]>>.

Exceptions to the main rule:

9. Few nouns in Meadow and Eastern Mari have the stress on the word-final <<o>> and <<[??]>>.

10. A few adjectives and nouns in Meadow Mari have stress placed on the word-final <<y>> and <<y>>.

1.3. L. P Gruzov ([TEXT NOT REPRODUCIBLE IN ASCII] 1960:132-138) did not formulate any definite rules of the Mari stress, but indicated its correlation with the length of the vowel and the grammatical form of the word:

The Mari stress is not fixed. [---] The fact that the stress is not fixed does not allow formulating strict rules of its location in a word. [---] In nouns, the first syllable is quite often stressed, but in verbs the situation is different. [---] The stress is tightly connected with the grammatical form of the word. [---] Restrictions on having a stressed <<bl>> is explained by the nature of the stress. As Mari stress correlates with length, and the vowel <<bl>> is shorter than other vowels, <<bl>> is usually unstressed. [---] Thus, Mari vowels are in some degree dependent on the stress, and vice versa the location of the stress correlates to some extent with the length of the vowels.

1.4. J. I. Kovedjajeva ([TEXT NOT REPRODUCIBLE IN ASCII] 1970) addressed the stress in different types of words separately (she discussed disyllabic words with the same root, words with a closed stressed final syllable, polysyllabic words with the same root, compound words, etc.). The author did not distinguish between stress as a prosodic-phonetic mechanism, and stress as a part of the phonological system, therefore she considered many different accentuation types, but did not formulate any compact rules. However, a general principle is the following: "The position of stress in Mari (the Morki-Sernur variety of Meadow Mari) depends on the phonetic structure of the word: the stress is on the full vowel closest to the end of the word. There are some exceptions." ([TEXT NOT REPRODUCIBLE IN ASCII] 1970:96).

1.5. Alho Alhoniemi (1993:21-22) suggested that reduced and full vowels should be treated separately, and the latter should be classified into strong and weak. Weak full vowels are those changing into reduced vowels before a suffix. The stress comes on the last strong full vowel or on the very first vowel if there are no full strong vowels in a word. Some inflectional suffixes are considered inconsistent concerning the stress. As will be seen from the following discussion, this rule is often correct, but it does not explain the placement of stress in some forms. For example, in the 1st and 2nd plural past forms the final vowels do not change into reduced, but still they cannot be stressed, and thus they should be considered accentually weak.

1.6. The most detailed rules of Mari accentuation are found in [TEXT NOT REPRODUCIBLE IN ASCII] 2003:104-108. The main rules are the following:

If a word ends in <<a, u, y, y>> or if these vowels are in the final closed syllable, then the stress is on these vowels. However, in imperative and past 1 forms the stress comes on the root, and not on the final <<a>>.

If the final closed syllable contains <<[??]>>, it is stressed.

If the final closed syllable contains <<[??]>> (orthographic <<e>>), the stress comes on the preceding syllable. Many nouns, adjectives, postpositions and particles are exceptions.

If a syllable with <<[??]>> is preceded with a syllable with <<bl>>, the stress is on the preceding full vowel.

If the final closed syllable contains <<bl>> and there are other vowels in the word, the stress is on the full vowel of the syllable preceding the syllable with <<bl>>.

If all syllables contain <<bl>>, the stress is on the first syllable.

If all syllables contain <<bl>>, but the final open syllable contains <<[??]>>, the stress is on the first syllable with <<bl>>. Gerunds ending in <<-de>> have the stress on the <<[??]>> vowel (spelled as <<e>> in the orthography).

If a word ends in <<o, o>>, the stress is on the preceding syllable.

If a word ends in <<o, o>> and the preceding syllable has <<bl>>, the stress comes on the full vowel of the preceding syllable.

The quotations listed above create the impression that Mari stress hardly follows any rules, and is dependent on the quality of the vowel, openness/closedness of the syllable, the vowel length, the morphological form and part of speech. However, my research has shown that the situation is not so complicated.

2. Aim of the paper

The main aim of the paper is to explain the structure of the Mari system of accentuation and to formulate a compact rule that describes the position of stress in Mari. The study covers all inflectional forms of nouns, verbs and adjectives.

Another goal of the paper is to discuss Mari stress in the context of other Uralic languages. I suggest that in spite of certain specific features Mari stress does not look alien to the Uralic family. This question will be addressed in the last section of the paper.

3. Data and methods

The study is based on the Staryj Torjal variety of Meadow Mari. There are certain phonetic differences between this variety and Standard Mari, (4) but usually they do not concern the position of stress. The differences in accentuation that I observed are discussed in this paper.

The transcription of examples is the Latin transliteration of standard orthography (phonetic dialectal differences in pronunciation are not indicated, as they do not affect accentuation).

The material was collected by the author in field trips organized by the Department of Theoretical and Applied Linguistics of Lomonosov Moscow State University in 2000, 2001 and 2004.

The main data collecting method was elicitation (a native speaker translated words or simple sentences from Russian into Mari). If necessary, speakers were asked additional questions concerning the position of stress. More than ten native speakers of different ages were questioned, and no significant variation in the position of stress was observed between the speakers. It should also be pointed out that Mari stress is very distinct (the only two exceptions will be discussed below) and confusing the stress position is not likely.

4. Mari accentuation

4.1. The rule of Mari accentuation

As shown above, previous researchers tried to formulate the rules of Mari accentuation in terms of phonetic (segments, syllables) and sometimes morphological units (grammatical forms). Let us compare two Mari words: serge (5) 'expensive:NOM' and serge 'comb:NOM'. These words have the same segmental, syllabic and morphological structure (both consist of one root morpheme). However, the position of stress in these words is different. This example demonstrates that Mari accentuation cannot be described on the phonetic or/and morphological level. What I suggest instead is formulating the rule of Mari stress in terms of morphophonology using a notion of formative as a unit of morphophonological representation. (6)

The morphophonological representation of a word is a sequence of formatives separated with "+" symbols. The border between morphemes is always the border between formatives but not vice versa, as one morpheme can consist of one or more formatives. For example, a form portemen 'house:POSS.1SG:GEN' consists of three morphemes: port-em-en 'house-POSS.1SG-GEN'. Its morphophonological representation is port+em+en, and the borders between morphemes correspond to the borders between formatives. The form tolmeske 'to come:GERPOST' consists of two morphemes: tol-meske 'to come-GERPOST'. Its morphophonological representation is tol+me+sk+e (the suffix meske is split into three formatives). The cases, when a morpheme should be split into several formatives, will be discussed below.

From the point of view of accentuation every formative can be qualified as weak or strong (the qualifying principles are discussed in section 4.1.1).

Based on the opposition of strong and weak formatives I formulate the following simple rule that allows prediction of the position of stress in all nouns, adjectives and verbs (and also in most words of other classes):

1. The stress is placed on the rightmost strong formative in the word. If there are no strong formatives, the stress is placed on the first vowel in the word.

2. If there are several vowels in a strong formative, the stress is placed on the rightmost full vowel (7) in the formative.

Before proceeding to the analysis of formatives I would like to make a terminological remark: when a morpheme consists of one formative I will use the simplified wording "morpheme X is strong/weak" instead of the more precise "the formative corresponding to the morpheme X is strong/weak".

4.1.1. How one can define whether a formative is weak or strong?

First of all, all formatives, which do not contain vowels, should be defined as weak, as they cannot carry the stress and they do not influence the stress position. Only the formatives that contain vowels will be discussed below.

Formatives are divided into two groups: stem formatives (that form roots and derivational suffixes) and inflectional formatives.

For the stem formatives the following rule is applicable: if there is at least one vowel, which is not ?, the formative is strong. If all vowels are e the formative is weak. There are two exceptions among the adverbalizers (see 4.3).

Inflectional formatives can be either weak or strong. The property "to be weak" or "to be strong" is explicitly attributed to every formative. A detailed discussion of inflectional formatives follows in section 4.2 and the full list of inflectional formatives with their accentuation characteristics is given in Table 10.

4.1.2. Harmonic formatives

A very important innovation that I introduce is an inflectional formative that I call "harmonic formative". (8) It consists of only one vowel o, o or e (usually it is the final vowel of the form), which is always unstressed. The three variants of the harmonic formative are distributed according to the vowel harmony rule. (9) For example, the inessive marker has three allomorphs (e)sto, (e)sto, (e)ste, cf. podesto 'cauldron:INE', portesto 'house:INE' and jaleste 'village:INE'. I suggest representing these forms on the morphophonological level as pod+est+o, port+est+o and jal+est+e, i.e. splitting the inessive marker into two formatives.

The harmonic formative is always weak. It is found in the following forms:

* imperative 2 singular (solto 'to cook:IMP:2SG'--solt+o, kece 'to hang:IMP:2SG' --kec+e, julo 'burn:IMP:2SG'--jul+o);

* 3 singular past 1 form (uzo 'to see:PAST1:3SG'--uz+o; pucko 'to cut:PAST1:3SG' --puck+o; pide 'to knit:PAST1:3SG'--pid+e);

* passive participle (ludmo 'to read:PRTPASS'--lud+m+o, kustemo 'to give orders:PRTPASs'--kust+em+o, sendeme 'to put:PRTPASs'--send+em+e);

* negative participle (soltedemo 'to cook:PRTNEG'--solt+edem+o, kustedemo 'to give orders:PRTNEG'--kust+edem+o, iledeme 'to live:PRTNEG'--il+edem+e);

* 3 singular possessive form (poskudezo 'neighbour:POSS.3SG'--poskud+ez+o, uderzo 'daughter:POSS.3SG'--uder+z+o, kajekse 'bird:POSS.3SG'-- kajek+s+e);

* inessive case (see examples above);

* illative case (podesko 'cauldron:ILL'--pod+esk+o, portesko 'house:ILL'--port+esk+o, jaleske 'village:ILL'--jal+esk+e).

The same formative is part of the following suffixes (traditionally described as derivational): demo/demo/deme, (10) lo/lo/le, so/so/se (derive adjectives from nouns), zo/zo/ze, co/co/ce (derive nouns from nouns), and eske (in this suffix the final vowel is always e because it is preceded by another e that unambiguously defines the harmonic variant). Therefore I suggest representing e.g. kutuco 'shepherd' as kutu+c+0 (the root kutu, the derivational formative c and the inflectional harmonic formative o) instead of the traditional kutu+co (the root kutu- and the derivational suffix co). Similarly, the morphophonological representation of imneske 'rider' (from imine 'horse') should be imn+esk+e. (11)

The final e in the gerund markers meske and meke is also considered as a harmonic formative defined by the previous vowel e (i.e. the same as in eske).

It is easy to notice that many nominative forms of nouns and adjectives end in the harmonic formative, as their final vowel correlates with the quality of the preceding vowel, compare for example, sud+o 'grass', muskend+o 'fist', ust+o 'belt', surg+o 'face', erg+e 'son', karm+e 'fly'. (12) However, the harmonic formative should not be confused with those final o and e that are stressed and therefore do not follow the vowel harmony rule: sopke 'aspen', kino 'cinema'. In these examples the final vowel is a part of the root but not a separate formative.

Summing up, if a nominative form ends in an unstressed o, o or e, these are always the harmonic formative. If the nominative ends in a consonant or in a stressed u, u, i, a, e, o, there is no harmonic formative in the word. Thus, the words serge 'expensive:NOM' and serge 'comb:NOM', which were discussed above, have different morphophonological representations: the first consists of two formatives (serg+e), while the second consists only of one formative (serge). It should be emphasized that the harmonic formative is not preserved in oblique case forms: (13) it either disappears or transforms into e (14) (cf. sudo 'grass:NOM' and suden 'grass:GEN'). The only exception is plural forms with the plural marker vlak or samec following the stem (e.g. subo-vlak-este 'fur coat-PL-INE').

4.1.3. Restrictions to the rule

The proposed accentuation rule has the following limitations:

1. It applies first of all to native Mari words. Loan words often preserve the original stress and can contradict the rule, e.g. termos 'thermos', but not *termos.

2. The rule concerns only primary stress. Secondary stress does not distinguish words in Mari and is not considered in this article.

3. In this paper I do not study the stress in compounds and in word combinations that form one phonetic word (like some analytic verbal forms do). It seems however that it should not be difficult to adapt the rule to compound words.

4. The rule was elaborated for the main morphological classes of words (nouns, verbs, and adjectives). In some peripheral morphological classes (particles, onomatopoetic words, etc.) one can find examples that possibly break the rule, e.g. teve 'here', seve-seve 'about whispering' (an onomatopoetic word). (15)

Adverbs pose a separate problem not because they contradict the rule, but because their morphoponological structure is blurred due to lexicalization processes. Adverbs are discussed in section 4.3. 5. There are specific cases where the position of stress can vary, and it is problematic even for Mari native speakers to define. I observed two such cases in my material: the dative case forms (see the discussion in section 4.2.2), and the sela gerunds with a possessive suffix (see section 4.2.1).

4.2. Inflexional formatives

In order to predict the position of stress in a word one has to correctly identify the formatives as weak or strong. As noted above, there is no problem in qualifying stem formatives: they are strong if they contain any full vowel(s) (except for the weak adverbalizers ge and la, see section 4.3). On the other hand, an inflectional formative can be either weak or strong, and there is no direct correlation between its segmental structure and accentuation characteristics (the correlations that do exist are described in section 4.2.4). In this section I will go through the Mari inflection system and define the accentuation characteristics for all inflectional formatives. An inflectional marker may consist of one or more formatives (a full list of markers is given in Table 10). All strong formatives are marked in bold italics.

4.2.1. Inflectional formatives in verbs

Mari finite verbal forms have the following structure:

<Stem>--<Tense/mood marker>--<Personal marker>

Present forms

There is no tense marker in the present forms.

All personal markers are strong except for the 3Pl marker in the conjugation I.

Examples: (conjugation I) pur+es 'to gnaw-PRS:3SG', pur+ena 'to gnaw-PRS:1PL', pur+et 'to gnaw-PRS:3PL'; (conjugation II) solt+a 'to cook-PRS:3SG', solt+ena 'to cook-PRS:1PL', solt+at 'to cook-PRS:3PL'.

Past 1 forms

There is no tense marker in conjugation I. In conjugation II the marker is es, and it is weak.

All personal markers are weak.

Examples: (conjugation I) pur+em 'to gnaw-PAST1:1sG', pur+o 'to gnaw-PAST1:3SG', pur+na 'to gnaw-PAST1:1PL', pur+ec 'to gnaw-PAST1:3PL'; (conjugation II) solt+es+em 'to cook-PAST1-1SG', solt+es+da 'to cook-PAST1-2PL', solt+es+t 'to cook-PAST1-3PL'.

Past 2 forms

The tense marker en of conjugation I is weak, while the tense marker en of conjugation II is strong.

Personal markers are the same for both conjugations. The 1Sg and 2Sg personal markers are strong. All other personal markers are weak.

Examples: (conjugation I) pur+en+am 'to gnaw-PAST2-1SG', pur+en 'to gnaw-PAST2-3SG', pur+en+et 'to gnaw-PAST2-3PL'; (conjugation II) solt+en+at 'to cook-PAST2-2SG', solt+en 'to cook-PAST2:3SG', solt+en+na 'to cook-PAST2-1PL'.

Imperative forms

There is no mood marker in the imperative. All formatives in personal markers are weak.

Examples: (conjugation I) pur 'to gnaw:IMP:2SG', pur+z+o 'to gnaw-IMP:3SG', pur+za 'to gnaw-IMP:2PL', pur+est 'to gnaw-IMP:3PL'; (conjugation II) solt+o 'to cook-IMP:2sG', solt+ez+o 'to cook-IMP:3SG', solt+eza 'to cook-IMP:2PL', solt+est 'to cook-IMP:3PL'.

Desiderative mood

There are two markers of the desiderative: ne for conjugation I and ane for conjugation II. Both are strong. 1Pl and 2Pl personal markers are strong, all other personal markers are weak.

Examples: (conjugation I) pur+ne+z+e 'to gnaw-DES-3SG', pur+ne+da 'to gnaw-DES-2PL'; (conjugation II) solt+ene+m 'to cook-DES-1SG', solt+ene+na 'to cook-DES-1PL'.

Nonfinite verbal forms

In Mari, the nonfinite verbal forms are participles, gerunds and the infinitive.

There are four different participles in Mari: active, passive, negative and future participle. The marker of the future participle is strong while all other participles have weak markers.

Examples: (conjugation I) pur+s+o 'to gnaw-PRTACT', pur+m+o 'to gnaw-PRTPASS', pur+sas 'to gnaw-PRTFUT'; (conjugation II) solt+em+o 'to cook-PRTPASS', solt+edem+o 'to cook-PRTNEG'.

There are several gerunds in Mari: affirmative, negative, posterior, anterior and simultaneous.

The formatives en and en can possibly be identified with the markers of the past 2 tense. The marker (e)sela is considered as consisting of two formatives because it can be split by a possessive marker (e.g. porlsemla [left arrow] portel+se+em+la 'to return:GERSIM:POSS.1SG'), see also [TEXT NOT REPRODUCIBLE IN ASCII] 1964:174-175 about the origin of s(e) and la. The part ske in the marker (e)meske originates from a lative case marker ([TEXT NOT REPRODUCIBLE IN ASCII] 1964:173). The part me in (e)meke can be identified with me in (e)meske.

Examples: pur+en 'to gnaw-GERAFF', kol+me+k+e 'to hear-GERANT', lud-de 'to read-GERNEG', tol+me+sk+ana 'to come-GERPOST-POSS.1PL', tol+me+sk+est 'to come-GERPOST-POSS.3PL', tol+se+la 'to come-GERSIM'.

A specific feature of sela gerunds is the possibility to attach a possessive marker, which is placed between two parts of the gerund marker (i.e. between two formatives) (cf. [TEXT NOT REPRODUCIBLE IN ASCII] 1961:264-265). If the possessive maker is strong, the position of stress is not clearly definable. Native speakers admit that the stress can be either on the possessive marker or on the final la (compare with the dative marker lan discussed below). For example, pid+s+ed+la ~ pid+s+ ed+la (17) 'to knit-GERSIM-POSS.2SG', kuc+es+em+la ~ kuc+es+em+la 'to catch-GERSIM-POSS.1SG'.

The marker of the infinitive as is strong. For example, pur+as 'to gnaw-INF', solt+as 'to cook-INF'.

4.2.2. Inflectional formatives in nouns

Nominal forms consist of the stem plus case, number and possessive markers. The order of affixes can vary (Luutonen 1997).

As mentioned above, I distinguish inflectional formatives in the nominative singular form of some words. These formatives are final e, o and 0 that are always weak and change into the reduced vowel or disappear in oblique case forms (erg+e 'son'--erg+en 'son-GEN', sud+o 'grass'--sud+en 'grass-GEN', surg+o 'face'--surg+en 'face-GEN'). Again I would like to emphasize that in words ending in stressed e and o (18) these vowels are part of the root and they are preserved in other case forms (sokte 'sieve'--sokte+n 'sieve-GEN'). These stressed final vowels are not considered as separate formatives.

All case markers except for the dative, lative and comitative cases consist of weak formatives. Dative forms are unique in the Mari declension system, because the position of stress cannot be unambiguously defined there (in all other forms the position of stress is clear both for the native speakers and researchers). The dative forms can be pronounced with the stress on the case marker or on the stem or without a prominent stress at all. This specific feature of the dative case was noted previously ([TEXT NOT REPRODUCIBLE IN ASCII] 1960:136; [TEXT NOT REPRODUCIBLE IN ASCII] 1961:72).

The markers of the plural vlak, la, samec are strong. (21)

The 1 and 2 person possessives markers are strong (if they contain a vowel), while the 3 person possessive markers are weak.

Examples: jal+la+st+ana 'village-PL-lNE-POSS.1PL', serge+vlak+est+en 'comb-PL-POSS.3PL-GEN', port+esk+et 'house-ILL-POSS.2sc', poskud+ez+em 'neighbour-POSS.3SG-ACC'.

The comparative degree of adjectives (and also adverbs) is marked with the suffix rak, which is strong, e.g. motor+rak 'more beautiful', suksu+rak 'worse'.22

4.2.3. A list of inflectional markers and their accentuation characteristics

Table 10 lists inflectional markers with their accentuation characteristic. Each marker consists of one, two or three formatives (strong formatives are marked in bold italics). Markers that do not contain vowels are not included in the table.

Two formatives (lan of the dative case and la in gerunds with a possessive marker) are defined as "conventionally strong" and marked with bold. These formatives occur in forms where the position of the stress is not evident (see sections 4.2.2 and 4.2.1). However, their behavior is not the same. The position of the stress is not evident in all dative forms, while the formative la in sela gerunds usually behaves as a normal strong formative. Accentual ambiguity appears only if a strong possessive marker is inserted between se and la.

Table 11 gives several examples of Mari forms divided into formatives. The position of stress as predicted by the accentuation rule is indicated in the second column.

The last two examples in Table 11 contain a derivational formative alt. These examples demonstrate that drawing a border between stem formatives is not significant from the point of view of accentuation (except for two weak formatives ge and la, which are discussed in section 4.3). Even if the derivational suffix is not considered as a separate formative (todelalt+ne+ze and todelalt+et) the accentuation rule gives the same result. In general, any sequence consisting of strong formatives is not sensitive to the borders between them, as these borders do not influence the rule: in such a sequence the stress is always on the rightmost full vowel. (23)

4.2.4. Correlations between the segmental structure and the accentuation characteristics of a formative

The analyzed data reveal certain correlations between the segmental structure of formatives and their accentuation characteristics.

1. If there are no full vowels in a formative (of any type), the formative is weak.

2. If a formative ends in a consonant and contains at least one full vowel, it is strong. Similar to the previous one, this rule applies to any type of formatives.

3. If an inflectional marker contains o or o vowels, these can only represent the harmonic formative. As there are no other full vowels in inflectional markers with the harmonic formative o or o, and the latter one is always weak, any marker containing o or o cannot be stressed.

4. Vowels u, u and i do not occur in inflectional formatives. Taking into account what was said above about stem formatives, any formative containing u, u or i is strong.

5. The presence of a and e in an inflectional formative does not indicate its accentuation characteristics, cf. the weak da in the past tense 2Pl forms with the strong da in 2Pl possessive forms, or the strong ge in the comitative with the weak ze in the 3Pl form of the desiderative mood.

These observations can be summed up into following statements:

1. All inflectional formatives that do not contain a or e are weak.

2. Inflectional formatives that contain a or e and end in a consonant are always strong.

3. Inflectional formatives containing a or e and do not end in a consonant can be either weak or strong.

4.3. Stress in adverbs

Adverbs pose a separate problem; compare words with similar segmental structure but different position of stress: ende 'now, already' but pekse 'hardly, only just'; ize 'only now' but ese 'still, yet'.

The source of the problem is evident: adverbs often originate from lexicalized grammatical forms, so splitting them into formatives can be really problematic. If an adverb is formally a regular grammatical form of a noun or an adjective, it can be divided into corresponding formatives (e.g. solaske 'to the left' [left arrow] sola 'left' + ske 'ILL'). In other cases the situation is more complicated. It does not look like adverbs violate the proposed accentuation rule, but the problem appears because there is no accurate morphophonological description of adverbs. As providing such a description is beyond the scope of this study, I will only discuss the most important points concerning the adverbs here.

1. There are two homonymous adverbalizers la in Mari. One of them is accentually strong and is used to derive adverbs from nouns denoting nationality: tatarla 'in Tatar; in the Tatar way', rusla 'in Russian; in the Russian way'. (24) The second one is accentually weak and should probably be identified with the comparative suffix (see section 4.2.2), as it "derives the adverbs of manner with a shade of comparative meaning" ([TEXT NOT REPRODUCIBLE IN ASCII] 1961:280): pirela 'as a wolf', jocala 'as a child'.

2. There are two different adverbalizers ge. One of them is strong and should probably be identified with the comitative marker, as "the suffix ge derives adverbs with meaning of something aggregated" ([TEXT NOT REPRODUCIBLE IN ASCII] 1961:281): e.g. tejge 'with you'.

The other ge is weak, e.g. ruzge 'hand in hand, together'. (25)

3. Similar to other parts of speech, many adverbs end in a harmonic formative, i.e. in a vowel that obviously follows the vowel harmony rule: mongo '(to) home', munderko 'far', suko 'many', umbake 'far', erdene 'in the morning', ende 'now, already', tace 'today', ize 'just now'. However, since adverbs cannot be declined, there is no accentually independent criteria that proves the existence of the harmonic formative (opposite to nouns where the harmonic formative is easily revealed through oblique case forms).

Once again I would like to point out that the morphophonology of Mari adverbs needs a separate investigation, but a preliminary study did not show noticeable contradictions to the proposed accentuation rule.

4.4. Conclusions

The analysis presented above allows making the following conclusions about the accentuation system in the Staryj Torjal variety of Mari:

1. There is a formal rule that defines the position of stress in a form. The rule operates on the morphophonological level; it is not enough to consider only phonetics and phonology (there are lexemes with the same segmental structure but with a different position of stress).

2. The position of stress does not correlate directly with the grammatical class of a word (i.e. part of speech).

3. In order to locate the position of stress in a form the following steps should be made:

a) Splitting the form into formatives;

b) Defining the accentuation characteristics of each formative (weak or strong) with the help of Table 10.

c) Applying the accentuation rule described in section 4.1.

4. It is crucial to distinguish unstressed final vowels e/o/o as a separate harmonic formative. This formative is also important for the description of the Mari morphophonological system in general. For example, nouns ending in this formative constitute a separate paradigmatic type ([TEXT NOT REPRODUCIBLE IN ASCII] 2003).

6. There is a correlation between the vowel harmony and accentuation. Harmonic formatives are never accentually strong. In most (but not in all (26)) cases the vowel that determines the harmonic variant is stressed.

7. The e vowel has a special status in Mari. It is not only shorter than other vowels (Lehiste, Teras, Help, Lippus, Meister, Pajusalu, Viitso 2005), but it is also opposed to full vowels from the point of view of accentuation--strong formatives cannot contain only reduced vowels.

5. Mari accentuation system in the Uralic context

This section addresses the question whether the described system of Mari accentuation is exceptional or typical for the Uralic languages. The analysis I give on the subject is not comprehensive, but it allows formulation of some preliminary answers.

From what we can find in grammars, the Uralic languages can be divided into three groups from the point of view of accentuation:

1. Languages with distinct fixed stress.

2. Languages with distinct variable stress.

3. Languages with non-distinct stress.

This classification is obviously rather imprecise.

First, the prosodic features of many Uralic languages are not studied well enough and the existing descriptions give only basic (and sometimes contradictory) information.

For example, for Enets (the Forest dialect) N. M. Terescenko ([TEXT NOT REPRODUCIBLE IN ASCII] 1993b:345) noted that "stress comes mainly on the first syllable". Florian Siegl (2011:91) wrote that "Stress is fixed and falls on the first syllable. [---] A weak secondary stress falls on following odd-numbered syllables". A. Sluinskij (in private communication) claimed that although the first syllable is undoubtedly marked from the point of view of accentuation, the situation is much more complicated: a preliminary analysis of the main prosodic features (intensity, length and pitch of vowels) did not show any interpretable correlation with the stress, and additionally some affixes can affect the stress position.

Second, there is a significant variation between dialects, and accentuation characteristics should be attributed rather to a particular dialect than to language in general (for example, in Eastern Mari there are many words that have two stress patterns (Sebeok, Ingemann 1961:9), but it is completely untypical for Meadow Mari).

Third, there are languages that combine features from different groups.

Anyway, it is evident that the Uralic family is not homogeneous from the point of view of accentuation systems. Nevertheless I suggest that there are features typical for accentuation in most Uralic languages: the correlation with the distribution of weak and strong units in a word, and a close tie between prosody and morphology on the synchronic level.

5.1. Languages with distinct fixed stress

This group of languages is the biggest in the Uralic family. According to grammars it includes all Finnic and Sami languages, Moksha, Udmurt, Nganasan, Hungarian and Mansi.

In most cases languages from this group have the stress on the first syllable, but in some languages another syllable carries the main stress (e.g. the last syllable in Udmurt and the penultimate syllable in Nganasan). In grammars one can mostly find only short passages about stress, such as: "In this language the primary stress comes on the first syllable and the secondary stress comes on odd syllables". However, papers with a more detailed analysis of the stress show that apart from the primary stress (and the secondary stress tied to the odd syllables) these languages demonstrate many additional, rather nontrivial prosodic features and interesting cases that exceed the bounds of the main principle.

Let us consider several examples:

5.1.1. In Estonian, the main stress falls on the first syllable (with rare exceptions usually in loan words) and does not mark differences in lexical meaning or grammatical function. Both differences are marked, however, by three quantity degrees (traditionally referred to as Q1, Q2 and Q3) that are contrastive in prosodic feet (Ross, Lehiste 2001 : 38). The contrast of quantities is based on a whole set of characteristics that are strongly intertwined, including the syllabic length, syllabic weight and pitch contour (Viitso 2003:10-20). The quantity is often the only feature that distinguishes morphological forms of a word; compare, for example 'linna 'town:GEN.SG' of Q2 and "linna 'town:PART.SG' of Q3 (Viitso 2003 : 14). It is also possible to find correlations between the quantity degrees and morphological markers. (27)

5.1.2. In Ingrian (Soikkola dialect), which also has the primary stress on the first syllable, the prolongation and reduction of vowels and consonants can be described via the contrast of light, heavy and extra heavy feet ([TEXT NOT REPRODUCIBLE IN ASCII] 2009; 2012). Some morphological markers determine the weight of the foot (e.g. the inessive marker conditions the extra heavy foot, cf. light 'town.NOM' and extra heavy lin.naz 'town.INESS').

5.1.3. In Udmurt, the last syllable is usually stressed but there are a number of deviations from this principle. For instance, according to Geisler (2005) in the imperative forms the stress usually comes on the first syllable; the adverbalizer ak makes the position of stress not distinct; in some dialects there are suffixes that move stress onto the first syllable, and so on.

5.1.4. In Nganasan, usually the penultimate syllable is stressed. However, according to E. Helimski (1998:486-487) "This general principle is optionally violated by the retraction of stress from a high vowel or o to the vowel (usually an open one) in the preceding syllable: barusji ~ barusji 'devil'. Longer words (with five syllables and more--such words are very common in Nganasan) are usually divided into two, three or four--potentially even more--rhythmic groups. Each group typically contains two syllables (two phonological vowels), and the last group has two or three syllables. It is very common--especially for verbal forms--that the stem and the derivational suffix (or suffixes) are bisyllabic, so that the boundaries between groups in most cases coincide with the boundaries between morphemes, while the last group includes a cluster of inflectional suffixes (sometimes together with a monosyllabic derivational suffix). [---] The last group has, according to the general rule, the main stress on its penultimate vowel, and all preceding groups receive additional stresses on their first vowels. [---] The rhythmic organization of words plays an important role also in the morphophonology of Nganasan, regulating the phenomenon of rhythmic gradation".

5.2. Languages with distinct variable stress

The second group of languages is relatively small. It includes Mari (analyzed in detail in the first part of this paper), Selkup and Komi-Permyak.

5.2.1. In Selkup "the position of stress demonstrates a dualistic phonetic-morphological dependence" ([TEXT NOT REPRODUCIBLE IN ASCII] 1980:137). There are several types of affixes that condition a different position of stress. One type includes affixes without vowels or only with the u vowel. In words containing only such affixes the stress comes on the last long vowel or on the first syllable, if there are no long vowels in the word. Affixes of the second type contain short vowels other than u. These affixes are stressed if they are not followed by long vowels or other affixes of the same type in subsequent syllables. The third type of affixes makes the form double-stressed. It is easy to notice the similarity between this system and the Mari accentuation that operates with weak formatives, strong formatives and lan and la formatives that distribute the stress between two syllables.

5.2.2. In Komi-Permyak the stress is variable. R. M. Batalova ([TEXT NOT REPRODUCIBLE IN ASCII] 1993:230) noted: "The Komi-Permyak stress takes a special place among other Permian and Finno-Ugric languages because it is morphologically dependent". V. I. Lytkin ([TEXT NOT REPRODUCIBLE IN ASCII] 1966:303) wrote that the stress "always comes on one of the syllables within the stem. The position of stress depends on a morpheme: some morphemes are always unstressed, some are stressed, and some draw the stress to themselves or to the previous syllable". M. Geisler (2005:162-172) conducted a detailed analysis of the Komi-Permyak stress and distinguished two groups of dialects that have different accentuation schemes. In the first group the stress is "vocalic-qualitative", and the general accentuation rule is based on the division of the first syllable vowels into three groups: a) a, e, e, o, which are stressed; b) i, u, which are stressed in some words and unstressed in other words; c) i, which is always unstressed. However, this rule has a number of exceptions, because some morphemes can draw the stress to themselves. In the second dialectal group the stress is morphological and its position depends on derivational suffixes. There are two types of such suffixes: those that are stressed and those that are not stressed but draw the stress to the previous syllable.

5.2.3. There are rather contradictory data about Khanty accentuation (possibly due to a strong dialectal variation). L. Honti ([TEXT NOT REPRODUCIBLE IN ASCII] 1993:303) wrote that in the Surgut variety of the Eastern dialect "the word stress is dynamic, it comes usually on the first syllable, but in certain cases can move to the second syllable". A. Filchenko (2007:57-60) described a much more complex system in Eastern Khanty (Vasygan and Alexandrovo varieties), where the stress can be on the first or second syllable but "it is typical for poly-syllabic words to have multiple stress" and "in tri-syllabic words, stress appears to fluctuate between the first and second syllable and no decisive pattern appears to be clear". I. Nikolaeva (1999:10) gave a different description for the Northern dialect (Obdorsk variety): "The stress system is based on an unbounded quantity-sensitive foot constructed from left to right. The primary stress falls on the leftmost heavy syllable. In the absence of heavy syllables, stress lodges on the first syllable in the word". Again, it is easy to note some similarity between Khanty accentuation as described by I. Nikolaeva and Mari accentuation. (28)

5.3. Languages with non-distinct stress

This group includes Erzya, Nenets and possibly Komi-Zyrian and Enets (the latter was discussed in section 5).

5.3.1. Erzya is the most striking example of this group of languages. A. P. Feoktistov ([TEXT NOT REPRODUCIBLE IN ASCII] 1993:192) noted: "In Erzya the free stress prevails and its position depends on the speech rhythmic unit that includes a particular word or form". Lehiste et al. gave a detailed analysis of the acoustic characteristics of stress and concluded that "in disyllabic and even trisyllabic words, the opposition between a stressed and an unstressed syllable is hardly perceptible; the location of stress in a word can alternate without a change in meaning (e.g. mo*ro/moro* 'song', ko*moro/komo*ro 'handful, palm of a hand'; it is maintained that alternations in the location of word stress are phonologically non-distinctive, nor do they change a word into a non-word" (Lehiste, Aasmae, Meister, Pajusalu, Teras, Viitso 2003). N. Aasmae (2006:162-164) noted that the position of stress in Erzya depends on the rhythmic features of the utterance and Erzya dialects vary in both the degree of stress mobility and the acoustic characteristics of the syllables.

5.3.2. Nenets can also be considered as belonging to this group, because of the ambiguous nature of the stress. N. M. Terescenko ([TEXT NOT REPRODUCIBLE IN ASCII] 1993a:328) noted: "The Nenets stress is variable and movable: veva 'bad'--vevdvna 'badly'--vevarkdvna 'worse'. There are also words with a double or multiple stress. In some words the stress is distributed evenly among the syllables and some words are pronounced without any stress at all. The unstressed final a is reduced both in quality and in quantity. The stress can distinguish lexical meaning: toda(s) 'to vomit'--toda(s) 'warm oneself by the fire' ". In Nenets there is a strong correlation between stress and vowel alternations. T. Salminen (1997:42) analyzed the process of vowel reduction, where the schwa [degrees] is derived from the reduced vowel [empty set] in unstressed syllables (with the condition that a syllable becomes stressed if followed by a syllable with [degrees]). The stress and correspondingly the distribution of the schwa [degrees] and the reduced [empty set] in some cases are dependent on the morphological characteristics of the form.

5.3.3. Komi-Zyrian combines features of languages with both fixed and non-distinct stress (and to some extent, with variable stress). The accentuation systems of Komi-Zyrian dialects vary significantly. In most cases there is a tendency to have the stressed first syllable, but the stressed syllable can be ill-defined and the stress can often move to some other syllable. In some dialect there are formatives that draw the stress to themselves (Geisler 2005:157-162).

5.4. Conclusions

This short overview of the Uralic accentuation systems allows the drawing of several conclusions.

In most Uralic languages (including those with fixed stress) the prosodic system is not trivial. In the core of the Uralic accentuation systems there is an opposition of weak and strong units (syllables, feet or morphemes--depending on a language and on a particular model of description). This opposition can influence the position of stress and the phonetic realization of segments, and very often conditions the appearance of reduced vowels. (29) In some languages reduction is phonetic (e.g. in Ingrian reduced vowels are restored in a distinct pronunciation) while in some other languages the reduced vowel became a phoneme. As a result the segmental and supra-segmental levels are strongly intertwined and the analysis of prosody should also address the level of morphophonology. Being often dependent on grammatical forms, accentuation cannot be considered as a pure phonetic phenomenon, and the attempts to describe accentuation only on the phonetic level often lead to incorrect statements. (30)

The Mari accentuation system is a completely typical example of such system. It is based on the contrast of weak and strong formatives and demonstrates correlation between the stress and quality of vowels (the accentuation behavior of the reduced vowel e and full vowels is crucially different).

The fact that similar accentuation systems can be observed in other Uralic languages (first of all, in Selkup) suggests that the main principles of the Mari accentuation system have Uralic roots and should not be interpreted as a contact-induced innovation. I would like to note that the thesis regarding the identity of Mari and Chuvash accentuation systems ([TEXT NOT REPRODUCIBLE IN ASCII] 1979:130), which leads to the idea that the Mari accentuation system was borrowed, does not seem correct. Probably, E. Helimski ([TEXT NOT REPRODUCIBLE IN ASCII] 1979) was basing his judgement on the description of Mari accentuation in [TEXT NOT REPRODUCIBLE IN ASCII] 1970:96, where the deviations from the principle that "the stress is placed on the rightmost full vowel" are considered as exceptions and such an interpretation is supported by the statement that "along with numerous foreign borrowings, especially Turkic, and along with the adoption of the language of the neighbouring Turkic nations, the accentuation system of these languages was also adopted". The Turkic languages definitely had influence on Mari, but the Mari accentuation system demonstrates typically Uralic features.


ACC--accusative; COM--comitative; COMP--comparative (case); CMPR--comparative degree; DAT--dative; DES--desiderative mood; GEN--genitive; GERAFF--affirmative gerund; GERANT--anterior gerund; GERNEG--negative gerund; GERPOST--posterior gerund; GERSIM--simultaneous gerund; ILL--illative; IMP--imperative; IMPRS--impersonal form; INE--inessive; INF--infinitive; LAT--lative; NOM--nominative; PART--partitive; PAST1--past 1 tense; PAST2--past 2 tense; PL--plural; POSS--possessive; PRS--present/future tense; PRTACT--active participle; PRTFUT--future participle; PRTNEG--negative participle; PRTPASS--passive participle; SG--singular; (I), (II)--number of conjugation; 1SG, 2SG, ...--person and number.


Aasmae, N. 2006, Stress and Quantity in Erzya, Tartu.

Alhoniemi, A. 1993, Grammatik des Tscheremissishen (Mari), Hamburg.

Bauer, L. 2003, Introducing Linguistic Morphology. 2nd edition, Edinburgh.

Castren, M. A. 1845, Elementa grammatices Tscheremissas, Kuopio.

Filchenko, A. Yu. 2007, A Grammar of Eastern Khanty, Houston, Texas (A thesis submitted in partial fulfillment of the requirements for the degree "Doctor of Philosophy". Rice University).

Geisler, M. 2005, Vokal-Null-Alternation, Synkope und Akzent in den permischen Sprachen, Wiesbaden.

Helimski, E. 1998, Nganasan.--The Uralic Languages, London--New York, 480-515.

Lehiste, I., Aasmae, N., Meister, E., Pajusalu, K., Teras, P., Viitso, T.-R. 2003, Erzya prosody, Helsinki (MSFOu 245).

Lehiste, I., Teras, P., Help, T., Lippus, P., Meister, E., Pajusalu, K., Viitso, T.-R. 2005, Meadow Mari Prosody, Tallinn (Linguistica Uralica. Supplementary Series / Volume 3).

Lewy, E. 1962, Zur Betonung des Ostjakischen.--Commentationes Fenno-Ugricae in honorem Paavo Ravila, Helsinki (MSFOu 125), 285-287.

Luutonen, J. 1997, The Variation of Morpheme Order in Mari Declension, Helsinki (MSFOu 226).

Nikolaeva, I. 1999, Ostyak, Munchen--Newcastle (Languages of the World / Materials 305).

Ross, J., Lehiste, I. 2001, The Temporal Structure of Estonian Runic Songs, Berlin--New York.

Salminen, T. 1997, Tundra Nenets Inflexion, Helsinki (MSFOu 227).

Sebeok, Th. A., Ingemann, F. J. 1961, An Eastern Cheremis Manual. Phonology, Grammar, Texts and Glossary, Bloomington (Studies in Cheremis 9. UAS 5).

Siegl, F. 2011, Materials on Forest Enets, an indigenous language of Northern Siberia, Tartu (Dissertationes Philologiae Uralicae Universitatis Tartuensis 9).

Vander Hulst, H. 1999, Word Prosodic Systems in the Languages of Europe, Berlin--New York.

Viitso, T.-R. 2003, Structure of the Estonian Language.--Estonian Language, Tallinn, 9-129 (Linguistica Uralica. Supplementary series / Volume 1).



Fedor Rozhanskiy

University of Tartu


* The research was supported by the Estonian Research Council grant IUT2-37.

(1) Cf. [TEXT NOT REPRODUCIBLE IN ASCII] 1977:28: "The study of Mari accentuation is one of the traditional areas of Finno-Ugric studies, as Mari has a distinct system of variable stress".

(2) Here and in the following quotations I leave the Cyrillic symbols for vowels as they are presented in the sources. Symbols bl and bl correspond to the reduced vowel e, bl is a front variant of the reduced vowel, [??] is e, u is i, y is u, y is u, and other symbols are the same as in the transcription based on the Roman script. To avoid the confusion, Cyrillic symbols are given in quotes <<>>.

(3) Here and below the parts of quotations that contain illustrative material or belong to a dialect other than Meadow Mari are omitted.

(4) These differences are usually regular and concern the pronunciation of certain segments, e.g. s in Staryj Torjal corresponds to c in Standard Mari.

(5) Here and below the stressed vowel in Mari forms is marked with an acute. In quotations from other papers the original accentuation marks are preserved.

(6) See, for example, Bauer 2003 regarding the notion of the "formative".

(7) A vowel is full if it is not e.

(8) Previous works on Mari did not operate with the harmonic formative. However, some researchers came very close to this idea (Alhoniemi 1993:21-22). G. I. Lavrentjev ([TEXT NOT REPRODUCIBLE IN ASCII] 1975:51) considers that nominative forms have a null flexion but at the same time he analyzes the 3Sg imperative forms as containing the suffix -e (-o, -o), which depends on the vowel harmony.

(9) The vowel harmony rule in the Staryj Torjal variety is the following: the quality of the final unstressed vowel depends on the preceding full vowel. If the preceding vowel is a, e or i (or if there are no full vowels in the word) the final unstressed vowel is e (vate 'wife', leve 'butterfly'). If the preceding full vowel is o or u, the final vowel is o (muskendo 'fist'); if the preceding full vowel is 0 or u, the final vowel is o (surgo 'face'). In Standard Mari (and in some other varieties) the vowel harmony is controlled by the stressed vowel (Alhoniemi 1993: 41). The difference between the varieties is seen in loan words where the rightmost full vowel is not stressed. Compare, for example the Staryj Torjal form augustesto 'August:INE' (the final harmonic o is conditioned by the non-stressed u) with the Standard Mari avgusteste 'August:INE' (the final e is conditioned by the stressed a).

(10) This suffix should probably be identified with the similar suffix of the participles.

(11) The final vowel of the root is dropped before eske (imn+e+esk+e [right arrow] imneske).

(12) The effect of the vowel harmony is especially evident in loan words, cf. petrusko 'parsley', koftecko 'blouse' and zanaveske 'window curtain', tarelke 'plate' (in all original Russian words the final vowel is a).

(13) This fact is very important because it gives us a possibility to avoid circulus vitiosus, when the accentuation rule is based on splitting a word into formatives, but the list of formatives is compiled on the basis of accentuation characteristics. The difference in the behavior of stressed and unstressed final vowels in the declension system is an independent argument.

(14) There are forms where both interpretations are possible, e.g. the form poskuden 'neighbour:GEN' can be interpreted on the morphophonological level either as poskud+e+n (with a reduced variant of the harmonic formative--e) or as poskud+en (in this case e is a part of the marker). If the interpretation is ambiguous I prefer the second variant when [??] is considered to be a part of the marker.

(15) The accentuation rule does work for these examples if we assume that the last vowel e is a harmonic formative. However, as these words do not decline or conjugate there is no way to prove this assumption.

(16) I identify the personal marker ze in the desiderative with the personal marker z+o/o/e in the imperative. The vowel e in the desiderative markers ne or ene conditions the vowel in the personal marker, so only the variant ze is possible.

(17) The voiced consonant in the possessive marker is the result of the assimilation.

(18) In the Staryj Torjal variety there are very few nouns with final stressed e ([TEXT NOT REPRODUCIBLE IN ASCII] 2002), and final stressed o is attested only in loan words (e.g. kino 'cinema').

(19) There are different opinions about the number of morphological cases in Mari. I use the list of cases as presented in [TEXT NOT REPRODUCIBLE IN ASCII] 1987:73.

(20) In the Staryj Torjal variety there are two variants of the adverbial case marker: es and esan (for more details see [TEXT NOT REPRODUCIBLE IN ASCII] 2002). Both variants consist of strong formatives.

(21) [TEXT NOT REPRODUCIBLE IN ASCII] 1961:72 describes the plural marker vlak as "usually unstressed", but in the Staryj Torjal variety the situation is different.

(22) In Standard Mari the word 'bad' is sukso and it consists of two formatives (root plus harmonic): suks+o. Hence, the comparative degree is sukserak. In the Staryj Torjal variety the word for 'bad' is suksu. It does not have the harmonic formative and its final vowel belongs to the stem.

(23) The same is true for the sequence "weak formative + strong formative", but not for the sequence "strong formative + weak formative".

(24) [TEXT NOT REPRODUCIBLE IN ASCII] 1961:286 claims that there are only two adverbs (rusla 'in Russian' and marla 'in Mari') with the final stressed la. In the Staryj Torjal variety this group of adverbs is bigger.

(25) [TEXT NOT REPRODUCIBLE IN ASCII] 1961:281 mentions the adverbalizer ge, which derives adverbs of manner from imitative words. Most likely it should be identified with the weak ge. However, usually the etymology of adverbs with this formative is not transparent on the synchronic level, and thus we cannot use the features of the source word as a reliable criterion for distinguishing between the weak and strong ge formatives.

(26) The vowel harmony rule in the Staryj Torjal variety and in Standard Mari was discussed above in footnote 9.

(27) For example, in Viitso 2003:12: "Except for some foreign proper names, the co-occurrence of both a long monophthong or a diphthong and a geminate obstruent in a word with a syllable of Q2 is restricted (a) to genitive plural forms of some nouns and (b) to the second-person present-tense forms of monosyllabic vocalic verb stems, both of which have the suffix -te".

(28) The similarity between Khanty and Mari accentuation was mentioned in Lewy 1962:287.

(29) It does not concern only vowels. For example, Lewy (1962:287) suggested that there is a correlation between the position of stress and grade alternations in the Uralic languages. This idea is disputable but possibly it has a rational kernel.

(30) Cf. for example the description of Mari accentuation in Van der Hulst 1999:451: "In Literary Mari accent falls on the last full vowel and in words with only reduced vowels on the initial syllable. [---] One complicating factor in Literary Mari is that final open syllables are never accented".

Table 1

                   Personal markers (present tense)

                    Conjugation I    Conjugation II

         Number       SG    PL         SG     PL


1                     am    ena        em     ena
2                     at    eda        et     eda
3                     es    et         a      at

Table 2

                   Personal markers (past 1 tense)

                   Conjugation I    Conjugation II

         Number     SG       PL      SG       PL


1                   em       na      em       na
2                   ec       da      ec       da
3                   o/o/e    ec      --       t

Table 3

Personal markers (past 2 tense)

         Number    SG    PL


1                  am    na
2                  at    da
3                  --    et

Table 4

 Personal markers (imperative forms)

                    Conjugation I       Conjugation II

          Number     SG           PL      SG         PL


2                    --           za/sa   o/o/e      eza
3                    s+o/o/e      est     ez+o/o/e   est

Table 5

Personal markers (desiderative mood)

        Number    SG          PL


1                 m           na
2                 t           da
3                 z+e (16)    st

Table 6


Participle                  Formative

Active                      (e)s+o/o/e
Passive                     (e)m+o/o/ e
Negative                    (e)dem+o/o/ e
Future                      (e)sas

Table 7


Participle              Formative

Affirmative             en/en
Negative                (e)de
Posterior               (e)me+sk+e
Anterior                (e)me+k+e
Simultaneous            (e)se+la

Table 8

Case markers (19)

NOM          - / o/o/e
GEN          (e)n
ACC          (e)m
DAT          lan
ILL         (e)st+o/o/e
INE         (e)s(k+o/o/ e)
LAT         es / es+an (20)
COM         ge
COMP        la

Table 9

Possessive markers

1SG       m/ em
2SG       t/et
3SG      (e)z+o/o/e / s+o/o/e
1PL      (e)na
2PL      (e)da
3PL      (e)st

Table 10

A list of inflectional markers

a               PRS.3SG (II)
am              PRS.1SG (I)
am              PAST2.1SG
as              INF
at              PRS.2SG (I)
at              PRS.3PL (II)
at              PAST2.2SG
da              PAST1.2PL
da              PAST2.2PL
da              DES.2PL
da              POSS.2PL
(e)de           GERNEG
(e)dem+o/o/e    PRTNEG
eda             PRS.2PL (II)
em              PRS.1SG (II)
em              POSS.1SG
en              PAST2 (II)
en              GERAFF
ena             PRS.1PL (II)
es              PRS.3SG (I)
es(+an)         LAT
et              PRS.2SG (II)
et              POSS.2SG
ge              COM
ec              PAST1.2SG (I)
ec              PAST1.3PL (I)
ec              PAST1.2SG (II)
eda             PRS.2PL (I)
em              PAST1.1SG (I)
em              ACC
en              PAST2 (I)
en              GERAFF
en              GEN
ena             PRS.1PL (I)
es              PAST1 (II)
es              ILL
est             IMP.3PL
est             POSS.3PL
et              PRS.3PL (I)
et              PAST2.3PL
la              COMP
la              PL
lan             DAT
m+o/o/e         PRTPASS
(e)me+k+e       GERANT
(a)me+sk+e      GERPOST
na              PAST1.1PL
na              PAST2.1PL
na              DES.1PL
na              POSS.1PL
(e)ne           DES
o/o/e           PAST1.3SG (I)
o/o/e           IMP.2SG
o/o/e           NOM
rak             CMPR
sa              IMP.2PL (I)
samec           PL
(e)sas          PRTFUT
s+o/o/e         IMP.3SG (I)
(e)s+o/o/e      PRTACT
(e)se+la        GERSIM
s+o/o/e         POSS.3SG
(e)sk+o/o/e     ILL
(e)st+o/o/e     INE
vlak            PL
(e)za           IMP.2PL (I, II)
(e)z+o/o/e      IMP.3SG (I, II)
z+e             DES.3SG
z+o/o/e         POSS.3SG

Table 11

Samples of accentuation in various forms

Formative structure      Resulting form          Gloss
                         with a stress mark

este+dem+e               estedeme                to do-PRTPASS
port+em+em               portemem                house-POSS.1SG-ACC
jal+esk+em               jaleskem                village-ILL-POSS.1SG
codara+st+e              coderdste               forest-INE
kudevec+e                kudevece                yard.NOM
m+eda                    ruedd                   to cut-PRS.2PL
kenel+en+et              kenelenet               to get up-PAST2-3PL
usan+es+na               usanesna                to believe-PAST1-1PL
usan+ane+na              usanenena               to believe-DES-1PL
todal+alt+ne+z+e         todalaltneze            to break-DES-3SG
todal+alt+et             todaldltet              to break-PRS.3PL
