This study discusses the origins and subsequent development of conditional and non-first-conjugation imperfect indicative desinences in varieties of Occitan. The systematic identity between these series of desinences can be traced to their common origin (Latin imperfect indicative forms in ‑ēbam, ēbas, ‑ēbat, etc.), and predisposes them to undergo the same sound changes as each other: a change of particular interest is the irregular loss of intervocalic -b-, which is here argued to originate in the conditional. Strikingly, the identity between the series of desinences is also maintained in three cases of analogical change (levelling and extension) reported here, indicating strong implicational relationships between the series. In the light of these historical and comparative facts, the study proposes that the identity between conditional and non-first-conjugation imperfect indicative desinences is most accurately captured by treating it as morphomic.

Cette étude a pour sujet les origines et l’évolution ultérieure des désinences de l’imparfait de l’indicatif (hors première conjugaison) et du conditionnel dans les parlers occitans. Ces séries de désinences montrent une identité systématique, qui remonte à l’origine commune des formes (imparfaits de l’indicatif latins en ‑ēbam, ēbas, ‑ēbat, etc.), et qui les prédispose à subir les mêmes évolutions phonologiques : parmi celles-ci figure notamment la perte irrégulière de -b- intervocalique, qui, selon les arguments ici présentés, débute au conditionnel. De façon plus remarquable, l’identité entre ces séries de désinences est également maintenue dans trois cas d’évolution analogique (nivellement et extension) ici décrites, ce qui indique l’existence de fortes relations implicationnelles entre les séries. Pour bien rendre compte de ces données diachroniques et comparatives, l’étude propose que l’identité entre les désinences de l’imparfait de l’indicatif (hors première conjugaison) et du conditionnel doit être considérée comme un phénomène morphomique.



1. Introduction

This study discusses the implicational relationships holding between exponents of the paradigm categories conventionally labelled imperfect indicative (henceforth ipfv.ind) and synthetic conditional (henceforth cond) in varieties of Occitan (Gallo-Romance)1.

Most Occitan varieties present systematic identity between series of desinences in the cond forms of all lexemes and in the ipfv.ind forms of non-first-conjugation lexemes. These desinences have a common historical origin, as all cond and non-first-conjugation ipfv.ind forms continue Latin forms in ‑ēbam, ēbas, ‑ēbat, etc. (section 2), and consequently each series tends to undergo the same sound changes as the other. Moreover, the formal identity between the series is maintained in several cases of analogical change, suggesting that it has some measure of reality for speakers as a systematic distributional template (section 3).

The diachronic behaviour of the pattern observed here recalls the existing morphomic structures identified for Romance languages by Maiden (e.g. 2009a, 2011a, 2011b, 2016a, 2018), though these patterns differ in scope. Maiden’s patterns, defined in terms of lexical root or stem distribution, can act as templates for the distribution of almost all types of inflectional exponents, while the analogical changes reported in this study affect only desinences and thematic elements. Additionally, Maiden’s patterns are defined paradigm-internally and independently of conjugational class, while the relationship studied here involves multiple paradigm types and makes crucial reference to conjugational class.

To account for this combination of similarities and differences, the study makes the following proposal (section 4). Two separate implicational relationships are assumed to hold: one between cond forms of all lexemes, and another, paradigm-internal, between cond and ipfv.ind forms in a subset of lexemes. The latter relationship is classed, like Maiden’s patterns, as a metamorphome2 in the terms of Round (2015), namely a set of paradigm cells which systematically share exponents. The behavioural differences between the distribution described here and Maiden’s patterns are argued to emerge from such factors as lexical type frequency and the function of the exponents on which generalisations about identity are based. The proposed analysis is consistent with observations that multiple and conflicting paradigmatic distribution templates may co-exist (e.g. Smith, 2011, for sociolinguistic variation between templates for stem distribution; Maiden, 2000, 2012a, 2018, pp. 288-289; and Esher, 2015b, for ‘clash’ between templates for the distribution of different exponents in a single paradigm), and contributes to understanding of the interactions between such templates.

2. Occitan conjugation and the historical source of identity between cond and non-first-conjugation ipfv.ind forms

2.1. Modern Occitan systems

The majority of Occitan varieties, like most Ibero-Romance varieties, present two distinct types of desinence in the ipfv.ind, both of which continue Latin ipfv.ind forms. One is characterised by a theme vowel /a/ and a labial consonant, and is confined to first-conjugation ipfv.ind forms; the other, common to all non-first-conjugation ipfv.ind forms, usually presents either a yod followed by a stressed vowel or an /i/ followed by an unstressed vowel. The latter type is also found in cond forms of all conjugations, as the Gallo- and Ibero-Romance cond has its origin in a periphrasis collocating the lexical infinitive and an ipfv.ind form of the second-conjugation verb habere ‘have’ (e.g. cantare habebat > Fr. chanterait, Cast. cantaría, Cat. cantaria).

Some illustrative examples are shown in Tables 1 and 2. Table 1 gives full ipfv.ind and cond paradigms for the three conjugations traditionally recognised for Occitan (continuants of Latin conjugations I, IV and III respectively; II was early assimilated to III) in the variety of Fauch. Table 2 shows sample ipfv.ind and cond forms from varieties spanning a selection of traditional dialect groups.

cantar ‘sing’, I

bastir ‘build’, IV

vendre ‘sell’, III

















































Table 1. Comparison of cond and ipfv.ind forms for continuants of Latin conjugations I, IV and III in the Occitan variety of Fauch (Languedoc, ALLOc survey point 81.12). Exponents shared across cond and non-first-conjugation ipfv.ind forms are highlighted in bold face.






Val d’Aran














































Table 2. Comparison of 3sg cond and ipfv.ind forms for continuants of Latin conjugations I, IV and III in the Occitan varieties of Graulhet (Languedoc: Lieutard, 2004), Var (Provence: Domenge, 2002), Nice (Nice: Toscano, 1998), Nontron (Limousin: Reydy, 2008), Aubenas (Vivarais: Moulin, 2006), Val d’Aran (Gascony: Carrera, 2007). Orthography replicates that of the source material. Exponents shared across cond and non-first-conjugation ipfv.ind forms are highlighted in bold face.

As can be seen from Table 2, the phonological form of the desinences varies by region, but the basic distribution pattern, in which cond and non-first-conjugation ipfv.ind forms pattern together, contrasting with first-conjugation ipfv.ind forms, remains constant across varieties, with the exception of Gascony. In the latter area, it is common for a three-way conjugational distinction to be preserved in the ipfv.ind (compare also 3sg forms cantava vs. sentiva vs. batèva/batè in Béarn: Romieu & Bianchi, 2005, p. 277), and for the cond to present a unique series of desinences (compare also cantaré, sentiré, bateré in Béarn: Romieu & Bianchi, 2005, p. 278). The history and behaviour of these forms falls outside the scope of the present study.

Note also that in the conjugational systems discussed here, distinction between continuants of Latin conjugations IV and III is based entirely on stem formatives: continuants of IV are characterised by a theme vowel /i/ and/or a thematic ‘augment’, which usually takes the form /is/ in the ipfv.ind (Esher, 2016).

2.2. Historical background

The Occitan two-way system represents a reduction of the three-way contrast found in Latin ipfv.ind forms and exemplified in Table 3. Reflexes of Latin first-conjugation ipfv.ind forms in -ābam, etc. have remained distinct, whereas -iēbam, etc. characteristic of the fourth conjugation was reduced to ‑ēbam, etc. (Ronjat, 1937, p. 71), thus falling together with etymological -ēbam, etc. in the second and third conjugations. The cond, due to its historical source, patterns with the -ēbam, etc. group. Exemplar ipfv.ind forms for mediaeval Occitan are shown in Table 4, illustrating the two-way contrast which is continued by modern varieties.

‘sing’, I

‘have’, II

‘send’, III

‘hear’, IV































Table 3. Exemplar ipfv.ind forms for Latin conjugations I-IV.

     cantar ‘sing’     

     aver ‘have’     

     metre ‘put’     

     auzir ‘hear’     































Table 4. Exemplar ipfv.ind forms for mediaeval Occitan reflexes of the Latin lexemes in Table 3 (Anglade, 1921, pp. 271, 285, 294, 318, 396; Skårup, 1997, pp. 98-99). Vowels bearing primary stress are underlined. Palatalisation of d > z before i constitutes a regular sound change in these varieties.

The sound changes undergone by first and non-first-conjugation ipfv.ind forms are deserving of comment, since these forms differ not only in theme vowel but in their treatment of intervocalic -b-. In first-conjugation forms, -b- undergoes lenition to [β] or [v], a development which is demonstrably regular (compare e.g. faba > [faβɔ], [fava] ‘bean’). In forms continuing -ēbam, etc., by contrast, intervocalic -b- is deleted, a change which is usually considered idiosyncratic.

Historical grammars of Gallo-Romance varieties (e.g. Anglade, 1921, p. 286; Fouché, 1931, p. 623; Ronjat, 1937, p. 171; Zink, 1989, p. 74; Buridant, 2000, p. 271) typically seek to motivate the deletion of -b- in non-first-conjugation ipfv.ind forms in isolation, assuming that the cond forms will automatically undergo identical developments. According to these grammars, the loss of -b- first occurs by dissimilation in the ipfv.ind of verbs with a stem in -b-, particularly habēbam ‘have.ipfv.ind.1sg’ etc., but also debēbam ‘have_to.ipfv.ind.1sg’, bibēbam ‘drink.ipfv.ind.1sg’, scribēbam ‘write.ipfv.ind.1sg’ etc., and is then spread by analogy to other non-first-conjugation ipfv.ind forms (together with the cond, presumably).

Yet this account is unconvincing (Lecoy, 1967, p. 280), from both dissimilatory and analogical points of view (see also Posner, 1961, p.183). The dissimilation claim is based on Grammont’s (1895, p. 79) law XVII, but the predictions of this law differ substantially from the development claimed for habēbam, etc. since, according to Grammont, where two same intervocalic consonants occur successively, it is the first consonant of the pair (not the second) which is liable to be dissimilated (not deleted).

The analogy claim is similarly problematic. According to the general principles identified by Bybee (e.g. 1985, 2001) and Albright (2009), patterns of high lexical type frequency constitute good candidates for models for analogical change, while patterns of high token frequency but low lexical type frequency may resist change, or may undergo idiosyncratic changes (compare e.g. Fr. mon sieur ‘my lord’ [mɔ̃ sjœʁ] vs. monsieur ‘Mr’ [msjø]; Cast. Vuestra Merced ‘Your Grace’ > Usted ‘you [polite/formal]’), but do not in general serve as models for change to other items. For the early Romance case under discussion, it may be estimated that non-first-conjugation items represent approximately half of verb lexemes4, while around 50 verb lexemes have stems in -b-5. The traditional account, in which loss of -b- from -ēbam, etc. forms originates in verb lexemes with stems in -b-, thus entails the improbable claim that the ipfv.ind forms of half of all verb lexemes, together with the cond forms of all verb lexemes (in both cases several thousand items), were remodelled based on a few dozen lexemes.

An alternative and more convincing proposal by Togeby (1964, p. 4) asserts that the loss of -b- from -ēbam, etc. forms began instead in the cond, due to the grammaticalisation of habebam, etc. from auxiliary to desinence. This case of grammaticalisation is known to involve significant phonological reduction of the periphrases in which the Romance fut and cond originate, with deletion of unstressed vowels from the erstwhile lexical infinitive and deletion of the stem of habēre (e.g. mittere habebat > Occ. metriá/*metraviá)6. Loss of intervocalic -b- would, under Togeby’s view, form part of such phonological reduction.

The resulting forms in *ea < -eba- are then assumed to have spread from the cond to all non-first-conjugation ipfv.ind forms: “[d]’après ce modèle [the cond], l’imparfait a également réduit sa désinence à -ea, -ia, mais seulement dans les conjugaisons dont les désinences correspondent à celles du conditionnel”7 (Togeby, 1964, p. 4). It is not clear whether Togeby himself envisages the mechanism of spread to be morphological analogy or diffuse sound change. Indeed, one of the striking aspects of his proposal is its concision, a testimony to the strength of the implicative relationship between cond and non-first-conjugation ipfv.ind desinences. It is not a priori self-evident that novel cond forms in *ea should constitute a suitable model for conservative ipfv.ind forms in -eba-, yet Togeby’s analysis offers no justification for the spread of *ea from cond to ipfv.ind, instead taking it as axiomatic that cond and non-first-conjugation ipfv.ind desinences should consistently display and maintain identity.

In practice, diffuse sound change is a more plausible candidate mechanism for this development than morphological analogy. The diachrony of Maiden’s well-documented patterns repeatedly shows that, for a given set of paradigm cells forming a metamorphome, a sound change which causes differentiation between two subsets of these cells ordinarily leads to a novel morphological generalisation splitting the metamorphome in two, rather than to analogical levelling restoring identity of exponence between the cells8. It is thus improbable that the identity of cond and ipfv.ind desinences, if initially morphomic and compromised by phonological innovation in the cond, should have been restored by morphological analogy spreading the innovative forms to the ipfv.ind. The spread of *-ea, etc. from the cond to the ipfv.ind is more plausibly explained as a diffuse phonological change, with loss of -b- from the ipfv.ind beginning while there is still considerable variation between forms with and without -b- in the cond (and thus forms with -b- still occur sufficiently often in the cond for a formal similarity with the ipfv.ind to be perceived).

In their subsequent history, the phonological similarity of cond and non-first-conjugation ipfv.ind desinences predisposes them to undergo the same sound changes as each other, though these may vary by region. For instance, the regular continuant of -ēbat is [je] in Provence (e.g. metrié ‘put.cond.3sg’, metié ‘put.ipfv.ind.3sg’ in Table 2, also Ronjat, 1930, p. 390, 1937, p. 172), [jε] in the eastern Languedoc (e.g. vendriè ‘sell.cond.3sg’, vendiè ‘sell.ipfv.ind.3sg’ in Sète: Thérond, 2002, pp. 156-157) and [jɔ] in much of the western Languedoc (e.g. vendriá [be͂ndˈɾ] ‘sell.cond.3sg’, vendiá [be͂ndˈjɔ] ‘sell.ipfv.ind.3sg’, in the variety of Roussayrolles: ALLOc survey point 81.01). As well as the raising of final /a/ and the reduction of /i/ to /j/, these forms have undergone a stress shift, in which primary stress has moved from the penult to the final syllable (Ronjat, 1937, p. 172). The inherited accentuation pattern with stress on the penult in the singular and 3pl forms (as in Table 4) survives only in certain varieties of the Pyrenees (e.g. Massat: Laurent, 2001, p. 13, pp. 28-29) and of the Nice area (Dalbera, 1994, also section 3.2 below); in some Pyrenean varieties the final unstressed vowel itself undergoes deletion in closed syllables, thus bastía [basˈti.ə] ‘build.ipfv.ind.1sg/3sg’, bastiría [bastiˈɾi.ə] ‘build.cond.1sg/3sg’ vs. bastís [basˈtis] ‘build.ipfv.ind.2sg, bastirís [bastiˈɾis] ‘build.cond.2sg’, bastín [basˈti͂n] build.ipfv.ind.3pl, bastirín [bastiˈɾi͂n] build.cond.3pl’ in the variety of Quérigut (ALLOc survey point 09.33).

While these changes affect cond and non-first-conjugation ipfv.ind forms equally, they are not confined to these categories or indeed to the verb: raising of final unstressed /a/, stress shift and /i/ > /j/ are found in nouns presenting the same phonological context, e.g. in the western Languedoc porcariá [purkaˈɾjɔ] ‘pigsty’, ALLOc map 440; resegariá [reseɡaˈɾjɔ] ‘sawmill’, ALLOc map 777). As general phonological changes applying across the language, they provide no information about what morphological relationship, if any, speakers perceive between cond and non-first-conjugation ipfv.ind forms.

3. Analogical changes sensitive to the existing pattern of identity between cond and non-first-conjugation ipfv.ind forms in Occitan

In the Gallo-Romance varieties considered here, unequivocally morphological, analogical change applying to all and only cond and non-first-conjugation ipfv.ind forms was found to be much rarer than such change applying to all and only the cells of a given metamorphome defined by root or stem material (for which latter see the extensive survey by Maiden, 2018). Among Occitan varieties with a two-way conjugational distinction in the ipfv.ind, this study found three cases in which analogical change appears to show sensitivity to a relationship between cond and non-first-conjugation ipfv.ind forms, namely: the generalisation of final -i as an exponent of 1sg, widespread across the Languedoc; the spread of a thematic formative -av- into 1pl and 2pl cond and non-first-conjugation ipfv.ind forms in varieties of the Nice area; and the replacement of [a] by [ɔ] across 1pl and 2pl cond and non-first-conjugation ipfv.ind forms in varieties of the Toulouse area. This section sets out the developments observed in historical and comparative dialect data, prior to theoretical discussion in section 4.

3.1. Generalisation of final -i across 1sg forms

In many varieties of Occitan, 1sg forms in most or all paradigm categories present an exponent -i, realised [i] or [j] (Oliviéri & Sauzet, 2016, pp. 333-338). This exponent is etymological in pfv.pst.ind.1sg forms, in all fut.1sg forms, and in the prs.ind forms of a subset of lexemes: first-conjugation lexemes with a stem in -i e.g. *cambio > cambi ‘change.prs.ind.1sg’, continuants of Latin conjugation IV, e.g. audio > auzi ‘hear.prs.ind.1sg’, lexemes with a stem-final consonant cluster requiring a supporting vowel, e.g. *simulo > sembli ‘seem.prs.ind.1sg’, and irregular lexemes with -e- in hiatus, e.g. debeo > dei ‘owe.prs.ind.1sg’ (the outcomes shown are for mediaeval Occitan, following Sutherland, 1959, pp. 39-40 and Skårup, 1997, pp. 93-95). From the mediaeval period onward, the exponent is progressively extended to the prs.ind forms of remaining lexemes, and to other TAM [tense, aspect, mood] categories, in some varieties colonising all TAM categories (Field, 2003; Esher, 2017a).

Study of diatopic variation in linguistic atlas data across the Languedoc region (Esher, 2017a, 2017b) reveals that in any given system with a two-way distinction in the ipfv.ind, the cond.1sg and non-first-conjugation ipfv.ind.1sg forms systematically pattern together. Either all such forms have -i (e.g. variety of Puycelsi, Table 5) or none do (e.g. variety of Cordes, Table 5); no intermediate systems are attested.

cantar ‘sing’, I

bastir ‘build’, IV

vendre ‘sell’, III





















Table 5. ipfv.ind.1sg and cond.1sg forms for exemplars of the major conjugational types in the varieties of Cordes (ALLOc survey point 81.02) and Puycelsi (ALLOc survey point 81.03).

The consistency of behaviour between cond and non-first-conjugation ipfv.ind forms is striking in the context of analogical extension of -i. In this development, the only other distinct paradigm categories found to pattern together across the survey area are the prs.ind and pret of all conjugations, which systematically present -i, often for etymological reasons as discussed above; note also that in some varieties the etymological -i of the fut has fallen, and that in general the correlation between presence of -i in the fut and in the cond is not strong. The exceptionless correlation between cond and non-first-conjugation ipfv.ind forms with respect to the presence or absence of -i is thus remarkable, particularly since -i in these categories can be neither etymological nor the product of regular sound change. The presence of -i results from an analogical change which treats cond.1sg and non-first-conjugation ipfv.ind.1sg forms as a single block; such treatment indicates a strong formal, implicational relationship between the forms.

3.2. Analogical extension of -[av]- in Nissart

In the variety of Nice, variation is found in the 1pl and 2pl forms of the cond and ipfv.ind, between the historically expected forms in -[iˈãŋ], -[iˈas] and forms of more recent, analogical origin, in -[iaˈvãŋ], -[iaˈvas]. Illustrative examples are given in Table 6.

Table 6. ipfv.ind and cond forms for continuants of Latin conjugations I, IV and III, in the variety of Nice (Toscano, 1998), showing variation in 1pl and 2pl cond and non-first-conjugation ipfv.ind forms due to analogical extension of the formative [av] from the first-conjugation ipfv.ind. Orthography is the usual standard for this area. Vowels bearing primary stress is indicated with an underline.

Table 6. ipfv.ind and cond forms for continuants of Latin conjugations I, IV and III, in the variety of Nice (Toscano, 1998), showing variation in 1pl and 2pl cond and non-first-conjugation ipfv.ind forms due to analogical extension of the formative [av] from the first-conjugation ipfv.ind. Orthography is the usual standard for this area. Vowels bearing primary stress is indicated with an underline.

Whether forms in -[iaˈvʹãŋ], -[iaˈvas] result from the spread of a formative [av] (as assumed by Ronjat, 1937, p. 171) or of entire desinences -[aˈvãŋ], -[aˈvas], the analogically extended material must originate from first-conjugation ipfv.ind forms in -[av]-, the only area of the verb system in which such a formative was historically present (see e.g. reconstructed paradigms given by Dalbera, 1994, pp. 588, 590, 609). The direction of change is confirmed by the fact that the formative -[i]-, characteristic of cond and non-first-conjugation ipfv.ind forms, is not spread to first-conjugation ipfv.ind forms (this study found no attestations of forms such as *parliavam ‘speak.ipfv.ind.1pl’). This fact also indicates that the development is not a general levelling of ipfv.ind and cond forms, but is instead restricted to cond and non-first-conjugation ipfv.ind forms, a set of cells which is uniquely characterised only by the form of its exponents.

The inflectional distribution illustrated in Table 6 is highly localised, being confined to the immediate area of Nice. In a few of the surrounding varieties, such as that of Tende, -[av]- has spread across ipfv.ind forms of all conjugations, without affecting the cond (Dalbera, 1994, p. 621). This development differs from that observed in Nice, since in the variety of Tende -av- replaces inherited -i-, giving e.g. [vendaˈvamu] ‘sell.ipfv.ind.1pl’, [vendaˈvai] ‘sell.ipfv.ind.2pl’ (Dalbera, 1994, p. 621), as opposed to the corresponding Nissart forms vendiavam, vendiavatz which present both formatives. Nevertheless, it invites the interpretation that the development observed in Nice has two components: firstly, the generalisation of a given exponent across a functionally coherent category [ipfv.ind.1pl or ipfv.ind.2pl respectively], from the conjugation with highest lexical type frequency to conjugations of lower type frequency; and secondly, the spread of this exponent to cond forms, restoring the strong implicational relationship between cond and non-first-conjugation ipfv.ind forms. Such an analysis is supported by the presentation of the variant forms in dialect descriptions: av-forms tend to be recommended over av-less forms in the ipfv.ind, while av-less forms are preferred in the cond, indicating that the av-forms are less strongly established in the cond than in the ipfv.ind. Moreover, there are no attestations of varieties in which -iavam, -iavatz forms are present in the cond but absent from the ipfv.ind. Taken together, these data support a view under which the extension of -av- is mediated via the non-first-conjugation ipfv.ind, without direct influence of the first-conjugation ipfv.ind on the cond; the key role of the non-first-conjugation ipfv.ind again points to the importance of the formal, implicational relationship.

A striking aspect of the Nissart development is the restriction of -av- to 1pl and 2pl forms. In its etymological distribution, the formative -av- is characteristic of all first-conjugation ipfv.ind forms, and is not associated with any particular person/number value(s). One might therefore expect analogical extension to spread -av- to all cond and non-first-conjugation ipfv.ind cells, yet instead only 1pl and 2pl forms are affected. It may be significant that, in the paradigm categories concerned, 1pl and 2pl forms are distinguished from the other person/number forms with respect to stress assignment: primary stress is final in 1pl and 2pl forms, but penultimate in the other forms, thus in practice the extension is consistently limited to unstressed -av-9.

3.3. Levelling of final vowels in Haute-Garonne

In many varieties of the Languedoc (e.g. Graulhet: Lieutard, 2004 and Table 7) the cond and non-first-conjugation ipfv.ind present a contrast between final stressed [a] in the 1pl and 2pl forms, and final stressed [ɔ] in the singular and 3pl forms. This contrast is due to a combination of the sound changes discussed in section 2 above. The alternation in vowel quality arises at an early period, when final /a/ undergoes context-sensitive differentiation according to stress: when tonic the realisation [a] is maintained, while when post-tonic the vowel raises to [ɔ] (see reconstructed forms in Table 7). At a later period, when the context-sensitive changes to quality were no longer active, stress shifted from the penult to the final syllable in the singular and 3pl forms of the cond and non-first-conjugation ipfv.ind, resulting in the distribution typically found today.

partir, reconstructed

durbir, Graulhet

bastir, Merville

















































Table 7. ipfv.ind and cond forms for continuants of Latin conjugation IV: partir ‘leave’ prior to stress shift (based on Anglade, 1921, pp. 285-286; Skårup, 1997, pp. 98, 103), durbir ‘open’ in the modern variety of Graulhet (Tarn: Lieutard, 2004, p. 229) and bastir ‘build’ in the modern variety of Merville (Haute-Garonne: ALLOc survey point 31.01).

An exception to the common pattern is found in the area of Toulouse, where Ronjat (1937, p. 172) notes that the 1pl and 2pl endings in the cond and non-first-conjugation ipfv.ind are not -ian, -iats as expected, but -ion, -iots. Such forms are independently attested in linguistic atlas data: forms in [jɔ̃n], [jots] are found at eight of the ten ALLOc survey points in the département Haute-Garonne, including Merville10 (ALLOc 31.01, Table 7), while the other two points continue the historically expected pattern of alternation between [a] and [ɔ].

The development must be analogical, since there is no phonological motivation for the replacement of [ˈjãn], [ˈjats] by [ˈjɔ̃n], [ˈjɔts]: final stressed [ˈjãn], [ˈjats] remain entirely licit in these varieties, e.g. vendiam [bẽnˈdjãn] ‘sell.prs.sbjv.1pl’, vendiatz [bẽnˈdjats] ‘sell.prs.sbjv.2pl’ in the varieties of Merville11, Mauressac (ALLOc 31.21) and Aignes (ALLOc 31.23).

No model for this change, though, is offered by etymological 1pl and 2pl desinences elsewhere in the paradigm. In the variety of Toulouse (Table 8), the exponents of highest paradigmatic type frequency for 1pl and 2pl forms in general, and indeed for 1pl and 2pl forms with final stress, are [ẽn], [ets], while the first-conjugation prs.ind forms, of high lexical type frequency, are [ˈãn], [ˈats]; [ɔ̃n] is found only in 3pl forms, which are rarely syncretic with 1pl forms.

Instead, this appears to be a case of analogical levelling, generalising the vowel [ɔ] found in the singular and 3pl desinences of the cond and non-first-conjugation ipfv.ind to all desinences of these categories. The result is a reduction of allomorphy within the relevant series of desinences, reinforcing the formal similarity between the constituent cells.























































































































































Table 8. Finite synthetic forms of cantar ‘sing’, bastir ‘build’ and vendre ‘sell’ in the variety of Toulouse (ALLOc survey point 31.12). Note that the ipfv.ind forms [kãnˈta.i] etc., and the third-conjugation pret forms [bẽnˈdɛ.i] etc., result from a regular sound change specific to this area, in which intervocalic β > 0. Variation in pret exponents according to inflectional class is rare but not unknown in Occitan varieties of the Languedoc area. The ipfv.sbjv.1sg form for bastir is not given in the transcriptions of the fieldwork interviews; [bastiskˈɛsi] would be expected.

While the internal formal consistency of the series is increased by the analogical extension of /jɔ/, the overall distribution of the series is affected by a separate development: a sound change which reduces the sequence /ɾj/ in cond forms12 to either /ɾ/ or /j/ (Esher, 2015a). Which element is deleted depends on the preceding segments: in general, /ɾ/ is deleted where the sequence follows a vowel, while /j/ is deleted where the sequence follows a consonant, hence in the variety of Toulouse (Table 8) [kãntaɾjˈɔ] > [kãntajˈɔ] and [bastiɾjˈɔ] > [bastijˈɔ] but [bẽndɾjˈɔ] > [bẽndɾˈɔ]13. As a result, a subset of cond forms in the modern varieties of this area lack the glide /j/ common to all other cond and non-first-conjugation ipfv.ind forms.

3.4. Summary

In the three developments described above, systematic similarities of exponence between cond and non-first-conjugation ipfv.ind forms are variously maintained (extension of -i and -av-) and reinforced (levelling to -/ɔ/) in cases of analogical change.

The association of common semantic content with the ipfv.ind and the cond (see e.g. Iatridou, 2000; Touratier, 1996; Vincent, 2013) may be a contributing factor in such parallelism, but is not sufficient to explain the observed distributions, in which first-conjugation and non-first-conjugation ipfv.ind forms are treated differently: while inflectional classes may align to some extent with function, they are intrinsically a phenomenon of autonomous morphology (Aronoff, 1994). The changes observed also show sensitivity to morphosyntactic feature values (notably 1sg) and phonological properties (stress assignment, which is lexically specified in the varieties discussed here).

The picture which emerges is of implicational relationships between paradigm cells, which have psychological reality for speakers demonstrated by their diachronic productivity as templates: for any given person/number value, cond and non-first-conjugation ipfv.ind forms systematically share exponents; across person/number values, cond and non-first-conjugation ipfv.ind forms may share exponents.

4. Comparison between the study data and established metamorphomes

The dialect data outlined in section 3 are evocative of implicational, morphomic relationships insofar as they illustrate an inherited identity between exponents being preserved over time and referred to in the process of analogical change: while the relationships themselves are defined synchronically by linguists (see e.g. Aronoff, 1994; Round, 2015) and presumably exist synchronically for speakers, diachronic persistence is a valuable indicator of their psychological reality (Maiden, 2018, pp. 12-13). The descriptions found in historical grammars are similarly indicative of strong implicational relationships between cond forms and non-first-conjugation ipfv.ind forms: so commonplace and systematic is this identity that many authors consider a reference to the ipfv.ind to provide a reliable, informative and sufficient description of the cond.

For these reasons, it would be desirable to formalise the relationship described here as a morphomic object which is part of the structure of the inflectional system, similar to Maiden’s patterns. Indeed, neither Aronoff’s (1994) initial conception of morphomic functions, nor Round’s (2015) notion of metamorphomes as grouping paradigm cells based on exponence, specifies any particular type of exponent to be crucial; thus the existence of metamorphomes based on desinential rather than root material is not a priori excluded. Yet such a formalisation should also account for a number of apparent or real contrasts between the relationships described here and existing examples of metamorphomic phenomena. The nature and significance of these contrasts are addressed over the following sections.

4.1. Characteristics of metamorphomes discussed in existing literature

In a vast comparative-historical survey of Romance data, Maiden (e.g. 2009a, 2011a, 2011b, 2016a) identifies a number of metamorphomic patterns for the distribution of inflectional material in the verb paradigm of Romance languages, showing how these patterns act as templates for the distribution of inflectional allomorphy. Once a template is established, its constituent cells systematically pattern together in analogical change, behaviour labelled coherence by Maiden (2005). Furthermore, analogy may operate between the exponents of this set of cells in separate lexemes, such that these exponents come to share a characteristic phonological shape, a phenomenon labelled convergence by Maiden (2005).

An illustrative example of such developments is offered by the Romance fut and cond in the variety of Nice described above (section 3.2, Table 6), which together constitute a metamorphome. Historically, these forms are reflexes of the construction infinitive+‘have’, and thus systematically share a stem, which is differentiated from the infinitive by early Romance sound changes, and often unique within the inflectional paradigm (compare e.g. *uolere > voler ‘want.inf’ vs. *uolere habet > vorrà ‘want.fut.3sg’, *uolere habebat > vorria ‘want.cond.3sg’ in the modern variety of Nice, Toscano, 1998, p. 130). Subsequent analogical changes taking the set of twelve fut and cond cells as a template include the spread of the augment -iss- < -isc- into all twelve cells (contrasting with most other Occitan varieties, in which the augment is not found in either fut or cond; Esher, 2016a), and the replacement of etymological theme vowels differentiating conjugational class in the fut and cond by a single theme vowel /e/ characterising this set of cells across conjugations (Esher, 2013); thus, in Nice, parleria ‘speak.cond.3sg’, finisseria ‘finish.cond.3sg’ replace etymological parlaria and finiria, conserved in other varieties).

Analogical redistribution of exponents according to metamorphomic patterns is not limited to thematic elements. In other Romance varieties (see e.g. Maiden, 2018), metamorphomes act as templates for suppletive forms, novel root allomorphy, heteroclisis and defectiveness. Additionally, in a few rare cases, inflectional desinences are redistributed according to metamorphomic templates based on roots: in Daco-Romance: Maiden (2009b) documents the spread of an exponent -şi originally unique to plpf.2sg forms into other 2sg cells forming part of the same established metamorphomic pattern as the plpf.

4.2. Influence on distribution of exponents within the paradigm

In contrast to existing metamorphomes based on lexical roots, which can act as templates for the redistribution of roots, stems, thematic elements and desinences, the identity of cond and non-first-conjugation ipfv.ind desinences does not appear to serve as a template for redistribution of roots or stems14: this study only found examples of analogical redistribution of desinential material (-i) and thematic material (-av-). The apparent restriction on what the template can manipulate may be merely an artefact of the particular data set considered here; alternatively, it may reflect a potentially general property of distributional templates based on similarity of inflectional desinences.

One interpretation of this contrast is simply that if a metamorphome is inferred by speakers based on a particular type of exponent (e.g. mainly lexical roots or mainly inflectional desinences), speakers are more likely to exploit it as a template for redistributing broadly similar types of exponent. Although there are occasional cases of inflectional desinences being redistributed according to Maiden’s patterns, these are overall much less frequent than the cases of root or stem material being similarly redistributed. As Maiden’s patterns, which arise as morphological generalisations about the distribution of root allomorphy, are chiefly of use to speakers in organising phenomena which affect the root, it would not be illogical to expect that patterns which arise as morphological generalisations about the distribution of non-root material would be preferentially exploited for organising non-root material.

An alternative interpretation is that templates based on similarity of root (or, at least, lexical) material have wider influence over the distribution of other exponents than templates based on similarity of desinences. Such a view would be congruent with Maiden’s hypothesis that the function of metamorphomes is to provide speakers with a predictable means of distributing arbitrary allomorphy associated with a single lexical item (2013, 2018, pp. 310-314): Maiden observes that, in diachrony, metamorphomes persist where their constituent cells share lexical meaning, but may break down where a given inflectional form acquires a lexicalised meaning distinct from that of its original paradigm (2012b, 2018, p. 262). If Maiden is correct, then metamorphomic distributions pertaining to exponents of lexical meaning (chiefly roots) are expected to be of greater value to speakers, and thus more influential in the paradigm, than metamorphomic distributions pertaining to other, less arbitrary material.

An interesting comparator in this regard is Bybee’s (1985, p. 35) observation of a crosslinguistic tendency for exponents of tense and aspect to occur nearest to the stem, while exponents of person/number occur furthest from the stem. Bybee maps this ordering to a cline of semantic relevance, from material central to the meaning of the verb form, which occurs in the stem, to material referring to participants, which occurs furthest from the stem (1985, p. 35); but it may also be characterised as a cline of decreasing arbitrariness. Lexical content, associated with the innermost exponents, is inherent to the lexeme and arbitrary by definition (principle of the Saussurean sign); the morphosemantic features tense, aspect and mood are inherent to a given inflected form, and contribute meaning not recoverable from syntactic context; while the morphosyntactic features person and number are (on the verb) contextual, dependent on agreement and visible in syntax15. To the extent that exponents of lexical content, morphosemantic features and morphosyntactic features can be discerned in Romance verb paradigms (rife with cumulative exponence, extended exponence and so-called ‘empty morphs’), the same tendency is visible: relevance in Bybee’s sense and arbitrariness both decrease as distance from the stem increases, and one might reasonably expect the influence of metamorphomes to decrease with these properties of the exponents they are based on.

4.3. Internal structure, coherence and convergence

The study data illustrate a degree of coherence, without convergence. These findings are largely inconclusive: they do not constitute sufficient evidence either to assert that the set of cond and non-first-conjugation ipfv.ind forms is a single metamorphome, or to dismiss the possibility.

It is a feature of the analogical extensions discussed in sections 3.1. and 3.2. that they involve only cells with certain person/number values among cond and non-first-conjugation ipfv.ind forms, rather than treating all such forms as a single block. Such phenomena are not unprecedented in the literature on metamorphomes: Maiden (2018, p. 73) gives several examples of allomorphy within metamorphomes in Italo-Romance and Ibero-Romance varieties, and there are also cases in which the domain of a given analogical change is demonstrably the intersection of a metamorphome with an extramorphological feature (Maiden, 2009b; O’Neill, 2014); though it should be noted that in these examples, robust independent evidence of the metamorphomic status of the entire set of cells was available.

The more fundamental point to be made is that even established metamorphomes, typically discussed with emphasis on the coherence of their constituent cells, are groupings of individual cells, each with its own bundle of feature values and exponents. As such, the groupings retain internal structure: consider, for example, the Daco-Romance patterns described by Maiden (2011c) as a series of probabilistic, implicational relationships between individual cells. The evidence of coherence adduced in this study does not permit discrimination between an analysis in which all relevant person/number forms belong to the same set of cells (cross-cut by the distribution of morphosyntactic feature values and of stress) and an analysis in which each set of person/number forms constitutes a distinct unit.

Examples of convergence (in which the exponents of a given metamorphome acquire a characteristic phonological shape across lexemes) were not found in this study. Yet the finding is hardly surprising, since the desinences participating in the formal identity are already constant across lexemes: there is effectively no margin for their similarity to increase.

It should also be noted that certain of the developments illustrate conflicting tendencies: for example, the extension of -av- in Nissart increases differentiation between person/number forms within the series, while such differentiation is reduced by the levelling of vowel alternations in the varieties of Haute-Garonne. The comparison of these two cases thus yields no overall generalisation concerning the relationships between the cells involved; indeed, if there is a generalisation to be made at all, it concerns a tendency to align the distribution of segmental allomorphy with the distribution of stress patterns (see e.g. Maiden, 2000; Esher, 2015b).

4.4. Shape, definition and interaction

The most significant difference observed concerns the shape of the distribution: defining the formal relationships of cond and non-first-conjugation ipfv.ind forms requires reference to multiple paradigms, while familiar metamorphomes (although applying to multiple paradigms) can be defined in terms of cells within a single paradigm.

Maiden’s patterns can be represented visually in a manner similar to a distribution schema (Pirrelli & Battista, 2000) or stem space diagram (Bonami & Boyé, 2003), by setting out a grid which represents the cells of a single complete verb paradigm, and shading, outlining or indexing cells to indicate which cells are grouped together. A single grid is sufficient to capture a generalisation which holds for multiple lexemes (e.g. in the variety of Nice it is true for each individual lexeme that all its fut and cond forms share a stem).

By contrast, in the case of the cond and non-first-conjugation ipfv.ind, the similarity of exponents spans multiple paradigms, and concerns different cells in different inflectional classes (only cond cells in first-conjugation lexemes; cond and ipfv.ind cells in most other lexemes; only ipfv.ind cells for certain lexemes in Haute-Garonne). This relation cannot be diagrammed within a single paradigm, or indeed a single conjugational class.

The solution proposed here, in order to account for the data, is to consider that the identity of cond and ipfv.ind cells is not a unitary object, but that there are at least two relations involved. One holds across cond forms of different lexemes: speakers associate cantariá with finiriá, vendriá and so on. The other, a metamorphome in Round’s terms, holds between the cond and ipfv.ind of a subset of lexemes: in these lexemes, speakers associate finiriá with finissiá, vendriá with vendiá, and so on.

The proposed metamorphome directly conflicts with the distribution found in first-conjugation verbs, which is of somewhat higher lexical type frequency16. Conflict between metamorphomes within a given lexeme or across lexemes is not unprecedented. For example, there is overlap between the respective domains of the L-pattern and the N-pattern (Maiden, 2012a); the modern distribution of stem forms in the Italo-Romance preterite results from the intersection of two patterns (Maiden, 2000; Esher, 2015b); and in Occitan varieties such as that of Toulouse (Table 8) the inherited identity between fut and cond forms is maintained in some lexemes but lost in others, chiefly due to the differential treatment of the cluster [ɾj] in the cond. In the case of the cond and ipfv.ind, the lower lexical type frequency of this pattern is a plausible contributing factor to its limited influence and productivity, in comparison to Maiden’s patterns, many of which are valid across the lexicon.

A further advantage of this account is that it maintains a clear separation between metamorphomes (categories which group cells within a paradigm) and conjugational classes (categories which group paradigm types; Round, 2015), with the two phenomena remaining independent and orthogonal. Such a view is of descriptive advantage for Occitan dialect data (Esher, 2016a) and is equally compatible with theories which consider conjugational class to be a property of entire lexemes (e.g. Spencer, 2013) and theories which assume that conjugational class is a property of stems (e.g. Stump, 2016).

4.5. Summary

Overall, the behaviours of the study data are shown to be compatible with the existence of a metamorphome comprising cond and non-first-conjugation ipfv.ind cells. While salient properties of the putative metamorphome superficially differ from the characteristic behaviour of Maiden’s patterns, the same properties are nevertheless attested for various of Maiden’s patterns, and the differences are argued to be gradient and circumstantial rather than categorial. Among the advantages of this approach is that, in rejecting a sharp distinction between ‘root-metamorphomes’ (Maiden’s patterns) and paradigmatic distribution patterns based on similarity of inflectional exponents other than roots, it does not require definitive, categorial segmentation of inflectional forms into root and non-root material (see e.g. Spencer (2012) for the theoretical difficulty in doing so, Blevins (2006) and Maiden (2016b) for the multiple, overlapping and changing segmentations which speakers may infer in reality).

5. Conclusions

This study discusses the historical origins and diachronic development of cond and non-first-conjugation ipfv.ind desinences, and the formal relationship between them, in varieties of Occitan. The desinences share a common origin, and tend to undergo the same sound changes as each other, due to their similar or identical phonological form and context. While the usual convention of historical descriptions is to present the ipfv.ind as primary in such changes, it is argued (following Togeby, 1964) that at least one change, the loss of intervocalic -b- in early Gallo-Romance, is best accounted for by assuming that change begins in the cond.

The forms are also subject to analogical changes, by which the cond and non-first-conjugation ipfv.ind are equally affected. The study describes three examples of extension and levelling attested in varieties of Occitan, in which thematic formatives and person/number desinences are redistributed (no examples of analogical change affecting roots were found).

The behaviour of these forms shows a number of similarities with established Romance metamorphomes, suggesting that the phenomena are indeed related. Examination of apparent differences compared to established metamorphomes indicates that the majority of these differences do not form significant obstacles to analysing the identity of cond and ipfv.ind forms in non-first-conjugation lexemes as a metamorphome. On the contrary, the data discussed here highlight the variability and internal structure of metamorphomes (previously observed by e.g. Maiden, 2011c ; Smith, 2011): familiar, established Romance metamorphomes consist of multiple implicational relationships between individual paradigm cells, and exist alongside distributional patterns of lower frequency, generality and/or predictive value; while analogical changes affecting the distribution of inflectional exponents may make reference to multiple intersecting metamorphomes as well as to extramorphological properties.


1 Versions of this paper were presented at the 1st International Symposium of Morphology (Lille, December 2017) and the 8th Obrador de Lingüistica Occitana (Pau, July 2018). I thank both conference audiences, and especially the two anonymous Lexique reviewers, for their constructive comments and suggestions; Xavièr Bach kindly assisted with the French abstract. Remaining flaws are the sole responsibility of the author. Return to text

2 For consistency and clarity I will apply Round’s terminology throughout, even where it conflicts with the practice of other authors (Maiden, for example, refers systematically to morphomes). Return to text

3 Some scholars have suggested an articulatory explanation hinging on the difference of vowel quality between low [a] and high front [i], [e]: this view is adopted by Posner (1961), who additionally considers -ēbam and ‑iēbam to have merged as *iba rather than *eba. Return to text

4 For Latin, Ernout (1927, pp. 124, 143, 138) estimates 3620 lexemes in conjugation I, 570 in conjugation II and 2400 in conjugation III; regrettably no figure is given for conjugation IV. Wiktionary lists 2608 lexemes in conjugation I, 531 in conjugation II, 1786 in conjugation III and 334 in conjugation IV, plus a further 166 lexemes classed as ‘irregular’ (<> accessed 16 September 2018). A search of the lemmatiser Collatinus yields approximately 1698 lexemes of conjugation I, 301 in conjugation II, 1382 in conjugation III and 199 in conjugation IV (, version 11 accessed 16 September 2018). While the raw figures differ, the ranking of conjugations by size is consistent, and both the Wiktionary and Collatinus counts suggest that around 48% of verb lexemes belong to the first conjugation and around 52% to the other conjugations. Return to text

5 Estimate based on Collatinus. Return to text

6 This point is discussed in more detail by Matthews (1982), based on Ibero- and Italo-Romance data. Return to text

7 ‘Following this model [the cond], the imperfect also reduced its desinence to -ea, -ia, but only in the conjugations whose desinences match those of the conditional’ [my translation]. Return to text

8 Consider, for example, the split of Latin imperfectum forms following context-sensitive palatalisation, which gives rise to Maiden’s L-pattern [metamorphome consisting of 1sg.prs.ind and all prs.sbjv forms (see Maiden, 2009a, 2018); the label is deliberately arbitrary]; or the split of synthetic future and conditional forms in western varieties of Occitan, following context-sensitive reduction of consonant clusters (Esher, 2015a). Return to text

9 Compare the case of analogical levelling discussed in section 3.3., where no stress alternation is present and distinctions of vowel quality are eliminated. Return to text

10 Also Toulouse (31.12), Clermont-le-Fort (31.20), Mauressac (31.21), Mascarville (31.30), Montgaillard-Lauragais (31.31), Dreuilhe (31.32), Aignes (31.33), and areas of the départements Ariège and Aude. The survey points Villaudric (31.10) and Garidech (31.11), present, like the variety of Graulhet, the historically expected pattern of alternation between [a] and [ɔ]. Return to text

11 A variant form [ˈbẽndɔts] ‘sell.prs.sbjv.2pl’ is given for Merville. This form, lacking yod and bearing stress on the root as opposed to the final syllable, is an analogical creation based on the corresponding singular and 3pl forms [ˈbẽndɔj], [ˈbẽndɔs], [ˈbẽndɔ], [ˈbẽndɔ̃n]. Return to text

12 The change does not affect nouns in -[aɾjˈɔ], nor ipfv.ind forms with stems in -[ɾ]- such as moriá [muɾjˈɔ], ‘die.ipfv.ind.3sg’. Return to text

13 Treatment of the cluster /ɾj/ following a falling diphthong varies by area: e.g. in the ALLOc data plauriá ‘rain.cond.3sg’ is realised [plawɾˈɔ] in Toulouse (31.12), [plawjˈɔ] in Merville (31.01). Return to text

14 In the variety of Gartempe (Limousin: Quint, 1996), some third-conjugation verbs display syncretism between ipfv.ind and cond forms which is not limited to identity of desinences, but affects the entire inflected wordform, e.g. [nø vãdjã] ‘sell.ipfv.ind/cond.1pl’. This phenomenon is due to regular sound change, specifically the loss of -r- in the cond (Esher, 2015a, forthcoming). As such, it provides no evidence for analogical redistribution of root/stem material based on the morphomic relationship between ipfv.ind and cond forms discussed here. Return to text

15 The terms ‘morphosemantic features’ and ‘morphosyntactic features’ are taken from Corbett (2012, p. 49). Note that Corbett makes no claim about the relative arbitrariness of such features, and that the claim made in this study relates only to such features on verbs. Return to text

16 While first-conjugation verbs are certainly more numerous than non-first-conjugation verbs, the precise ratio between the two is unclear. For varieties of the Languedoc, based on Alibèrt (1965), Oliviéri & Sauzet (2016, p. 333) estimate 10,644 first-conjugation lexemes, 1,266 continuants of Latin conjugation IV, and 446 continuants of Latin conjugation III; according to these figures, identity of cond and ipfv.ind desinences would be found in only 14% of verb lexemes. This estimate contrasts considerably, and rather surprisingly, with the results of Tang & Nevins (2013) for diachronic corpora of Spanish, Portuguese and Italian, according to which non-first-conjugation lexemes represent around 30-40% of verb lexemes. Further investigation would be required to determine whether Occitan is indeed such an outlier in Romance with regard to the relative size of conjugations, or whether the disparity is an artefact of the considerable methodological differences between the two studies. Return to text


