Proto-Uralic: Difference between revisions

From FrathWiki
Jump to navigationJump to search
(new category)
 
(69 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==Subpages & such==
Fully a work in progress. Mistakes may occur.
 
Abbreviations used on these pages:
'''B.''' = Baltic, '''Cf.''' = 'compare', '''En''' = Enets, '''Er.''' = Erźa, '''Es.''' = Estonian, '''F.''' = Finnish, '''Gmc''' = Germanic, '''H.''' = [[Hungarian]], '''Hi.''' = Hill Mari, '''IA''' = Indo-Aryan, '''IE''' = Indo-European, '''II''' = Indo-Iranian, '''K.''' = Komi, '''Ka.''' = Kamass, '''Kh.''' = [[Khanty]], '''Li.''' = Livonian, '''Ma.''' = [[Mari]], '''Me.''' = Meadow Mari, '''Mk.''' = Mokša, '''Mo.''' = [[Mordvinic]], '''Ms.''' = [[Mansi]], '''N''' = North, '''Ne.''' = Nenets, '''Ng.''' = Nganasan, '''P.''' = [[Permic]], '''PU''' = Proto-Uralic, '''S.''' = Samic / South, '''Se.''' = Selkup, '''Smy.''' = Samoyedic, '''U.''' = Udmurt, '''Ve.''' = Veps, '''Võ.''' =
South Estonian (Võro)
 
'''[http://protouralic.wordpress.com/ Now with a blog!]''' <span style="color:red">NB: New URL<span>
 
__NOTOC__
==Development==
* Sound changes to [[Finnish]]
* Sound changes to [[Finnish]]
* Sound changes to [[Mordvinic]]
* Sound changes to [[Proto-Samic]]
* Sound changes to [[Proto-Samoyedic]]
* Sound changes to [[Proto-Samoyedic]]
* [[Proto-Uralic/new|Data for reduced vowels]]
 
* [[Proto-Uralic/A|Data for back unrounded vowels *a *ë]]
==Data subpages==
* [[Proto-Uralic/Ä|Data for *ä]]
In the vowel tables, '''bold''' marks vocalic irregularities, ''italic'' uncertainties in what the regular vocalic reflex is, <font color="red">red</font> consonantal irregularities.
* [[Proto-Uralic/Ü|Data for *ü]]
 
* [[Proto-Uralic/derived|Potential derivativs]]
{|
* [[Proto-Uralic/CC|Cluster issues]]
|-
| Close
| [[Proto-Uralic/I|*i]] • [[Proto-Uralic/Ü|*ü]] || [[Proto-Uralic/new|*ï?]] • [[Proto-Uralic/U|*u]]
|-
| Mid
| [[Proto-Uralic/E|*e ~ *ê ~ *E]] || [[Proto-Uralic/O|*o ~ *ô ~ *O]]
|-
| Open
| [[Proto-Uralic/Ä|*ä]] || [[Proto-Uralic/A|*a, *ë]]
|}
 
Known derivativs with
* [[Proto-Uralic/-ka|-kA]]
* [[Proto-Uralic/-ma|-mA]]
* [[Proto-Uralic/-ta|-tA]]
[[Proto-Uralic/derived|Potential derivativs]] [[Proto-Uralic/CC|Cluster issues]] • [[Proto-Uralic/harmony|Co-occurrence of coronals]] • [[Proto-Uralic/WE|West-East discrepancies]]
 
==Reconstructed phoneme inventory==
 
;Vowels:
<nowiki>*/i ü u e ë o ä a/</nowiki> in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including #nato "brother's wife", #kälü "spouse's sister", #wäŋü/#wiŋü "son-in-law".
 
Opinions vary on if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ]. As of July 2014 I ([[User:Tropylium|Tropylium]]) support [ɤ] for the former, as suggested by e.g. the substitution of Indo-Iranian *a by *ë in loans (unless these words are post-PU in date.) The latter seems like an open question; I am investigating the possibility of [ɒ] as the default value, [ɑ] as a positional allophone.
 
Several amendments have been proposed at times:
* Long vowels or equivalent entities have been a popular proposal, but the main evidence, from Finnic, has recently been adequately explained otherwise.
* A number of studies have proposed, on the basis of the Ugric evidence, to reinterpret several quality contrasts as quantity contrasts instead.<sup>[citation needed]</sup> No wholly systemic account of this idea seems to have been presented to this day.
* Korhonen (1988) proposes an *ï, which would have split at an early date to to front and back allophones, the former then developing into standard *ü.
* This *ï has been reshuffled into two "reduced" or "semi-rounded" vowels *ê *ô ([ɪ ʊ]?) by Häkkinen (2007), which also aim to explain some apparent exceptions in the development of *e and *o. In light of some recently identified conditional sound laws, this may not be a necessary hypothesis. (I still suspect that at least a raised allophone of *e might have existed though.)
 
;Consonants
Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" */x/ (a voiceless velar fricativ [x] seems like the most plausible sound value, though voiced [ɣ] is not ruled out), two "spirants" */d₁ d₂/ (traditionally interpreted as [ð] and [ðʲ] respectivly, though this is far from certain), two liquids */l r/ and two semivowels */w j/.
 
The existence of a distinction between *ć and *ś is not certain. *ć can be securely established only in the clusters *ćć, *ńć, where *ś is not found. The Proto-Uralic status of *š is also uncertain: most words with the consonant are not found in languages east to Permic, and frequently moreover probable Indo-European loanwords.
 
A palatal liquid *ĺ is also commonly found in old reconstructions (e.g. in the ''Uralisches Etymologisches Wörterbuch''), but many of the etyma involved do not really behave. Some may be late inter-branch loans. Most others have been by now accounted for by a widespread development *l > *ĺ in Samoyedic. The "palatal spirant" *d₂ may thus have been the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. (The "dental spirant" also develops to /l/ widely, but was certainly distinct from *l.)
 
Three further consonants of dubious nature were supported until the mid-20th century: the palatal shibilant *š́ (distinguished from regular *ś only in Mansi), and the retroflex sonorants *ṇ, *ḷ (distinguished from regular *n, *l only in Khanty).
 
;Phonotactics
A notable distributional feature was that *ŋ, *x and probably also *d₁, *r could not occur word-initially.
 
Roots generally had the form (C)V(C)C{A I}, with initial stress. A couple longer stems of the shape (C)V(C)C{A I}C(I) are also attested. Shorter CV roots occurred in grammatical words: pronouns, prepositions and the copula also CV; there was also a single lone-V root, the negativ verb *e-.
 
See below for allowed consonant clusters.
 
Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):
 
{| class="wikitable"
|-
! C !! [[Finnish|Finnic]] !! [[Proto-Samic|S.]] !! [[Mordvinic|Mordv.]] !! [[Mari]] !! [[Permic]] !! [[Hungarian|Hung.]] !! [[Mansi|Ms.]] !! [[Khanty|Kh.]] !! [[Proto-Samoyedic|Smy.]] !! Comments
|-
! *m
| *m || *m || *m, *v || *m || *m || m, -v- || *m || *m || *m
| Sporadic lenition in Mo, H. Regular in suffixes in H.
|-
! *n
| rowspan="2"| *n || *n || *n, *ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
|-
! *ń
| *ń || *ń,<br/>*n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
|-
! rowspan="2"| *-ŋ
| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| *j&nbsp;/&nbsp;F_,<br/>*v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
| rowspan="2"| ń&nbsp;/&nbsp;F_,<br/>n&nbsp;/&nbsp;C_,<br/>m&nbsp;/&nbsp;B_
| rowspan="2"| g || *ŋk || *ŋk || rowspan="2"| *ŋ
| rowspan="2"| Irregularly split in ObU (the more general development is *ŋk)<br/>
Retained in some Erzya & Udmurt dialects
|-
| *ŋ || *ŋ
|-
! *w
| rowspan="2"| *v || rowspan="2"| *v || *w || *v || v || *w || *w || *w
|-
! *-w-
| rowspan="3"| *j&nbsp;/&nbsp;F_,<br/>∅&nbsp;/&nbsp;B_ || rowspan="3"| ∅ || rowspan="3"| *v → -ː- || rowspan="3"| *ɣ || rowspan="3"| *ɣ || rowspan="3"| ∅
|-
! *-x-
| *ː || rowspan="3"| *k || *j
|-
! *-k-
| rowspan="2"| *k || *j, *v
| Mo. split by vowel backness/frontness
|-
! *k
| k || *k || *k, *g || k, h&nbsp;/_B || *k || *k || *k
| rowspan="4"| Stop voicing irregularly split in P.
|-
! *p
| *p || *p || *p, *-v- || *p, *-w- || *p, *b || f, -v- || *p || *p || *p
|-
! *č
| *t, *h || *c || *č || *č || *č, *dž || č, š || *š || *č || *č
|-
! *ć
| rowspan="3"| *s || rowspan="2"| *č || *ś, *-ć- || *ć, *-ź- || *ć, *dź || č, s || *ć, *s || *ć, *s || rowspan="2"| *s
|-
! *ś
| ś, -ź- || rowspan="3"| *š, -ž- || *ś, *-ź- || *s, *š ||  s || *s
|-
! *s
| rowspan="2"| *s || *s, *-z- || *s, *-z- || rowspan="2"| ∅ || rowspan="4"| *t || rowspan="2"| *ɬ || rowspan="4"| *t
|-
! *š
| *h || *š, *-ž- || *š, *-ž-
|-
! *t
| rowspan="4"| *t || rowspan="2"| *t || *t, *tʲ || *t || *t, *d || t || rowspan="2"| *t
|-
! *-t-
| rowspan="3"| *d, *dʲ || *ð || ∅ || z
|-
! *d₂
| rowspan="2"| *ð || *l, *-ð- || *ĺ || ɟ || *ĺ || *j || *j
|-
! *-d₁-
| ∅ || *l, *-∅- || rowspan="2"| l || rowspan="2"| *l || rowspan="2"| *l, *-ɬ- || *r
| lost in Permic only intervocally, not in clusters
|-
! *l
| *l || *l || *l || *l || *l || *l, *j
| In Kh. also irregularly *l → *ɭ
|-
! *j
| *j, *ː || *j || *j || *j || *j || j, ɟ || *j || *j || *j
|-
! *r
| *r || *r || *r || *r || *r || r || *r || *r || *r
<!--
|-
! *-kt-
| *kt || *kt || vt || * || * || t || rowspan="2"| *ɣət,<br/>*təɣ || *ɣəɬ, *ɬəɣ || rowspan="2"| *t
| rowspan="2"| Metathesis in Ob-Ugric in i-stems
|-
! *-ks-
| *ks || *ks || ks || *ks || *sk || v || *ɣət, *təɣ
|-
! *
| * || * ||  || * || * ||  || * || * || *-->
|}
 
Major consonant developments involving specific '''vowel environments''' include:
* the widespread loss of *w before non-open labial vowels, *j before non-open front vowels
** best retained in Permic and Khanty
* the labialization of *k, *ŋ, *x in Mansi to *kʷ, *ŋʷ, *w before and after labial vowels
* the assibilation of *t in Finnic (of any origin, i.e. < *t, *č, *d₁, *d₂) to *c before *i
 
Major soundlaws affecting '''consonant clusters''' include:
* the widespread loss or vocalization of *j and *w
** only Samic consistently retains these, though Finnic in most cases as well and Samoyedic also frequently
** the cluster *lj has widely coalesced to /lʲ/; in Mordvinic all j-clusters yield palatalized consonants
** Mansi and Khanty have retained a few direct traces of *w (generally as delabialized *ɣ)
** indirect traces of such clusters are widely found in the development of vowels
* the shortening of geminates everywhere except in Samic and Finnic
** generally leading to new medial voiceless stops/affricates, after the lenition of the original ones
** geminates other than *kk are merged with the corresponding singletons in Mari, Mansi, Khanty and Samoyedic
* the assimilation of *mt to *nt in Finnic, Mordvinic, Mansi, and probably Permic & Hungarian
* the denasalization of nasal + stop/affricate clusters to voiced stops/affricates in Permic and Hungarian
** the pre-Hungarian voiced affricates have lost their sibilancy: *nč *ńć > *[ʤ ʥ] > /r ɟ/
* the metathesis of *k in Mansi and Khanty, when following a heterorganic consonant
* the loss of *k in Samoyedic, under the same conditions
* after the loss of final vowels, the breaking of most heterorganic consonant clusters by an epenthetic schwa in Hungarian, Mansi and Khanty
** entirely general in Khanty, somewhat limited in Hungarian and Mansi; in many cases consonant clusters are still attested in Old Hungarian
** some traces of a similar process are found in Mari (for *ŋ + sibilant) and Finnic (for *kś) as well
* the loss of *p before other obstruents in Mansi
* the "palatality metathesis" (*Ć-lk, *d₂k, *d₂w >) *ĺɣ > /lɟ/ in Hungarian
* the assimilation of *lm (incl. earlier *d₁m) to *nm in Permic
* the merger and lenition of *pt *kt to *ht in Finnic, *ft in Mordvinic
* the simplification of *čč, *kš to *h in Finnic (via *th, *kh?)


==Medial consonant clusters==
==Medial consonant clusters==


Words included chiefly from appendix from this: http://urn.fi/URN:NBN:fi-fe20071746 (see comments in table code)
Includes chiefly the words from the appendix in Häkkinen (2007).


CF: http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU
Cf. Sammallahti (1988): http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU


{|
{|
! colspan="2"|2nd →<br/>1st ↓
! colspan="2"|2nd →<br/>1st ↓
! p !! t !! č !! k !! s !! ś !! ð !! w !! j !! l !! r !! m !! n !! ŋ
! p !! t !! č !! k !! s !! ś !! š !! d₁ !! l !! r !! w !! j !! m !! n !! ŋ
! ∑
! ∑
! width="25"| !! Notes !! width="25"| !! Frequency color code
! width="25"| !! Notes !! width="25"| !! Frequency color code
|-
|-
! rowspan="4" valign="center" style="background:#f0f0f0"| N
! rowspan="3" valign="center" style="background:#f0f0f0"| N
! m
! m
| mp <!--lämpi-->|| style="background:#ffffe0"| mt <!--omti tumti---> || ||
| style="background:#ffffe0"| mp <!--kompa lämpe-->|| style="background:#fff0c0"| mt <!--ëmta- omte tumte---> || ||
| ms <!--pImsimi-->|| mś <!--n'ims'a-->||
| ms <!--pImseme-->|| style="background:#ffffe0"| mś <!--n'ims'a kums'e (= Häkk. kuxs'e)-->||
| colspan="7" rowspan="10" align="center" valign="center" style="background:#f0f0f0"| N/A
| colspan="3" rowspan="3" align="center" valign="center" style="background:#f0f0f0"| N/A
| 5
| colspan="5" rowspan="11" align="center" valign="center" style="background:#f0f0f0"| N/A
| || mostly i-stems
| 8
| || mostly i-stems except *kompa *ńimśa
| || single root
| || single root
|-
|-
! n
! n
| || style="background:#fff0c0"| nt <!--tanti- konta- lUnta kënta-->|| style="background:#ffc8c0"| nč <!--kyntSi n'antSa pantSa pontSa wantSa tSantSi--> ||
| || style="background:#ffc8c0"| nt <!--jäntik pintä tante- konta- lUnta kënta-->|| style="background:#ffc8c0"| nč <!--kyntSe n'antSa pantSa pontSa wantSa tSantSe--> ||
| || style="background:#fff0c0"| nś <!--ons'a luns'a kuns'i-->||  
| style="background:#c0f0ff"| || style="background:#fff0c0"| nś <!--ons'a luns'a kuns'e sa/äns'a-->||
| 13
| 16
| ||
| || mostly back-harmonic a-stems; *ns → *nč?
| || style="background:#ffffe0"| two roots
| || style="background:#ffffe0"| two roots
|-
|-
! ń
! ŋ
| || || ||
| || style="background:#ffffe0"| ŋt <!--oNte uNta--> || || style="background:#ffc8c0"| ŋk <!--t(s)uŋke- jäNka s'äNke s'aNka woNki pëNka --->
| || ńś <!--san's'a-->||
| ŋs <!--jUNsi--> || || …
| 1
| 9
| || might not be distinct from *
| || back-harmonic or *ä
| || style="background:#fff0c0"| 3-4 roots
| || style="background:#fff0c0"| 3-4 roots
|-
|-
! ŋ
! rowspan="5" valign="center" style="background:#f0f0f0"| P
| || style="background:#ffffe0"| ŋt <!--oNti uNta--> || || style="background:#ffc8c0"| ŋk <!--jäNka s'äNki s'aNka woNki pëNka--->
! p
| ŋs <!--jUNsi--> || ||
| pp <!--säppa-->|| style="background:#ffffe0"| pt <!--jupta- ëpte-->|| ||
| 8
| style="background:#ffffe0"| ps <!--ipse kupsa--->|| pś <!--lëps'e-->|| … || pd₁ <!--l/däpdä-->
| … ||
| 7
| ||
| ||
| || style="background:#ffc8c0"| 5-6 roots
| || style="background:#ffc8c0"| 5-6 roots
|-
|-
! rowspan="4" valign="center" style="background:#f0f0f0"| P
! t
! p
| || || || style="background:#fff0c0"| tk <!--kätke- kytke- tytke--->
| pp <!--säppa-->|| style="background:#ffffe0"| pt <!--jupta- ëpti-->|| ||
| || || ||
| style="background:#ffffe0"| ps <!--ipsi kupsa--->|| pś <!--lëps'i-->|| pð <!--däpdä-->
| 7
| ||
| ||
| || style="background:#f0c0f0"| ~10 roots
| 3
| || all front-harmonic ə-stems
| || style="background:#f0c0f0"| 10+ roots
|-
|-
! t
! č
| || || || style="background:#fff0c0"| tk <!--kätki- kytki- tytki--->
| || || čč <!--këčče-ra-->|| style="background:#ffffe0"| čk <!--putSke nëtSke-->
| || ||
| || || ||
| <!--"kečlä" = *keččä-lä-->||
| 3
| 3
| ||
| ||
| || style="background:#c0f0ff"| Most suspiciously none
|-
|-
! č
! ć
| || || || čk <!--putSki-->
| || || ||
| || ||
| || || ||
| 1
| … ||
|
|-
|-
! k
! k
| || style="background:#fff0c0"| kt <!--kakta tUkta tUkti ïkta-->|| || kk <!--lykka-->
| || style="background:#ffc8c0"| kt <!--kakta tUkta tUkte ïkta täkta-->|| || kk <!--lükka-->
| style="background:#fff0c0"| ks <!--peksa- suksi mïksa-->|| style="background:#fff0c0"| kś <!--s'yks'i n'uks'i n'ëks'imi-->||
| style="background:#fff0c0"| ks <!--peksa- sukse mïksa-->|| style="background:#fff0c0"| kś <!--s'üks'e n'uks'e n'ëks'eme-->|| style="background:#ffffe0"| kš <!--mIkSe wokSe-->||
| 11
| … || …
| 13
| || but *mekšə "bee" probably separate loans in FP and H.<!-- since reflected differently from *ks in the latter-->
|-
|-
! rowspan="2" valign="center" style="background:#f0f0f0"| S
! rowspan="3" valign="center" style="background:#f0f0f0"| S
! s
! s
| || || || style="background:#ffffe0"| sk <!--poski soski-->
| || || || style="background:#ffffe0"| sk <!--poske soske-->
| || ||
| colspan="6" rowspan="6" align="center" valign="center" style="background:#f0f0f0"| N/A
| 2
| 2
| || both i-stems
| || both o_ə
|-
|-
! ś
! ś
| || || || style="background:#fff0c0"| śk <!--wäs'ka as'kili mos'ki kUs'ki--->
| || || || style="background:#ffc8c0"| śk <!--kis'ko- wäs'ka as'kel mos'ke kUs'ke--->
| || ||
| 5
| 4
| || mostly ə-stems + *wäśka
| || mostly i-stems + *wäśka
|-
|-
| align="center"| (?)
! š
! x
| || || ||
| || style="background:#ffffe0"| xt <!--wixti kuxti-->|| xč <!--oxtSi-->||
|
| xs <!--koxsi-->|| xś <!--kuxs'i-->|| (xð) <!--oxdi-->
| || style="background:#ffffe0"| xj <!--sexji (woxji)-->|| style="background:#f0c0f0"| xl <!--käxli pexli n'exli koxli n'oxli poxli woxli kUxli tuxli n'ëxli-->|| (xr) <!--woxri-->
| style="background:#fff0c0"| xm <!--läxmi d'ëxmi s'ëxmi-->|| xn <!--sëxni--> ||
| 23
| || i-stems only, some may be loans with long vowels
|-
|-
! rowspan="3" valign="center" style="background:#f0f0f0"| L
! rowspan="2" valign="center" style="background:#f0f0f0"| L
! l
! l
| || lt <!--kUlta--> || || style="background:#f0c0f0"| lk <!--jälki mälki pelka wolka tulka ulki kulki s'ëlka- wëlka--->
| || lt <!--kUlta--> || || style="background:#f0c0f0"| lk <!--jälke mälke pelka s'ülke wolka tulka ulke kulke n(')olke kIlke- s'ëlka- wëlka jalka--->
| || ||
| style="background:#fff0c0"| lw <!--talwa tolwa pëlwa-->|| style="background:#ffffe0"| lj <!--n'elja pelja-->
| style="background:#fff0c0"| lw <!--talwa tolwa pëlwa-->|| style="background:#ffffe0"| lj <!--n'elja pelja-->|| ||
|style="background:#ffc8c0"| lm <!--n'elma Ilma s'ilma pilmeta kUlme-->|| || lŋ <!--pIlNe-->
| style="background:#ffc8c0"| lm <!--n'elma Ilma s'ilma pilmita kUlmi-->|| || lŋ <!--pIlNi-->
| 25
| 21
| || *lw *lj only a-stems; *lt may be derived ← *-lk-t-
|-
! lʲ
| || || || lʲk <!--syl'ki-->
| || ||
| || || ||
| || ||
| 1
|-
|-
! r
! r
| style="background:#ffffe0"| rp <!--korpi tUrpa--->|| style="background:#ffffe0"| rt <!--pärta sarta-->|| || style="background:#fff0c0"| rk <!--sarka kurki purki n'ërki-->
| style="background:#ffffe0"| rp <!--korpi tUrpa--->|| style="background:#ffffe0"| rt <!--pärta sarta-->|| || style="background:#fff0c0"| rk <!--sarka kurke purke n'ërke-->
| || ||
| style="background:#ffffe0"| rw <!--jerwV- kIrwa---> ||
| rw <!--kIrwa---> || || ||
| rm <!--n'urme--> || ||
| rm <!--n'urmi--> || ||
| 11
| 10
| || mostly back-harmonic
| || mostly back-harmonic
|-
|-
| align="center"| (?)
| align="center" valign="center" | (?)
! ðʲ
! d₁/d₂
| || || || ðʲk <!--s'ëd'ka-->
| || || || d₁k <!--s'ëdka-->
| || ||
| d₂w <!--käd'wa-->||
| ðʲw <!--käd'wa-->|| || ||
| dₓm <!--s'eDmä--> || ||
| || ||
| 3
| 2
|-
|-
! rowspan="2" valign="center" style="background:#f0f0f0"| sV
! rowspan="2" valign="center" style="background:#f0f0f0"| sV
! w
! w
| colspan="6" rowspan="2" align="center" valign="center" style="background:#f0f0f0"| N/A?
| || … || ||
| style="background:#ffffe0"| <!--käwdi täwdi--->
| ws <!--kowse--> || ||
| <!--s'awNa--> || wj <!--jäwji-->|| wl <!--lewli-->||
| style="background:#fff0c0"| wd₁ <!--owdem käwde täwde--->
| || || wŋ
| wl <!--lewle-->|| || || wj <!--jewje-->
| 5
| || wn <!--säwnV--> || style="background:#ffffe0"| wŋ <!--owNe s'awNa-->
| || after e ä a only
| 9
| || after e ä a o only
|-
|-
! j
! j
|
| … || … || || ||
| style="background:#fff0c0"| jw <!--päjwa kajwa ojwa-->|| || ||
| || ||
| jm <!--äjmä-->|| || style="background:#ffffe0"| jŋ <!--wajNi ajNi-->
| || … || style="background:#fff0c0"| jw <!--päjwa kajwa ojwa--> ||
| jm <!--äjmä-->|| || style="background:#ffffe0"| jŋ <!--wajNe ajNe-->
| 6
| 6
| || after ä a o only
| || after ä a o only
|-
|-
! colspan="2" align="center"| ∑
! colspan="2" align="center"| ∑
| 4 || 17 || 7 || 31
| 5 || 19 || 7 || 37
| 8 || 10 || 4
| 8 || 10 || 2 || 4
| 8 || 5 || 11 || (1)
| 1 || 0 || 9 || 5
| 10 || 1 || 4
| 8 || 1 || 5
|}
|}


[[Category:Natlangs]]
Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, *ćl, *kl, *kr, *čt, *tt, *st, *śt, *št, *šk, *ćk, *nš, *ŋš, *mč, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table).
 
==Literature==
;Current standard literature
* Janhunen, Juha (1981): Uralilaisen kantakielen sanastosta
* Janhunen, Juha (1982): On the structure of Proto-Uralic
* Sammallahti, Pekka (1988): Historical Phonology of the Uralic languages
 
; Nonstandard proposals
* Häkkinen, Jaakko (2007): ''Kantauralin murteutuminen vokaalivastaavuuksien valossa''. Master's thesis. http://urn.fi/URN:NBN:fi-fe20071746
* Korhonen, Mikko (1988): Uralilaisten kielten jälkitavujen vokaaliston historiaa
 
;Dated reconstructions
* Collinder, Björn (1960): ''Comparative Grammar of the Uralic languages''
* Itkonen, Erkki (1946): Zur Frage nach der Entwicklung des Vokalismus der ersten Silbe in den finnisch-ugrischen Sprachen, insbesondere in Mordwinischen. ''Finnisch-Ugrische Forschungen'' 29.
* Steinitz, Wolfgang (1944): ''Geschichte des finnisch-ugrischen Vokalismus''. Stockholm.
* Sammallahti, Pekka (1979): Über die Laut- und Morphemstruktur der uralischen Grundsprache. ''Finnisch-Ugrische Forschungen'' 43.
 
{{Natlangs}}
[[Category:Reconstructed languages]]
[[Category:Reconstructed languages]]

Latest revision as of 12:59, 3 July 2014

Fully a work in progress. Mistakes may occur.

Abbreviations used on these pages: B. = Baltic, Cf. = 'compare', En = Enets, Er. = Erźa, Es. = Estonian, F. = Finnish, Gmc = Germanic, H. = Hungarian, Hi. = Hill Mari, IA = Indo-Aryan, IE = Indo-European, II = Indo-Iranian, K. = Komi, Ka. = Kamass, Kh. = Khanty, Li. = Livonian, Ma. = Mari, Me. = Meadow Mari, Mk. = Mokša, Mo. = Mordvinic, Ms. = Mansi, N = North, Ne. = Nenets, Ng. = Nganasan, P. = Permic, PU = Proto-Uralic, S. = Samic / South, Se. = Selkup, Smy. = Samoyedic, U. = Udmurt, Ve. = Veps, Võ. = South Estonian (Võro)

Now with a blog! NB: New URL


Development

Data subpages

In the vowel tables, bold marks vocalic irregularities, italic uncertainties in what the regular vocalic reflex is, red consonantal irregularities.

Close *i *ï?*u
Mid *e ~ *ê ~ *E *o ~ *ô ~ *O
Open *a, *ë

Known derivativs with

Potential derivativsCluster issuesCo-occurrence of coronalsWest-East discrepancies

Reconstructed phoneme inventory

Vowels

*/i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including #nato "brother's wife", #kälü "spouse's sister", #wäŋü/#wiŋü "son-in-law".

Opinions vary on if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ]. As of July 2014 I (Tropylium) support [ɤ] for the former, as suggested by e.g. the substitution of Indo-Iranian *a by *ë in loans (unless these words are post-PU in date.) The latter seems like an open question; I am investigating the possibility of [ɒ] as the default value, [ɑ] as a positional allophone.

Several amendments have been proposed at times:

  • Long vowels or equivalent entities have been a popular proposal, but the main evidence, from Finnic, has recently been adequately explained otherwise.
  • A number of studies have proposed, on the basis of the Ugric evidence, to reinterpret several quality contrasts as quantity contrasts instead.[citation needed] No wholly systemic account of this idea seems to have been presented to this day.
  • Korhonen (1988) proposes an *ï, which would have split at an early date to to front and back allophones, the former then developing into standard *ü.
  • This *ï has been reshuffled into two "reduced" or "semi-rounded" vowels *ê *ô ([ɪ ʊ]?) by Häkkinen (2007), which also aim to explain some apparent exceptions in the development of *e and *o. In light of some recently identified conditional sound laws, this may not be a necessary hypothesis. (I still suspect that at least a raised allophone of *e might have existed though.)
Consonants

Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" */x/ (a voiceless velar fricativ [x] seems like the most plausible sound value, though voiced [ɣ] is not ruled out), two "spirants" */d₁ d₂/ (traditionally interpreted as [ð] and [ðʲ] respectivly, though this is far from certain), two liquids */l r/ and two semivowels */w j/.

The existence of a distinction between *ć and *ś is not certain. *ć can be securely established only in the clusters *ćć, *ńć, where *ś is not found. The Proto-Uralic status of *š is also uncertain: most words with the consonant are not found in languages east to Permic, and frequently moreover probable Indo-European loanwords.

A palatal liquid *ĺ is also commonly found in old reconstructions (e.g. in the Uralisches Etymologisches Wörterbuch), but many of the etyma involved do not really behave. Some may be late inter-branch loans. Most others have been by now accounted for by a widespread development *l > *ĺ in Samoyedic. The "palatal spirant" *d₂ may thus have been the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. (The "dental spirant" also develops to /l/ widely, but was certainly distinct from *l.)

Three further consonants of dubious nature were supported until the mid-20th century: the palatal shibilant *š́ (distinguished from regular *ś only in Mansi), and the retroflex sonorants *ṇ, *ḷ (distinguished from regular *n, *l only in Khanty).

Phonotactics

A notable distributional feature was that *ŋ, *x and probably also *d₁, *r could not occur word-initially.

Roots generally had the form (C)V(C)C{A I}, with initial stress. A couple longer stems of the shape (C)V(C)C{A I}C(I) are also attested. Shorter CV roots occurred in grammatical words: pronouns, prepositions and the copula also CV; there was also a single lone-V root, the negativ verb *e-.

See below for allowed consonant clusters.

Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):

C Finnic S. Mordv. Mari Permic Hung. Ms. Kh. Smy. Comments
*m *m *m *m, *v *m *m m, -v- *m *m *m Sporadic lenition in Mo, H. Regular in suffixes in H.
*n *n *n *n, *ń- / _F *n,
*-ń- / F_F
*n n *n *n *n
*ń,
*n- /#_B
ń
*-ŋ *v, *ː *j / F_,
*v / B_
ń / F_,
n / C_,
m / B_
g *ŋk *ŋk Irregularly split in ObU (the more general development is *ŋk)

Retained in some Erzya & Udmurt dialects

*w *v *v *w *v v *w *w *w
*-w- *j / F_,
∅ / B_
*v → -ː-
*-x- *k *j
*-k- *k *j, *v Mo. split by vowel backness/frontness
*k k *k *k, *g k, h /_B *k *k *k Stop voicing irregularly split in P.
*p *p *p *p, *-v- *p, *-w- *p, *b f, -v- *p *p *p
*t, *h *c *č, *dž č, š
*s *ś, *-ć- *ć, *-ź- *ć, *dź č, s *ć, *s *ć, *s *s
ś, -ź- *š, -ž- *ś, *-ź- *s, *š s *s
*s *s *s, *-z- *s, *-z- *t *t
*h *š, *-ž- *š, *-ž-
*t *t *t *t, *tʲ *t *t, *d t *t
*-t- *d, *dʲ z
*d₂ *l, *-ð- ɟ *j *j
*-d₁- *l, *-∅- l *l *l, *-ɬ- *r lost in Permic only intervocally, not in clusters
*l *l *l *l *l *l *l, *j In Kh. also irregularly *l → *ɭ
*j *j, *ː *j *j *j *j j, ɟ *j *j *j
*r *r *r *r *r *r r *r *r *r

Major consonant developments involving specific vowel environments include:

  • the widespread loss of *w before non-open labial vowels, *j before non-open front vowels
    • best retained in Permic and Khanty
  • the labialization of *k, *ŋ, *x in Mansi to *kʷ, *ŋʷ, *w before and after labial vowels
  • the assibilation of *t in Finnic (of any origin, i.e. < *t, *č, *d₁, *d₂) to *c before *i

Major soundlaws affecting consonant clusters include:

  • the widespread loss or vocalization of *j and *w
    • only Samic consistently retains these, though Finnic in most cases as well and Samoyedic also frequently
    • the cluster *lj has widely coalesced to /lʲ/; in Mordvinic all j-clusters yield palatalized consonants
    • Mansi and Khanty have retained a few direct traces of *w (generally as delabialized *ɣ)
    • indirect traces of such clusters are widely found in the development of vowels
  • the shortening of geminates everywhere except in Samic and Finnic
    • generally leading to new medial voiceless stops/affricates, after the lenition of the original ones
    • geminates other than *kk are merged with the corresponding singletons in Mari, Mansi, Khanty and Samoyedic
  • the assimilation of *mt to *nt in Finnic, Mordvinic, Mansi, and probably Permic & Hungarian
  • the denasalization of nasal + stop/affricate clusters to voiced stops/affricates in Permic and Hungarian
    • the pre-Hungarian voiced affricates have lost their sibilancy: *nč *ńć > *[ʤ ʥ] > /r ɟ/
  • the metathesis of *k in Mansi and Khanty, when following a heterorganic consonant
  • the loss of *k in Samoyedic, under the same conditions
  • after the loss of final vowels, the breaking of most heterorganic consonant clusters by an epenthetic schwa in Hungarian, Mansi and Khanty
    • entirely general in Khanty, somewhat limited in Hungarian and Mansi; in many cases consonant clusters are still attested in Old Hungarian
    • some traces of a similar process are found in Mari (for *ŋ + sibilant) and Finnic (for *kś) as well
  • the loss of *p before other obstruents in Mansi
  • the "palatality metathesis" (*Ć-lk, *d₂k, *d₂w >) *ĺɣ > /lɟ/ in Hungarian
  • the assimilation of *lm (incl. earlier *d₁m) to *nm in Permic
  • the merger and lenition of *pt *kt to *ht in Finnic, *ft in Mordvinic
  • the simplification of *čč, *kš to *h in Finnic (via *th, *kh?)

Medial consonant clusters

Includes chiefly the words from the appendix in Häkkinen (2007).

Cf. Sammallahti (1988): http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU

2nd →
1st ↓
p t č k s ś š d₁ l r w j m n ŋ Notes Frequency color code
N m mp mt ms N/A N/A 8 mostly i-stems except *kompa *ńimśa single root
n nt 16 mostly back-harmonic a-stems; *ns → *nč? two roots
ŋ ŋt ŋk ŋs 9 back-harmonic or *ä 3-4 roots
P p pp pt ps pd₁ 7 5-6 roots
t tk 3 all front-harmonic ə-stems 10+ roots
č čč čk 3 Most suspiciously none
ć
k kt kk ks 13 but *mekšə "bee" probably separate loans in FP and H.
S s sk N/A 2 both o_ə
ś śk 5 mostly ə-stems + *wäśka
š
L l lt lk lw lj lm 25 *lw *lj only a-stems; *lt may be derived ← *-lk-t-
r rp rt rk rw rm 11 mostly back-harmonic
(?) d₁/d₂ d₁k d₂w dₓm 3
sV w ws wd₁ wl wj wn 9 after e ä a o only
j jw jm 6 after ä a o only
5 19  37 8 10 2 4 1 0 9 5 8 1 5

Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, *ćl, *kl, *kr, *čt, *tt, *st, *śt, *št, *šk, *ćk, *nš, *ŋš, *mč, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table).

Literature

Current standard literature
  • Janhunen, Juha (1981): Uralilaisen kantakielen sanastosta
  • Janhunen, Juha (1982): On the structure of Proto-Uralic
  • Sammallahti, Pekka (1988): Historical Phonology of the Uralic languages
Nonstandard proposals
  • Häkkinen, Jaakko (2007): Kantauralin murteutuminen vokaalivastaavuuksien valossa. Master's thesis. http://urn.fi/URN:NBN:fi-fe20071746
  • Korhonen, Mikko (1988): Uralilaisten kielten jälkitavujen vokaaliston historiaa
Dated reconstructions
  • Collinder, Björn (1960): Comparative Grammar of the Uralic languages
  • Itkonen, Erkki (1946): Zur Frage nach der Entwicklung des Vokalismus der ersten Silbe in den finnisch-ugrischen Sprachen, insbesondere in Mordwinischen. Finnisch-Ugrische Forschungen 29.
  • Steinitz, Wolfgang (1944): Geschichte des finnisch-ugrischen Vokalismus. Stockholm.
  • Sammallahti, Pekka (1979): Über die Laut- und Morphemstruktur der uralischen Grundsprache. Finnisch-Ugrische Forschungen 43.
This article is one of quite a few pages about Natlangs.

Indo-european natlangs:

Balto-Slavic Natlangs: Czech * Russian
Celtic Natlangs: Revived Middle Cornish * Pictish
Germanic Natlangs:
North Germanic Natlangs: Norwegian
West Germanic Natlangs: Anglo-Saxon * Dutch * English (Old English * Middle English * Modern English * Scots) * German (High German * Low German)
Indo-Iranian Natlangs: Pahlavi
Italic Natlangs: French * Italian * Latin * Spanish
Debated: Cimmerian

Uralic Natlangs: Finnish * Khanty * Mansi * Mordvinic * Proto-Uralic
Altaic (controversial): Japanese
Sino-Tibetan Natlangs:
Uto-Aztecan Natlangs: Nahuatl

-

Isolate Natlangs: Basque * *
Hypothetical/debated Natlangs and Natlang families: Danubian * Europic (obsolete)