Proto-Uralic: Difference between revisions

Latest revision as of 12:59, 3 July 2014

Fully a work in progress. Mistakes may occur.

Abbreviations used on these pages: B. = Baltic, Cf. = 'compare', En = Enets, Er. = Erźa, Es. = Estonian, F. = Finnish, Gmc = Germanic, H. = Hungarian, Hi. = Hill Mari, IA = Indo-Aryan, IE = Indo-European, II = Indo-Iranian, K. = Komi, Ka. = Kamass, Kh. = Khanty, Li. = Livonian, Ma. = Mari, Me. = Meadow Mari, Mk. = Mokša, Mo. = Mordvinic, Ms. = Mansi, N = North, Ne. = Nenets, Ng. = Nganasan, P. = Permic, PU = Proto-Uralic, S. = Samic / South, Se. = Selkup, Smy. = Samoyedic, U. = Udmurt, Ve. = Veps, Võ. = South Estonian (Võro)

Now with a blog! NB: New URL

Development

Sound changes to Finnish
Sound changes to Mordvinic
Sound changes to Proto-Samic
Sound changes to Proto-Samoyedic

Data subpages

In the vowel tables, bold marks vocalic irregularities, italic uncertainties in what the regular vocalic reflex is, red consonantal irregularities.

Close	i • ü	ï? • u
Mid	e ~ ê ~ *E	o ~ ô ~ *O
Open	*ä	a, ë

Known derivativs with

Potential derivativs • Cluster issues • Co-occurrence of coronals • West-East discrepancies

Reconstructed phoneme inventory

Vowels

*/i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including #nato "brother's wife", #kälü "spouse's sister", #wäŋü/#wiŋü "son-in-law".

Opinions vary on if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ]. As of July 2014 I (Tropylium) support [ɤ] for the former, as suggested by e.g. the substitution of Indo-Iranian *a by *ë in loans (unless these words are post-PU in date.) The latter seems like an open question; I am investigating the possibility of [ɒ] as the default value, [ɑ] as a positional allophone.

Several amendments have been proposed at times:

Long vowels or equivalent entities have been a popular proposal, but the main evidence, from Finnic, has recently been adequately explained otherwise.
A number of studies have proposed, on the basis of the Ugric evidence, to reinterpret several quality contrasts as quantity contrasts instead.^{[citation needed]} No wholly systemic account of this idea seems to have been presented to this day.
Korhonen (1988) proposes an *ï, which would have split at an early date to to front and back allophones, the former then developing into standard *ü.
This *ï has been reshuffled into two "reduced" or "semi-rounded" vowels *ê *ô ([ɪ ʊ]?) by Häkkinen (2007), which also aim to explain some apparent exceptions in the development of *e and *o. In light of some recently identified conditional sound laws, this may not be a necessary hypothesis. (I still suspect that at least a raised allophone of *e might have existed though.)

Consonants

Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" */x/ (a voiceless velar fricativ [x] seems like the most plausible sound value, though voiced [ɣ] is not ruled out), two "spirants" */d₁ d₂/ (traditionally interpreted as [ð] and [ðʲ] respectivly, though this is far from certain), two liquids */l r/ and two semivowels */w j/.

The existence of a distinction between *ć and *ś is not certain. *ć can be securely established only in the clusters *ćć, *ńć, where *ś is not found. The Proto-Uralic status of *š is also uncertain: most words with the consonant are not found in languages east to Permic, and frequently moreover probable Indo-European loanwords.

A palatal liquid *ĺ is also commonly found in old reconstructions (e.g. in the Uralisches Etymologisches Wörterbuch), but many of the etyma involved do not really behave. Some may be late inter-branch loans. Most others have been by now accounted for by a widespread development *l > *ĺ in Samoyedic. The "palatal spirant" *d₂ may thus have been the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. (The "dental spirant" also develops to /l/ widely, but was certainly distinct from *l.)

Three further consonants of dubious nature were supported until the mid-20th century: the palatal shibilant *š́ (distinguished from regular *ś only in Mansi), and the retroflex sonorants *ṇ, *ḷ (distinguished from regular *n, *l only in Khanty).

Phonotactics

A notable distributional feature was that *ŋ, *x and probably also *d₁, *r could not occur word-initially.

Roots generally had the form (C)V(C)C{A I}, with initial stress. A couple longer stems of the shape (C)V(C)C{A I}C(I) are also attested. Shorter CV roots occurred in grammatical words: pronouns, prepositions and the copula also CV; there was also a single lone-V root, the negativ verb *e-.

See below for allowed consonant clusters.

Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):

C	Finnic	S.	Mordv.	Mari	Permic	Hung.	Ms.	Kh.	Smy.	Comments
*m	*m	*m	m, v	*m	*m	m, -v-	*m	*m	*m	Sporadic lenition in Mo, H. Regular in suffixes in H.
*n	*n	*n	n, ń- / _F	n, -ń- / F_F	*n	n	*n	*n	*n
*ń	*n	*ń	ń, n- /#_B	n, -ń- / F_F	*ń	ń	*ń	*ń	*ń
*-ŋ	v, ː	*ŋ	j / F_, v / B_	*ŋ	ń / F_, n / C_, m / B_	g	*ŋk	*ŋk	*ŋ	Irregularly split in ObU (the more general development is *ŋk) Retained in some Erzya & Udmurt dialects
*-ŋ		*ŋ	j / F_, v / B_	*ŋ	ń / F_, n / C_, m / B_	g	*ŋ	*ŋ	*ŋ
*w		*v	*v	*w	*v	v	*w	*w	*w
*-w-		*v	*v	*j / F_, ∅ / B_	∅	*v → -ː-	*ɣ	*ɣ	∅
*-x-	*ː	*k	*j
*-k-	*k		j, v							Mo. split by vowel backness/frontness
*k	*k		k	*k	k, g	k, h /_B	*k	*k	*k	Stop voicing irregularly split in P.
*p	*p	*p	p, -v-	p, -w-	p, b	f, -v-	*p	*p	*p
*č	t, h	*c	*č	*č	č, dž	č, š	*š	*č	*č
*ć	*s	*č	ś, -ć-	ć, -ź-	ć, dź	č, s	ć, s	ć, s	*s
*ś		*č	ś, -ź-	*š, -ž-	ś, -ź-	s, š	s	*s	*s
*s		*s	s, -z-		s, -z-	∅	*t	*ɬ	*t
*š	*h	*s	š, -ž-		š, -ž-	∅		*ɬ
*t	*t	*t	t, tʲ	*t	t, d	t		*t
*-t-		*t	d, dʲ	*ð	∅	z		*t
*d₂		*ð		l, -ð-	*ĺ	ɟ	*ĺ	*j	*j
*-d₁-		*ð		∅	l, -∅-	l	*l	l, -ɬ-	*r	lost in Permic only intervocally, not in clusters
*l	*l	*l	*l	*l	*l	l	*l	l, -ɬ-	l, j	In Kh. also irregularly l → ɭ
*j	j, ː	*j	*j	*j	*j	j, ɟ	*j	*j	*j
*r	*r	*r	*r	*r	*r	r	*r	*r	*r

Major consonant developments involving specific vowel environments include:

the widespread loss of *w before non-open labial vowels, *j before non-open front vowels
- best retained in Permic and Khanty
the labialization of *k, *ŋ, *x in Mansi to *kʷ, *ŋʷ, *w before and after labial vowels
the assibilation of *t in Finnic (of any origin, i.e. < *t, *č, *d₁, *d₂) to *c before *i

Major soundlaws affecting consonant clusters include:

the widespread loss or vocalization of *j and *w
- only Samic consistently retains these, though Finnic in most cases as well and Samoyedic also frequently
- the cluster *lj has widely coalesced to /lʲ/; in Mordvinic all j-clusters yield palatalized consonants
- Mansi and Khanty have retained a few direct traces of *w (generally as delabialized *ɣ)
- indirect traces of such clusters are widely found in the development of vowels
the shortening of geminates everywhere except in Samic and Finnic
- generally leading to new medial voiceless stops/affricates, after the lenition of the original ones
- geminates other than *kk are merged with the corresponding singletons in Mari, Mansi, Khanty and Samoyedic
the assimilation of *mt to *nt in Finnic, Mordvinic, Mansi, and probably Permic & Hungarian
the denasalization of nasal + stop/affricate clusters to voiced stops/affricates in Permic and Hungarian
- the pre-Hungarian voiced affricates have lost their sibilancy: *nč *ńć > *[ʤ ʥ] > /r ɟ/
the metathesis of *k in Mansi and Khanty, when following a heterorganic consonant
the loss of *k in Samoyedic, under the same conditions
after the loss of final vowels, the breaking of most heterorganic consonant clusters by an epenthetic schwa in Hungarian, Mansi and Khanty
- entirely general in Khanty, somewhat limited in Hungarian and Mansi; in many cases consonant clusters are still attested in Old Hungarian
- some traces of a similar process are found in Mari (for *ŋ + sibilant) and Finnic (for *kś) as well
the loss of *p before other obstruents in Mansi
the "palatality metathesis" (*Ć-lk, *d₂k, *d₂w >) *ĺɣ > /lɟ/ in Hungarian
the assimilation of *lm (incl. earlier *d₁m) to *nm in Permic
the merger and lenition of *pt *kt to *ht in Finnic, *ft in Mordvinic
the simplification of *čč, *kš to *h in Finnic (via *th, *kh?)

Medial consonant clusters

Includes chiefly the words from the appendix in Häkkinen (2007).

Cf. Sammallahti (1988): http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU

2nd → 1st ↓		p	t	č	k	s	ś	š	d₁	l	r	w	j	m	n	ŋ	∑	Notes	Frequency color code
N	m	mp	mt	…		ms	mś		N/A			N/A					8	mostly i-stems except kompa ńimśa	single root
	n		nt	nč			nś	…									16	mostly back-harmonic a-stems; ns → nč?	two roots
	ŋ		ŋt		ŋk	ŋs		…									9	back-harmonic or *ä	3-4 roots
P	p	pp	pt			ps	pś	…	pd₁	…							7		5-6 roots
	t		…		tk												3	all front-harmonic ə-stems	10+ roots
	č		…	čč	čk												3		Most suspiciously none
	ć				…					…							…
	k		kt	…	kk	ks	kś	kš		…	…						13	but *mekšə "bee" probably separate loans in FP and H.
S	s		…		sk	N/A											2	both o_ə
	ś		…		śk												5	mostly ə-stems + *wäśka
	š		…		…												…
L	l	…	lt	…	lk							lw	lj	lm	…	lŋ	25	lw lj only a-stems; lt may be derived ← -lk-t-
L	r	rp	rt	…	rk							rw	…	rm	…	…	11	mostly back-harmonic
(?)	d₁/d₂				d₁k							d₂w		dₓm			3
sV	w		…			ws			wd₁	wl			wj		wn	wŋ	9	after e ä a o only
sV	j	…	…								…	jw	…	jm	…	jŋ	6	after ä a o only
∑		5	19	7	37	8	10	2	4	1	0	9	5	8	1	5

Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, *ćl, *kl, *kr, *čt, *tt, *st, *śt, *št, *šk, *ćk, *nš, *ŋš, *mč, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table).

Literature

Current standard literature

Janhunen, Juha (1981): Uralilaisen kantakielen sanastosta
Janhunen, Juha (1982): On the structure of Proto-Uralic
Sammallahti, Pekka (1988): Historical Phonology of the Uralic languages

Nonstandard proposals

Häkkinen, Jaakko (2007): Kantauralin murteutuminen vokaalivastaavuuksien valossa. Master's thesis. http://urn.fi/URN:NBN:fi-fe20071746
Korhonen, Mikko (1988): Uralilaisten kielten jälkitavujen vokaaliston historiaa

Dated reconstructions

Collinder, Björn (1960): Comparative Grammar of the Uralic languages
Itkonen, Erkki (1946): Zur Frage nach der Entwicklung des Vokalismus der ersten Silbe in den finnisch-ugrischen Sprachen, insbesondere in Mordwinischen. Finnisch-Ugrische Forschungen 29.
Steinitz, Wolfgang (1944): Geschichte des finnisch-ugrischen Vokalismus. Stockholm.
Sammallahti, Pekka (1979): Über die Laut- und Morphemstruktur der uralischen Grundsprache. Finnisch-Ugrische Forschungen 43.

This article is one of quite a few pages about Natlangs.

Indo-european natlangs:

Balto-Slavic Natlangs: Czech * Russian

Celtic Natlangs: Revived Middle Cornish * Pictish

Germanic Natlangs:

North Germanic Natlangs: Norwegian

West Germanic Natlangs: Anglo-Saxon * Dutch * English (Old English * Middle English * Modern English * Scots) * German (High German * Low German)

Indo-Iranian Natlangs: Pahlavi

Italic Natlangs: French * Italian * Latin * Spanish

Debated: Cimmerian

Uralic Natlangs: Finnish * Khanty * Mansi * Mordvinic * Proto-Uralic
Altaic (controversial): Japanese
Sino-Tibetan Natlangs:
Uto-Aztecan Natlangs: Nahuatl

-

Isolate Natlangs: Basque * *
Hypothetical/debated Natlangs and Natlang families: Danubian * Europic (obsolete)

Close	i • ü	ï? • u
Mid	e ~ ê ~ *E	o ~ ô ~ *O
Open	*ä	a, ë

Proto-Uralic: Difference between revisions

Latest revision as of 12:59, 3 July 2014

Development

Data subpages

Reconstructed phoneme inventory

Medial consonant clusters

Literature

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools

@@ Line 2: / Line 2: @@
 Abbreviations used on these pages:
-'''B.''' = Baltic, '''Cf.''' = 'compare', '''En''' = Enets, '''Er.''' = Erźa, '''Es.''' = Estonian, '''F.''' = Finnish, '''Gmc.''' = Germanic, '''H.''' = Hungarian, '''Hi.''' = Hill Mari, '''IA''' = Indo-Aryan, '''IE''' = Indo-European, '''K.''' = Komi, '''Ka.''' = Kamass, '''Kh.''' = Khanty, '''Li.''' = Livonian, '''Ma.''' = Mari, '''Me.''' = Meadow Mari, '''Mk.''' = Mokša, '''Mo.''' = Mordvinic, '''Ms.''' = Mansi, '''N''' = North, '''Ne.''' = Nenets, '''Ng.''' = Nganasan, '''P.''' = Permic, '''PU''' = Proto-Uralic, '''S.''' = Samic, South, '''Se.''' = Selkup, '''Smy.''' = Samoyedic, '''U.''' = Udmurt, '''Ve.''' = Veps, '''Võ.''' =
+'''B.''' = Baltic, '''Cf.''' = 'compare', '''En''' = Enets, '''Er.''' = Erźa, '''Es.''' = Estonian, '''F.''' = Finnish, '''Gmc''' = Germanic, '''H.''' = [[Hungarian]], '''Hi.''' = Hill Mari, '''IA''' = Indo-Aryan, '''IE''' = Indo-European, '''II''' = Indo-Iranian, '''K.''' = Komi, '''Ka.''' = Kamass, '''Kh.''' = [[Khanty]], '''Li.''' = Livonian, '''Ma.''' = [[Mari]], '''Me.''' = Meadow Mari, '''Mk.''' = Mokša, '''Mo.''' = [[Mordvinic]], '''Ms.''' = [[Mansi]], '''N''' = North, '''Ne.''' = Nenets, '''Ng.''' = Nganasan, '''P.''' = [[Permic]], '''PU''' = Proto-Uralic, '''S.''' = Samic / South, '''Se.''' = Selkup, '''Smy.''' = Samoyedic, '''U.''' = Udmurt, '''Ve.''' = Veps, '''Võ.''' =
 South Estonian (Võro)
+'''[http://protouralic.wordpress.com/ Now with a blog!]''' <span style="color:red">NB: New URL<span>
 __NOTOC__
 ==Development==
 * Sound changes to [[Finnish]]
+* Sound changes to [[Mordvinic]]
 * Sound changes to [[Proto-Samic]]
 * Sound changes to [[Proto-Samoyedic]]
 ==Data subpages==
+In the vowel tables, '''bold''' marks vocalic irregularities, ''italic'' uncertainties in what the regular vocalic reflex is, <font color="red">red</font> consonantal irregularities.
 {|
 |-
-| [[Proto-Uralic/I|*i]] • [[Proto-Uralic/Ü|*ü]] • [[Proto-Uralic/U|*u]]
+| Close
+| [[Proto-Uralic/I|*i]] • [[Proto-Uralic/Ü|*ü]] || [[Proto-Uralic/new|*ï?]] • [[Proto-Uralic/U|*u]]
 |-
-| [[Proto-Uralic/new|reduced vowels?]] • [[Proto-Uralic/E|remaining cases of *e]] • [[Proto-Uralic/O|*o]]
+| Mid
+| [[Proto-Uralic/E|*e ~ *ê ~ *E]] || [[Proto-Uralic/O|*o ~ *ô ~ *O]]
 |-
-| [[Proto-Uralic/A|back unrounded vowels *a *ë]] • [[Proto-Uralic/Ä|*ä]]
+| Open
-|-
+| [[Proto-Uralic/Ä|*ä]] || [[Proto-Uralic/A|*a, *ë]]
-| Known derivativs with [[Proto-Uralic/-ka|-kA]] • [[Proto-Uralic/-ma|-mA]] • [[Proto-Uralic/derived|Potential derivativs]] • [[Proto-Uralic/CC|Cluster issues]]
 |}
+Known derivativs with
+* [[Proto-Uralic/-ka|-kA]]
+* [[Proto-Uralic/-ma|-mA]]
+* [[Proto-Uralic/-ta|-tA]]
+[[Proto-Uralic/derived|Potential derivativs]] • [[Proto-Uralic/CC|Cluster issues]] • [[Proto-Uralic/harmony|Co-occurrence of coronals]] • [[Proto-Uralic/WE|West-East discrepancies]]
 ==Reconstructed phoneme inventory==
-'''Vowels:''' */i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [i a]. Mid values [e ~ ə ~ ɤ] for */I/ are also possible.
+;Vowels:
+<nowiki>*/i ü u e ë o ä a/</nowiki> in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including #nato "brother's wife", #kälü "spouse's sister", #wäŋü/#wiŋü "son-in-law".
-It is furthermore unclear if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ], but substitution of Indo-Iranian *a by *ë in loans suggests the latter values (unless these particular words are newer loans.)
+Opinions vary on if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ]. As of July 2014 I ([[User:Tropylium|Tropylium]]) support [ɤ] for the former, as suggested by e.g. the substitution of Indo-Iranian *a by *ë in loans (unless these words are post-PU in date.) The latter seems like an open question; I am investigating the possibility of [ɒ] as the default value, [ɑ] as a positional allophone.
-Two "reduced" or "semi-rounded" vowels */ê ô/ have been proposed recently.
+Several amendments have been proposed at times:
+* Long vowels or equivalent entities have been a popular proposal, but the main evidence, from Finnic, has recently been adequately explained otherwise.
+* A number of studies have proposed, on the basis of the Ugric evidence, to reinterpret several quality contrasts as quantity contrasts instead.<sup>[citation needed]</sup> No wholly systemic account of this idea seems to have been presented to this day.
+* Korhonen (1988) proposes an *ï, which would have split at an early date to to front and back allophones, the former then developing into standard *ü.
+* This *ï has been reshuffled into two "reduced" or "semi-rounded" vowels *ê *ô ([ɪ ʊ]?) by Häkkinen (2007), which also aim to explain some apparent exceptions in the development of *e and *o. In light of some recently identified conditional sound laws, this may not be a necessary hypothesis. (I still suspect that at least a raised allophone of *e might have existed though.)
-'''Consonants:''' nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.
+;Consonants
+Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" */x/ (a voiceless velar fricativ [x] seems like the most plausible sound value, though voiced [ɣ] is not ruled out), two "spirants" */d₁ d₂/ (traditionally interpreted as [ð] and [ðʲ] respectivly, though this is far from certain), two liquids */l r/ and two semivowels */w j/.
-<nowiki>*ć</nowiki> (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave. The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.
+The existence of a distinction between *ć and *ś is not certain. *ć can be securely established only in the clusters *ćć, *ńć, where *ś is not found. The Proto-Uralic status of *š is also uncertain: most words with the consonant are not found in languages east to Permic, and frequently moreover probable Indo-European loanwords.
-A notable distributional feature was that *ŋ, *x and probably *ð, *r could not occur word-initially.
+A palatal liquid *ĺ is also commonly found in old reconstructions (e.g. in the ''Uralisches Etymologisches Wörterbuch''), but many of the etyma involved do not really behave. Some may be late inter-branch loans. Most others have been by now accounted for by a widespread development *l > *ĺ in Samoyedic. The "palatal spirant" *d₂ may thus have been the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. (The "dental spirant" also develops to /l/ widely, but was certainly distinct from *l.)
-Roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions and the copula also CV; and a single lone-V root, the negativ verb *e-
+Three further consonants of dubious nature were supported until the mid-20th century: the palatal shibilant *š́ (distinguished from regular *ś only in Mansi), and the retroflex sonorants *ṇ, *ḷ (distinguished from regular *n, *l only in Khanty).
-Basic consonant correspondences (gradation not included in Finno-Samic):
+;Phonotactics
+A notable distributional feature was that *ŋ, *x and probably also *d₁, *r could not occur word-initially.
+Roots generally had the form (C)V(C)C{A I}, with initial stress. A couple longer stems of the shape (C)V(C)C{A I}C(I) are also attested. Shorter CV roots occurred in grammatical words: pronouns, prepositions and the copula also CV; there was also a single lone-V root, the negativ verb *e-.
+See below for allowed consonant clusters.
+Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):
 {| class="wikitable"
 |-
-! C !! [[Finnish|Finnic]] !! [[Proto-Samic|Samic]] !! [[Mordvinic|Mo.]] !! [[Mari]] !! [[Permic]] !! [[Hungarian|Hung.]] !! [[Mansi|Ms.]] !! [[Khanty|Kh.]] !! [[Proto-Samoyedic|Smy.]] !! Comments
+! C !! [[Finnish|Finnic]] !! [[Proto-Samic|S.]] !! [[Mordvinic|Mordv.]] !! [[Mari]] !! [[Permic]] !! [[Hungarian|Hung.]] !! [[Mansi|Ms.]] !! [[Khanty|Kh.]] !! [[Proto-Samoyedic|Smy.]] !! Comments
 |-
 ! *m
-| *m || *m || m, v || *m || *m || m, -v- || *m || *m || *m
+| *m || *m || *m, *v || *m || *m || m, -v- || *m || *m || *m
-| Sporadic lenition in Mo, H.
+| Sporadic lenition in Mo, H. Regular in suffixes in H.
 |-
 ! *n
-| rowspan="2"| *n || *n || n, ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
+| rowspan="2"| *n || *n || *n, *ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
 |-
 ! *ń
-| *ń || ń,<br/>n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
+| *ń || *ń,<br/>*n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
 |-
 ! rowspan="2"| *-ŋ
-| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| j&nbsp;/&nbsp;F_,<br/>v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
+| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| *j&nbsp;/&nbsp;F_,<br/>*v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
 | rowspan="2"| ń&nbsp;/&nbsp;F_,<br/>n&nbsp;/&nbsp;C_,<br/>m&nbsp;/&nbsp;B_
 | rowspan="2"| g || *ŋk || *ŋk || rowspan="2"| *ŋ
-| rowspan="2"| Irregularly split in ObU. (the more general development is *ŋk)<br/>
+| rowspan="2"| Irregularly split in ObU (the more general development is *ŋk)<br/>
 Retained in some Erzya & Udmurt dialects
 |-
@@ Line 65: / Line 90: @@
 |-
 ! *w
-| rowspan="2"| *v || rowspan="2"| v || rowspan="2"| *w || *v || rowspan="4"| v, -ː- || *w || *w || *w
+| rowspan="2"| *v || rowspan="2"| *v || *w || *v || v || *w || *w || *w
 |-
 ! *-w-
-|  || rowspan="3"| *ɣ || rowspan="3"| *ɣ || rowspan="3"| ∅
+| rowspan="3"| *j&nbsp;/&nbsp;F_,<br/>∅&nbsp;/&nbsp;B_ || rowspan="3"| ∅ || rowspan="3"| *v → -ː- || rowspan="3"| *ɣ || rowspan="3"| *ɣ || rowspan="3"| ∅
 |-
 ! *-x-
-| *ː || rowspan="3"| *k || j || ∅ || ∅
+| *ː || rowspan="3"| *k || *j
 |-
 ! *-k-
-| rowspan="2"| *k || j, v || g || ∅
+| rowspan="2"| *k || *j, *v
-| Mordvinic split by vowel backness/frontness
+| Mo. split by vowel backness/frontness
 |-
 ! *k
@@ Line 82: / Line 107: @@
 |-
 ! *p
-| *p || *p || p, -v- || *p || *p, *b || f, -v- || *p || *p || *p
+| *p || *p || *p, *-v- || *p, *-w- || *p, *b || f, -v- || *p || *p || *p
 |-
 ! *č
-| *t, *h || *c || č || *č || *č, *dž || č, š || *š || *č || *č
+| *t, *h || *c || *č || *č || *č, *dž || č, š || *š || *č || *č
 |-
 ! *ć
-| rowspan="3"| *s || rowspan="2"| *č || ś || *ć ? || *ć, *dź || s, č || rowspan="2"| *s, *š || *s, *ť || rowspan="2"| *s
+| rowspan="3"| *s || rowspan="2"| *č || *ś, *-ć- || *ć, *-ź- || *ć, *dź || č, s || *ć, *s || *ć, *s || rowspan="2"| *s
 |-
 ! *ś
-| ś, -ź- || rowspan="3"| *š || *ś, *-ź- || s || *s
+| ś, -ź- || rowspan="3"| *š, -ž- || *ś, *-ź- || *s, *š ||  s || *s
 |-
 ! *s
-| rowspan="2"| *s || s, -z- || *s, *-z- || rowspan="2"| ∅ || rowspan="3"| *t || rowspan="2"| *ɬ || rowspan="3"| *t
+| rowspan="2"| *s || *s, *-z- || *s, *-z- || rowspan="2"| ∅ || rowspan="4"| *t || rowspan="2"| *ɬ || rowspan="4"| *t
 |-
 ! *š
-| *h || š, -ž- || *š, *-ž-
+| *h || *š, *-ž- || *š, *-ž-
 |-
 ! *t
-| rowspan="3"| *t || *t || rowspan="3"| t, -d-<br/> tʲ, -dʲ- || *t, *-d- || *t, *d || t, -z- || *t
+| rowspan="4"| *t || rowspan="2"| *t || *t, *tʲ || *t || *t, *d || t || rowspan="2"| *t
+|-
+! *-t-
+| rowspan="3"| *d, *dʲ || *ð || ∅ || z
 |-
-! *ðʲ
+! *d₂
-| rowspan="2"| *ð || *l || *ĺ || ɟ || *ĺ || *j || *j
+| rowspan="2"| *ð || *l, *-ð- || *ĺ || ɟ || *ĺ || *j || *j
 |-
-! *ð
+! *-d₁-
-| *l, *-d- || rowspan="2"| *l || rowspan="2"| l || rowspan="2"| *l || rowspan="2"| *l, *-ɬ- || *r
+| ∅ || *l, *-∅- || rowspan="2"| l || rowspan="2"| *l || rowspan="2"| *l, *-ɬ- || *r
+| lost in Permic only intervocally, not in clusters
 |-
 ! *l
-| *l || *l || l || *l || *l, *j
+| *l || *l || *l || *l || *l || *l, *j
+| In Kh. also irregularly *l → *ɭ
 |-
 ! *j
-| *j, *ː || *j || j || *j || *j || j, ɟ || *j || *j || *j
+| *j, *ː || *j || *j || *j || *j || j, ɟ || *j || *j || *j
 |-
 ! *r
-| *r || *r || r || *r || *r || r || *r || *r || *r
+| *r || *r || *r || *r || *r || r || *r || *r || *r
-<!--|-
+<!--
+|-
 ! *-kt-
 | *kt || *kt || vt || * || * || t || rowspan="2"| *ɣət,<br/>*təɣ || *ɣəɬ, *ɬəɣ || rowspan="2"| *t
@@ Line 127: / Line 158: @@
 | * || * ||  || * || * ||  || * || * || *-->
 |}
+Major consonant developments involving specific '''vowel environments''' include:
+* the widespread loss of *w before non-open labial vowels, *j before non-open front vowels
+** best retained in Permic and Khanty
+* the labialization of *k, *ŋ, *x in Mansi to *kʷ, *ŋʷ, *w before and after labial vowels
+* the assibilation of *t in Finnic (of any origin, i.e. < *t, *č, *d₁, *d₂) to *c before *i
+Major soundlaws affecting '''consonant clusters''' include:
+* the widespread loss or vocalization of *j and *w
+** only Samic consistently retains these, though Finnic in most cases as well and Samoyedic also frequently
+** the cluster *lj has widely coalesced to /lʲ/; in Mordvinic all j-clusters yield palatalized consonants
+** Mansi and Khanty have retained a few direct traces of *w (generally as delabialized *ɣ)
+** indirect traces of such clusters are widely found in the development of vowels
+* the shortening of geminates everywhere except in Samic and Finnic
+** generally leading to new medial voiceless stops/affricates, after the lenition of the original ones
+** geminates other than *kk are merged with the corresponding singletons in Mari, Mansi, Khanty and Samoyedic
+* the assimilation of *mt to *nt in Finnic, Mordvinic, Mansi, and probably Permic & Hungarian
+* the denasalization of nasal + stop/affricate clusters to voiced stops/affricates in Permic and Hungarian
+** the pre-Hungarian voiced affricates have lost their sibilancy: *nč *ńć > *[ʤ ʥ] > /r ɟ/
+* the metathesis of *k in Mansi and Khanty, when following a heterorganic consonant
+* the loss of *k in Samoyedic, under the same conditions
+* after the loss of final vowels, the breaking of most heterorganic consonant clusters by an epenthetic schwa in Hungarian, Mansi and Khanty
+** entirely general in Khanty, somewhat limited in Hungarian and Mansi; in many cases consonant clusters are still attested in Old Hungarian
+** some traces of a similar process are found in Mari (for *ŋ + sibilant) and Finnic (for *kś) as well
+* the loss of *p before other obstruents in Mansi
+* the "palatality metathesis" (*Ć-lk, *d₂k, *d₂w >) *ĺɣ > /lɟ/ in Hungarian
+* the assimilation of *lm (incl. earlier *d₁m) to *nm in Permic
+* the merger and lenition of *pt *kt to *ht in Finnic, *ft in Mordvinic
+* the simplification of *čč, *kš to *h in Finnic (via *th, *kh?)
 ==Medial consonant clusters==
-Words included chiefly from appendix from this: http://urn.fi/URN:NBN:fi-fe20071746 (see comments in table code)
+Includes chiefly the words from the appendix in Häkkinen (2007).
-CF: http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU
+Cf. Sammallahti (1988): http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU
 {|
 ! colspan="2"|2nd →<br/>1st ↓
-! p !! t !! č !! k !! s !! ś !! š !! ð !! l !! r !! w !! j !! m !! n !! ŋ
+! p !! t !! č !! k !! s !! ś !! š !! d₁ !! l !! r !! w !! j !! m !! n !! ŋ
 ! ∑
 ! width="25"| !! Notes !! width="25"| !! Frequency color code
@@ Line 142: / Line 202: @@
 ! rowspan="3" valign="center" style="background:#f0f0f0"| N
 ! m
-| mp <!--lämpi-->|| style="background:#ffffe0"| mt <!--omti tumti---> || ||
+| style="background:#ffffe0"| mp <!--kompa lämpe-->|| style="background:#fff0c0"| mt <!--ëmta- omte tumte---> || … ||
-| ms <!--pImsimi-->|| style="background:#ffffe0"| mś <!--n'ims'a kums'i; Häkk. kuxs'i-->|| ||
+| ms <!--pImseme-->|| style="background:#ffffe0"| mś <!--n'ims'a kums'e (= Häkk. kuxs'e)-->||
-| colspan="2" rowspan="3" align="center" valign="center" style="background:#f0f0f0"| N/A
+| colspan="3" rowspan="3" align="center" valign="center" style="background:#f0f0f0"| N/A
-| colspan="5" rowspan="10" align="center" valign="center" style="background:#f0f0f0"| N/A
+| colspan="5" rowspan="11" align="center" valign="center" style="background:#f0f0f0"| N/A
-| 6
+| 8
-| || mostly i-stems except *ńimśa
+| || mostly i-stems except *kompa *ńimśa
 | || single root
 |-
 ! n
-| || style="background:#fff0c0"| nt <!--tanti- konta- lUnta kënta-->|| style="background:#ffc8c0"| nč <!--kyntSi n'antSa pantSa pontSa wantSa tSantSi--> ||
+| || style="background:#ffc8c0"| nt <!--jäntik pintä tante- konta- lUnta kënta-->|| style="background:#ffc8c0"| nč <!--kyntSe n'antSa pantSa pontSa wantSa tSantSe--> ||
-| style="background:#c0f0ff"| || style="background:#fff0c0"| nś <!--ons'a luns'a kuns'i-->|| … ||
+| style="background:#c0f0ff"| || style="background:#fff0c0"| nś <!--ons'a luns'a kuns'e sa/äns'a-->|| …
-| 13
+| 16
-| || mostly back-harmonic a-stems
+| || mostly back-harmonic a-stems; *ns → *nč?
 | || style="background:#ffffe0"| two roots
 |-
 ! ŋ
-| || style="background:#ffffe0"| ŋt <!--oNti uNta--> || || style="background:#ffc8c0"| ŋk <!--jäNka s'äNki s'aNka woNki pëNka--->
+| || style="background:#ffffe0"| ŋt <!--oNte uNta--> || || style="background:#ffc8c0"| ŋk <!--t(s)uŋke- jäNka s'äNke s'aNka woNki pëNka --->
-| ŋs <!--jUNsi--> || ŋś <!--sa/äNs'a--> || … ||
+| ŋs <!--jUNsi--> || || …
 | 9
 | || back-harmonic or *ä
 | || style="background:#fff0c0"| 3-4 roots
 |-
-! rowspan="4" valign="center" style="background:#f0f0f0"| P
+! rowspan="5" valign="center" style="background:#f0f0f0"| P
 ! p
-| pp <!--säppa-->|| style="background:#ffffe0"| pt <!--jupta- ëpti-->|| ||
+| pp <!--säppa-->|| style="background:#ffffe0"| pt <!--jupta- ëpte-->|| ||
-| style="background:#ffffe0"| ps <!--ipsi kupsa--->|| pś <!--lëps'i-->|| … || pð <!--däpdä-->
+| style="background:#ffffe0"| ps <!--ipse kupsa--->|| pś <!--lëps'e-->|| … || pd₁ <!--l/däpdä-->
 | … ||
 | 7
@@ Line 174: / Line 234: @@
 |-
 ! t
-| || … || || style="background:#fff0c0"| tk <!--kätki- kytki- tytki--->
+| || … || || style="background:#fff0c0"| tk <!--kätke- kytke- tytke--->
 | || || ||
 | ||
 | 3
-| || all front-harmonic i-stems
+| || all front-harmonic ə-stems
-| || style="background:#f0c0f0"| ~10 roots
+| || style="background:#f0c0f0"| 10+ roots
 |-
 ! č
-| || … || || čk <!--putSki-->
+| || … || čč <!--këčče-ra-->|| style="background:#ffffe0"| čk <!--putSke nëtSke-->
 | || || ||
-| … ||
+| <!--"kečlä" = *keččä-lä-->||
-| 1
+| 3
 | ||
 | || style="background:#c0f0ff"| Most suspiciously none
+|-
+! ć
+| || || || …
+| || || ||
+| … ||
+| …
 |-
 ! k
-| || style="background:#fff0c0"| kt <!--kakta tUkta tUkti ïkta-->|| … || kk <!--lükka-->
+| || style="background:#ffc8c0"| kt <!--kakta tUkta tUkte ïkta täkta-->|| … || kk <!--lükka-->
-| style="background:#fff0c0"| ks <!--peksa- suksi mïksa-->|| style="background:#fff0c0"| kś <!--s'üks'i n'uks'i n'ëks'imi-->|| … ||
+| style="background:#fff0c0"| ks <!--peksa- sukse mïksa-->|| style="background:#fff0c0"| kś <!--s'üks'e n'uks'e n'ëks'eme-->|| style="background:#ffffe0"| kš <!--mIkSe wokSe-->||
 | … || …
-| 11
+| 13
+| || but *mekšə "bee" probably separate loans in FP and H.<!-- since reflected differently from *ks in the latter-->
 |-
 ! rowspan="3" valign="center" style="background:#f0f0f0"| S
 ! s
-| || … || || style="background:#ffffe0"| sk <!--poski soski-->
+| || … || || style="background:#ffffe0"| sk <!--poske soske-->
-| colspan="6" rowspan="7" align="center" valign="center" style="background:#f0f0f0"| N/A
+| colspan="6" rowspan="6" align="center" valign="center" style="background:#f0f0f0"| N/A
 | 2
-| || both o_i
+| || both o_ə
 |-
 ! ś
-| || … || || style="background:#fff0c0"| śk <!--wäs'ka as'kili mos'ki kUs'ki--->
+| || … || || style="background:#ffc8c0"| śk <!--kis'ko- wäs'ka as'kel mos'ke kUs'ke--->
-| 4
+| 5
-| || mostly i-stems + *wäśka
+| || mostly ə-stems + *wäśka
 |-
 ! š
@@ Line 211: / Line 278: @@
 | …
 |-
-! rowspan="3" valign="center" style="background:#f0f0f0"| L
+! rowspan="2" valign="center" style="background:#f0f0f0"| L
 ! l
-| … || lt <!--kUlta--> || … || style="background:#f0c0f0"| lk <!--jälki mälki pelka wolka tulka ulki kulki s'ëlka- wëlka--->
+| … || lt <!--kUlta--> || … || style="background:#f0c0f0"| lk <!--jälke mälke pelka s'ülke wolka tulka ulke kulke n(')olke kIlke- s'ëlka- wëlka jalka--->
 | style="background:#fff0c0"| lw <!--talwa tolwa pëlwa-->|| style="background:#ffffe0"| lj <!--n'elja pelja-->
-|style="background:#ffc8c0"| lm <!--n'elma Ilma s'ilma pilmita kUlmi-->|| … || lŋ <!--pIlNi-->
+|style="background:#ffc8c0"| lm <!--n'elma Ilma s'ilma pilmeta kUlme-->|| … || lŋ <!--pIlNe-->
-| 21
+| 25
 | || *lw *lj only a-stems; *lt may be derived ← *-lk-t-
-|-
-! lʲ
-| || || || lʲk <!--syl'ki-->
-| ||
-| || ||
-| 1
 |-
 ! r
-| style="background:#ffffe0"| rp <!--korpi tUrpa--->|| style="background:#ffffe0"| rt <!--pärta sarta-->|| … || style="background:#fff0c0"| rk <!--sarka kurki purki n'ërki-->
+| style="background:#ffffe0"| rp <!--korpi tUrpa--->|| style="background:#ffffe0"| rt <!--pärta sarta-->|| … || style="background:#fff0c0"| rk <!--sarka kurke purke n'ërke-->
-| rw <!--kIrwa---> || …
+| style="background:#ffffe0"| rw <!--jerwV- kIrwa---> || …
-| rm <!--n'urmi--> || … || …
+| rm <!--n'urme--> || … || …
-| 10
+| 11
 | || mostly back-harmonic
 |-
-| align="center" valign="center" rowspan="2"| (?)
+| align="center" valign="center" | (?)
-! ðʲ
+! d₁/d₂
-| || || || ðʲk <!--s'ëd'ka-->
+| || || || d₁k <!--s'ëdka-->
-| ðʲw <!--käd'wa-->||
+| d₂w <!--käd'wa-->||
-| || ||
+| dₓm <!--s'eDmä--> || ||
-| 2
+| 3
-|-
-! x
-| || style="background:#ffffe0"| xt? <!--wixti kuxti-->|| (xč) <!--oxtSi-->|| style="background:#c0f0ff"| &nbsp;
-| xs? <!--koxsi - or kowsi?-->|| || || (xð) <!--oxdi-->
-| style="background:#f0c0f0"| xl <!--käxli pexli n'exli koxli n'oxli poxli woxli kUxli tuxli n'ëxli-->|| (xr) <!--woxri - or woori, cf. IE-->|| || style="background:#ffffe0"| xj <!--sexji (woxji)-->
-| style="background:#fff0c0"| xm <!--läxmi d'ëxmi s'ëxmi-->|| xn <!--sëxni--> ||
-| 22
-| || i-stems only, some may be loans with long vowels
 |-
 ! rowspan="2" valign="center" style="background:#f0f0f0"| sV
 ! w
-| || … || colspan="5" rowspan="2" align="center" valign="center" style="background:#f0f0f0"| N/A?
+| || … || ||
-| style="background:#ffffe0"| wð <!--käwdi täwdi--->
+| ws <!--kowse--> || ||
-| wl <!--lewli-->|| || || wj <!--jewji-->
+| style="background:#fff0c0"| wd₁ <!--owdem käwde täwde--->
-| || || wŋ <!--s'awNa-->
+| wl <!--lewle-->|| || || wj <!--jewje-->
-| 5
+| || wn <!--säwnV--> || style="background:#ffffe0"| wŋ <!--owNe s'awNa-->
-| || after e ä a only
+| 9
+| || after e ä a o only
 |-
 ! j
-| … || … ||
+| … || … || || ||
+| || ||
 | || … || style="background:#fff0c0"| jw <!--päjwa kajwa ojwa--> || …
-| jm <!--äjmä-->|| … || style="background:#ffffe0"| jŋ <!--wajNi ajNi-->
+| jm <!--äjmä-->|| … || style="background:#ffffe0"| jŋ <!--wajNe ajNe-->
 | 6
 | || after ä a o only
 |-
 ! colspan="2" align="center"| ∑
-| 4 || 17 || 7 || 31
+| 5 || 19 || 7 || 37
-| 8 || 10 || … || 4
+| 8 || 10 || 2 || 4
-| 11 || (1) || 8 || 5
+| 1 || 0 || 9 || 5
-| 10 || 1 || 4
+| 8 || 1 || 5
 |}
-Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, '''*ćl''', *čl, *kl, *kr, *tt, *čt, *st, *śt, *št, *šk, '''*ćk''', *nš, *ŋš, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table where applicable, others bolded).
+Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, *ćl, *kl, *kr, *čt, *tt, *st, *śt, *št, *šk, *ćk, *nš, *ŋš, *mč, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table).
+==Literature==
+;Current standard literature
+* Janhunen, Juha (1981): Uralilaisen kantakielen sanastosta
+* Janhunen, Juha (1982): On the structure of Proto-Uralic
+* Sammallahti, Pekka (1988): Historical Phonology of the Uralic languages
+; Nonstandard proposals
+* Häkkinen, Jaakko (2007): ''Kantauralin murteutuminen vokaalivastaavuuksien valossa''. Master's thesis. http://urn.fi/URN:NBN:fi-fe20071746
+* Korhonen, Mikko (1988): Uralilaisten kielten jälkitavujen vokaaliston historiaa
+;Dated reconstructions
+* Collinder, Björn (1960): ''Comparative Grammar of the Uralic languages''
+* Itkonen, Erkki (1946): Zur Frage nach der Entwicklung des Vokalismus der ersten Silbe in den finnisch-ugrischen Sprachen, insbesondere in Mordwinischen. ''Finnisch-Ugrische Forschungen'' 29.
+* Steinitz, Wolfgang (1944): ''Geschichte des finnisch-ugrischen Vokalismus''. Stockholm.
+* Sammallahti, Pekka (1979): Über die Laut- und Morphemstruktur der uralischen Grundsprache. ''Finnisch-Ugrische Forschungen'' 43.
-[[Category:Natlangs]]
+{{Natlangs}}
 [[Category:Reconstructed languages]]