Proto-Uralic: Difference between revisions

Revision as of 09:13, 14 January 2013

Fully a work in progress. Mistakes may occur.

Abbreviations used on these pages: B. = Baltic, Cf. = 'compare', En = Enets, Er. = Erźa, Es. = Estonian, F. = Finnish, Gmc = Germanic, H. = Hungarian, Hi. = Hill Mari, IA = Indo-Aryan, IE = Indo-European, II = Indo-Iranian, K. = Komi, Ka. = Kamass, Kh. = Khanty, Li. = Livonian, Ma. = Mari, Me. = Meadow Mari, Mk. = Mokša, Mo. = Mordvinic, Ms. = Mansi, N = North, Ne. = Nenets, Ng. = Nganasan, P. = Permic, PU = Proto-Uralic, S. = Samic / South, Se. = Selkup, Smy. = Samoyedic, U. = Udmurt, Ve. = Veps, Võ. = South Estonian (Võro)

Development

Sound changes to Finnish
Sound changes to Proto-Samic
Sound changes to Proto-Samoyedic

Data subpages

In the vowel tables, bold marks vocalic irregularities, italic uncertainties in what the regular vocalic reflex is, red consonantal irregularities.

Close	i • ü	ï? • u
Mid	e ~ ê ~ *E	o ~ ô ~ *O
Open	*ä	a, ë

Known derivativs with

Potential derivativs • Cluster issues • Co-occurrence of coronals

Reconstructed phoneme inventory

Vowels

*/i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. (That *-i fails to cause palatalization anywhere outside of Finnic seems like a good argument against an original close front value.) These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including *nato "brother's wife", *kälü "spouse's sister", *wäŋü/*wiŋü "son-in-law".

It is furthermore unclear if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ], but substitution of Indo-Iranian *a by *ë in loans suggests the latter values (unless these particular words are newer loans.)

Two "reduced" or "semi-rounded" vowels */ê ô/ have been proposed recently.

Consonants

Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.

*ć (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave (they may be late inter-branch loans). The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.

A notable distributional feature was that *ŋ, *x and probably also *ð, *r could not occur word-initially.

Roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions and the copula also CV; and a single lone-V root, the negativ verb *e-

Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):

C	Finnic	S.	Mordv.	Mari	Permic	Hung.	Ms.	Kh.	Smy.	Comments
*m	*m	*m	m, v	*m	*m	m, -v-	*m	*m	*m	Sporadic lenition in Mo, H. Regular in suffixes in H.
*n	*n	*n	n, ń- / _F	n, -ń- / F_F	*n	n	*n	*n	*n
*ń	*n	*ń	ń, n- /#_B	n, -ń- / F_F	*ń	ń	*ń	*ń	*ń
*-ŋ	v, ː	*ŋ	j / F_, v / B_	*ŋ	ń / F_, n / C_, m / B_	g	*ŋk	*ŋk	*ŋ	Irregularly split in ObU (the more general development is *ŋk) Retained in some Erzya & Udmurt dialects
*-ŋ		*ŋ	j / F_, v / B_	*ŋ	ń / F_, n / C_, m / B_	g	*ŋ	*ŋ	*ŋ
*w		*v	*v	*w	*v	v	*w	*w	*w
*-w-		*v	*v	*j / F_, ∅ / B_	∅	*v → -ː-	*ɣ	*ɣ	∅
*-x-	*ː	*k	*j
*-k-	*k		j, v							Mo. split by vowel backness/frontness
*k	*k		k	*k	k, g	k, h /_B	*k	*k	*k	Stop voicing irregularly split in P.
*p	*p	*p	p, -v-	p, -w-	p, b	f, -v-	*p	*p	*p
*č	t, h	*c	*č	*č	č, dž	č, š	*š	*č	*č
*ć	*s	*č	ś, -ć-	ć, -ź-	ć, dź	č, s	ć, s	ć, s	*s
*ś		*č	ś, -ź-	*š, -ž-	ś, -ź-	s, š	s	*s	*s
*s		*s	s, -z-		s, -z-	∅	*t	*ɬ	*t
*š	*h	*s	š, -ž-		š, -ž-	∅		*ɬ
*t	*t	*t	t, tʲ	*t	t, d	t		*t
*-t-		*t	d, dʲ	*ð	∅	z		*t
*ðʲ		*ð		l, -ð-	*ĺ	ɟ	*ĺ	*j	*j
*ð		*ð		l, -∅-	*l	l	*l	l, -ɬ-	*r
*l	*l	*l	*l	*l	*l	l	*l	l, -ɬ-	l, j	In Kh. also irregularly l → ɭ
*j	j, ː	*j	*j	*j	*j	j, ɟ	*j	*j	*j
*r	*r	*r	*r	*r	*r	r	*r	*r	*r

Medial consonant clusters

Words included chiefly from appendix from this: http://urn.fi/URN:NBN:fi-fe20071746 (see comments in table code)

CF: http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU

2nd → 1st ↓		p	t	č	k	s	ś	š	ð	l	r	w	j	m	n	ŋ	∑	Notes	Frequency color code
N	m	mp	mt	…		ms	mś		N/A			N/A					7	mostly i-stems except *ńimśa	single root
	n		nt	nč			nś	…									16	mostly back-harmonic a-stems; ns → nč?	two roots
	ŋ		ŋt		ŋk	ŋs		…									9	back-harmonic or *ä	3-4 roots
P	p	pp	pt			ps	pś	…	pð	…							7		5-6 roots
	t		…		tk												3	all front-harmonic ə-stems	10+ roots
	č		…	čč	čk												3		Most suspiciously none
	ć				…					…							…
	k		kt	…	kk	ks	kś	(kš)		…	…						12	*kš probably separate loans in FP and H.
S	s		…		sk	N/A											2	both o_ə
	ś		…		śk												5	mostly ə-stems + *wäśka
	š		…		…												…
L	l	…	lt	…	lk							lw	lj	lm	…	lŋ	25	lw lj only a-stems; lt may be derived ← -lk-t-
L	r	rp	rt	…	rk							rw	…	rm	…	…	10	mostly back-harmonic
(?)	ð⁽ʲ⁾				ð⁽ʲ⁾k							ðʲw		…			2
sV	w		…	N/A?					wð	wl			wj		wn	wŋ	6	after e ä a only
sV	j	…	…	N/A?							…	jw	…	jm	…	jŋ	6	after ä a o only
∑		4	19	7	37	7	10	(1)	3	1	0	6	5	7	1	4

Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, *ćl, *kl, *kr, *čt, *tt, *st, *śt, *št, *šk, *ćk, *nš, *ŋš, *mč, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *ðm, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table).

This article is one of quite a few pages about Natlangs.

Indo-european natlangs:

Balto-Slavic Natlangs: Czech * Russian

Celtic Natlangs: Revived Middle Cornish * Pictish

Germanic Natlangs:

North Germanic Natlangs: Norwegian

West Germanic Natlangs: Anglo-Saxon * Dutch * English (Old English * Middle English * Modern English * Scots) * German (High German * Low German)

Indo-Iranian Natlangs: Pahlavi

Italic Natlangs: French * Italian * Latin * Spanish

Debated: Cimmerian

Uralic Natlangs: Finnish * Khanty * Mansi * Mordvinic * Proto-Uralic
Altaic (controversial): Japanese
Sino-Tibetan Natlangs:
Uto-Aztecan Natlangs: Nahuatl

-

Isolate Natlangs: Basque * *
Hypothetical/debated Natlangs and Natlang families: Danubian * Europic (obsolete)

@@ Line 34: / Line 34: @@
 ==Reconstructed phoneme inventory==
-'''Vowels:''' */i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [i a]. Mid values [e ~ ə ~ ɤ] for */I/ are also possible.
+;Vowels:
+<nowiki>*/i ü u e ë o ä a/</nowiki> in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. (That *-i fails to cause palatalization anywhere outside of Finnic seems like a good argument against an original close front value.) These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including *nato "brother's wife", *kälü "spouse's sister", *wäŋü/*wiŋü "son-in-law".
 It is furthermore unclear if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ], but substitution of Indo-Iranian *a by *ë in loans suggests the latter values (unless these particular words are newer loans.)
@@ Line 40: / Line 41: @@
 Two "reduced" or "semi-rounded" vowels */ê ô/ have been proposed recently.
-'''Consonants:''' nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.
+;Consonants:
+Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.
-<nowiki>*ć</nowiki> (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave. The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.
+<nowiki>*ć</nowiki> (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave (they may be late inter-branch loans). The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.
-A notable distributional feature was that *ŋ, *x and probably *ð, *r could not occur word-initially.
+A notable distributional feature was that *ŋ, *x and probably also *ð, *r could not occur word-initially.
 Roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions and the copula also CV; and a single lone-V root, the negativ verb *e-
-Basic consonant correspondences (gradation not included in Finno-Samic):
+Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):
 {| class="wikitable"
@@ Line 55: / Line 57: @@
 |-
 ! *m
-| *m || *m || m, v || *m || *m || m, -v- || *m || *m || *m
+| *m || *m || *m, *v || *m || *m || m, -v- || *m || *m || *m
-| Sporadic lenition in Mo, H.
+| Sporadic lenition in Mo, H. Regular in suffixes in H.
 |-
 ! *n
-| rowspan="2"| *n || *n || n, ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
+| rowspan="2"| *n || *n || *n, *ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
 |-
 ! *ń
-| *ń || ń,<br/>n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
+| *ń || *ń,<br/>*n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
 |-
 ! rowspan="2"| *-ŋ
-| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| j&nbsp;/&nbsp;F_,<br/>v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
+| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| *j&nbsp;/&nbsp;F_,<br/>*v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
 | rowspan="2"| ń&nbsp;/&nbsp;F_,<br/>n&nbsp;/&nbsp;C_,<br/>m&nbsp;/&nbsp;B_
 | rowspan="2"| g || *ŋk || *ŋk || rowspan="2"| *ŋ
-| rowspan="2"| Irregularly split in ObU. (the more general development is *ŋk)<br/>
+| rowspan="2"| Irregularly split in ObU (the more general development is *ŋk)<br/>
 Retained in some Erzya & Udmurt dialects
 |-
@@ Line 74: / Line 76: @@
 |-
 ! *w
-| rowspan="2"| *v || rowspan="2"| v || *w || *v || v || *w || *w || *w
+| rowspan="2"| *v || rowspan="2"| *v || *w || *v || v || *w || *w || *w
 |-
 ! *-w-
@@ Line 80: / Line 82: @@
 |-
 ! *-x-
-| *ː || rowspan="3"| *k || j
+| *ː || rowspan="3"| *k || *j
 |-
 ! *-k-
-| rowspan="2"| *k || j, v
+| rowspan="2"| *k || *j, *v
 | Mo. split by vowel backness/frontness
 |-
@@ Line 91: / Line 93: @@
 |-
 ! *p
-| *p || *p || p, -v- || *p, *-w- || *p, *b || f, -v- || *p || *p || *p
+| *p || *p || *p, *-v- || *p, *-w- || *p, *b || f, -v- || *p || *p || *p
 |-
 ! *č
-| *t, *h || *c || č || *č || *č, *dž || č, š || *š || *č || *č
+| *t, *h || *c || *č || *č || *č, *dž || č, š || *š || *č || *č
 |-
 ! *ć
-| rowspan="3"| *s || rowspan="2"| *č || ś || *ć ? || *ć, *dź || č, s || *ć, *s || *ć, *s || rowspan="2"| *s
+| rowspan="3"| *s || rowspan="2"| *č || *ś, *-ć- || *ć, *-ź- || *ć, *dź || č, s || *ć, *s || *ć, *s || rowspan="2"| *s
 |-
 ! *ś
@@ Line 103: / Line 105: @@
 |-
 ! *s
-| rowspan="2"| *s || s, -z- || *s, *-z- || rowspan="2"| ∅ || rowspan="4"| *t || rowspan="2"| *ɬ || rowspan="4"| *t
+| rowspan="2"| *s || *s, *-z- || *s, *-z- || rowspan="2"| ∅ || rowspan="4"| *t || rowspan="2"| *ɬ || rowspan="4"| *t
 |-
 ! *š
-| *h || š, -ž- || *š, *-ž-
+| *h || *š, *-ž- || *š, *-ž-
 |-
 ! *t
@@ Line 121: / Line 123: @@
 |-
 ! *l
-| *l || *l || l || *l || *l, *j
+| *l || *l || *l || *l || *l, *j
 | In Kh. also irregularly *l → *ɭ
 |-
 ! *j
-| *j, *ː || *j || j || *j || *j || j, ɟ || *j || *j || *j
+| *j, *ː || *j || *j || *j || *j || j, ɟ || *j || *j || *j
 |-
 ! *r
-| *r || *r || r || *r || *r || r || *r || *r || *r
+| *r || *r || *r || *r || *r || r || *r || *r || *r
 <!--
 |-
 ! *NP
-| *NP || *BB || NB || *NB || *B || B || *NP || *NP || *NP
+| *NP || *NB || *NB || *NB || *B || B || *NP || *NP || *NP
+|-
+! *nč
+| *nt || *nc || *nž || *č || *dž, *ž || ?? || *nš || *nč || *nč
+|-
+! *nś
+| *s || *nč || *ź ? || *nć, *ž || *dź || gy || *nć || *nć || *ns
 |-
 ! *-kt-

Close	i • ü	ï? • u
Mid	e ~ ê ~ *E	o ~ ô ~ *O
Open	*ä	a, ë

Proto-Uralic: Difference between revisions

Revision as of 09:13, 14 January 2013

Development

Data subpages

Reconstructed phoneme inventory

Medial consonant clusters

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools