Proto-Uralic: Difference between revisions

From FrathWiki
Jump to navigationJump to search
Line 34: Line 34:
==Reconstructed phoneme inventory==
==Reconstructed phoneme inventory==


'''Vowels:''' */i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [i a]. Mid values [e ~ ə ~ ɤ] for */I/ are also possible.
;Vowels:
<nowiki>*/i ü u e ë o ä a/</nowiki> in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. (That *-i fails to cause palatalization anywhere outside of Finnic seems like a good argument against an original close front value.) These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including *nato "brother's wife", *kälü "spouse's sister", *wäŋü/*wiŋü "son-in-law".


It is furthermore unclear if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ], but substitution of Indo-Iranian *a by *ë in loans suggests the latter values (unless these particular words are newer loans.)
It is furthermore unclear if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ], but substitution of Indo-Iranian *a by *ë in loans suggests the latter values (unless these particular words are newer loans.)
Line 40: Line 41:
Two "reduced" or "semi-rounded" vowels */ê ô/ have been proposed recently.
Two "reduced" or "semi-rounded" vowels */ê ô/ have been proposed recently.


'''Consonants:''' nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.
;Consonants:
Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.


<nowiki>*ć</nowiki> (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave. The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.
<nowiki>*ć</nowiki> (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave (they may be late inter-branch loans). The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.


A notable distributional feature was that *ŋ, *x and probably *ð, *r could not occur word-initially.
A notable distributional feature was that *ŋ, *x and probably also *ð, *r could not occur word-initially.


Roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions and the copula also CV; and a single lone-V root, the negativ verb *e-
Roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions and the copula also CV; and a single lone-V root, the negativ verb *e-


Basic consonant correspondences (gradation not included in Finno-Samic):
Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):


{| class="wikitable"
{| class="wikitable"
Line 55: Line 57:
|-
|-
! *m
! *m
| *m || *m || m, v || *m || *m || m, -v- || *m || *m || *m
| *m || *m || *m, *v || *m || *m || m, -v- || *m || *m || *m
| Sporadic lenition in Mo, H.
| Sporadic lenition in Mo, H. Regular in suffixes in H.
|-
|-
! *n
! *n
| rowspan="2"| *n || *n || n, ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
| rowspan="2"| *n || *n || *n, *ń-&nbsp;/&nbsp;_F || rowspan="2"| *n,<br/>*-ń-&nbsp;/&nbsp;F_F || *n || n || *n || *n || *n
|-
|-
! *ń
! *ń
| *ń || ń,<br/>n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
| *ń || *ń,<br/>*n-&nbsp;/#_B || *ń || ń || *ń || *ń || *ń
|-
|-
! rowspan="2"| *-ŋ
! rowspan="2"| *-ŋ
| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| j&nbsp;/&nbsp;F_,<br/>v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
| rowspan="4"| *v, *ː || rowspan="2"| *ŋ || rowspan="2"| *j&nbsp;/&nbsp;F_,<br/>*v&nbsp;/&nbsp;B_ || rowspan="2"| *ŋ
| rowspan="2"| ń&nbsp;/&nbsp;F_,<br/>n&nbsp;/&nbsp;C_,<br/>m&nbsp;/&nbsp;B_
| rowspan="2"| ń&nbsp;/&nbsp;F_,<br/>n&nbsp;/&nbsp;C_,<br/>m&nbsp;/&nbsp;B_
| rowspan="2"| g || *ŋk || *ŋk || rowspan="2"| *ŋ
| rowspan="2"| g || *ŋk || *ŋk || rowspan="2"| *ŋ
| rowspan="2"| Irregularly split in ObU. (the more general development is *ŋk)<br/>
| rowspan="2"| Irregularly split in ObU (the more general development is *ŋk)<br/>
Retained in some Erzya & Udmurt dialects
Retained in some Erzya & Udmurt dialects
|-
|-
Line 74: Line 76:
|-
|-
! *w
! *w
| rowspan="2"| *v || rowspan="2"| v || *w || *v || v || *w || *w || *w
| rowspan="2"| *v || rowspan="2"| *v || *w || *v || v || *w || *w || *w
|-
|-
! *-w-
! *-w-
Line 80: Line 82:
|-
|-
! *-x-
! *-x-
| *ː || rowspan="3"| *k || j
| *ː || rowspan="3"| *k || *j
|-
|-
! *-k-
! *-k-
| rowspan="2"| *k || j, v
| rowspan="2"| *k || *j, *v
| Mo. split by vowel backness/frontness
| Mo. split by vowel backness/frontness
|-
|-
Line 91: Line 93:
|-
|-
! *p
! *p
| *p || *p || p, -v- || *p, *-w- || *p, *b || f, -v- || *p || *p || *p
| *p || *p || *p, *-v- || *p, *-w- || *p, *b || f, -v- || *p || *p || *p
|-
|-
! *č
! *č
| *t, *h || *c || č || *č || *č, *dž || č, š || *š || *č || *č
| *t, *h || *c || *č || *č || *č, *dž || č, š || *š || *č || *č
|-
|-
! *ć
! *ć
| rowspan="3"| *s || rowspan="2"| *č || ś || *ć ? || *ć, *dź || č, s || *ć, *s || *ć, *s || rowspan="2"| *s
| rowspan="3"| *s || rowspan="2"| *č || *ś, *-ć- || *ć, *-ź- || *ć, *dź || č, s || *ć, *s || *ć, *s || rowspan="2"| *s
|-
|-
! *ś
! *ś
Line 103: Line 105:
|-
|-
! *s
! *s
| rowspan="2"| *s || s, -z- || *s, *-z- || rowspan="2"| ∅ || rowspan="4"| *t || rowspan="2"| *ɬ || rowspan="4"| *t
| rowspan="2"| *s || *s, *-z- || *s, *-z- || rowspan="2"| ∅ || rowspan="4"| *t || rowspan="2"| *ɬ || rowspan="4"| *t
|-
|-
! *š
! *š
| *h || š, -ž- || *š, *-ž-
| *h || *š, *-ž- || *š, *-ž-
|-
|-
! *t
! *t
Line 121: Line 123:
|-
|-
! *l
! *l
| *l || *l || l || *l || *l, *j
| *l || *l || *l || *l || *l, *j
| In Kh. also irregularly *l → *ɭ
| In Kh. also irregularly *l → *ɭ
|-
|-
! *j
! *j
| *j, *ː || *j || j || *j || *j || j, ɟ || *j || *j || *j
| *j, *ː || *j || *j || *j || *j || j, ɟ || *j || *j || *j
|-
|-
! *r
! *r
| *r || *r || r || *r || *r || r || *r || *r || *r
| *r || *r || *r || *r || *r || r || *r || *r || *r
<!--
<!--
|-
|-
! *NP
! *NP
| *NP || *BB || NB || *NB || *B || B || *NP || *NP || *NP
| *NP || *NB || *NB || *NB || *B || B || *NP || *NP || *NP
|-
! *nč
| *nt || *nc || *nž || *č || *dž, *ž || ?? || *nš || *nč || *nč
|-
! *nś
| *s || *nč || *ź ? || *nć, *ž || *dź || gy || *nć || *nć || *ns
|-
|-
! *-kt-
! *-kt-

Revision as of 09:13, 14 January 2013

Fully a work in progress. Mistakes may occur.

Abbreviations used on these pages: B. = Baltic, Cf. = 'compare', En = Enets, Er. = Erźa, Es. = Estonian, F. = Finnish, Gmc = Germanic, H. = Hungarian, Hi. = Hill Mari, IA = Indo-Aryan, IE = Indo-European, II = Indo-Iranian, K. = Komi, Ka. = Kamass, Kh. = Khanty, Li. = Livonian, Ma. = Mari, Me. = Meadow Mari, Mk. = Mokša, Mo. = Mordvinic, Ms. = Mansi, N = North, Ne. = Nenets, Ng. = Nganasan, P. = Permic, PU = Proto-Uralic, S. = Samic / South, Se. = Selkup, Smy. = Samoyedic, U. = Udmurt, Ve. = Veps, Võ. = South Estonian (Võro)


Development

Data subpages

In the vowel tables, bold marks vocalic irregularities, italic uncertainties in what the regular vocalic reflex is, red consonantal irregularities.

Close *i *ï?*u
Mid *e ~ *ê ~ *E *o ~ *ô ~ *O
Open *a, *ë

Known derivativs with

Potential derivativsCluster issuesCo-occurrence of coronals

Reconstructed phoneme inventory

Vowels

*/i ü u e ë o ä a/ in the initial syllable. Only a two-way height-based contrast */I A/ is normally reconstructed in later syllables, which may have been realized as [i æ] after front vowels and [ɯ ɑ] after back vowels (ie. with vowel harmony); or as unalternating [ə a]. (That *-i fails to cause palatalization anywhere outside of Finnic seems like a good argument against an original close front value.) These pages will use the notation *a~*ä, *ə. A couple family terms suggest different vowels, including *nato "brother's wife", *kälü "spouse's sister", *wäŋü/*wiŋü "son-in-law".

It is furthermore unclear if (1st-syllable) *ë was [ɯ] or [ɤ], and *a [ɑ] or [ɒ], but substitution of Indo-Iranian *a by *ë in loans suggests the latter values (unless these particular words are newer loans.)

Two "reduced" or "semi-rounded" vowels */ê ô/ have been proposed recently.

Consonants

Nasals */m n ń ŋ/, voiceless stops/affricates */p t ć č k/, voiceless sibilants */s ś š/, a "laryngeal" *x (likely a voiceless velar fricativ & a recent pre-Uralic split from *k), two "spirants" */ð ðʲ/, two liquids */l r/ and two semivowels */w j/.

*ć (as distinct from *ś?) and *š (as original Proto-Uralic?) are reconstructed less securely than the other consonants. A palatal liquid *ĺ is also found in old reconstructions, but the etyma involved do not really behave (they may be late inter-branch loans). The "palatal spirant" may be the actual palatal liquid; obstruent reflexes are limited to western branches, and external comparisions generally involve laterals. The dental spirant, while also merging with *l in most branches, is however certainly distinct.

A notable distributional feature was that *ŋ, *x and probably also *ð, *r could not occur word-initially.

Roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions and the copula also CV; and a single lone-V root, the negativ verb *e-

Basic consonant correspondences (gradation not included in Finno-Samic, asterisks for Mari and Mordvinic largely superfluous):

C Finnic S. Mordv. Mari Permic Hung. Ms. Kh. Smy. Comments
*m *m *m *m, *v *m *m m, -v- *m *m *m Sporadic lenition in Mo, H. Regular in suffixes in H.
*n *n *n *n, *ń- / _F *n,
*-ń- / F_F
*n n *n *n *n
*ń,
*n- /#_B
ń
*-ŋ *v, *ː *j / F_,
*v / B_
ń / F_,
n / C_,
m / B_
g *ŋk *ŋk Irregularly split in ObU (the more general development is *ŋk)

Retained in some Erzya & Udmurt dialects

*w *v *v *w *v v *w *w *w
*-w- *j / F_,
∅ / B_
*v → -ː-
*-x- *k *j
*-k- *k *j, *v Mo. split by vowel backness/frontness
*k k *k *k, *g k, h /_B *k *k *k Stop voicing irregularly split in P.
*p *p *p *p, *-v- *p, *-w- *p, *b f, -v- *p *p *p
*t, *h *c *č, *dž č, š
*s *ś, *-ć- *ć, *-ź- *ć, *dź č, s *ć, *s *ć, *s *s
ś, -ź- *š, -ž- *ś, *-ź- *s, *š s *s
*s *s *s, *-z- *s, *-z- *t *t
*h *š, *-ž- *š, *-ž-
*t *t *t *t, *tʲ *t *t, *d t *t
*-t- *d, *dʲ z
*ðʲ *l, *-ð- ɟ *j *j
*l, *-∅- *l l *l *l, *-ɬ- *r
*l *l *l *l *l *l, *j In Kh. also irregularly *l → *ɭ
*j *j, *ː *j *j *j *j j, ɟ *j *j *j
*r *r *r *r *r *r r *r *r *r

Medial consonant clusters

Words included chiefly from appendix from this: http://urn.fi/URN:NBN:fi-fe20071746 (see comments in table code)

CF: http://books.google.fi/books?id=TM2NQ78dP2wC&pg=PA492&dq=phonotactics+of+PFU

2nd →
1st ↓
p t č k s ś š ð l r w j m n ŋ Notes Frequency color code
N m mp mt ms N/A N/A 7 mostly i-stems except *ńimśa single root
n nt 16 mostly back-harmonic a-stems; *ns → *nč? two roots
ŋ ŋt ŋk ŋs 9 back-harmonic or *ä 3-4 roots
P p pp pt ps 7 5-6 roots
t tk 3 all front-harmonic ə-stems 10+ roots
č čč čk 3 Most suspiciously none
ć
k kt kk ks (kš) 12 *kš probably separate loans in FP and H.
S s sk N/A 2 both o_ə
ś śk 5 mostly ə-stems + *wäśka
š
L l lt lk lw lj lm 25 *lw *lj only a-stems; *lt may be derived ← *-lk-t-
r rp rt rk rw rm 10 mostly back-harmonic
(?) ð(ʲ) ð(ʲ)k ðʲw 2
sV w N/A? wl wj wn 6 after e ä a only
j jw jm 6 after ä a o only
4 19  37 7 10 (1) 3 1 0 6 5 7 1 4

Not all blank'd cells were necessarily impossible: some roots of limited distribution have examples of *kč, *pš, *kš, *pl, *ćl, *kl, *kr, *čt, *tt, *st, *śt, *št, *šk, *ćk, *nš, *ŋš, *mč, *lp, *lč, *ln, *rč, *rj, *rn, *rŋ, *ðm, *jp, *jt, *jr, *jj, *jń, *wt (mark'd with an ellipsis in the table).

This article is one of quite a few pages about Natlangs.

Indo-european natlangs:

Balto-Slavic Natlangs: Czech * Russian
Celtic Natlangs: Revived Middle Cornish * Pictish
Germanic Natlangs:
North Germanic Natlangs: Norwegian
West Germanic Natlangs: Anglo-Saxon * Dutch * English (Old English * Middle English * Modern English * Scots) * German (High German * Low German)
Indo-Iranian Natlangs: Pahlavi
Italic Natlangs: French * Italian * Latin * Spanish
Debated: Cimmerian

Uralic Natlangs: Finnish * Khanty * Mansi * Mordvinic * Proto-Uralic
Altaic (controversial): Japanese
Sino-Tibetan Natlangs:
Uto-Aztecan Natlangs: Nahuatl

-

Isolate Natlangs: Basque * *
Hypothetical/debated Natlangs and Natlang families: Danubian * Europic (obsolete)