Finnish

From FrathWiki
Jump to navigationJump to search

Proto-Uralic to Finnish sound changes

Mostly based on:

Currently in process of reformatting and reordering to include the information from the last two documents.

Technotes

  • Here, /@/ is NOT an ASCIIfication of /ə/, but any vowel that assimilates to the preceding vowel. This comes useful with cases of compensatory lengthening and ecko vowels.
  • Similarly, /A O U/ are harmonic vowels which will assimilate to either /a o u/ or /æ ø y/ depending on the harmony. /a/ is to be understood as [ɑ].
  • /ˣ/ is the assimilatory final, pronounced as lengthening of the next word's initial consonant, or in case of null initial, [ʔː] or hiatus. Very rarely, it occurs within words, too (usually sandwiched between two instances of the same vowel.)
  • /C/ represents any consonant; /V/ represents any vowel; and /X/ represents any 2nd mora in a syllable (be it consonantal, difthongal or chronemical).

I've grouped similar changes together under sub-headings, so the order of unrelated changes might not be exactly chronological whenever no reference was available. Also, since the document is headed towards Standard Finnish, I've had to cut a few corners anyway when maneuvering around dialectal changes... in a few cases picking the most represented outcome wasn't all that clear.

Proto-Uralic to Proto-Finno-Samic

[Ca. 4000 BCE to 3000 BCE]

PU roots generally had the form (C)V(C)C{A I}, with initial stress; in pronouns and prepositions also CV; and two lone-V roots, the negativ verb e- and the root "self", o-.

Unclear issues:

  • the quality of the vowel - [ɯ] or [ɤ]?
  • the quality of the vowel *a - [ɑ] or [ɒ]?
  • was the 2nd-syllable vocalism phonemically /a i/ with fronted & backed allophones, or overtly vowel-harmonic /æ a i ɯ/?
  • the quality of the consonant *x - [h], [x], [ɣ], something else?

The existence of "Proto-Finno-Samic" as distinct from PU is unclear. Changes shared with Samic are in indigo, those also shared with Mordvinic in green, and those with even wider distribution in orange.

Word-final */ŋ/ → k in the lativ ending, → n elsewhere (!dubious, pre-Uralic?)

Introduction of length from loss of preconsonantal *x.

  • x → @ / _C (leaves no evidence in Mordvinic and Ob-Ugric)

Stressed merges with *a

Loss of /w/ before labial vowels (partially also in Mari)

  • w → ∅ / #_{o u y}

The consonant may have persisted before long vowels, but since a glide was epenthetically later added there anyway, there's no way to tell.

Loss of */j/ before /i/ likely goes around here too, but Samic seems inconsistent.

Other stressed vowel changes

  • aː æː → oː eː (but part of a general a æ → oː eː shift in Samic)
  • iw → y / _C (unsure about the distribution of this, or if this feeds vowel length)
  • V# → Vː (affects most CV words with the exception of se.)
  • *ê *ô → e o / _(X)Ci (new hypothetical vowels for PU)
            → y ɯ → y i / _(X)CA

Unstressed vowels

  • ij iw → i u
  • i → e / _C (≠ j, w) (but part of a general i → ɤ shift in Samic)
  • æ → e / _j
  • aw æw → o (the -w rarely goes back to Proto-Uralic, so this may also be analogical)
  • a → e / {o u}[+STR](X)C_j
       → o / {a e i}[+STR](X)C_j
       → a / elsewhere

(Other instances of unstressed /Qj/ shift too, but analogical leveling has rendered it impossible to tell whether the original result was /ej/ or /oj/.)

Proto-Finno-Samic to Proto-Finnic

[Ca. 3000 BC to 2000 BC] (likely also incomplete; this is the section of changes not shared by other branches of Uralic)

Loss of remaining /x/

  • ixi → øː
  • uxi → oː
  • xi → @ / elsewhere

(*xA, *x# apparently did not occur)

Loss of /ŋ/

  • UŋA → Oː
  • ŋi → @ / {A i u}_
  • ŋ → j / _Cʲ
       → remains _k
       → w / _U_ _O_, other _C

Loss of medial semivowels in i-stems

  • Uwi → Uː
  • ewi → øː
  • wI → i (medial unstressed reduced /i/!)
  • ji → @ / front V_
       → j / A_#, O_ U_
       → @ / A_{l r}(C)V (due to [je]?)
       → i / C_{# C)
  • /yje/ → */øː/ →

Initial deaffrication. Newer initial affricates are found in loanwords and onomatopoeia.

  • ʧ ʦʲ → ʃ sʲ / #_

Depalatalization, commonly attributed to Germanic superstratum influence.

  • ʦʲ(ː) sʲ ðʲ lʲ → ʦ(ː) s ð l
  • nʲ → ni / #(C)V_V (i.e. before a short stressed vowel), → n (elsewhere)

Loss of /ð/

  • ð → t (may be gradation-related, shared with Mordvinic but not Samic. Put here to avoid requiring postulating intermediate *tʲ for the development of *ðʲ)

---

This givs as the phonology of Proto-Finnic:

Consonant inventory
lab dnt alv palv vel
/m       n         / nasals
/p   t   ts  tʃ   k/ plosivs/affricates
/        s   ʃ     / sibilants
/        l         / lateral
/        r         / rhotic
/v             j   / semivowels

(I'm marking [ʋ] as /v/ for brevity from now on.)

Syllable structure (C)V(@, i, U, C)(C) /N/ didn't occur morpheme-initially. Morpheme-finally, only /t k s m n j/ occured. Word-initial /r/ was rare (non-existant in PU) /#ji #je #vu/ did not occur.

Allowed medial clusters included the following (and possibly more, if consonantal root forms were in existence yet by this stage):

  • /pː pt tː tk kt tːs tʃk ktʃ kː/ (/tsk kts/?)
  • /mp mt mts nt nts ntʃ ŋk/
  • /ns nʃ/
  • /ps ks kʃ kʃt/ (/kst/?)
  • /tn km/ (only intermorphemically)
  • /sm st sn sl sk ʃm ʃt ʃn ʃl ʃr ʃk/
  • just about all approximant + non-approximant combinations
  • /lj rj lv rv jv/(/vj/ is forbidden and metathesizes to /jv/ in loans)
  • /ntː ŋkː rtː rkː lkː/?
  • almost all allowed CC combinations preceded by Vj, VU or V@
Vowel inventory
/i iː y yː      u uː /
/e eː   øː      o oː /
/æ æː      a aː      /
/ew æw     aw   ow   /
/ej æj     aj   oj uj/

/a: æ:/ were rare, only occuring in about half a dozen roots each. (These new instances are of fuzzy origin, apparently loanwords acquired between PU and PF?)

/i e A o u/ could occur in non-initial root syllables (plus /ej oj/ due to suffixal j).

Early Proto-Finnic to Late Proto-Finnic

[Ca. 2000 BCE to 1000 CE]

Loss of /ʧ/

  • ʧ ʧː → t tʃ


Difthong paradigm shift j w → i U / V_{C #} (required for pre-difthongal consonants not to gradate)

Birth of consonantal suffixes i → ∅ / VC_, ks_ suffix-finally (with /ts tʃ/ counted as clusters, not phonemes)

Collapse of labial codas

  • m → n / _{t ts #}
  • p → U / _# (probably via [β]?)

Consonant gradation These all occur on the general condition that the folloing syllable is closed.

  • pː tː tsː kː → pˑ tˑ tsˑ kˑ → p t ts k / {sonorant}_V (the half-long stage ensures that gradated consonants can still themself trigger gradation)
  • p t ts s k → b d s z ɡ / {V l r}_V
  • b d ɡ → β ð ɣ / except N_

(NB: gradation of modern /ht hk/ is analogy-borne)

Gradation-related changes

  • p t s k → β ð z ɣ / V_{l r j} (mostly in loanwords)
  • ɣ → j/v (a possible change involving a -ɣA suffix of fuzzy origin)
  • ð ɣ → ∅ / V_V#

Around this time there's also a paradigm shift wrt. /f/ in loanwords: the reflex of /f/ preceded by a vowel changes from /p/ to /v/. This could signify a change of [w] to [ʋ] in the position, but also of [ɸ] to [f] in the loaning languages!

Vowel shifts

  • oi → o / [-STR] (but reverted back in many, tho not all, cases where the -i was morphological)
  • ai → ei / [+STR] (with many exceptions; also, surprizingly, /æi/ stays put)
  • V: → V / _i

Birth of consonantal root forms

  • e → ∅ / stem-finally after a coronal

(This change could be much older and is actually more complex, but I don't kno what's the latest understanding)

Assibilation

  • t → s / _i

except before a coronal obstruent (/t s ʃ/)

Esh-drift

  • ʃ → ʂ → x (postdates old Baltic and Germanic loanwords)

Assimilation of many consonant clusters to geminates, etc. All of these require a morpheme boundary somewhere in the cluster. A basically equivalent criterion is requiring a preceding unstressed syllable. Of these, only /pt kt rn/ occurred root-medially, and these were retained.

  • (t)(ː)sn → sː
  • kx (tx) → xː
  • rn ln → rː lː
  • pn tn kn ktn ptn (etc.) → nː
  • pm tm km (etc.) → mː
  • xk → kː (happens also across word boundaries, precluding the formation of /?/)
  • pst tst kst → st

(The consequent obscuring of many inflected forms due to this and the previous change caused many words to revert back, however.)

Fricativ collapse, part 3

  • ʦ(ː) → θ(ː)
  • z → h
  • x(ː) → h (a spirantic pronunciation can still be found in coda position)


V-epenthesis

  • ∅ → ʋ / #_{}

Shifts involving /h/ (unfinished)

  • e → @ / h_ in suffixes
  • p k → h / _t (With IE loanwords continuing to feed new /pt kt/, this rule remained activ up until to the 20th century.) (NB Mordvin has → f)

Late Proto-Fennic to Standard Finnish

[Ca. 1000-1900 CE] These changes are, for the most part, only attested in the Finnish-Carelian continuum.

"Flavor": Voiced prenasal stops become geminate nasals, and (around the same time as in a whole lot of other European languages!) long mid vowels become opening difthongs:

  • mb nd ŋg → mː nː ŋː
  • eː øː oː → ie yø uo

Changes involving /j/

  • j → i / C_ suffix-initially

More shifts with /h/

  • t → 0 / h_r
  • s → h / _l
  • Vh → hV / {Vi n l r}_# in eg. vaihe venhe orhi urho alhainen ylhäinen
  • k h → ? / _#

Spirant loss

  • β → 0 / _UC
        → v / other _V
        → U / _C
  • ð remains V[+STR](X)_
  • ið → j / V[-STR]_V
  • ð → U / {A, e}[+STR]_{l r j n} (paitsi @ in <vaatia>)
        → t / _{v, j}
        → l / l_
        → r / r_
        → 0 / elsewhere
  • ɣ → @ / _j, e_r{i e a o u}
        → i / {i e æ}_C (C!=j, r)
        → U / {O U a}_C (C!=j); æ_r; e_r{æ ø y}
        → j / C_e
        → v / U_U
        → ? / V1V2_V2 (including the cases of V1=V2; also V2≠U)
        → 0 / elsewhere
  • h → 0 / V[-STR](X)_V

Subsequent vowel changes in unstressed syllables (unfinished, may need to be meshed with the prev. section)

  • AO → A:, O: or U: (seemingly irregularly)
  • Ae → Ai
  • Ue → e:
  • VU → V: / _#
  • iU → U:
  • OU → O: (kokoontu-; but aitous etc.)

Initial-syllable labialization

  • ey → øy
  • e i ie → ø y yø | _(X)(C)Cy (if the /y/ is a part of the root)
  • i → y / _væ (this one is actually older than the others, but fits here better)

The final stages of interdental loss began after or around the time of the creation of the literary language, seen in spellings such as <tz dh>. By standardization it was however practically complete. The standard outcome is largely a spelling pronunciation based on the example of German and Swedish:

  • θ(ː) → ts
  • ð → d (commonly alveolar)

Most common dialectal variations for the former are t(ː) and ht~t, for the latter r and ∅.