Finnish
Proto-Uralic to Finnish sound changes
Thus far mostly based on:
- Lauri Hakulinen: Suomen kielen rakenne ja kehitys • Otava 1979
- Kaisa Häkkinen: Nykysuomen etymologinen sanakirja • WSOY 2004
- Petri Kallio: Kantasuomen konsonanttihistoriaa · Mémoires de la Société Finno-Ougrienne 253 · 2007
- Juha Janhunen: The primary laryngeal in Uralic and beyond · (same)
Currently in process of reformatting and reordering to include the information from the last two documents.
Technotes
- Here, /@/ is NOT an ASCIIfication of /ə/, but any vowel that assimilates to the preceding vowel. This comes useful with cases of compensatory lengthening and ecko vowels.
- Similarly, /A O U/ are harmonic vowels which will assimilate to either /a o u/ or /æ ø y/ depending on the harmony. /a/ is to be understood as [ɑ].
- /ˣ/ is the assimilatory final, pronounced as lengthening of the next word's initial consonant, or in case of null initial, [ʔː] or hiatus. Very rarely, it occurs within words, too (usually sandwiched between two instances of the same vowel.)
- /C/ represents any consonant; /V/ represents any vowel; and /X/ represents any 2nd mora in a syllable (be it consonantal, difthongal or chronemical).
I've grouped similar changes together under sub-headings, so the order of unrelated changes might not be exactly chronological whenever no reference was available. Also, since the document is headed towards Standard Finnish, I've had to cut a few corners anyway when maneuvering around dialectal changes... in a few cases picking the most represented outcome wasn't all that clear.
Proto-Uralic to Pre-Finnic
[Supposedly 4000 BCE to 3000 BCE]
The existence of "Proto-Finno-Samic" ("-Volgaic", "-Permic", "-Ugric") as distinct from PU is unclear, hence "Pre-Finnic". Changes shared with Samic are in indigo, those also shared with Mordvinic in green, and those with even wider distribution in orange.
Word-final */ŋ/ → k in the lativ ending, → n elsewhere (!dubious, pre-Uralic?)
Dephthongization (dubious)
- iw → y / _C (distribution? does this feed vowel length?)
- potentially: ow → uː / _C
Introduction of length from loss of preconsonantal *x.
- x → @ / _C (leaves no evidence in Ob-Ugric)
Coda nasal simplification
- m → n / _{t tsʲ #} (Finnic, Mordvinic; medially also Permic, Mansi)
Stressed *ë merges with *a
Other stressed vowel changes
- aː æː → oː eː (but part of a general a æ → oː eː / [+STR] shift in Samic)
Unstressed vowels
- a → æ / {æ e ê i ü}(X)(C)C_ (if not an original distinction)
- aw æw → o (the presence of -w rarely is shared, so this may also be analogical)
- i → e / _C (≠ j, w) (but part of a general i → ɤ shift in Samic)
- ij iw → i u ?
Pre-Finnic to Proto-Finnic
[Ca. 3000 BC to 2000 BC] (likely also incomplete; this is the section of changes not shared by other branches of Uralic)
Vowel changes
- V# → Vː (affects most old CV words with the exception of me te he se ne, which per Livonian had *-k.)
- *ê *ô → e o / _(X)Ci (new hypothetical vowels for PU, possibly semi-rounded [ɪ ʊ])
→ y ɯ → y i / _(X)CA - æ → e / _j unstress'd
- a → e / {o u}[+STR](X)C_j
→ o / {a e i}[+STR](X)C_j
→ a / elsewhere
(Other instances of unstressed /aj/ shift too, but analogical leveling has rendered it impossible to tell whether the original result was /ej/ or /oj/.)
Loss of remaining /x/
- ixi → øː
- uxi → oː
- xi → @ / elsewhere
- Possibly in newer formations, x → w / {U O}_C (souta- nouta-), → j / {I E}_C (keima-
(*xA, *x# apparently did not occur)
Loss of /ŋ/
- UŋA → Oː (but *piŋa → pivo)
- ŋi → @ / {A i u}_
- ŋ → remains _k
→ n / _t (ensi, possibly onte-)
→ j / _Cʲ (seiso- veitsi, punaise-)
→ w / _{U O}_, other _C (jauha- jousi joutsen)
Note the similar fate of postconsonantal *x *ŋ. Merger?
Loss of medial semivowels in i-stems
- Uwi → Uː
- ewi → øː
- wI → i (medial unstressed reduced /i/!)
- ji → @ / front V_
→ j / A_#, O_ U_
→ @ / A_{l r}(C)V (due to [je]?)
→ i / C_{# C) - /yje/ → */øː/ → yö
Dissimilation
- uw ij → ow ej / _C (souta- nouta- keima- again; and perhaps jousi?!)
Initial deaffrication. Newer initial affricates are found in loanwords and onomatopoeia.
- ʧ ʦʲ → ʃ sʲ / #_
Depalatalization, commonly attributed to Germanic superstratum influence.
- ʦʲ(ː) sʲ ðʲ lʲ → ʦ(ː) s ð l
- nʲ → ni / #(C)V_V (i.e. after a short stressed vowel; miniä), → n (elsewhere)
Loss of /ð/
- ð → t (may be gradation-related, shared with Mordvinic but not Samic. Put here to avoid requiring postulating intermediate *tʲ for the development of *ðʲ)
Loss of /ʧ/
- ʧ ʧː → t tʃ (In South Estonian, → ts / _k)
Assibilation, fed by all three previous changes
- t → ʦ / _i
- tj → ʦ
except after a coronal obstruent (/tt st ʃt/) or before a derivational suffix
Esh-drift
- ʃ → ʂ → x (postdates old Baltic and Germanic loanwords; as well as the previous on account of blocking assibilation)
Nasal simplification part 2. Note old Germanic loans feeding -ntj-! Probably postdates the previous, given no simplification in *nx (vanha inho etc.)
- n → ∅ / _(t)s (asia rasia nisä osa otsa kusi) (distribution?)
Paradigmatically reverted in cases of *-ntsi ~ *-nte- such as kansi, ponsi; not however in ordinals: kolmas ~ kolmante- (also note Est. kaas vel. sim. - a later reprise per compensatory lengthening? - and Liv. vuontsa - suggesting Gmc. loan origin?)
---
This givs as the phonology of Proto-Finnic:
Consonant inventory
Lab. | Dnt. | Alv. | Vel. | |
---|---|---|---|---|
Nasals | m | n | ||
Stops & affricate | p | t | ʦ | k |
Fricatives | s | x | ||
Lateral | l | |||
Rhotic | r | |||
Semivowels | v | j |
(I'm marking [ʋ] as /v/ for brevity from now on.)
Syllable structure (C)V(@, i, U, C)(C) Morpheme-finally, only /t k s m n j/ occured. Word-initial /r/ was rare outside of recent IE loanwords (non-existant in PU?) /#ji #je #vu/ did not occur.
Allowed medial clusters included the following (and possibly more, if consonantal root forms were in existence yet by this stage):
- /pː pt pʦ tː tk ʦː kː kt kʦ/ (/ʦk/?)
- /mp nt nʦ ŋk/
- /nx/
- /ps ks/ (/kst/?) and perhaps /tx kx(t)/ (if not simplified to /x(t)/ yet; check!)
- /tn km/ (only intermorphemically)
- /sm st sn sl sk xm xt xn xl xr xk/
- just about all approximant + non-approximant combinations
- /lj rj lv rv jv/
- /ntː ŋkː rtː rkː lkː/?
- various CC combinations preceded by Vj, VU or V@
/vj vr vs (vx?)/ are forbidden and metathesize to /jv rv sv (hv?)/ in loans (haiven laiva raiva- raivo, järvi karva tarvas torvi, rasva; kaivata with analogous gradation?) (The last of these will have to be dated as erlier than *joŋsi → *jousi).
Note also kauha ~ kahva, kiuru ~ kirvinen, kuilu ~ kulju
While several later common innovations affect both South Estonian and the remainder of Finnic, pre-Finnic *č is reconstructable in some cases per the former (*kačke- *kačku *kičke- *nočko *pučki *sičkajn)
Vowel inventory /i iː y yː u uː / /e eː øː o oː / /æ æː a aː / /ej ew oj ow/ /æj æw aj aw uj /
/aː æː/ were rare, originally only occuring in about half a dozen roots each. (These new instances are of fuzzy origin, apparently loanwords acquired after the change to */oː eː/?)
/i e A o (u?)/ could occur in non-initial root syllables (plus /ej oj/ due to suffixal j).
Proto-Finnic to Proto-Finnish
[Ca. 2000 BCE to 1000 CE]
Difthong paradigm shift j w → i U / V_{C #} (not really phonetical; required for pre-difthongal consonants not to gradate)
Consonantal suffix and stem-forms
- i → ∅ / VC_, ks_ suffix-finally
- e → ∅ / stem-finally after a coronal
(This change could be much older and is actually more complex, but I don't kno what's the latest understanding)
Consonant gradation. These all occur on the general condition that the folloing syllable is closed.
- pː tː tsː kː → pˑ tˑ tsˑ kˑ → p t ts k / {sonorant}_V (the half-long stage ensures that gradated consonants can still themself trigger gradation; no gradation is found in Veps or Livonian)
- p t ts s k → b d s z ɡ / {sonorant}_V
(NB: gradation of modern /ht hk/ is analogy-borne)
Suffixal gradation (needs elaboration)
- p t s k → b d z ɡ / V[-STR]_V
Spirantization
- b d ɡ → β ð ɣ / except N_ (may be later - not evident in Votic)
- ɣ → j~v (in kataja, jalava, kajava etc.)
- βi → U / _# (3PS -Pi)
Around this time there's also a paradigm shift wrt. /f/ in loanwords: the reflex of initial /f/ changes from /p/ to /v/. This could signify a change of [w] to [ʋ] in the position, but also of [ɸ] to [f] in the loaning languages (or even the onset of Grimm's law)! Medial /f/ does not seem to ever turn to /p/.
Loss of /v/ before labial vowels
- v → ∅ / #_{o u y}
The consonant may have persisted before long vowels, but since a glide was epenthetically added there anyway (see next), there's no way to tell. Compensatory lengthening may have occurred before *u (*wuðʲi → uusi). Shared with Samic, Mordvinic & Mari, but note that this seems to be a late areal change — it is later than the South Finnic change from *o to *ɤ per *wo-tta- "to take" → *vɤtta- → Es. võtma- (however, note the counterexample *wolka → õlg).
Loss of */j/ before /i/ is likely of similar age, but Samic seems inconsistent (due to é → jé / #_ ? cf. also no medial loss: *velji → veli, ~ *véljɤ)
V-epenthesis
- ∅ → v / #_{yː øː oː}
Notable exceptions: yö uoma, the former probably homonym avoidance (vyö "belt"), the latter a Samic loan - allowing to date the similar change in Samic as later. Per *voohla → vohla "goat kid" (*voohi → vuohi "goat") this would seem to be the later of the two changes involving initial *v.
Vowel shifts
- oi → o / [-STR] (but reverted back in many, tho not all, cases where the -i was morphological)
- Vː → V / _i (Clearly postdates epenthesis: voi ← *vooji)
Assimilation of many consonant clusters to geminates, etc. (Dating provisional, perhaps older.) All of these require a morpheme boundary somewhere in the cluster. A basically equivalent criterion is requiring a preceding unstressed syllable. Of these, /rn pt kt kx tx/ (/kʃ tʃ/?) occurred root-medially, and the first three were retained (though rn → rː may have occurred in aarre; cf. aarni - and kt → tː is required for tytär, which appears to be the only loan with the cluster around this timeframe. South Estonian has even root-medially pt kt → tː.)
- kt(s) pt(s) → tː(s)
- *ntt → tː (katta-, tuta)
- xk → kː (happens also across word boundaries, precluding the formation of /?/)
- (t)(ː)sn → sː
- kx (tx) → xː
→ @x / _C (vaahtera, jäähty-, huuhto- ?) - rn ln → rː lː
- wst → st / o_ ? (nouse- ~ nosta-)
- pn tn kn ktn ptn (etc.) → nː
- pm tm km (etc.) → mː
- pst tst kst → st
(The consequent obscuring of many inflected forms due to this and the previous change caused many words to revert back, however. Note especially *pekstä, *pekse- → *peestä, peekse-)
Fricativ collapse, part 3
- ʦ → s / _{# s}_
- ʦ(ː) → θ(ː) (remains gradational)
- z → h
- x(ː) → h (a spirantic pronunciation can still be found in coda position)
Shifts involving /h/
(unfinished)
- e → @ / h_ in suffixes
- p k → h / _t (With IE loanwords continuing to feed new /pt kt/, this rule remained activ up until to the 20th century.)
Proto-Finnish to Standard Finnish
[Ca. 1000-1900 CE] These changes are, for the most part, only attested in the Finnish-Carelian continuum.
"Flavor": Voiced prenasal stops become geminate nasals, and (around the same time as in a whole lot of other European languages!) long mid vowels become opening difthongs:
- mb nd ŋg → mː nː ŋː
- eː øː oː → ie yø uo
Changes involving /j/
- j → i / C_ suffix-initially
More shifts with /h/
- Vh → hV / {Vi n l r}_# in eg. vaihe venhe perhe erhe orhi urho alhainen ylhäinen (dialectally regular)
- dubbelmetathesis: imeh → imhe → ihme
- k h → ˣ / _#
- t → ∅ / h_r (ahrain ihra kehrä ohra) (cf. next)
Pre-sonorant stop vocalization (with an intermediate spirant stage)
Predominantly Germanic loanwords; a few Baltic, and a Uralic etymology exists for *kopra *kotva *kupla *nakris *syklä. By the evidence of other Finnic languages, *Tl in loanwords is initially substituted by *kl (eg. *seeθla → *seekla).
- p → U / _S (hauras kauris koura seura taulu teuras vauras äyräs; also note kupla, from a conservativ dialect)
- t remains _{v, j} (katve ketju kotva latva lotja patja patvi vitja)
→ U / _r{A, O} (aura nöyrä peura puuro uuras)
→ @ / _r{i, e} (teeri)
(any coda examples before i O??) - k → @ / _j (laaja raaja taaja vaaja)[1]
→ i / {i, e}_S{i, e} (eilen keila leili leiri neilikka peili teili teini tiili) (May have rather occurred in loaning Finland Swedish dialects, except eilen, of unkno'n origin & where Karelian explicitly retains /kl/.)
→ U / {A, O, U}_S (S≠j) (hauli kaula kaura käyrä myyrä mäyrä naula nauris naura- paula vaula väylä sauna syylä taula uuni vaunu); {i e}_Sa (neula seula siula siuna-)
Spirant loss
- β → ∅ / _UC
→ v / other _V - ið → j / V[-STR]_V
- ð remains V[+STR](X)_
→ l / l_
→ r / r_
→ ∅ / elsewhere - ɣ → j / C_e
→ v / U_U
→ ? / V1V2_V2 (including the cases of V1=V2; also V2≠U)
→ ∅ / elsewhere - h → ∅ / V[-STR](X)_V
Subsequent vowel changes in unstressed syllables (unfinished, may need to be meshed with the prev. section)
- AO → Aː, Oː or Uː (seemingly irregularly)
- Ae → Ai
- Ue → eː
- VU → Vː / _#
- iU → Uː
- OU → Oː (kokoontu-; but aitous etc.)
Initial-syllable labialization
- ey → øy
- e i ie → ø y yø | _(X)(C)Cy (if the /y/ is a part of the root)
- i → y / _væ (this one is actually older than the others, but fits here better)
The final stages of interdental loss began after or around the time of the creation of the literary language, seen in spellings such as <tz dh>. By standardization it was however practically complete. The standard outcome is largely a spelling pronunciation based on the example of German and Swedish:
- θ(ː) → ts
- ð → d (commonly alveolar)
Most common dialectal variations for the former are t(ː) and ht~t, for the latter r and ∅.
Modern Standard Finnish
Modern Standard Finnish has the following consonant phonemes:
p t k (ʔ) b d ɡ m n ŋ f s (ʃ) h ʋ l r j
/b ɡ f ʃ/ only occur in loanwords and neologisms. /ʃ/ is rather marginal and usually replaced by /s/. [ʔ] occurs optionally word-initially before a vowel, and between two identical vowels.
The vowels are /i e æ ø y ɑ o u/. Vowel harmony continues to apply to suffixes, however in loanwords and neologisms /æ ø y/ and /ɑ o u/ can co-occur (psyko-, hyla, sekundääri). Examples of /æ/ and /ɑ/ co-occurring remain rare; even the lexicalized compound tällainen is usually pronounced /ˈt̪ælːæ(i)ne̞/. All vowels may occur long at any position of the word; however, (C)VVCC syllables occur only in very recent loans (pointti).
Any difthongs that end in -i/-y/-u and adhere to vowel harmony and the Obligatory Contour Principle are possible, tho in stressed syllables /ey/ is exceedingly rare (due to the change to /øy/; found only in the verb leyhy-) and /iy/ nonexistant.
The following sound changes are commonplace in spoken Standard Finnish:
- n → ∅ / _#
- d → ∅/r in inherited vocabulary
- Va → Vː / unstress'd
- ie yö uo → ii yy uu / _A
Proto-Finnic to Livonian
Palatalization 1 (everywhere except in Votic, Finnish proper, some Estonian & Karelian dialects; possibly needs splitting)
- t ts s(C) n l r → tʲ tsʲ sʲ(C) nʲ lʲ rʲ / _i#
Deaffrication (near universal in Finnic, but note the exception here)
- ts(ʲ) → s(ʲ) / except n_
Common South Finnic changes (Estonian, Võro, Votic)
- e → ɤ / _C(C){a o u}
- ä → a / [-STR] (phonemicizes the prev. - also in Veps)
- h → ∅ / {# C}_ (initially only dialectally in Estonian)
- Vn → Vː / _s (in Livonian only in new loans such as kansa, pensas; in E/Võ/Vo also from *nts, see prev.)
Umlaut
- a ä → ä e / _C*i
Vowel reduction
- V → ə / [-STR] ≠a (*u *i survive in certain suffixes, but never in roots)
- a → ə / [-STR] VC(C)aC(C)_#, also always in verbal stems
Early length
- Vh → Vːɦ / _C (except hj hv?)
Liquid metathesis
- LV → VL / [-sonorant]_ (*atra → *atar, *putro → *putor, etc.) (may need relocation)
Voicing
- p t(ʲ) k s(ʲ) → b d(ʲ) g z(ʲ) / when not initial or next to another voiceless segment (yes, final consonants voice too!)
Early glottalization
- (C)VCV → (C)VʔCV
Apocope
- ə → ∅ / _#, VC_CV
Gemination etc.
- C → Cː / ʔ_V
- gj dj lj rj → jg dʲ lʲː rʲː
- dv zv lv rv jv → d z lː rː jː
- Cv → C / elsewh.
Length II / degemination
- VCːa → VːCa / [+sonorant] (preceeds general presonorant lengthening since *a → ō, *o → ūo from here)
- Cː → C / except V_ə in verbs (unless this is a reflex of the infinitiv!)
Vowel shift 1
- aː au → ɔː ɔu (some new *aː then develop in words like _paarm_ "gadfly")
- eː (øː) oː → iːe (yːø) uːo
- ɤ(ː) → ɨ(ː)
2nd glottalization
- ɦ → ʔ
Sibilants (hard to date)
- (t)sʲ (d)zʲ → (t)ʃ (d)ʒ
Vowel shift 2
- y ø → i e
- except äy ey → äu eu
- V → Vː / _[+sonorant]C(C)a (including difthongs in -i, eg. *aitta → āita)
- a → aː / VC_
- e o → eː oː / _Ca
- o → oː / _[+sonorant]C#, _i
- eː oː → ie uo
- furthermore → je vo / #_
- ɔ(ː) → o(ː)
Notes for other Finnic languages
- Veps: loss of vowel length, js → ʃ, ts(ː) → ʧ(ː), ü → u / [-STR]
- Estonian & Votic: o → ɤ / _(C*){i e}, _u
- Estonian: jh wh → hj hv / V_, akj akr → ai aer, V → ∅ / VXC_#, high V → mid V / _nonhigh V, ü → i [-STR]
- Votic: k → ʧ / _{i ü e ö ä}, kj → ɟɟ, h → ∅ / #_, st → sː